BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011482
         (484 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/462 (70%), Positives = 384/462 (83%), Gaps = 7/462 (1%)

Query: 23  LLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIV 82
           +   G  CF+GKK L +HK QW+Q S +SS+C+S Q++R E GA  LE+KHK+ CSGKI+
Sbjct: 24  IFDNGVQCFQGKKVLSMHKFQWKQGS-NSSTCLS-QETRWENGATILEMKHKDSCSGKIL 81

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISG-NIKDVSNTEIPLTSGIRLQTLNYIATIEL 141
           DWN++ +  LI+D+  ++ LQSR+K++ISG NI D  +  IPLTSGIRLQTLNYI T+EL
Sbjct: 82  DWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAPIPLTSGIRLQTLNYIVTVEL 141

Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           GGR MTVIVDTGSDL+WVQCQPCK CYNQQDPVF+PS SPSY+ VLC+S TC +L+ ATG
Sbjct: 142 GGRKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATG 201

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGV 260
           N GVC S+ PP CNY V+YGDGSYTRGELG EHL LG ++ VN+FIFGCGRNN+GLFGG 
Sbjct: 202 NLGVCGSN-PPSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGA 260

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           SGL+GLGRS LSL+SQTS +FGG+FSYCLP T+   ASGSL++GGNSSV+KN+TPI+YT 
Sbjct: 261 SGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETE-ASGSLVMGGNSSVYKNTTPISYTR 319

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           MIPNPQL  FY LNLTGI++G   +QA  F K G++IDSGTVITRLPPSIY ALK EF+K
Sbjct: 320 MIPNPQLP-FYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVK 378

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
           QFSGFPSAP F ILDTCFNLS YQEV IP +KM FEGNAE+ VDVTG+ YFVK+DASQVC
Sbjct: 379 QFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVC 438

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           LA+ASLSYE+E GIIGNYQQKNQRVIYDTK S LGFA E C+
Sbjct: 439 LAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEACT 480


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  627 bits (1617), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 302/414 (72%), Positives = 354/414 (85%), Gaps = 4/414 (0%)

Query: 71  LKHKNYC--SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
           +KH+++C  SGK  DWN++ Q  LILD+  V+ LQSRIK++ SGN  D  +++IPL+SG+
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60

Query: 129 RLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           RLQTLNYI T+E+GGRNMTVIVDTGSDLTWVQCQPC+ CYNQQDP+F+PS SPSY+ +LC
Sbjct: 61  RLQTLNYIVTVEIGGRNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILC 120

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
           NSSTC +L++ATGN GVC S++P  CNY V+YGDGSYTRG+LG E L LG   V++FIFG
Sbjct: 121 NSSTCQSLQYATGNLGVCGSNTP-TCNYVVNYGDGSYTRGDLGMEQLNLGTTHVSNFIFG 179

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
           CGRNNKGLFGG SGLMGLG+SDLSLVSQTS IF G+FSYCLP+T  A ASGSLILGGNSS
Sbjct: 180 CGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTA-ADASGSLILGGNSS 238

Query: 309 VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPP 368
           V+KN+TPI+YT MI NPQL TFY LNLTGISIGG  LQA  + + GILIDSGTVITRLPP
Sbjct: 239 VYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDSGTVITRLPP 298

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
            +Y  LKAEFLKQFSGFPSAP FSILDTCFNL+ Y EV+IP ++M+FEGNAE+TVDVTGI
Sbjct: 299 PVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAELTVDVTGI 358

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            YFVK+DASQVCLALASLS++DE  IIGNYQQ+NQRVIY+TK S+LGFA E CS
Sbjct: 359 FYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  620 bits (1600), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 298/413 (72%), Positives = 351/413 (84%), Gaps = 4/413 (0%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIR 129
           +KHK+ CSGKI+DWN++ Q RLI+DN  ++ LQSRIKN+I SGNI D  +T+IPLTSGIR
Sbjct: 1   MKHKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIR 60

Query: 130 LQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           LQ+LNYI T+ELGGR MTVIVDTGSDL+WVQCQPC  CYNQQDPVF+PS SPSY+ VLCN
Sbjct: 61  LQSLNYIVTVELGGRKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCN 120

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
           S TC +L+ ATGNSGVC S+ PP CNY V+YGDGSYT GE+G EHL LG  +VN+FIFGC
Sbjct: 121 SLTCRSLQLATGNSGVCGSN-PPTCNYVVNYGDGSYTSGEVGMEHLNLGNTTVNNFIFGC 179

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
           GR N+GLFGG SGL+GLGR+DLSL+SQ S +FGG+FSYCLP+T+ A ASGSL++GGNSSV
Sbjct: 180 GRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTE-AEASGSLVMGGNSSV 238

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPS 369
           +KN+TPI+YT MI NP L  FY LNLTGI++GG ++QA  F K  ++IDSGTVI+RLPPS
Sbjct: 239 YKNTTPISYTRMIHNP-LLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGTVISRLPPS 297

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           IY ALKAEF+KQFSG+PSAP F ILD+CFNLS YQEV IP +KM FEG+AE+ VDVTG+ 
Sbjct: 298 IYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNVDVTGVF 357

Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           Y VK+DASQVCLA+ASL YEDE GIIGNYQQKNQR+IYDTK S LGFA E CS
Sbjct: 358 YSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  586 bits (1510), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 292/459 (63%), Positives = 350/459 (76%), Gaps = 9/459 (1%)

Query: 29  HCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNE 86
           H    KK L +H   W  K    +SSSC S    +    + TLE+KH+  CSGK +DW +
Sbjct: 30  HGVGEKKILSVHNNIWSPKKSYEASSSCFSRSLGKGRE-STTLEMKHRELCSGKTIDWGK 88

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGRN 145
           + +  L+LDN+ VQ LQ RIK M S    + VS T+IPLTSGI+L+TLNYI T+ELGG+N
Sbjct: 89  KMRRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGGKN 148

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           M++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC  L  ATGNSG 
Sbjct: 149 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGP 208

Query: 206 CSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
           C   +      C Y VSYGDGSYTRG+L  E + LG   + + +FGCGRNNKGLFGG SG
Sbjct: 209 CGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKGLFGGASG 268

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           LMGLGRS +SLVSQT + F G+FSYCLPS +D GASG+L  G + SV+KNST + YT ++
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGTLSFGNDFSVYKNSTSVFYTPLV 327

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            NPQL +FYILNLTG SIGG +L+   F +G ILIDSGTVITRLPPSIY A+K EFLKQF
Sbjct: 328 QNPQLRSFYILNLTGASIGGVELKTLSFGRG-ILIDSGTVITRLPPSIYKAVKTEFLKQF 386

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
           SGFPSAPG+SILDTCFNL++Y++++IP +KM FEGNAE+ VDVTG+ YFVK DAS VCLA
Sbjct: 387 SGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDASLVCLA 446

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           LASLSYE+E GIIGNYQQKNQRVIYDT   +LG AGE+C
Sbjct: 447 LASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 281/413 (68%), Positives = 335/413 (81%), Gaps = 2/413 (0%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           +K + +CS K +DWN + Q +LILD+L V+ +Q+RI+ + S +  + S T+IPL+SGI L
Sbjct: 1   MKDRGHCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL 60

Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           QTLNYI T+ LG +NMTVI+DTGSDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61  QTLNYIVTMGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
           STC +L+FATGN+G C SS+P  CNY V+YGDGSYT GELG E L  G  SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVFGCG 180

Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
           RNNKGLFGGVSGLMGLGRS LSLVSQT+  FGG+FSYCLP+T+ AG+SGSL++G  SSVF
Sbjct: 181 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTE-AGSSGSLVMGNESSVF 239

Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPS 369
           KN+ PITYT M+ NPQL+ FYILNLTGI +GG  L+A   F  GGILIDSGTVITRLP S
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITRLPSS 299

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           +Y ALKAEFLK+F+GFPSAPGFSILDTCFNL+ Y EV+IP + + FEGNA++ VD TG  
Sbjct: 300 VYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDATGTF 359

Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           Y VK DASQVCLALASLS   +T IIGNYQQ+NQRVIYDTK S++GFA E CS
Sbjct: 360 YVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  573 bits (1476), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 281/412 (68%), Positives = 331/412 (80%), Gaps = 2/412 (0%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           +K + +CS K +DWN + Q +LI D+L V+ +Q+RI+ ++S +  + S T+IPL+SGI L
Sbjct: 1   MKDRGHCSEKKIDWNRRLQKQLISDDLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINL 60

Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           QTLNYI T+ LG  NMTVI+DTGSDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61  QTLNYIVTMGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
           STC +L+FATGN+G C S+ P  CNY V+YGDGSYT GELG E L  G  SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSN-PSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFGCG 179

Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
           RNNKGLFGGVSGLMGLGRS LSLVSQT+  FGG+FSYCLP+T+ +GASGSL++G  SSVF
Sbjct: 180 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTE-SGASGSLVMGNESSVF 238

Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSI 370
           KN TPITYT M+PNPQL+ FYILNLTGI + G  LQ   F  GG+LIDSGTVITRLP S+
Sbjct: 239 KNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSSV 298

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           Y ALKA FLKQF+GFPSAPGFSILDTCFNL+ Y EV+IP + M FEGNAE+ VD TG  Y
Sbjct: 299 YKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFY 358

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            VK DASQVCLALASLS   +T IIGNYQQ+NQRVIYDTK S++GFA E CS
Sbjct: 359 VVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  569 bits (1467), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 285/476 (59%), Positives = 357/476 (75%), Gaps = 6/476 (1%)

Query: 9   TILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
           T+L   L  +   F++A G    E KK   +  LQ   + GS    +   +SR E GAI 
Sbjct: 7   TMLPFFLSFVFLYFIIANGGCELEQKKMFKVQMLQRNHQFGSKGCILP--ESRKEKGAIV 64

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN--IKDVSNTEIPLTS 126
           LE+K + YCS + ++WN + Q +LI D+L V+ +Q+RI+  +SG+   +  S  +IPL S
Sbjct: 65  LEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSSEIQIPLAS 124

Query: 127 GIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKV 186
           GI L+TLNYI TI LG +NMTVI+DTGSDLTWVQC PC SCY+QQ PVF+PS S SY  +
Sbjct: 125 GINLETLNYIVTIGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSL 184

Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
           LCNSSTC  L+F TGN+  C S++P  CN+ VSYGDGS+T GELG EHL  G  SV++F+
Sbjct: 185 LCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFV 244

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
           FGCGRNNKGLFGGVSG+MGLGRS+LS++SQT+  FGG+FSYCLP+T D+GASGSL++G  
Sbjct: 245 FGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTT-DSGASGSLVIGNE 303

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRL 366
           SS+FKN TPI YT+M+ NPQL+ FY+LNLTGI +GG  +Q + F  GGILIDSGTVITRL
Sbjct: 304 SSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRL 363

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
            PS+Y+ALKAEFLKQFSG+P AP  SILDTCFNL+  +EV+IP + M FE N ++ VD  
Sbjct: 364 APSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAV 423

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           GI+Y  K D SQVCLALASLS E++  IIGNYQQ+NQRVIYD K S++GFA EDCS
Sbjct: 424 GILYMPK-DGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  563 bits (1452), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 288/456 (63%), Positives = 343/456 (75%), Gaps = 22/456 (4%)

Query: 35  KKLHLH-KLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG--KIVDWNEQQQNR 91
           K  HL  KLQ       +  C+  Q SR E GAI LE+K +  CS   +  DW E+Q   
Sbjct: 27  KTFHLQRKLQH-----GTPECLLPQ-SRKEKGAIILEMKDRGECSESERKGDWVEKQ--- 77

Query: 92  LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV 150
           L+LD LHV+ +Q+ I K   S  I D S T++PLTSGI+ QTLNYI T+ LG +NM+VIV
Sbjct: 78  LVLDGLHVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIVTMGLGSQNMSVIV 137

Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS-- 208
           DTGSDLTWVQC+PC+SCYNQ  P+F PS SPSY+ +LCNS+TC +LE      G C S  
Sbjct: 138 DTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLEL-----GACGSDP 192

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
           S+   C+Y V+YGDGSYT GELG E LG G  SV++F+FGCGRNNKGLFGG SGLMGLGR
Sbjct: 193 STSATCDYVVNYGDGSYTSGELGIEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGR 252

Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
           S+LS++SQT+  FGG+FSYCLPST  AGASGSL++G  S VFKN TPI YT M+PN QL+
Sbjct: 253 SELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLS 312

Query: 329 TFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
            FYILNLTGI +GG  L  QAS F  GG+++DSGTVI+RL PS+Y ALKA+FL+QFSGFP
Sbjct: 313 NFYILNLTGIDVGGVSLHVQASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFP 372

Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
           SAPGFSILDTCFNL+ Y +VNIP + M FEGNAE+ VD TGI Y VK DAS+VCLALASL
Sbjct: 373 SAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASL 432

Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           S E E GIIGNYQQ+NQRV+YD K SQ+GFA E C+
Sbjct: 433 SDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPCT 468


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  560 bits (1444), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 287/460 (62%), Positives = 354/460 (76%), Gaps = 9/460 (1%)

Query: 28  AHCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWN 85
            H  + KK L +H   W  K    +S+SC S    +    + TLE+KH+  CSGK +D  
Sbjct: 26  VHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGK-GRESTTLEMKHRELCSGKTIDLG 84

Query: 86  EQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
           ++ +  L+LDN+ VQ LQ +IK M S    + VS T+IPLTSGI+L++LNYI T+ELGG+
Sbjct: 85  KKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGK 144

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC  L  AT NSG
Sbjct: 145 NMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204

Query: 205 VCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
            C  ++      C Y VSYGDGSYTRG+L  E + LG   + +F+FGCGRNNKGLFGG S
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSS 264

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL  G +SSV+ NST ++YT +
Sbjct: 265 GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSLSFGNDSSVYTNSTSVSYTPL 323

Query: 322 IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           + NPQL +FYILNLTG SIGG +L++S F +G ILIDSGTVITRLPPSIY A+K EFLKQ
Sbjct: 324 VQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGTVITRLPPSIYKAVKIEFLKQ 382

Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           FSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+ VDVTG+ YFVK DAS VCL
Sbjct: 383 FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCL 442

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           ALASLSYE+E GIIGNYQQKNQRVIYDT   +LG  GE+C
Sbjct: 443 ALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  559 bits (1440), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 286/460 (62%), Positives = 354/460 (76%), Gaps = 9/460 (1%)

Query: 28  AHCFEGKKKLHLHKLQWQQKSG--SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWN 85
            H  + KK L +H   W  K    +S+SC S    +    + TLE+KH+  CSGK +D  
Sbjct: 26  VHGVDEKKILSVHNNIWSPKKSYEASTSCFSRSLGK-GRESTTLEMKHRELCSGKTIDLG 84

Query: 86  EQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
           ++ +  L+LDN+ VQ LQ +IK M S    + VS T+IPLTSGI+L++LNYI T+ELGG+
Sbjct: 85  KKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGGK 144

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK V CNSSTC  L  AT NSG
Sbjct: 145 NMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204

Query: 205 VCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
            C  ++      C Y VSYGDGSYTRG+L  E + LG   + +F+FGCGRNNKGLFGG S
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSS 264

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL  G +SSV+ NST ++YT +
Sbjct: 265 GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSLSFGNDSSVYTNSTSVSYTPL 323

Query: 322 IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           + NPQL +FYILNLTG SIGG +L++S F +G ILIDSGTVITRLPPSIY A+K EFLKQ
Sbjct: 324 VQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGTVITRLPPSIYKAVKIEFLKQ 382

Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           FSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+ VDVTG+ YFVK DAS VCL
Sbjct: 383 FSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDASLVCL 442

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           ALASLSYE+E GIIGNYQQKNQRVIYD+   +LG  GE+C
Sbjct: 443 ALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 276/420 (65%), Positives = 338/420 (80%), Gaps = 6/420 (1%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNI-KDVSNTEIPL 124
           + TLE+KH+  CSGK +D  ++ +  L+LDN+ VQ LQ +IK M S    + VS T+IPL
Sbjct: 17  STTLEMKHRELCSGKTIDLGKKMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPL 76

Query: 125 TSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           TSGI+L++LNYI T+ELGG+NM++IVDTGSDLTWVQCQPC+SCYNQQ P++DPS+S SYK
Sbjct: 77  TSGIKLESLNYIVTVELGGKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYK 136

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSS---PPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
            V CNSSTC  L  AT NSG C  ++      C Y VSYGDGSYTRG+L  E + LG   
Sbjct: 137 TVFCNSSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTK 196

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           + +F+FGCGRNNKGLFGG SGLMGLGRS +SLVSQT + F G+FSYCLPS +D GASGSL
Sbjct: 197 LENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLED-GASGSL 255

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
             G +SSV+ NST ++YT ++ NPQL +FYILNLTG SIGG +L++S F +G ILIDSGT
Sbjct: 256 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRG-ILIDSGT 314

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
           VITRLPPSIY A+K EFLKQFSGFP+APG+SILDTCFNL++Y++++IP++KM F+GNAE+
Sbjct: 315 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAEL 374

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            VDVTG+ YFVK DAS VCLALASLSYE+E GIIGNYQQKNQRVIYDT   +LG  GE+C
Sbjct: 375 EVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  551 bits (1419), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 281/468 (60%), Positives = 350/468 (74%), Gaps = 8/468 (1%)

Query: 20  SLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG 79
           S F L  G +  +G  +L      W++   +  +C+  QK +I  G  TLE+K ++YCSG
Sbjct: 31  SSFNLGNGDNHEKGLLQL-FQNFPWKEHGEAVVNCI-FQKPKITKGITTLEMKQRDYCSG 88

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIRLQTLNYIAT 138
           KI DW +  QNR+ILD ++V  L S  K+ I  G    +S+++IP++SG RLQTLNYI T
Sbjct: 89  KITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVT 148

Query: 139 IELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           + +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+PS S S+  + CNS TC AL+ 
Sbjct: 149 VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 208

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
             G+SG+CS+ +   C+Y + YGDGSY+RGELG E L LGK  +++FIFGCGRNNKGLFG
Sbjct: 209 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFG 268

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPIT 317
           G SGLMGL RS+LSLVSQTS +FG +FSYCLP+T   G+SGSL LGG + S FKN +PI+
Sbjct: 269 GASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGSSGSLTLGGADFSNFKNISPIS 327

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI--LIDSGTVITRLPPSIYSAL 374
           YT MI NPQ++ FY LNLTGISIGG  L     +   G+  L+DSGTVITRL PSIY A 
Sbjct: 328 YTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAF 387

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK  FEGNAEM VDV G+ YFVKS
Sbjct: 388 KAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKS 447

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           DASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S++GFAGE CS
Sbjct: 448 DASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  533 bits (1374), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 266/417 (63%), Positives = 326/417 (78%), Gaps = 6/417 (1%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI-SGNIKDVSNTEIPLTSGIR 129
           +K ++YCSGKI DW +  QNR+ILD ++V  L S  K+ I  G    +S+++IP++SG R
Sbjct: 1   MKQRDYCSGKITDWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGAR 60

Query: 130 LQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           LQTLNYI T+ +GG+N T+IVDTGSDLTWVQC PC+ CYNQQ+P+F+PS S S+  + CN
Sbjct: 61  LQTLNYIVTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCN 120

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
           S TC AL+   G+SG+CS+ +   C+Y + YGDGSY+RGELG E L LGK  +++FIFGC
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGC 180

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSS 308
           GRNNKGLFGG SGLMGL RS+LSLVSQTS +FG +FSYCLP+T   G+SGSL LGG + S
Sbjct: 181 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTT-GVGSSGSLTLGGADFS 239

Query: 309 VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GI--LIDSGTVITR 365
            FKN +PI+YT MI NPQ++ FY LNLTGISIGG  L     +   G+  L+DSGTVITR
Sbjct: 240 NFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITR 299

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
           L PSIY A KAEF KQFSG+ + PGFSIL+TCFNL+ Y+EVNIP VK  FEGNAEM VDV
Sbjct: 300 LSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDV 359

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            G+ YFVKSDASQ+CLA ASL YED+T IIGNYQQKNQRVIY++K S++GFAGE CS
Sbjct: 360 EGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 236/458 (51%), Positives = 313/458 (68%), Gaps = 23/458 (5%)

Query: 44  WQQKSGSSSSCV---SHQKSRIEMGAIT-LELKHKNYCSGKIVDWNEQQQNR-----LIL 94
           W ++ G +        H+K+     A T LELK  +  +  I D +    +R     L  
Sbjct: 86  WSRRYGDAKLAEMLGEHKKAGAARTATTVLELKRHSLVA--IPDDDPAAHDRYLRRLLAA 143

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGG-------RNM 146
           D       Q RI+N  +      S + E+PLTSGIR QTLNY+ TI LGG        N+
Sbjct: 144 DESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANL 203

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHA-LEFATGNSGV 205
           TVIVDTGSDLTWVQC+PC +CY Q+DP+FDP+ S +Y  V CN+S C A L+ ATG  G 
Sbjct: 204 TVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKAATGTPGS 263

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
           C   +   C Y ++YGDGS++RG L  + + LG AS++ F+FGCG +N+GLFGG +GLMG
Sbjct: 264 CGGGNE-RCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRGLFGGTAGLMG 322

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR++LSLVSQT+  +GG+FSYCLP+T    ASGSL LGG++S ++N+TP+ YT MI +P
Sbjct: 323 LGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADP 382

Query: 326 QLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF--S 383
               FY LN+TG ++GG  L A G     +LIDSGTVITRL PS+Y  ++AEF +QF  +
Sbjct: 383 AQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAA 442

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
           G+P+APGFSILDTC++L+ + EV +PL+ +  EG AE+TVD  G+++ V+ D SQVCLA+
Sbjct: 443 GYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAM 502

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           ASLSYED+T IIGNYQQKN+RV+YDT  S+LGFA EDC
Sbjct: 503 ASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 236/478 (49%), Positives = 316/478 (66%), Gaps = 37/478 (7%)

Query: 37  LHLHKLQWQQKSGSSSSCVS---------------HQKSRIEMGAIT--LELKHKNYCS- 78
           L L +L  +++S +++   S               H+K+    GA T  LELK  +  + 
Sbjct: 31  LSLRELDGRRRSAATTDTRSSRYYVDAMLAETLGEHKKA----GAATSVLELKRHSLTAI 86

Query: 79  -GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIA 137
               V  +   +  L  D       Q R     +      ++ E+PLTSGIRLQTLNY+ 
Sbjct: 87  PEDPVARDRYLRRLLAADESRANSFQPRRNKDRASASTQSASAEVPLTSGIRLQTLNYVT 146

Query: 138 TIELGGR------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           TI LGG       N+TVIVDTGSDLTWVQC+PC +CY Q+DP+FDP+ S +Y  V CN+S
Sbjct: 147 TISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNAS 206

Query: 192 TC-HALEFATGNSGVCSSSSP--PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            C  +L  ATG  G C S+      C Y ++YGDGS++RG L  + + LG AS+  F+FG
Sbjct: 207 ACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFG 266

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN-- 306
           CG +N+GLFGG +GLMGLGR++LSLVSQT+  +GG+FSYCLP+     ASGSL LGG   
Sbjct: 267 CGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDD 326

Query: 307 -SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITR 365
            +S ++N+TP+ YT MI +P    FY LN+TG ++GG  L A G     +LIDSGTVITR
Sbjct: 327 AASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITR 386

Query: 366 LPPSIYSALKAEFLKQF--SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           L PS+Y A++AEF++QF  +G+P+APGFSILDTC++L+ + EV +PL+ +  EG A++TV
Sbjct: 387 LAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVTV 446

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           D  G+++ V+ D SQVCLA+ASLSYEDET IIGNYQQKN+RV+YDT  S+LGFA EDC
Sbjct: 447 DAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 228/430 (53%), Positives = 301/430 (70%), Gaps = 8/430 (1%)

Query: 59  KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKN---MISGNIK 115
           +SR E GA  LEL+H    S       E+    L  D   V  LQ RI +   + S +  
Sbjct: 33  RSRAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAA 92

Query: 116 DVSN-TEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
             S   ++P+TSG RL+TLNY+AT+ +GG   TVIVDT S+LTWVQC+PC +C++QQ+P+
Sbjct: 93  SASKLAQVPVTSGARLRTLNYVATVGIGGGEATVIVDTASELTWVQCEPCDACHDQQEPL 152

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           FDPS SPSY  V CNSS+C AL  ATG SG      P  C+Y +SY DGSY+RG L  + 
Sbjct: 153 FDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDR 212

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
           L L    +  F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FSYCLP  ++
Sbjct: 213 LSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP-PKE 271

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
           +G+SGSL+LG ++SV++NSTPI YT M+ +P    FY+ NLTGI++GG+ +Q+ GF+ GG
Sbjct: 272 SGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGG 331

Query: 355 ---ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
               ++DSGT+IT L PS+Y+A++AEF+ Q + +P A  FSILDTCF+L+  +EV +P +
Sbjct: 332 GGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQVPSL 391

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           K+ F+G AE+ VD  G++Y V  DASQVCLALASL  E +T IIGNYQQKN RVI+DT  
Sbjct: 392 KLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVG 451

Query: 472 SQLGFAGEDC 481
           SQ+GFA E C
Sbjct: 452 SQIGFAQETC 461


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 236/477 (49%), Positives = 324/477 (67%), Gaps = 30/477 (6%)

Query: 27  GAHCF----EGKKKLHLHK--LQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGK 80
           G HC     E  ++ HL +  LQ +Q+         H +SR   GA  LEL+H ++    
Sbjct: 29  GVHCLDLDLEEGRRHHLSRRALQGRQRR-------HHLRSRAVGGATVLELRHHSFSPAP 81

Query: 81  IVDWNEQQQNRLILDNLHVQYLQSRIKN---MISGNIKDV----SNTEIPLTSGIRLQTL 133
                E+    L  D   V  LQ RI++     + +  +V    S  ++P++SG RL+TL
Sbjct: 82  ANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVSSGARLRTL 141

Query: 134 NYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
           NY+AT+ LGG   TVIVDT S+LTWVQC PC+SC++QQ P+FDPS SPSY  V C+S +C
Sbjct: 142 NYVATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSC 201

Query: 194 HALE--FATG---NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            AL+   ATG    +  C +  P  C+Y +SY DGSY+RG L  + L L    ++ F+FG
Sbjct: 202 DALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGEVIDGFVFG 261

Query: 249 CGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           CG +N+G  FGG SGLMGLGRS LSLVSQT + FGG+FSYCLP ++++ ASGSL+LG + 
Sbjct: 262 CGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDP 321

Query: 308 SVFKNSTPITYTNMIPN--PQL-ATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVIT 364
           S ++NSTP+ YT+M+ N  P L   FY++NLTGI++GG++++++GF+   I +DSGTVIT
Sbjct: 322 SAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAI-VDSGTVIT 380

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
            L PS+Y+A++AEF+ Q + +P APGFSILDTCFN++  +EV +P + + F+G AE+ VD
Sbjct: 381 SLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVD 440

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             G++YFV SD+SQVCLA+ASL  EDET IIGNYQQKN RV++DT  SQ+GFA E C
Sbjct: 441 SGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 242/480 (50%), Positives = 323/480 (67%), Gaps = 26/480 (5%)

Query: 27  GAHCF----EGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIV 82
           G HC     +G +  H H L  +           H +SR   GA  LEL+H+++ S    
Sbjct: 31  GVHCLHLEEDGSRHRHQHHLSRRALRQGRQRHPHHLRSRAVGGATVLELRHRSFSSAPPA 90

Query: 83  DWNEQQQNRLI-LDNLHVQYLQSRIKN----MISGNIKDVSNT-----EIPLTSGIRLQT 132
              E++ + L+  D   V  LQ RI      MI+ + +          ++P+TSG +L+T
Sbjct: 91  SSREEEVDGLLSTDAARVSSLQRRIDRYRRLMITSSAEVAVAVAASKAQVPVTSGAKLRT 150

Query: 133 LNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           LNY+AT+ LGG   TVIVDT S+LTWVQC PC+SC++QQDP+FDPS SPSY  V CNSS+
Sbjct: 151 LNYVATVGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSS 210

Query: 193 CHALEFATGN----SGVCS--SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
           C AL+ ATG     +  C     S   C+Y +SY DGSY+RG L  + L L    ++ F+
Sbjct: 211 CDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEVIDGFV 270

Query: 247 FGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
           FGCG +N+G  FGG SGLMGLGRS LSLVSQT + FGG+FSYCLP  +++ +SGSL++G 
Sbjct: 271 FGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLP-LKESDSSGSLVIGD 329

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGT 361
           +SSV++NSTPI Y +M+ +P    FY +NLTGI++GG+++++SGF+ GG     +IDSGT
Sbjct: 330 DSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGT 389

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
           VIT L PSIY+A+KAEFL QF+ +P APGFSILDTCFN++  +EV +P +K+ F+G  E+
Sbjct: 390 VITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREVQVPSLKLVFDGGVEV 449

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            VD  G++YFV SD+SQVCLA+A L  E ET IIGNYQQKN RVI+DT  SQ+GFA E C
Sbjct: 450 EVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQETC 509


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 229/435 (52%), Positives = 300/435 (68%), Gaps = 14/435 (3%)

Query: 57  HQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKN-- 108
           H +SR E GA  LEL+H     G     +  +     L      D   V  LQ R     
Sbjct: 40  HLRSRAESGATILELRHHGGGGGGGSGKSGGRSREEELGGLFSSDAARVSSLQRRAGGGS 99

Query: 109 -MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSC 167
                     +   +P+TSG RL+TLNY+AT+ LGG   TVIVDT S+LTWVQC PC SC
Sbjct: 100 WAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEATVIVDTASELTWVQCAPCASC 159

Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYT 226
           ++QQ P+FDP+ SPSY  + CNSS+C AL+ ATG++         P C+Y +SY DGSY+
Sbjct: 160 HDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYS 219

Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           +G L  + L L    ++ F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FS
Sbjct: 220 QGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFS 279

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
           YCLP  +++ +SGSL+LG ++SV++NSTPI YT M+ +P    FY +NLTGI+IGG++++
Sbjct: 280 YCLP-LKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE 338

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
           +S    G +++DSGT+IT L PS+Y+A+KAEFL QF+ +P APGFSILDTCFNL+ ++EV
Sbjct: 339 SSA---GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREV 395

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            IP +K  FEGN E+ VD +G++YFV SD+SQVCLALASL  E ET IIGNYQQKN RVI
Sbjct: 396 QIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVI 455

Query: 467 YDTKNSQLGFAGEDC 481
           +DT  SQ+GFA E C
Sbjct: 456 FDTLGSQIGFAQETC 470


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 229/435 (52%), Positives = 300/435 (68%), Gaps = 14/435 (3%)

Query: 57  HQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKN-- 108
           H +SR E GA  LEL+H     G     +  +     L      D   V  LQ R     
Sbjct: 39  HLRSRAESGATILELRHHGGGGGGGSGKSGGRSREEELGGLFSSDAARVSSLQRRAGGGS 98

Query: 109 -MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSC 167
                     +   +P+TSG RL+TLNY+AT+ LGG   TVIVDT S+LTWVQC PC SC
Sbjct: 99  WAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEATVIVDTASELTWVQCAPCASC 158

Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYT 226
           ++QQ P+FDP+ SPSY  + CNSS+C AL+ ATG++         P C+Y +SY DGSY+
Sbjct: 159 HDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYS 218

Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           +G L  + L L    ++ F+FGCG +N+G FGG SGLMGLGRS LSL+SQT + FGG+FS
Sbjct: 219 QGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFS 278

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
           YCLP  +++ +SGSL+LG ++SV++NSTPI YT M+ +P    FY +NLTGI+IGG++++
Sbjct: 279 YCLP-LKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE 337

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
           +S    G +++DSGT+IT L PS+Y+A+KAEFL QF+ +P APGFSILDTCFNL+ ++EV
Sbjct: 338 SSA---GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREV 394

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            IP +K  FEGN E+ VD +G++YFV SD+SQVCLALASL  E ET IIGNYQQKN RVI
Sbjct: 395 QIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVI 454

Query: 467 YDTKNSQLGFAGEDC 481
           +DT  SQ+GFA E C
Sbjct: 455 FDTLGSQIGFAQETC 469


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 229/441 (51%), Positives = 303/441 (68%), Gaps = 19/441 (4%)

Query: 59  KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNR-------LILDNLHVQYLQSRIKNMIS 111
           +SR E G+  LEL+H    S         + +R       L  D   V  LQ RI++  S
Sbjct: 32  RSRTESGSTILELRHHISSSFSPGPNRPSKTSRGEVDGGVLSSDAARVSSLQRRIESYRS 91

Query: 112 GNIKDVSNT-----EIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKS 166
            +  +         ++P+TSG  L+TLNY+AT+ LG    TV+VDT S+LTWVQCQPC+S
Sbjct: 92  SSEGEEEEASKLALQVPITSGANLRTLNYVATVGLGAAEATVVVDTASELTWVQCQPCES 151

Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP--PDCNYFVSYGDG 223
           C++QQDP+FDPS SPSY  V CNSS+C AL  A    +  C+  +   P C+Y +SY DG
Sbjct: 152 CHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDG 211

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFG 282
           SY+RG L R+ L L    +  F+FGCG +N+G  FGG SGLMGLGRS +SLVSQT + FG
Sbjct: 212 SYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFG 271

Query: 283 GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN--PQLATFYILNLTGISI 340
           G+FSYCLP  +++G+SGSL+LG +SS ++NSTPI YT M+ +  P    FY LNLTGI++
Sbjct: 272 GVFSYCLP-MRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITV 330

Query: 341 GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
           GG+++++  F+ G ++IDSGT+IT L PS+Y+A++AEFL Q + +P AP FSILDTCFNL
Sbjct: 331 GGQEVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNL 390

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           +  +EV +P +K  FEG+ E+ VD  G++YFV SDASQVCLALASL  E +T IIGNYQQ
Sbjct: 391 TGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQ 450

Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
           KN RVI+DT  SQ+GFA E C
Sbjct: 451 KNLRVIFDTLGSQIGFAQETC 471


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 213/287 (74%), Positives = 240/287 (83%), Gaps = 2/287 (0%)

Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKG 255
           +   +GNSGVC S++P  CNY ++YGDGS+TRGELG E L  G   V DFIFGCGRNNKG
Sbjct: 116 IPVTSGNSGVCGSAAP-ICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKG 174

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
           LFGGVSGLMGLGRSDLSL+SQTS IFGG+FSYCLPST+  G SGSLILGGNSSV++NS+P
Sbjct: 175 LFGGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSP 233

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALK 375
           I+Y  MI NPQL  FY +NLTGISIGG  LQA       IL+DSGTVITRLPP+IY ALK
Sbjct: 234 ISYAKMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALK 293

Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           AEFLKQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSD
Sbjct: 294 AEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD 353

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ASQVCLALASL Y+DE  I+GNYQQKN RVIYDTK +++GFA E CS
Sbjct: 354 ASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400



 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 2/98 (2%)

Query: 30  CFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ 89
           C E K+ L L K+Q + +S + +SC S QKSR EMGA  LE+KH+++CSG   DWNE+ Q
Sbjct: 26  CLEEKRVLSLQKVQPKLQS-TDTSCFS-QKSRREMGATILEMKHRDHCSGVTRDWNEKLQ 83

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
            RL +D   V+ LQSRIK  +  N +DVSN +IP+TSG
Sbjct: 84  KRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIPVTSG 121


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 240/493 (48%), Positives = 314/493 (63%), Gaps = 53/493 (10%)

Query: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIE------------------MGAITLELKHKNYCS 78
           L L +LQW    GSS      Q  R E                       LELKH +  +
Sbjct: 36  LQLRELQW----GSSGQVRYSQSKRFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA 91

Query: 79  GKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT-------EIPLTSG 127
             I D    Q+    RL+  D      LQ R K   + + K  +         E+PLTSG
Sbjct: 92  --IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSG 149

Query: 128 IRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           IR QTLNY+ TI LGG         N+TVIVDTGSDLTWVQC+PC  CY Q+DP+FDPS 
Sbjct: 150 IRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSG 209

Query: 180 SPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELG 231
           S SY  V CN+S C A L+ ATG  G C++            C Y ++YGDGS++RG L 
Sbjct: 210 SASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLA 269

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
            + + LG ASV+ F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+  FGG+FSYCLP+
Sbjct: 270 TDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA 329

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
                A+GSL LGG++S ++N+TP++YT MI +P    FY +N+TG S+GG  + A+G  
Sbjct: 330 ATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLG 389

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIP 409
              +L+DSGTVITRL PS+Y A++AEF +QF    +P+AP FS+LD C+NL+ + EV +P
Sbjct: 390 AANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVP 449

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
           L+ +  EG A+MTVD  G+++  + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT
Sbjct: 450 LLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDT 509

Query: 470 KNSQLGFAGEDCS 482
             S+LGFA EDCS
Sbjct: 510 VGSRLGFADEDCS 522


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 238/490 (48%), Positives = 314/490 (64%), Gaps = 46/490 (9%)

Query: 37  LHLHKLQW---------QQKSGSSSSCVSHQKSRIEMGAI-----TLELKHKNYCSGKIV 82
           L L +LQW         Q K         H+K+             LELKH +  +  I 
Sbjct: 36  LQLRELQWGSSGQVRYSQSKHFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA--IP 93

Query: 83  DWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT--------EIPLTSGIRL 130
           D    Q+    RL+  D      LQ R K   + + K  +          E+PLTSGIR 
Sbjct: 94  DHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAAGAEVPLTSGIRF 153

Query: 131 QTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
           QTLNY+ TI LGG         N+TVIVDTGSDLTWVQC+PC  CY Q+DP+FDPS S S
Sbjct: 154 QTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSAS 213

Query: 183 YKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELGREH 234
           Y  V CN+S C A L+ ATG  G C++            C Y ++YGDGS++RG L  + 
Sbjct: 214 YAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDT 273

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
           + LG ASV+ F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+  FGG+FSYCLP+   
Sbjct: 274 VALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATS 333

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
             A+GSL LGG++S ++N+TP++YT MI +P    FY +N+TG S+GG  + A+G     
Sbjct: 334 GDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAAN 393

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           +L+DSGTVITRL PS+Y A++AEF +QF    +P+AP FS+LD C+NL+ + EV +PL+ 
Sbjct: 394 VLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLT 453

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           +  EG A+MTVD  G+++  + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT  S
Sbjct: 454 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 513

Query: 473 QLGFAGEDCS 482
           +LGFA EDCS
Sbjct: 514 RLGFADEDCS 523


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 207/274 (75%), Positives = 232/274 (84%), Gaps = 2/274 (0%)

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG 259
           +GNSGVC S++P  CNY ++YGDGS+TRGELG E L  G   V DFIFGCGRNNKGLFGG
Sbjct: 63  SGNSGVCGSAAPI-CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGG 121

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
           VSGLMGLGRSDLSL+SQTS IFGG+FSYCLPST+  G SGSLILGGNSSV++NS+PI+Y 
Sbjct: 122 VSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKG-SGSLILGGNSSVYRNSSPISYA 180

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
            MI NPQL  FY +NLTGISIGG  LQA       IL+DSGTVITRLPP+IY ALKAEFL
Sbjct: 181 KMIENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 240

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           KQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSDASQV
Sbjct: 241 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 300

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           CLALASL Y+DE  I+GNYQQKN RVIYDTK ++
Sbjct: 301 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETK 334



 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/64 (54%), Positives = 46/64 (71%)

Query: 64  MGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
           MGA  LE+KH+++CSG   DWNE+ Q RL +D   V+ LQSRIK  +  N +DVSN +IP
Sbjct: 1   MGATILEMKHRDHCSGVTRDWNEKLQKRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIP 60

Query: 124 LTSG 127
           +TSG
Sbjct: 61  VTSG 64


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 227/434 (52%), Positives = 297/434 (68%), Gaps = 28/434 (6%)

Query: 69  LELKHKNYCSGKIVDWNEQQQNR-----LILDNLHVQYLQSRIKN---MISGNIKDVSNT 120
           LELKH  + S   V  +   + R     L  D+     LQ R        +      +  
Sbjct: 108 LELKH--HSSTATVPDHPAARERYLKHLLAADSARAASLQLRKPKPASSTTTTQASAAAA 165

Query: 121 EIPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVF 175
           E+PL SGIR QTLNY+ TI LGG   +N+TVIVDTGSDLTWVQC+PC   SCY Q+DP+F
Sbjct: 166 EVPLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLF 225

Query: 176 DPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELG 231
           DP+ SP++  V C S  C A L+ ATG  G C+ S   S   C Y +SYGDGS++RG L 
Sbjct: 226 DPAASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLA 285

Query: 232 REHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
           ++ LGLG  + ++ F+FGCG +N+GLFGG +GLMGLGR+DLSLVSQT+  FGG+FSYCLP
Sbjct: 286 QDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLP 345

Query: 291 STQDAGASGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ-LQAS 348
           +T  +  +GSL LG G SS F N   + YT MI +P    FY +N+TG ++GG   L A 
Sbjct: 346 ATTTS--TGSLSLGPGPSSSFPN---MAYTRMIADPTQPPFYFINITGAAVGGGAALTAP 400

Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
           GF  G +L+DSGTVITRL PS+Y A++AEF ++F  +P+APGFSILD C++L+   EVN+
Sbjct: 401 GFGAGNVLVDSGTVITRLAPSVYKAVRAEFARRFE-YPAAPGFSILDACYDLTGRDEVNV 459

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           PL+ +  EG A++TVD  G+++ V+ D SQVCLA+ASL YED+T IIGNYQQ+N+RV+YD
Sbjct: 460 PLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYD 519

Query: 469 TKNSQLGFAGEDCS 482
           T  S+LGFA EDC+
Sbjct: 520 TVGSRLGFADEDCT 533


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 204/340 (60%), Positives = 259/340 (76%), Gaps = 11/340 (3%)

Query: 2   VTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSR 61
           + KVK L ++SL L       ++A G   FE KK  +L  LQ +Q+ GS   C+ H +SR
Sbjct: 21  MVKVKALLLVSLCL-------IIANGVSSFEEKKVFNLQILQRKQQLGSLG-CL-HPESR 71

Query: 62  IEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
            E GAI LE+K ++YCS K V+W+ +  N+L LD+LHV+ +Q+R++ M+S +  +VS  +
Sbjct: 72  QEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQ 131

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           IPL SG+  QTLNYI T+ELGG++MTVI+DTGSDLTWVQC+PC SCYNQQ PVF PS S 
Sbjct: 132 IPLASGVNFQTLNYIVTMELGGQDMTVIIDTGSDLTWVQCEPCMSCYNQQGPVFKPSTSS 191

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY+ + CNSSTC +L+  TGN+G C  S+P +C+Y V+YGDGSYT GELG EHL  G  S
Sbjct: 192 SYQSIPCNSSTCQSLQLTTGNAGAC-ESNPSNCSYAVNYGDGSYTNGELGAEHLSFGGIS 250

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V++F+FGCG+NNKGLFGGVSGLMGLGRS+LSL+SQT+  FGG+FSYCLP T DAGASGSL
Sbjct: 251 VSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYCLPPT-DAGASGSL 309

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
            +G  SSVFKN TPI YT M+PNPQL+ FY+LNLTGI +G
Sbjct: 310 AMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 206/444 (46%), Positives = 275/444 (61%), Gaps = 52/444 (11%)

Query: 69  LELKHKNYCSGKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT---- 120
           LELKH +  +  I D    Q+    RL+  D      LQ R K   + + K  +      
Sbjct: 27  LELKHHSLTA--IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAA 84

Query: 121 ----EIPLTSGIRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCY 168
               E+PLTSGIR QTLNY+ TI LGG         N+TVIVDTGSDLTWVQC+PC  CY
Sbjct: 85  AAGAEVPLTSGIRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCY 144

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSY 220
            Q+DP+FDPS S SY  V CN+S C A L+ ATG  G C++            C Y ++Y
Sbjct: 145 AQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAY 204

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           GDGS++RG L  + + LG ASV+ F+FGCG +N+GL           R   +  S T+  
Sbjct: 205 GDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGL----------RRPGSAASSPTAS- 253

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
                    P      A+GSL LGG++S ++N+TP++YT MI +P    FY +N+TG S+
Sbjct: 254 ---------PPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV 304

Query: 341 GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCF 398
           GG  + A+G     +L+DSGTVITRL PS+Y A++AEF +QF    +P+AP FS+LD C+
Sbjct: 305 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 364

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
           NL+ + EV +PL+ +  E  A+MTVD  G+++  + D SQVCLA+ASLS+ED+T IIGNY
Sbjct: 365 NLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 424

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
           QQKN+RV+YDT  S+LGFA EDCS
Sbjct: 425 QQKNKRVVYDTVGSRLGFADEDCS 448


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  357 bits (917), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 182/283 (64%), Positives = 202/283 (71%), Gaps = 45/283 (15%)

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG 259
           +GNSGVC S++P  CNY ++YGDGS+TRGELG E L  G   V DFIFGCGRNNKGLFGG
Sbjct: 120 SGNSGVCGSAAPI-CNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGG 178

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
           VSGLMGLGRSDLSL+SQTSE                                        
Sbjct: 179 VSGLMGLGRSDLSLISQTSE---------------------------------------- 198

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
               NPQL  FY +NLTGISIGG  LQA       IL+DSGTVITRLPP+IY ALKAEFL
Sbjct: 199 ----NPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 254

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           KQF+GFP AP FSILDTCFNLSAYQEV+IP +KM FEGNAE+TVDVTG+ YFVKSDASQV
Sbjct: 255 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 314

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           CLALASL Y+DE  I+GNYQQKN RVIYDTK +++GFA E CS
Sbjct: 315 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357



 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 51/98 (52%), Positives = 68/98 (69%), Gaps = 2/98 (2%)

Query: 30  CFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ 89
           C E K+ L L K+Q + +S + +SC S QKSR EMGA  LE+KH+++CSG   DWNE+ Q
Sbjct: 26  CLEEKRVLSLQKVQPKLQS-TDTSCFS-QKSRREMGATILEMKHRDHCSGVTRDWNEKLQ 83

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
            RL +D   V+ LQSRIK  +  N +DVSN +IP+TSG
Sbjct: 84  KRLTMDAERVKSLQSRIKRTVPSNTEDVSNAQIPVTSG 121


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score =  341 bits (875), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 211/493 (42%), Positives = 273/493 (55%), Gaps = 100/493 (20%)

Query: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIE------------------MGAITLELKHKNYCS 78
           L L +LQW    GSS      Q  R E                       LELKH +  +
Sbjct: 36  LQLRELQW----GSSGQVRYSQSKRFEKKMTGEHKKAAAAARTRTRSTTVLELKHHSLTA 91

Query: 79  GKIVDWNEQQQ---NRLIL-DNLHVQYLQSRIKNMISGNIKDVSNT-------EIPLTSG 127
             I D    Q+    RL+  D      LQ R K   + + K  +         E+PLTSG
Sbjct: 92  --IPDHPAAQETYLRRLLAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAEVPLTSG 149

Query: 128 IRLQTLNYIATIELGGR--------NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           IR QTLNY+ TI LGG         N+TVIVDTGSDLTWVQC+PC  CY Q+DP+FDPS 
Sbjct: 150 IRFQTLNYVTTIALGGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSG 209

Query: 180 SPSYKKVLCNSSTCHA-LEFATGNSGVCSS-------SSPPDCNYFVSYGDGSYTRGELG 231
           S SY  V CN+S C A L+ ATG  G C++            C Y ++YGDGS++RG L 
Sbjct: 210 SASYAAVPCNASACEASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLA 269

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
            + + LG ASV+ F+FGCG +N+GLFGG +GLMGLG                      P 
Sbjct: 270 TDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLG----------------------PD 307

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
              AG                         +P+     FY +N+TG S+GG  + A+G  
Sbjct: 308 GALAG-------------------------LPDGAPPPFYFMNVTGASVGGAAVAAAGLG 342

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIP 409
              +L+DSGTVITRL PS+Y A++AEF +QF    +P+AP FS+LD C+NL+ + EV +P
Sbjct: 343 AANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVP 402

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
           L+ +  EG A+MTVD  G+++  + D SQVCLA+ASLS+ED+T IIGNYQQKN+RV+YDT
Sbjct: 403 LLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDT 462

Query: 470 KNSQLGFAGEDCS 482
             S+LGFA EDCS
Sbjct: 463 VGSRLGFADEDCS 475


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  337 bits (864), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 204/490 (41%), Positives = 286/490 (58%), Gaps = 28/490 (5%)

Query: 7   PLTILSLLLPLMVSLFLLAKGAHCFEGKKKL-----HLHKLQWQQKSGSSSSCVSHQKSR 61
           P++ + LL  L+ S  L +K    F+G+K        LH +       SS   V     +
Sbjct: 4   PISTIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSS---VCSPSPK 60

Query: 62  IEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQSRI-KNMISGNIKDV 117
            +    +LE+ HK+    K+     +  +R   L  D   V  ++SR+ KN   G     
Sbjct: 61  GDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKG 120

Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
           S   +P  SG  + T NY+ T+ LG   R++T I DTGSDLTW QC+PC + CY+QQ+P+
Sbjct: 121 SKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPI 180

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           F+PS S SY  + C+S TC  L+  TGNS  CS+S+   C Y + YGD SY+ G   ++ 
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST---CVYGIQYGDQSYSVGFFAQDK 237

Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           L L    V N+F+FGCG+NN+GLF GV+GL+GLGR+ LSLVSQT++ +G LFSYCLPST 
Sbjct: 238 LALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCLPST- 296

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA 351
            + ++G L  G        S  + +T  + N Q  +FY LNL  IS+GG++L   AS F+
Sbjct: 297 -SSSTGYLTFGSGGGT---SKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFS 352

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
             G +IDSGTVI+RLPP+ YS L+A F +Q S +P A   SILDTC++ S Y  V++P +
Sbjct: 353 TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKI 412

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
            + F   AEM +D +GI Y +  + SQVCLA A  S   +  I+GN QQK   V+YD   
Sbjct: 413 NLYFSDGAEMDLDPSGIFYIL--NISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAG 470

Query: 472 SQLGFAGEDC 481
            ++GFA   C
Sbjct: 471 GRIGFAPGGC 480


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 199/489 (40%), Positives = 281/489 (57%), Gaps = 24/489 (4%)

Query: 7   PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKS------GSSSSCVSHQKS 60
           P++ + LL  L+ +  L  K     EG++    H +Q    +        SS+C    K 
Sbjct: 11  PISTICLLRFLLYASLLSLKSGFAIEGRESAESHHVQPIHHNVHITSLMPSSACSPSPKG 70

Query: 61  RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRI-KNMISGNIKDVS 118
             +  ++ +  KH      +    N     +++  D   V  +QSR+ KN+  G+    S
Sbjct: 71  HDQRASLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKAS 130

Query: 119 NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVF 175
              +P  S   L + NY+ T+ LG   R++T I DTGSDLTW QC+PC   CY Q++ +F
Sbjct: 131 KATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIF 190

Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
           DPS S SY  V C+S +C  LE ATGNS  CSSS+   C Y + YGDGSY+ G   RE L
Sbjct: 191 DPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSST---CLYGIRYGDGSYSIGFFAREKL 247

Query: 236 GLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
            L    V N+F FGCG+NN+GLFGG +GL+GL R+ LSLVSQT++ +G +FSYCLPS+  
Sbjct: 248 SLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSS 307

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK 352
           +  +G L  G       +S  + +T    N    +FY L++ GIS+G ++L    S F+ 
Sbjct: 308 S--TGYLSFGSGDG---DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST 362

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            G +IDSGTVI+RLPP++YS+++  F +  S +P   G SILDTC++LS Y+ V +P + 
Sbjct: 363 AGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKII 422

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F G AEM +   GI+Y +K   SQVCLA A  S +DE  IIGN QQK   V+YD    
Sbjct: 423 LYFSGGAEMDLAPEGIIYVLK--VSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEG 480

Query: 473 QLGFAGEDC 481
           ++GFA   C
Sbjct: 481 RVGFAPSGC 489


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  323 bits (828), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 209/514 (40%), Positives = 290/514 (56%), Gaps = 49/514 (9%)

Query: 1   MVTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKL--HLHKLQWQQKSGSSSSCVSHQ 58
           M T+   L   S    L++  F + K +H  E K+ +  H H LQ       SSSC +  
Sbjct: 6   MATRSYFLLFSSFTFLLILLSFPVEK-SHALEAKETIESHFHTLQLTSLL-PSSSCNTAT 63

Query: 59  KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL------DNLHVQYLQSRIKNM--- 109
           K +   GA +LE+ ++    G     N++      L      D   V  +Q+R+ +    
Sbjct: 64  KGK-RRGA-SLEVVNRQ---GPCTQLNQKGAKAPTLTEILAHDQARVDSIQARVTDQSYD 118

Query: 110 ----------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLT 157
                             S   +P  SG+ L T NYI  + LG   +++++I DTGSDLT
Sbjct: 119 LFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLT 178

Query: 158 WVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
           W QCQPC KSCY QQ P+FDPS S +Y  + C S+ C  L+ ATGNS  CSSS   +C Y
Sbjct: 179 WTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSGLKSATGNSPGCSSS---NCVY 235

Query: 217 FVSYGDGSYTRGELGREHLGLGKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 275
            + YGD S+T G   ++ L L +  V D F+FGCG+NN+GLFG  +GL+GLGR  LS+V 
Sbjct: 236 GIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQ 295

Query: 276 QTSEIFGGLFSYCLPSTQDAGASGSLILG-GN----SSVFKNSTPITYTNMIPNPQLATF 330
           QT++ FG  FSYCLP+++  G++G L  G GN    S   KN   IT+T    + Q ATF
Sbjct: 296 QTAQKFGKYFSYCLPTSR--GSNGHLTFGNGNGVKTSKAVKNG--ITFTPF-ASSQGATF 350

Query: 331 YILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           Y +++ GIS+GGK L  S   F   G +IDSGTVITRLP ++Y +LK+ F +  S +P+A
Sbjct: 351 YFIDVLGISVGGKALSISPMLFQNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTA 410

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
           P  S+LDTC++LS Y  ++IP +   F GNA + ++  GI+  + + ASQVCLA A    
Sbjct: 411 PALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGIL--ITNGASQVCLAFAGNGD 468

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +D  GI GN QQ+   V+YD    QLGF  + CS
Sbjct: 469 DDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  315 bits (806), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 177/373 (47%), Positives = 238/373 (63%), Gaps = 21/373 (5%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDP 177
            +P  SG+ L T NYI  + LG   +++++I DTGSDLTW QCQPC KSCY QQ P+FDP
Sbjct: 140 NLPAQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDP 199

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           S S +Y  + C S+ C +L+ ATGNS  CSSS   +C Y + YGD S+T G   ++ L L
Sbjct: 200 STSKTYSNISCTSAACSSLKSATGNSPGCSSS---NCVYGIQYGDSSFTIGFFAKDKLTL 256

Query: 238 GKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
            +  V D F+FGCG+NNKGLFG  +GL+GLGR  LS+V QT++ FG  FSYCLP+++  G
Sbjct: 257 TQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSR--G 314

Query: 297 ASGSLILG-GN----SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG-- 349
           ++G L  G GN    S   KN   IT+T    + Q   +Y +++ GIS+GGK L  S   
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNG--ITFTPF-ASSQGTAYYFIDVLGISVGGKALSISPML 371

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
           F   G +IDSGTVITRLP + Y +LK+ F +  S +P+AP  S+LDTC++LS Y  ++IP
Sbjct: 372 FQNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIP 431

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +   F GNA + +D  GI+  + + ASQVCLA A    +D  GI GN QQ+   V+YD 
Sbjct: 432 KISFNFNGNANVELDPNGIL--ITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDV 489

Query: 470 KNSQLGFAGEDCS 482
              QLGF  + CS
Sbjct: 490 AGGQLGFGYKGCS 502


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  311 bits (797), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 152/234 (64%), Positives = 189/234 (80%), Gaps = 1/234 (0%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           +K + +CS K +DWN + Q +LILD+L V+ +Q+RI+ + S +  + S T+IPL+SGI L
Sbjct: 1   MKDRGHCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINL 60

Query: 131 QTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           QTLNYI T+ LG +NMTVI+DT SDLTWVQC+PC SCYNQQ P+F PS S SY+ V CNS
Sbjct: 61  QTLNYIVTMGLGSKNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
           STC +L+FATGN+G C SS+P  CNY V+YGDGSYT G+LG E L  G  SV+DF+FGCG
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVSVSDFVFGCG 180

Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
           RNNKGLFGGVSGLMGLGRS LSLVSQT+  FGG+FSYCLP+T +AG+SGSL++G
Sbjct: 181 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTT-EAGSSGSLVMG 233


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  308 bits (788), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 188/489 (38%), Positives = 267/489 (54%), Gaps = 38/489 (7%)

Query: 11  LSLLLPLMVSLFLLAKG-AHCFEGKKKLHLHKLQWQQKSGSSSSC-VSHQKSRIEMGAIT 68
           + ++  LM+   L+    A   E    + +  L+W+ K    + C  S          + 
Sbjct: 13  IRVVAALMLQCLLMGSSTALDHENYHTISVDILKWKWKPPGFAKCPASFAGQEALKPGVK 72

Query: 69  LELKH--------KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
           + L H        +   S   +D   Q  +R   DN  +  + S  KN  +G    +SN 
Sbjct: 73  IRLDHIHGACSPLRPINSSSWIDMVSQSFDR---DNDRLNTIWS--KN--NGTYSTMSN- 124

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
            +PL  G ++ T NYI T   G   +N  +I+DTGSD+TW+QC+PC  CY+Q DP+F+P 
Sbjct: 125 -LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQ 183

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S SYK + C SS C   E  T N   C       C Y ++YGDGS ++G+  +E L LG
Sbjct: 184 QSSSYKHLSCLSSAC--TELTTMNH--CRLGG---CVYEINYGDGSRSQGDFSQETLTLG 236

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
             S   F FGCG  N GLF G +GL+GLGR+ LS  SQT   +GG FSYCLP    + ++
Sbjct: 237 SDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTST 296

Query: 299 GSLILGGNSSVFKNSTPIT--YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
           GS  +G      + S P T  +  ++ N    +FY + L GIS+GG++L    +   +GG
Sbjct: 297 GSFSVG------QGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGG 350

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
            ++DSGTVITRL P  Y ALK  F  +    PSA  FSILDTC++LS+Y +V IP +   
Sbjct: 351 TIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFH 410

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F+ NA++ V   GI++ ++SD SQVCLA AS S    T IIGN+QQ+  RV +DT   ++
Sbjct: 411 FQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRI 470

Query: 475 GFAGEDCSS 483
           GFA   C++
Sbjct: 471 GFAPGSCAT 479


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  306 bits (783), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 168/398 (42%), Positives = 241/398 (60%), Gaps = 21/398 (5%)

Query: 94  LDNLHVQYLQSRIKNMISG--NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
           LDN  V+Y+QSR+   + G   +K++ +T +P  SG  + + +Y   + LG   R++++I
Sbjct: 97  LDNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLI 156

Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
            DTGS LTW QC+PC  SCY QQDP+FDPS S SY  + C SS C         S  CSS
Sbjct: 157 FDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLCTQFR-----SAGCSS 211

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
           S+   C Y V YGD S +RG L +E L +     V+DF+FGCG++N+GLF G +GLMGL 
Sbjct: 212 STDASCIYDVKYGDNSISRGFLSQERLTITATDIVHDFLFGCGQDNEGLFRGTAGLMGLS 271

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  +S V QTS I+  +FSYCLPST  +   G L  G +++   N   + YT        
Sbjct: 272 RHPISFVQQTSSIYNKIFSYCLPSTPSS--LGHLTFGASAATNAN---LKYTPFSTISGE 326

Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            +FY L++ GIS+GG +L A   S F+ GG +IDSGTVITRLPP+ Y+AL++ F +    
Sbjct: 327 NSFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLPPTAYAALRSAFRQFMMK 386

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +P A G  +LDTC++ S Y+E+++P +  EF G  ++ + + GI+Y     A Q+CLA A
Sbjct: 387 YPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVKVELPLVGILY--GESAQQLCLAFA 444

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +    ++  I GN QQK   V+YD +  ++GF    C+
Sbjct: 445 ANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGCN 482


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  305 bits (780), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 179/445 (40%), Positives = 262/445 (58%), Gaps = 21/445 (4%)

Query: 48  SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQS 104
           S S+ S + H      +   +L + H++    ++ +      +    L LD   V  + S
Sbjct: 13  SKSALSSLHHHHLVFFLPESSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHS 72

Query: 105 RI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           ++ K + + ++ +  +T++P   G  L + NYI T+ LG    ++++I DTGSDLTW QC
Sbjct: 73  KLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQC 132

Query: 162 QPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
           QPC ++CY+Q++P+F+PS S SY  V C+S+ C +L  ATGN+G CS+S   +C Y + Y
Sbjct: 133 QPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS---NCIYGIQY 189

Query: 221 GDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
           GD S++ G L +E   L  + V D + FGCG NN+GLF GV+GL+GLGR  LS  SQT+ 
Sbjct: 190 GDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTAT 249

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
            +  +FSYCLPS+  A  +G L  G  S+    S   T  + I +    +FY LN+  I+
Sbjct: 250 AYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG--TSFYGLNIVAIT 303

Query: 340 IGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
           +GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  + S +P+  G SILDTC
Sbjct: 304 VGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTC 363

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           F+LS ++ V IP V   F G A + +   GI Y  K   SQVCLA A  S +    I GN
Sbjct: 364 FDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAGNSDDSNAAIFGN 421

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
            QQ+   V+YD    ++GFA   CS
Sbjct: 422 VQQQTLEVVYDGAGGRVGFAPNGCS 446


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  305 bits (780), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 173/398 (43%), Positives = 245/398 (61%), Gaps = 18/398 (4%)

Query: 92  LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
           L LD   V  + S++ K + + ++ +  +T++P   G  L + NYI T+ LG    ++++
Sbjct: 88  LRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 147

Query: 149 IVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           I DTGSDLTW QCQPC ++CY+Q++P+F+PS S SY  V C+S+ C +L  ATGN+G CS
Sbjct: 148 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 207

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGL 266
           +S   +C Y + YGD S++ G L +E   L  + V D + FGCG NN+GLF GV+GL+GL
Sbjct: 208 AS---NCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGL 264

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           GR  LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T  + I +  
Sbjct: 265 GRDKLSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG- 319

Query: 327 LATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
             +FY LN+  I++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  + S 
Sbjct: 320 -TSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSK 378

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +P+  G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K   SQVCLA A
Sbjct: 379 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFA 436

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             S +    I GN QQ+   V+YD    ++GFA   CS
Sbjct: 437 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  303 bits (776), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 173/422 (40%), Positives = 245/422 (58%), Gaps = 18/422 (4%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG--NIKDVSNTEIPLT 125
           +LE+ H++   G  V         L+ D   V ++ S+I   +     ++    T+IP  
Sbjct: 62  SLEVIHRHGPCGDEVSNAPTAAEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATKIPAK 121

Query: 126 SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPS 182
           SG  + + NYI ++ LG   + +++I DTGSDLTW QCQPC + CYNQ+DPVF PS S +
Sbjct: 122 SGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTT 181

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
           Y  + C+S  C  LE  TGN   CS++    C Y + YGD S++ G   +E L L    V
Sbjct: 182 YSNISCSSPDCSQLESGTGNQPGCSAARA--CIYGIQYGDQSFSVGYFAKETLTLTSTDV 239

Query: 243 -NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
             +F+FGCG+NN+GLFG  +GL+GLG+  +S+V QT++ +G +FSYCLP T  +    + 
Sbjct: 240 IENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF 299

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDS 359
             GG     K  TPIT  + + N     FY +++ G+ +GG Q+   +S F+  G +IDS
Sbjct: 300 GGGGGGGALKY-TPITKAHGVAN-----FYGVDIVGMKVGGTQIPISSSVFSTSGAIIDS 353

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           GTVITRLPP  YSALK+ F K  + +P AP  SILDTC++LS Y  + IP V   F+G  
Sbjct: 354 GTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGE 413

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           E+ +D  GI+Y   +  SQVCLA A         IIGN QQK  +V+YD    ++GF   
Sbjct: 414 ELDLDGIGIMY--GASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYN 471

Query: 480 DC 481
            C
Sbjct: 472 GC 473


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 169/415 (40%), Positives = 257/415 (61%), Gaps = 24/415 (5%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRI-KNM-ISGNIKDVSN 119
           +LE+ HK+    ++ + + + +++      L  D   V+Y+ SRI KN+    ++ ++ +
Sbjct: 70  SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129

Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFD 176
             +P  SG  + + NY   + LG   R++++I DTGSDLTW QC+PC +SCY QQD +FD
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFD 189

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           PS S SY  + C S+ C  L  ATGN   CS+S+   C Y + YGD S++ G   RE L 
Sbjct: 190 PSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKA-CIYGIQYGDSSFSVGYFSRERLS 248

Query: 237 LGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           +     V++F+FGCG+NN+GLFGG +GL+GLGR  +S V QT+ ++  +FSYCLP+T  +
Sbjct: 249 VTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLPAT--S 306

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG 353
            ++G L  G  ++ +   TP +  +     + ++FY L++TGIS+GG +L   +S F+ G
Sbjct: 307 SSTGRLSFGTTTTSYVKYTPFSTIS-----RGSSFYGLDITGISVGGAKLPVSSSTFSTG 361

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G +IDSGTVITRLPP+ Y+AL++ F +  S +PSA   SILDTC++LS Y+  +IP +  
Sbjct: 362 GAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDF 421

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
            F G   + +   GI+Y   + A QVCLA A+   + +  I GN QQK   V+YD
Sbjct: 422 SFAGGVTVQLPPQGILYV--ASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  301 bits (771), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 172/398 (43%), Positives = 244/398 (61%), Gaps = 18/398 (4%)

Query: 92  LILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
           L LD   V  + S++ K + + ++    +T++P   G  L + NYI T+ LG    ++++
Sbjct: 89  LRLDQARVNSIHSKLSKKLTTNHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSL 148

Query: 149 IVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           I DTGSDLTW QCQPC ++CY+Q++P+F+PS S SY  V C+S+ C +L  ATGN+G CS
Sbjct: 149 IFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCS 208

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-FGCGRNNKGLFGGVSGLMGL 266
           +S   +C Y + YGD S++ G L ++   L  + V D + FGCG NN+GLF GV+GL+GL
Sbjct: 209 AS---NCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGL 265

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           GR  LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T  + I +  
Sbjct: 266 GRDKLSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDG- 320

Query: 327 LATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
             +FY LN+  I++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  + S 
Sbjct: 321 -TSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSK 379

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +P+  G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K   SQVCLA A
Sbjct: 380 YPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK--ISQVCLAFA 437

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             S +    I GN QQ+   V+YD    ++GFA   CS
Sbjct: 438 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  300 bits (769), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 167/397 (42%), Positives = 246/397 (61%), Gaps = 17/397 (4%)

Query: 94  LDNLHVQYLQSRI-KNMISGN-IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
           LDN  V+Y+QSR+ KN+   N +KD+ +T +P  SG  + + NY+  + LG   R+++++
Sbjct: 3   LDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLSLV 62

Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
            DTGSDLTW QC+PC  SCY QQD +FDPS S SY  + C SS C  L  + G    CSS
Sbjct: 63  FDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLT-SDGIKSECSS 121

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
           S+   C Y   YGD S + G L +E L +     V+DF+FGCG++N+GLF G +GLMGLG
Sbjct: 122 STDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGLG 181

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  +S+V QTS  +  +FSYCLP+T  + + G L  G +++    +  + YT +      
Sbjct: 182 RHPISIVQQTSSNYNKIFSYCLPAT--SSSLGHLTFGASAAT---NASLIYTPLSTISGD 236

Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            +FY L++  IS+GG +L A   S F+ GG +IDSGTVITRL P++Y+AL++ F +    
Sbjct: 237 NSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEK 296

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +P A    +LDTC++LS Y+E+++P +  EF G   + +   GI+  V+S+  QVCLA A
Sbjct: 297 YPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILX-VESE-QQVCLAFA 354

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +   +++  + GN QQK   V+YD K  ++GF    C
Sbjct: 355 ANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  297 bits (760), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 168/397 (42%), Positives = 243/397 (61%), Gaps = 18/397 (4%)

Query: 94  LDNLHVQYLQSRI-KNMISGN-IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
           LDN  V+Y+QSR+ KN+   N +K++ +T +P  SG  + + NY   + LG   R+++++
Sbjct: 93  LDNERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLV 152

Query: 150 VDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
            DTGSDLTW QC+PC  SCY QQD +FDPS S SY  + C SS C  L  A G    CSS
Sbjct: 153 FDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLCTQLTSA-GIKSRCSS 211

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLG 267
           S+   C Y + YGD S + G L +E L +     V+DF+FGCG++N+GLF G +GL+GLG
Sbjct: 212 STTA-CIYGIQYGDKSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLG 270

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  +S V QTS I+  +FSYCLPST  + + G L  G +++   N   + YT +      
Sbjct: 271 RHPISFVQQTSSIYNKIFSYCLPST--SSSLGHLTFGASAATNAN---LKYTPLSTISGD 325

Query: 328 ATFYILNLTGISIGGKQLQA---SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            TFY L++ GIS+GG +L A   S F+ GG +IDSGTVITRL P+ Y+AL++ F +    
Sbjct: 326 NTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTAYAALRSAFRQGMEK 385

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +P A    + DTC++ S Y+E+++P +  EF G   + + + GI+  +   A QVCLA A
Sbjct: 386 YPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGGVTVELPLVGIL--IGRSAQQVCLAFA 443

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +   +++  I GN QQK   V+YD +  ++GF    C
Sbjct: 444 ANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  297 bits (760), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 202/479 (42%), Positives = 285/479 (59%), Gaps = 27/479 (5%)

Query: 18  MVSLFLLAKGAHC--FEGKK---KLHLHKLQWQQKSGSSSSCV-SHQKSRIEMGAITLEL 71
            +SL+LL    +C  FEG+K     H H          ++SC  S Q   IE  A  L++
Sbjct: 29  FLSLWLLFSFNNCYAFEGRKFAESQHTHTTIHLTSLLPAASCKPSTQVPSIENKAF-LKV 87

Query: 72  KHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRI-KNMISGNIKDVSNTEIPLTSGIR 129
            HK+  CS        + Q  L+ D   V  + S++ K+    ++K  + T +P   G  
Sbjct: 88  VHKHGPCSDLRQGHKAEAQYILLQDQSRVDSIHSKLSKDSGLSDVKATAATTLPAKDGSI 147

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKV 186
           + + NY  T+ LG   ++ ++I DTGSDLTW QC+PC KSCYNQ++ +F+PS S SY  +
Sbjct: 148 IGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANI 207

Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDF 245
            C S+ C +L  ATGN   C+SS+   C Y + YGD S++ G  G+E L L    V NDF
Sbjct: 208 SCGSTLCDSLASATGNIFNCASST---CVYGIQYGDSSFSIGFFGKEKLSLTATDVFNDF 264

Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
            FGCG+NNKGLFGG +GL+GLGR  LSLVSQT++ +  +FSYCLPS+  + ++G L  GG
Sbjct: 265 YFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSS--SSSTGFLTFGG 322

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVI 363
           ++S   + TP+   +       ++FY L+LTGIS+GG++L  S   F+  G +IDSGTVI
Sbjct: 323 STSKSASFTPLATIS-----GGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVI 377

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           TRLPP+ YSAL + F K  S +P+AP  SILDTCF+ S +  +++P + + F G   + +
Sbjct: 378 TRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDI 437

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           D TGI Y   +D +QVCLA A  S   +  I GN QQK   V+YD    ++GFA   CS
Sbjct: 438 DKTGIFYV--NDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 180/483 (37%), Positives = 261/483 (54%), Gaps = 23/483 (4%)

Query: 11  LSLLLPLMVSLFLLAKGAHCFEGKKKLHL---HKLQWQQKSGSSSSCVSHQKSRIEMGAI 67
           +S++  LM+   L+  G+         HL      +W+   G +    S          +
Sbjct: 13  ISVVAVLMLQCLLM--GSSVAPDHDNYHLIPVENFKWKDPQGFAKCPASSAGQEALKPGV 70

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ-SRIKNMISGNIKDVSNTEIPLTS 126
            + L H +     +   N      L+  +      + + I++  SG    +SN  +PL S
Sbjct: 71  KIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN--LPLQS 128

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G  + T NYI T   G   +N  +I+DTGSDLTW+QC+PC  CY+Q D +F+P  S SYK
Sbjct: 129 GTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYK 188

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            + C S+TC  L  +  N   C       C Y ++YGDGS ++G+  +E L LG  S  +
Sbjct: 189 TLPCLSATCTELITSESNPTPCLLGG---CVYEINYGDGSSSQGDFSQETLTLGSDSFQN 245

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
           F FGCG  N GLF G SGL+GLG++ LS  SQ+   +GG F+YCLP    + ++GS  +G
Sbjct: 246 FAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVG 305

Query: 305 GNSSVFKNSTPIT--YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSG 360
                 K S P +  +T ++ N    TFY + L GIS+GG +L    +   +G  ++DSG
Sbjct: 306 ------KGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSG 359

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           TVITRL P  Y+ALK  F  +    PSA  FSILDTC++LS + +V IP +   F+ NA+
Sbjct: 360 TVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNAD 419

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + V   GI+  V++  SQVCLA AS S  D   IIGN+QQ+  RV +DT   ++GFA   
Sbjct: 420 VAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGS 479

Query: 481 CSS 483
           C++
Sbjct: 480 CAA 482


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 182/489 (37%), Positives = 273/489 (55%), Gaps = 42/489 (8%)

Query: 8   LTILSLLLPLMVSLFLLAKGAHCFEGKK---KLHLHKLQWQQKSGSSSSCVSHQKSRIEM 64
           L   S LL L + + L    A  FEG+K   + HL  +   + S    S      +++  
Sbjct: 3   LISFSHLLCLCLVISLSTTYAFGFEGRKIAQENHLQLIHAIEISNLLPSADCEHSTKVAQ 62

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRIKNMISGNIKDVS 118
              +L++ HK+   G     N+Q  N       L+ D   V  + +++ +     +K+  
Sbjct: 63  NKASLKVVHKH---GPCSQLNQQNGNAPNLVEILLEDQSRVDSIHAKLSDH--SGVKETD 117

Query: 119 NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
             ++P  SG+ L T NYI +I LG   +++ +I DTGSDLTW +C   ++        FD
Sbjct: 118 AAKLPTKSGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET--------FD 169

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           P+ S SY  V C++  C ++  ATGN   C++S+   C Y + YGDGSY+ G LG+E L 
Sbjct: 170 PTKSTSYANVSCSTPLCSSVISATGNPSRCAAST---CVYGIQYGDGSYSIGFLGKERLT 226

Query: 237 LGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           +G   + N+F FGCG++  GLFG  +GL+GLGR  LS+VSQT+  +  LFSYCLPS+   
Sbjct: 227 IGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSS-- 284

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKG 353
            ++G L  G + S     TP++       P  ++FY L+LTGI++GG++L    S F+  
Sbjct: 285 -STGFLSFGSSQSKSAKFTPLS-----SGP--SSFYNLDLTGITVGGQKLAIPLSVFSTA 336

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G +IDSGTV+TRLPP+ YSAL++ F K  + +P     SILDTC++ S Y+ + +P + +
Sbjct: 337 GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F G  ++ VD  GI  FV +   QVCLA A  +   +T I GN QQ+N  V+YD    +
Sbjct: 397 SFSGGVDVDVDQAGI--FVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGK 454

Query: 474 LGFAGEDCS 482
           +GFA   CS
Sbjct: 455 VGFAPASCS 463


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  288 bits (738), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 166/415 (40%), Positives = 251/415 (60%), Gaps = 23/415 (5%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNR------LILDNLHVQYLQSRI-KNM-ISGNIKDVSN 119
           +LE+ HK+    ++ D + + ++       L  D   V+Y+ SR+ KN+    +++++ +
Sbjct: 71  SLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEELDS 130

Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFD 176
             +P  SG  + + NY   + LG   R++++I DTGSDLTW QC+PC +SCY QQD +FD
Sbjct: 131 ATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFD 190

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           PS S SY  + C S+ C  L  ATGN   CS+S+   C Y + YGD S++ G   RE L 
Sbjct: 191 PSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKA-CIYGIQYGDSSFSVGYFSRERLT 249

Query: 237 LGKASVND-FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           +    V D F+FGCG+NN+GLFGG +GL+GLGR  +S V QT+  +  +FSYCLPST  +
Sbjct: 250 VTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCLPST--S 307

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG 353
            ++G L  G  ++       + YT      + ++FY L++T I++GG +L   +S F+ G
Sbjct: 308 SSTGHLSFGPAAT----GRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTG 363

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G +IDSGTVITRLPP+ Y AL++ F +  S +PSA   SILDTC++LS Y+  +IP ++ 
Sbjct: 364 GAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTIEF 423

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
            F G   + +   GI++   +   QVCLA A+   + +  I GN QQ+   V+YD
Sbjct: 424 SFAGGVTVKLPPQGILFVASTK--QVCLAFAANGDDSDVTIYGNVQQRTIEVVYD 476


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  287 bits (735), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 165/399 (41%), Positives = 239/399 (59%), Gaps = 27/399 (6%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
           Q+R  +D++H +        + S  +       +P+ SG  + + +Y  T+ LG   +  
Sbjct: 95  QDRHRVDSIHAR--------LSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEF 146

Query: 147 TVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           T+I DTGSDLTW QC+PC K+CY Q++P  DP+ S SYK + C+S+ C  L+   G S  
Sbjct: 147 TLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGES-- 204

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLM 264
           CSS   P C Y V YGDGSY+ G    E L L  ++V  +F+FGCG+ N GLF G +GL+
Sbjct: 205 CSS---PTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLL 261

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           GLGR+ LSL SQT++ +  LFSYCLP++  + + G L  GG     + S  + +T +  +
Sbjct: 262 GLGRTKLSLPSQTAQKYKKLFSYCLPAS--SSSKGYLSFGG-----QVSKTVKFTPLSED 314

Query: 325 PQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            +   FY L++T +S+GG +L   AS F+  G +IDSGTVITRLP + YSAL + F K  
Sbjct: 315 FKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPSTAYSALSSAFQKLM 374

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
           + +PS  G+SI DTC++ S  + + IP V + F+G  EM +DV+GI+Y V     +VCLA
Sbjct: 375 TDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNG-LKKVCLA 433

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            A    + +  I GN QQK  +V+YD    ++GFA   C
Sbjct: 434 FAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 161/398 (40%), Positives = 219/398 (55%), Gaps = 26/398 (6%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTE---IPLTSGIRLQTLNYIATIELG--GRNMTVI 149
           D   V  +  +I    S  +      +   +P   GI L T NY+ ++ LG   R+MTV+
Sbjct: 103 DQARVDSIHRKIAAAASPVLDQARGKKGVTLPAQRGISLGTGNYVVSMGLGTPARDMTVV 162

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
            DTGSDL+WVQC PC  CY Q+DP+FDP+ S +Y  V C S  C  L+     S  CS  
Sbjct: 163 FDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLD-----SRSCSRD 217

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGR 268
               C Y V YGD S T G L R+ L L ++ V   F+FGCG  + GLFG   GL+GLGR
Sbjct: 218 K--KCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVGLGR 275

Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
             +SL SQ +  +G  FSYCLPS+    A+G L LGG +          +T M       
Sbjct: 276 EKVSLSSQAASKYGAGFSYCLPSSPS--AAGYLSLGGPAPANAR-----FTAMETRHDSP 328

Query: 329 TFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
           +FY + L G+ + G+ ++ S   F+  G +IDSGTVITRLPP +Y+AL++ F +     G
Sbjct: 329 SFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYG 388

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +  AP  SILDTC++ + +  V IP V + F G A + +D +G++Y  K   SQ CLA A
Sbjct: 389 YKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLYVAK--VSQACLAFA 446

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                 + GIIGN QQK   V+YD    ++GF    CS
Sbjct: 447 PNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  285 bits (728), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 154/324 (47%), Positives = 202/324 (62%), Gaps = 20/324 (6%)

Query: 52  SSCVSHQKSRIEMGAIT--LELKHKNYCS--GKIVDWNEQQQNRLILDNLHVQYLQSRIK 107
           SS   H+K+    GA T  LELK  +  +     V  +   +  L  D       Q R  
Sbjct: 9   SSSGEHKKA----GAATSVLELKRHSLTAIPEDPVARDRYLRRLLAADESRANSFQPRRN 64

Query: 108 NMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR------NMTVIVDTGSDLTWVQC 161
              +      ++ E+PLTSGIRLQTLNY+ TI LGG       N+TVIVDTGSDLTWVQC
Sbjct: 65  KDRASASTQSASAEVPLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQC 124

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSP--PDCNYFV 218
           +PC +CY Q+DP+FDP+ S +Y  V CN+S C  +L  ATG  G C S+      C Y +
Sbjct: 125 KPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYAL 184

Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
           +YGDGS++RG L  + + LG AS+  F+FGCG +N+GLFGG +GLMGLGR++LSLVSQT+
Sbjct: 185 AYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTA 244

Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNL 335
             +GG+FSYCLP+     ASGSL LGG    +S ++N+TP+ YT MI +P    FY LN+
Sbjct: 245 SRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV 304

Query: 336 TGISIGGKQLQASGFAKGGILIDS 359
           TG ++GG  L A G     +LIDS
Sbjct: 305 TGAAVGGTALAAQGLGASNVLIDS 328


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 161/403 (39%), Positives = 226/403 (56%), Gaps = 30/403 (7%)

Query: 86  EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG-- 143
           ++ Q+R+  D++H    +       +G         +P   G+RL T NYI ++ LG   
Sbjct: 145 DRDQDRV--DSIH----RMTAGPWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPR 198

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R++ V+ DTGSDL+WVQC+PC +CY Q DP+FDPS S +Y  V C +  C        +S
Sbjct: 199 RDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQECL-------DS 251

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVS 261
           G CSS     C Y V YGD S T G L R+ L LG +S  +  F+FGCG ++ GLFG   
Sbjct: 252 GTCSSGK---CRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRAD 308

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GL GLGR  +SL SQ +  +G  FSYCLPS+    A G L LG  ++         +T M
Sbjct: 309 GLFGLGRDRVSLASQAAARYGAGFSYCLPSSWR--AEGYLSLGSAAA----PPHAQFTAM 362

Query: 322 IPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFL 379
           +      +FY L+L GI + G+ ++ +   F   G +IDSGTVITRLP   YSAL++ F 
Sbjct: 363 VTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRLPSRAYSALRSSFA 422

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
                +  AP  SILDTC++ +   +V IP V + F+G A + +   G++Y   ++ SQ 
Sbjct: 423 GFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLYV--ANRSQA 480

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           CLA AS   +   GI+GN QQK   V+YD  N ++GF  + CS
Sbjct: 481 CLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 240/441 (54%), Gaps = 28/441 (6%)

Query: 52  SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           S C   +  +   GA T+ L H++  CS          + RL  D L   Y+Q +     
Sbjct: 43  SVCSESKAVKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGG 102

Query: 111 -------SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
                  +G+++  S+  +P T G  L TL Y+ T+ LG  G++ T+++DTGSD++WVQC
Sbjct: 103 VNGSRGGAGDVQQ-SHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQC 161

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
           +PC  C++Q DP+FDPS S +Y    C+S+ C  L    GN   CSSS    C Y V+YG
Sbjct: 162 KPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLG-QEGNG--CSSS---QCQYTVTYG 215

Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
           DGS T G    + L LG  +V  F FGC     G      GLMGLG    SLVSQT+  F
Sbjct: 216 DGSSTTGTYSSDTLALGSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTF 275

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           G  FSYCLP+T  + +SG L LG  +S F        T M+ + Q+ TFY + +  I +G
Sbjct: 276 GAAFSYCLPAT--SSSSGFLTLGAGTSGFVK------TPMLRSSQVPTFYGVRIQAIRVG 327

Query: 342 GKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
           G+QL   +     G ++DSGTV+TRLPP+ YSAL + F      +PSAP   ILDTCF+ 
Sbjct: 328 GRQLSIPTSVFSAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDF 387

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           S    V+IP V + F G A + +   GI+  +++  S +CLA A+ S +   GIIGN QQ
Sbjct: 388 SGQSSVSIPTVALVFSGGAVVDIASDGIM--LQTSNSILCLAFAANSDDSSLGIIGNVQQ 445

Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
           +   V+YD     +GF    C
Sbjct: 446 RTFEVLYDVGGGAVGFKAGAC 466


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 185/490 (37%), Positives = 274/490 (55%), Gaps = 27/490 (5%)

Query: 1   MVTKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKK--KLHLHKLQWQQKSGSSSSCVSHQ 58
           +++ +K    + + L  +  L  L KG +  E  +  K ++H L+      S S     Q
Sbjct: 4   LISSIKFTGFIYVFLLFLCPLCSLKKG-YAVEANEHIKKYVHTLEVNSLLASDSC---DQ 59

Query: 59  KSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVS 118
            S++   A +L++ HK     ++++ +      L+ D L V  +Q+R+  +    I +  
Sbjct: 60  SSKVIDKASSLQVLHKYGPCMQVLN-DRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEM 118

Query: 119 NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVF 175
            T++P  SGI + T NY+ T+ LG    + T++ DTGS +TW QCQPC  SCY Q++  F
Sbjct: 119 VTKLPAQSGIAIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKF 178

Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
           DP+ S SY  V C+S++C+ L   T   G  +S+S   C Y + YGD SY++G    E L
Sbjct: 179 DPTKSTSYNNVSCSSASCNLL--PTSERGCSASNS--TCLYQIIYGDQSYSQGFFATETL 234

Query: 236 GLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
            +  + V  +F+FGCG++N GLFG  +GL+GL  S +SL SQT+E +   FSYCLPST  
Sbjct: 235 TISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPST-- 292

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK 352
             ++G L  GG  S     TPI+       P  ++FY +++ GIS+ G QL    S F  
Sbjct: 293 PSSTGYLNFGGKVSQTAGFTPIS-------PAFSSFYGIDIVGISVAGSQLPIDPSIFTT 345

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            G +IDSGTVITRLPP+ Y ALK  F ++ S +P   G  +LDTC++ S Y  V+ P V 
Sbjct: 346 SGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVS 405

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G  E+ +D +GI+Y V      VCLA A+   + E GI GN+QQK   V+YD    
Sbjct: 406 VSFKGGVEVDIDASGILYLVNG-VKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKG 464

Query: 473 QLGFAGEDCS 482
            +GFA   CS
Sbjct: 465 MIGFAAGACS 474


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 163/367 (44%), Positives = 217/367 (59%), Gaps = 25/367 (6%)

Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
            IP   G+ + T NY+ T+  G   +N TVI DTGS++ W+QC+PC  SCY QQ+P+FDP
Sbjct: 2   SIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDP 61

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           ++S +Y+ + C S+ C  L      S  CS S+   C Y V+YGDGS T G L  E   L
Sbjct: 62  TLSSTYRNISCTSAACTGLS-----SRGCSGST---CVYGVTYGDGSSTVGFLATETFTL 113

Query: 238 GKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
              +V N+FIFGCG+NN+GLF G +GL+GLGRS  SL SQ +   G +FSYCLPST    
Sbjct: 114 AAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSS-- 171

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
           A+G L +G         TP  YT M+ N +  T Y ++L GIS+GG +L  S   F   G
Sbjct: 172 ATGYLNIGN-----PLRTP-GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVG 225

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
            +IDSGTVITRLPP+ Y AL+  F    + +  A   SILDTC++ S    V  P +K+ 
Sbjct: 226 TIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLH 285

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           + G  ++T+   G+ Y + S  SQVCLA A  S   + GIIGN QQ+   V YD    ++
Sbjct: 286 YTG-LDVTIPGAGVFYVISS--SQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRI 342

Query: 475 GFAGEDC 481
           GFA   C
Sbjct: 343 GFAAGAC 349


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  278 bits (710), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 163/411 (39%), Positives = 233/411 (56%), Gaps = 35/411 (8%)

Query: 86  EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
           ++ Q+R+  D++H +   +R  +             +P   G+ L T NYI ++ LG   
Sbjct: 92  DRDQDRV--DSIH-RLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGTPK 148

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R++ V+ DTGSDL+WVQC+PC  CY Q DP+FDPS S +Y  V C +  C  L+     S
Sbjct: 149 RDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLD-----S 203

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-------VNDFIFGCGRNNKGL 256
           G CSS     C Y V YGD S T G L R+ L LG +S       + +F+FGCG ++ GL
Sbjct: 204 GSCSSGK---CRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGL 260

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
           FG   GL GLGR  +SL SQ +  +G  FSYCLPS+  + A G L LG  S+   N+   
Sbjct: 261 FGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSS--STAEGYLSLG--SAAPPNA--- 313

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSAL 374
            +T M+      +FY LNL GI + G+ ++ S   F   G +IDSGTVITRLP   Y+AL
Sbjct: 314 RFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRAYAAL 373

Query: 375 KAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           ++ F   ++++S +  AP  SILDTC++ +   +V IP V + F+G A + +    ++Y 
Sbjct: 374 RSSFAGLMRRYS-YKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLYV 432

Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             ++ SQ CLA AS   +    I+GN QQK   V+YD  N ++GF  + CS
Sbjct: 433 --ANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  277 bits (708), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 162/404 (40%), Positives = 228/404 (56%), Gaps = 26/404 (6%)

Query: 86  EQQQNRLILDNLHVQYLQSR-IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
           E+ Q R+  D++H +   +    +++           +P   GI L T NY+ ++ LG  
Sbjct: 101 ERDQARV--DSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTP 158

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
            +   VI DTGSDL+WVQC+PC  CY QQDP+FDPS+S +Y  V C +  C  L+ A+G 
Sbjct: 159 AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELD-ASG- 216

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVS 261
              CSS S   C Y V YGD S T G L R+ L L  + ++  F+FGCG  N GLFG V 
Sbjct: 217 ---CSSDS--RCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVD 271

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GL GLGR  +SL SQ +  +G  F+YCLPS+      G L LGG        T +     
Sbjct: 272 GLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSG--RGYLSLGGAPPANAQFTALA---- 325

Query: 322 IPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
             +    +FY ++L GI +GG+ ++    +  A GG +IDSGTVITRLPP  Y+ L+A F
Sbjct: 326 --DGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAF 383

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
            +  + +  AP  SILDTC++ + ++   IP V++ F G A +++D TG++Y  K   SQ
Sbjct: 384 ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSK--VSQ 441

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            CLA A  + +    I+GN QQK   V YD  N ++GF  + CS
Sbjct: 442 ACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  277 bits (708), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 162/404 (40%), Positives = 228/404 (56%), Gaps = 26/404 (6%)

Query: 86  EQQQNRLILDNLHVQYLQSR-IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
           E+ Q R+  D++H +   +    +++           +P   GI L T NY+ ++ LG  
Sbjct: 101 ERDQARV--DSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGLGTP 158

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
            +   VI DTGSDL+WVQC+PC  CY QQDP+FDPS+S +Y  V C +  C  L+ A+G 
Sbjct: 159 AKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELD-ASG- 216

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVS 261
              CSS S   C Y V YGD S T G L R+ L L  + ++  F+FGCG  N GLFG V 
Sbjct: 217 ---CSSDS--RCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVD 271

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GL GLGR  +SL SQ +  +G  F+YCLPS+      G L LGG        T +     
Sbjct: 272 GLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSG--RGYLSLGGAPPANAQFTALA---- 325

Query: 322 IPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
             +    +FY ++L GI +GG+ ++    +  A GG +IDSGTVITRLPP  Y+ L+A F
Sbjct: 326 --DGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRAYAPLRAAF 383

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
            +  + +  AP  SILDTC++ + ++   IP V++ F G A +++D TG++Y  K   SQ
Sbjct: 384 ARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSK--VSQ 441

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            CLA A  + +    I+GN QQK   V YD  N ++GF  + CS
Sbjct: 442 ACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 157/371 (42%), Positives = 222/371 (59%), Gaps = 28/371 (7%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDP 177
           ++P + G+ L T NY+  + LG      TV+ DTGSD TWVQCQPC + CY Q++P+FDP
Sbjct: 147 DLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDP 206

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           + S +Y  + C+SS C  L + +G SG         C Y + YGDGSYT G   ++ L L
Sbjct: 207 TKSATYANISCSSSYCSDL-YVSGCSGG-------HCLYGIQYGDGSYTIGFYAQDTLTL 258

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              ++ +F FGCG  N+GLFG  +GL+GLGR   SL  Q  + +GG+F+YCLP+T  +  
Sbjct: 259 AYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPAT--SAG 316

Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
           +G L LG G  +     TP+         +  TFY + +TGI +GG  L   G  F+  G
Sbjct: 317 TGFLDLGPGAPAANARLTPMLVD------RGPTFYYVGMTGIKVGGHVLPIPGSVFSTAG 370

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLK--QFSGFPSAPGFSILDTCFNLSAYQ--EVNIPL 410
            L+DSGTVITRLPPS Y+ L++ F K  Q  G+ +AP FSILDTC++L+ ++   + +P 
Sbjct: 371 TLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPA 430

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V + F+G A + VD +GI+Y   +D SQ CLA A  + + +  I+GN QQK   V+YD  
Sbjct: 431 VSLVFQGGACLDVDASGILYV--ADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIG 488

Query: 471 NSQLGFAGEDC 481
              +GFA   C
Sbjct: 489 KKIVGFAPGAC 499


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 175/467 (37%), Positives = 249/467 (53%), Gaps = 49/467 (10%)

Query: 34  KKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKN---YCSGKIVDWNEQQQ- 89
           + +LH+    W          VS    R    A+ L L H++     +GK          
Sbjct: 28  RHRLHIQLRDWDSLR------VSAASPRNGTSAV-LRLTHRHGPCAPAGKASALGSPPSF 80

Query: 90  -NRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
            + L  D    +Y+Q R+    +      +       +P   G  + TL Y+ T+ LG  
Sbjct: 81  LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTP 140

Query: 145 NM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FA 199
            +  T+ VDTGSD++WVQC+PC S  CY+Q+DP+FDP+ S SY  V C +++C  L  ++
Sbjct: 141 AVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYS 200

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFG 258
            G SG         C Y VSYGDGS T G    + L L G  ++  F+FGCG   +GLF 
Sbjct: 201 NGCSGG-------QCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFA 253

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
           GV GL+GLGR   SLVSQ S  +GG+FSYCLP TQ+  + G + LGG SS    ST    
Sbjct: 254 GVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN--SVGYISLGGPSSTAGFST---- 307

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKA 376
           T ++      T+YI+ L GIS+GG+ L   AS FA G + +D+GTV+TRLPP+ YSAL++
Sbjct: 308 TPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPTAYSALRS 366

Query: 377 EFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
            F    +  G+PSAP   ILDTC++ + Y  V +P + + F G A M +  +GI+     
Sbjct: 367 AFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGIL----- 421

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +  CLA A    + +  I+GN QQ++  V +D   S +GF    C
Sbjct: 422 --TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 171/476 (35%), Positives = 248/476 (52%), Gaps = 60/476 (12%)

Query: 48  SGSSSSCVSHQKSRIEMGAIT-LELKHKNYCSGKIVDWNEQQQNR-----LILDNLHVQY 101
           S +++SC + ++ R E G  T + + H++     + D    ++       L+ D   V+Y
Sbjct: 46  SAAAASCHTPEQ-RPEAGTATRMPIVHQHGPCSPLADDKHGKKAPSHTEILVADQRRVEY 104

Query: 102 LQSRIKNMISGNIKDVSNTE----------------------------IPLTSGIRLQTL 133
           +  R+    +G ++   ++                             +P  SG+ L T 
Sbjct: 105 IHRRVSE-TTGRVRRQKHSAPVVELRPGTPSSTRSSSSSLSSSATSTNLPAKSGLSLNTG 163

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
           NY+  I LG      TV+ DTGSD TWVQCQPC + CY Q++P+F P+ S +Y  + C S
Sbjct: 164 NYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTS 223

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
           S C  L+    + G         C Y V YGDGSYT G   ++ L LG  +V DF FGCG
Sbjct: 224 SYCSDLDTRGCSGG--------HCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCG 275

Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
             N+GLFG  +GLMGLGR   S+  Q  + + G+F+YC+P+T           G  ++  
Sbjct: 276 EKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPAAAN 335

Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPP 368
              TP+   N        TFY + +TGI +GG  L   A+ F+  G L+DSGTVITRLPP
Sbjct: 336 ARLTPMLVDNG------PTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPP 389

Query: 369 SIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDV 425
           S Y  L++ F K     G+ +AP FSILDTC++L+ YQ  + +P V + F+G A + VD 
Sbjct: 390 SAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +GI+Y   +D SQ CLA A+   + +  I+GN QQK   V+YD     +GFA   C
Sbjct: 450 SGILYV--ADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/400 (44%), Positives = 249/400 (62%), Gaps = 21/400 (5%)

Query: 92  LILDNLHVQYLQSRIKNMISGNIKDV---SNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
           L+ D   V+ + SR+ N  +   KDV    +T IP   G  + + NYI T+ LG   +++
Sbjct: 103 LLQDQSRVKSIHSRLSNSKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDL 162

Query: 147 TVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           ++I DTGSD+TW QCQPC +SCY Q++ +FDPS S SY  + C+SS C++L  ATGN+  
Sbjct: 163 SLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPG 222

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
           C+SS+   C Y + YGD S++ G  G E L L    + N+  FGCG+NN+GLFGG +GL+
Sbjct: 223 CASSA---CVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLL 279

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           GLGR  LS+VSQT++ +  +FSYCLPS+  +  +G L  GG++S     TP++  +  P 
Sbjct: 280 GLGRDKLSVVSQTAQKYNKIFSYCLPSSSSS--TGFLTFGGSASKNAKFTPLSTISAGP- 336

Query: 325 PQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
               +FY L+ TGIS+GGK+L   AS F+  G +IDSGTVITRLPP+ YSAL+A F    
Sbjct: 337 ----SFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRLPPAAYSALRASFRNLM 392

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
           S +P     SILDTC++ S+Y  +++P +   F    E+ +D TGI+Y   S  SQVCLA
Sbjct: 393 SKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILY--ASSLSQVCLA 450

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            A  S   +  I GN QQK   V YD    ++GFA   CS
Sbjct: 451 FAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGCS 490


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 157/371 (42%), Positives = 222/371 (59%), Gaps = 28/371 (7%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDP 177
           ++P + G+ L T NY+  + LG      TV+ DTGSD TWVQCQPC + CY Q++P+FDP
Sbjct: 82  DLPASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDP 141

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           + S +Y  + C+SS C  L + +G SG         C Y + YGDGSYT G   ++ L L
Sbjct: 142 TKSATYANISCSSSYCSDL-YVSGCSGG-------HCLYGIQYGDGSYTIGFYAQDTLTL 193

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              ++ +F FGCG  N+GLFG  +GL+GLGR   SL  Q  + +GG+F+YCLP+T  +  
Sbjct: 194 AYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPAT--SAG 251

Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
           +G L LG G  +     TP+         +  TFY + +TGI +GG  L   G  F+  G
Sbjct: 252 TGFLDLGPGAPAANARLTPMLVD------RGPTFYYVGMTGIKVGGHVLPIPGSVFSTAG 305

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLK--QFSGFPSAPGFSILDTCFNLSAYQ--EVNIPL 410
            L+DSGTVITRLPPS Y+ L++ F K  Q  G+ +AP FSILDTC++L+ ++   + +P 
Sbjct: 306 TLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPA 365

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V + F+G A + VD +GI+Y   +D SQ CLA A  + + +  I+GN QQK   V+YD  
Sbjct: 366 VSLVFQGGACLDVDASGILYV--ADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIG 423

Query: 471 NSQLGFAGEDC 481
              +GFA   C
Sbjct: 424 KKIVGFAPGAC 434


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 170/437 (38%), Positives = 235/437 (53%), Gaps = 60/437 (13%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRI-KN 108
           SS+C    K   +  ++ +  KH      +    N     +++  D   V  +QSR+ KN
Sbjct: 3   SSACSPSPKGHDQRASLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKN 62

Query: 109 MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS 166
           +  G+    S   +P  S   L + NY+ T+ LG   R++T I DTGSDLTW QC+PC  
Sbjct: 63  LAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVG 122

Query: 167 -CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
            CY Q++ +FDPS S SY  V C+S +C  LE ATGNS  CSSS+   C Y + YGDGSY
Sbjct: 123 YCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSST---CLYGIRYGDGSY 179

Query: 226 TRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
           + G   RE L L    V N+F FGCG+NN+GLFGG +GL+GL R+ LSLVSQT++ +G +
Sbjct: 180 SIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKV 239

Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           FSYCLPS+  + ++G L  G       +S  + +T                         
Sbjct: 240 FSYCLPSS--SSSTGYLSFGSGDG---DSKAVKFT------------------------- 269

Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
                               RLPP++YS+++  F +  S +P   G SILDTC++LS Y+
Sbjct: 270 -------------------PRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYK 310

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
            V +P + + F G AEM +   GI+Y +K   SQVCLA A  S +DE  IIGN QQK   
Sbjct: 311 TVKVPKIILYFSGGAEMDLAPEGIIYVLK--VSQVCLAFAGNSDDDEVAIIGNVQQKTIH 368

Query: 465 VIYDTKNSQLGFAGEDC 481
           V+YD    ++GFA   C
Sbjct: 369 VVYDDAEGRVGFAPSGC 385


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 163/402 (40%), Positives = 224/402 (55%), Gaps = 31/402 (7%)

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKD--VSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
           + L  D    +++  R+    +  + D   +   +P   G  + T NY+ T  LG  G  
Sbjct: 90  DTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMA 149

Query: 146 MTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
            T+ VDTGSDL+WVQC+PC   SCY Q+DP+FDP+ S SY  V C  S C  L      +
Sbjct: 150 QTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGI---YA 206

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR-NNKGLFGGVS 261
             CS++    C Y VSYGDGS T G    + L L   A+V  F+FGCG   + GLF G+ 
Sbjct: 207 SACSAA---QCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGID 263

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GL+G GR   SLV QT+  +GG+FSYCLP+      +G L LGG S V       + T +
Sbjct: 264 GLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSST--TGYLTLGGPSGVAPG---FSTTQL 318

Query: 322 IPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
           +P+P   T+Y++ LTGIS+GG+ L   AS FA G  ++D+GTVITRLPP+ Y+AL++ F 
Sbjct: 319 LPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAG-TVVDTGTVITRLPPAAYAALRSAFR 377

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
              + +PSAP   ILDTC++ + Y  VN+  V + F   A MT+   GI+ F        
Sbjct: 378 SGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIMSF-------G 430

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CLA AS   +    I+GN QQ++  V  D   S +GF    C
Sbjct: 431 CLAFASSGSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 161/400 (40%), Positives = 223/400 (55%), Gaps = 24/400 (6%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
           +++L      VQ    +   +  G+    S   +P TSG  + T NY+ T+ LG      
Sbjct: 117 RDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKY 176

Query: 147 TVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           TV+ DTGSD TWVQC+PC   CY Q++P+FDP+ S +Y  V C  S C  L+      G 
Sbjct: 177 TVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSACADLDTNGCTGG- 235

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
                   C Y V YGDGSYT G   ++ L +   ++  F FGCG  N GLFG  +GLMG
Sbjct: 236 -------HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMG 288

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR   SL  Q    +GG F+YCLP+      +G L  G  S+   N+  +  T M+ + 
Sbjct: 289 LGRGKTSLTVQAYNKYGGAFAYCLPALTT--GTGYLDFGPGSA--GNNARL--TPMLTD- 341

Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF- 382
           +  TFY + +TGI +GG+Q+    S F+  G L+DSGTVITRLP + Y+AL + F K   
Sbjct: 342 KGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVML 401

Query: 383 -SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
             G+  APG+SILDTC++ +   +V +P V + F+G A + VDV+GIVY +    +QVCL
Sbjct: 402 ARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISE--AQVCL 459

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A AS   ++   I+GN QQK   V+YD     +GFA   C
Sbjct: 460 AFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  274 bits (701), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 163/406 (40%), Positives = 230/406 (56%), Gaps = 37/406 (9%)

Query: 90  NRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRN 145
           + L  D    +Y+Q R+    +      +       +P   G  + TL Y+ T+ LG   
Sbjct: 93  DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPA 152

Query: 146 M--TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FAT 200
           +  T+ VDTGSD++WVQC+PC S  CY+Q+DP+FDP+ S SY  V C +++C  L  ++ 
Sbjct: 153 VAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSN 212

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGG 259
           G SG         C Y VSYGDGS T G    + L L G  ++  F+FGCG   +GLF G
Sbjct: 213 GCSGG-------QCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 265

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
           V GL+GLGR   SLVSQ S  +GG+FSYCLP TQ+  + G + LGG SS    ST    T
Sbjct: 266 VDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN--SVGYISLGGPSSTAGFST----T 319

Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
            ++      T+YI+ L GIS+GG+ L   AS FA G + +D+GTV+TRLPP+ YSAL++ 
Sbjct: 320 PLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPTAYSALRSA 378

Query: 378 FLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           F    +  G+PSAP   ILDTC++ + Y  V +P + + F G A M +  +GI+      
Sbjct: 379 FRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGIL------ 432

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +  CLA A    + +  I+GN QQ++  V +D   S +GF    C
Sbjct: 433 -TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 161/400 (40%), Positives = 222/400 (55%), Gaps = 24/400 (6%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNM 146
           +++L      VQ    +   +  G+    S   +P TSG  + T NY+ T+ LG      
Sbjct: 117 RDKLTKHAAPVQPGPKKSPGIHPGHSASSSTPSLPATSGRAVSTGNYVVTVGLGTPASKY 176

Query: 147 TVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           TV+ DTGSD TWVQC+PC   CY Q+ P+FDP+ S +Y  V C  S C  L+      G 
Sbjct: 177 TVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSACADLDTNGCTGG- 235

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
                   C Y V YGDGSYT G   ++ L +   ++  F FGCG  N GLFG  +GLMG
Sbjct: 236 -------HCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMG 288

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR   SL  Q    +GG F+YCLP+      +G L  G  S+   N+  +  T M+ + 
Sbjct: 289 LGRGKTSLTVQAYNKYGGAFAYCLPALTT--GTGYLDFGPGSA--GNNARL--TPMLTD- 341

Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF- 382
           +  TFY + +TGI +GG+Q+    S F+  G L+DSGTVITRLP + Y+AL + F K   
Sbjct: 342 KGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTALSSAFDKVML 401

Query: 383 -SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
             G+  APG+SILDTC++ +   +V +P V + F+G A + VDV+GIVY +    +QVCL
Sbjct: 402 ARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYAISE--AQVCL 459

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A AS   ++   I+GN QQK   V+YD     +GFA   C
Sbjct: 460 AFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 181/496 (36%), Positives = 251/496 (50%), Gaps = 46/496 (9%)

Query: 5   VKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVS----HQKS 60
           V+   +LSL+    +     + GA    G   +   + +       SS+C S     Q+ 
Sbjct: 6   VRRALLLSLICAGALGFLPCSHGAAVAPGYVTVSAARFR------PSSTCSSLDPVAQRR 59

Query: 61  RIEMGAITLELKHKN-YCSGKIVD--WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD- 116
           R    A+ L L HK+  C+             + L  D    +Y+  R+    +  + D 
Sbjct: 60  RNGTSAV-LRLTHKHGPCAPSRASSLATPSVADTLRADQRRAEYILRRVSGRGTPQLWDS 118

Query: 117 ---VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYN 169
               +   +P   G  + TLNY+ T+ LG  G   T+ VDTGSDL+WVQC PC +  CY+
Sbjct: 119 KAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYS 178

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
           Q+DP+FDP+ S SY  V C    C  L          SS S   C Y VSYGDGS T G 
Sbjct: 179 QKDPLFDPAQSSSYAAVPCGGPVCGGLGI------YASSCSAAQCGYVVSYGDGSKTTGV 232

Query: 230 LGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
              + L L    +V  F FGCG    G F G  GL+GLGR + SLV QT+  +GG+FSYC
Sbjct: 233 YSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYC 291

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA- 347
           LP+      +G L LGG S         + T ++ +P  AT+Y++ LTGIS+GG+QL   
Sbjct: 292 LPTRPST--TGYLTLGGPSGAAPPG--FSTTQLLSSPNAATYYVVMLTGISVGGQQLSVP 347

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQE 405
           S    GG ++D+GTVITRLPP+ Y+AL++ F    +  G+PSAP   ILDTC+N S Y  
Sbjct: 348 SSVFAGGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT 407

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
           V +P V + F G A +T+   GI+ F        CLA A    +    I+GN QQ++  V
Sbjct: 408 VTLPNVALTFSGGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEV 460

Query: 466 IYDTKNSQLGFAGEDC 481
             D   + +GF    C
Sbjct: 461 RID--GTSVGFKPSSC 474


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  271 bits (694), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 170/443 (38%), Positives = 232/443 (52%), Gaps = 37/443 (8%)

Query: 52  SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           S C   +  R   GA T+ L H++  CS          ++RL  D L   Y    IK   
Sbjct: 42  SVCSESKAVRSSSGATTVPLHHRHGPCSPLPTKKMPSLEDRLHRDQLRAAY----IKRKF 97

Query: 111 SGNIK---------DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
           SG++K         + S+  +P T G  L TL Y+ T+ LG   +  TV++D+GSD++WV
Sbjct: 98  SGDVKKDGQGAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWV 157

Query: 160 QCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVS 219
           QC+PC  C++Q DP+FDPS+S +Y    C+S+ C  L    GN   CSSSS   C Y V 
Sbjct: 158 QCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAACAQLG-QDGNG--CSSSS--QCQYIVR 212

Query: 220 YGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
           Y DGS T G    + L LG  ++++F FGC     G      GLMGLG    SL SQT+ 
Sbjct: 213 YADGSSTTGTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAG 272

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
            FG  FSYCLP T    +SG L LG  +S F   TP+  ++ +P     TFY + L  I 
Sbjct: 273 TFGTAFSYCLPPTPS--SSGFLTLGAGTSGFVK-TPMLRSSPVP-----TFYGVRLEAIR 324

Query: 340 IGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
           +GG QL   +     G+++DSGT+ITRLP + YSAL + F      +  AP  SI+DTCF
Sbjct: 325 VGGTQLSIPTSVFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCF 384

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
           + S    V +P V + F G A + +D  GI+          CLA A+ S +   GI+GN 
Sbjct: 385 DFSGQSSVRLPSVALVFSGGAVVNLDANGIIL-------GNCLAFAANSDDSSPGIVGNV 437

Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
           QQ+   V+YD     +GF    C
Sbjct: 438 QQRTFEVLYDVGGGAVGFKAGAC 460


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 167/411 (40%), Positives = 226/411 (54%), Gaps = 38/411 (9%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
           D   V  +   I N  +   +DVS   +P   GI + T NY+ ++ LG   R++TV+ DT
Sbjct: 48  DQARVDSIHRMIANETAVVGQDVS---LPAERGISVGTGNYVVSVGLGTPARDLTVVFDT 104

Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           GSDL+WVQC PC S  CY+QQDP+F PS S ++  V C    C     +      CSSS 
Sbjct: 105 GSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRARQS------CSSSP 158

Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLGLG-----KASVND------FIFGCGRNNKGLFG 258
             D C Y V YGD S T G LG + L LG      AS N+      F+FGCG NN GLFG
Sbjct: 159 GDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFG 218

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
              GL GLGR  +SL SQ +  +G  FSYCLPS+  + A G L LG  +    ++    +
Sbjct: 219 KADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSS-SNAHGYLSLGTPAPAPAHA---RF 274

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQAS---GFAKGGILIDSGTVITRLPPSIYSALK 375
           T M+      +FY + L GI + G+ ++ S        G+++DSGTVITRL P  YSAL+
Sbjct: 275 TPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALR 334

Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYF 431
             FL      G+  AP  SILDTC++ +A+    V+IP V + F G A ++VD +G++Y 
Sbjct: 335 TAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYV 394

Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            K   +Q CLA A        GI+GN QQ+   V+YD    ++GFA + CS
Sbjct: 395 AK--VAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGCS 443


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 169/434 (38%), Positives = 238/434 (54%), Gaps = 53/434 (12%)

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI------------KNMISGNIKD 116
           L L H+   S     + E Q+     D   V+Y+Q R+            + + +G+   
Sbjct: 75  LRLAHRCGPSTASASFAEVQRA----DEQRVEYIQRRVSGGGARGAKGALQQLATGS--- 127

Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQD 172
             +  +P T G+   T  Y+ T+ LG  G + TV VDTGSD++WVQC+PC +  C +Q+D
Sbjct: 128 -RSATVPTTMGV--GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD 184

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            +FDP+ S +Y  V C +  C  L         CS S    C Y VSYGDGS T G  G 
Sbjct: 185 QLFDPAKSSTYSAVPCGADACSELRIYEAG---CSGS---QCGYVVSYGDGSNTTGVYGS 238

Query: 233 EHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           + L L    +V  F+FGCG    G+F G+ GL+ LGR  +SL SQ +  +GG+FSYCLPS
Sbjct: 239 DTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPS 298

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
            Q   A+G L LGG SS    +T    T ++      TFY++ LTGIS+GG+Q+   AS 
Sbjct: 299 KQS--AAGYLTLGGPSSASGFAT----TGLLTAWAAPTFYMVMLTGISVGGQQVAVPASA 352

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVN 407
           FA GG ++D+GTVITRLPP+ Y+AL++ F    +  G+PSAP   ILDTC++ S Y  V 
Sbjct: 353 FA-GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVT 411

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P V + F G A + ++  GI+       S  CLA A    + +  I+GN QQ++  V +
Sbjct: 412 LPTVALTFSGGATLALEAPGIL-------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRF 464

Query: 468 DTKNSQLGFAGEDC 481
           D   S +GF    C
Sbjct: 465 D--GSTVGFMPGAC 476


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  268 bits (686), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 168/434 (38%), Positives = 238/434 (54%), Gaps = 53/434 (12%)

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI------------KNMISGNIKD 116
           L L H+   S     + E Q+     D   V+Y+Q R+            + + +G+   
Sbjct: 75  LRLAHRCGPSTASASFAEVQRA----DEQRVEYIQRRVSGGGARGAKGALQQLATGS--- 127

Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQD 172
             +  +P T G+   T  Y+ T+ LG  G + TV VDTGSD++WVQC+PC +  C +Q+D
Sbjct: 128 -RSATVPTTMGV--GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD 184

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            +FDP+ S +Y  V C +  C  L         CS S    C Y VSYGDGS T G  G 
Sbjct: 185 QLFDPAKSSTYSAVPCGADACSELRIYEAG---CSGS---QCGYVVSYGDGSNTTGVYGS 238

Query: 233 EHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           + L L    +V  F+FGCG    G+F G+ GL+ LGR  +SL SQ +  +GG+FSYCLPS
Sbjct: 239 DTLALAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPS 298

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
            Q   A+G L LGG +S    +T    T ++      TFY++ LTGIS+GG+Q+   AS 
Sbjct: 299 KQS--AAGYLTLGGPTSASGFAT----TGLLTAWAAPTFYMVMLTGISVGGQQVAVPASA 352

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVN 407
           FA GG ++D+GTVITRLPP+ Y+AL++ F    +  G+PSAP   ILDTC++ S Y  V 
Sbjct: 353 FA-GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVT 411

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P V + F G A + ++  GI+       S  CLA A    + +  I+GN QQ++  V +
Sbjct: 412 LPTVALTFSGGATLALEAPGIL-------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRF 464

Query: 468 DTKNSQLGFAGEDC 481
           D   S +GF    C
Sbjct: 465 D--GSTVGFMPGAC 476


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 172/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
           + QNR+  D++H + L SR      G   +   T +P+ SG  +   +Y+ T+ LG   +
Sbjct: 32  RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 82

Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
             T+I DTGSD+TW QC+PC K+CY Q++P  +PS S SYK + C+S+ C  +      S
Sbjct: 83  EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 142

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
             CSSS+   C Y V YGDGSY+ G    E L L  ++V  +F+FGCG+ N GLFGG +G
Sbjct: 143 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 199

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           L+GLGR+ L+L SQT++ +  LFSYCLP++  + + G L LGG  S     TP++  +  
Sbjct: 200 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 256

Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
             P    FY L++TG+S+GG+QL    S F+  G +IDSGTVITRL P+ YS L + F  
Sbjct: 257 STP----FYGLDITGLSVGGRQLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 311

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
             + +PS  G+SI DTC++ S Y  V IP V + F+G  EM +DV+GI+Y V     +VC
Sbjct: 312 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 370

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           LA A    + +T I GN QQ+  +V+YD    ++GFA   CS
Sbjct: 371 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 412


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  266 bits (679), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 171/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
           + QNR+  D++H + L SR      G   +   T +P+ SG  +   +Y+ T+ LG   +
Sbjct: 92  RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 142

Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
             T+I DTGSD+TW QC+PC K+CY Q++P  +PS S SYK + C+S+ C  +      S
Sbjct: 143 EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 202

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
             CSSS+   C Y V YGDGSY+ G    E L L  ++V  +F+FGCG+ N GLFGG +G
Sbjct: 203 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 259

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           L+GLGR+ L+L SQT++ +  LFSYCLP++  + + G L LGG  S     TP++  +  
Sbjct: 260 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 316

Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
             P    FY L++TG+S+GG++L    S F+  G +IDSGTVITRL P+ YS L + F  
Sbjct: 317 STP----FYGLDITGLSVGGRKLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 371

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
             + +PS  G+SI DTC++ S Y  V IP V + F+G  EM +DV+GI+Y V     +VC
Sbjct: 372 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 430

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           LA A    + +T I GN QQ+  +V+YD    ++GFA   CS
Sbjct: 431 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 472


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 171/402 (42%), Positives = 244/402 (60%), Gaps = 27/402 (6%)

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
           + QNR+  D++H + L SR      G   +   T +P+ SG  +   +Y+ T+ LG   +
Sbjct: 80  RDQNRV--DSIHAR-LSSR------GMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKK 130

Query: 145 NMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
             T+I DTGSD+TW QC+PC K+CY Q++P  +PS S SYK + C+S+ C  +      S
Sbjct: 131 EFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFS 190

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSG 262
             CSSS+   C Y V YGDGSY+ G    E L L  ++V  +F+FGCG+ N GLFGG +G
Sbjct: 191 QSCSSST---CLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAG 247

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           L+GLGR+ L+L SQT++ +  LFSYCLP++  + + G L LGG  S     TP++  +  
Sbjct: 248 LLGLGRTKLALPSQTAKTYKKLFSYCLPAS--SSSKGYLSLGGQVSKSVKFTPLS-ADFD 304

Query: 323 PNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
             P    FY L++TG+S+GG++L    S F+  G +IDSGTVITRL P+ YS L + F  
Sbjct: 305 STP----FYGLDITGLSVGGRKLSIDESAFS-AGTVIDSGTVITRLSPTAYSELSSAFQN 359

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
             + +PS  G+SI DTC++ S Y  V IP V + F+G  EM +DV+GI+Y V     +VC
Sbjct: 360 LMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNG-LKKVC 418

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           LA A    + +T I GN QQ+  +V+YD    ++GFA   CS
Sbjct: 419 LAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGCS 460


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 222/408 (54%), Gaps = 35/408 (8%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
           D   V  +   I N  S     VS   +P   GI + T NY+ ++ LG   R++TV+ DT
Sbjct: 117 DQARVDSILGMITNETSAVGPGVS---LPAERGISVGTGNYVVSVGLGTPARDLTVVFDT 173

Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           GSDL+WVQC PC S  CY QQDP+F PS S ++  V C +  C A +   G+ G      
Sbjct: 174 GSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRARQSCGGSPG------ 227

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLG------KASVND-----FIFGCGRNNKGLFGG 259
              C Y V YGD S T+G LG + L LG       ++ ND     F+FGCG NN GLFG 
Sbjct: 228 DDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQ 287

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
             GL GLGR  +SL SQ +  FG  FSYCLPS+  + A G L LG       ++    +T
Sbjct: 288 ADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSS-APGYLSLGTPVPAPAHA---QFT 343

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEF 378
            M+      +FY + L GI + G+ ++ S       +++DSGTVITRL P  Y AL+A F
Sbjct: 344 PMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTVITRLAPRAYRALRAAF 403

Query: 379 LKQFS--GFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           L      G+  AP  SILDTC++ +A+    V+IP V + F G A ++VD +G++Y  K 
Sbjct: 404 LSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLYVAK- 462

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             +Q CLA A        GI+GN QQ+   V+YD    ++GFA + CS
Sbjct: 463 -VAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGCS 509


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 161/409 (39%), Positives = 223/409 (54%), Gaps = 39/409 (9%)

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
           + L  D    +Y+  R+    SG    + +++       +P + G  + TLNY+ T  LG
Sbjct: 92  DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147

Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
             G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S SY  V C    C  L 
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLG 207

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGL 256
               ++   +        Y VSYGDGS T G    + L L  +S V  F FGCG    GL
Sbjct: 208 IYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGL 262

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
           F GV GL+GLGR   SLV QT+  +GG+FSYCLP+        +L LGG S         
Sbjct: 263 FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGLGGPSGAAPG---F 319

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSAL 374
           + T ++P+P   T+Y++ LTGIS+GG+QL   AS FA GG ++D+GTVITRLPP+ Y+AL
Sbjct: 320 STTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA-GGTVVDTGTVITRLPPTAYAAL 378

Query: 375 KAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           ++ F    +  G+P+AP   ILDTC+N + Y  V +P V + F   A + +   GI+ F 
Sbjct: 379 RSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVMLGADGILSF- 437

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  CLA A    +    I+GN QQ++  V  D   + +GF    C
Sbjct: 438 ------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  262 bits (670), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 167/426 (39%), Positives = 228/426 (53%), Gaps = 27/426 (6%)

Query: 65  GAITLELKHKN-YCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
           G IT+ L H++  CS   V  N+     + RL  D L   Y++ +      G+++     
Sbjct: 59  GGITVPLHHRHGPCS--PVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDAA 116

Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
            +P T G  L TL Y+ T+ +G   +T  + +DTGSD++WVQC+PC  C+++ D +FDPS
Sbjct: 117 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPS 176

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y    C+S+ C  L  +   +G CSSS    C Y VSY DGS T G    + L LG
Sbjct: 177 ASSTYSPFSCSSAACVQLSQSQQGNG-CSSS---QCQYIVSYVDGSSTTGTYSSDTLTLG 232

Query: 239 KASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
             ++  F FGC ++  G F     GLMGLG    SLVSQT+  FG  FSYCLP T   G+
Sbjct: 233 SNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPT--PGS 290

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
           SG L LG  S      TP+  +  IP     T+Y + L  I +GG+QL    S F+ G +
Sbjct: 291 SGFLTLGAASRSGFVKTPMLRSTQIP-----TYYGVLLEAIRVGGQQLNIPTSVFSAGSV 345

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           + DSGTVITRLPP+ YSAL + F      +P A    ILDTCF+ S    V+IP V + F
Sbjct: 346 M-DSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 404

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A + +D  GI+     +    CLA A+ S +   G IGN QQ+   V+YD     +G
Sbjct: 405 SGGAVVNLDFNGIML----ELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVG 460

Query: 476 FAGEDC 481
           F    C
Sbjct: 461 FRAGAC 466


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  261 bits (667), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 156/407 (38%), Positives = 239/407 (58%), Gaps = 37/407 (9%)

Query: 95  DNLHVQYLQSRIK----------NMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG- 143
           D  HV++L SR++             SG++ + ++  IPL  G+ + + NY   + LG  
Sbjct: 70  DEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLGSP 129

Query: 144 -RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
            +  T+I+DTGS L+W+QC+PC   C++Q DP+F+PS S +Y+ + C+SS C  L+ AT 
Sbjct: 130 PKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAATL 189

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV 260
           N  +C++S    C Y  SYGD SY+ G L R+ L L  + ++  F +GCG++N+GLFG  
Sbjct: 190 NDPLCTASG--VCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGLFGKA 247

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS-TPITYT 319
           +G++GL R  LS+++Q S  +G  FSYCLP++  +G       GG  S+ K S +   +T
Sbjct: 248 AGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSG-------GGFLSIGKISPSSYKFT 300

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGTVITRLPPSIYSALK 375
            MI N Q  + Y L L  I++ G+ +   G A  G     +IDSGTV+TRLP SIY+AL+
Sbjct: 301 PMIRNSQNPSLYFLRLAAITVAGRPV---GVAAAGYQVPTIIDSGTVVTRLPISIYAALR 357

Query: 376 AEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
             F+K  S  +  AP +SILDTCF  S       P ++M F+G A++++    I+  +++
Sbjct: 358 EAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNIL--IEA 415

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           D    CLA AS    ++  IIGN+QQ+   + YD   S++GFA   C
Sbjct: 416 DKGIACLAFAS---SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 162/442 (36%), Positives = 239/442 (54%), Gaps = 31/442 (7%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNM 109
           S  C  H+ +  + G+ TL L H++  CS  I       +  L  D L   Y+Q+++ + 
Sbjct: 43  SEVCSGHKVTPSKNGS-TLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSR 101

Query: 110 ISGNIKDV--SNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPC- 164
            +   K++  S   IP +SG  L T  Y+ T+ +G   +T +  +DTGSD++WVQC PC 
Sbjct: 102 YNNVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCA 161

Query: 165 -KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGD 222
            +SC +Q+D +FDP++S +Y    C S+ C  L    G+ G  C  S    C Y V YGD
Sbjct: 162 AQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQL----GDEGNGCLKS---QCQYIVKYGD 214

Query: 223 GSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
           GS T G  G + L L  + +V  F FGC     G  G + GLMGLG    SLVSQT+  +
Sbjct: 215 GSNTAGTYGSDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATY 274

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           G  FSYCLP    +G  G L LG       +S+  ++T M+    + TFY + L GI++ 
Sbjct: 275 GKAFSYCLPPPSSSGG-GFLTLGAAGGA--SSSRYSHTPMV-RFSVPTFYGVFLQGITVA 330

Query: 342 GKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           G  L   AS F+ G  ++DSGTVIT+LPP+ Y AL+  F K+   +PSA     LDTCF+
Sbjct: 331 GTMLNVPASVFS-GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFD 389

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
            S +  + +P V + F   A M +D++GI+Y         CLA  + +++ +TGI+GN Q
Sbjct: 390 FSGFNTITVPTVTLTFSRGAAMDLDISGILY-------AGCLAFTATAHDGDTGILGNVQ 442

Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
           Q+   +++D     +GF    C
Sbjct: 443 QRTFEMLFDVGGRTIGFRSGAC 464


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 169/424 (39%), Positives = 226/424 (53%), Gaps = 25/424 (5%)

Query: 65  GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
           GA T+ L H++  CS          +  L  D L   Y+Q +          DV  S+  
Sbjct: 56  GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 114

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           +P   G  L TL Y+ T+ LG    + T+++DTGSD++WVQC+PC  C++Q DP+FDPS 
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +Y    C S+ C  L    GN   CSSSS   C Y V+YGDGS T G    + L LG 
Sbjct: 175 SSTYSPFSCGSAACAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 229

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           ++V  F FGC     G      GLMGLG    SLVSQT+   G  FSYCLP T    +SG
Sbjct: 230 SAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 287

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
            L L   ++    ++    T M+ + Q+ TFY + L  I +GG+QL   AS F+ G ++ 
Sbjct: 288 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 344

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           DSGTVITRLPP+ YSAL + F      +P A    ILDTCF+ S    V+IP V + F G
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A +++D +GI+          CLA A+ S +   GIIGN QQ+   V+YD     +GF 
Sbjct: 405 GAVVSLDASGIIL-------SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 478 GEDC 481
              C
Sbjct: 458 AGAC 461


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 152/425 (35%), Positives = 230/425 (54%), Gaps = 26/425 (6%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
           +L L H++  SG        Q   L+  DN  V++L+ R+    S  + +D+ +  +P  
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
            G+   +  Y   + +G    +  ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
             V C S+ C  L       G  +      C+Y V+YGDGSYT+GEL  E L LG  +V 
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG  N GLF G +GL+GLG   +SLV Q     GG+FSYCL +++ AG +GSL+L
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL-ASRGAGGAGSLVL 295

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
           G   +V   +    +  ++ N Q ++FY + LTGI +GG++L       Q +    GG++
Sbjct: 296 GRTEAVPVGA---VWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 352

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +D+GT +TRLP   Y+AL+  F       P +P  S+LDTC++LS Y  V +P V   F+
Sbjct: 353 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 412

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
             A +T+    +   V+   +  CLA A  S      I+GN QQ+  ++  D+ N  +GF
Sbjct: 413 QGAVLTLPARNL--LVEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 468

Query: 477 AGEDC 481
               C
Sbjct: 469 GPNTC 473


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 169/424 (39%), Positives = 225/424 (53%), Gaps = 25/424 (5%)

Query: 65  GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
           GA T+ L H++  CS          +  L  D L   Y+Q +          DV  S+  
Sbjct: 126 GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 184

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           +P   G  L TL Y+ T+ LG    + T+++DTGSD++WVQC+PC  C++Q DP+FDPS 
Sbjct: 185 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 244

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +Y    C S+ C  L    GN   CSSSS   C Y V+YGDGS T G    + L LG 
Sbjct: 245 SSTYSPFSCGSADCAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 299

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           ++V  F FGC     G      GLMGLG    SLVSQT+   G  FSYCLP T    +SG
Sbjct: 300 SAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 357

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
            L L   ++    ++    T M+ + Q+ TFY + L  I +GG+QL   AS F+ G ++ 
Sbjct: 358 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 414

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           DSGTVITRLPP+ YSAL + F      +P A    ILDTCF+ S    V+IP V + F G
Sbjct: 415 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 474

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A +++D +GI+          CLA A  S +   GIIGN QQ+   V+YD     +GF 
Sbjct: 475 GAVVSLDASGIIL-------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527

Query: 478 GEDC 481
              C
Sbjct: 528 AGAC 531


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  258 bits (659), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 169/424 (39%), Positives = 225/424 (53%), Gaps = 25/424 (5%)

Query: 65  GAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
           GA T+ L H++  CS          +  L  D L   Y+Q +          DV  S+  
Sbjct: 56  GAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDAT 114

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           +P   G  L TL Y+ T+ LG    + T+++DTGSD++WVQC+PC  C++Q DP+FDPS 
Sbjct: 115 VPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSS 174

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +Y    C S+ C  L    GN   CSSSS   C Y V+YGDGS T G    + L LG 
Sbjct: 175 SSTYSPFSCGSADCAQLG-QEGNG--CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGS 229

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           ++V  F FGC     G      GLMGLG    SLVSQT+   G  FSYCLP T    +SG
Sbjct: 230 SAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSG 287

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILI 357
            L L   ++    ++    T M+ + Q+ TFY + L  I +GG+QL   AS F+ G ++ 
Sbjct: 288 FLTL--GAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM- 344

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           DSGTVITRLPP+ YSAL + F      +P A    ILDTCF+ S    V+IP V + F G
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A +++D +GI+          CLA A  S +   GIIGN QQ+   V+YD     +GF 
Sbjct: 405 GAVVSLDASGIIL-------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 478 GEDC 481
              C
Sbjct: 458 AGAC 461


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  258 bits (658), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 150/424 (35%), Positives = 228/424 (53%), Gaps = 24/424 (5%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
           +L L H++  SG        Q   L+  DN  V++L+ R+    S  + +   +E+    
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEV--VP 121

Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G+   +  Y   + +G    +  ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+ 
Sbjct: 122 GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFS 181

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            V C S+ C  L       G  +      C+Y V+YGDGSYT+GEL  E L LG  +V  
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQG 237

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
              GCG  N GLF G +GL+GLG   +SL+ Q     GG+FSYCL +++ AG +GSL+LG
Sbjct: 238 VAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCL-ASRGAGGAGSLVLG 296

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILI 357
              +V   +    +  ++ N Q ++FY + LTGI +GG++L       Q +    GG+++
Sbjct: 297 RTEAVPVGA---VWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVM 353

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           D+GT +TRLP   Y+AL+  F       P +P  S+LDTC++LS Y  V +P V   F+ 
Sbjct: 354 DTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQ 413

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A +T+    +   V+   +  CLA A  S      I+GN QQ+  ++  D+ N  +GF 
Sbjct: 414 GAVLTLPARNL--LVEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGFG 469

Query: 478 GEDC 481
              C
Sbjct: 470 PNTC 473


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 153/403 (37%), Positives = 232/403 (57%), Gaps = 23/403 (5%)

Query: 95  DNLHVQYLQSRIKNM----------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG- 143
           D  HV+ L  R+ N            SG++ + ++  IPL  G+ + + NY   + LG  
Sbjct: 75  DEEHVKALSDRLANKGLGSGSAKPPKSGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTP 134

Query: 144 -RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
            +   +I+DTGS L+W+QCQPC   C+ Q DP++DPS+S +YKK+ C S  C  L+ AT 
Sbjct: 135 PKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATL 194

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV 260
           N  +C + S   C Y  SYGD S++ G L ++ L L  + ++  F +GCG++N+GLFG  
Sbjct: 195 NDPLCETDSN-ACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRA 253

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +G++GL R  LS+++Q S  +G  FSYCLP+     + G  +  G+ S     T   +T 
Sbjct: 254 AGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSIS----PTSYKFTP 309

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
           M+ + +  + Y L LT I++ G+ L  A+   +   LIDSGTVITRLP S+Y+AL+  F+
Sbjct: 310 MLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFV 369

Query: 380 KQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
           K  S  +  AP +SILDTCF  S      +P +KM F+G A++T+    I+  +++D   
Sbjct: 370 KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL--IEADKGI 427

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            CLA A  S  ++  IIGN QQ+   + YD   S++GFA   C
Sbjct: 428 TCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 163/480 (33%), Positives = 251/480 (52%), Gaps = 37/480 (7%)

Query: 15  LPLMVSLFLLAKGAHCFEGKKKLH-LHKLQWQQKSGSSSSCVSHQKSRIEMGA--ITLEL 71
           LPL+V   L    +    G ++ H L  +   + S  +++C + +   ++ G+  +++ L
Sbjct: 4   LPLLVCFILCTYNSLAHGGNEEEHVLVAVPTSRYSEPAATCSTSRVRWLDEGSNTVSVPL 63

Query: 72  KHKN-YCSGKIVDWNEQQ-QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
            H++  C+      +E     RL       +Y+ SR            SN  IP   G  
Sbjct: 64  VHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASK---------SNVSIPTHLGGS 114

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
           + +L Y+ T+ LG    +  +++DTGSDL+WVQC PC S  CY Q+DP+FDPS S +Y  
Sbjct: 115 VDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAP 174

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSP--PDCNYFVSYGDGSYTRGELGREHLGLGKA-SV 242
           + CN+  C  L    G    C+S S     C Y ++YGDGS T G    E L +    +V
Sbjct: 175 IPCNTDACRDLT-RDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTV 233

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
            DF FGCG +  G      GL+GLG +  SLV QTS ++GG FSYCLP+  D   +G L 
Sbjct: 234 KDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAAND--QAGFLA 291

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGT 361
           LG   +   +++   +T M+   Q  TFY++N+TGI++GG+ +     A  GG++IDSGT
Sbjct: 292 LG---APVNDASGFVFTPMVREQQ--TFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGT 346

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
           V+T L  + Y+AL+A F K  + +P  P    LDTC+N + +  V +P V + F G A +
Sbjct: 347 VVTELQHTAYAALQAAFRKAMAAYPLLPNGE-LDTCYNFTGHSNVTVPRVALTFSGGATV 405

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +DV   +          CLA      +++ GI+GN  Q+   V+YD  + ++GF  + C
Sbjct: 406 DLDVPDGILLDN------CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 148/397 (37%), Positives = 216/397 (54%), Gaps = 30/397 (7%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
           DN   +YL SR+           S +E  + SG+   +  Y   + +G       ++VD+
Sbjct: 88  DNARAEYLASRLSPAAY-QPTGFSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDS 146

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP 211
           GSD+ WVQC+PC  CY Q DP+FDP+ S ++  V C S+ C  L  +  G+SG       
Sbjct: 147 GSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCRTLRTSGCGDSG------- 199

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
             C+Y VSYGDGSYT+G L  E L LG  +V     GCG  N+GLF G +GL+GLG   +
Sbjct: 200 -GCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 258

Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
           SLV Q     GG FSYCL S      +GSL+LG + +V + +    +  ++ NPQ  +FY
Sbjct: 259 SLVGQLGGAAGGAFSYCLASR----GAGSLVLGRSEAVPEGA---VWVPLVRNPQAPSFY 311

Query: 332 ILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            + L+GI +G ++L       Q +    GG+++D+GT +TRLP   Y+AL+  F+     
Sbjct: 312 YVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGA 371

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
            P APG S+LDTC++LS Y  V +P V   F+G A +T+    ++  ++ D    CLA A
Sbjct: 372 LPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLL--LEVDGGIYCLAFA 429

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             S      I+GN QQ+  ++  D+ N  +GF    C
Sbjct: 430 PSS--SGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 157/368 (42%), Positives = 212/368 (57%), Gaps = 26/368 (7%)

Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
            IP   G+ + + NY+ T+  G   R  TV+ DTGSD+ W+QC+PC   CY QQ+P+FDP
Sbjct: 2   SIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDP 61

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           S+S +Y+ V C    C  L     ++  CSSS+   C Y V YGDGS T G L  +   L
Sbjct: 62  SLSSTYRNVSCTEPACVGL-----STRGCSSST---CLYGVFYGDGSSTIGFLAMDTFML 113

Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDA 295
             A    +FIFGCG+NN GLF G +GL+GLGRS   SL SQ +   G +FSYCLPST  A
Sbjct: 114 TPAQKFKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSA 173

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
             +G L +G        +TP  YT M+ + ++ T Y ++L GIS+GG +L  S   F   
Sbjct: 174 --TGYLNIGN-----PQNTP-GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSV 225

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G +IDSGTVITRLPP+ YSALK       + +  AP  +ILDTC++ S    V  P++ +
Sbjct: 226 GTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVL 285

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F G  ++ +  TG+ +   S  SQVCLA A  +     GIIGN QQ    V YD +  +
Sbjct: 286 HFAG-LDVRIPATGVFFVFNS--SQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKR 342

Query: 474 LGFAGEDC 481
           +GF+   C
Sbjct: 343 IGFSAGAC 350


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 149/402 (37%), Positives = 218/402 (54%), Gaps = 32/402 (7%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
           DN   +YL SR+         D   +E  + SG+   +  Y   + +G       ++VD+
Sbjct: 87  DNARAEYLASRLSPAY--QPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDS 144

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSP 211
           GSD+ WVQC+PC  CY Q DP+FDP+ S ++  V C S+ C  L  +  G+SG       
Sbjct: 145 GSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSG------- 197

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
             C Y VSYGDGSYT+G L  E L LG  +V     GCG  N+GLF G +GL+GLG   +
Sbjct: 198 -GCEYEVSYGDGSYTKGTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPM 256

Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAG-----ASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           SLV Q     GG FSYCL S   +G     A+GSL+LG + +V + +    +  ++ NPQ
Sbjct: 257 SLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGA---VWVPLVRNPQ 313

Query: 327 LATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
             +FY + ++GI +G ++L       Q +    GG+++D+GT +TRLP   Y+AL+  F+
Sbjct: 314 APSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFV 373

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
                 P APG S+LDTC++LS Y  V +P V   F+G A +T+    ++  ++ D    
Sbjct: 374 GAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLL--LEVDGGIY 431

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CLA A  S      I+GN QQ+  ++  D+ N  +GF    C
Sbjct: 432 CLAFAPSS--SGLSILGNIQQEGIQITVDSANGYIGFGPATC 471


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 153/368 (41%), Positives = 209/368 (56%), Gaps = 29/368 (7%)

Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ TI LG      TV+ DTGSD TWVQCQPC   CY QQ+ +FDP+ S 
Sbjct: 172 SSGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSS 231

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  V C +  C  L +  G SG         C Y V YGDGSY+ G    + L L    
Sbjct: 232 TYANVSCAAPACSDL-YTRGCSGG-------HCLYSVQYGDGSYSIGFFAMDTLTLSSYD 283

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP+      +G 
Sbjct: 284 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS--GTGY 341

Query: 301 LILGGNSSV---FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
           L  G  S      + +TP+   N        TFY + +TGI +GG+ L    S F+  G 
Sbjct: 342 LDFGPGSPAAVGARQTTPMLTDNG------PTFYYVGMTGIRVGGQLLSIPQSVFSTAGT 395

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           ++DSGTVITRLPP+ YS+L++ F    +  G+  AP  S+LDTC++ +   EV IP V +
Sbjct: 396 IVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSL 455

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F+G A + V+ +GI+Y   +  SQVCL  A+   +D+ GI+GN Q K   V+YD     
Sbjct: 456 LFQGGAYLDVNASGIMY--AASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKT 513

Query: 474 LGFAGEDC 481
           +GF+   C
Sbjct: 514 VGFSPGAC 521


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 152/370 (41%), Positives = 206/370 (55%), Gaps = 25/370 (6%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
            +PLT G  +   NY+  + LG   +   ++VDTGS LTW+QC PC+ SC+ Q  PVFDP
Sbjct: 103 SVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDP 162

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S SY  V C+S  C  L  AT N  VCS S+   C Y  SYGD S++ G L ++ +  
Sbjct: 163 KTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSN--VCIYQASYGDSSFSVGYLSKDTVSF 220

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG- 296
           G  SV +F +GCG++N+GLFG  +GLMGL R+ LSL+ Q +   G  FSYCLPST  +G 
Sbjct: 221 GANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGY 280

Query: 297 -ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
            + GS   GG S          YT M+ N    + Y ++L+G+++ GK L  S   +   
Sbjct: 281 LSIGSYNPGGYS----------YTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSL 330

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
             +IDSGTVITRLP S+Y+AL         G    A  +SILDTCF   A +   +P V 
Sbjct: 331 PTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVS 390

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           M F G A + +    ++  V  D +  CLA A         IIGN QQ+   V+YD K++
Sbjct: 391 MAFSGGATLKLSAGNLL--VDVDGATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKSN 445

Query: 473 QLGFAGEDCS 482
           ++GFA   CS
Sbjct: 446 RIGFAAAGCS 455


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 160/429 (37%), Positives = 229/429 (53%), Gaps = 33/429 (7%)

Query: 69  LELKHKNYCSGKIVDWNE----QQQNRLILDNLHVQYLQSRIKNM--ISGNIKDVSNTEI 122
           + + H++     + D ++      +  L  D    + +Q R+     +S      +   +
Sbjct: 89  MPIVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSL 148

Query: 123 PLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
           P +SG  L T NY+ TI LG   GR  TV+ DTGSD TWVQC+PC   CY QQ+ +FDP+
Sbjct: 149 PASSGSALGTGNYVVTIGLGTPAGR-YTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPA 207

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y  + C +  C  L +  G SG         C Y V YGDGSY+ G    + L L 
Sbjct: 208 RSSTYANISCAAPACSDL-YIKGCSG-------GHCLYGVQYGDGSYSIGFFAMDTLTLS 259

Query: 239 K-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              ++  F FGCG  N+GL+G  +GL+GLGR   SL  Q  + +GG+F++C P    A +
Sbjct: 260 SYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFP----ARS 315

Query: 298 SGSLILG-GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
           SG+  L  G  S+   S  +T   ++ N    TFY + LTGI +GGK L    S F   G
Sbjct: 316 SGTGYLDFGPGSLPAVSAKLTTPMLVDNGP--TFYYVGLTGIRVGGKLLSIPQSVFTTSG 373

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            ++DSGTVITRLPP+ YS+L++ F    +  G+  AP  S+LDTC++ +   EV IP V 
Sbjct: 374 TIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVS 433

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G A + V  +GI+Y   +  SQ CL  A    +D+ GI+GN Q K   V+YD    
Sbjct: 434 LLFQGGASLDVHASGIIY--AASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKK 491

Query: 473 QLGFAGEDC 481
            +GF    C
Sbjct: 492 VVGFCPGAC 500


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 161/440 (36%), Positives = 230/440 (52%), Gaps = 42/440 (9%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNM------------------ 109
           L L H ++ CS   +  +      L  D+    +L SR+                     
Sbjct: 47  LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARAAHLASRLATTSNAPSRRPTTSLRKPKAA 106

Query: 110 --ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK 165
              SG   D S   +PLT G  +   NY+  + LG    +  ++VDTGS LTW+QC PC 
Sbjct: 107 AGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCV 166

Query: 166 -SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
            SC+ Q  P++DP  S +Y  V C++S C  L+ AT N   CS  +   C Y  SYGD S
Sbjct: 167 VSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRN--VCIYQASYGDSS 224

Query: 225 YTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
           ++ G L R+ +  G  S  +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  
Sbjct: 225 FSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYS 284

Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           FSYCLP+     ++G L +G  +S   + TP+  +++      A+ Y + L+G+S+GG  
Sbjct: 285 FSYCLPT---PASTGYLSIGPYTSGHYSYTPMASSSLD-----ASLYFVTLSGMSVGGSP 336

Query: 345 LQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
           L  S   ++    +IDSGTVITRLP ++Y+AL         G  SAP FSILDTCF   A
Sbjct: 337 LAVSPAEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCFQGQA 396

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
            Q + +P V M F G A + +    ++  +  D S  CLA A     D T IIGN QQ+ 
Sbjct: 397 SQ-LRVPAVAMAFAGGATLKLATQNVL--IDVDDSTTCLAFAP---TDSTTIIGNTQQQT 450

Query: 463 QRVIYDTKNSQLGFAGEDCS 482
             V+YD   S++GFA   CS
Sbjct: 451 FSVVYDVAQSRIGFAAGGCS 470


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 152/364 (41%), Positives = 205/364 (56%), Gaps = 27/364 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSY 183
           G  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY Q++ +FDP+ S +Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
             V C +  C  L+    + G         C Y V YGDGSY+ G    + L L    +V
Sbjct: 231 ANVSCAAPACSDLDTRGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 282

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
             F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP+      +G L 
Sbjct: 283 KGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST--GTGYLD 340

Query: 303 LGGNSSVFK-NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDS 359
            G  S   +  +TP+   N    P   TFY + LTGI +GG+ L    S FA  G ++DS
Sbjct: 341 FGAGSPAARLTTTPMLVDN---GP---TFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDS 394

Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           GTVITRLPP+ YS+L++ F    S  G+  AP  S+LDTC++ +   +V IP V + F+G
Sbjct: 395 GTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQG 454

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A + VD +GI+Y   + ASQVCLA A+     + GI+GN Q K   V YD     + F+
Sbjct: 455 GARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFS 512

Query: 478 GEDC 481
              C
Sbjct: 513 PGAC 516


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 152/363 (41%), Positives = 208/363 (57%), Gaps = 22/363 (6%)

Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
           G+ L T NY+  I LG      TV+ DTGSD TWVQC+PC  SCY Q+D +FDP+ S +Y
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
             V C    C  L+ +  N+G         C Y + YGDGSYT G   ++ L + + ++ 
Sbjct: 215 ANVSCADPACADLDASGCNAG--------HCLYGIQYGDGSYTVGFFAKDTLAVAQDAIK 266

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
            F FGCG  N+GLFG  +GL+GLGR   S+  Q  E +GG FSYCLP++  + A+G L  
Sbjct: 267 GFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPAS--SAATGYLEF 324

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA---SGFAKGGILIDSG 360
           G  S     S   T T M+ + +  TFY + LTGI +GGKQL A   S F+  G L+DSG
Sbjct: 325 GPLSPSSSGSNAKT-TPMLTD-KGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTLVDSG 382

Query: 361 TVITRLPPS--IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           TVITRLP +     +         SG+  A  +SILDTC++ +   +V++P V + F+G 
Sbjct: 383 TVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGG 442

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A + +D +GIVY +    SQVCL  AS   ++  GI+GN QQ+   V+YD     +GFA 
Sbjct: 443 ACLDLDASGIVYAISQ--SQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAP 500

Query: 479 EDC 481
             C
Sbjct: 501 GAC 503


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 143/362 (39%), Positives = 205/362 (56%), Gaps = 16/362 (4%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G  L T NY  ++ LG    ++ V +DTGSD +W+QC+PC  CY Q + +FDPS S +Y 
Sbjct: 126 GKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYS 185

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
            + C+S  C  L    G+S   + SS   C Y ++Y D SYT G L R+ L L    +V 
Sbjct: 186 DITCSSRECQEL----GSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAVP 241

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
            F+FGCG NN G FG + GL+GLGR   SL SQ +  +G  FSYCLPS+    A+G L  
Sbjct: 242 GFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPS--ATGYLSF 299

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA-KGGILIDSG 360
            G ++     T   +T M+   Q  +FY LNLTGI++ G+ ++   S FA   G +IDSG
Sbjct: 300 SGAAA--AAPTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSG 356

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T  + LPPS Y+AL++        +  AP  +I DTC++L+ ++ V IP V + F   A 
Sbjct: 357 TAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGAT 416

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + +  +G++Y   S+ SQ CLA      +   G++GN QQ+   VIYD  N ++GF    
Sbjct: 417 VHLHPSGVLY-TWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANG 475

Query: 481 CS 482
           C+
Sbjct: 476 CA 477


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 150/425 (35%), Positives = 227/425 (53%), Gaps = 35/425 (8%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
           +L L H++  SG        Q   L+  DN  V++L+ R+    S  + +D+ +  +P  
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
            G+   +  Y   + +G    +  ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
             V C S+ C  L       G  +      C+Y V+YGDGSYT+GEL  E L LG  +V 
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG  N GLF G +GL+GLG   +SLV Q     GG+FSYCL +++ AG +GSL+L
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL-ASRGAGGAGSLVL 295

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
           G   +V +              + ++FY + LTGI +GG++L       Q +    GG++
Sbjct: 296 GRTEAVPRGR------------RASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 343

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +D+GT +TRLP   Y+AL+  F       P +P  S+LDTC++LS Y  V +P V   F+
Sbjct: 344 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 403

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
             A +T+    ++  V+   +  CLA A  S      I+GN QQ+  ++  D+ N  +GF
Sbjct: 404 QGAVLTLPARNLL--VEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 459

Query: 477 AGEDC 481
               C
Sbjct: 460 GPNTC 464


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 162/442 (36%), Positives = 236/442 (53%), Gaps = 44/442 (9%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI----------------KNMIS 111
           L L H ++ CS   +  +      L  D+  V +L SR+                K    
Sbjct: 46  LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAG 105

Query: 112 G-----NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
           G     ++ D S   +PL+ G  +   NY+  + LG    +  ++VDTGS LTW+QC PC
Sbjct: 106 GASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC 165

Query: 165 K-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
             SC+ Q  P+FDP  S +Y  V C++S C  L+ AT N   CS+S+   C Y  SYGD 
Sbjct: 166 VVSCHRQVGPLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASN--VCIYQASYGDS 223

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           S++ G L  + +  G  S   F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G 
Sbjct: 224 SFSVGYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGY 283

Query: 284 LFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            FSYCLP+   A ++G L +G  N+  + + TP+  +++      A+ Y + L+G+S+GG
Sbjct: 284 SFSYCLPT---AASTGYLSIGPYNTGHYYSYTPMASSSLD-----ASLYFITLSGMSVGG 335

Query: 343 KQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
             L  S   ++    +IDSGTVITRLP ++++AL     +  +G   AP FSILDTCF  
Sbjct: 336 SPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEG 395

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
            A Q + +P V M F G A M +    ++  +  D S  CLA A     D T IIGN QQ
Sbjct: 396 QASQ-LRVPTVVMAFAGGASMKLTTRNVL--IDVDDSTTCLAFAP---TDSTAIIGNTQQ 449

Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
           +   VIYD   S++GF+   CS
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 165/444 (37%), Positives = 235/444 (52%), Gaps = 35/444 (7%)

Query: 52  SSCVSHQKSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           S   S QK        TL L H++  CS  +       +  L  D L    + +++ +  
Sbjct: 44  SEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPR 103

Query: 111 SGNIKDV--SNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPC-- 164
           + + K++  S   IP +SG  L T  Y+ T+ LG   +T +  +DTGSD++WVQC PC  
Sbjct: 104 NSSAKELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAA 163

Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
           +SC +Q+D +FDP+ S +Y    C+S+ C  L    G    C +S    C Y V Y D S
Sbjct: 164 QSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLG---GEGNGCLNS---HCQYIVKYVDHS 217

Query: 225 YTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
            T G  G + LGL  + +V +F FGC     G  G + GLMGLG    SLVSQT+  +G 
Sbjct: 218 NTTGTYGSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGK 277

Query: 284 LFSYCLPSTQDAGASGSLILG----GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
            FSYCLP +  + A G L LG    G SS   + TP+   N      + TFY + L  I+
Sbjct: 278 AFSYCLPPSSSS-AGGFLTLGAAAGGTSSSRYSRTPLVRFN------VPTFYGVFLQAIT 330

Query: 340 IGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
           + G +L   AS F+ G  ++DSGTVIT+LPP+ Y AL+  F K+   +PSA    ILDTC
Sbjct: 331 VAGTKLNVPASVFS-GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTC 389

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           F+ S  + V +P+V + F   A M +DV+GI Y         CLA  + + + +TGI+GN
Sbjct: 390 FDFSGIKTVRVPVVTLTFSRGAVMDLDVSGIFY-------AGCLAFTATAQDGDTGILGN 442

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
            QQ+   +++D   S LGF    C
Sbjct: 443 VQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 161/442 (36%), Positives = 235/442 (53%), Gaps = 44/442 (9%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI----------------KNMIS 111
           L L H ++ CS   +  +      L  D+  V +L SR+                K    
Sbjct: 46  LTLHHPQSPCSPAPLPSDLPFSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAG 105

Query: 112 G-----NIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
           G     ++ D S   +PL+ G  +   NY+  + LG    +  ++VDTGS LTW+QC PC
Sbjct: 106 GASGGHHLDDDSLASVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC 165

Query: 165 K-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
             SC+ Q  P+FDP  S +Y  V C++S C  L+ AT N   CS+S+   C Y  SYGD 
Sbjct: 166 VVSCHRQVGPLFDPRASSTYASVRCSASQCDELQAATLNPSACSASN--VCIYQASYGDS 223

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           S++ G L  + +  G      F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G 
Sbjct: 224 SFSVGSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGY 283

Query: 284 LFSYCLPSTQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            FSYCLP+   A ++G L +G  N+  + + TP+  +++      A+ Y + L+G+S+GG
Sbjct: 284 SFSYCLPT---AASTGYLSIGPYNTGHYYSYTPMASSSLD-----ASLYFITLSGMSVGG 335

Query: 343 KQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
             L  S   ++    +IDSGTVITRLP ++++AL     +  +G   AP FSILDTCF  
Sbjct: 336 SPLAVSPSEYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEG 395

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
            A Q + +P V M F G A M +    ++  +  D S  CLA A     D T IIGN QQ
Sbjct: 396 QASQ-LRVPTVAMAFAGGASMKLTTRNVL--IDVDDSTTCLAFAP---TDSTAIIGNTQQ 449

Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
           +   VIYD   S++GF+   CS
Sbjct: 450 QTFSVIYDVAQSRIGFSAGGCS 471


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 149/399 (37%), Positives = 217/399 (54%), Gaps = 19/399 (4%)

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT-- 147
           +RL  +    +Y+ SR+   + G+  DVS   IP   G  + +L Y+ T+ LG  +++  
Sbjct: 82  DRLRRNRARSKYIMSRVSKGMMGDDADVS---IPTHLGGSVDSLEYVVTVGLGTPSVSQV 138

Query: 148 VIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           +++DTGSDL+WVQCQPC S  CY Q+DP+FDPS S +Y  + CN+  C  L       G 
Sbjct: 139 LLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGC 198

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
            S      C + ++YGDGS TRG    E L L    +V DF FGCG +  G      GL+
Sbjct: 199 ASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLL 258

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-AGASGSLILGGNSSVFKNSTPITYTNMIP 323
           GLG +  SLV QT+ ++GG FSYCLP+  +  G       G  S    N++   +T MI 
Sbjct: 259 GLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIR 318

Query: 324 NPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
             +  TFY++N+TGI++GG+ +     A  GG++IDSGTV+T L  + Y+AL+A F K  
Sbjct: 319 EEE--TFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQHTAYNALQAAFRKAM 376

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
           + +P       LDTC++ S Y  V +P V + F G A + +DV   +          CLA
Sbjct: 377 AAYPLVRNGE-LDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGILL------DDCLA 429

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 +D+ GI+GN  Q+   V+YD    ++GF    C
Sbjct: 430 FQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  251 bits (641), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 162/399 (40%), Positives = 215/399 (53%), Gaps = 24/399 (6%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTEIPLTSGIRLQTLNYIATIELG--GR 144
           +  L  D L   Y+Q +          DV  S+  +P   G  L TL Y+ T+ LG    
Sbjct: 5   EETLHRDQLRAAYIQRKFSGGGG-AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPAT 63

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           + T+++DTGSD++WVQC+PC  C++Q DP+FDPS S +Y    C S+ C  L    GN  
Sbjct: 64  SQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLG-QEGNG- 121

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
            CSSSS   C Y V+YGDGS T G    + L LG ++V  F FGC     G      GLM
Sbjct: 122 -CSSSS--QCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLM 178

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           GLG    SLVSQT+   G  FSYCLP T    +SG L L   ++    ++    T M+ +
Sbjct: 179 GLGGGAQSLVSQTAGTLGRAFSYCLPPTPS--SSGFLTL--GAAGGSGTSGFVKTPMLRS 234

Query: 325 PQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            Q+ TFY + L  I +GG+QL   AS F+ G ++ DSGTVITRLPP+ YSAL + F    
Sbjct: 235 SQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM-DSGTVITRLPPTAYSALSSAFKAGM 293

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
             +P A    ILDTCF+ S    V+IP V + F G A +++D +GI+          CLA
Sbjct: 294 KQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-------SNCLA 346

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            A  S +   GIIGN QQ+   V+YD     +GF    C
Sbjct: 347 FAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 163/454 (35%), Positives = 234/454 (51%), Gaps = 32/454 (7%)

Query: 45  QQKSGSSSSCVSHQKSRIE--MGAITLELKHK-------NYCSGKIVDWNEQ-QQNRLIL 94
           Q++S  S +  S  K  +E     +++ L H+        Y +      +E  +++R   
Sbjct: 31  QRRSYDSETVCSASKVNLEPSSATVSMSLVHRYGPCAPSQYSNVPTPSISETLRRSRART 90

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDT 152
           + +  Q  +S    M S    D +   IP   G  + +L Y+ T+  G  ++   +++DT
Sbjct: 91  NYIMSQASKSMGMGMASTPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDT 150

Query: 153 GSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           GSD++WVQC PC S  CY Q+DP+FDPS S +Y  + CN+  C  L     N   C+S  
Sbjct: 151 GSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNG--CTSGG 208

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
              C Y V Y DGS++RG    E L L    +V DF FGCGR+ +G      GL+GLG +
Sbjct: 209 T-QCGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGA 267

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            +SLV QTS ++GG FSYCLP+      +G L+LG   S   N +   +T M   P  AT
Sbjct: 268 PVSLVVQTSSVYGGAFSYCLPALNS--EAGFLVLGSPPS--GNKSAFVFTPMRHLPGYAT 323

Query: 330 FYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           FY++ +TGIS+GGK L     A +GG++IDSGTV T LP + Y+AL+A   K    +P  
Sbjct: 324 FYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV 383

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLS 447
           P     DTC+N + Y  + +P V   F G A + +DV  GI+          CLA     
Sbjct: 384 PS-DDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILV-------NDCLAFQESG 435

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +D  GIIGN  Q+   V+YD     +GF    C
Sbjct: 436 PDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 151/369 (40%), Positives = 205/369 (55%), Gaps = 31/369 (8%)

Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY QQ+ +FDP+ S 
Sbjct: 169 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSS 228

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  V C +  C  L+    + G         C Y V YGDGSY+ G    + L L    
Sbjct: 229 TYANVSCAAPACFDLDTRGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 280

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP    A +SG+
Sbjct: 281 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSSGT 336

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
             L      F   +P      +  P L     TFY + +TGI +GG+ L    S FA  G
Sbjct: 337 GYLD-----FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 391

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            ++DSGTVITRLPP  YS+L++ F+   +  G+  AP  S+LDTC++ +   +V IP V 
Sbjct: 392 TIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 451

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G A + VD +GI+Y   +  SQVCL  A+     + GI+GN Q K   V YD    
Sbjct: 452 LLFQGGAILDVDASGIMY--AASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 509

Query: 473 QLGFAGEDC 481
            +GF+   C
Sbjct: 510 VVGFSPGAC 518


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 163/447 (36%), Positives = 226/447 (50%), Gaps = 33/447 (7%)

Query: 50  SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQ--QQNRLILDNLHVQYLQSRIK 107
           S + C         +   T+ L H++     +    ++  ++  L  D L  +++Q +  
Sbjct: 35  SEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFA 94

Query: 108 -NMISGNIKDVSNTEI----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
            N       D+  +++    P   G  L TL Y+ ++ LG      TV +DTGSD++WVQ
Sbjct: 95  MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154

Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           C PC +  C+ Q   +FDP+ S +Y+ V C ++ C  LE      G  +     +C Y V
Sbjct: 155 CNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNY----ECQYGV 210

Query: 219 SYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
            YGDGS T G   R+ L L  AS  V  F FGC     G      GLMGLG    SLVSQ
Sbjct: 211 QYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQ 270

Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
           T+  +G  FSYCLP T     SGS            S  +T T M+ + Q+ TFY   L 
Sbjct: 271 TAAAYGNSFSYCLPPT-----SGSSGFLTLGGGGGASGFVT-TRMLRSKQIPTFYGARLQ 324

Query: 337 GISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
            I++GGKQL    S FA G + +DSGT+ITRLPP+ YSAL + F      + SAP  SIL
Sbjct: 325 DIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSIL 383

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
           DTCF+ +   +++IP V + F G A + +D  GI+Y         CLA A+   +  TGI
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY-------GNCLAFAATGDDGTTGI 436

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IGN QQ+   V+YD  +S LGF    C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 215/368 (58%), Gaps = 25/368 (6%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SGI   + +Y A I +G   R++ ++ DTGSD++W+QC PC+ CY QQDP+F+PS+S
Sbjct: 69  PLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLS 128

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            S+K + C SS C  L+        CS  +  +C Y VSYGDGS+T G+   E L  G+ 
Sbjct: 129 SSFKPLACASSICGKLKIKG-----CSRKN--ECMYQVSYGDGSFTVGDFSTETLSFGEH 181

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V     GCGRNN+GLF G +GL+GLGR  LS  SQT   +  +FSYCLP  + A A+ S
Sbjct: 182 AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA-S 240

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
           L+ G ++   K      +T ++PN +L T+Y + L  I + G    +    FA      G
Sbjct: 241 LVFGPSAVPEK----ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTG 296

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G+++DSGT I+RL    Y+AL+  F +    FPSAPG S+ DTC++LS+ +   +P V +
Sbjct: 297 GVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKTATLPAVVL 355

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           +F+G A M +   GI+  V  D    CLA A    E+   IIGN QQ+  R+  D +  Q
Sbjct: 356 DFDGGASMPLPADGILVNVD-DEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQ 412

Query: 474 LGFAGEDC 481
           +G A + C
Sbjct: 413 MGIAPDQC 420


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 157/432 (36%), Positives = 233/432 (53%), Gaps = 32/432 (7%)

Query: 68  TLELKHKN-YCSGKIVDWNEQQQ----NRLILDNLHVQYLQSRI--KNMISGNIKDVSNT 120
           ++ L H++  C+ K     ++++     RL  D     ++  +   + M+S    +    
Sbjct: 55  SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMS----EGGGA 110

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFD 176
            IP   G  + +L Y+ T+ +G      TV++DTGSDL+WVQC+PC +  CY Q+DP+FD
Sbjct: 111 SIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFD 170

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS--PPDCNYFVSYGDGSYTRGELGREH 234
           PS S ++  + C S  C  L     ++G  +++S  PP C Y + YG+G+ T G    E 
Sbjct: 171 PSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTET 230

Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           L LG  A V  F FGCG +  G +    GL+GLG +  SLVSQT+ ++GG FSYCLP   
Sbjct: 231 LALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLN 290

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIP-NPQLATFYILNLTGISIGGKQLQ--ASGF 350
               +G L LG  +S   +++   +T M   +P++ATFY++ LTGIS+GGK L    + F
Sbjct: 291 S--GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF 348

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNLSAYQEVNIP 409
           AKG I +DSGTVIT +P + Y AL+  F    + +P   P  S LDTC+N + +  V +P
Sbjct: 349 AKGNI-VDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVP 407

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            V + F G A + +DV   V        + CLA A    +   GIIGN   +   V+YD+
Sbjct: 408 KVALTFVGGATVDLDVPSGVLV------EDCLAFADAG-DGSFGIIGNVNTRTIEVLYDS 460

Query: 470 KNSQLGFAGEDC 481
               LGF    C
Sbjct: 461 GKGHLGFRAGAC 472


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 148/390 (37%), Positives = 211/390 (54%), Gaps = 19/390 (4%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSD 155
              Y++SR    ++    D + T +P   G  + +L Y+ T+  G  ++   +++DTGSD
Sbjct: 89  RTNYIKSRASTGMASTPDDAAVT-VPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSD 147

Query: 156 LTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
           ++WVQC PC S  CY Q+DP+FDPS S +Y  + C +  C+ L     N   C+S     
Sbjct: 148 VSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNG--CTSGGT-Q 204

Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
           C Y V YGDGS TRG    E +      +V DF FGCG + +G      GL+GLG +  S
Sbjct: 205 CGYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPES 264

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
           LV QT+ ++GG FSYCLP+      +G L LG   S   N++   +T M   P  AT Y+
Sbjct: 265 LVVQTASVYGGAFSYCLPALNS--EAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYM 322

Query: 333 LNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
           +N+TGIS+GGK L     A +GG+LIDSGT++T LP + Y+AL A   K F+ +P     
Sbjct: 323 VNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMV-AS 381

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
              DTC+N + Y  V +P V + F G A + +DV   +  VK      CLA      +  
Sbjct: 382 EDFDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGI-LVKD-----CLAFRESGPDVG 435

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            GIIGN  Q+   V+YD  + ++GF    C
Sbjct: 436 LGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 150/364 (41%), Positives = 206/364 (56%), Gaps = 31/364 (8%)

Query: 130 LQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKK 185
           L T NY+ TI LG   GR  TV+ DTGSD TWVQC+PC   CY QQ+ +FDP+ S +   
Sbjct: 181 LGTGNYVVTIGLGTPAGR-YTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDAN 239

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVND 244
           + C +  C  L +  G SG         C Y V YGDGSY+ G    + L L    ++  
Sbjct: 240 ISCAAPACSDL-YTKGCSGG-------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 291

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
           F FGCG  N+GLFG  +GL+GLGR   SL  Q  + +GG+F++C P+   +  +G L  G
Sbjct: 292 FRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPAR--SSGTGYLDFG 349

Query: 305 GNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDS 359
             SS     K +TP+   N +      TFY + LTGI +GGK L    S F   G ++DS
Sbjct: 350 PGSSPAVSTKLTTPMLVDNGL------TFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDS 403

Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           GTVITRLPP+ YS+L++ F    +  G+  AP  S+LDTC++ +   +V IP V + F+G
Sbjct: 404 GTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQG 463

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A + VD +GI+Y   +  SQ CL  A+   +D+ GI+GN Q K   V+YD     +GF+
Sbjct: 464 GASLDVDASGIIY--AASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFS 521

Query: 478 GEDC 481
              C
Sbjct: 522 PGAC 525


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 148/369 (40%), Positives = 205/369 (55%), Gaps = 31/369 (8%)

Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY Q++ +FDP+ S 
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  + C +  C  L+    + G        +C Y V YGDGSY+ G    + L L    
Sbjct: 230 TYANISCAAPACSDLDTRGCSGG--------NCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP    A +SG+
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSSGT 337

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
             L      F   +P      +  P L     TFY + +TGI +GG+ L    S F   G
Sbjct: 338 GYLD-----FGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAG 392

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            ++DSGTVITRLPP+ YS+L++ F    +  G+  AP  S+LDTC++ +   +V IP V 
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 452

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G A + VD +GI+Y   +  SQVCL  A+     + GI+GN Q K   V YD    
Sbjct: 453 LLFQGGARLDVDASGIMY--AASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510

Query: 473 QLGFAGEDC 481
            +GF+   C
Sbjct: 511 VVGFSPGAC 519


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 163/447 (36%), Positives = 224/447 (50%), Gaps = 33/447 (7%)

Query: 50  SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQ--QQNRLILDNLHVQYLQSRIK 107
           S + C         +   T+ L H++     +    ++  ++  L  D L  +++Q +  
Sbjct: 35  SEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQRKFA 94

Query: 108 -NMISGNIKDVSNTEI----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
            N       D+  +++    P   G  L TL Y+ ++ LG      TV +DTGSD++WVQ
Sbjct: 95  MNAAVDGAGDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQ 154

Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           C PC +  CY Q   +FDP+ S +Y+ V C ++ C  LE      G  +     +C Y V
Sbjct: 155 CNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNY----ECQYGV 210

Query: 219 SYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
            YGDGS T G   R+ L L  AS  V  F FGC     G      GLMGLG    SLVSQ
Sbjct: 211 QYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQ 270

Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
           T+  +G  FSYCLP T     SGS            S     T M+ + Q+ TFY   L 
Sbjct: 271 TAAAYGNSFSYCLPPT-----SGSSGFLTLGGGGGVSG-FVTTRMLRSRQIPTFYGARLQ 324

Query: 337 GISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
            I++GGKQL    S FA G + +DSGT+ITRLPP+ YSAL + F      + SAP  SIL
Sbjct: 325 DIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSIL 383

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
           DTCF+ +   +++IP V + F G A + +D  GI+Y         CLA A+   +  TGI
Sbjct: 384 DTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY-------GNCLAFAATGDDGTTGI 436

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IGN QQ+   V+YD  +S LGF    C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 151/363 (41%), Positives = 203/363 (55%), Gaps = 26/363 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
           G  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC  +CY Q++ +FDP+ S +Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
             V C +  C  L+ +  + G         C Y V YGDGSY+ G    + L L    +V
Sbjct: 231 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 282

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
             F FGCG  N GLFG  +GL+GLGR   SL  QT   +GG+F++CLP+      +G L 
Sbjct: 283 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARST--GTGYLD 340

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
            G  S     +TP+   N    P   TFY + +TGI +GG+ L    S FA  G ++DSG
Sbjct: 341 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 394

Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           TVITRLPP+ YS+L++ F    +  G+  A   S+LDTC++ +   +V IP V + F+G 
Sbjct: 395 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 454

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A + VD +GI+Y V   ASQVCLA A      + GI+GN Q K   V YD     +GF+ 
Sbjct: 455 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 512

Query: 479 EDC 481
             C
Sbjct: 513 GAC 515


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 172/478 (35%), Positives = 245/478 (51%), Gaps = 27/478 (5%)

Query: 9   TILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
           +I   LL L+ S   L   AH  + ++    HK+        SS+  S  K       +T
Sbjct: 3   SISKFLLALLFSYHTLI--AHAADDRR----HKVLSVGSLMKSSTACSEPKVTPPSTGVT 56

Query: 69  LELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
           + L H+ + CS          + RL  D L   Y++ +     +G+I+      +P T G
Sbjct: 57  VPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSG--AGDIEQSDAATVPTTLG 114

Query: 128 IRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
             L TL Y+ T+ +G   +T  + +DTGSD++WVQC+PC  C+++ D +FDPS S +Y  
Sbjct: 115 TSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSP 174

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
             C+S+ C  L  +   +G  SS     C Y V+YGD S T G    + L LG +++ DF
Sbjct: 175 FSCSSAPCAQLSQSQEGNGCMSS----QCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDF 230

Query: 246 IFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC ++  G F     GLMGLG    SL SQT+  FG  FSYCLP T  +G+SG L LG
Sbjct: 231 QFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPT--SGSSGFLTLG 288

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVI 363
             SS F        T M+ + Q+ T+Y++ L  I +G +QL   +     G L+DSGT+I
Sbjct: 289 TGSSGFVK------TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTII 342

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           TRLPP+ YSAL + F      +P A    ILDTCF+ S    ++IP V + F G A + +
Sbjct: 343 TRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDL 402

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              GI+  + S  S  CLA      +   GIIGN QQ+   V+YD     +GF    C
Sbjct: 403 AFDGIMLEISS--SIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 151/363 (41%), Positives = 203/363 (55%), Gaps = 26/363 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
           G  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC  +CY Q++ +FDP+ S +Y
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
             V C +  C  L+ +  + G         C Y V YGDGSY+ G    + L L    +V
Sbjct: 235 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 286

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
             F FGCG  N GLFG  +GL+GLGR   SL  QT   +GG+F++CLP+      +G L 
Sbjct: 287 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARST--GTGYLD 344

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
            G  S     +TP+   N    P   TFY + +TGI +GG+ L    S FA  G ++DSG
Sbjct: 345 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 398

Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           TVITRLPP+ YS+L++ F    +  G+  A   S+LDTC++ +   +V IP V + F+G 
Sbjct: 399 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 458

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A + VD +GI+Y V   ASQVCLA A      + GI+GN Q K   V YD     +GF+ 
Sbjct: 459 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 516

Query: 479 EDC 481
             C
Sbjct: 517 GAC 519


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 151/368 (41%), Positives = 204/368 (55%), Gaps = 29/368 (7%)

Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY Q++ +FDP+ S 
Sbjct: 170 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  V C +  C  L     + G         C Y V YGDGSY+ G    + L L    
Sbjct: 230 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP+      +G 
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARST--GTGY 339

Query: 301 LILGGNS---SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
           L  G  S   +  + +TP+   N    P   TFY + +TGI +GG+ L    S FA  G 
Sbjct: 340 LDFGAGSLAAARARLTTPMLTEN---GP---TFYYVGMTGIRVGGQLLSIPQSVFATAGT 393

Query: 356 LIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           ++DSGTVITRLPP+ YS+L+          G+  AP  S+LDTC++ +   +V IP V +
Sbjct: 394 IVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSL 453

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F+G A + VD +GI+Y   + ASQVCLA A+     + GI+GN Q K   V YD     
Sbjct: 454 LFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKV 511

Query: 474 LGFAGEDC 481
           +GF    C
Sbjct: 512 VGFYPGAC 519


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 151/363 (41%), Positives = 202/363 (55%), Gaps = 26/363 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSY 183
           G  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC  +CY Q++ +FDP+ S +Y
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASV 242
             V C +  C  L+ +  + G         C Y V YGDGSY+ G    + L L    +V
Sbjct: 232 ANVSCAAPACSDLDVSGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 283

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
             F FGCG  N GLFG  +GL+GLGR   SL  QT   +GG+F++CLP       +G L 
Sbjct: 284 KGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRST--GTGYLD 341

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
            G  S     +TP+   N    P   TFY + +TGI +GG+ L    S FA  G ++DSG
Sbjct: 342 FGAGSPPATTTTPMLTGN---GP---TFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSG 395

Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           TVITRLPP+ YS+L++ F    +  G+  A   S+LDTC++ +   +V IP V + F+G 
Sbjct: 396 TVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGG 455

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A + VD +GI+Y V   ASQVCLA A      + GI+GN Q K   V YD     +GF+ 
Sbjct: 456 AALDVDASGIMYTVS--ASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSP 513

Query: 479 EDC 481
             C
Sbjct: 514 GAC 516


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 214/368 (58%), Gaps = 25/368 (6%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SGI   + +Y A I +G   R++ ++ DTGSD++W+QC PC+ CY QQDP+F+PS+S
Sbjct: 2   PLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLS 61

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            S+K + C SS C  L+        CS  +   C Y VSYGDGS+T G+   E L  G+ 
Sbjct: 62  SSFKPLACASSICGKLKIKG-----CSRKN--KCMYQVSYGDGSFTVGDFSTETLSFGEH 114

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V     GCGRNN+GLF G +GL+GLGR  LS  SQT   +  +FSYCLP  + A A+ S
Sbjct: 115 AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA-S 173

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
           L+ G ++   K      +T ++PN +L T+Y + L  I + G    +    FA      G
Sbjct: 174 LVFGPSAVPEK----ARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTG 229

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G+++DSGT I+RL    Y+AL+  F +    FPSAPG S+ DTC++LS+ +   +P V +
Sbjct: 230 GVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKTATLPAVVL 288

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           +F+G A M +   GI+  V  D    CLA A    E+   IIGN QQ+  R+  D +  Q
Sbjct: 289 DFDGGASMPLPADGILVNVD-DEGTYCLAFA--PEEEAFSIIGNVQQQTFRISIDNQKEQ 345

Query: 474 LGFAGEDC 481
           +G A + C
Sbjct: 346 MGIAPDQC 353


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 167/449 (37%), Positives = 240/449 (53%), Gaps = 35/449 (7%)

Query: 59  KSRIEMGAITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN---- 113
           +SR      T+ L H++  CS          + RL  D L   Y+  ++           
Sbjct: 54  ESRAPAVHATVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGA 113

Query: 114 -----IKDVSNTEIPLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPC- 164
                ++      +P T G  L TL Y+ T+ LG   G++ T+++DTGSD++WV+C+PC 
Sbjct: 114 GGDVVVQQSHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW 173

Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
           + C  Q DP+FDPS+S +Y    C+S+ C  L F  GN+  CSSS    C Y   YGDGS
Sbjct: 174 QQCRPQVDPLFDPSLSSTYSPFSCSSAACAQL-FQEGNANGCSSSG--QCQYIAMYGDGS 230

Query: 225 Y-TRGELGREHLGLGKAS----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSE 279
             T G    + L LG  S    V+ F FGC     G+ G  +GLMGLG    SLVSQT+ 
Sbjct: 231 VGTTGTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAG 290

Query: 280 IFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
            FG   FSYCLP T  +  SG L LG   +   +S     T M+ + Q+  FY + L  I
Sbjct: 291 TFGTTAFSYCLPPTPSS--SGFLTLGAAGT---SSAGFVKTPMLRSSQVPAFYGVRLEAI 345

Query: 339 SIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSIL 394
            +GG+QL   +     G+++DSGTV+TRLPP+ YS+L + F   +KQ+   PS+ G   L
Sbjct: 346 RVGGRQLSIPTTVFSAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFL 405

Query: 395 DTCFNLSAYQEVNIPLVKMEFE--GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
           DTCF++S    V++P V + F   G A + +D +GI+  +++ +S  CLA  + S +  T
Sbjct: 406 DTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMET-SSIFCLAFVATSDDGST 464

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           GIIGN QQ+  +V+YD     +GF    C
Sbjct: 465 GIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 147/368 (39%), Positives = 219/368 (59%), Gaps = 24/368 (6%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSI 179
           PL  G  + + NY   + LG   R  ++IVDTGS L+W+QC+PC   C+ Q DP+FDPS 
Sbjct: 1   PLNPGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSA 60

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +YK + C SS C +L  AT N+ +C +SS   C Y  SYGD SY+ G L ++ L L  
Sbjct: 61  SKTYKSLSCTSSQCSSLVDATLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAP 119

Query: 240 A-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
           + ++  F++GCG++++GLFG  +G++GLGR+ LS++ Q S  FG  FSYCLP+    G  
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---GGG 176

Query: 299 GSLILGGNS---SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGG 354
           G L +G  S   S +K      +T M  +P   + Y L LT I++GG+ L  A+   +  
Sbjct: 177 GFLSIGKASLAGSAYK------FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP 230

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
            +IDSGTVITRLP S+Y+  +  F+K  S  +  APGFSILDTCF  +     ++P V++
Sbjct: 231 TIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRL 290

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F+G A++ +    ++  ++ D    CLA A     +   IIGN+QQ+  +V +D   ++
Sbjct: 291 IFQGGADLNLRPVNVL--LQVDEGLTCLAFAG---NNGVAIIGNHQQQTFKVAHDISTAR 345

Query: 474 LGFAGEDC 481
           +GFA   C
Sbjct: 346 IGFATGGC 353


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 145/372 (38%), Positives = 203/372 (54%), Gaps = 21/372 (5%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
           D S   +PLT G      NY+  + LG   +   ++VDTGS LTW+QC PC+ SC+ Q  
Sbjct: 118 DGSLASVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSG 177

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           PVFDP  S SY  V C++  C+ L  AT N   CSSS    C Y  SYGD S++ G L +
Sbjct: 178 PVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSD--VCIYQASYGDSSFSVGYLSK 235

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
           + +  G  SV +F +GCG++N+GLFG  +GLMGL R+ LSL+ Q +   G  FSYCLPS+
Sbjct: 236 DTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSS 295

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--F 350
             +G               N    +YT M+ +    + Y + L+G+++ GK L  S   +
Sbjct: 296 SSSGYLSIGSY--------NPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEY 347

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
           +    +IDSGTVITRLP ++Y AL         G   A  +SILDTCF +     + +P 
Sbjct: 348 SSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPA 406

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V M F G A + +    ++  V  D+S  CLA A         IIGN QQ+   V+YD K
Sbjct: 407 VSMAFSGGAALKLSAQNLL--VDVDSSTTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 461

Query: 471 NSQLGFAGEDCS 482
           ++++GFA   C+
Sbjct: 462 SNRIGFAAGGCT 473


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 204/338 (60%), Gaps = 11/338 (3%)

Query: 148 VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           +I+DTGS L+W+QCQPC   C+ Q DP++DPS+S +YKK+ C S  C  L+ AT N  +C
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMG 265
            + S   C Y  SYGD S++ G L ++ L L  + ++  F +GCG++N+GLFG  +G++G
Sbjct: 61  ETDSNA-CLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIG 119

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           L R  LS+++Q S  +G  FSYCLP+     + G  +  G+ S     T   +T M+ + 
Sbjct: 120 LARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSIS----PTSYKFTPMLTDS 175

Query: 326 QLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS- 383
           +  + Y L LT I++ G+ L  A+   +   LIDSGTVITRLP S+Y+AL+  F+K  S 
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
            +  AP +SILDTCF  S      +P +KM F+G A++T+    I+  +++D    CLA 
Sbjct: 236 KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSIL--IEADKGITCLAF 293

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A  S  ++  IIGN QQ+   + YD   S++GFA   C
Sbjct: 294 AGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 159/409 (38%), Positives = 223/409 (54%), Gaps = 39/409 (9%)

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
           + L  D    +Y+  R+    SG    + +++       +P + G  + TLNY+ T  LG
Sbjct: 92  DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147

Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
             G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S SY  V C    C  L 
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLG 207

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGL 256
               ++   +        Y VSYGDGS T G    + L L  +S V  F FGCG    GL
Sbjct: 208 IYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGL 262

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
           F GV GL+GLGR   SLV QT+  +GG+FSYCLP+        +L +GG S         
Sbjct: 263 FNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---F 319

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSAL 374
           + T ++P+P   T+Y++ LTGIS+GG+QL   AS FA  G ++D+GTV+TRLPP+ Y+AL
Sbjct: 320 STTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAAL 378

Query: 375 KAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           ++ F    +  G+P+AP   ILDTC+N + Y  V +P V + F   A +T+   GI+ F 
Sbjct: 379 RSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF- 437

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  CLA A    +    I+GN QQ++  V  D   + +GF    C
Sbjct: 438 ------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 478


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 148/425 (34%), Positives = 219/425 (51%), Gaps = 48/425 (11%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNI-KDVSNTEIPLT 125
           +L L H++  SG        Q   L+  DN  V++L+ R+    S  + +D+ +  +P  
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVVP-- 121

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
            G+   +  Y   + +G    +  ++VD+GSD+ WVQC+PC+ CY Q DP+FDP+ S S+
Sbjct: 122 -GVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSF 180

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
             V C S+ C  L       G  +      C+Y V+YGDGSYT+GEL  E L LG  +V 
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGK----CDYSVTYGDGSYTKGELALETLTLGGTAVQ 236

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG  N GLF G +GL+GLG   +SLV Q     GG+FSYCL S + AG +GSL  
Sbjct: 237 GVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLAS-RGAGGAGSLA- 294

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
                                   ++FY + LTGI +GG++L       Q +    GG++
Sbjct: 295 ------------------------SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 330

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +D+GT +TRLP   Y+AL+  F       P +P  S+LDTC++LS Y  V +P V   F+
Sbjct: 331 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 390

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
             A +T+    ++  V+   +  CLA A  S      I+GN QQ+  ++  D+ N  +GF
Sbjct: 391 QGAVLTLPARNLL--VEVGGAVFCLAFAPSS--SGISILGNIQQEGIQITVDSANGYVGF 446

Query: 477 AGEDC 481
               C
Sbjct: 447 GPNTC 451


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 159/405 (39%), Positives = 240/405 (59%), Gaps = 26/405 (6%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSN-------TEIPLTSGIRLQTLNYIATIELG--GRN 145
           D   V++L SR+ N  S +    ++          PL SG+ + + NY   I +G   + 
Sbjct: 60  DEERVRFLHSRLTNKESASNSATTDKLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPAKY 119

Query: 146 MTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
            ++IVDTGS L+W+QCQPC   C+ Q DP+F PS+S +YK + C+SS C +L+ +T N+ 
Sbjct: 120 FSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLNAP 179

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLFGGVSG 262
            CS+++   C Y  SYGD S++ G L ++ L L    A  + F++GCG++N+GLFG  +G
Sbjct: 180 GCSNATG-ACVYKASYGDTSFSIGYLSQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSAG 238

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPST----QDAGASGSLILGGNSSVFKNSTPITY 318
           ++GL    LS++ Q S  +G  FSYCLPS+     ++  SG L +G +S    +S+P  +
Sbjct: 239 IIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSGFLSIGASSL---SSSPYKF 295

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAE 377
           T ++ NP++ + Y L LT I++ GK L  S  +     +IDSGTVITRLP +IY+ALK  
Sbjct: 296 TPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAIYNALKKS 355

Query: 378 FLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
           F+   S  +  APGFSILDTCF  S  +   +P +++ F G A + + V   +  V+ + 
Sbjct: 356 FVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGAGLELKVHNSL--VEIEK 413

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              CLA+A+ S  +   IIGNYQQ+   V YD  NS++GFA   C
Sbjct: 414 GTTCLAIAASS--NPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 135/341 (39%), Positives = 195/341 (57%), Gaps = 22/341 (6%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           M +++DTGSD+TW+QC PC  CY QQD +F P+ S +YK + CNS+ C  L+     S  
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQ---SFSHS 57

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKGLFGGV 260
           C +SS   CNY VSYGD S TRG+   E L L        SV +F FGCG  NKGLF G 
Sbjct: 58  CLNSS---CNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA 114

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GLMGLG+S +   +QTS  FG +FSYCLPS      SG L  G  + +  +   + +T 
Sbjct: 115 AGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYD---VRFTP 171

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           ++ +    + Y +++TGI++G + L  S      +++DSGTVI+R   S Y  L+  F +
Sbjct: 172 LVDSSSGPSQYFVSMTGINVGDELLPIS----ATVMVDSGTVISRFEQSAYERLRDAFTQ 227

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
              G  +A   +  DTCF +S   ++NIPL+ + F  +AE+ +    I+Y V  D   +C
Sbjct: 228 ILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPV--DDGVMC 285

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            A A  S      ++GN+QQ+N R +YD   S+LG +  +C
Sbjct: 286 FAFAPSS--SGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  241 bits (614), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 144/405 (35%), Positives = 214/405 (52%), Gaps = 32/405 (7%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
           DN   +YL +R+           S +E  + SG+   +  Y+  + +G       ++VD+
Sbjct: 133 DNARAEYLATRLSPAY--QPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDS 190

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           GSD+ WVQC+PC  CY Q DP+FDP+ S ++  V C S+ C  L      +  C      
Sbjct: 191 GSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAICRILP-----TSACGDGELG 245

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
            C Y VSY DGSYT+G L  E L LG  +V   + GCG  N+GLF G +GLMGLG   +S
Sbjct: 246 GCEYEVSYADGSYTKGALALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMS 305

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGA------SGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           LV Q     GG FSYCL S    G+      +G L+LG + +V + +    +  ++ NP+
Sbjct: 306 LVGQLGGEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGA---VWVPLVRNPR 362

Query: 327 LATFYILNLTGISIGGKQ--LQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFL 379
             +FY + L+GI +G ++  LQA  F       G +++D+GT +TRLP   Y+AL+  F+
Sbjct: 363 APSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFV 422

Query: 380 KQFSG-FPSAPGF--SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
              +G  P A G   S+LDTC++LS Y  V +P V   F+G+A + +    +   ++ D 
Sbjct: 423 GALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNV--LLEVDM 480

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              CLA A  S      I+GN QQ   ++  D+ N  +GF   +C
Sbjct: 481 GIYCLAFAPSS--SGLSIMGNTQQAGIQITVDSANGYIGFGPANC 523


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 149/369 (40%), Positives = 202/369 (54%), Gaps = 31/369 (8%)

Query: 125 TSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY Q++ +FDP+ S 
Sbjct: 170 SSGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSS 229

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  V C +  C  L     + G         C Y V YGDGSY+ G    + L L    
Sbjct: 230 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP    A ++G+
Sbjct: 282 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSTGT 337

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
             L      F   +    +  +  P L     TFY + +TGI +GG+ L    S FA  G
Sbjct: 338 GYLD-----FGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392

Query: 355 ILIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            ++DSGTVITRLPP+ YS+L+          G+  AP  S+LDTC++ +   +V IP V 
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 452

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G A + VD +GI+Y   + ASQVCLA A+     + GI+GN Q K   V YD    
Sbjct: 453 LLFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 510

Query: 473 QLGFAGEDC 481
            +GF    C
Sbjct: 511 VVGFYPGAC 519


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 150/364 (41%), Positives = 200/364 (54%), Gaps = 31/364 (8%)

Query: 125 TSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISP 181
           +SG  L T NY+ T+ LG      TV+ DTGSD TWVQCQPC   CY QQ+ +FDP  S 
Sbjct: 168 SSGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSS 227

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-A 240
           +Y  V C +  C  L     + G         C Y V YGDGSY+ G    + L L    
Sbjct: 228 TYANVSCAAPACSDLNIHGCSGG--------HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 279

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           +V  F FGCG  N+GLFG  +GL+GLGR   SL  QT + +GG+F++CLP    A ++G+
Sbjct: 280 AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLP----ARSTGT 335

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLA----TFYILNLTGISIGGKQLQ--ASGFAKGG 354
             L      F   +P   +  +  P L     TFY + +TGI +GG+ L    S FA  G
Sbjct: 336 GYLD-----FGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAG 390

Query: 355 ILIDSGTVITRLPPSIYSALK--AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            ++DSGTVITRLPP  YS+L+          G+  AP  S+LDTC++ +   +V IP V 
Sbjct: 391 TIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVS 450

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F+G A + VD +GI+Y   + ASQVCLA A+     + GI+GN Q K   V YD    
Sbjct: 451 LLFQGGARLDVDASGIMY--AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKK 508

Query: 473 QLGF 476
            +GF
Sbjct: 509 VVGF 512


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 139/367 (37%), Positives = 189/367 (51%), Gaps = 19/367 (5%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
           IP ++G  L TL ++ T+  G   +  TVI DTGSD++W+QC PC   CY Q DP+FDP+
Sbjct: 122 IPDSTGTSLDTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPT 181

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y  V C    C     A  +   CS+ +   C Y V YGDGS + G L  E L L 
Sbjct: 182 KSATYSVVPCGHPQC-----AAADGSKCSNGT---CLYKVEYGDGSSSAGVLSHETLSLT 233

Query: 239 KA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              ++  F FGCG+ N G FG V GL+GLGR  LSL SQ +  FGG FSYCLPS  D   
Sbjct: 234 STRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPS--DNTT 291

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGI 355
            G L +G  +    +   + YT M+      +FY + L  I IGG  L      F   G 
Sbjct: 292 HGYLTIGPTTPASNDD--VQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGT 349

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
            +DSGT++T LPP  Y+AL+  F    + +  AP +   DTC++ +    + IP V  +F
Sbjct: 350 FLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKF 409

Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
              +   +   GI+ F    A  + CL   +        I+GN QQ+N  VIYD    ++
Sbjct: 410 SDGSVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKI 469

Query: 475 GFAGEDC 481
           GFA   C
Sbjct: 470 GFASASC 476


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 155/397 (39%), Positives = 233/397 (58%), Gaps = 17/397 (4%)

Query: 92  LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR--NMTVI 149
           L+ D L V+ + +R  N  +G+       +IP+ SGI L   NY+  + LG    ++++ 
Sbjct: 2   LLQDQLRVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61

Query: 150 VDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           +DTGSD+TW QC+PC  SCY Q    FDP  S SYK V C+SS+C  +  + G  G  SS
Sbjct: 62  LDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVSS 121

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
           +    C Y V YGDGSY+ G    E L +  + V ++F+FGCG+ N G FG ++GL+GLG
Sbjct: 122 T----CIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLG 177

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  LSL  QTSE +  LF+YCLPS   + ++G L LGG     +    + +T + P  + 
Sbjct: 178 RGKLSLALQTSEKYNNLFTYCLPSFSSS-STGHLTLGG-----QVPKSVKFTPLSPAFKN 231

Query: 328 ATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
             FY +++ G+S+GG  L   AS F+  G +IDSGTVITRL P++YSAL ++F +    +
Sbjct: 232 TPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDY 291

Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
           P   GFSILDTC++ S  + +++P +   F+G  E+ +   GI+  + +   +VCLA A 
Sbjct: 292 PKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINA-WDKVCLAFAP 350

Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              + +  + GN QQ+   V++D    ++GFA   C+
Sbjct: 351 NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGCN 387


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 151/383 (39%), Positives = 212/383 (55%), Gaps = 17/383 (4%)

Query: 107 KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
           K   SG    +S+  IP + G  + +L Y+ T+ +G      TV++DTGSDL+WVQC+PC
Sbjct: 99  KAKASGRTTTLSDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPC 158

Query: 165 KS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
            S  CY Q+DP++DP+ S +Y  V C+S  C  L     + G  +SS    C Y + YG+
Sbjct: 159 NSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGN 218

Query: 223 GSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
              T G    E L L  + SV DF FGCG   +G F    GL+GLG +  SLVSQT+E +
Sbjct: 219 RDTTVGVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETY 278

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           GG FSYCLP       +G L LG  ++   ++    +T +   P+ ATFY++NLTG+S+G
Sbjct: 279 GGAFSYCLPPGNS--TTGFLALGAPTN-NNDTAGFLFTPLHSLPEQATFYLVNLTGVSVG 335

Query: 342 GKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
           GK L        GG++IDSGT+IT LP + YSAL+  F    S +P  P     +LDTC+
Sbjct: 336 GKPLDIPPTVLSGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCY 395

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
           N +    V +P V + F+G A + +DV   V        Q CLA A  + + + GIIGN 
Sbjct: 396 NFTGIANVTVPTVALTFDGGATIDLDVPSGVLI------QDCLAFAGGASDGDVGIIGNV 449

Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
            Q+   V+YD+    +GF    C
Sbjct: 450 NQRTFEVLYDSGRGHVGFRPGAC 472


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 160/440 (36%), Positives = 230/440 (52%), Gaps = 41/440 (9%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKN----------MISGNIK-- 115
           L L H ++ CS   +  +      +  D+  + +L SR+ N          ++ G+ K  
Sbjct: 45  LTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHRKKK 104

Query: 116 -------DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK- 165
                    S++ +PLT G  +   NY+  + LG    +  ++VDTGS LTW+QC PC  
Sbjct: 105 AGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSV 164

Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
           SC+ Q  PVFDP  S +Y  V C+SS C  L+ AT N   CS S+   C Y  SYGD SY
Sbjct: 165 SCHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSN--VCIYQASYGDSSY 222

Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           + G L ++ +  G  S   F +GCG++N+GLFG  +GL+GL ++ LSL+ Q +   G  F
Sbjct: 223 SVGYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAF 282

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           SYCLP++  + A+G L +G       N    +YT M  +   A+ Y + L+GIS+ G  L
Sbjct: 283 SYCLPTS--SAAAGYLSIGS-----YNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPL 335

Query: 346 QA--SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSA 402
               S +     +IDSGTVITRLPP++Y+AL        +        +SILDTCF  SA
Sbjct: 336 AVPPSEYRSLPTIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA 395

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
              + +P V M F G A + +    ++  +  D S  CLA A       T IIGN QQ+ 
Sbjct: 396 -AGLRVPRVDMAFAGGATLALSPGNVL--IDVDDSTTCLAFAP---TGGTAIIGNTQQQT 449

Query: 463 QRVIYDTKNSQLGFAGEDCS 482
             V+YD   S++GFA   CS
Sbjct: 450 FSVVYDVAQSRIGFAAGGCS 469


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 159/431 (36%), Positives = 224/431 (51%), Gaps = 32/431 (7%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV---------- 117
           L L H ++ CS   +  +      L  D+  +  L +R+    S     +          
Sbjct: 43  LTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPSSRPTKLRRGSSSSPDA 102

Query: 118 -SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDP 173
            S   +PL  G  +   NY+  + LG   ++  ++VDTGS LTW+QC PC  SC+ Q  P
Sbjct: 103 ESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGP 162

Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
           VF+P  S SY  V C++  C AL  AT N   CS+S+   C Y  SYGD S++ G L ++
Sbjct: 163 VFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSN--VCIYQASYGDSSFSVGYLSKD 220

Query: 234 HLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
            +  G  SV +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  FSYCLP++ 
Sbjct: 221 TVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSS 280

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA 351
            +           S    N    +YT M  +    + Y + +TGI++ GK L   AS ++
Sbjct: 281 SSSGY-------LSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYS 333

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
               +IDSGTVITRLP  +YSAL         G P A  FSILDTCF   A   + +P V
Sbjct: 334 SLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-SRLRVPQV 392

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
            M F G A + +  T ++  V  D++  CLA A         IIGN QQ+   V+YD KN
Sbjct: 393 SMAFAGGAALKLKATNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKN 447

Query: 472 SQLGFAGEDCS 482
           S++GFA   CS
Sbjct: 448 SKIGFAAGGCS 458


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 153/370 (41%), Positives = 209/370 (56%), Gaps = 28/370 (7%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFD 176
           +P + G  + TLNY+ T  LG  G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FD
Sbjct: 35  VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFD 94

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           P+ S SY  V C    C  L     ++   +        Y VSYGDGS T G    + L 
Sbjct: 95  PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLT 149

Query: 237 LGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           L  +S V  F FGCG    GLF GV GL+GLGR   SLV QT+  +GG+FSYCLP+    
Sbjct: 150 LSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 209

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG 353
               +L +GG S         + T ++P+P   T+Y++ LTGIS+GG+QL   AS FA  
Sbjct: 210 AGYLTLGVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG- 265

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLV 411
           G ++D+GTV+TRLPP+ Y+AL++ F    +  G+P+AP   ILDTC+N + Y  V +P V
Sbjct: 266 GTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNV 325

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
            + F   A +T+   GI+ F        CLA A    +    I+GN QQ++  V  D   
Sbjct: 326 ALTFGSGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--G 376

Query: 472 SQLGFAGEDC 481
           + +GF    C
Sbjct: 377 TSVGFKPSSC 386


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 158/409 (38%), Positives = 241/409 (58%), Gaps = 32/409 (7%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTE-----------IPLTSGIRLQTLNYIATIELG- 142
           D   V++L SR+ N  S  +++ + T+            PL SG+ + + NY   I LG 
Sbjct: 64  DEERVRFLHSRLTNKES--VRNSATTDKLRGGPSLVSTTPLKSGLSIGSGNYYVKIGLGT 121

Query: 143 -GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
             +  ++IVDTGS L+W+QCQPC   C+ Q DP+F PS S +YK + C+SS C +L+ +T
Sbjct: 122 PAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSLKSST 181

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLFG 258
            N+  CS+++   C Y  SYGD S++ G L ++ L L   +A  + F++GCG++N+GLFG
Sbjct: 182 LNAPGCSNATG-ACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQGLFG 240

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA----SGSLILGGNSSVFKNST 314
             SG++GL    +S++ Q S+ +G  FSYCLPS+  A      SG L +G +S     S+
Sbjct: 241 RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASS---LTSS 297

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-GILIDSGTVITRLPPSIYSA 373
           P  +T ++ N ++ + Y L+LT I++ GK L  S  +     +IDSGTVITRLP ++Y+A
Sbjct: 298 PYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVITRLPVAVYNA 357

Query: 374 LKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           LK  F+   S  +  APGFSILDTCF  S  +   +P +++ F G A + +     +  V
Sbjct: 358 LKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSL--V 415

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + +    CLA+A+ S  +   IIGNYQQ+  +V YD  N ++GFA   C
Sbjct: 416 EIEKGTTCLAIAASS--NPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 172/479 (35%), Positives = 258/479 (53%), Gaps = 28/479 (5%)

Query: 10  ILSLLLPLMVSL-FLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
            LS+++ L V L +  A+GA   +  K L  + +Q      SSSSCV   K+     ++ 
Sbjct: 7   FLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTKSSLR 66

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
           +   H   CS    D        +  D   V+ + S++    +  + +  +TE+P  SGI
Sbjct: 67  VVHMH-GACSHLSSDARVDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGI 125

Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKK 185
            L + NYI TI +G    +++++ DTGSDLTW QC+PC  SCY+Q++P F+PS S +Y+ 
Sbjct: 126 TLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQN 185

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-ND 244
           V C+S  C   E          S S  +C Y + YGD S+T+G L +E   L  + V  D
Sbjct: 186 VSCSSPMCEDAE----------SCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLED 235

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
             FGCG NN+GLF GV+GL+GLG   LSL +QT+  +  +FSYCLPS   + ++G L  G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT-SNSTGHLTFG 294

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS--GFAKGGILIDSGTV 362
             S+    S   T  +  P+   A  Y +++ GIS+G K+L  +   F+  G +IDSGTV
Sbjct: 295 --SAGISESVKFTPISSFPS---AFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTV 349

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
            TRLP  +Y+ L++ F ++ S + S  G+ + DTC++ +    V  P +   F G+  + 
Sbjct: 350 FTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVE 409

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +D +GI   +K   SQVCLA A    +D   I GN QQ    V+YD    ++GFA   C
Sbjct: 410 LDGSGISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 160/461 (34%), Positives = 247/461 (53%), Gaps = 45/461 (9%)

Query: 36  KLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHK-------NYCSGKIVDWNE-Q 87
           + + H L+      S+  C    K+  E G+ +L+L H+          +     +NE  
Sbjct: 32  RAYFHTLKISSLP-STEVCKESSKALNE-GSSSLKLVHRFGPCNPHRTSTAPASSFNEIL 89

Query: 88  QQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
           ++++L +D++    +Q+R    ++ +++ + ++ +P     ++   +YI  + +G   + 
Sbjct: 90  RRDKLRVDSI----IQARRSMNLTSSVEHMKSS-VPFYGLSKITASDYIVNVGIGTPKKE 144

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           M +I DTGS L W QC+PCK+CY +  PVFDP+ S S+K + C+S  C ++         
Sbjct: 145 MPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIRQG------ 197

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVSGL 263
           CSS   P C Y  +Y D S + G L  E +     K    + + GC     G   G SG+
Sbjct: 198 CSS---PKCTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGI 254

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
           MGL RS +SL SQT+ I+  LFSYC+PST   G++G L  GG        +P++ T    
Sbjct: 255 MGLNRSPISLASQTANIYDKLFSYCIPST--PGSTGHLTFGGKVPNDVRFSPVSKT---- 308

Query: 324 NPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
               ++ Y + +TGIS+GG++L   AS F K    IDSG V+TRLPP  YSAL++ F + 
Sbjct: 309 --APSSDYDIKMTGISVGGRKLLIDASAF-KIASTIDSGAVLTRLPPKAYSALRSVFREM 365

Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-C 440
             G+P       LDTC++ S Y  V IP + + FEG  EM +DV+GI++ V    S+V C
Sbjct: 366 MKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVP--GSKVYC 423

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           LA A L  +DE  I GN+QQK   V++D    ++GFA   C
Sbjct: 424 LAFAEL--DDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  238 bits (607), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 163/426 (38%), Positives = 230/426 (53%), Gaps = 33/426 (7%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQ-QNRLILDNLHVQYLQSRIKNMISGNIKDV--SNTE 121
           G +T+ L H++     +   N    ++ L  D L   Y+ +R  + ++G+  DV  S+  
Sbjct: 55  GVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYI-TRKYSGVNGSAGDVEGSDVT 113

Query: 122 IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           +P T G  L TL Y+ T+ +G   +  T+++DTGSD++WVQC+PC  C++Q D +FDPS 
Sbjct: 114 VPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSS 173

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +Y    C S+ C  L       G CSSS    C Y V YGDGS   G    + L LG 
Sbjct: 174 SSTYSAFSCTSAACAQLR----QRG-CSSS---QCQYTVKYGDGSTGSGTYSSDTLALGS 225

Query: 240 ASVNDFIFGCGRNNKG--LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
           ++V +F FGC ++  G  L    +GLMGLG    SL +QT+  FG  FSYCLP T   G+
Sbjct: 226 STVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPT--PGS 283

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGI 355
           SG L LG ++S F   TP+  +  +P+     +Y + L  I +GG+QL   AS F+ G I
Sbjct: 284 SGFLTLGASTSGFVVKTPMLRSTQVPS-----YYGVLLQAIRVGGRQLNIPASAFSAGSI 338

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           + DSGT+ITRLP + YSAL + F      +P A    I DTCF+ S    V+IP V + F
Sbjct: 339 M-DSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVF 397

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A + +   GI+          CLA A+ S +   GIIGN QQ+   V+YD     +G
Sbjct: 398 SGGAVVDLASDGIIL-------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVG 450

Query: 476 FAGEDC 481
           F    C
Sbjct: 451 FKAGAC 456


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  238 bits (606), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 172/479 (35%), Positives = 257/479 (53%), Gaps = 28/479 (5%)

Query: 10  ILSLLLPLMVSL-FLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAIT 68
            LS+++ L V L +  A+GA   +  K L  + +Q      SSSSCV   K+     ++ 
Sbjct: 7   FLSMIIMLCVCLNWCFAEGAEKSDSGKVLDSYTIQVSSLFPSSSSCVPSSKASNTKSSLR 66

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI 128
           +   H   CS    D        +  D   V+ + S++    +  + +  +TE+P  SGI
Sbjct: 67  VVHMH-GACSHLSSDARVDHDEIIRRDQARVESIYSKLSKNSANEVSEAKSTELPAKSGI 125

Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKK 185
            L + NYI TI +G    +++++ DTGSDLTW QC+PC  SCY+Q++P F+PS S +Y+ 
Sbjct: 126 TLGSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQN 185

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-ND 244
           V C+S  C   E          S S  +C Y + YGD S+T+G L +E   L  + V  D
Sbjct: 186 VSCSSPMCEDAE----------SCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLED 235

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
             FGCG NN+GLF GV+GL+GLG   LSL +QT+  +  +FSYCLPS   + ++G L  G
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT-SNSTGHLTFG 294

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS--GFAKGGILIDSGTV 362
             S+    S   T  +  P+   A  Y +++ GIS+G K+L  +   F+  G +IDSGTV
Sbjct: 295 --SAGISESVKFTPISSFPS---AFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTV 349

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
            TRLP  +Y+ L++ F ++ S + S  G+ + DTC++ +    V  P +   F G   + 
Sbjct: 350 FTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVE 409

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +D +GI   +K   SQVCLA A    +D   I GN QQ    V+YD    ++GFA   C
Sbjct: 410 LDGSGISLPIK--ISQVCLAFA--GNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  237 bits (605), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 152/365 (41%), Positives = 206/365 (56%), Gaps = 28/365 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISP 181
           G  + TLNY+ T  LG  G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S 
Sbjct: 132 GYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSS 191

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY  V C    C  L     ++   +        Y VSYGDGS T G    + L L  +S
Sbjct: 192 SYAAVPCGGPVCAGLGIYAASACSAAQC-----GYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 242 -VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            V  F FGCG    GLF GV GL+GLGR   SLV QT+  +GG+FSYCLP+        +
Sbjct: 247 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 306

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILID 358
           L +GG S         + T ++P+P   T+Y++ LTGIS+GG+QL   AS FA  G ++D
Sbjct: 307 LGVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVD 362

Query: 359 SGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +GTV+TRLPP+ Y+AL++ F    +  G+P+AP   ILDTC+N + Y  V +P V + F 
Sbjct: 363 TGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFG 422

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
             A +T+   GI+ F        CLA A    +    I+GN QQ++  V  D   + +GF
Sbjct: 423 SGATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGF 473

Query: 477 AGEDC 481
               C
Sbjct: 474 KPSSC 478


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 166/453 (36%), Positives = 249/453 (54%), Gaps = 46/453 (10%)

Query: 58  QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLI----LDNLHVQYLQSRIKNMISGN 113
           Q S  + G ++LEL H+N    +  +     +  L+     D   V++++S  K  ++G 
Sbjct: 47  QLSPRDGGTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIES--KAQLAGK 104

Query: 114 IKD-VSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
            KD  S+T++  P+TSG+   +  Y   + +G   R++ ++VDTGSDL W+QCQPCKSCY
Sbjct: 105 KKDEASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCY 164

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEF--ATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
            Q DP+FDP  S S++++ C S  C ALE    +G+ G  S      C+Y V+YGDGS++
Sbjct: 165 KQADPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSR-----CSYQVAYGDGSFS 219

Query: 227 RGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ-----TSEI 280
            G+   +   LG  S      FGCG +N+GLF G +GL+GLG   LS  SQ     T+  
Sbjct: 220 VGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSS 279

Query: 281 FGGLFSYCL-----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
               FSYCL     P T+   +S SLI G  +      +    + ++ NP+L TFY   +
Sbjct: 280 TANSFSYCLVDRSNPMTR---SSSSLIFGAAAI----PSTAALSPLLKNPKLDTFYYAAM 332

Query: 336 TGISIGGKQ-------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
            G+S+GG Q       LQ S    GG++IDSGT +TR P S+Y+ ++  F    +  PSA
Sbjct: 333 IGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSA 392

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
           P +S+ DTC+N S    V++P + + FE  A++ +  T  +  + + A   CLA A  S 
Sbjct: 393 PRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINT-AGSFCLAFAPTSM 451

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             E GIIGN QQ++ R+ +D + S L FA + C
Sbjct: 452 --ELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 482


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 153/389 (39%), Positives = 219/389 (56%), Gaps = 35/389 (8%)

Query: 108 NMISGNIK--DVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           +++  N+   + S  EI  P+ SG+ L +  Y + + +G   R + +++DTGSD+TWVQC
Sbjct: 132 DLVPANVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQC 191

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
           QPC  CY Q DPVFDPS+S SY  V C++  CH L+ A      C +S+   C Y V+YG
Sbjct: 192 QPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAA-----ACRNSTGA-CLYEVAYG 245

Query: 222 DGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           DGSYT G+   E L LG  A V+    GCG +N+GLF G +GL+ LG   LS  SQ S  
Sbjct: 246 DGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT 305

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
               FSYCL   +D+ +S +L   G+++  + + P     +I +P+ +TFY + L+GIS+
Sbjct: 306 ---TFSYCL-VDRDSPSSSTLQF-GDAADAEVTAP-----LIRSPRTSTFYYVGLSGISV 355

Query: 341 GGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG+ L    S FA      GG+++DSGT +TRL  S Y+AL+  F++     P   G S+
Sbjct: 356 GGQILSIPPSAFAMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL 415

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDET 452
            DTC++LS    V +P V + F G  E+ +      Y +  D A   CLA A  +     
Sbjct: 416 FDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN--YLIPVDGAGTYCLAFAPTNA--AV 471

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            IIGN QQ+  RV +DT  S +GF    C
Sbjct: 472 SIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 158/402 (39%), Positives = 234/402 (58%), Gaps = 24/402 (5%)

Query: 95  DNLHVQYLQSRIKNMISGN--IKDVSN--TEIPLTSGIRLQTLNYIATIELGG--RNMTV 148
           D   ++Y  SR+      N   K V      IPL SG+ + + NY   + LG   +  T+
Sbjct: 59  DEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTM 118

Query: 149 IVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           IVDTGS  +W+QCQPC   C+ Q+DPVF+PS S +YK V C+SS C +L+ AT N   CS
Sbjct: 119 IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCS 178

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGL 266
             S   C Y  SYGD S++ G L ++ L L  + +++ F++GCG++N+GLFG   G++GL
Sbjct: 179 KQSN-ACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGL 237

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
             ++LS++SQ S  +G  FSYCLP   ST ++   G L +G  +S    S+   +T ++ 
Sbjct: 238 ANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG--TSSLTPSSSYKFTPLLK 295

Query: 324 NPQLATFYILNLTGISIGGKQL-QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           NP   + Y ++L  I++ G+ L  A+   K   +IDSGTVITRLP  +Y+ LK  ++   
Sbjct: 296 NPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTIL 355

Query: 383 S-GFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           S  +  APG S+LDTCF  +L+   EV  P +++ F+G A++   + G    V+ +    
Sbjct: 356 SKKYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADL--QLKGHNSLVELETGIT 412

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CLA+A  S      IIGNYQQ+  +V YD  NS++GFA   C
Sbjct: 413 CLAMAGSS---SIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 158/402 (39%), Positives = 234/402 (58%), Gaps = 24/402 (5%)

Query: 95  DNLHVQYLQSRIKNMISGNI--KDVSN--TEIPLTSGIRLQTLNYIATIELGG--RNMTV 148
           D   ++Y  SR+      N   K V      IPL SG+ + + NY   + LG   +  T+
Sbjct: 59  DEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTM 118

Query: 149 IVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           IVDTGS  +W+QCQPC   C+ Q+DPVF+PS S +YK V C+SS C +L+ AT N   CS
Sbjct: 119 IVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCS 178

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGL 266
             S   C Y  SYGD S++ G L ++ L L  + +++ F++GCG++N+GLFG   G++GL
Sbjct: 179 KQSN-ACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDGIIGL 237

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
             ++LS++SQ S  +G  FSYCLP   ST ++   G L +G  +S    S+   +T ++ 
Sbjct: 238 ANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIG--TSSLTPSSSYKFTPLLK 295

Query: 324 NPQLATFYILNLTGISIGGKQL-QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           NP   + Y ++L  I++ G+ L  A+   K   +IDSGTVITRLP  +Y+ LK  ++   
Sbjct: 296 NPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTIL 355

Query: 383 S-GFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           S  +  APG S+LDTCF  +L+   EV  P +++ F+G A++   + G    V+ +    
Sbjct: 356 SKKYQQAPGISLLDTCFKGSLAGISEV-APDIRIIFKGGADL--QLKGHNSLVELETGIT 412

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CLA+A  S      IIGNYQQ+  +V YD  NS++GFA   C
Sbjct: 413 CLAMAGSS---SIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 158/435 (36%), Positives = 228/435 (52%), Gaps = 30/435 (6%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK--NMISGNIKDVSN--TE 121
           ++ L  +H                 RL  D     Y+ ++       +  + D +   T 
Sbjct: 18  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDP 177
           IP   G  + +L Y+ T+ +G      TV++DTGSDL+WVQC+PC +  CY Q+DP+FDP
Sbjct: 78  IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137

Query: 178 SISPSYKKVLCNSSTCHALE---FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           S S SY  V C+S  C  L    +  G +GV S  +   C Y + YG+ + T G    E 
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGV-SGGAAALCEYGIEYGNRATTTGVYSTET 196

Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           L L    V  DF FGCG +  G +    GL+GLG +  SLVSQTS  FGG FSYCLP T 
Sbjct: 197 LTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPT- 255

Query: 294 DAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASG 349
            +G +G L LG   NSS    ++ +++T M   P + TFYI+ LTGIS+GG  L    S 
Sbjct: 256 -SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSA 314

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEV 406
           F+  G++IDSGTVIT LP + Y+AL++ F   + ++   P + G  +LDTC++ + +  V
Sbjct: 315 FSS-GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHANV 372

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P + + F G A  T+D+      +       CLA A    ++  GIIGN  Q+   V+
Sbjct: 373 TVPTISLTFSGGA--TIDLAAPAGVLVDG----CLAFAGAGTDNAIGIIGNVNQRTFEVL 426

Query: 467 YDTKNSQLGFAGEDC 481
           YD+    +GF    C
Sbjct: 427 YDSGKGTVGFRAGAC 441


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 157/430 (36%), Positives = 221/430 (51%), Gaps = 32/430 (7%)

Query: 69  LELKH-KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV---------- 117
           LEL H ++ CS   V  +      L  D+  +  L +R+    S     +          
Sbjct: 45  LELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAGLAG 104

Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
           S   +PL+ G  +   NY+  + LG       ++VDTGS LTW+QC PC  SC+ Q  PV
Sbjct: 105 SLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPV 164

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           F+P  S +Y  V C++  C  L  AT N   CSSS+   C Y  SYGD S++ G L ++ 
Sbjct: 165 FNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSN--VCIYQASYGDSSFSVGYLSKDT 222

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
           +  G  S+ +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  F+YCLPS+  
Sbjct: 223 VSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSS 282

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG 354
           +G          S    N    +YT M+ +    + Y + L+G+++ G  L  S  A   
Sbjct: 283 SGYL--------SLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSS 334

Query: 355 I--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           +  +IDSGTVITRLP S+YSAL         G   A  +SILDTCF   A   V+ P V 
Sbjct: 335 LPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA-SRVSAPAVT 393

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           M F G A + +    ++  V  D S  CLA A         IIGN QQ+   V+YD K+S
Sbjct: 394 MSFAGGAALKLSAQNLL--VDVDDSTTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKSS 448

Query: 473 QLGFAGEDCS 482
           ++GFA   CS
Sbjct: 449 RIGFAAGGCS 458


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 167/408 (40%), Positives = 230/408 (56%), Gaps = 39/408 (9%)

Query: 92  LILDNLHVQYLQSRIKNMIS-GNIKDVS------NTEIPLTSGIRLQTLNYIATIELG-- 142
           L  D    +Y+Q R+      G ++  +      +  IP   G  + TL Y+ T+ LG  
Sbjct: 450 LRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKSVTIPANIGHSIGTLQYVVTVSLGTP 509

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYN--QQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
           G   TV VDTGSD++WVQC PC +     Q+D +FDP+ S SY  V C +  C   E +T
Sbjct: 510 GVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAADACS--ELST 567

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGG 259
              G C++ S   C Y VSYGDGS T G  G + L L  A +V  F+FGCG    GLF G
Sbjct: 568 YGHG-CAAGS--QCGYVVSYGDGSNTTGVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAG 624

Query: 260 VSGLMGLGRSDLSLVSQTSEIF-GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
           + GL+ LGR  +SL SQTS  + GG+FSYCLP +    ++G L LGG SS    +T    
Sbjct: 625 IDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS--STGFLTLGGPSSASGFAT---- 678

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSGTVITRLPPSIYSALK 375
           T ++    + TFY++ LTGI +GG+QL    AS FA GG ++D+GTVITRLPP+ Y+AL+
Sbjct: 679 TGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA-GGTVVDTGTVITRLPPTAYAALR 737

Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           A F    +  G+P+AP   ILDTC+N + Y  V +P V + F G A + +D  G +    
Sbjct: 738 AAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSGGATLKLDAPGFL---- 793

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              S  CLA A+ S + +  I+GN QQ++  V +D   S +GF    C
Sbjct: 794 ---SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--GSSVGFMPHSC 836


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 156/395 (39%), Positives = 221/395 (55%), Gaps = 22/395 (5%)

Query: 95  DNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVD 151
           D   +  L SR+        KD V+ + +PL SG  +   NYI  + LG    T  ++VD
Sbjct: 71  DAARIAGLASRLAT----KDKDWVAASSVPLASGASVGVGNYITRLGLGTPTTTYVMVVD 126

Query: 152 TGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           +GS LTW+QC PC  SC+ Q  P++DP  S +Y  V C++  C  L+ AT N   CS S 
Sbjct: 127 SGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSGSG 186

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
              C Y  SYGDGS++ G L ++ + L  + S   F +GCG++N GLFG  +GL+GL R+
Sbjct: 187 --VCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARN 244

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            LSL+SQ +   G  F+YCLP T  A ++G L  G NS   KN    +YT+M+ +   A+
Sbjct: 245 KLSLLSQLAPSVGNSFAYCLP-TSAAASAGYLSFGSNSDN-KNPGKYSYTSMVSSSLDAS 302

Query: 330 FYILNLTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
            Y ++L G+S+ G  L       G +  +IDSGTVITRLP  +Y+AL ++ +      PS
Sbjct: 303 LYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSGTVITRLPTPVYTAL-SKAVGAALAAPS 361

Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
           AP +SIL TCF      ++ +P V M F G A + +    ++  V  + +  CLA A   
Sbjct: 362 APAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVL--VDVNETTTCLAFAP-- 416

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             D T IIGN QQ+   V+YD K S++GFA   CS
Sbjct: 417 -TDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGGCS 450


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 156/411 (37%), Positives = 223/411 (54%), Gaps = 30/411 (7%)

Query: 90  NRLILDNLHVQYLQSRIK--NMISGNIKDVSN--TEIPLTSGIRLQTLNYIATIELG--G 143
            RL  D     Y+ ++       +  + D +   T IP   G  + +L Y+ T+ +G   
Sbjct: 122 ERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPA 181

Query: 144 RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALE---F 198
              TV++DTGSDL+WVQC+PC +  CY Q+DP+FDPS S SY  V C+S  C  L    +
Sbjct: 182 VQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAY 241

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLF 257
             G +GV S  +   C Y + YG+ + T G    E L L    V  DF FGCG +  G +
Sbjct: 242 GHGCTGV-SGGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPY 300

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG--NSSVFKNSTP 315
               GL+GLG +  SLVSQTS  FGG FSYCLP T  +G +G L LG   NSS    ++ 
Sbjct: 301 EKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPT--SGGAGFLTLGAPPNSSSSTAASG 358

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSA 373
           +++T M   P + TFYI+ LTGIS+GG  L    S F+  G++IDSGTVIT LP + Y+A
Sbjct: 359 LSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSS-GMVIDSGTVITGLPATAYAA 417

Query: 374 LKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           L++ F   + ++   P + G  +LDTC++ + +  V +P + + F G A  T+D+     
Sbjct: 418 LRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHANVTVPTISLTFSGGA--TIDLAAPAG 474

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +       CLA A    ++  GIIGN  Q+   V+YD+    +GF    C
Sbjct: 475 VLVDG----CLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 152/389 (39%), Positives = 219/389 (56%), Gaps = 35/389 (8%)

Query: 108 NMISGNIK--DVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           +++  N+   + S  EI  P+ SG+ L +  Y + + +G   R + +++DTGSD+TWVQC
Sbjct: 136 DLVPANVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQC 195

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
           QPC  CY Q DPVFDPS+S SY  V C++  CH L+ A      C +S+   C Y V+YG
Sbjct: 196 QPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAA-----ACRNSTGA-CLYEVAYG 249

Query: 222 DGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           DGSYT G+   E L LG  A V+    GCG +N+GLF G +GL+ LG   LS  SQ S  
Sbjct: 250 DGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT 309

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
               FSYCL   +D+ +S +L   G+++  + + P     +I +P+ +TFY + L+G+S+
Sbjct: 310 ---TFSYCL-VDRDSPSSSTLQF-GDAADAEVTAP-----LIRSPRTSTFYYVGLSGLSV 359

Query: 341 GGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG+ L    S FA      GG+++DSGT +TRL  S Y+AL+  F++     P   G S+
Sbjct: 360 GGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL 419

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDET 452
            DTC++LS    V +P V + F G  E+ +      Y +  D A   CLA A  +     
Sbjct: 420 FDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN--YLIPVDGAGTYCLAFAPTNA--AV 475

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            IIGN QQ+  RV +DT  S +GF    C
Sbjct: 476 SIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 155/423 (36%), Positives = 225/423 (53%), Gaps = 39/423 (9%)

Query: 81  IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
           + D +      L  D+  V+ +  R+         D + T IP + G+   +L Y+ TI 
Sbjct: 78  VPDHHPHYTGILRRDHNRVRSIHRRLTGA-----GDTAAT-IPASLGLAFHSLEYVVTIG 131

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           +G   RN TV+ DTGSDLTWVQC+PC  SCY QQ+P+FDPS S +Y  V C +  C   +
Sbjct: 132 IGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQC---K 188

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRN-NK 254
              G    C  ++   C Y V YGD S TRG L +E   L  ++      +FGC    + 
Sbjct: 189 IGGGQDLTCGGTT---CEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEYSS 245

Query: 255 GLFGG-----VSGLMGLGRSDLSLVSQTSE-IFGGLFSYCLPSTQDAGASGSLILGGNSS 308
           G+ G      V+GL+GLGR D S++SQT     G +FSYCLP      ++G L +G  + 
Sbjct: 246 GVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPR--GSSAGYLTIGAAAP 303

Query: 309 VFKNSTPITYTNMIP-NPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITR 365
              N   +++T ++  N QL++ Y++NL GIS+ G  L   AS F  G + IDSGTVIT 
Sbjct: 304 PQSN---LSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV-IDSGTVITH 359

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           +P + Y  L+ EF +   G+   P   +  LDTC++++ +  V  P V +EF G A + V
Sbjct: 360 MPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGARIDV 419

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDET----GIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           D +GI+     DAS   L LA L++         IIGN QQ+   V++D +  ++GF   
Sbjct: 420 DASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGFGAN 479

Query: 480 DCS 482
            CS
Sbjct: 480 GCS 482


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 145/370 (39%), Positives = 210/370 (56%), Gaps = 26/370 (7%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G  L T NY+A++ LG     + V +DTGSD +WVQC+PC  CY Q+DPVFDP+ S +Y 
Sbjct: 131 GKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYS 190

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA---- 240
            V C +  C  L  ++ +    S ++  +C Y VSY D S+T G+L R+ L L  +    
Sbjct: 191 AVPCGARECQELASSSSSRNCSSDNN-KNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPS 249

Query: 241 ---SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              +V  F+FGCG +N G FG V GL+GLG    SL SQ +  +G  FSYCLPS+    A
Sbjct: 250 PADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS--A 307

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-KGG 354
           +G L  GG ++         +T M+   Q  T Y LNLTGI + G+ ++  AS FA   G
Sbjct: 308 AGYLSFGGAAARAN----AQFTEMVTG-QDPTSYYLNLTGIVVAGRAIKVPASAFATAAG 362

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            +IDSGT  +RLPPS Y+AL++ F        +  AP   I DTC++ + ++ V IP V+
Sbjct: 363 TIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVE 422

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F   A + +  +G++Y   +D +Q CLA        + GI+GN QQ+   VIYD  + 
Sbjct: 423 LVFADGATVHLHPSGVLY-TWNDVAQTCLAFVP---NHDLGILGNTQQRTLAVIYDVGSQ 478

Query: 473 QLGFAGEDCS 482
           ++GF  + C+
Sbjct: 479 RIGFGRKGCA 488


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 156/454 (34%), Positives = 239/454 (52%), Gaps = 48/454 (10%)

Query: 58  QKSRIEMGAITLELKHKNYCSGKI-----VDWNEQQQNRLILDNLHVQYLQSRIKNMISG 112
           ++  +E+   ++ L H++   G       + + E+ Q RL  D   V  + SR++  ++G
Sbjct: 50  KEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLELAVNG 109

Query: 113 NIKDV-------------SNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLT 157
             +               S+ + P+ SG+   +  Y + I +G   R+  +++DTGSD+T
Sbjct: 110 IKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVT 169

Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
           W+QC+PC  CY Q DP+++P++S SYK V C ++ C  L+       V   S    C Y 
Sbjct: 170 WIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANLCQQLD-------VSGCSRNGSCLYQ 222

Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
           VSYGDGSYT+G    E L LG A + +   GCG +N+GLF G +GL+GLG   LS  SQ 
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQL 282

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP--ITYTNMIPNPQLATFYILNL 335
           ++  G +FSYCL   +D+ +S +L  G      + + P       M+ N +L TFY ++L
Sbjct: 283 TDENGKIFSYCLVD-RDSESSSTLQFG------RAAVPNGAVLAPMLKNSRLDTFYYVSL 335

Query: 336 TGISIGGKQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           +GIS+GGK L  S           GG+++DSGT +TRL  + Y +L+  F       PS 
Sbjct: 336 SGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPST 395

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLS 447
            G S+ DTC++LS+ + V++P V   F G   M++      Y V  D+    C A A  S
Sbjct: 396 DGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKN--YLVPVDSMGTFCFAFAPTS 453

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 I+GN QQ+  RV +D  N+Q+GFA   C
Sbjct: 454 --SSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 148/428 (34%), Positives = 220/428 (51%), Gaps = 34/428 (7%)

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD----VSNTEIPL 124
           L L H++  S  +        +R+  D + V  L  R+ +     +KD    V+N    +
Sbjct: 74  LNLLHRDKLS-HVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDV 132

Query: 125 TSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            SG+   +  Y   I +G   RN  +++D+GSD+ WVQC+PC  CY Q DPVFDP+ S S
Sbjct: 133 ISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSS 192

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
           +  V C S  C  LE    N+G         C Y VSYGDGSYT+G L  E L +G+  +
Sbjct: 193 FAGVSCGSDVCDRLENTGCNAG--------RCRYEVSYGDGSYTKGTLALETLTVGQVMI 244

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
            D   GCG  N+G+F G +GL+GLG   +S + Q     GG FSYCL S +  G++G+L 
Sbjct: 245 RDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS-RGTGSTGALE 303

Query: 303 LGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGG-------KQLQASGFAKG 353
            G      + + P+  T+ ++I NP+  +FY + L GI +GG       +  Q + +   
Sbjct: 304 FG------RGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTN 357

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G+++D+GT +TR P + Y A +  F  Q S  P APG SI DTC++L+ ++ V +P V  
Sbjct: 358 GVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSF 417

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F     +T+     +  V    +  CLA A         IIGN QQ+  ++ +D  N  
Sbjct: 418 YFSDGPVLTLPARNFLIPVDGGGT-FCLAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 474

Query: 474 LGFAGEDC 481
           +GF    C
Sbjct: 475 VGFGPNIC 482


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 149/381 (39%), Positives = 212/381 (55%), Gaps = 33/381 (8%)

Query: 114 IKDVSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
           + + S  EI  P+ SG+   +  Y + + +G   R + +++DTGSD+TW+QCQPC  CY 
Sbjct: 140 VFEASAAEIQGPVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYA 199

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
           Q DPV+DPS+S SY  V C+S  C  L+ A      C +S+   C Y V+YGDGSYT G+
Sbjct: 200 QSDPVYDPSVSTSYATVGCDSPRCRDLDAA-----ACRNST-GSCLYEVAYGDGSYTVGD 253

Query: 230 LGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
              E L LG  A V++   GCG +N+GLF G +GL+ LG   LS  SQ S      FSYC
Sbjct: 254 FATETLTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT---TFSYC 310

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-- 346
           L   +D+ +S +L  G       +  P     +I +P+  TFY + L+GIS+GG+ L   
Sbjct: 311 L-VDRDSPSSSTLQFG------DSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIP 363

Query: 347 ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
           +S FA      GG+++DSGT +TRL    Y AL+  F++     P A G S+ DTC++L+
Sbjct: 364 SSAFAMDDAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLA 423

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQ 460
               V +P V + FEG  E+ +      Y +  DA+   CLA A  S      IIGN QQ
Sbjct: 424 GRSSVQVPAVALWFEGGGELKLPAKN--YLIPVDAAGTYCLAFAGTS--GPVSIIGNVQQ 479

Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
           +  RV +DT  + +GF  + C
Sbjct: 480 QGVRVSFDTAKNTVGFTADKC 500


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 156/461 (33%), Positives = 231/461 (50%), Gaps = 41/461 (8%)

Query: 43  QWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYL 102
            ++ ++  S+S  +   +R    ++ L  +H                 RL  D     Y+
Sbjct: 24  SFEPEAACSTSSANSDPNR---ASVPLVHRHGPCAPSAASGGKPSLAERLRRDRARANYI 80

Query: 103 QSR----------IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
            ++          + + + G       T IP   G  + +L Y+ T+ +G       V++
Sbjct: 81  VTKAAGGRTAATAVSDAVGGG-----GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLI 135

Query: 151 DTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           DTGSDL+WVQC+PC +  CY Q+DP+FDPS S SY  V C+S  C  L       G C+S
Sbjct: 136 DTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHG-CTS 194

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
            +   C Y + YG+ + T G    E L L    V  DF FGCG +  G +    GL+GLG
Sbjct: 195 GAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLG 254

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP--ITYTNMIPNP 325
            +  SLVSQTS  FGG FSYCLP T  +G +G L LG  +S   ++      +T M   P
Sbjct: 255 GAPESLVSQTSSQFGGPFSYCLPPT--SGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIP 312

Query: 326 QLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEF---LK 380
            + TFY++ LTGIS+GG  L    S F+  G++IDSGTVIT LP + Y+AL++ F   + 
Sbjct: 313 SVPTFYVVTLTGISVGGAPLAVPPSAFSS-GMVIDSGTVITGLPATAYAALRSAFRSAMS 371

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
           ++   P + G ++LDTC++ + +  V +P + + F G A + +     V          C
Sbjct: 372 EYRLLPPSNG-AVLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVLV------DGC 424

Query: 441 LALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           LA A    +D  GIIGN  Q+   V+YD+    +GF    C
Sbjct: 425 LAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 159/412 (38%), Positives = 232/412 (56%), Gaps = 42/412 (10%)

Query: 95  DNLHVQYLQSRIKNMISGNIKD-VSNTEI--PLTSGIRLQTLNYIATIELG--GRNMTVI 149
           D   V++++S+ K  ++G  KD  S+T++  P+TSG+   +  Y   + LG   R++ ++
Sbjct: 13  DERRVRWIESKAK--LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPARSLFMV 70

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF--ATGNSGVCS 207
           VDTGSDL W+QCQPCKSCY Q DP+FDP  S S++++ C S  C ALE    +G+ G  S
Sbjct: 71  VDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHSCSGSRGATS 130

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGL 266
                 C+Y V+YGDGS++ G+   +   LG  S      FGCG +N+GLF G +GL+GL
Sbjct: 131 R-----CSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGL 185

Query: 267 GRSDLSLVSQ-----TSEIFGGLFSYCL-----PSTQDAGASGSLILGGNSSVFKNSTPI 316
           G   LS  SQ     T+      FSYCL     P T+   +S SLI G    V    +  
Sbjct: 186 GAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTR---SSSSLIFG----VAAIPSTA 238

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQ-------LQASGFAKGGILIDSGTVITRLPPS 369
             + ++ NP+L TFY   + G+S+GG Q       LQ S    GG++IDSGT +TR P S
Sbjct: 239 ALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTS 298

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           +Y+ ++  F       PSAP +S+ DTC+N S    V++P + + FE  A++ +  T  +
Sbjct: 299 VYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYL 358

Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             + + A   CLA A  S   E GIIGN QQ++ R+ +D + S L FA + C
Sbjct: 359 IPINT-AGSFCLAFAPTSM--ELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 174/492 (35%), Positives = 252/492 (51%), Gaps = 36/492 (7%)

Query: 7   PLTILSLLLPLMVSLFLLA-----KGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSR 61
           PL+ +SL   L V L LL      K     EGK+    + ++  + +    S V  Q +R
Sbjct: 5   PLSPISLTFILYVFLVLLCPLCSLKKGLTVEGKETTK-NYIRTVRVNSLLPSNVCSQSTR 63

Query: 62  IEMGAITLELKHK-NYC---SGKIVDWNEQQQNRLIL-DNLHVQYLQSRIK-NMISGNIK 115
           +   A +L++ +K   C   +G     N       +L D L V+  Q R+  N  SG  K
Sbjct: 64  VLNRASSLKVVNKYGPCIPVTGAPKTINVPSTAEFLLQDQLRVKSFQVRLSMNPSSGVFK 123

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQD 172
           ++  T IP  + I      Y+ T+ LG   ++ T+  DTGSDLTW QC+PC   C+ Q  
Sbjct: 124 EMQTT-IP--ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQ 180

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           P FDP+ S SYK V C+S  C  +      +  C S++   C Y + YG G YT G L  
Sbjct: 181 PKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNT---CLYGIQYGSG-YTIGFLAT 236

Query: 233 EHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           E L +  + V  +F+FGC   ++G F G +GL+GLGRS ++L SQT+  +  LFSYCLP+
Sbjct: 237 ETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA 296

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
           +    ++G L  G   S    STPI+       P+L   Y LN  GIS+ G++L  +G +
Sbjct: 297 SPS--STGHLSFGVEVSQAAKSTPIS-------PKLKQLYGLNTVGISVRGRELPING-S 346

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS--AYQEVNIP 409
               +IDSGT  T LP   YSAL + F +  + +    G S    C++ S      + IP
Sbjct: 347 ISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIP 406

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + + FEG  E+ +DV+GI+  V +   +VCLA A    + +  I GNYQQK   VIYD 
Sbjct: 407 GISIFFEGGVEVEIDVSGIMIPV-NGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDV 465

Query: 470 KNSQLGFAGEDC 481
               +GFA + C
Sbjct: 466 AKGMVGFAPKGC 477


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 143/371 (38%), Positives = 203/371 (54%), Gaps = 30/371 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y + + +G   R + +++DTGSD+TWVQCQPC  CY Q DPVFDPS+S
Sbjct: 157 PVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 216

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            SY  V C+S  C  L+ A   +   +      C Y V+YGDGSYT G+   E L LG +
Sbjct: 217 ASYAAVSCDSPRCRDLDTAACRNATGA------CLYEVAYGDGSYTVGDFATETLTLGDS 270

Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           + V +   GCG +N+GLF G +GL+ LG   LS  SQ   I    FSYCL   +D+ A+ 
Sbjct: 271 TPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCL-VDRDSPAAS 326

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA------ 351
           +L  G + +     T      ++ +P+  TFY + L+GIS+GG+ L   +S FA      
Sbjct: 327 TLQFGADGAEADTVT----APLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            GG+++DSGT +TRL  S Y+AL+  F++     P   G S+ DTC++LS    V +P V
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
            + FEG   + +      Y +  D A   CLA A  +      IIGN QQ+  RV +DT 
Sbjct: 443 SLRFEGGGALRLPAKN--YLIPVDGAGTYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTA 498

Query: 471 NSQLGFAGEDC 481
              +GF    C
Sbjct: 499 KGVVGFTPNKC 509


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 143/371 (38%), Positives = 204/371 (54%), Gaps = 30/371 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y + + +G   R + +++DTGSD+TWVQCQPC  CY Q DPVFDPS+S
Sbjct: 154 PVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 213

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            SY  V C+S  C  L+ A   +   +      C Y V+YGDGSYT G+   E L LG +
Sbjct: 214 ASYAAVSCDSQRCRDLDTAACRNATGA------CLYEVAYGDGSYTVGDFATETLTLGDS 267

Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           + V +   GCG +N+GLF G +GL+ LG   LS  SQ   I    FSYCL   +D+ A+ 
Sbjct: 268 TPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQ---ISASTFSYCL-VDRDSPAAS 323

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA------ 351
           +L  G  ++     T      ++ +P+ +TFY + L+GIS+GG+ L   AS FA      
Sbjct: 324 TLQFGDGAAEAGTVT----APLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSG 379

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            GG+++DSGT +TRL  + Y+AL+  F++     P   G S+ DTC++LS    V +P V
Sbjct: 380 SGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 439

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
            + FEG   + +      Y +  D A   CLA A  +      IIGN QQ+  RV +DT 
Sbjct: 440 SLRFEGGGALRLPAKN--YLIPVDGAGTYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTA 495

Query: 471 NSQLGFAGEDC 481
              +GF    C
Sbjct: 496 RGAVGFTPNKC 506


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 142/369 (38%), Positives = 203/369 (55%), Gaps = 25/369 (6%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +TSG+   +  Y   + +G   +   +++DTGSD+ W+QC PCKSCY Q D VFDP  S 
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S++++ C++  C  L+        C+S+    C Y VSYGDGS+T G+L  +   + +  
Sbjct: 63  SFRRLSCSTPQCKLLDVK-----ACASTDN-RCLYQVSYGDGSFTVGDLASDSFSVSRGR 116

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
            +  +FGCG +N+GLF G +GL+GLG   LS  SQ S      FSYCL S  +   + S 
Sbjct: 117 TSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSS---RKFSYCLVSRDNGVRASSA 173

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------ASGFAKG 353
           +L G+S++   S    YT ++ NP+L TFY   L+GISIGG  L         +S   +G
Sbjct: 174 LLFGDSAL-PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRG 232

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G++IDSGT +TRLP   Y+ ++  F       P A  FS+ DTC++ SA   V IP V  
Sbjct: 233 GVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSF 292

Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            FEG A + +  +   Y V  D S   C A +  S   +  IIGN QQ+  RV  D  +S
Sbjct: 293 HFEGGASVQLPPSN--YLVPVDTSGTFCFAFSKTSL--DLSIIGNIQQQTMRVAIDLDSS 348

Query: 473 QLGFAGEDC 481
           ++GFA   C
Sbjct: 349 RVGFAPRQC 357


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 160/435 (36%), Positives = 229/435 (52%), Gaps = 40/435 (9%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLIL-DNLHVQYLQSRIKNMISGNIKDVSNTEI- 122
           G  +L L H++  SG+           L   D   V+YLQ R+             TE+ 
Sbjct: 67  GRPSLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLS-------PTTMTTEVG 119

Query: 123 -PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
             + SGI   +  Y   + +G       ++VD+GSD+ W+QC+PC  CY Q DP+FDP+ 
Sbjct: 120 SEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAA 179

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S S+  V C+S  C  L    G+SG   S +   C Y VSYGDGSYT+G L  E L  G 
Sbjct: 180 SASFTAVPCDSGVCRTLP--GGSSGCADSGA---CRYQVSYGDGSYTQGVLAMETLTFGD 234

Query: 240 AS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGA 297
           ++ V     GCG  N+GLF G +GL+GLG   +SLV Q     GG FSYCL S   DAGA
Sbjct: 235 STPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGA 294

Query: 298 SGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF--- 350
            GSL+ G +     ++ P+   +  ++ N Q  +FY + LTG+ +GG++  LQ   F   
Sbjct: 295 -GSLVFGRD-----DAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLT 348

Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVN 407
               GG+++D+GT +TRLPP  Y+AL+  F     G  P APG S+LDTC++LS Y  V 
Sbjct: 349 EDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVR 408

Query: 408 IPLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
           +P V + F    A +T+    ++  V+      CLA A+ +      I+GN QQ+  ++ 
Sbjct: 409 VPTVALYFGRDGAALTLPARNLL--VEMGGGVYCLAFAASA--SGLSILGNIQQQGIQIT 464

Query: 467 YDTKNSQLGFAGEDC 481
            D+ N  +GF    C
Sbjct: 465 VDSANGYVGFGPSTC 479


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 166/460 (36%), Positives = 235/460 (51%), Gaps = 41/460 (8%)

Query: 42  LQWQQKSGSSSSC-----VSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDN 96
           +Q    S S+++C     V+   SR  M    L  +H           N      ++  +
Sbjct: 31  VQTSTSSPSNAACSPAAQVTSDPSRASM---PLMYRHGPCAPASAAATNRPSPAEMLRRD 87

Query: 97  LHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGS 154
              +  ++ I    SG  +      IP + G  + +L Y+ T+  G   +   +++DTGS
Sbjct: 88  ---RARRNHILRKASGR-RITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGS 143

Query: 155 DLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           DL+WVQCQPC S  CY Q+DPVFDPS S +Y  V C S  C  L+  +  +G  +SSS  
Sbjct: 144 DLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGA 203

Query: 213 D-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGR 268
             C Y + YG+G  T G    E L L   +   VN+F FGCG   KG+F    GL+GLG 
Sbjct: 204 SLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGG 263

Query: 269 SDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           +  SLVSQT+  +GG FSYCLP   ST    A G+   GGN++     TP+         
Sbjct: 264 APESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAGFQFTPLQVVET---- 319

Query: 326 QLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
              TFY++ LTGIS+GGKQL  + + FA GG++IDSGT++T LP + YSAL+  F    S
Sbjct: 320 ---TFYLVKLTGISVGGKQLDIEPTVFA-GGMIIDSGTIVTGLPETAYSALRTAFRSAMS 375

Query: 384 GFPSAP--GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
            +P  P      LDTC++ +    V +P V + FEG   + +DV   V          CL
Sbjct: 376 AYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVLL------DGCL 429

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A  + + + +TGIIGN  Q+   V+YD+    +GF    C
Sbjct: 430 AFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 149/368 (40%), Positives = 196/368 (53%), Gaps = 22/368 (5%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
           IP  +G  L TL ++  +  G   +   +I+DTGSDL+W+QC+PC   CY Q DP FDP+
Sbjct: 124 IPDHTGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPA 183

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S SY  V C +  C A        G+C+ ++   C Y V YGDGS T G L R+ L   
Sbjct: 184 KSSSYAAVPCGTPVCAAA------GGMCNGTT---CLYGVQYGDGSSTTGVLSRDTLTFN 234

Query: 239 KAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
            +S    F FGCG  N G FG V GL+GLGR  LSL SQ +  FGG+FSYCLPS      
Sbjct: 235 SSSKFTGFTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT--T 292

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGI 355
            G L +G       ++ P+ YT MI  PQ  +FY + L  I+IGG  L    S F K G 
Sbjct: 293 PGYLNIGATKPT--STVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGT 350

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           L+DSGT++T LPP  Y++L+  F     G   AP +  LDTC++ +    + IP V   F
Sbjct: 351 LLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNF 410

Query: 416 EGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
              A   +D  GI+ F   DA  +  CLA  S        I+GN QQ+   VIYD  + +
Sbjct: 411 SDGAVFDLDFYGIMIF-PDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQK 469

Query: 474 LGFAGEDC 481
           +GF    C
Sbjct: 470 IGFIPISC 477


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 150/424 (35%), Positives = 220/424 (51%), Gaps = 43/424 (10%)

Query: 93  ILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELGG-- 143
           + D+ H   +  R ++ +    + ++  E       IP   G+  Q+L Y+ TI +G   
Sbjct: 73  VPDHHHYTGILRRDRHRVRSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPP 132

Query: 144 RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           RN TV+ DTGSDLTWVQC PC   SCY QQ+P+FDPS S +Y  V C++  CH       
Sbjct: 133 RNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECHIGGVQQT 192

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGL 256
             G  S      C Y V YGD S T G L  E   L   S         +FGC      +
Sbjct: 193 RCGATS------CEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGCSHEYISV 246

Query: 257 FG----GVSGLMGLGRSDLSLVSQTSEIF---GGLFSYCLPSTQDAGASGSLILGGNSSV 309
           F     GV+GL+GLGR D S++SQT       GG+FSYCLP      ++G L +GG ++ 
Sbjct: 247 FNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPR--GSSTGYLTIGGGAAA 304

Query: 310 FKNS-TPITYTNMIPN-PQLATFYILNLTGISIGGK--QLQASGFAKGGILIDSGTVITR 365
            +   + +++T +I    QL + Y++NL G+S+ G    + AS F+ G + IDSGTV+T 
Sbjct: 305 PQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAV-IDSGTVVTH 363

Query: 366 LPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           +P + Y  L+ EF      +   P     +LDTC++++    V  P V +EF G A + V
Sbjct: 364 MPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDV 423

Query: 424 DVTGIVYFVKS-DASQVCLALASLSY--EDETG--IIGNYQQKNQRVIYDTKNSQLGFAG 478
           D +GI+  + + D S   L LA L++   +  G  I+GN QQ+   V++D    ++GF  
Sbjct: 424 DASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGP 483

Query: 479 EDCS 482
             CS
Sbjct: 484 NGCS 487


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 151/408 (37%), Positives = 221/408 (54%), Gaps = 36/408 (8%)

Query: 99  VQYLQSRI--KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGS 154
           VQ L+S++   ++I+  +    +      S +     +Y+ TI LG   +  +VI DTGS
Sbjct: 2   VQALRSKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGS 61

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           DL W+QC+PC++C+NQ+DP+FDP  S SY  + C  + C +L   +     CS    PDC
Sbjct: 62  DLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKS-----CS----PDC 112

Query: 215 NYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
           +Y   YGDGS TRG L  E + L      K +  +  FGCG  N+G F   SGL+GLGR 
Sbjct: 113 DYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRG 172

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITY--TNMIPNPQ 326
           +LS VSQ  ++FG  FSYCL   +DA +  S +  G  SS   +   + Y  T MI NP 
Sbjct: 173 NLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPA 232

Query: 327 LATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFL 379
           + +FY + L  ISI G+ L+  A  F       GG++ DSGT +T LP + Y  +     
Sbjct: 233 MESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALR 292

Query: 380 KQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
            + S FP   G S  LD C+++S   A  ++ IP +   FEG A+  + V    YF+ ++
Sbjct: 293 SKIS-FPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEG-ADYQLPVEN--YFIAAN 348

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            +   + LA +S   + GI GN  Q+N RV+YD  +S++G+A   C S
Sbjct: 349 DAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDS 396


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 152/463 (32%), Positives = 230/463 (49%), Gaps = 40/463 (8%)

Query: 37  LHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQN------ 90
           L++ +   + K+        +Q   +  G   L+L H++    KI  +N+   +      
Sbjct: 41  LNVKEAITETKASQYQELFDNQNDTLTEGKWKLKLVHRD----KITAFNKSSYDHSHNFH 96

Query: 91  -RLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMT 147
            R+  D   V  L  R+    + +   V      + SG+   +  Y   I +G   R   
Sbjct: 97  ARIQRDKKRVATLIRRLSPRDATSSYSVEEFGAEVVSGMNQGSGEYFIRIGVGSPPREQY 156

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           V++D+GSD+ WVQCQPC  CY+Q DPVFDP+ S S+  V C+SS C  +E A  ++G   
Sbjct: 157 VVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCERIENAGCHAG--- 213

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
                 C Y V YGDGSYT+G L  E L  G+  V +   GCG  N+G+F G +GL+GLG
Sbjct: 214 -----GCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLG 268

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNP 325
              +SLV Q     GG FSYCL S +   ++GSL  G      + + P+   +  +I NP
Sbjct: 269 GGSMSLVGQLGGQTGGAFSYCLVS-RGTDSAGSLEFG------RGAMPVGAAWIPLIRNP 321

Query: 326 QLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
           +  +FY + L+G+ +GG ++       Q +    GG+++D+GT +TR+P   Y A +  F
Sbjct: 322 RAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAF 381

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
           + Q    P A G SI DTC+NL+ +  V +P V   F G   +T+     +  V  D   
Sbjct: 382 IGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVD-DVGT 440

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            C A A  +      IIGN QQ+  ++ +D  N  +GF    C
Sbjct: 441 FCFAFA--ASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 143/373 (38%), Positives = 212/373 (56%), Gaps = 30/373 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y + I +G   R + +++DTGSD+TW+QC PC  CY Q DP+FDP++S
Sbjct: 184 PVVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---GL 237
            SY  V C+S  C AL+ +  ++   + +S   C Y V+YGDGSYT G+   E L   G 
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNS--SCVYEVAYGDGSYTVGDFATETLTLGGD 301

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
           G A+V+D   GCG +N+GLF G +GL+ LG   LS  SQ S      FSYCL   +D+ +
Sbjct: 302 GSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAT---EFSYCL-VDRDSPS 357

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFA--- 351
           + +L  G +     +S+ +T   ++ +P+  TFY + L GIS+GG+ L     + FA   
Sbjct: 358 ASTLQFGAS-----DSSTVT-APLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDE 411

Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
              GG+++DSGT +TRL  S YSAL+  F++     P A G S+ DTC++L+    V +P
Sbjct: 412 QGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVP 471

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
            V + FEG  E+ +      Y +  D A   CLA A+        I+GN QQ+  RV +D
Sbjct: 472 AVSLRFEGGGELKLPAKN--YLIPVDGAGTYCLAFAATG--GAVSIVGNVQQQGIRVSFD 527

Query: 469 TKNSQLGFAGEDC 481
           T  + +GF+   C
Sbjct: 528 TAKNTVGFSPNKC 540


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  228 bits (581), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 154/420 (36%), Positives = 226/420 (53%), Gaps = 46/420 (10%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKD-----VSNT--------EIPLTSGIRLQTLNY 135
           +NRL  D L +  + SRI   ++G  K      + NT        E PL SG+   +  Y
Sbjct: 22  RNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSGEY 81

Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
             ++ +G   R + ++ DTGSD+ W+QC PC+SCY Q DP+F+PS S +++ + C SS C
Sbjct: 82  FVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLC 141

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
             L         C  +    C Y VSYGDGS+T GE   E L  G  +VN    GCG NN
Sbjct: 142 QQLLIRG-----CRRN---QCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNN 193

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
           +GLF G +GL+GLG+  LS  SQ  +++G +FSYCLP+ +  G S  LI  GN +V  N+
Sbjct: 194 QGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SVPLIF-GNQAVASNA 251

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITR 365
               +T ++ NP+L TFY + + GI +GG  +          S    GG+++DSGT +TR
Sbjct: 252 ---QFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTR 308

Query: 366 LPPSIYSALKAEFLKQFSGFPS----APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
           L  S Y+ ++  F    +G PS      GFS+ DTC++LS    + +P V   F G A M
Sbjct: 309 LVTSAYNPMRDAFR---AGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +    I+  V  ++   CLA A  S  +   IIGN QQ++ R+ +D+  +++G     C
Sbjct: 366 ALPAQNIMVPVD-NSGTYCLAFAPNS--ENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  228 bits (580), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 142/369 (38%), Positives = 203/369 (55%), Gaps = 25/369 (6%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +TSG+   +  Y   + +G   +   +++DTGSD+ W+QC PCKSCY Q D VFDP  S 
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S++++ C++  C  L+        C+S+    C Y VSYGDGS+T G+L  +   + +  
Sbjct: 63  SFRRLSCSTPQCKLLDVK-----ACASTD-NRCLYQVSYGDGSFTVGDLASDSFLVSRGR 116

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
            +  +FGCG +N+GLF G +GL+GLG   LS  SQ S      FSYCL S  +   + S 
Sbjct: 117 TSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSS---RKFSYCLVSRDNGVRASSA 173

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------ASGFAKG 353
           +L G+S++   S    YT ++ NP+L TFY   L+GISIGG  L         +S   +G
Sbjct: 174 LLFGDSAL-PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRG 232

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G++IDSGT +TRLP   Y+ ++  F       P A  FS+ DTC++ SA   V IP V  
Sbjct: 233 GVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSF 292

Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            FEG A + +  +   Y V  D S   C A +  S   +  IIGN QQ+  RV  D  +S
Sbjct: 293 HFEGGASVQLPPSN--YLVPVDTSGTFCFAFSKTSL--DLSIIGNIQQQTMRVAIDLDSS 348

Query: 473 QLGFAGEDC 481
           ++GFA   C
Sbjct: 349 RVGFAPRQC 357


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  228 bits (580), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 154/420 (36%), Positives = 226/420 (53%), Gaps = 46/420 (10%)

Query: 89  QNRLILDNLHVQYLQSRIKNMISGNIKD-----VSNT--------EIPLTSGIRLQTLNY 135
           +NRL  D L +  + SRI   ++G  K      + NT        E PL SG+   +  Y
Sbjct: 22  RNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSGEY 81

Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
             ++ +G   R + ++ DTGSD+ W+QC PC+SCY Q DP+F+PS S +++ + C SS C
Sbjct: 82  FVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLC 141

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
             L         C  +    C Y VSYGDGS+T GE   E L  G  +VN    GCG NN
Sbjct: 142 QQLLIRG-----CRRN---QCLYQVSYGDGSFTVGEFSTETLSFGSNAVNSVAIGCGHNN 193

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
           +GLF G +GL+GLG+  LS  SQ  +++G +FSYCLP+ +  G S  LI  GN +V  N+
Sbjct: 194 QGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG-SVPLIF-GNQAVASNA 251

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITR 365
               +T ++ NP+L TFY + + GI +GG  +          S    GG+++DSGT +TR
Sbjct: 252 ---QFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTR 308

Query: 366 LPPSIYSALKAEFLKQFSGFPS----APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
           L  S Y+ ++  F    +G PS      GFS+ DTC++LS    + +P V   F G A M
Sbjct: 309 LVTSAYNPMRDAFR---AGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATM 365

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +    I+  V  ++   CLA A  S  +   IIGN QQ++ R+ +D+  +++G     C
Sbjct: 366 ALPAQNIMVPVD-NSGTYCLAFAPNS--ENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 156/439 (35%), Positives = 230/439 (52%), Gaps = 44/439 (10%)

Query: 66  AITLELKHKNYCSG-KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD---VSNTE 121
           + +LEL  +    G    D+     +RL  D+  V+ + ++++  +SG  K      +TE
Sbjct: 79  SFSLELHPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTE 138

Query: 122 I--------PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQ 171
           I        P+TSG    +  Y   + +G  + T  +++DTGSD+ W+QC+PC  CY Q 
Sbjct: 139 ILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQV 198

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
           DP+FDP+ S S+ ++ C +  C  L+ FA  N           C Y VSYGDGSYT G+ 
Sbjct: 199 DPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDS---------CLYQVSYGDGSYTVGDF 249

Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
             E +  G + SV+    GCG +N+GLF G +GL+GLG   LSL   TS+I    FSYCL
Sbjct: 250 ATETVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSL---TSQIKASSFSYCL 306

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---- 345
              +D+  S +L           + PI       N ++ TFY + +TG+S+GG++L    
Sbjct: 307 -VNRDSVDSSTLEFNSAKPSDSVTAPI-----FKNSKVDTFYYVGITGMSVGGEKLAIPP 360

Query: 346 ---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
              +  G  KGGI++D GT +TRL    Y+AL+  F+K     PS  GF++ DTC+NLS+
Sbjct: 361 SIFEVDGSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSS 420

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
              V +P V   F+G   + +  +  +  V S A   CLA A  +      IIGN QQ+ 
Sbjct: 421 RTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDS-AGTFCLAFAPTTA--SLSIIGNVQQQG 477

Query: 463 QRVIYDTKNSQLGFAGEDC 481
            RV YD  NSQ+ F+   C
Sbjct: 478 TRVTYDLANSQVSFSSRKC 496


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 147/425 (34%), Positives = 218/425 (51%), Gaps = 31/425 (7%)

Query: 69  LELKHKNYCS-GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
           +++ H++  S G   D   +   RL  D   V  L  R+ +   G+ + V +    + SG
Sbjct: 74  MKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYR-VDDFGTDVISG 132

Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           +   +  Y   I +G   R+  +++D+GSD+ WVQCQPC  CY+Q DPVFDP+ S S+  
Sbjct: 133 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 192

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
           V C+SS C  LE A  ++G         C Y VSYGDGSYT+G L  E L  G+  V   
Sbjct: 193 VSCSSSVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTFGRTMVRSV 244

Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
             GCG  N+G+F G +GL+GLG   +S V Q     GG FSYCL S +   +SGSL+ G 
Sbjct: 245 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS-RGTDSSGSLVFG- 302

Query: 306 NSSVFKNSTP--ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------GFAKGGIL 356
                + + P    +  ++ NP+  +FY + L G+ +GG ++  S           GG++
Sbjct: 303 -----REALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVV 357

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +D+GT +TRLP   Y A +  FL Q +  P A G +I DTC++L  +  V +P V   F 
Sbjct: 358 MDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 417

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           G   +T+     +     DA   C A A  +      I+GN QQ+  ++ +D  N  +GF
Sbjct: 418 GGPILTLPARNFL-IPMDDAGTFCFAFAPST--SGLSILGNIQQEGIQISFDGANGYVGF 474

Query: 477 AGEDC 481
               C
Sbjct: 475 GPNIC 479


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 151/372 (40%), Positives = 201/372 (54%), Gaps = 28/372 (7%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFD 176
           IP  SG  L TL ++  + LG   +   +I DTGSDL+WVQCQPC S   C+ QQDP+FD
Sbjct: 136 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFD 195

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           PS S +Y  V C    C A        G+CS  +   C Y V YGDGS T G L R+ L 
Sbjct: 196 PSKSSTYAAVHCGEPQCAA------AGGLCSEDNT-TCLYLVHYGDGSSTTGVLSRDTLA 248

Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           L  + ++  F FGCG  N G FG V GL+GLGR +LSL SQ +  FG +FSYCLPS+   
Sbjct: 249 LTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS- 307

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKG 353
             +G L +G   +   ++    YT M+  PQ  +FY + L  I IGG  L    + F +G
Sbjct: 308 -TTGYLTIGATPAT--DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRG 364

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G L+DSGTV+T LP   Y  L+  F      +  AP   +LD C++ +   EV +P V  
Sbjct: 365 GTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSF 424

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDT 469
            F   A   +D  G++ F+  D +  CLA A++   D  G    IIGN QQ++  VIYD 
Sbjct: 425 RFGDGAVFELDFFGVMIFL--DENVGCLAFAAM---DAGGLPLSIIGNTQQRSAEVIYDV 479

Query: 470 KNSQLGFAGEDC 481
              ++GF    C
Sbjct: 480 AAEKIGFVPASC 491


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 173/505 (34%), Positives = 256/505 (50%), Gaps = 66/505 (13%)

Query: 3   TKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGS-SSSCVSHQKSR 61
           ++  P +  + LL ++ SL    + AH        HL++ Q QQ++   SSS   H +SR
Sbjct: 20  SRSTPHSSKTTLLDVVSSL----QNAHNAVAFTPHHLNQHQRQQEALLLSSSFGIHLRSR 75

Query: 62  IEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
               A   +  H++Y S  +        +RL  D+  V+ LQ+R+  ++    K VSN++
Sbjct: 76  ----ASIQKPSHRDYKSLTL--------SRLARDSARVKSLQTRLDLVL----KRVSNSD 119

Query: 122 I----------------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQP 163
           +                P+ SG    +  Y   + +G       V++DTGSD++W+QC P
Sbjct: 120 LHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAP 179

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C  CY Q DP+FDP  S SY  + C++  C +L+ +   +G C         Y VSYGDG
Sbjct: 180 CSECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCL--------YEVSYGDG 231

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           SYT GE   E + LG A+V +   GCG NN+GLF G +GL+GLG   LS  +Q   +   
Sbjct: 232 SYTVGEFATETVTLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQ---VNAT 288

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
            FSYCL   +D+ A  +L    NS + +N   +    +  NP+L TFY L L GIS+GG+
Sbjct: 289 SFSYCL-VNRDSDAVSTLEF--NSPLPRN---VVTAPLRRNPELDTFYYLGLKGISVGGE 342

Query: 344 QL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT 396
            L       +      GGI+IDSGT +TRL   +Y AL+  F+K   G P A G S+ DT
Sbjct: 343 ALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDT 402

Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
           C++LS+ + V +P V   F    E+ +     +  V S     C A A  +      I+G
Sbjct: 403 CYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDS-VGTFCFAFAPTT--SSLSIMG 459

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
           N QQ+  RV +D  NS +GF+ + C
Sbjct: 460 NVQQQGTRVGFDIANSLVGFSADSC 484


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 173/507 (34%), Positives = 252/507 (49%), Gaps = 70/507 (13%)

Query: 3   TKVKPLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSG---SSSSCVSHQK 59
           ++  P +  + LL ++ SL    + AH        H +K Q QQ+S    SS     H +
Sbjct: 20  SRTTPHSPQTTLLDVVSSL----QNAHNVVAFTHHHPNKHQRQQESSLLTSSFGIQLHSR 75

Query: 60  SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
           + I+  +      H +Y S  +        +RL  D+  V+ LQ+R+   +    K VSN
Sbjct: 76  ASIQKSS------HSDYKSLTL--------SRLARDSARVKALQTRLDLFL----KRVSN 117

Query: 120 TEI----------------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQC 161
           +++                P+ SG    +  Y   + +G       V++DTGSD++W+QC
Sbjct: 118 SDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQC 177

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
            PC  CY Q DP+FDP  S SY  + C+   C +L+ +   +G C         Y VSYG
Sbjct: 178 APCSECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCL--------YEVSYG 229

Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
           DGSYT GE   E + LG A+V +   GCG NN+GLF G +GL+GLG   LS  +Q   + 
Sbjct: 230 DGSYTVGEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQ---VN 286

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
              FSYCL   +D+ A  +L    NS + +N+       ++ NP+L TFY L L GIS+G
Sbjct: 287 ATSFSYCL-VNRDSDAVSTLEF--NSPLPRNA---ATAPLMRNPELDTFYYLGLKGISVG 340

Query: 342 GKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
           G+ L       +      GGI+IDSGT +TRL   +Y AL+  F+K   G P A G S+ 
Sbjct: 341 GEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLF 400

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
           DTC++LS+ + V IP V   F    E+ +     +  V S     C A A  +      I
Sbjct: 401 DTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDS-VGTFCFAFAPTT--SSLSI 457

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IGN QQ+  RV +D  NS +GF+ + C
Sbjct: 458 IGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 152/409 (37%), Positives = 222/409 (54%), Gaps = 38/409 (9%)

Query: 99  VQYLQSRI--KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGS 154
           VQ L+S++   ++I+  +    +      S +     +Y+ TI LG   +  +VI DTGS
Sbjct: 2   VQALRSKLAASSLITSEVPYPPSVSTDYESPVASGGGDYVTTISLGTPAKVFSVIADTGS 61

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           DL W+QC+PC++C+NQ+DP+FDP  S SY  + C  + C +L   +     CS    P+C
Sbjct: 62  DLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKS-----CS----PNC 112

Query: 215 NYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
           +Y   YGDGS TRG L  E + L      K +  +  FGCG  N+G F   SGL+GLGR 
Sbjct: 113 DYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRG 172

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITY--TNMIPNPQ 326
           +LS VSQ  ++FG  FSYCL   +DA +  S +  G  SS   +   + Y  T MI NP 
Sbjct: 173 NLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPA 232

Query: 327 LATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPPSIYS-ALKAEF 378
           + +FY + L  ISI G+ L+  A  F       GG++ DSGT +T LP + Y   L+A  
Sbjct: 233 MESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-- 290

Query: 379 LKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           L+    FP   G S  LD C+++S   A  +  IP +   FEG A+  + V    YF+ +
Sbjct: 291 LRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEG-ADHQLPVEN--YFIAA 347

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + +   + LA +S   + GI GN  Q+N RV+YD  +S++G+A   C S
Sbjct: 348 NDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDS 396


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 150/373 (40%), Positives = 200/373 (53%), Gaps = 30/373 (8%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFD 176
           IP  SG  L TL ++  + LG   +   +I DTGSDL+WVQCQPC S   C+ QQDP+FD
Sbjct: 131 IPDRSGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFD 190

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           PS S +Y  V C    C A       +G   S     C Y V YGDGS T G L R+ L 
Sbjct: 191 PSKSSTYAAVHCGEPQCAA-------AGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLA 243

Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
           L  + ++  F FGCG  N G FG V GL+GLGR +LSL SQ +  FG +FSYCLPS+   
Sbjct: 244 LTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS- 302

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKG 353
             +G L +G   +   ++    YT M+  PQ  +FY + L  I IGG  L      F +G
Sbjct: 303 -TTGYLTIGATPAT--DTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG 359

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G L+DSGTV+T LP   Y+ L+  F      +  AP   +LD C++ +   EV +P V  
Sbjct: 360 GTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSF 419

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYD 468
            F   A   +D  G++ F+  D +  CLA A++    +TG     IIGN QQ++  VIYD
Sbjct: 420 RFGDGAVFELDFFGVMIFL--DENVGCLAFAAM----DTGGLPLSIIGNTQQRSAEVIYD 473

Query: 469 TKNSQLGFAGEDC 481
               ++GF    C
Sbjct: 474 VAAEKIGFVPASC 486


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 157/435 (36%), Positives = 219/435 (50%), Gaps = 31/435 (7%)

Query: 65  GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
           G  ++ L H+   CS    +  E++   +  L  D L   Y++ +        +G     
Sbjct: 58  GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 117

Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
           S   +P T G  L TL Y+ ++ LG   MT  V++DTGSD++WVQC+PC +   C+    
Sbjct: 118 SKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 177

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            +FDP+ S +Y    C+++ C  L   +G +  C + S   C Y V YGDGS T G    
Sbjct: 178 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTGTYSS 234

Query: 233 EHLGL-GKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
           + L L G   V  F FGC       G+     GL+GLG    SLVSQT+  +G  FSYCL
Sbjct: 235 DVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCL 294

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
           P+T    +SG L LG  +S           T M+ + ++ T+Y   L  I++GGK+L   
Sbjct: 295 PATP--ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 352

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
            S FA G  L+DSGTVITRLPP+ Y+AL + F    + +  A    ILDTCFN +   +V
Sbjct: 353 PSVFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKV 411

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
           +IP V + F G A + +D  GIV       S  CLA A    +   G IGN QQ+   V+
Sbjct: 412 SIPTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 464

Query: 467 YDTKNSQLGFAGEDC 481
           YD      GF    C
Sbjct: 465 YDVGGGVFGFRAGAC 479


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  224 bits (571), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 131/378 (34%), Positives = 205/378 (54%), Gaps = 29/378 (7%)

Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           E P+ SG+   T  Y A + +G   R+M ++VDTGSD+TW+QC PC +CY Q+D +F+PS
Sbjct: 2   EAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPS 61

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
            S S+K + C+SS C  L+        C S+    C Y   YGDGS+T GEL  +++   
Sbjct: 62  SSSSFKVLDCSSSLCLNLDVMG-----CLSNK---CLYQADYGDGSFTMGELVTDNVVLD 113

Query: 236 ---GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
              G G+  + +   GCG +N+G FG  +G++GLGR  LS  +        +FSYCLP  
Sbjct: 114 DAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDR 173

Query: 293 QDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQL------ 345
           +      S ++ G++++   +T  + +   + NP++AT+Y + +TGIS+GG  L      
Sbjct: 174 ESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPAS 233

Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
             Q      GG + DSGT ITRL    Y+A++  F        SA  F I DTC++ +  
Sbjct: 234 VFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGM 293

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
             +++P V   F+G+ +M +  +  +  V S+ +  C A A+        +IGN QQ++ 
Sbjct: 294 NSISVPTVTFHFQGDVDMRLPPSNYIVPV-SNNNIFCFAFAA---SMGPSVIGNVQQQSF 349

Query: 464 RVIYDTKNSQLGFAGEDC 481
           RVIYD  + Q+G   + C
Sbjct: 350 RVIYDNVHKQIGLLPDQC 367


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  224 bits (571), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 149/393 (37%), Positives = 216/393 (54%), Gaps = 31/393 (7%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN----YIATIELG--GRNMTVIVDTGSDL 156
           Q R+K++ + +  + S T +      R+ T +    Y  T+ LG   ++ +++ DTGSDL
Sbjct: 96  QLRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTGSDL 155

Query: 157 TWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPD 213
           TW QC+PC   C+ Q D  FDP+ S SYK + C+S  C ++  E A G    CSSS+   
Sbjct: 156 TWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQG----CSSSN--S 209

Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
           C Y V YG G YT G L  E L +  + V  +F+ GCG  N G F G +GL+GLGRS ++
Sbjct: 210 CLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVA 268

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
           L SQTS  +  LFSYCLP++  + ++G L  GG  S     TPI  T+ IP       Y 
Sbjct: 269 LPSQTSSTYKNLFSYCLPAS--SSSTGHLSFGGGVSQAAKFTPI--TSKIPE-----LYG 319

Query: 333 LNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           L+++GIS+GG++L    S F   G +IDSGT +T LP + +SAL + F +  + +    G
Sbjct: 320 LDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKG 379

Query: 391 FSILDTCFNLS--AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
            S L  C++ S  A   + IP + + FEG  E+ +D +GI +   +   +VCLA      
Sbjct: 380 TSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGI-FIAANGLEEVCLAFKDNGN 438

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + +  I GN QQK   V+YD     +GFA   C
Sbjct: 439 DTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  224 bits (570), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 153/444 (34%), Positives = 229/444 (51%), Gaps = 46/444 (10%)

Query: 66  AITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI------SGNI 114
           A +++L H++       +     +  + + +L  +   V+ L+ RI+  +      +G+ 
Sbjct: 70  AWSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSY 129

Query: 115 KDVSNTEIP----LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCY 168
           ++V+         + SG+   +  Y   I +G   R   +++DTGSD+ W+QC+PC+ CY
Sbjct: 130 ENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECY 189

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
           +Q DP+F+PS S S+  V C+S+ C  L+    + G         C Y VSYGDGSYT G
Sbjct: 190 SQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGG--------GCLYEVSYGDGSYTVG 241

Query: 229 ELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
               E L  G  S+ +   GCG +N GLF G +GL+GLG   LS  +Q     G  FSYC
Sbjct: 242 SYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYC 301

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGG---K 343
           L   +D+ +SG+L  G        S PI   +T ++ NP L TFY L++  IS+GG    
Sbjct: 302 L-VDRDSESSGTLEFG------PESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILD 354

Query: 344 QLQASGF------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
            + +  F       +GGI+IDSGT +TRL  S Y AL+  F+      P A G SI DTC
Sbjct: 355 SVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTC 414

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           ++LSA Q V+IP V   F   A   +     +  + S  +  C A A    +    I+GN
Sbjct: 415 YDLSALQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGT-FCFAFAPA--DSNLSIMGN 471

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
            QQ+  RV +D+ NS +GFA + C
Sbjct: 472 IQQQGIRVSFDSANSLVGFAIDQC 495


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 146/373 (39%), Positives = 195/373 (52%), Gaps = 20/373 (5%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQD 172
           +  +  IP  +G  L+T  ++  +  G    T   + DTGSDL+W+QCQPC   CY Q D
Sbjct: 93  EAPSATIPDHTGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHD 152

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           PVFDP+ S SY  V C ++ C A        G C+ ++   C Y V YGDGS T G L R
Sbjct: 153 PVFDPAKSSSYAVVPCGTTECAAA------GGECNGTT---CVYGVEYGDGSSTTGVLAR 203

Query: 233 EHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           E L    +S    FIFGCG  N G FG V GL+GLGR  LSL SQ +  FGG+FSYCLPS
Sbjct: 204 ETLTFSSSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPS 263

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASG 349
                  G L +G  ++      P+ YT M+  P   +FY + L  I+IGG  L    S 
Sbjct: 264 YNT--TPGYLSIG--ATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSE 319

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
           F K G L+DSGT++T LPP  Y+AL+  F     G   AP +  LDTC++ +    + IP
Sbjct: 320 FTKTGTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIP 379

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
            V   F   A   ++  GI+ F       V CLA  S   +    ++G+  Q++  VIYD
Sbjct: 380 GVSFNFSDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYD 439

Query: 469 TKNSQLGFAGEDC 481
               ++GF    C
Sbjct: 440 VPAQKIGFIPASC 452


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 160/427 (37%), Positives = 217/427 (50%), Gaps = 35/427 (8%)

Query: 68  TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK-NMISGN--IKDVSNTEIP 123
           T+ L H++  CS             L  D L  +Y+Q+++  N  SG   ++  +   +P
Sbjct: 54  TVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAITLP 113

Query: 124 LTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
            T G  L TL Y+ T+ +G   MT  V++DTGSD++WV C       +     FDP  S 
Sbjct: 114 TTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSL--FFDPGKSS 171

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           +Y    C+S+ C  LE   G    CS +S   C Y V YGDGS T G  G + L L    
Sbjct: 172 TYTPFSCSSAACTRLE---GRDNGCSLNS--TCQYTVRYGDGSNTTGTYGSDTLALNSTE 226

Query: 242 -VNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
            V +F FGC   +    G       GLMGLG    SLVSQT+  +G  FSYCLP+T  + 
Sbjct: 227 KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRS- 285

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
            SG L LG ++     ++    T M  + +  TFY + L GI++GG  +  S   FA G 
Sbjct: 286 -SGFLTLGAST----GTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGS 340

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
           I+ DSGT+ITRLPP  YSAL A F      +P A  FSILDTCF+ +    V+IP V++ 
Sbjct: 341 IM-DSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELV 399

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F G A + +D  GI+Y         CLA A  +    + IIGN QQ+   V++D   S L
Sbjct: 400 FSGGAVVDLDADGIMY-------GSCLAFAPATGGIGS-IIGNVQQRTFEVLHDVGQSVL 451

Query: 475 GFAGEDC 481
           GF    C
Sbjct: 452 GFRPGAC 458


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 142/340 (41%), Positives = 193/340 (56%), Gaps = 26/340 (7%)

Query: 150 VDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S SY  V C    C  L     ++   
Sbjct: 3   VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMG 265
           +        Y VSYGDGS T G    + L L  +S V  F FGCG    GLF GV GL+G
Sbjct: 63  AQC-----GYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLG 117

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR   SLV QT+  +GG+FSYCLP+        +L +GG S         + T ++P+P
Sbjct: 118 LGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---FSTTQLLPSP 174

Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
              T+Y++ LTGIS+GG+QL   AS FA  G ++D+GTV+TRLPP+ Y+AL++ F    +
Sbjct: 175 NAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAALRSAFRSGMA 233

Query: 384 --GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
             G+P+AP   ILDTC+N + Y  V +P V + F   A +T+   GI+ F        CL
Sbjct: 234 SYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF-------GCL 286

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A A    +    I+GN QQ++  V  D   + +GF    C
Sbjct: 287 AFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 150/423 (35%), Positives = 207/423 (48%), Gaps = 36/423 (8%)

Query: 84  WNEQQQNRLILDN-----LHVQYLQS--------RIKNMISGNIKDVSNTE-----IPLT 125
           WN+ +  RLI        L + YL +        R + +       +   E     IP +
Sbjct: 51  WNKSEVPRLISRTCNGRPLPLDYLWTYGPAPSPHRPRGIPISYPPTIPPAEAPAVTIPDS 110

Query: 126 SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPS 182
           +G  L TL ++ T+  G   +  T++ DTGSD++W+QC PC   CY Q DP+FDP+ S +
Sbjct: 111 TGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSAT 170

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-S 241
           Y  V C    C A        G CSS+    C Y V YGDGS T G L  E L L  A +
Sbjct: 171 YSAVPCGHPQCAA------AGGKCSSNG--TCLYKVQYGDGSSTAGVLSHETLSLTSARA 222

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           +  F FGCG  N G FG V GL+GLGR  LSL SQ +  FG  FSYCLPS     + G L
Sbjct: 223 LPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT--SHGYL 280

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDS 359
            +G  +     S  + YT MI      +FY ++L  I +GG  L      F + G L+DS
Sbjct: 281 TIGTTTPA-SGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDS 339

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           GTV+T LPP  Y+AL+  F    + +  AP +   DTC++ +    + +PLV  +F   +
Sbjct: 340 GTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGS 399

Query: 420 EMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
              +   G++ F    A    CLA           I+GN QQ+N  +IYD    ++GF  
Sbjct: 400 SFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVS 459

Query: 479 EDC 481
             C
Sbjct: 460 GSC 462


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 147/368 (39%), Positives = 202/368 (54%), Gaps = 26/368 (7%)

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           SG+   +  Y   I +G   R + +++DTGSD+ W+QC PCK CY Q DPVFDP  S S+
Sbjct: 117 SGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSF 176

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
             + C S  CH L+     S  C++     C Y VSYGDGS+T G+   E L   +  V 
Sbjct: 177 ASIACRSPLCHRLD-----SPGCNTQKQ-TCMYQVSYGDGSFTFGDFSTETLTFRRTRVA 230

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG +N+GLF G +GL+GLGR  LS  SQT   F   FSYCL     +    S++ 
Sbjct: 231 RVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVF 290

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKGGI 355
            G+S+V + +    +T ++ NP+L TFY + L GIS+GG +   + AS F       GG+
Sbjct: 291 -GDSAVSRTA---RFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGV 346

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSGT +TRL    Y A +  F    S    AP FS+ DTCF+LS   EV +P V + F
Sbjct: 347 IIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHF 406

Query: 416 EGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
            G A++++  +   Y +  D S   CLA A         IIGN QQ+  RV+YD   S++
Sbjct: 407 RG-ADVSLPASN--YLIPVDTSGNFCLAFAGT--MGGLSIIGNIQQQGFRVVYDLAGSRV 461

Query: 475 GFAGEDCS 482
           GFA   C+
Sbjct: 462 GFAPHGCA 469


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  221 bits (564), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 148/450 (32%), Positives = 224/450 (49%), Gaps = 48/450 (10%)

Query: 55  VSHQKSRIEMGAIT-----LELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRI--- 106
           + HQK  I   A +     L+L H++    K+  +N    +R    N  +Q    R+   
Sbjct: 49  LQHQKLNIATEASSPAKYKLKLVHRD----KVPTFNTSHDHRTRF-NARMQRDTKRVAAL 103

Query: 107 -KNMISGN---IKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
            +++ +G     ++   +++   SG+   +  Y   I +G   RN  V++D+GSD+ WVQ
Sbjct: 104 RRHLAAGKPTYAEEAFGSDV--VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQ 161

Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
           C+PC  CY+Q DPVF+P+ S SY  V C S+ C  ++ A  + G         C Y VSY
Sbjct: 162 CEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEG--------RCRYEVSY 213

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           GDGSYT+G L  E L  G+  + +   GCG +N+G+F G +GL+GLG   +S V Q    
Sbjct: 214 GDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQ 273

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG- 337
            GG FSYCL S +   +SG L  G      + + P+   +  +I NP+  +FY + L+G 
Sbjct: 274 AGGTFSYCLVS-RGIQSSGLLQFG------REAVPVGAAWVPLIHNPRAQSFYYVGLSGL 326

Query: 338 ------ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
                 + I     + S    GG+++D+GT +TRLP + Y A +  F+ Q +  P A G 
Sbjct: 327 GVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGV 386

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
           SI DTC++L  +  V +P V   F G   +T+     +  V  D    C A A  S    
Sbjct: 387 SIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVD-DVGSFCFAFAPSS--SG 443

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             IIGN QQ+   +  D  N  +GF    C
Sbjct: 444 LSIIGNIQQEGIEISVDGANGFVGFGPNVC 473


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 158/431 (36%), Positives = 229/431 (53%), Gaps = 33/431 (7%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQ--QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP- 123
           ++++L H +  S    D + Q    +RL+ D   V+ L S    +   N+          
Sbjct: 76  LSVQLHHIDALSS---DKSSQDLFNSRLVRDAARVKSLISLAATVGGTNLTRARGPGFSS 132

Query: 124 -LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
            + SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC  CY+Q DPVFDP+ S
Sbjct: 133 SVISGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKS 192

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            S+  + C S  C  L++       CS+     C Y VSYGDGS+T GE   E L     
Sbjct: 193 RSFANIPCGSPLCRRLDYPG-----CSTKKQ-ICLYQVSYGDGSFTVGEFSTETLTFRGT 246

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            V   + GCG +N+GLF G +GL+GLGR  LS  SQ    F   FSYCL   + A +  S
Sbjct: 247 RVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCL-GDRSASSRPS 305

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AK 352
            I+ G+S++ + +    +T ++ NP+L TFY + L GIS+GG +   + AS F       
Sbjct: 306 SIVFGDSAISRTT---RFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGN 362

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           GG++IDSGT +TRL  + Y AL+  FL   S    AP FS+ DTCF+LS   EV +P V 
Sbjct: 363 GGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVV 422

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           + F G A++ +  +   Y +  D S   C A A  +      IIGN QQ+  RV+YD   
Sbjct: 423 LHFRG-ADVPLPASN--YLIPVDNSGSFCFAFAGTA--SGLSIIGNIQQQGFRVVYDLAT 477

Query: 472 SQLGFAGEDCS 482
           S++GFA   C+
Sbjct: 478 SRVGFAPRGCA 488


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 148/372 (39%), Positives = 204/372 (54%), Gaps = 19/372 (5%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
           D S   +PL  G  +   NY+  + LG   ++  ++VDTGS LTW+QC PC  SC+ Q  
Sbjct: 108 DESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG 167

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           PVF+P  S SY  V C++  C  L  AT N   CS+S+   C Y  SYGD S++ G L +
Sbjct: 168 PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSK 225

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
           + +  G  SV +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  FSYCLP++
Sbjct: 226 DTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTS 285

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
             + +    I   N   +      +YT M  +    + Y + +TGI + GK L  S  A 
Sbjct: 286 SSSSSGYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAY 339

Query: 353 GGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             +  +IDSGTVITRLP  +YSAL         G P A  FSILDTCF   A   + +P 
Sbjct: 340 SSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPE 398

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V M F G A + +    ++  V  D++  CLA A         IIGN QQ+   V+YD K
Sbjct: 399 VTMAFAGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 453

Query: 471 NSQLGFAGEDCS 482
           NS++GFA   CS
Sbjct: 454 NSKIGFAAAGCS 465


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 188/338 (55%), Gaps = 19/338 (5%)

Query: 148 VIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           ++VDTGS LTW+QC PC  SC+ Q  PVF+P  S +Y  V C++  C  L  AT N   C
Sbjct: 12  MVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSAC 71

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
           SSS+   C Y  SYGD S++ G L ++ +  G  S+ +F +GCG++N+GLFG  +GL+GL
Sbjct: 72  SSSN--VCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGL 129

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
            R+ LSL+ Q +   G  F+YCLPS+  +G          S    N    +YT M+ +  
Sbjct: 130 ARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYL--------SLGSYNPGQYSYTPMVSSSL 181

Query: 327 LATFYILNLTGISIGGKQLQASGFAKGGI--LIDSGTVITRLPPSIYSALKAEFLKQFSG 384
             + Y + L+G+++ G  L  S  A   +  +IDSGTVITRLP S+YSAL         G
Sbjct: 182 DDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKG 241

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
              A  +SILDTCF   A   V+ P V M F G A + +    ++  V  D S  CLA A
Sbjct: 242 TSRASAYSILDTCFKGQA-SRVSAPAVTMSFAGGAALKLSAQNLL--VDVDDSTTCLAFA 298

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                    IIGN QQ+   V+YD K+S++GFA   CS
Sbjct: 299 P---ARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 148/372 (39%), Positives = 204/372 (54%), Gaps = 19/372 (5%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
           D S   +PL  G  +   NY+  + LG   ++  ++VDTGS LTW+QC PC  SC+ Q  
Sbjct: 108 DESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG 167

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           PVF+P  S SY  V C++  C  L  AT N   CS+S+   C Y  SYGD S++ G L +
Sbjct: 168 PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSK 225

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
           + +  G  SV +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  FSYCLP++
Sbjct: 226 DTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTS 285

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
             + +    I   N   +      +YT M  +    + Y + +TGI + GK L  S  A 
Sbjct: 286 SSSSSGYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAY 339

Query: 353 GGI--LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             +  +IDSGTVITRLP  +YSAL         G P A  FSILDTCF   A   + +P 
Sbjct: 340 SSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPE 398

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V M F G A + +    ++  V  D++  CLA A         IIGN QQ+   V+YD K
Sbjct: 399 VTMAFAGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVK 453

Query: 471 NSQLGFAGEDCS 482
           NS++GFA   CS
Sbjct: 454 NSKIGFAAGGCS 465


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 149/408 (36%), Positives = 209/408 (51%), Gaps = 63/408 (15%)

Query: 90  NRLILDNLHVQYLQSRIKNMISGNIKDVSNTE-------IPLTSGIRLQTLNYIATIELG 142
           + L  D    +Y+  R+    SG    + +++       +P + G  + TLNY+ T  LG
Sbjct: 92  DTLRADQRRAEYILRRV----SGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLG 147

Query: 143 --GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
             G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S SY  V C    C  L 
Sbjct: 148 TPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGL- 206

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
                                    G Y          G    +V  F FGCG    GLF
Sbjct: 207 -------------------------GIYAASACSAAQCG----AVQGFFFGCGHAQSGLF 237

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
            GV GL+GLGR   SLV QT+  +GG+FSYCLP+        +L +GG S         +
Sbjct: 238 NGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPG---FS 294

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALK 375
            T ++P+P   T+Y++ LTGIS+GG+QL   AS FA  G ++D+GTV+TRLPP+ Y+AL+
Sbjct: 295 TTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDTGTVVTRLPPTAYAALR 353

Query: 376 AEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           + F    +  G+P+AP   ILDTC+N + Y  V +P V + F   A +T+   GI+ F  
Sbjct: 354 SAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGILSF-- 411

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 CLA A    +    I+GN QQ++  V  D   + +GF    C
Sbjct: 412 -----GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 147/370 (39%), Positives = 207/370 (55%), Gaps = 27/370 (7%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +TSG+   +  Y   + +G   + + +++DTGSD+ W+QC PC+ CY+Q DPVFDP  S 
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSG 195

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+  + C S  C  L+     S  C+S     C Y V+YGDGS+T GE   E L      
Sbjct: 196 SFSSISCRSPLCLRLD-----SPGCNSRQ--SCLYQVAYGDGSFTFGEFSTETLTFRGTR 248

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V     GCG +N+GLF G +GL+GLGR  LS  +QT   FG  FSYCL   + A +  S 
Sbjct: 249 VPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVD-RSASSKPSS 307

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
           ++ G S+V + +    +T +I NP+L TFY L LTGIS+GG +   + AS F       G
Sbjct: 308 VVFGQSAVSRTA---VFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNG 364

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G++IDSGT +TRL    Y +L+  F    +    AP +S+ DTCF+LS   EV +P V M
Sbjct: 365 GVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVM 424

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F G A++++  T   Y +  D + V C A A         IIGN QQ+  RV++D   S
Sbjct: 425 HFRG-ADVSLPATN--YLIPVDTNGVFCFAFAGT--MSGLSIIGNIQQQGFRVVFDVAAS 479

Query: 473 QLGFAGEDCS 482
           ++GFA   C+
Sbjct: 480 RIGFAARGCA 489


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 171/498 (34%), Positives = 253/498 (50%), Gaps = 58/498 (11%)

Query: 9   TILSLLLP----LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEM 64
           ++ S +LP       S+  +A   H  +      L++ + Q  S SSS  +    SR+ +
Sbjct: 19  SVFSRILPETSTTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSASSSFSL-QLHSRVSV 77

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSR----IKNMISGNIKDVS-- 118
                  +H +Y S  +         RL  D   V+ L +R    I N+   ++K +S  
Sbjct: 78  RGT----EHSDYKSLTLA--------RLNRDTARVKSLITRLDLAINNISKADLKPISTM 125

Query: 119 ------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
                 + E PL SG    +  Y   + +G   R + +++DTGSD+ W+QC PC  CY+Q
Sbjct: 126 YTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQ 185

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            +P+F+PS S SY+ + C++  C+ALE +      C +++   C Y VSYGDGSYT G+ 
Sbjct: 186 TEPIFEPSSSSSYEPLSCDTPQCNALEVSE-----CRNAT---CLYEVSYGDGSYTVGDF 237

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
             E L +G   V +   GCG +N+GLF G +GL+GLG   L+L SQ +      FSYCL 
Sbjct: 238 ATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS---FSYCLV 294

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--AS 348
             +D+ ++ ++  G + S      P     ++ N QL TFY L LTGIS+GG+ LQ   S
Sbjct: 295 D-RDSDSASTVDFGTSLSPDAVVAP-----LLRNHQLDTFYYLGLTGISVGGELLQIPQS 348

Query: 349 GF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
            F       GGI+IDSGT +TRL   IY++L+  F+K       A G ++ DTC+NLSA 
Sbjct: 349 SFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAK 408

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
             V +P V   F G   + +     +  V S     CLA A  +      IIGN QQ+  
Sbjct: 409 TTVEVPTVAFHFPGGKMLALPAKNYMIPVDS-VGTFCLAFAPTA--SSLAIIGNVQQQGT 465

Query: 464 RVIYDTKNSQLGFAGEDC 481
           RV +D  NS +GF+   C
Sbjct: 466 RVTFDLANSLIGFSSNKC 483


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 146/367 (39%), Positives = 202/367 (55%), Gaps = 19/367 (5%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
            +PL  G  +   NY+  + LG   ++  ++VDTGS LTW+QC PC  SC+ Q  PVF+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S SY  V C++  C  L  AT N   CS+S+   C Y  SYGD S++ G L ++ +  
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLNPASCSTSN--VCIYQASYGDSSFSVGYLSKDTVSF 232

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
           G  SV +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  FSYCLP++  + +
Sbjct: 233 GSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSS 292

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-- 355
               I   N   +      +YT M  +    + Y + +TGI + GK L  S  A   +  
Sbjct: 293 GYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT 346

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSGTVITRLP  +YSAL         G P A  FSILDTCF   A   + +P V M F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 405

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A + +    ++  V  D++  CLA A         IIGN QQ+   V+YD KNS++G
Sbjct: 406 AGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKNSKIG 460

Query: 476 FAGEDCS 482
           FA   CS
Sbjct: 461 FAAGGCS 467


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 155/449 (34%), Positives = 230/449 (51%), Gaps = 47/449 (10%)

Query: 63  EMGAITLELKHKNYCSGKIVDW--NEQQQNRLILDNLHVQYLQSRIKNMISGNIK----- 115
           E  +I L++ H++  S         E  Q RL  D   V  + +R++    G  K     
Sbjct: 64  EKNSIVLQVVHRDSLSSSSNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKP 123

Query: 116 ----------DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQP 163
                     D  +    + SG+   +  Y   + +G   R   +++DTGSD+ W+QC P
Sbjct: 124 LNGSSIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLP 183

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C  CY Q DP+F+P+ S +Y+KV C +  C  L+       +    +   C Y VSYGDG
Sbjct: 184 CAKCYGQTDPLFNPAASSTYRKVPCATPLCKKLD-------ISGCRNKRYCEYQVSYGDG 236

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           S+T G+   E L      +     GCG +N+GLF G +GL+GLGR  LS  SQT   F  
Sbjct: 237 SFTVGDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSK 296

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
            FSYCL     +G + SLI  G +++ K++    +T ++ NP+L TFY + L GIS+GG+
Sbjct: 297 RFSYCLVDRSASGTASSLIF-GKAAIPKSA---IFTPLLSNPKLDTFYYVELVGISVGGR 352

Query: 344 QLQ---ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
           +L    AS F       GG++IDSGT +TRL  S YS ++  F        SA GFS+ D
Sbjct: 353 RLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFD 412

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
           TC++LS  + V +P +   F+G A +++  T  +  V S A+  C A A       TG  
Sbjct: 413 TCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSAT-FCFAFAG-----NTGGL 466

Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            IIGN QQ+  RV++D+  +++GF    C
Sbjct: 467 SIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 145/414 (35%), Positives = 217/414 (52%), Gaps = 41/414 (9%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELGG- 143
           DNL V  +  RI   ++G  +  S          + + P+ SG+ L +  Y   I +G  
Sbjct: 8   DNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTP 67

Query: 144 -RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
            R M +++DTGSD+ W+QC PC +CY+Q D +FDP  S +Y  + C++  C  L+     
Sbjct: 68  PRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDI---- 123

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDFIFGCGRNNKGL 256
            G C ++    C Y V YGDGS+T GE G + +      G+G+  +N    GCG +N+G 
Sbjct: 124 -GTCQANK---CLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGY 179

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
           F G +GL+GLG+  LS  +Q     GG FSYCL   +     GS ++ G ++V       
Sbjct: 180 FVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGA-- 237

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPS 369
            +T    N ++ TFY L +TGIS+GG  L       Q      GG++IDSGT +TRL  +
Sbjct: 238 RFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNA 297

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
            Y++L+  F    S      GFS+ DTC++LS    V++P V + F+G  ++ +  +   
Sbjct: 298 AYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASN-- 355

Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           Y +  D S   CLA A  +      IIGN QQ+  RVIYD  ++Q+GF    C+
Sbjct: 356 YLIPVDNSNTFCLAFAGTT---GPSIIGNIQQQGFRVIYDNLHNQVGFVPSQCN 406


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 147/368 (39%), Positives = 206/368 (55%), Gaps = 27/368 (7%)

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           SG+   +  Y   I +G   + + +++DTGSD+ W+QC PCK+CY+Q DPVF+P  S S+
Sbjct: 120 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSF 179

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
            KVLC +  C  LE     S  C+      C Y VSYGDGSYT GE   E L   +  V 
Sbjct: 180 AKVLCRTPLCRRLE-----SPGCNQRQ--TCLYQVSYGDGSYTTGEFVTETLTFRRTKVE 232

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG +N+GLF G +GL+GLGR  LS  SQ    F   FSYCL   + A +  S ++
Sbjct: 233 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCL-VDRSASSKPSSVV 291

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKGGI 355
            GNS+V + +    +T ++ NP+L TFY + L GIS+GG     + AS F       GG+
Sbjct: 292 FGNSAVSRTA---RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGV 348

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +ID GT +TRL    Y AL+  F    S   SAP FS+ DTC++LS    V +P V + F
Sbjct: 349 IIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF 408

Query: 416 EGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
            G A++++  +   Y +  D S + C A A  +      IIGN QQ+  RV+YD  +S++
Sbjct: 409 RG-ADVSLPASN--YLIPVDGSGRFCFAFAGTT--SGLSIIGNIQQQGFRVVYDLASSRV 463

Query: 475 GFAGEDCS 482
           GF+   C+
Sbjct: 464 GFSPRGCA 471


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 137/344 (39%), Positives = 191/344 (55%), Gaps = 28/344 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +++DTGSD+TWVQCQPC  CY Q DPVFDPS+S SY  V C+S  C  L+ A   +   +
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGGVSGLMGL 266
                 C Y V+YGDGSYT G+   E L LG ++ V +   GCG +N+GLF G +GL+ L
Sbjct: 61  ------CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLAL 114

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           G   LS  SQ   I    FSYCL   +D+ A+ +L  G  ++     T      ++ +P+
Sbjct: 115 GGGPLSFPSQ---ISASTFSYCL-VDRDSPAASTLQFGDGAAEAGTVT----APLVRSPR 166

Query: 327 LATFYILNLTGISIGGKQLQ--ASGFA------KGGILIDSGTVITRLPPSIYSALKAEF 378
            +TFY + L+GIS+GG+ L   AS FA       GG+++DSGT +TRL  + Y+AL+  F
Sbjct: 167 TSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAF 226

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-AS 437
           ++     P   G S+ DTC++LS    V +P V + FEG   + +      Y +  D A 
Sbjct: 227 VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKN--YLIPVDGAG 284

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             CLA A  +      IIGN QQ+  RV +DT    +GF    C
Sbjct: 285 TYCLAFAPTNA--AVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 153/430 (35%), Positives = 227/430 (52%), Gaps = 28/430 (6%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT---EI 122
           +ITL L H +  S       E   +RL  D+  V+ + +    +   N+     T     
Sbjct: 71  SITLNLDHIDALSSNKTP-QELFSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRTGGFSS 129

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
            + SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP  S
Sbjct: 130 SVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKS 189

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +Y  + C+S  C  L+ A  N+   +      C Y VSYGDGS+T G+   E L   + 
Sbjct: 190 KTYATIPCSSPHCRRLDSAGCNTRRKT------CLYQVSYGDGSFTVGDFSTETLTFRRN 243

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            V     GCG +N+GLF G +GL+GLG+  LS   QT   F   FSYCL   + A +  S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSKPS 302

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AK 352
            ++ GN++V   S    +T ++ NP+L TFY + L GIS+GG +   + AS F       
Sbjct: 303 SVVFGNAAV---SRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGN 359

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           GG++IDSGT +TRL    Y A++  F         AP FS+ DTCF+LS   EV +P V 
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVV 419

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F G A++++  T  +  V ++  + C A A         IIGN QQ+  RV+YD  +S
Sbjct: 420 LHFRG-ADVSLPATNYLIPVDTNG-KFCFAFAGT--MGGLSIIGNIQQQGFRVVYDLASS 475

Query: 473 QLGFAGEDCS 482
           ++GFA   C+
Sbjct: 476 RVGFAPGGCA 485


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 145/367 (39%), Positives = 202/367 (55%), Gaps = 19/367 (5%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDP 177
            +PL  G  +   NY+  + LG   ++  ++VDTGS LTW+QC PC  SC+ Q  PVF+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S SY  V C++  C  L  AT +   CS+S+   C Y  SYGD S++ G L ++ +  
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLSPASCSTSN--VCIYQASYGDSSFSVGYLSKDTVSF 232

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
           G  SV +F +GCG++N+GLFG  +GL+GL R+ LSL+ Q +   G  FSYCLP++  + +
Sbjct: 233 GSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSS 292

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-- 355
               I   N   +      +YT M  +    + Y + +TGI + GK L  S  A   +  
Sbjct: 293 GYLSIGSYNPGQY------SYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPT 346

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSGTVITRLP  +YSAL         G P A  FSILDTCF   A   + +P V M F
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 405

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A + +    ++  V  D++  CLA A         IIGN QQ+   V+YD KNS++G
Sbjct: 406 AGGAALKLAARNLL--VDVDSATTCLAFAP---ARSAAIIGNTQQQTFSVVYDVKNSKIG 460

Query: 476 FAGEDCS 482
           FA   CS
Sbjct: 461 FAAGGCS 467


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 147/370 (39%), Positives = 207/370 (55%), Gaps = 27/370 (7%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+   +  Y   I +G   + + +++DTGSD+ W+QC PCK+CY+Q DPVF+P  S 
Sbjct: 31  VISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSG 90

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+ KVLC +  C  LE     S  C+      C Y VSYGDGSYT GE   E L   +  
Sbjct: 91  SFAKVLCRTPLCRRLE-----SPGCNQRQ--TCLYQVSYGDGSYTTGEFVTETLTFRRTK 143

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V     GCG +N+GLF G +GL+GLGR  LS  SQ    F   FSYCL   + A +  S 
Sbjct: 144 VEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCL-VDRSASSKPSS 202

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
           ++ GNS+V + +    +T ++ NP+L TFY + L GIS+GG     + AS F       G
Sbjct: 203 VVFGNSAVSRTA---RFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNG 259

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G++ID GT +TRL    Y AL+  F    S   SAP FS+ DTC++LS    V +P V +
Sbjct: 260 GVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVL 319

Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F G A++++  +   Y +  D S + C A A  +      IIGN QQ+  RV+YD  +S
Sbjct: 320 HFRG-ADVSLPASN--YLIPVDGSGRFCFAFAGTT--SGLSIIGNIQQQGFRVVYDLASS 374

Query: 473 QLGFAGEDCS 482
           ++GF+   C+
Sbjct: 375 RVGFSPRGCA 384


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 171/498 (34%), Positives = 252/498 (50%), Gaps = 59/498 (11%)

Query: 10  ILSLLLP----LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMG 65
           + S +LP       S+  +A   H  +      L++ + Q  S SSS  +    SR+ + 
Sbjct: 22  VFSRILPKTSVTTTSILNVADSIHRTKYTSSFRLNQQEEQTHSRSSSFSL-QLHSRVSVR 80

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSR----IKNMISGNIKDVS--- 118
                 +H +Y S  +         RL  D   V+ L +R    I N+   ++K V+   
Sbjct: 81  GT----EHSDYKSLTLA--------RLNRDTARVKSLITRLDLAINNISKADLKPVTTMY 128

Query: 119 ------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
                 + E PL SG    +  Y   + +G   R + +++DTGSD+ W+QC PC  CY+Q
Sbjct: 129 TTTEEEDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQ 188

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            +P+F+PS S SY+ + C++  C+ALE +      C +++   C Y VSYGDGSYT G+ 
Sbjct: 189 TEPIFEPSSSSSYEPLSCDTPQCNALEVSE-----CRNAT---CLYEVSYGDGSYTVGDF 240

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
             E L +G   V +   GCG +N+GLF G +GL+GLG   L+L SQ +      FSYCL 
Sbjct: 241 ATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTS---FSYCLV 297

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--AS 348
             +D+ ++ ++  G +        P     ++ N QL TFY L LTGIS+GG+ LQ   S
Sbjct: 298 D-RDSDSASTVEFGTSLPPDAVVAP-----LLRNHQLDTFYYLGLTGISVGGELLQIPQS 351

Query: 349 GF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
            F       GGI+IDSGT +TRL   IY++L+  FLK  S    A G ++ DTC+NLSA 
Sbjct: 352 SFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAK 411

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
             + +P V   F G   + +     +  V S     CLA A  +      IIGN QQ+  
Sbjct: 412 TTIEVPTVAFHFPGGKMLALPAKNYMIPVDS-VGTFCLAFAPTA--SSLAIIGNVQQQGT 468

Query: 464 RVIYDTKNSQLGFAGEDC 481
           RV +D  NS +GF+   C
Sbjct: 469 RVTFDLANSLIGFSSNKC 486


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 153/409 (37%), Positives = 212/409 (51%), Gaps = 29/409 (7%)

Query: 91  RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
           RL  D+L V+ L S       +N+     +        + SG+   +  Y   + +G   
Sbjct: 87  RLQRDSLRVESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRLGVGTPA 146

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
            NM +++DTGSD+ W+QC PCK CYNQ DPVF+P+ S ++  V C S  C  L+    +S
Sbjct: 147 TNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLD----DS 202

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
             C S     C Y VSYGDGS+T G+   E L    A V+    GCG +N+GLF G +GL
Sbjct: 203 SECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGL 262

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GLGR  LS  SQT   + G FSYCL    S+  +    S I+ GN +V K +    +T 
Sbjct: 263 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTA---VFTP 319

Query: 321 MIPNPQLATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYS 372
           ++ NP+L TFY L L GIS+GG         Q +      GG++IDSGT +TRL  S Y 
Sbjct: 320 LLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYV 379

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           AL+  F    +    AP +S+ DTCF+LS    V +P V   F G  E+++  +  +  V
Sbjct: 380 ALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTG-GEVSLPASNYLIPV 438

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +   + C A A         IIGN QQ+  RV YD   S++GF    C
Sbjct: 439 NNQG-RFCFAFAGT--MGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 202/383 (52%), Gaps = 32/383 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SGI  ++  Y A + +G       +++DTGSDL W+QC PC+ CY Q+  VFDP  S
Sbjct: 74  PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +Y++V C+S  C AL F   +SG  +      C Y V+YGDGS + G+L  + L     
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGDLATDKLAFAND 190

Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           + VN+   GCGR+N+GLF   +GL+G+GR  +S+ +Q +  +G +F YCL          
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
           S ++ G +          +T ++ NP+  + Y +++ G S+GG+++  +GF+        
Sbjct: 251 SYLVFGRT---PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERV--TGFSNASLALDT 305

Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQE 405
              +GG+++DSGT I+R     Y+AL+  F  +             S+ D C++L     
Sbjct: 306 ATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 365

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQ 460
            + PL+ + F G A+M +      YF+  D  +   A     L   + +D   +IGN QQ
Sbjct: 366 ASAPLIVLHFAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
           +  RV++D +  ++GFA + C+S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 156/433 (36%), Positives = 231/433 (53%), Gaps = 36/433 (8%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP--- 123
           ITL L H +  S      +E   +RL  D+  V+ + + +   I G  ++V++   P   
Sbjct: 72  ITLNLDHIDALSSNKTP-DELFSSRLQRDSRRVKSIAT-LAAQIPG--RNVTHAPRPGGF 127

Query: 124 ---LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
              + SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP 
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y  + C+S  C  L+ A  N+          C Y VSYGDGS+T G+   E L   
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNT------RRKTCLYQVSYGDGSFTVGDFSTETLTFR 241

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
           +  V     GCG +N+GLF G +GL+GLG+  LS   QT   F   FSYCL   + A + 
Sbjct: 242 RNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSK 300

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF----- 350
            S ++ GN++V   S    +T ++ NP+L TFY + L GIS+GG +   + AS F     
Sbjct: 301 PSSVVFGNAAV---SRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             GG++IDSGT +TRL    Y A++  F         AP FS+ DTCF+LS   EV +P 
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPT 417

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
           V + F G A++++  T   Y +  D + + C A A         IIGN QQ+  RV+YD 
Sbjct: 418 VVLHFRG-ADVSLPATN--YLIPVDTNGKFCFAFAGT--MGGLSIIGNIQQQGFRVVYDL 472

Query: 470 KNSQLGFAGEDCS 482
            +S++GFA   C+
Sbjct: 473 ASSRVGFAPGGCA 485


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/383 (31%), Positives = 201/383 (52%), Gaps = 32/383 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SGI  ++  Y A + +G       +++DTGSDL W+QC PC+ CY Q+  VFDP  S
Sbjct: 74  PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +Y++V C+S  C AL F   +SG  +      C Y V+YGDGS + GEL  + L     
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGELATDKLAFAND 190

Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           + VN+   GCGR+N+GLF   +GL+G+ R  +S+ +Q +  +G +F YCL          
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRS 250

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
           S ++ G +          +T ++ NP+  + Y +++ G S+GG+++  +GF+        
Sbjct: 251 SYLVFGRT---PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERV--TGFSNASLALDT 305

Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQE 405
              +GG+++DSGT I+R     Y+AL+  F  +             S+ D C++L     
Sbjct: 306 ATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 365

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQ 460
            + PL+ + F G A+M +      YF+  D  +   A     L   + +D   +IGN QQ
Sbjct: 366 ASAPLIVLHFAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
           +  RV++D +  ++GFA + C+S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 141/423 (33%), Positives = 207/423 (48%), Gaps = 46/423 (10%)

Query: 69  LELKHKNYCS-GKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
           +++ H++  S G   D   +   RL  D   V  L  R+ +   G+ + V +    + SG
Sbjct: 135 MKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYR-VDDFGTDVISG 193

Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           +   +  Y   I +G   R+  +++D+GSD+ WVQCQPC  CY+Q DPVFDP+ S S+  
Sbjct: 194 MEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTG 253

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
           V C+SS C  LE A  ++G         C Y VSYGDGSYT+G L  E L  G+  V   
Sbjct: 254 VSCSSSVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTFGRTMVRSV 305

Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
             GCG  N+G+F G +GL+GLG   +S V Q     GG FSYCL S              
Sbjct: 306 AIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSA------------- 352

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------GFAKGGILID 358
                       +  ++ NP+  +FY + L G+ +GG ++  S           GG+++D
Sbjct: 353 -----------AWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMD 401

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           +GT +TRLP   Y A +  FL Q +  P A G +I DTC++L  +  V +P V   F G 
Sbjct: 402 TGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGG 461

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
             +T+     +     DA   C A A  +      I+GN QQ+  ++ +D  N  +GF  
Sbjct: 462 PILTLPARNFL-IPMDDAGTFCFAFAPST--SGLSILGNIQQEGIQISFDGANGYVGFGP 518

Query: 479 EDC 481
             C
Sbjct: 519 NIC 521


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 138/368 (37%), Positives = 184/368 (50%), Gaps = 18/368 (4%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPS 178
           IP ++G  L TL ++ T+  G   +N T+ +DTGSD++W+QC PC   CY Q DPVFDP+
Sbjct: 148 IPDSTGTSLDTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPT 207

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y  V C    C A      NSG C         Y V+YGDGS T G L  E L L 
Sbjct: 208 KSATYSAVPCGHPQCAAAGGKCSNSGTCL--------YKVTYGDGSSTAGVLSHETLSLS 259

Query: 239 KA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
               +  F FGCG+ N G FGGV GL+GLGR  LSL SQ +  FG  FSYCLPS      
Sbjct: 260 STRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT--T 317

Query: 298 SGSLILGGNSSVFKN-STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGG 354
            G L +G  +    N    + YT MI      + Y + +  I IGG  L      F + G
Sbjct: 318 HGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDG 377

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
            L DSGT++T LPP  Y++L+  F    + +  AP +   DTC++ + +  + +P V  +
Sbjct: 378 TLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFK 437

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           F   A   +    I+ +    A    CLA           IIGN QQ+   VIYD    +
Sbjct: 438 FSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEK 497

Query: 474 LGFAGEDC 481
           +GF    C
Sbjct: 498 IGFGQFTC 505


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 162/466 (34%), Positives = 232/466 (49%), Gaps = 49/466 (10%)

Query: 42  LQWQQKSGSSSSCVSHQKSRIEMGAITLELKH----KNYCSGKIVDWNEQQQNRLILDNL 97
           L W +    S   VS   +     ++++ L H     ++     VD  +    RL  D+L
Sbjct: 44  LSWPESKSFSDESVSESTT-----SLSVHLSHVDALSSFSDASPVDLFKL---RLQRDSL 95

Query: 98  HVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
            V+ + S       +N      +        + SG+   +  Y   + +G    N+ +++
Sbjct: 96  RVKSITSLAAVSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVL 155

Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           DTGSD+ W+QC PCK+CYNQ D +FDP  S ++  V C S  C  L+    +S  C +  
Sbjct: 156 DTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLD----DSSECVTRR 211

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSD 270
              C Y VSYGDGS+T G+   E L    A V+    GCG +N+GLF G +GL+GLGR  
Sbjct: 212 SKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGG 271

Query: 271 LSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           LS  SQT   + G FSYCL    S+  +    S I+ GN +V K S    +T ++ NP+L
Sbjct: 272 LSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTS---VFTPLLTNPKL 328

Query: 328 ATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
            TFY L L GIS+GG         Q +      GG++IDSGT +TRL  S Y AL+  F 
Sbjct: 329 DTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFR 388

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
              +    AP +S+ DTCF+LS    V +P V   F G  E+++  +  +  V ++  + 
Sbjct: 389 LGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPVNTEG-RF 446

Query: 440 CLALA----SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           C A A    SLS      IIGN QQ+  RV YD   S++GF    C
Sbjct: 447 CFAFAGTMGSLS------IIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/402 (35%), Positives = 214/402 (53%), Gaps = 30/402 (7%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIP-LTSGIRLQTLNYIATIELGG--RNMTVIVD 151
           D   ++++  RI++    + +  S  +   ++SG+ L +  Y A + +G   R+  + +D
Sbjct: 4   DEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLELD 63

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           TGSD+TW+QC PC SCY+Q DP++DPS S SY++V C S+ C AL+++      C     
Sbjct: 64  TGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYS-----ACQGMG- 117

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGR 268
             C+Y V YGD S + G+LG E   LG  S   + +  FGCG +N GLF G +GL+G+G 
Sbjct: 118 --CSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGG 175

Query: 269 SDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
             LS  SQ +   G  FSYCL    +Q    S  LI G  +  F       +T ++ NP+
Sbjct: 176 GTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFA----ARFTPLLKNPR 231

Query: 327 LATFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
           + TFY   LTGIS+GG        Q   +G   GG ++DSGT +TR+ P+ Y+ L+  + 
Sbjct: 232 IDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYR 291

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
                 P APG  +LDTCFN      V IP + + F+ + +M +    I+  V    +  
Sbjct: 292 AASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGT-F 350

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CLA A  S      +IGN QQ+  R+ +D + S +  A  +C
Sbjct: 351 CLAFAPSSM--PISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 142/364 (39%), Positives = 192/364 (52%), Gaps = 52/364 (14%)

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISP 181
           G  + TLNY+ T  LG  G   T+ VDTGSDL+WVQC+PC    SCY+Q+DP+FDP+ S 
Sbjct: 132 GYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSS 191

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY  V C    C  L                          G Y          G    +
Sbjct: 192 SYAAVPCGGPVCAGL--------------------------GIYAASACSAAQCG----A 221

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V  F FGCG    GLF GV GL+GLGR   SLV QT+  +GG+FSYCLP+        +L
Sbjct: 222 VQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTL 281

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDS 359
            +GG S         + T ++P+P   T+Y++ LTGIS+GG+QL   AS FA  G ++D+
Sbjct: 282 GVGGPSGAAPG---FSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG-GTVVDT 337

Query: 360 GTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           GTV+TRLPP+ Y+AL++ F    +  G+P+AP   ILDTC+N + Y  V +P V + F  
Sbjct: 338 GTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGS 397

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A +T+   GI+ F        CLA A    +    I+GN QQ++  V  D   + +GF 
Sbjct: 398 GATVTLGADGILSF-------GCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFK 448

Query: 478 GEDC 481
              C
Sbjct: 449 PSSC 452


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 141/361 (39%), Positives = 194/361 (53%), Gaps = 31/361 (8%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y   I +G   R   +++DTGSD+ W+QC+PC+ CY+Q DP+F+PS S S+  V C+S+
Sbjct: 7   EYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSA 66

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
            C  L+    + G         C Y VSYGDGSYT G    E L  G  S+ +   GCG 
Sbjct: 67  VCSQLDANDCHGG--------GCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGH 118

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N GLF G +GL+GLG   LS  +Q     G  FSYCL   +D+ +SG+L  G       
Sbjct: 119 DNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCL-VDRDSESSGTLEFG------P 171

Query: 312 NSTPI--TYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF------AKGGILIDSG 360
            S PI   +T ++ NP L TFY L++  IS+GG     + +  F       +GGI+IDSG
Sbjct: 172 ESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSG 231

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T +TRL  S Y AL+  F+      P A G SI DTC++LSA Q V+IP V   F   A 
Sbjct: 232 TAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAG 291

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             +     +  + S  +  C A A    +    I+GN QQ+  RV +D+ NS +GFA + 
Sbjct: 292 FILPAKNCLIPMDSMGT-FCFAFAPA--DSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQ 348

Query: 481 C 481
           C
Sbjct: 349 C 349


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 151/422 (35%), Positives = 214/422 (50%), Gaps = 31/422 (7%)

Query: 65  GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
           G  ++ L H+   CS    +  E++   +  L  D L   Y++ +        +G     
Sbjct: 31  GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 90

Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
           S   +P T G  L TL Y+ ++ LG   +T  V++DTGSD++WVQC+PC +   C+    
Sbjct: 91  SKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 150

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            +FDP+ S +Y    C+++ C  L   +G +  C + S   C Y V YGDGS T G    
Sbjct: 151 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTGTYSS 207

Query: 233 EHLGL-GKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
           + L L G   V  F FGC       G+     GL+GLG    S VSQT+  +G  F YCL
Sbjct: 208 DVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCL 267

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
           P+T    +SG L LG  +S           T M+ + ++ T+Y   L  I++GGK+L   
Sbjct: 268 PATP--ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 325

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
            S FA G  L+DSGTVITRLPP+ Y+AL + F    + +  A    ILDTCFN +   +V
Sbjct: 326 PSVFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKV 384

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
           +IP V + F G A + +D  GIV       S  CLA A    +   G IGN QQ+   V+
Sbjct: 385 SIPTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 437

Query: 467 YD 468
           YD
Sbjct: 438 YD 439


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 146/359 (40%), Positives = 197/359 (54%), Gaps = 26/359 (7%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y   I +G   R + +++DTGSD+ W+QC PC+ CY Q D VFDP+ S +Y  + C + 
Sbjct: 117 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAP 176

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
            C  L+     S  CS+ +   C Y VSYGDGS+T G+   E L   +  V     GCG 
Sbjct: 177 LCRRLD-----SPGCSNKNKV-CQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVALGCGH 230

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N+GLF G +GL+GLGR  LS   QT   F   FSYCL   + A A  S ++ G+S+V +
Sbjct: 231 DNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCL-VDRSASAKPSSVIFGDSAVSR 289

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVI 363
            +    +T +I NP+L TFY L L GIS+GG   + L AS F       GG++IDSGT +
Sbjct: 290 TA---HFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSV 346

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           TRL    Y AL+  F    S    AP FS+ DTCF+LS   EV +P V + F G A++++
Sbjct: 347 TRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRG-ADVSL 405

Query: 424 DVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             T   Y +  D S   C A A         IIGN QQ+  R+ YD   S++GFA   C
Sbjct: 406 PATN--YLIPVDNSGSFCFAFAGTM--SGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 131/396 (33%), Positives = 203/396 (51%), Gaps = 29/396 (7%)

Query: 104 SRIKNMISGN-------IKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
           +R + M+ G        +    + +IPL SG  + + NYI  +  G   ++   ++DTGS
Sbjct: 86  ARYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGS 145

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           ++ W+ C PC  C ++Q P F+PS S +Y  + C S  C  L   T       S +  +C
Sbjct: 146 NIAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQCQLLRVCT------KSDNSVNC 198

Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
           +    YGD S     L  E L +G   V +F+FGC    +GL      L+G GR+ LS V
Sbjct: 199 SLTQRYGDQSEVDEILSSETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFV 258

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
           SQT+ ++   FSYCLPS   +  +GSL+LG  +    ++  + +T ++ N +  +FY + 
Sbjct: 259 SQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEA---LSAQGLKFTPLLSNSRYPSFYYVG 315

Query: 335 LTGISIGGK-------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
           L GIS+G +        L        G +IDSGTVITRL    Y+A++  F  Q S    
Sbjct: 316 LNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTM 375

Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA--LAS 445
           A    + DTC+N  +  +V  PL+ + F+ N ++T+ +  I+Y    D S +CLA  L  
Sbjct: 376 ASPTDLFDTCYNRPS-GDVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPP 434

Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              +D     GNYQQ+  R+++D   S+LG A E+C
Sbjct: 435 GGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 198/371 (53%), Gaps = 30/371 (8%)

Query: 119 NTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPV 174
              +P   G  + +L Y+  +  G   +   V++DTGSD++W+QC+PC S  C+ Q+DP+
Sbjct: 63  KVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPL 122

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           +DPS S +Y  V C S  C  L      SG C+S     C + +SY DG+ T G   ++ 
Sbjct: 123 YDPSHSSTYSAVPCASDVCKKLAADAYGSG-CTSGK--QCGFAISYADGTSTVGAYSQDK 179

Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           L L   A V +F FGCG     + G   G++GLGR   SL ++    +GG+FSYCLPS  
Sbjct: 180 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS 235

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
                G L LG      KN +   +T M   P   TF  + L GI++GGK+  L+ S F+
Sbjct: 236 S--KPGFLALGAG----KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS 289

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            GG+++DSGTVIT L  + Y AL++ F K    +   P    LDTC+NL+ Y+ V +P +
Sbjct: 290 -GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKI 347

Query: 412 KMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
            + F G A + +DV  GI+          CLA A    +   G++GN  Q+   V++DT 
Sbjct: 348 ALTFTGGATINLDVPNGILV-------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTS 400

Query: 471 NSQLGFAGEDC 481
            S+ GF  + C
Sbjct: 401 TSKFGFRAKAC 411


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 152/413 (36%), Positives = 214/413 (51%), Gaps = 37/413 (8%)

Query: 91  RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
           RL  D+L V+ + S       +N      +        + SG+   +  Y   + +G   
Sbjct: 86  RLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPA 145

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
            N+ +++DTGSD+ W+QC PCK+CYNQ D +FDP  S ++  V C S  C  L+    +S
Sbjct: 146 TNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD----DS 201

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
             C +     C Y VSYGDGS+T G+   E L    A V+    GCG +N+GLF G +GL
Sbjct: 202 SECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGL 261

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GLGR  LS  SQT   + G FSYCL    S+  +    S I+ GN++V K S    +T 
Sbjct: 262 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS---VFTP 318

Query: 321 MIPNPQLATFYILNLTGISIGG--------KQLQASGFAKGGILIDSGTVITRLPPSIYS 372
           ++ NP+L TFY L L GIS+GG         Q +      GG++IDSGT +TRL    Y 
Sbjct: 319 LLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAYV 378

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           AL+  F    +    AP +S+ DTCF+LS    V +P V   F G  E+++  +  +  V
Sbjct: 379 ALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF-GGGEVSLPASNYLIPV 437

Query: 433 KSDASQVCLALA----SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            ++  + C A A    SLS      IIGN QQ+  RV YD   S++GF    C
Sbjct: 438 NTEG-RFCFAFAGTMGSLS------IIGNIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 144/370 (38%), Positives = 205/370 (55%), Gaps = 26/370 (7%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +TSG+   +  Y   + +G   R + +++DTGSD+ W+QC PCK CY+Q DPVF+P+ S 
Sbjct: 136 VTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSR 195

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+  + C S  C  L+     S  CS+     C Y VSYGDGS+T GE   E L      
Sbjct: 196 SFANIPCGSPLCRRLD-----SPGCSTKKH-ICLYQVSYGDGSFTYGEFSTETLTFRGTR 249

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V     GCG +N+GLF G +GL+GLGR  LS  SQ    F   FSYCL   + A +  S 
Sbjct: 250 VGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCL-VDRSASSKPSY 308

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF-----AKG 353
           ++ G+S++ + +    +T ++ NP+L TFY + L G+S+GG +   + AS F       G
Sbjct: 309 MVFGDSAISRTA---RFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNG 365

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G++IDSGT +TRL    Y AL+  F    S    AP FS+ DTCF+LS   EV +P V +
Sbjct: 366 GVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVL 425

Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F G A++++  +   Y +  D S   C A A         I+GN QQ+  RV+YD   S
Sbjct: 426 HFRG-ADVSLPASN--YLIPVDNSGSFCFAFAGT--MSGLSIVGNIQQQGFRVVYDLAAS 480

Query: 473 QLGFAGEDCS 482
           ++GFA   C+
Sbjct: 481 RVGFAPRGCA 490


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 155/433 (35%), Positives = 229/433 (52%), Gaps = 36/433 (8%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP--- 123
           ITL L H +  S       E   +RL  D+  V+ + + +   I G  ++V++   P   
Sbjct: 72  ITLNLDHIDALSSNKTP-QELFSSRLQRDSRRVRSIAT-LAAQIPG--RNVTHAPRPGGF 127

Query: 124 ---LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
              + SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC+ CY+Q DP+FDP 
Sbjct: 128 SSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPR 187

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y  + C+S  C  L+ A  N+          C Y VSYGDGS+T G+   E L   
Sbjct: 188 KSKTYATIPCSSPHCRRLDSAGCNT------RRKTCLYQVSYGDGSFTVGDFSTETLTFR 241

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
           +  V     GCG +N+GLF G +GL+GLG+  LS   QT   F   FSYCL   + A + 
Sbjct: 242 RNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCL-VDRSASSK 300

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ---LQASGF----- 350
            S ++ GN++V   S    +T ++ NP+L TFY + L GIS+GG +   + AS F     
Sbjct: 301 PSSVVFGNAAV---SRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             GG++IDSGT +TRL    Y A++  F         AP FS+ DTCF+LS   EV +P 
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPT 417

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
           V + F   A++++  T   Y +  D + + C A A         IIGN QQ+  RV+YD 
Sbjct: 418 VVLHFR-RADVSLPATN--YLIPVDTNGKFCFAFAGT--MGGLSIIGNIQQQGFRVVYDL 472

Query: 470 KNSQLGFAGEDCS 482
            +S++GFA   C+
Sbjct: 473 ASSRVGFAPGGCA 485


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 198/371 (53%), Gaps = 30/371 (8%)

Query: 119 NTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS--CYNQQDPV 174
              +P   G  + +L Y+  +  G   +   V++DTGSD++W+QC+PC S  C+ Q+DP+
Sbjct: 97  KVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPL 156

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           +DPS S +Y  V C S  C  L      SG C+S     C + +SY DG+ T G   ++ 
Sbjct: 157 YDPSHSSTYSAVPCASDVCKKLAADAYGSG-CTSGK--QCGFAISYADGTSTVGAYSQDK 213

Query: 235 LGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           L L   A V +F FGCG     + G   G++GLGR   SL ++    +GG+FSYCLPS  
Sbjct: 214 LTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVS 269

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
                G L LG      KN +   +T M   P   TF  + L GI++GGK+  L+ S F+
Sbjct: 270 S--KPGFLALGAG----KNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS 323

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            GG+++DSGTVIT L  + Y AL++ F K    +   P    LDTC+NL+ Y+ V +P +
Sbjct: 324 -GGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKI 381

Query: 412 KMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
            + F G A + +DV  GI+          CLA A    +   G++GN  Q+   V++DT 
Sbjct: 382 ALTFTGGATINLDVPNGILV-------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTS 434

Query: 471 NSQLGFAGEDC 481
            S+ GF  + C
Sbjct: 435 TSKFGFRAKAC 445


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  214 bits (545), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 158/510 (30%), Positives = 246/510 (48%), Gaps = 56/510 (10%)

Query: 7   PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQ----WQQKSGSSSSCVSHQKSRI 62
           PL   + LL + + LFL +  +      +    H L      ++   ++      ++++ 
Sbjct: 10  PLLPFTFLLCVGMLLFLQSAQSRPISVPEVPAYHALDVASSLRETDTAAGGAEYKRETKP 69

Query: 63  EMGAITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
                ++E+ H++       +     +  + + +L  + + V+ L+ +I+  ++ N   V
Sbjct: 70  RRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPV 129

Query: 118 SNTEI----------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
           +  E            + SG+   +  Y   I +G   R   +++DTGSD+ W+QC+PC+
Sbjct: 130 NRYENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCR 189

Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
            CY+Q DP+F+PS S S+  V C+S+ C  L+    +SG         C Y  SYGDGSY
Sbjct: 190 ECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSG--------GCLYEASYGDGSY 241

Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           + G    E L  G  SV +   GCG  N GLF G +GL+GLG   LS  +Q     G  F
Sbjct: 242 STGSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTF 301

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISIGGK 343
           SYCL   +++ +SG L  G        S P+   +T +  NP L TFY L++T IS+GG 
Sbjct: 302 SYCL-VDRESDSSGPLQFG------PKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGA 354

Query: 344 QL-----------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
            L           + SG   GG +IDSGTV+TRL  S Y A++  F+      P     S
Sbjct: 355 LLDSIPPEVFRIDETSG--HGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVS 412

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDE 451
           I DTC++LS  Q V++P V   F   A + +      Y +  D     C A A  +    
Sbjct: 413 IFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKN--YLIPMDTVGTFCFAFAPAA--SS 468

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             I+GN QQ++ RV +D+ NS +GFA + C
Sbjct: 469 VSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 143/389 (36%), Positives = 202/389 (51%), Gaps = 32/389 (8%)

Query: 112 GNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
           G  +  S    P+ SG+   +  Y   I +G       +++DTGSD+ W+QC PC+ CY+
Sbjct: 119 GTRRTGSGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYD 178

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
           Q   VFDP  S SY  V C++  C  L+     SG C       C Y V+YGDGS T G+
Sbjct: 179 QSGQVFDPRRSRSYGAVGCSAPLCRRLD-----SGGCDLRRKA-CLYQVAYGDGSVTAGD 232

Query: 230 LGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
              E L   G A V     GCG +N+GLF   +GL+GLGR  LS  +Q S  +G  FSYC
Sbjct: 233 FATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYC 292

Query: 289 L----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           L     S   A  S ++  G  S    ++   ++T M+ NP++ TFY + L GIS+GG +
Sbjct: 293 LVDRTSSANPASHSSTVTFG--SGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGAR 350

Query: 345 LQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFS 392
           +  SG A           +GG+++DSGT +TRL    YSAL+  F    +G   +P GFS
Sbjct: 351 V--SGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFS 408

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
           + DTC++LS  + V +P V M F G AE  +     +  V S  +  C A A    +   
Sbjct: 409 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGT-FCFAFAGT--DGGV 465

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            IIGN QQ+  RV++D    ++GF  + C
Sbjct: 466 SIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 186/344 (54%), Gaps = 24/344 (6%)

Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
           + TV+VDT SD+ WVQC PC    C+ Q+DP++DP+ S ++  + C S  C  L  + GN
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGV- 260
              CS ++  +C Y V+YGDG  T G    + L +     V DF FGC    +G F    
Sbjct: 228 G--CSPTTD-ECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQN 284

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +G++ LG    SL+ QT++ +G  FSYC+P    AG    L LGG     + S   +YT 
Sbjct: 285 AGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGF---LSLGGP---VEASLKFSYTP 338

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
           +I N    TFYI++L  I + GKQL    + FA G ++ DSG V+T+LPP +Y+AL+A F
Sbjct: 339 LIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVM-DSGAVVTQLPPQVYAALRAAF 397

Query: 379 LKQFSGF-PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
               + + P A     LDTC++ + + +V +P V + F G A + ++   I+        
Sbjct: 398 RSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL------- 450

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             CLA A+   E+  G IGN QQ+   V+YD    ++GF    C
Sbjct: 451 DGCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 145/359 (40%), Positives = 197/359 (54%), Gaps = 26/359 (7%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y   I +G   R + +++DTGSD+ W+QC PC+ CY Q DPVFDP+ S +Y  + C + 
Sbjct: 128 EYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAP 187

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
            C  L+     S  C++ +   C Y VSYGDGS+T G+   E L   +  V     GCG 
Sbjct: 188 LCRRLD-----SPGCNNKNKV-CQYQVSYGDGSFTFGDFSTETLTFRRTRVTRVALGCGH 241

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N+GLF G +GL+GLGR  LS   QT   F   FSYCL   + A A  S ++ G+S+V +
Sbjct: 242 DNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCL-VDRSASAKPSSVVFGDSAVSR 300

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVI 363
            +    +T +I NP+L TFY L L GIS+GG   + L AS F       GG++IDSGT +
Sbjct: 301 TA---RFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSV 357

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           TRL    Y AL+  F    S    A  FS+ DTCF+LS   EV +P V + F G A++++
Sbjct: 358 TRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRG-ADVSL 416

Query: 424 DVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             T   Y +  D S   C A A         IIGN QQ+  RV +D   S++GFA   C
Sbjct: 417 PATN--YLIPVDNSGSFCFAFAGT--MSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 151/439 (34%), Positives = 230/439 (52%), Gaps = 46/439 (10%)

Query: 67  ITLELKHKNYCSG-KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV---- 117
           +T+EL  +      K  D+     +RL  D+  V+ + +R+   I G    ++K +    
Sbjct: 63  LTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDS 122

Query: 118 ----SNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
                + + P+ SG    +  Y + + +G  +  V  ++DTGSD+ W+QC PC  CY+Q 
Sbjct: 123 QFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQA 182

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
           DP+F+P+ S SY  + C++  C +L+ +      C +++   C Y VSYGDGSYT G+  
Sbjct: 183 DPIFEPASSTSYSPLSCDTKQCQSLDVSE-----CRNNT---CLYEVSYGDGSYTVGDFV 234

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
            E + LG ASV++   GCG NN+GLF G +GL+GLG   LS  SQ   I    FSYCL  
Sbjct: 235 TETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQ---INASSFSYCL-- 289

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK-------- 343
             D  +  +  L  NS++  ++  IT   ++ N +L TFY + +TG+S+GG+        
Sbjct: 290 -VDRDSDSASTLEFNSALLPHA--IT-APLLRNRELDTFYYVGMTGLSVGGELLSIPESM 345

Query: 344 -QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
            ++  SG   GGI+IDSGT +TRL  + Y+AL+  F+K     P     ++ DTC++LS 
Sbjct: 346 FEMDESG--NGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSR 403

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
              V +P V     G   + +  T  +  V SD +  C A A  S      IIGN QQ+ 
Sbjct: 404 KTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGT-FCFAFAPTS--SALSIIGNVQQQG 460

Query: 463 QRVIYDTKNSQLGFAGEDC 481
            RV +D  NS +GF    C
Sbjct: 461 TRVGFDLANSLVGFEPRQC 479


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 150/431 (34%), Positives = 220/431 (51%), Gaps = 37/431 (8%)

Query: 68  TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI---SGNIKDVSNTEIP 123
           TL L H++ + S    + + +   R+  D   V  +  RI   +   S +  +V++    
Sbjct: 60  TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGSD 119

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+   +  Y   I +G   R+  +++D+GSD+ WVQCQPCK CY Q DPVFDP+ S 
Sbjct: 120 IVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSG 179

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY  V C SS C  +E    NSG C S     C Y V YGDGSYT+G L  E L   K  
Sbjct: 180 SYTGVSCGSSVCDRIE----NSG-CHSGG---CRYEVMYGDGSYTKGTLALETLTFAKTV 231

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V +   GCG  N+G+F G +GL+G+G   +S V Q S   GG F YCL S +   ++GSL
Sbjct: 232 VRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGSL 290

Query: 302 ILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAK 352
           + G      + + P+  ++  ++ NP+  +FY + L G       I +       +    
Sbjct: 291 VFG------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGD 344

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           GG+++D+GT +TRLP + Y A +  F  Q +  P A G SI DTC++LS +  V +P V 
Sbjct: 345 GGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVS 404

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDTK 470
             F     +T+     +  V  D+   C A A+      TG  IIGN QQ+  +V +D  
Sbjct: 405 FYFTEGPVLTLPARNFLMPVD-DSGTYCFAFAA----SPTGLSIIGNIQQEGIQVSFDGA 459

Query: 471 NSQLGFAGEDC 481
           N  +GF    C
Sbjct: 460 NGFVGFGPNVC 470


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 153/419 (36%), Positives = 217/419 (51%), Gaps = 40/419 (9%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE------IPLTSGIRLQT 132
           D+      RL  D+  V+ L +R+   I+G    ++K V           PL SG    +
Sbjct: 93  DYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGASQGS 152

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
             Y + + +G   +++ ++VDTGSD+ WVQC PC  CY Q DP+F+PS S SY  + C +
Sbjct: 153 GEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCET 212

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGC 249
             C +L+ +      C + S   C Y VSYGDGSYT G+   E + L G AS+N+   GC
Sbjct: 213 HQCKSLDVSE-----CRNDS---CLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGC 264

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
           G +N+GLF G +GL+GLG   LS  SQ   I    FSYCL +     AS    L  NS +
Sbjct: 265 GHDNEGLFVGAAGLLGLGGGSLSFPSQ---INASSFSYCLVNRDTDSAS---TLEFNSPI 318

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTV 362
             +S       ++ N QL TFY L +TGI +GG+ L    S F       GGI++DSGT 
Sbjct: 319 PSHSVT---APLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTA 375

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           +TRL   +Y++L+  F++     PS  G ++ DTC++LS+   V +P V   F     + 
Sbjct: 376 VTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLA 435

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +     +  V S A   C A A  +      IIGN QQ+  RV YD  NS +GF+   C
Sbjct: 436 LPAKNYLIPVDS-AGTFCFAFAPTT--SALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 159/481 (33%), Positives = 252/481 (52%), Gaps = 51/481 (10%)

Query: 10  ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITL 69
           ILSL +  M  +  +A G +C    K L+       +K G  S  VS     I       
Sbjct: 14  ILSLAITFMCGVAEIAPGLNCRSSDKILN-------RKVGKRSHSVSFPLIHIYSECSPF 66

Query: 70  ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
              ++         W      ++  D   +++L+       S + K  +N  +P+ SG  
Sbjct: 67  RPPNRT--------WESLMSEKIRGDANRLRFLK-----RTSRSSKQDANANVPVRSG-- 111

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
             +  YI  ++ G   ++M  ++DTGSD+ W+ C+ C+ C++   P+FDP+ S SYK   
Sbjct: 112 --SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFA 168

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
           C+S  C  +      SG C  +S   C + VSYGDG+   G L  + + LG   + +F F
Sbjct: 169 CDSQPCQEI------SGNCGGNS--KCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSF 220

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
           GC  +         GLMGLG   LSL++Q  T+E+FGG FSYCLPS+  + +SGSL+LG 
Sbjct: 221 GCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS--STSSGSLVLGK 278

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTV 362
            ++V  +S+ + +T +I +P + TFY + L  IS+G  ++   G    + GG +IDSGT 
Sbjct: 279 EAAV--SSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTT 336

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           IT L PS Y+AL+  F +Q S     P    +DTC++LS+   V++P + +  + N ++ 
Sbjct: 337 ITHLVPSAYTALRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLV 394

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +    I+  +  ++   CLA +S    D   IIGN QQ+N R+++D  NSQ+GFA E C+
Sbjct: 395 LPKENIL--ITQESGLACLAFSS---TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449

Query: 483 S 483
           +
Sbjct: 450 A 450


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 150/432 (34%), Positives = 220/432 (50%), Gaps = 38/432 (8%)

Query: 68  TLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI----SGNIKDVSNTEI 122
           TL L H++ + S    + + +   R+  D   V  +  RI   +    S +  +V++   
Sbjct: 60  TLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFGS 119

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
            + SG+   +  Y   I +G   R+  +++D+GSD+ WVQCQPCK CY Q DPVFDP+ S
Sbjct: 120 DVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKS 179

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            SY  V C SS C  +E    NSG C S     C Y V YGDGSYT+G L  E L   K 
Sbjct: 180 GSYTGVSCGSSVCDRIE----NSG-CHSGG---CRYEVMYGDGSYTKGTLALETLTFAKT 231

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            V +   GCG  N+G+F G +GL+G+G   +S V Q S   GG F YCL S +   ++GS
Sbjct: 232 VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS-RGTDSTGS 290

Query: 301 LILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFA 351
           L+ G      + + P+  ++  ++ NP+  +FY + L G       I +       +   
Sbjct: 291 LVFG------REALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETG 344

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            GG+++D+GT +TRLP   Y+A +  F  Q +  P A G SI DTC++LS +  V +P V
Sbjct: 345 DGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTV 404

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDT 469
              F     +T+     +  V  D+   C A A+      TG  IIGN QQ+  +V +D 
Sbjct: 405 SFYFTEGPVLTLPARNFLMPVD-DSGTYCFAFAA----SPTGLSIIGNIQQEGIQVSFDG 459

Query: 470 KNSQLGFAGEDC 481
            N  +GF    C
Sbjct: 460 ANGFVGFGPNVC 471


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 138/372 (37%), Positives = 199/372 (53%), Gaps = 29/372 (7%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           ++SG+ L +  Y A + +G   R+  + +DTGSD+TW+QC PC SCY+Q DP++DPS S 
Sbjct: 1   ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY++V C S+ C AL+++      C       C+Y V YGD S + G+LG E   LG  S
Sbjct: 61  SYRRVYCGSALCQALDYS-----ACQGMG---CSYRVVYGDSSASSGDLGIESFYLGPNS 112

Query: 242 ---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAG 296
              + +  FGCG +N GLF G +GL+G+G   LS  SQ +   G  FSYCL    +Q   
Sbjct: 113 STAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQS 172

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG-------KQLQASG 349
            S  LI G  +  F       +T ++ NP++ TFY   LTGIS+GG        Q   +G
Sbjct: 173 RSSPLIFGRTAIPFA----ARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTG 228

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
              GG ++DSGT +TR+ P  Y+ L+  +       P APG  +LDTCFN      V IP
Sbjct: 229 NGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIP 288

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + + F+   +M +    I+  V    +  CLA A  S      +IGN QQ+  R+ +D 
Sbjct: 289 SLVLHFDNGVDMVLPGGNILIPVDRSGT-FCLAFAPSSM--PISVIGNVQQQTFRIGFDL 345

Query: 470 KNSQLGFAGEDC 481
           + S +  A  +C
Sbjct: 346 QRSLIAIAPREC 357


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 137/396 (34%), Positives = 204/396 (51%), Gaps = 31/396 (7%)

Query: 99  VQYLQSRIKNMISGNIK--DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
           V+ + S I  + SG+    +V +    + SG+   +  Y   I LG   R+  +++D+GS
Sbjct: 5   VKRVASLIHRLSSGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYMVIDSGS 64

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           D+ WVQC+PC  CY+Q DP+FDP+ S S+  V C+S+ C  +E A  NSG         C
Sbjct: 65  DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNSG--------RC 116

Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
            Y VSYGDGSYT+G L  E L  G+  V +   GCG +N+G+F G +GL+GLG   +S +
Sbjct: 117 RYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFM 176

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYI 332
            Q S   G  FSYCL S +    +G L  G        + P+   +  ++ NP+  +FY 
Sbjct: 177 GQLSGQTGNAFSYCLVS-RGTNTNGFLEFG------SEAMPVGAAWIPLVRNPRAPSFYY 229

Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
           + L G+ +G  ++       Q +    GG+++D+GT +TR P   Y A +  F++Q    
Sbjct: 230 IRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNL 289

Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
           P A G SI DTC+NL  +  V +P V   F G   +T+     +  V  DA   C A A 
Sbjct: 290 PRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVD-DAGTFCFAFA- 347

Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                   I+GN QQ+  ++  D  N  +GF    C
Sbjct: 348 -PSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 146/429 (34%), Positives = 214/429 (49%), Gaps = 57/429 (13%)

Query: 85  NEQQQN-------RLILDNLHVQYLQSRIKNMISG-NIKDVSNTEI----------PLTS 126
           NEQ  N       RL  D   V  L ++++  +S  N  D+  TE           P++S
Sbjct: 89  NEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSS 148

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G    +  Y + + +G   +   +++DTGSD+ W+QC+PC  CY Q DP+FDP+ S SY 
Sbjct: 149 GTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYN 208

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            + C++  C  LE +   +G         C Y VSYGDGS+T GE   E +  G  SVN 
Sbjct: 209 PLTCDAQQCQDLEMSACRNG--------KCLYQVSYGDGSFTVGEYVTETVSFGAGSVNR 260

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
              GCG +N+GLF    G  GL       +S TS+I    FSYCL   +D+G S +L   
Sbjct: 261 VAIGCGHDNEGLF---VGSAGLLGLGGGPLSLTSQIKATSFSYCL-VDRDSGKSSTLEFN 316

Query: 305 ----GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KG 353
               G+S V           ++ N ++ TFY + LTG+S+GG+   +    FA      G
Sbjct: 317 SPRPGDSVV---------APLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAG 367

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           G+++DSGT ITRL    Y++++  F ++ S    A G ++ DTC++LS+ Q V +P V  
Sbjct: 368 GVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSF 427

Query: 414 EFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F G+    +      Y +  D A   C A A  +      IIGN QQ+  RV +D  NS
Sbjct: 428 HFSGDRAWALPAKN--YLIPVDGAGTYCFAFAPTT--SSMSIIGNVQQQGTRVSFDLANS 483

Query: 473 QLGFAGEDC 481
            +GF+   C
Sbjct: 484 LVGFSPNKC 492


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 140/348 (40%), Positives = 195/348 (56%), Gaps = 24/348 (6%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           + + +++DTGSD+ W+QC+PC  CY+Q D +FDPS S S+  + C S  C  L+     S
Sbjct: 141 KYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLD-----S 195

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
             CS  +   C Y VSYGDGS+T G+   E L   +A+V     GCG +N+GLF G +GL
Sbjct: 196 PGCSLKNN-LCQYQVSYGDGSFTFGDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGL 254

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
           +GLGR  LS  +QT   F   FSYCL + + A A  S I+ G+S+V + +    +T ++ 
Sbjct: 255 LGLGRGGLSFPTQTGTRFNNKFSYCL-TDRTASAKPSSIVFGDSAVSRTA---RFTPLVK 310

Query: 324 NPQLATFYILNLTGISIGG---KQLQASGF-----AKGGILIDSGTVITRLPPSIYSALK 375
           NP+L TFY + L GIS+GG   + + AS F       GG++IDSGT +TRL    Y +L+
Sbjct: 311 NPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLR 370

Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
             F    S    AP FS+ DTC++LS   EV +P V + F G     V +    Y V  D
Sbjct: 371 DAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRG---ADVSLPAANYLVPVD 427

Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            S   C A A         IIGN QQ+  RV++D   S++GFA   C+
Sbjct: 428 NSGSFCFAFAGT--MSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGCA 473


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 149/424 (35%), Positives = 214/424 (50%), Gaps = 47/424 (11%)

Query: 84  WNEQQQNRLILDNLHVQYLQSRIKNMI------SGNIKDVSNTEIP----LTSGIRLQTL 133
           +  + +  L  D   V+ L+ RI+  +      +G+ ++V+         + SG+   + 
Sbjct: 136 YERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSG 195

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y   I +G   R   +++DTGSD+ W+QC+PC  CY+Q DP+F+PS+S S+  + CNS+
Sbjct: 196 EYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSA 255

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
            C  L+    + G         C Y VSYGDGSYT G    E L  G  SV +   GCG 
Sbjct: 256 VCSYLDAYNCHGG--------GCLYKVSYGDGSYTIGSFATEMLTFGTTSVRNVAIGCGH 307

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N GLF G +GL+GLG   LS  SQ     G  FSYCL   + + +SG+L  G       
Sbjct: 308 DNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYCL-VDRFSESSGTLEFG------P 360

Query: 312 NSTPI--TYTNMIPNPQLATFYILNLTGISIGGKQL-----------QASGFAKGGILID 358
            S P+    T ++ NP L TFY + L  IS+GG  L           + SG  +GG ++D
Sbjct: 361 ESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSG--RGGFIVD 418

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           SGT +TRL   +Y A++  F+      P A G SI DTC++LS    VN+P V   F   
Sbjct: 419 SGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHFSNG 478

Query: 419 AEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           A + +      Y +  D     C A A  +   +  I+GN QQ+  RV +DT NS +GFA
Sbjct: 479 ASLILPAKN--YMIPMDFMGTFCFAFAPAT--SDLSIMGNIQQQGIRVSFDTANSLVGFA 534

Query: 478 GEDC 481
              C
Sbjct: 535 LRQC 538


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 140/368 (38%), Positives = 192/368 (52%), Gaps = 43/368 (11%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ T+ LG   ++  VIVDTGSDL WVQC PC+ CY Q  P FDPS S S++K  C  + 
Sbjct: 39  YLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNL 98

Query: 193 CH--ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL----GLGKASVNDFI 246
           C+  AL      + V        C Y  +YGD S T G+L  E +    G G  SV +F 
Sbjct: 99  CNVSALPLKACAANV--------CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFA 150

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
           FGCG  N G F G +GL+GLG+  LSL SQ S  F   FSYCL S     AS    GS+ 
Sbjct: 151 FGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIA 210

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA------KGG 354
              N         I YT+++ N +  T+Y + L  I +GG+   L  S FA      +GG
Sbjct: 211 AAAN---------IQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGG 261

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKM 413
            +IDSGT IT L    YSA+   + + F  +P   G +  LD CFN++     ++P +  
Sbjct: 262 TIIDSGTTITMLTLPAYSAVLRAY-ESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVF 320

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           +F+G A+  +    +   V + A+ +CLA+          IIGN QQ+N  V+YD +  +
Sbjct: 321 KFQG-ADFQMRGENLFVLVDTSATTLCLAMGG---SQGFSIIGNIQQQNHLVVYDLEAKK 376

Query: 474 LGFAGEDC 481
           +GFA  DC
Sbjct: 377 IGFATADC 384


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 153/458 (33%), Positives = 231/458 (50%), Gaps = 46/458 (10%)

Query: 48  SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKI-VDWNEQQQNRLILDNLHVQYLQSR- 105
           SG   S  + Q+       +T+EL  +          +     +RL  D+  V+ L +R 
Sbjct: 49  SGPKMSPFNQQEKETTSSELTVELLSRTSIQKTTHTGYKSLTLSRLQRDSARVKSLVTRL 108

Query: 106 ---IKNMISGNIKDVS--------NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
              I ++ S ++K +         + + P+ SG    +  Y + + +G       +I+DT
Sbjct: 109 DLAINSISSSDLKPLETDSEFKPEDLQSPIISGTSQGSGEYFSRVGIGKPPSQAYLILDT 168

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           GSD+ WVQC PC  CY Q DP+F+P+ S S+  + CN+  C +L+ +      C + +  
Sbjct: 169 GSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRSLDVSE-----CRNDT-- 221

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
            C Y VSYGDGSYT G+   E + LG A V++   GCG NN+GLF G +GL+GLG   LS
Sbjct: 222 -CLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLS 280

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
             SQ   I    FSYCL       AS    L  NS++  N+       ++ N  L TFY 
Sbjct: 281 FPSQ---INATSFSYCLVDRDSESAS---TLEFNSTLPPNA---VSAPLLRNHHLDTFYY 331

Query: 333 LNLTGISIGGK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           + LTG+S+GG+         Q+  SG   GG+++DSGT ITRL   +Y++L+  F+K+  
Sbjct: 332 VGLTGLSVGGELVSIPESAFQIDESG--NGGVIVDSGTAITRLQTDVYNSLRDAFVKRTR 389

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
             PS  G ++ DTC++LS+   V +P V   F    E+ +     +  + S+ +  C A 
Sbjct: 390 DLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGT-FCFAF 448

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A  +      IIGN QQ+  RV+YD  N  +GF    C
Sbjct: 449 APTA--SSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 152/435 (34%), Positives = 222/435 (51%), Gaps = 45/435 (10%)

Query: 61  RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
           R E     + L+H +  SG      E+ Q  +    L +Q L ++  +         S+ 
Sbjct: 36  RPEKTWFRVSLRHVD--SGGNYTKFERLQRAMKRGKLRLQRLSAKTASF-------ESSV 86

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           E P+ +G       ++  + +G      + I+DTGSDL W QC+PCK C++Q  P+FDP 
Sbjct: 87  EAPVHAG----NGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPK 142

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S S+ K+ C+S  C AL  ++ + G         C Y  SYGD S T+G L  E    G
Sbjct: 143 KSSSFSKLPCSSDLCAALPISSCSDG---------CEYLYSYGDYSSTQGVLATETFAFG 193

Query: 239 KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
            ASV+   FGCG +N G  F   +GL+GLGR  LSL+SQ  E     FSYCL S  D+  
Sbjct: 194 DASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGE---PKFSYCLTSMDDSKG 250

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
             SL++G  +++ KN+  IT T +I NP   +FY L+L GIS+G   L  + S F+    
Sbjct: 251 ISSLLVGSEATM-KNA--IT-TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQND 306

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIP 409
             GG++IDSGT IT L  S ++ALK EF+ Q        G + LD CF L      V++P
Sbjct: 307 GSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVP 366

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +   FEG A++ +     +    S    +CL + S S      I GN+QQ+N  V++D 
Sbjct: 367 QLVFHFEG-ADLKLPAENYI-IADSGLGVICLTMGSSS---GMSIFGNFQQQNIVVLHDL 421

Query: 470 KNSQLGFAGEDCSSM 484
           +   + FA   C+ +
Sbjct: 422 EKETISFAPAQCNQL 436


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 151/430 (35%), Positives = 217/430 (50%), Gaps = 44/430 (10%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQN----RLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
           +TL+L H +  S      N+   +    RL  D L V  L SR     S           
Sbjct: 54  LTLDLHHLDSLS-----LNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSS---------- 98

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
            + SG+   +  Y   + +G   R + +++DTGSD+ W+QC PC+ CY+Q DP+F+P  S
Sbjct: 99  -VVSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKS 157

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            S+  + C+S  C  L+     S  CS+     C Y VSYGDGS+T G+   E L     
Sbjct: 158 KSFAGIPCSSPLCRRLD-----SSGCSTRRH-TCLYQVSYGDGSFTTGDFATETLTFRGN 211

Query: 241 SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            +     GCG +N+GLF G +GL+GLGR  LS  SQT   F   FSYCL     +    S
Sbjct: 212 KIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSS 271

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAK 352
           ++  G++++   S    +T +I NP+L TFY + L GIS+GG +++              
Sbjct: 272 MVF-GDAAI---SRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGN 327

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           GG++IDSGT +TRL    Y+AL+  F          P FS+ DTC++LS    V +P V 
Sbjct: 328 GGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVV 387

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F G A+M +  T  +  V  + S  C A A         IIGN QQ+  RV+YD   S
Sbjct: 388 LHFRG-ADMALPATNYLIPVDENGS-FCFAFAGT--ISGLSIIGNIQQQGFRVVYDLAGS 443

Query: 473 QLGFAGEDCS 482
           ++GFA   C+
Sbjct: 444 RIGFAPRGCT 453


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 149/458 (32%), Positives = 225/458 (49%), Gaps = 27/458 (5%)

Query: 34  KKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLI 93
           ++K  +    + Q S   +SC S  +       +++ L H+N     +    E  +  ++
Sbjct: 29  ERKFTVVPTAFLQSSSEEASC-STPRGTPHANRVSVPLAHRNGPCSPVRGKGELPRAEML 87

Query: 94  L-DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGGRNM--TVI 149
             D    +Y+  R        ++D ++   +P   G    +  Y+AT+ LG   +  T+I
Sbjct: 88  RRDRERTEYIIRRASRSRR--LQDNNDAVSVPTQLGSSYDSQEYVATVGLGTPAVPQTLI 145

Query: 150 VDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +DTGS LTWVQC+PC S  CY Q+ P+FDP+ S SY  V C+S  C AL       G C+
Sbjct: 146 LDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDSQECRALAAGIDGDG-CT 204

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNN-KGLFGGVSGLMG 265
           S     C Y + YG G+   GE   + L LG  A V  F FGCG +  +G F    G++G
Sbjct: 205 SDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLG 264

Query: 266 LGRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           LGR   SL  Q S    GG+FS+CLP T    ++G L LG       +++   +T ++  
Sbjct: 265 LGRLPQSLAWQASARRGGGVFSHCLPPT--GVSTGFLALGAP----HDTSAFVFTPLLTM 318

Query: 325 PQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
                FY L  T IS+ G+ L       + G++ DSGTV++ L  + Y+AL+  F    +
Sbjct: 319 DDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITDSGTVLSALQETAYTALRTAFRSAMA 378

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
            +P AP    LDTCFN + Y  V +P V + F G A + +D +  V          CLA 
Sbjct: 379 EYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHLDASSGVLM------DGCLAF 432

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            S S ++ TG+IG+  Q+   V+YD    ++GF    C
Sbjct: 433 WS-SGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 200/375 (53%), Gaps = 25/375 (6%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y   I +G  +    +++DTGSD+ W+QC PC+ CY+Q  PVFDP  S
Sbjct: 128 PVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRS 187

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GK 239
            SY  V C +  C  L+     SG C       C Y V+YGDGS T G+   E L   G 
Sbjct: 188 SSYGAVDCAAPLCRRLD-----SGGCDLRRRA-CLYQVAYGDGSVTAGDFATETLTFAGG 241

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           A V     GCG +N+GLF   +GL+GLGR  LS  +Q S  +G  FSYCL     + +SG
Sbjct: 242 ARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSG 301

Query: 300 SLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
           +     +S+V     +++  ++T M+ NP++ TFY + L GIS+GG ++           
Sbjct: 302 AASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLD 361

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
               +GG+++DSGT +TRL    YSAL+  F    +G   +P GFS+ DTC++L   + V
Sbjct: 362 PSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVV 421

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V M F G AE  +     +  V S  +  C A A    +    IIGN QQ+  RV+
Sbjct: 422 KVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVV 478

Query: 467 YDTKNSQLGFAGEDC 481
           +D    ++GFA + C
Sbjct: 479 FDGDGQRVGFAPKGC 493


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 143/426 (33%), Positives = 226/426 (53%), Gaps = 54/426 (12%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTEI-------PLTSGIRLQ 131
           D+     +RL  D+  VQ + +R++ +++G    ++K +  TEI       P++SG    
Sbjct: 97  DYKALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPL-QTEIQPQDLSTPVSSGTSQG 155

Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           +  Y   + +G   ++  +++DTGSD+ W+QCQPC  CY Q DP+F P+ S SY  + C+
Sbjct: 156 SGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCD 215

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFG 248
           S  C++L+ ++  +G         C Y V+YGDGS+T G+   E +   G  +VN    G
Sbjct: 216 SQQCNSLQMSSCRNG--------QCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
           CG +N+GLF G +GL+GLG   LSL   TS++    FSYCL   +D+ AS +L       
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSL---TSQLKATSFSYCL-VNRDSAASSTLDF----- 318

Query: 309 VFKNSTPITYTNMIP---NPQLATFYILNLTGISIGGK---------QLQASGFAKGGIL 356
              NS P+  + + P   + ++ TFY + L+G+S+GG+         +L  SG   GG++
Sbjct: 319 ---NSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSG--DGGVI 373

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +D GT ITRL    Y++L+  F+       S  G ++ DTC++LS    V +P V   F+
Sbjct: 374 VDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFD 433

Query: 417 GNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
           G      D+    Y +  D A   C A A  +      IIGN QQ+  RV +D  N+++G
Sbjct: 434 GGKSW--DLPAANYLIPVDSAGTYCFAFAPTT--SSLSIIGNVQQQGTRVSFDLANNRVG 489

Query: 476 FAGEDC 481
           F+   C
Sbjct: 490 FSTNKC 495


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 152/419 (36%), Positives = 225/419 (53%), Gaps = 47/419 (11%)

Query: 77  CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
           CSG         Q     D   V ++ S+     SGN+K+ ++      + +  +  N++
Sbjct: 75  CSGSGHSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFL 129

Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
             +  G     + +I+DTGS +TW QC+ C +C    +  FD S S +Y    C  ST  
Sbjct: 130 VDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIPSTVE 189

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNN 253
                               NY ++YGD S + G  G + + L  + V   F FGCGRNN
Sbjct: 190 N-------------------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNN 230

Query: 254 KGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
           KG FG GV G++GLG+  LS VSQT+  F  +FSYCLP   +  + GSL+ G  ++    
Sbjct: 231 KGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--SQ 285

Query: 313 STPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLP 367
           S+ + +T+++  P   Q + +Y +NL+ IS+G ++L   +S FA  G +IDS TVITRLP
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLP 345

Query: 368 PSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
              YSALKA F K  + +P + G      ILDTC+NLS  ++V +P + + F G A++ +
Sbjct: 346 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRL 405

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           + T IV+   SDAS++CLA A  S   E  IIGN QQ +  V+YD +  ++GF G  CS
Sbjct: 406 NGTNIVW--GSDASRLCLAFAGTS---ELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  207 bits (527), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 130/380 (34%), Positives = 200/380 (52%), Gaps = 28/380 (7%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
           D  + E P ++G       ++  I LG   +   VI+DTGSDLTW+Q +PC++C+ Q DP
Sbjct: 10  DNESYEFPESAGYG----EFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADP 65

Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
           +FDPS S +Y K+ C+SS C  L       G  + S+  +C Y   YGDGS TRG   +E
Sbjct: 66  IFDPSKSSTYNKIACSSSACADL------LGTQTCSAAANCIYAYGYGDGSVTRGYFSKE 119

Query: 234 HLGLGKASVNDFIFGCGRNNKGLFG--GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
            +     +  +  FG    N G FG  G  G++GLG+  +S+ SQ   + G  FSYCL  
Sbjct: 120 TITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVD 179

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL------ 345
              AG+  S +  G+++V   S  + YT ++PN    T+Y + + GIS+GG  L      
Sbjct: 180 WLSAGSETSTMYFGDAAV--PSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSV 237

Query: 346 -QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
            +      GG +IDSGT IT L   +++AL A +  Q   +P+    + LD CFN     
Sbjct: 238 YEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVR-YPTTTSATGLDLCFNTRGTG 296

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
               P + +  +G   + +++     F+  + + +CLA AS + +    I GN QQ+N  
Sbjct: 297 SPVFPAMTIHLDG---VHLELPTANTFISLETNIICLAFAS-ALDFPIAIFGNIQQQNFD 352

Query: 465 VIYDTKNSQLGFAGEDCSSM 484
           ++YD  N ++GFA  DC+S+
Sbjct: 353 IVYDLDNMRIGFAPADCASL 372


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 148/418 (35%), Positives = 207/418 (49%), Gaps = 35/418 (8%)

Query: 85  NEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG-- 142
            E  ++RL  D      +            K V+    P+ SG+   +  Y   I +G  
Sbjct: 82  GELLKHRLQRDKRRAARISEAAGAGGGNGRKGVA---APVVSGLAQGSGEYFTKIGVGTP 138

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
                +++DTGSD+ WVQC PC+ CY Q  PVFDP  S SY  V C ++ C  L+     
Sbjct: 139 ATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLD----- 193

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVS 261
           SG C       C Y V+YGDGS T G+   E L   G A V     GCG +N+GLF   +
Sbjct: 194 SGGCDLRRGA-CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAA 252

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG-------SLILGGNSSVFKNST 314
           GL+GLGR  LS  +Q S  +G  FSYCL     +GA         S +  G  SV  +S 
Sbjct: 253 GLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSA 312

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---------FAKGGILIDSGTVITR 365
             ++T M+ NP++ TFY + L GIS+GG ++               +GG+++DSGT +TR
Sbjct: 313 --SFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTR 370

Query: 366 LPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           L  + YSAL+  F    +G    S  GFS+ DTC++L   + V +P V M F G AE  +
Sbjct: 371 LARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAAL 430

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                +  V S  +  C A A    +    IIGN QQ+  RV++D    ++GFA + C
Sbjct: 431 PPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 148/436 (33%), Positives = 224/436 (51%), Gaps = 47/436 (10%)

Query: 61  RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
           R E     + L+H +  SG      E+ Q  +    L +Q L ++  +          + 
Sbjct: 36  RPEKNGFRVSLRHVD--SGGNYTKFERLQRAVKRGRLRLQRLSAKTASF-------EPSV 86

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           E P+ +G       ++  + +G      + I+DTGSDL W QC+PCK C++Q  P+FDP 
Sbjct: 87  EAPVHAG----NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPE 142

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S S+ K+ C+S  C AL  ++ + G         C Y  SYGD S T+G L  E    G
Sbjct: 143 KSSSFSKLPCSSDLCVALPISSCSDG---------CEYRYSYGDHSSTQGVLATETFTFG 193

Query: 239 KASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
            ASV+   FGCG +N+G  +   +GL+GLGR  LSL+SQ   +    FSYCL S  D+  
Sbjct: 194 DASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ---LGVPKFSYCLTSIDDSKG 250

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
             +L++G  ++V K++ P   T +I NP   +FY L+L GIS+G   L  + S F+    
Sbjct: 251 ISTLLVGSEATV-KSAIP---TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDD 306

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-QEVNIP 409
             GG++IDSGT IT L  S ++ALK EF+ Q      A G + L+ CF L      V++P
Sbjct: 307 GSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVP 366

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
            +   FEG   + + +    Y ++  A +V CL + S S      I GN+QQ+N  V++D
Sbjct: 367 QLVFHFEG---VDLKLPKENYIIEDSALRVICLTMGSSS---GMSIFGNFQQQNIVVLHD 420

Query: 469 TKNSQLGFAGEDCSSM 484
            +   + FA   C+ +
Sbjct: 421 LEKETISFAPAQCNQL 436


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 139/377 (36%), Positives = 203/377 (53%), Gaps = 31/377 (8%)

Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           + P+ SG+ L +  Y   + +G   R M +++DTGSD+ W+QC PC SCY+Q D VFDP 
Sbjct: 23  QAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPY 82

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
            S +Y  + CNS  C  L+      G C  +    C Y V YGDGS++ GE   + + L 
Sbjct: 83  KSSTYSTLGCNSRQCLNLDV-----GGCVGNK---CLYQVDYGDGSFSTGEFATDAVSLN 134

Query: 238 -----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
                G+  +N    GCG +N+G F G +GL+GLG+  LS  +Q +   GG FSYCL   
Sbjct: 135 STSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGR 194

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL------- 345
                  S ++ G+++V      + +T    N +++TFY L +TGIS+GG  L       
Sbjct: 195 DTDSTERSSLIFGDAAV--PPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAF 252

Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
           Q      GG++IDSGT +TRL  + Y++L+  F    S       FS+ DTC+NLS    
Sbjct: 253 QLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSS 312

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETGIIGNYQQKNQR 464
           V++P V + F+G A++ +  +   Y V  D +S  CLA A  +      IIGN QQ+  R
Sbjct: 313 VDVPTVTLHFQGGADLKLPASN--YLVPVDNSSTFCLAFAGTT---GPSIIGNIQQQGFR 367

Query: 465 VIYDTKNSQLGFAGEDC 481
           VIYD  ++Q+GF    C
Sbjct: 368 VIYDNLHNQVGFVPSQC 384


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 146/425 (34%), Positives = 216/425 (50%), Gaps = 47/425 (11%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV------------SNTEIPLTS 126
           D+     +RL  D+  V+ L +RI   I G    +++ +             + E P+ S
Sbjct: 83  DYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVS 142

Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G    +  Y + + +G     V  ++DTGSD++WVQC PC  CY Q DP+F+P+ S S+ 
Sbjct: 143 GASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFT 202

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            + C +  C +L+ +   +G C         Y VSYGDGSYT G+   E + LG  S+ +
Sbjct: 203 SLSCETEQCKSLDVSECRNGTCL--------YEVSYGDGSYTVGDFVTETVTLGSTSLGN 254

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
              GCG NN+GLF G +GL+GLG   LS  SQ   +    FSYCL    D  +  +  L 
Sbjct: 255 IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCL---VDRDSDSTSTLD 308

Query: 305 GNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
            NS +    TP   T  +  NP L TF+ L LTG+S+GG  L       Q S    GGI+
Sbjct: 309 FNSPI----TPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +DSGT +TRL  ++Y+ L+  F+K      +A G ++ DTC++LS+   V +P V   F 
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFA 424

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
              E+ +     +  V S+ +  C A A    +    I+GN QQ+  RV +D  NS +GF
Sbjct: 425 NGNELPLPAKNYLIPVDSEGT-FCFAFAPT--DSTLSILGNAQQQGTRVGFDLANSLVGF 481

Query: 477 AGEDC 481
           +   C
Sbjct: 482 SPNKC 486


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 125/347 (36%), Positives = 185/347 (53%), Gaps = 32/347 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
           TVI+D+GSD++WVQC+PC    C+ Q+DP+FDP++S +Y  V C S+ C  L  +  G  
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 226

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
             CS+++   C + ++YGDGS   G    + L LG   V   F FGC   ++G      V
Sbjct: 227 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 282

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
           +G + LG    SLV QT+  +G +FSYCLP T  A + G L+LG      +      STP
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 340

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
           +  ++M P     TFY + L  I + G+ L           +IDS T+I+RLPP+ Y AL
Sbjct: 341 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 395

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           +A F    + + +AP  SILDTC++ +  + + +P + + F+G A + +D  GI+     
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 451

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                CLA A  + +   G IGN QQK   V+YD     + F    C
Sbjct: 452 ---GSCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 144/435 (33%), Positives = 218/435 (50%), Gaps = 53/435 (12%)

Query: 70  ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIK----NMISGNIKDVSNTEI--- 122
           ++ HK+Y S  +        +RL  D +    L +R++    ++   ++K +  TEI   
Sbjct: 94  KIHHKDYKSLVL--------SRLHRDTVRFNSLTARLQLALEDISKSDLKPL-ETEIKPE 144

Query: 123 ----PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
               P+TSG    +  Y   + +G   R   +++DTGSD+ W+QCQPC  CY Q DP+FD
Sbjct: 145 DLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFD 204

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           P+ S +Y  V C S  C +LE ++  SG         C Y V+YGDGSYT G+   E + 
Sbjct: 205 PTASSTYAPVTCQSQQCSSLEMSSCRSG--------QCLYQVNYGDGSYTFGDFATESVS 256

Query: 237 LGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
            G + SV +   GCG +N+GLF G +GL+GLG   LSL   T+++    FSYCL +   A
Sbjct: 257 FGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSA 313

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQ 346
           G+S          V   + P     ++ N ++ TFY + L+G+S+GG+         +L 
Sbjct: 314 GSSTLDFNSAQLGVDSVTAP-----LMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLD 368

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
            SG   GGI++D GT ITRL    Y+ L+  F++           ++ DTC++LS    V
Sbjct: 369 ESG--NGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASV 426

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V   F       +     +  V S A   C A A  +      IIGN QQ+  RV 
Sbjct: 427 RVPTVSFHFADGKSWNLPAANYLIPVDS-AGTYCFAFAPTT--SSLSIIGNVQQQGTRVT 483

Query: 467 YDTKNSQLGFAGEDC 481
           +D  N+++GF+   C
Sbjct: 484 FDLANNRMGFSPNKC 498


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 133/378 (35%), Positives = 194/378 (51%), Gaps = 23/378 (6%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C++Q+ PVFDP  S 
Sbjct: 139 VESGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMAST 198

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY+ V C  + C  L         C SS    C Y+  YGD S T G+L  E   +   +
Sbjct: 199 SYRNVTCGDTRC-GLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257

Query: 242 -----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
                V+  + GCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL       
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCL--VDHGS 315

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAK 352
           A GS I+ G+ +V  +   + YT   P+    TFY + L GI +GG+ L       G +K
Sbjct: 316 AVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSK 375

Query: 353 ----GGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVN 407
               GG +IDSGT ++  P   Y A++  F+ +    +P    F +L  C+N+S  + V 
Sbjct: 376 EDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVE 435

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVI 466
           +P   + F   A    D     YF++ D   + CLA+          IIGNYQQ+N  V+
Sbjct: 436 VPEFSLLFADGA--VWDFPAENYFIRLDTEGIMCLAVLGTP-RSAMSIIGNYQQQNFHVL 492

Query: 467 YDTKNSQLGFAGEDCSSM 484
           YD  +++LGFA   C+ +
Sbjct: 493 YDLHHNRLGFAPRRCAEV 510


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 190/356 (53%), Gaps = 37/356 (10%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            IVDTGSDL W QC+PC  C+ Q  PVFDPS S +Y  V C+S+ C  L  +T     C+
Sbjct: 115 AIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTST-----CT 169

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGL-FGGVSGLM 264
           S+S   C Y  +YGD S T+G L  E   LG  K  +    FGCG  N+G  F   +GL+
Sbjct: 170 SAS--KCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLV 227

Query: 265 GLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGG---NSSVFKNSTPITYT 319
           GLGR  LSLVSQ      GL  FSYCL S  D      L+LGG     S    + P+  T
Sbjct: 228 GLGRGPLSLVSQL-----GLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTT 282

Query: 320 NMIPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYS 372
            ++ NP   +FY ++LTG+++G  +  L AS FA      GG+++DSGT IT L    Y 
Sbjct: 283 PLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYR 342

Query: 373 ALKAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           ALK  F+ Q +  P+  G  I LD CF   A    EV +P + + F+G A++  D+    
Sbjct: 343 ALKKAFVAQMA-LPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADL--DLPAEN 399

Query: 430 YFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           Y V   AS  +CL +A         IIGN+QQ+N + +YD     L FA   C+ +
Sbjct: 400 YMVLDSASGALCLTVAP---SRGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQCNKL 452


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 146/425 (34%), Positives = 215/425 (50%), Gaps = 47/425 (11%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDV------------SNTEIPLTS 126
           D+     +RL  D+  V+ L +RI   I G    +++ +             + E P+ S
Sbjct: 83  DYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVS 142

Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G    +  Y + + +G     V  ++DTGSD++WVQC PC  CY Q DP F+P+ S S+ 
Sbjct: 143 GASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFT 202

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            + C +  C +L+ +   +G C         Y VSYGDGSYT G+   E + LG  S+ +
Sbjct: 203 SLSCETEQCKSLDVSECRNGTCL--------YEVSYGDGSYTVGDFVTETVTLGSTSLGN 254

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
              GCG NN+GLF G +GL+GLG   LS  SQ   +    FSYCL    D  +  +  L 
Sbjct: 255 IAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQ---LNASSFSYCL---VDRDSDSTSTLD 308

Query: 305 GNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGIL 356
            NS +    TP   T  +  NP L TF+ L LTG+S+GG  L       Q S    GGI+
Sbjct: 309 FNSPI----TPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGII 364

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +DSGT +TRL  ++Y+ L+  F+K      +A G ++ DTC++LS+   V +P V   F 
Sbjct: 365 VDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFA 424

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
              E+ +     +  V S+ +  C A A    +    I+GN QQ+  RV +D  NS +GF
Sbjct: 425 NGNELPLPAKNYLIPVDSEGT-FCFAFAPT--DSTLSILGNAQQQGTRVGFDLANSLVGF 481

Query: 477 AGEDC 481
           +   C
Sbjct: 482 SPNKC 486


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 153/481 (31%), Positives = 246/481 (51%), Gaps = 51/481 (10%)

Query: 10  ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITL 69
           ILSL +  M  +  +A G +C    K L+       +K G  S  VS     I       
Sbjct: 14  ILSLAITFMCGVAEIAPGLNCRSSDKILN-------RKVGKRSHSVSFPLIHIYSECSPF 66

Query: 70  ELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR 129
              ++         W      ++  D   +++L+       S + K+ +N  +P+ SG  
Sbjct: 67  RPPNRT--------WESLMSEKIRGDANRLRFLK-----RTSRSSKEDANANVPVRSG-- 111

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
             +  YI  ++ G   ++M  ++DTGSD+ W+ C+ C+ C++   P+FDP+ S SYK   
Sbjct: 112 --SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFA 168

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
           C+S  C  +      SG C  +S   C + V YGDG+   G L  + + LG   + +F F
Sbjct: 169 CDSQPCQEI------SGNCGGNS--KCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSF 220

Query: 248 GCGRN-NKGLFGGVSGLMGLGRSDLSLV-SQTSEIFGGLFSYCLPSTQDAGASGSLILGG 305
           GC  + ++  +     +   G S   L  + T+E+FGG FSYCLPS+  + +SGSL+LG 
Sbjct: 221 GCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSS--STSSGSLVLGK 278

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG-ILIDSGTV 362
            ++V  +S+ + +T +I +P   TFY + L  IS+G  ++   A+  A GG  +IDSGT 
Sbjct: 279 EAAV--SSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTT 336

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           IT L PS Y  L+  F +Q S     P    +DTC++LS+   V++P + +  + N ++ 
Sbjct: 337 ITYLVPSAYKDLRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLV 394

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +    I+   +S  S  CLA +S    D   IIGN QQ+N R+++D  NSQ+GFA E C+
Sbjct: 395 LPKENILITQESGLS--CLAFSS---TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449

Query: 483 S 483
           +
Sbjct: 450 A 450


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 147/436 (33%), Positives = 223/436 (51%), Gaps = 47/436 (10%)

Query: 61  RIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT 120
           R E     + L+H +  SG      E+ Q  +    L +Q L ++  +          + 
Sbjct: 36  RPEKNGFRVSLRHVD--SGGNYTKFERLQRAVKRGRLRLQRLSAKTASF-------EPSV 86

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           E P+ +G       ++  + +G      + I+DTGSDL W QC+PCK C++Q  P+FDP 
Sbjct: 87  EAPVHAG----NGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPE 142

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S S+ K+ C+S  C AL  ++ + G         C Y  SYGD S T+G L  E    G
Sbjct: 143 KSSSFSKLPCSSDLCVALPISSCSDG---------CEYRYSYGDHSSTQGVLATETFTFG 193

Query: 239 KASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
            ASV+   FGCG +N+G  +   +GL+GLGR  LSL+SQ   +    FSYCL S  D+  
Sbjct: 194 DASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQ---LGVPKFSYCLTSIDDSKG 250

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA---- 351
             +L++G  ++V K++ P   T +I NP   +FY L+L GIS+G   L  + S F+    
Sbjct: 251 ISTLLVGSEATV-KSAIP---TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDD 306

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-QEVNIP 409
             GG++IDSGT IT L  + ++ALK EF+ Q      A G + L+ CF L      V +P
Sbjct: 307 GSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVP 366

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYD 468
            +   FEG   + + +    Y ++  A +V CL + S S      I GN+QQ+N  V++D
Sbjct: 367 QLVFHFEG---VDLKLPKENYIIEDSALRVICLTMGSSS---GMSIFGNFQQQNIVVLHD 420

Query: 469 TKNSQLGFAGEDCSSM 484
            +   + FA   C+ +
Sbjct: 421 LEKETISFAPAQCNQL 436


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 162/517 (31%), Positives = 253/517 (48%), Gaps = 67/517 (12%)

Query: 10  ILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGA--- 66
            LSLL  + +SLFL A  A             L        + + +S   +R  + A   
Sbjct: 6   FLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSLTATKP 65

Query: 67  --------------ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS 111
                         ++LEL  ++   + +  D+     +RL  D+  V  + ++I+  + 
Sbjct: 66  ESISDPVFFNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFAVE 125

Query: 112 G----NIKDVSNTEI---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDL 156
           G    ++K V+N +          P+ SG+   +  Y + I +G   + M +++DTGSD+
Sbjct: 126 GIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGSDV 185

Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
            W+QC+PC  CY Q DPVF+P+ S +YK + C++  C  LE     +  C S+    C Y
Sbjct: 186 NWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLY 237

Query: 217 FVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVS 275
            VSYGDGS+T GEL  + +  G +  +ND   GCG +N+GLF G +GL+GLG   LS+  
Sbjct: 238 QVSYGDGSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSI-- 295

Query: 276 QTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
            T+++    FSYCL   +D+G S SL     +SV   S   T   ++ N ++ TFY + L
Sbjct: 296 -TNQMKATSFSYCLVD-RDSGKSSSLDF---NSVQLGSGDAT-APLLRNQKIDTFYYVGL 349

Query: 336 TGISIGGKQ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
           +G S+GG++         + ASG   GG+++D GT +TRL    Y++L+  FLK  +   
Sbjct: 350 SGFSVGGQKVMMPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLK 407

Query: 387 S-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALA 444
                 S+ DTC++ S+   V +P V   F G   +  D+    Y +   D    C A A
Sbjct: 408 KGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDNGTFCFAFA 465

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             S      IIGN QQ+  R+ YD  N  +G +G  C
Sbjct: 466 PTS--SSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 133/351 (37%), Positives = 186/351 (52%), Gaps = 30/351 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +++DTGSD+ W+QC PC+ CY Q   VFDP  S SY  V C +  C  L+     SG C 
Sbjct: 155 MVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCRRLD-----SGGCD 209

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
                 C Y V+YGDGS T G+   E L   G A V     GCG +N+GLF   +GL+GL
Sbjct: 210 LRRSA-CLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGL 268

Query: 267 GRSDLSLVSQTSEIFGGLFSYCL----PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           GR  LS  +Q S  +G  FSYCL     S   A  S ++  G  S    ++   ++T M+
Sbjct: 269 GRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFG--SGAVGSTVASSFTPMV 326

Query: 323 PNPQLATFYILNLTGISIGGKQL-----------QASGFAKGGILIDSGTVITRLPPSIY 371
            NP++ TFY + L GIS+GG ++            +SG  +GG+++DSGT +TRL    Y
Sbjct: 327 KNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSG--RGGVIVDSGTSVTRLARPAY 384

Query: 372 SALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           SAL+  F    +G   +P GFS+ DTC++LS  + V +P V M F G AE  +     + 
Sbjct: 385 SALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLI 444

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            V S  +  C A A    +    IIGN QQ+  RV++D    ++ F  + C
Sbjct: 445 PVDSKGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  204 bits (519), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 144/449 (32%), Positives = 218/449 (48%), Gaps = 45/449 (10%)

Query: 66  AITLELKHKNYCSGK------IVDWNEQQQNRLILDNLHVQ---------YLQSRIKNMI 110
           ++ L + H++  +G+       +D  E+   R+  D +H +            S  +  +
Sbjct: 73  SLKLHMTHRSAAAGETGKGSFFLDSAEKDAVRI--DTMHRRAALSGSAAARRDSAPRRAL 130

Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
           S  +     + +P+ SG       Y+  + LG   R   +I+DTGSDL W+QC PC  C+
Sbjct: 131 SERVVATVESGVPVGSG------EYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCF 184

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
            Q  P+FDP+ S SY+ V C    C  +   A      C       C Y+  YGD S T 
Sbjct: 185 EQSGPIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTT 244

Query: 228 GELGREHLGL-----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
           G+L  E   +     G   V+   FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G
Sbjct: 245 GDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYG 304

Query: 283 G-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           G  FSYCL   +   A+GS I+ G+         + YT   P     TFY L L  I +G
Sbjct: 305 GHAFSYCL--VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVG 362

Query: 342 GKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP---GFSILDT 396
           G+ +  S    + GG +IDSGT ++  P   Y A++  F+ + S  PS P   GF +L  
Sbjct: 363 GEAVNISSDTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMS--PSYPLILGFPVLSP 420

Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGII 455
           C+N+S  ++V +P + + F   A          YF++ +   + CLA+          II
Sbjct: 421 CYNVSGAEKVEVPELSLVFADGAAWEFPAEN--YFIRLEPEGIMCLAVLGTP-RSGMSII 477

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           GNYQQ+N  V+YD ++++LGFA   C+ +
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCADV 506


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 149/445 (33%), Positives = 222/445 (49%), Gaps = 51/445 (11%)

Query: 69  LELKHKNYCSGKIVDWNEQQQNRLIL---DNLHVQY---LQSRIKNMISGNIK------- 115
           LEL H+NY   ++ + + Q Q +L L   D L + +      R K  IS + K       
Sbjct: 51  LEL-HENYPIFELDNNSSQSQWKLKLFHRDKLPLNFDPDHPRRFKERISRDSKRVSSLLR 109

Query: 116 --------DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
                    V++    + SG    +  Y   I +G   R+  V++D+GSD+ WVQCQPC 
Sbjct: 110 LLSSGSDEQVTDFGSDVVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCS 169

Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
            CY Q DPVFDP+ S +Y  + C+SS C  L+ A  N G         C Y VSYGDGSY
Sbjct: 170 ECYQQSDPVFDPAGSATYAGISCDSSVCDRLDNAGCNDG--------RCRYEVSYGDGSY 221

Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           TRG L  E L  G+  + +   GCG  N+G+F G +GL+GLG   +S V Q     GG F
Sbjct: 222 TRGTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAF 281

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG------ 337
           SYCL S +   ++G+L  G      + + P+   +  +I NP+  +FY + L+G      
Sbjct: 282 SYCLVS-RGTESTGTLEFG------RGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGI 334

Query: 338 -ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT 396
            + I  +  + +    GG+++D+GT +TRLP   Y A +  F+ Q +  P +   SI DT
Sbjct: 335 RVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDT 394

Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
           C+NL+ +  V +P V   F G   +T+     +  V  + +  C A A+ +      IIG
Sbjct: 395 CYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDGEGT-FCFAFAASA--SGLSIIG 451

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
           N QQ+  ++  D  N  +GF    C
Sbjct: 452 NIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 194/375 (51%), Gaps = 25/375 (6%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y   I +G       +++DTGSD+ W+QC PC+ CY+Q   +FDP  S
Sbjct: 135 PVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
            SY  V C +  C  L+     SG C       C Y V+YGDGS T G+   E L     
Sbjct: 195 HSYGAVDCAAPLCRRLD-----SGGCDLRRKA-CLYQVAYGDGSVTAGDFATETLTFASG 248

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
           A V     GCG +N+GLF   +GL+GLGR  LS  SQ S  FG  FSYCL    S+  + 
Sbjct: 249 ARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASA 308

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
            S S  +   S     S   ++T M+ NP++ TFY + L GIS+GG ++     +     
Sbjct: 309 TSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLD 368

Query: 352 ----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
               +GG+++DSGT +TRL    Y+AL+  F    +G   +P GFS+ DTC++LS  + V
Sbjct: 369 PSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVV 428

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V M F G AE  +     +  V S  +  C A A    +    IIGN QQ+  RV+
Sbjct: 429 KVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVV 485

Query: 467 YDTKNSQLGFAGEDC 481
           +D    +LGF  + C
Sbjct: 486 FDGDGQRLGFVPKGC 500


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 145/387 (37%), Positives = 209/387 (54%), Gaps = 34/387 (8%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQ 160
            +R+  ++SG  K VS   +P   G  +++L Y+AT+  G   +   V++DTGSDLTW+Q
Sbjct: 85  HARLSYIVSG--KKVS---VPAHLGTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQ 139

Query: 161 CQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           C+PC S  C  Q+DP+FDPS S +Y  V C S  C  L      SG CS+  P  C + +
Sbjct: 140 CKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGECKKLAADAYGSG-CSNGQP--CGFAI 196

Query: 219 SYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
           SY DG+ T G  G++ L L   A V DF FGCG +   L G   GL+GLGR   SL +Q 
Sbjct: 197 SYVDGTSTVGVYGKDKLTLAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQY 256

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
                  FSYCLP+       G L  G      +N +   +T M   P   TF  + L G
Sbjct: 257 GGGG--GFSYCLPAVNS--KPGFLAFGAG----RNPSGFVFTPMGRVPGQPTFSTVTLAG 308

Query: 338 ISIGGKQ--LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
           I++GGK+  L+ S F+ GG+++DSGTV+T L  ++Y AL+A F +    +    G   LD
Sbjct: 309 ITVGGKKLDLRPSAFS-GGMIVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHG--DLD 365

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSDASQVCLALASLSYEDETGI 454
           TC++L+ Y+ V +P + + F G A + +DV  GI+          CLA A    +   G+
Sbjct: 366 TCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILV-------NGCLAFAETGKDGTAGV 418

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +GN  Q+   V++DT  S+ GF  + C
Sbjct: 419 LGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 151/439 (34%), Positives = 215/439 (48%), Gaps = 42/439 (9%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLIL---DNLHVQYLQSRIKNMISGNIK---DVSNTE 121
           +L+L H++  SG       ++   L L   D   V YLQ R+    S +     +   T 
Sbjct: 58  SLQLLHRDTVSGT--KHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTI 115

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           +   SG       Y+  + +G   +   ++ DTGSD+ WVQC PC  CY Q DP+FDP+ 
Sbjct: 116 VSHGSG------EYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPAN 169

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-G 238
           S S+  V CNS  C A   A   S         +C Y VSYGD SYT G L  E L L G
Sbjct: 170 SASFSPVPCNSGVCRA---AARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDG 226

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP--STQDAG 296
              V     GCG  N+GLF   +GL+GLG   +SLV Q     GG FSYCL    + +  
Sbjct: 227 GTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGS 286

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-------ASG 349
            SGSL+LG   +     T   +  ++ NP   +FY + + G+ + G++LQ          
Sbjct: 287 GSGSLVLGREDAA---PTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGD 343

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNI 408
              GG+++D+GT +TRLP   Y+AL+  F   F  G P APG S+ DTC++LS Y  V +
Sbjct: 344 DGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRV 403

Query: 409 PLVKMEFEGN------AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
           P V + F G       A +T+    ++  V  D    CLA A+++      I+GN QQ+ 
Sbjct: 404 PTVALYFGGGGQGQEAASLTLPARNLLVPVD-DGGTYCLAFAAVA--SGPSILGNIQQQG 460

Query: 463 QRVIYDTKNSQLGFAGEDC 481
             +  D+ +  +GF    C
Sbjct: 461 IEITVDSASGYVGFGPATC 479


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 167/491 (34%), Positives = 237/491 (48%), Gaps = 56/491 (11%)

Query: 13  LLLP--LMVSLFLLAKGAHCFEGKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLE 70
           LLLP  +M++   L   A   +  K L    L+        + C   +         T+ 
Sbjct: 8   LLLPCIIMITYHALVARAGDEKSYKVLSASSLK------PGAVCAEPKVRDSSSSGATVP 61

Query: 71  LKHKNYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE--IPLT 125
           L H++     +    ++Q      L  D L   Y+Q +  +        +  +E  +P+ 
Sbjct: 62  LNHRHGPCSPVPSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEATVPIA 121

Query: 126 SGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
            G  L TL Y+ T+ +G   +  T+ +DTGSD++W++C+           ++DP  S +Y
Sbjct: 122 LGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTY 172

Query: 184 KKVLCNSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
               C++  C  L    G  G  CSS S   C Y V YGDGS T G  G + L L   S 
Sbjct: 173 APFSCSAPACAQL----GRRGTGCSSGS--TCVYSVKYGDGSNTTGTYGSDTLTLAGTSE 226

Query: 242 --VNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
             ++ F FGC     G       GLMGLG    S VSQT+  +G  FSYCLP T +  +S
Sbjct: 227 PLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWN--SS 284

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGIL 356
           G L LG  SS    +   + T M+ + Q ATFY L L GIS+GGK L+  +S F+ G I 
Sbjct: 285 GFLTLGAPSSSTSAAF--STTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSI- 341

Query: 357 IDSGTVITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVN---IPL 410
           +DSGTVITRLPP+ Y AL A F   + ++   P+AP   +LDTCF+ + + E N   +P 
Sbjct: 342 VDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAP-RGLLDTCFDFTGHGEGNNFTVPS 400

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V +  +G A + +   GIV          CLA A+   +  TGIIGN QQ+   V+YD  
Sbjct: 401 VALVLDGGAVVDLHPNGIV-------QDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVG 453

Query: 471 NSQLGFAGEDC 481
            S  GF    C
Sbjct: 454 QSVFGFRPGAC 464


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 136/392 (34%), Positives = 200/392 (51%), Gaps = 46/392 (11%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   R+ ++I+DTGSDL W+QC PC  C+ Q  P +DP  S 
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESS 240

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
           S+K + C+   CH          + SS  PP         C YF  YGD S T G+   E
Sbjct: 241 SFKNIGCHDPRCH----------LVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALE 290

Query: 234 HLGLGKAS---------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
              +   S         V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  
Sbjct: 291 TFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHS 350

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGISI 340
           FSYCL     D   S  LI G +  +  N   + +T+++    NP + TFY + +  I +
Sbjct: 351 FSYCLVDRNSDTNVSSKLIFGEDKDLL-NHPEVNFTSLVAGKENP-VDTFYYVQIKSIMV 408

Query: 341 GGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG+ L+        S    GG ++DSGT ++      Y  +K  F+K+  G+P    F I
Sbjct: 409 GGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPI 468

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
           LD C+N+S  +++ +P  ++ FE  A     V    YF+K +  + VCLA+         
Sbjct: 469 LDPCYNVSGVEKMELPEFRILFEDGAVWNFPVEN--YFIKLEPEEIVCLAILGTP-RSAL 525

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            IIGNYQQ+N  ++YDTK S+LG+A   C+ +
Sbjct: 526 SIIGNYQQQNFHILYDTKKSRLGYAPMKCADV 557


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 135/396 (34%), Positives = 203/396 (51%), Gaps = 31/396 (7%)

Query: 99  VQYLQSRIKNMISGNIKD--VSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
           V+ + S I+ + SG+     V +    + SG+   +  Y   I +G   R+  +++D+GS
Sbjct: 5   VKRVVSLIRRVSSGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 64

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           D+ WVQC+PC  CY+Q DP+FDP+ S S+  V C+S+ C  ++ A  NSG         C
Sbjct: 65  DIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNSG--------RC 116

Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
            Y VSYGDGS T+G L  E L LG+  V +   GCG  N+G+F G +GL+GLG   +S V
Sbjct: 117 RYEVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFV 176

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYI 332
            Q S   G  FSYCL S +   ++G L  G        + P+   +  +I NP   ++Y 
Sbjct: 177 GQLSRERGNAFSYCLVS-RVTNSNGFLEFG------SEAMPVGAAWIPLIRNPHSPSYYY 229

Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
           + L+G+ +G  ++       + +    GG+++D+GT +TR P   Y A +  F+ Q    
Sbjct: 230 IGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNL 289

Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
           P A G SI DTC+NL  +  V +P V   F G   +T+     +  V  DA   C A A 
Sbjct: 290 PRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVD-DAGTFCFAFA- 347

Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                   I+GN QQ+  ++  D  N  +GF    C
Sbjct: 348 -PSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 29/369 (7%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+   +  Y   I +G   RN  V++D+GSD+ WVQC+PC  CY+Q DPVF+P+ S 
Sbjct: 125 VVSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSS 184

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+  V C S+ C  ++ A  + G         C Y VSYGDGSYT+G L  E +  G+  
Sbjct: 185 SFSGVSCASTVCSHVDNAACHEG--------RCRYEVSYGDGSYTKGTLALETITFGRTL 236

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           + +   GCG +N+G+F G +GL+GLG   +S V Q     GG FSYCL S +   +SG L
Sbjct: 237 IRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVS-RGIESSGLL 295

Query: 302 ILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAK 352
             G      + + P+   +  +I NP+  +FY + L+G       +SI     + S    
Sbjct: 296 EFG------REAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGD 349

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           GG+++D+GT +TRLP   Y A +  F+ Q +  P A G SI DTC++L  +  V +P V 
Sbjct: 350 GGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVS 409

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
             F G   +T+     +  V  D    C A A  S      IIGN QQ+  ++  D  N 
Sbjct: 410 FYFSGGPILTLPARNFLIPVD-DVGTFCFAFAPSS--SGLSIIGNIQQEGIQISVDGANG 466

Query: 473 QLGFAGEDC 481
            +GF    C
Sbjct: 467 FVGFGPNVC 475


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 232/448 (51%), Gaps = 60/448 (13%)

Query: 67  ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE 121
           ++LEL  ++ + + +  D+     +RL  D+  V  + ++I+  + G    ++K V N +
Sbjct: 80  LSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNED 139

Query: 122 I---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
                     P+ SG    +  Y + I +G   ++M +++DTGSD+ W+QC+PC  CY Q
Sbjct: 140 TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQ 199

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            DPVF+P+ S +YK + C++  C  LE     +  C S+    C Y VSYGDGS+T GEL
Sbjct: 200 SDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLYQVSYGDGSFTVGEL 251

Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
             + +  G +  +N+   GCG +N+GLF G +GL+GLG   LS+   T+++    FSYCL
Sbjct: 252 ATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSI---TNQMKATSFSYCL 308

Query: 290 PSTQDAGASGSLI-----LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
              +D+G S SL      LGG  +            ++ N ++ TFY + L+G S+GG++
Sbjct: 309 VD-RDSGKSSSLDFNSVQLGGGDAT---------APLLRNKKIDTFYYVGLSGFSVGGEK 358

Query: 345 ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSIL 394
                    + ASG   GG+++D GT +TRL    Y++L+  FLK        +   S+ 
Sbjct: 359 VVLPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLF 416

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETG 453
           DTC++ S+   V +P V   F G   +  D+    Y +  D S   C A A  S      
Sbjct: 417 DTCYDFSSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPTS--SSLS 472

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IIGN QQ+  R+ YD   + +G +G  C
Sbjct: 473 IIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 136/353 (38%), Positives = 187/353 (52%), Gaps = 30/353 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +++DTGSD+ WVQC PC+ CY Q  PVFDP  S SY  V C ++ C  L+     SG C 
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLD-----SGGCD 55

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
                 C Y V+YGDGS T G+   E L   G A V     GCG +N+GLF   +GL+GL
Sbjct: 56  LRRGA-CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGL 114

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG-------SLILGGNSSVFKNSTPITYT 319
           GR  LS  +Q S  +G  FSYCL     +GA         S +  G  SV  +S   ++T
Sbjct: 115 GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSA--SFT 172

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQA---------SGFAKGGILIDSGTVITRLPPSI 370
            M+ NP++ TFY + L GIS+GG ++               +GG+++DSGT +TRL  + 
Sbjct: 173 PMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARAS 232

Query: 371 YSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
           YSAL+  F    +G    S  GFS+ DTC++L   + V +P V M F G AE  +     
Sbjct: 233 YSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENY 292

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +  V S  +  C A A    +    IIGN QQ+  RV++D    ++GFA + C
Sbjct: 293 LIPVDSRGT-FCFAFAGT--DGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 157/467 (33%), Positives = 238/467 (50%), Gaps = 63/467 (13%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNR--LILDNL-----HVQYLQSRIKNMISGNIKDVS 118
           ++ +ELKH+        D  +  +NR  L+L++L      +Q  Q R+   ++ +    +
Sbjct: 82  SLKMELKHR--------DHGQPTRNRRSLLLESLKRDITRLQSFQKRVSEKLTASANPEA 133

Query: 119 NTEIP---------------------LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
             E+                      + SG  L    Y   + +G   R+  +I+DTGSD
Sbjct: 134 YLEMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSD 193

Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPD 213
           LTW+QC+PCK+C++Q  PVFDPS S S+K + CN++ C  +  +    NS   S +SP  
Sbjct: 194 LTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNS---SKTSPKT 250

Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA------SVNDFIFGCGRNNKGLFGGVSGLMGLG 267
           C YF  YGD S T G+L  E L +  +       + D + GCG +NKGLF G  GL+GLG
Sbjct: 251 CKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLG 310

Query: 268 RSDLSLVSQ-TSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI-PN 324
           +  LS  SQ  S   G  FSYCL   T +   S ++  G   ++ ++   + +T  +  N
Sbjct: 311 QGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTN 370

Query: 325 PQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAE 377
             + TFY L + GI I  + L   A  FA      GG +IDSGT +T L    Y A+++ 
Sbjct: 371 NSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESA 430

Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
           FL + S +P A  F IL  C+N +    V  P + + F+  AE+  D+    YF++ D  
Sbjct: 431 FLARIS-YPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAEL--DLPQENYFIQPDPQ 487

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +    LA L   D   IIGN+QQ+N   +YD ++++LGFA  DCS++
Sbjct: 488 EAKHCLAILP-TDGMSIIGNFQQQNIHFLYDVQHARLGFANTDCSAL 533


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 231/448 (51%), Gaps = 60/448 (13%)

Query: 67  ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIKDVSNTE 121
           ++LEL  ++ + + +  D+     +RL  D+  V  + ++I+  + G    ++K V N +
Sbjct: 80  LSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYNED 139

Query: 122 I---------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
                     P+ SG    +  Y + I +G   + M +++DTGSD+ W+QC+PC  CY Q
Sbjct: 140 TRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQ 199

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            DPVF+P+ S +YK + C++  C  LE     +  C S+    C Y VSYGDGS+T GEL
Sbjct: 200 SDPVFNPTSSSTYKSLTCSAPQCSLLE-----TSACRSNK---CLYQVSYGDGSFTVGEL 251

Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
             + +  G +  +N+   GCG +N+GLF G +GL+GLG   LS+   T+++    FSYCL
Sbjct: 252 ATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSI---TNQMKATSFSYCL 308

Query: 290 PSTQDAGASGSLI-----LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
              +D+G S SL      LGG  +            ++ N ++ TFY + L+G S+GG++
Sbjct: 309 VD-RDSGKSSSLDFNSVQLGGGDAT---------APLLRNKKIDTFYYVGLSGFSVGGEK 358

Query: 345 ---------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSIL 394
                    + ASG   GG+++D GT +TRL    Y++L+  FLK        +   S+ 
Sbjct: 359 VVLPDAIFDVDASG--SGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLF 416

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETG 453
           DTC++ S+   V +P V   F G   +  D+    Y +  D S   C A A  S      
Sbjct: 417 DTCYDFSSLSTVKVPTVAFHFTGGKSL--DLPAKNYLIPVDDSGTFCFAFAPTS--SSLS 472

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IIGN QQ+  R+ YD   + +G +G  C
Sbjct: 473 IIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  201 bits (511), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 143/381 (37%), Positives = 208/381 (54%), Gaps = 27/381 (7%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG  L    Y   + +G   R+  +I+DTGSDLTW+QC+PCK+C++Q  PVFDPS S 
Sbjct: 76  VESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQST 135

Query: 182 SYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S+K + CN++ C  +  +    NS   S +SP  C YF  YGD S T G+L  E L +  
Sbjct: 136 SFKIIPCNAAACDLVVHDECRDNS---SKTSPKTCKYFYWYGDSSRTSGDLALESLSVSL 192

Query: 240 A------SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ-TSEIFGGLFSYCL-PS 291
           +       + D + GCG +NKGLF G  GL+GLG+  LS  SQ  S   G  FSYCL   
Sbjct: 193 SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDR 252

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMI-PNPQLATFYILNLTGISIGGKQL--QAS 348
           T +   S ++  G   ++ ++   + +T  +  N  + TFY L + GI I  + L   A 
Sbjct: 253 TNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAE 312

Query: 349 GFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
            FA      GG +IDSGT +T L    Y A+++ FL + S +P A  F IL  C+N +  
Sbjct: 313 RFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARIS-YPRADPFDILGICYNATGR 371

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
             V  P + + F+  AE+  D+    YF++ D  +    LA L   D   IIGN+QQ+N 
Sbjct: 372 AAVPFPALSIVFQNGAEL--DLPQENYFIQPDPQEAKHCLAILP-TDGMSIIGNFQQQNI 428

Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
             +YD ++++LGFA  DCS++
Sbjct: 429 HFLYDVQHARLGFANTDCSAL 449


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 135/392 (34%), Positives = 200/392 (51%), Gaps = 46/392 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC  C+ Q  P +DP  S 
Sbjct: 79  LESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESS 138

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
           S++ + C+   CH          + SS  PP         C YF  YGD S T G+   E
Sbjct: 139 SFRNIGCHDPRCH----------LVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATE 188

Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
              +      GK+    V + +FGCG  N+GLF G SGL+GLGR  LS  SQ   ++G  
Sbjct: 189 TFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHS 248

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI---PNPQLATFYILNLTGISI 340
           FSYCL     D   S  LI G +  +  N   + +T ++    NP + TFY + +  I +
Sbjct: 249 FSYCLVDRNSDTNVSSKLIFGEDKDLL-NHPELNFTTLVGGKENP-VDTFYYVQIKSIMV 306

Query: 341 GGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG+ L         +    GG ++DSGT ++      Y  +K  F+K+  G+P    F I
Sbjct: 307 GGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPI 366

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
           LD C+N+S  +++++P   + F   A     V    YF++ D  + VCLA+         
Sbjct: 367 LDPCYNVSGVEKIDLPDFGILFADGAVWNFPVEN--YFIRLDPEEVVCLAILGTP-RSAL 423

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            IIGNYQQ+N  V+YDTK S+LG+A  +C+ +
Sbjct: 424 SIIGNYQQQNFHVLYDTKKSRLGYAPMNCADV 455


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 124/367 (33%), Positives = 188/367 (51%), Gaps = 37/367 (10%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG    +  Y   I +G   +   +++D+GSD+ W+QC+PC  CYNQ DP+F+P+ S 
Sbjct: 118 VVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSA 177

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+  V C+S+ C+ L+    +   C       C Y V+YGDGSYT+G L  E + +G+  
Sbjct: 178 SFIGVACSSNVCNQLD----DDVACRKGR---CGYQVAYGDGSYTKGTLALETITIGRTV 230

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           + D   GCG  N+G+F G +GL+GLG   +S V Q     GG F YCL        S ++
Sbjct: 231 IQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCL-------VSRAM 283

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGG 354
            +G             +  +I NP   +FY ++L+G+++GG ++       Q +    GG
Sbjct: 284 PVGA-----------MWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGG 332

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
           +++D+GT ITRLP   Y+A +  F+ Q +  P APG SI DTC++L+ +  V +P V   
Sbjct: 333 VVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFY 392

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F G   +T      +     D    C A A         IIGN QQ+  +V  D  N  +
Sbjct: 393 FSGGQILTFPARNFL-IPADDVGTFCFAFA--PSPSGLSIIGNIQQEGIQVSIDGTNGFV 449

Query: 475 GFAGEDC 481
           GF    C
Sbjct: 450 GFGPNVC 456


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 35/358 (9%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R+ + I+DTGSDL W QC+PC+ C++Q  P+FDP  S S+ K+ C+S  C AL  +T   
Sbjct: 377 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTST--- 433

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIFGCGRNNKGL-F 257
             CSS     C Y  +YGD S T+G L  E    G +     S+    FGCG +N G  F
Sbjct: 434 --CSSDG---CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF 488

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--P 315
              +GL+GLGR  LSLVSQ  E     F+YCL +  D+  S SL+LG  +++   ++   
Sbjct: 489 SQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPS-SLLLGSLANITPKTSKDE 544

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPP 368
           +  T +I NP   +FY L+L GIS+GG QL    S F       GG++IDSGT IT +  
Sbjct: 545 MKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVEN 604

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTG 427
           S +++LK EF+ Q +      G   LD CFNL A   +V +P +   F+G     +++ G
Sbjct: 605 SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKG---ADLELPG 661

Query: 428 IVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             Y +  S A  +CLA+ S        I GN QQ+N  V++D +   L F    C S+
Sbjct: 662 ENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 716


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 131/371 (35%), Positives = 191/371 (51%), Gaps = 33/371 (8%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+TSG    +  Y   + +G   R   +++DTGSD+ W+QCQPC  CY Q DP+FDP+ S
Sbjct: 8   PVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTAS 67

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +Y  V C S  C +LE ++  SG         C Y V+YGDGSYT G+   E +  G +
Sbjct: 68  STYAPVTCQSQQCSSLEMSSCRSG--------QCLYQVNYGDGSYTFGDFATESVSFGNS 119

Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
            SV +   GCG +N+GLF G +GL+GLG   LSL   T+++    FSYCL +   AG+S 
Sbjct: 120 GSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSL---TNQLKATSFSYCLVNRDSAGSST 176

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGF 350
                    V   + P     ++ N ++ TFY + L+G+S+GG+         +L  SG 
Sbjct: 177 LDFNSAQLGVDSVTAP-----LMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESG- 230

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             GGI++D GT ITRL    Y+ L+  F++           ++ DTC++LS    V +P 
Sbjct: 231 -NGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPT 289

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V   F       +     +  V S A   C A A  +      IIGN QQ+  RV +D  
Sbjct: 290 VSFHFADGKSWNLPAANYLIPVDS-AGTYCFAFAPTT--SSLSIIGNVQQQGTRVTFDLA 346

Query: 471 NSQLGFAGEDC 481
           N+++GF+   C
Sbjct: 347 NNRMGFSPNKC 357


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 137/358 (38%), Positives = 193/358 (53%), Gaps = 35/358 (9%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R+ + I+DTGSDL W QC+PC+ C++Q  P+FDP  S S+ K+ C+S  C AL  +T   
Sbjct: 122 RSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTST--- 178

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIFGCGRNNKGL-F 257
             CSS     C Y  +YGD S T+G L  E    G +     S+    FGCG +N G  F
Sbjct: 179 --CSSDG---CEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGF 233

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--P 315
              +GL+GLGR  LSLVSQ  E     F+YCL +  D+  S SL+LG  +++   ++   
Sbjct: 234 SQGAGLVGLGRGPLSLVSQLKE---QKFAYCLTAIDDSKPS-SLLLGSLANITPKTSKDE 289

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVITRLPP 368
           +  T +I NP   +FY L+L GIS+GG QL    S F       GG++IDSGT IT +  
Sbjct: 290 MKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVEN 349

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTG 427
           S +++LK EF+ Q +      G   LD CFNL A   +V +P +   F+G     +++ G
Sbjct: 350 SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKG---ADLELPG 406

Query: 428 IVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             Y +  S A  +CLA+ S        I GN QQ+N  V++D +   L F    C S+
Sbjct: 407 ENYMIGDSKAGLLCLAIGS---SRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 136/369 (36%), Positives = 188/369 (50%), Gaps = 36/369 (9%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            YIA I +G  G    + +DT SDLTW+QCQPC+ CY Q  PVFDP  S SY+++  N++
Sbjct: 137 EYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAA 196

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCG 250
            C AL  + G      +     C Y V YGDGS T G+   E L   G   +     GCG
Sbjct: 197 DCQALGRSGGGDAKRGT-----CVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRISIGCG 251

Query: 251 RNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
            +NKGLFG   +G++GLGR  +S  +Q      G FSYCL        S S  L   +  
Sbjct: 252 HDNKGLFGAPAAGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGAGA 309

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKGGILIDSG 360
              S P+++T  + N  + TFY + LTGIS+GG        + LQ   +  +GG+++DSG
Sbjct: 310 VDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGRGGVIVDSG 369

Query: 361 TVITRLPPSIYSALKAEF------LKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           T +TRL    Y+A +  F      L Q S G PS  GF   DTC+ +       +P V M
Sbjct: 370 TAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPS--GF--FDTCYTVGGRGMKKVPTVSM 425

Query: 414 EFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F G+ E+ +      Y +  D+   VC A A+   +    IIGN QQ+  R++YD    
Sbjct: 426 HFAGSVEVKLQPKN--YLIPVDSMGTVCFAFAATG-DHSVSIIGNIQQQGFRIVYDI-GG 481

Query: 473 QLGFAGEDC 481
           ++GFA   C
Sbjct: 482 RVGFAPNSC 490


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 148/421 (35%), Positives = 213/421 (50%), Gaps = 44/421 (10%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---DVSNTEI-------PLTSGIRLQT 132
           D+     +RL  D+  V+ +  R++  +S   +   +   TEI       P+ SG    +
Sbjct: 93  DYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGS 152

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
             Y + + +G   +   +++DTGSD+ W+QCQPC  CY Q DP+FDP  S S+  + C S
Sbjct: 153 GEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCES 212

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGC 249
             C ALE     SG C +S    C Y VSYGDGS+T GE   E L  G +  +ND   GC
Sbjct: 213 QQCQALE----TSG-CRASK---CLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGC 264

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
           G +N+GLF    G  GL       +S TS++    FSYCL    D  +S S  L  NS+ 
Sbjct: 265 GHDNEGLF---VGSAGLLGLGGGPLSLTSQMKASSFSYCL---VDRDSSSSSDLEFNSAA 318

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSG 360
             +S       ++ + ++ TFY + LTG+S+GG+         Q+  SG+  GGI++DSG
Sbjct: 319 PSDS---VNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY--GGIIVDSG 373

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T ITRL    Y+ L+  F+ +        GF++ DTC++LS+   V IP V  EF G   
Sbjct: 374 TAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKS 433

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + +     +  V S     C A A  +      IIGN QQ+  RV YD  NS +GF+   
Sbjct: 434 LQLPPKNYLIPVDS-VGTFCFAFAPTT--SSLSIIGNVQQQGTRVHYDLANSVVGFSPHK 490

Query: 481 C 481
           C
Sbjct: 491 C 491


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 217/422 (51%), Gaps = 43/422 (10%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISGNI---KDVSNTEI------PLTSG-IRLQT 132
           D+N   + RL  D   VQ+L   ++  ++G     + ++ + I      P+ SG  +   
Sbjct: 86  DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVL 187
             Y+A I +G   +   ++ DTGSD+TW+QCQPC S   CY Q DP+FDP  S SY  + 
Sbjct: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
           CNS  C  L+ A  NS  C         Y V YGDGS+T GEL  E L  G + S+ +  
Sbjct: 206 CNSQQCKLLDKANCNSDTCI--------YQVHYGDGSFTTGELATETLSFGNSNSIPNLP 257

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
            GCG +N+GLF G +GL+GLG   +SL SQ   +    FSYCL +  D+ +S +L    N
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQ---LKASSFSYCLVNL-DSDSSSTLEFNSN 313

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDS 359
                 ++P     ++ N +  ++  + + GIS+GGK L  S           GGI++DS
Sbjct: 314 MPSDSLTSP-----LVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           GT+I+RLP  +Y +L+  F+K  S    APG S+ DTC+N S    V +P +        
Sbjct: 369 GTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
            + +     +  + + A   CLA   +  +    IIG++QQ+  RV YD  NS +GF+  
Sbjct: 429 SLRLPARNYLIMLDT-AGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTN 485

Query: 480 DC 481
            C
Sbjct: 486 KC 487


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/350 (38%), Positives = 183/350 (52%), Gaps = 29/350 (8%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           IVDTGSDL W QC+PC  C+NQ  PVFDPS S +Y  + C+SS C  L  +T     C+S
Sbjct: 134 IVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPTST-----CTS 188

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLG 267
           ++  DC Y  +YGD S T+G L  E   L K  +    FGCG  N+G  F   +GL+GLG
Sbjct: 189 AA-KDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDGFTQGAGLVGLG 247

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNMIPNP 325
           R  LSLVSQ      G FSYCL S  D   S  L+  L   S+   ++  I  T +I NP
Sbjct: 248 RGPLSLVSQLGL---GKFSYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNP 304

Query: 326 QLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
              +FY + L  +++G  +  L  S FA      GG+++DSGT IT L    Y  LK  F
Sbjct: 305 SQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAF 364

Query: 379 LKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
             Q    P A G ++ LD CF   A    +V +P + + F+G A++  D+    Y V   
Sbjct: 365 AAQMK-LPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADL--DLPAENYMVLDS 421

Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           AS  +CL +          IIGN+QQ+N + +YD     L FA   C+ +
Sbjct: 422 ASGALCLTVMG---SRGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQCAKL 468


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 135/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SG+   +  Y A + +G    T  +++DTGSD+ W+QC PC+ CY Q   VFDP  S
Sbjct: 116 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 175

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
            SY  V C +  C  L+ A  +    S      C Y V+YGDGS T G+   E L   + 
Sbjct: 176 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 229

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
           A V     GCG +N+GLF   SGL+GLGR  LS  SQ +  FG  FSYCL    S+    
Sbjct: 230 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 289

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
           ++ S  +   +     +   ++T M  NP++ATFY ++L G S+GG +++          
Sbjct: 290 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 349

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
               +GG+++DSGT +TRL   +Y A++  F     G   +P GFS+ DTC+NLS  + V
Sbjct: 350 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 409

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
            +P V M   G A + +      Y +  D S   C A+A    +    IIGN QQ+  RV
Sbjct: 410 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 465

Query: 466 IYDTKNSQLGFAGEDC 481
           ++D    ++GF  + C
Sbjct: 466 VFDGDAQRVGFVPKSC 481


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 142/467 (30%), Positives = 221/467 (47%), Gaps = 37/467 (7%)

Query: 47  KSGSSSSCVSHQKSRIEMGAITLELKHKNYCSG------KIVDWNEQQQNRLILDNLHVQ 100
           K G++      QK      ++ L + H+    G        +D  E+   R+  + +H +
Sbjct: 54  KLGAAEDAADEQKPASPSSSLKLHMTHRRGAEGGRTRKGSFLDLAEKDAVRV--EAMHRR 111

Query: 101 YLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTW 158
              S         + +       + SG+ + +  Y+  + +G   R   +I+DTGSDL W
Sbjct: 112 VASSSSSPRRGRALSESERVVATVESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNW 171

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSPPDCNYF 217
           +QC PC  C+ Q+ PVFDP+ S SY+ + C    C H           C       C Y+
Sbjct: 172 LQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYY 231

Query: 218 VSYGDGSYTRGELGREHLGL------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
             YGD S + G+L  E   +        + V+  +FGCG  N+GLF G +GL+GLGR  L
Sbjct: 232 YWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPL 291

Query: 272 SLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLA- 328
           S  SQ   ++GG  FSYCL     +  +  ++ G + ++   + P + YT   P    A 
Sbjct: 292 SFASQLRAVYGGHTFSYCL-VDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPAD 350

Query: 329 TFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           TFY + LTG+ +GG+ L        AS    GG +IDSGT ++      Y  ++  F+ +
Sbjct: 351 TFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDR 410

Query: 382 FSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV- 439
            SG +P  P F +L  C+N+S  +   +P + + F   A    D     YF++ D   + 
Sbjct: 411 MSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGA--VWDFPAENYFIRLDPDGIM 468

Query: 440 CLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           CLA+        TG  IIGN+QQ+N  V YD  N++LGFA   C+ +
Sbjct: 469 CLAVLGTP---RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRCAEV 512


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 135/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SG+   +  Y A + +G    T  +++DTGSD+ W+QC PC+ CY Q   VFDP  S
Sbjct: 110 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 169

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
            SY  V C +  C  L+ A  +    S      C Y V+YGDGS T G+   E L   + 
Sbjct: 170 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 223

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
           A V     GCG +N+GLF   SGL+GLGR  LS  SQ +  FG  FSYCL    S+    
Sbjct: 224 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 283

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
           ++ S  +   +     +   ++T M  NP++ATFY ++L G S+GG +++          
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
               +GG+++DSGT +TRL   +Y A++  F     G   +P GFS+ DTC+NLS  + V
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 403

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
            +P V M   G A + +      Y +  D S   C A+A    +    IIGN QQ+  RV
Sbjct: 404 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 459

Query: 466 IYDTKNSQLGFAGEDC 481
           ++D    ++GF  + C
Sbjct: 460 VFDGDAQRVGFVPKSC 475


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 141/395 (35%), Positives = 204/395 (51%), Gaps = 39/395 (9%)

Query: 110 ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSC 167
           ++ N  D +N + P   G    +  ++  + +G   +    IVDTGSDL W QC+PC  C
Sbjct: 87  VASNPDDTNNIKAPTHGG----SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTEC 142

Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
           ++Q  P+FDP  S SY KV C+S  C+AL  +  N    S      C Y  +YGD S TR
Sbjct: 143 FDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDS------CEYLYTYGDYSSTR 196

Query: 228 GELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           G L  E      + S++   FGCG  N+G  F   SGL+GLGR  LSL+SQ  E     F
Sbjct: 197 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KF 253

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNST------PITYT-NMIPNPQLATFYILNLTGI 338
           SYCL S +D+ AS SL +G  +S   N T       +T T +++ NP   +FY L L GI
Sbjct: 254 SYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGI 313

Query: 339 SIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
           ++G K+L       + S    GG++IDSGT IT L  + +  LK EF  + S      G 
Sbjct: 314 TVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS 373

Query: 392 SILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYE 449
           + LD CF L +A + + +P +   F+G     +++ G  Y V   ++ V CLA+ S    
Sbjct: 374 TGLDLCFKLPNAAKNIAVPKLIFHFKG---ADLELPGENYMVADSSTGVLCLAMGS---S 427

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +   I GN QQ+N  V++D +   + F   +C  +
Sbjct: 428 NGMSIFGNVQQQNFNVLHDLEKETVTFVPTECGKL 462


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 137/393 (34%), Positives = 198/393 (50%), Gaps = 47/393 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC  C+ Q  P +DP  S 
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSI 244

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
           S++ + CN   C           + SS  PP         C YF  YGD S T G+   E
Sbjct: 245 SFRNITCNDPRCQ----------LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALE 294

Query: 234 HLGL-------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
              +       GK+    V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G 
Sbjct: 295 TFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGH 354

Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGIS 339
            FSYCL     D   S  LI G +  +  +   + +T++I    NP + TFY L +  I 
Sbjct: 355 SFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENP-VDTFYYLQIKSIF 412

Query: 340 IGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           +GG++LQ        S    GG +IDSGT ++      Y  +K  FL++  G+     F 
Sbjct: 413 VGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFP 472

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE 451
           IL  C+N+S   E+N P   ++F   A     V    YF++      VCLA+     +  
Sbjct: 473 ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVEN--YFIRIQQLDIVCLAMLGTP-KSA 529

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             IIGNYQQ+N  ++YDTKNS+LG+A   C+ +
Sbjct: 530 LSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 138/445 (31%), Positives = 220/445 (49%), Gaps = 34/445 (7%)

Query: 66  AITLELKHKNYCSGKIVD---WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE- 121
           ++ L +KH++   G+       ++ +++ + ++ +H +  +S +  M + +    + +E 
Sbjct: 74  SLQLRMKHRSAEGGRTRKESFLDKAEKDAVRIETMHRRAARSGVARMPASSSPRRALSER 133

Query: 122 --IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
               + SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C+ Q+ PVFDP
Sbjct: 134 MVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDP 193

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           + S SY+ V C    C  L         C   +   C Y+  YGD S T G+L  E   +
Sbjct: 194 AASSSYRNVTCGDQRC-GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTV 252

Query: 238 ------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
                     V+  +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL  
Sbjct: 253 NLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCL-- 310

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG- 349
            +    +GS ++ G   +      + YT   P    A TFY + L G+ +GG  L  S  
Sbjct: 311 VEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSD 370

Query: 350 ------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSA 402
                    GG +IDSGT ++      Y  ++  F+   S  +P  P F +L+ C+N+S 
Sbjct: 371 TWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNPCYNVSG 430

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG--IIGNYQ 459
            +   +P + + F   A    D     YFV+ D   + CLA+        TG  IIGN+Q
Sbjct: 431 VERPEVPELSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVRGTP---RTGMSIIGNFQ 485

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSSM 484
           Q+N  V+YD +N++LGFA   C+ +
Sbjct: 486 QQNFHVVYDLQNNRLGFAPRRCAEV 510


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 134/376 (35%), Positives = 195/376 (51%), Gaps = 27/376 (7%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SG+   +  Y A + +G    T  +++DTGSD+ W+QC PC+ CY Q   VFDP  S
Sbjct: 110 PLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRS 169

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
            SY  V C +  C  L+ A  +    S      C Y V+YGDGS T G+   E L   + 
Sbjct: 170 RSYAAVDCVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLTFARG 223

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAG 296
           A V     GCG +N+GLF   SGL+GLGR  LS  +Q +  FG  FSYCL    S+    
Sbjct: 224 ARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPS 283

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------- 347
           ++ S  +   +     +   ++T M  NP++ATFY ++L G S+GG +++          
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEV 406
               +GG+++DSGT +TRL   +Y A++  F     G   +P GFS+ DTC+NLS  + V
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVV 403

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRV 465
            +P V M   G A + +      Y +  D S   C A+A    +    IIGN QQ+  RV
Sbjct: 404 KVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFRV 459

Query: 466 IYDTKNSQLGFAGEDC 481
           ++D    ++GF  + C
Sbjct: 460 VFDGDAQRVGFVPKSC 475


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 137/393 (34%), Positives = 198/393 (50%), Gaps = 47/393 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC  C+ Q  P +DP  S 
Sbjct: 185 LESGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSI 244

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGRE 233
           S++ + CN   C           + SS  PP         C YF  YGD S T G+   E
Sbjct: 245 SFRNITCNDPRCQ----------LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALE 294

Query: 234 HLGL-------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
              +       GK+    V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G 
Sbjct: 295 TFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGH 354

Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGIS 339
            FSYCL     D   S  LI G +  +  +   + +T++I    NP + TFY L +  I 
Sbjct: 355 SFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPE-LNFTSLIAGKENP-VDTFYYLQIKSIF 412

Query: 340 IGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           +GG++LQ        S    GG +IDSGT ++      Y  +K  FL++  G+     F 
Sbjct: 413 VGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFP 472

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE 451
           IL  C+N+S   E+N P   ++F   A     V    YF++      VCLA+     +  
Sbjct: 473 ILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVEN--YFIRIQQLDIVCLAMLGTP-KSA 529

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             IIGNYQQ+N  ++YDTKNS+LG+A   C+ +
Sbjct: 530 LSIIGNYQQQNFHILYDTKNSRLGYAPMRCAEI 562


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 139/440 (31%), Positives = 224/440 (50%), Gaps = 44/440 (10%)

Query: 67  ITLELKHKN-YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG----NIK--DVSN 119
           ++LEL  ++   + +  D+     +RL  D+  V  + ++I+  + G    ++K  D+  
Sbjct: 82  LSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFAVEGIDRSDLKPVDIDE 141

Query: 120 TEI-------PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
           T         P+ SG    +  Y + I +G   + M V++DTGSD+ W+QC PC  CY Q
Sbjct: 142 TRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQ 201

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            DP+FDP+ S ++K + C+   C +L+ +      C S+    C Y VSYGDGS+T G  
Sbjct: 202 SDPIFDPTSSSTFKSLTCSDPKCASLDVS-----ACRSNK---CLYQVSYGDGSFTVGNY 253

Query: 231 GREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
             + +  G++  VND   GCG +N+GLF G +GL+GLG   LS+   T++I    FSYCL
Sbjct: 254 ATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSM---TNQIKAKSFSYCL 310

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---- 345
              +D+  S SL          ++T      ++ N ++ TFY + L+G S+GG+Q+    
Sbjct: 311 VD-RDSAKSSSLDFNSVQIGAGDAT----APLLRNSKMDTFYYVGLSGFSVGGQQVSIPS 365

Query: 346 ---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLS 401
              +      GG+++D GT +TRL    Y++L+  F+K  + F       S+ DTC++ S
Sbjct: 366 SLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFS 425

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
           +   V +P V   F G   + +     +  +  DA   C A A  S      IIGN QQ+
Sbjct: 426 SLSTVKVPTVTFHFTGGKSLNLPAKNYLIPID-DAGTFCFAFAPTS--SSLSIIGNVQQQ 482

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
             R+ YD  N+ +G +   C
Sbjct: 483 GTRITYDLANNLIGLSANKC 502


>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
          Length = 166

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 89/152 (58%), Positives = 123/152 (80%), Gaps = 1/152 (0%)

Query: 330 FYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
           FY++NLTGI++GG++++++GF+   I +DSGTVIT L PS+Y+A++AEF+ Q + +P AP
Sbjct: 13  FYLVNLTGITVGGQEVESTGFSARAI-VDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAP 71

Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
           GFSILDTCFN++  +EV +P + + F+G AE+ VD  G++YFV SD+SQVCLA+ASL  E
Sbjct: 72  GFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSE 131

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           DET IIGNYQQKN RV++DT  SQ+GFA E C
Sbjct: 132 DETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 147/421 (34%), Positives = 213/421 (50%), Gaps = 44/421 (10%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---DVSNTEI-------PLTSGIRLQT 132
           D+     +RL  D+  V+ +  R++  +S   +   +   TEI       P+ SG    +
Sbjct: 93  DYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGS 152

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
             Y + + +G   +   +++DTGSD+ W+QCQPC  CY Q DP+FDP  S S+  + C S
Sbjct: 153 GEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCES 212

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGC 249
             C ALE     SG C +S    C Y VSYGDGS+T GE   E L  G +  +N+   GC
Sbjct: 213 QQCQALE----TSG-CRASK---CLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGC 264

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
           G +N+GLF    G  GL       +S TS++    FSYCL    D  +S S  L  NS+ 
Sbjct: 265 GHDNEGLF---VGSAGLLGLGGGSLSLTSQMKASSFSYCL---VDRDSSSSSDLEFNSAA 318

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSG 360
             +S       ++ + ++ TFY + LTG+S+GG+         Q+  SG+  GGI++DSG
Sbjct: 319 PSDS---VNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGY--GGIIVDSG 373

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T ITRL    Y+ L+  F+ +        GF++ DTC++LS+   V IP V  EF G   
Sbjct: 374 TAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKS 433

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + +     +  V S     C A A  +      IIGN QQ+  RV YD  NS +GF+   
Sbjct: 434 LQLPPKNYLIPVDS-VGTFCFAFAPTT--SSLSIIGNVQQQGTRVHYDLANSVVGFSPHK 490

Query: 481 C 481
           C
Sbjct: 491 C 491


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 135/356 (37%), Positives = 188/356 (52%), Gaps = 33/356 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C++Q  P+FDP  S SY KV C+S  C+AL  +  N    
Sbjct: 121 SAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN---- 176

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLM 264
                  C Y  +YGD S TRG L  E      + S++   FGCG  N+G  F   SGL+
Sbjct: 177 --EDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLV 234

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST------PITY 318
           GLGR  LSL+SQ  E     FSYCL S +D+ AS SL +G  +S   N T       +T 
Sbjct: 235 GLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTK 291

Query: 319 T-NMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGILIDSGTVITRLPPSI 370
           T +++ NP   +FY L L GI++G K+L  + S F       GG++IDSGT IT L  + 
Sbjct: 292 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETA 351

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           +  LK EF  + S      G + LD CF L  A + + +P +   F+G     +++ G  
Sbjct: 352 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKG---ADLELPGEN 408

Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           Y V   ++ V CLA+ S    +   I GN QQ+N  V++D +   + F   +C  +
Sbjct: 409 YMVADSSTGVLCLAMGS---SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 194/363 (53%), Gaps = 30/363 (8%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+AT+ LG   R  +VIVDTGSDLTWVQC PC +CY+Q D +F P+ S S+ K+ C +  
Sbjct: 3   YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTEL 62

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIF 247
           C+ L +      +C+ ++   C Y+ SYGDGS + G+   + + +      K  V +F F
Sbjct: 63  CNGLPYP-----MCNQTT---CVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAF 114

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           GCG +N+G F G  G++GLG+  LS  SQ   +F G FSYCL          S +L G++
Sbjct: 115 GCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDA 174

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSG 360
           +V      + Y +++ NP++ T+Y + L GIS+GGK L  S  A       + G + DSG
Sbjct: 175 AV-PTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSG 233

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCF-NLSAYQEVNIPLVKMEFEGN 418
           T +T+L   ++  + A        +P  +   S LD C    +  Q   +P +   FEG 
Sbjct: 234 TTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEG- 292

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
            +M +  +    F++S  S  C ++ S     +  IIG+ QQ+N +V YDT   ++GF  
Sbjct: 293 GDMELPPSNYFIFLESSQS-YCFSMVS---SPDVTIIGSIQQQNFQVYYDTVGRKIGFVP 348

Query: 479 EDC 481
           + C
Sbjct: 349 KSC 351


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 218/422 (51%), Gaps = 43/422 (10%)

Query: 83  DWNEQQQNRLILDNLHVQYLQSRIKNMISGNI---KDVSNTEI------PLTSG-IRLQT 132
           D+N   + RL  D   VQ+L   ++  ++G     + ++ + I      P+ SG  +   
Sbjct: 86  DYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSGQSKGSG 145

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVL 187
             Y+A I +G   +   ++ DTGSD+TW+QCQPC S   CY Q DP+FDP  S SY  + 
Sbjct: 146 AEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLS 205

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
           CNS  C  L+ A  NS  C         Y V YGDGS+T GEL  E L  G + S+ +  
Sbjct: 206 CNSQQCKLLDKANCNSDTCI--------YQVHYGDGSFTTGELATETLSFGNSNSIPNLP 257

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
            GCG +N+GLF G +GL+GLG   +SL SQ   +    FSYCL +  D+ +S +L     
Sbjct: 258 IGCGHDNEGLFAGGAGLIGLGGGAISLSSQ---LKASSFSYCLVNL-DSDSSSTLEFNS- 312

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDS 359
              +  S  +T + ++ N +  ++  + + GIS+GGK L  S           GGI++DS
Sbjct: 313 ---YMPSDSLT-SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           GT+I+RLP  +Y +L+  F+K  S    APG S+ DTC+N S    V +P +        
Sbjct: 369 GTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
            + +     +  + + A   CLA   +  +    IIG++QQ+  RV YD  NS +GF+  
Sbjct: 429 SLRLPARNYLIMLDT-AGTYCLAF--IKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTN 485

Query: 480 DC 481
            C
Sbjct: 486 KC 487


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 202/396 (51%), Gaps = 35/396 (8%)

Query: 106 IKNMISGNI--KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC 161
           + ++I G++   D  +   P+ SG+   +  Y A++ +G       +++DTGSD+ W+QC
Sbjct: 68  VASLIIGSLTAHDDDHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQC 127

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
           +PC  CY Q  P++DP  S +Y +  C+   C   +   G +G         C Y + YG
Sbjct: 128 KPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTG--------GCGYRIVYG 179

Query: 222 DGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           D S T G L  + L      SV +   GCG +N+GLFG  +GL+G+ R + S  +Q ++ 
Sbjct: 180 DASSTSGNLATDRLVFSNDTSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADS 239

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
           +G  F+YCL     +G+S S ++ G ++    S+   +T +  NP+  + Y +++ G S+
Sbjct: 240 YGRYFAYCLGDRTRSGSSSSYLVFGRTAPEPPSS--VFTPLRSNPRRPSLYYVDMVGFSV 297

Query: 341 GGKQLQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF---P 386
           GG+ +  +GF+           +GG+++DSGT ITR     Y AL+  F  + +      
Sbjct: 298 GGEPV--TGFSNASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRK 355

Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS 445
              G S+ D C++L      + P V + F G A++ +      Y V  ++ +  C AL +
Sbjct: 356 VGRGISVFDACYDLRGVAVADAPGVVLHFAGGADVALPPEN--YLVPEESGRYHCFALEA 413

Query: 446 LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             + D   +IGN  Q+  RV++D +N ++GF    C
Sbjct: 414 AGH-DGLSVIGNVLQQRFRVVFDVENERVGFEPNGC 448


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 130/375 (34%), Positives = 193/375 (51%), Gaps = 32/375 (8%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
            T+ +      Y+AT+ LG   R  +VIVDTGSDLTWVQC PC  CY+Q D +F P+ S 
Sbjct: 2   FTAPVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTST 61

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--- 238
           S+ K+ C S+ C+ L F      +C+ ++   C Y+ SYGDGS T G+   + + +    
Sbjct: 62  SFTKLACGSALCNGLPFP-----MCNQTT---CVYWYSYGDGSLTTGDFVYDTITMDGIN 113

Query: 239 --KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
             K  V +F FGCG +N+G F G  G++GLG+  LS  SQ   ++ G FSYCL       
Sbjct: 114 GQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPP 173

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-------G 349
              S +L G+++V      + Y  ++ NP++ T+Y + L GIS+G   L  S        
Sbjct: 174 TQTSPLLFGDAAV-PILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDS 232

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA-PGFSILDTCFN-LSAYQEVN 407
               G + DSGT +T+L  + Y  + A        +       S LD C +     Q   
Sbjct: 233 VGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLPT 292

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDETGIIGNYQQKNQRVI 466
           +P +   FEG  +M +  +   YF+  ++SQ  C A+ S     +  IIG+ QQ+N +V 
Sbjct: 293 VPAMTFHFEG-GDMVLPPSN--YFIYLESSQSYCFAMTS---SPDVNIIGSVQQQNFQVY 346

Query: 467 YDTKNSQLGFAGEDC 481
           YDT   +LGF  +DC
Sbjct: 347 YDTAGRKLGFVPKDC 361


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 135/356 (37%), Positives = 188/356 (52%), Gaps = 33/356 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C++Q  P+FDP  S SY KV C+S  C+AL  +  N    
Sbjct: 13  SAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCN---- 68

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGL-FGGVSGLM 264
                  C Y  +YGD S TRG L  E      + S++   FGCG  N+G  F   SGL+
Sbjct: 69  --EDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLV 126

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST------PITY 318
           GLGR  LSL+SQ  E     FSYCL S +D+ AS SL +G  +S   N T       +T 
Sbjct: 127 GLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTK 183

Query: 319 T-NMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGILIDSGTVITRLPPSI 370
           T +++ NP   +FY L L GI++G K+L  + S F       GG++IDSGT IT L  + 
Sbjct: 184 TMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETA 243

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
           +  LK EF  + S      G + LD CF L  A + + +P +   F+G     +++ G  
Sbjct: 244 FKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKG---ADLELPGEN 300

Query: 430 YFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           Y V   ++ V CLA+ S    +   I GN QQ+N  V++D +   + F   +C  +
Sbjct: 301 YMVADSSTGVLCLAMGS---SNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 353


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 132/339 (38%), Positives = 183/339 (53%), Gaps = 21/339 (6%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           I DTGSDLTW QC PC  CY Q  P+F+P  S S+  V CN+ TCHA++   G+ GV   
Sbjct: 108 IADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD--DGHCGVQGV 165

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
                C+Y  +YGD +Y++G+LG E + +G +SV   I GCG  + G FG  SG++GLG 
Sbjct: 166 -----CDYSYTYGDRTYSKGDLGFEKITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGG 219

Query: 269 SDLSLVSQTSEIFG--GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
             LSLVSQ S+  G    FSYCLP T  + A+G +  G N+ V   S P   +  + +  
Sbjct: 220 GQLSLVSQMSQTSGISRRFSYCLP-TLLSHANGKINFGENAVV---SGPGVVSTPLISKN 275

Query: 327 LATFYILNLTGISIGGKQLQASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
             T+Y + L  ISIG ++  A  FAK G ++IDSGT +T LP  +Y  + +  LK     
Sbjct: 276 TVTYYYITLEAISIGNERHMA--FAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAK 333

Query: 386 PSAPGFSILDTCFN--LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
                   LD CF+  ++A   + IP++   F G A   V++  I  F K   +  CL L
Sbjct: 334 RVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGAN--VNLLPINTFRKVADNVNCLTL 391

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            + S   E GIIGN  Q N  + YD +  +L F    C+
Sbjct: 392 KAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVCA 430


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 133/360 (36%), Positives = 189/360 (52%), Gaps = 33/360 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  + +G   +  + I+DTGSDL W QCQPC  C+NQ  P+F+P  S S+  + C+S  
Sbjct: 95  YLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQL 154

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           C AL     +S  CS++    C Y   YGDGS T+G +G E L  G  S+ +  FGCG N
Sbjct: 155 CQAL-----SSPTCSNNF---CQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGEN 206

Query: 253 NKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           N+G   G  +GL+G+GR  LSL SQ        FSYC+     +  S +L+LG  ++   
Sbjct: 207 NQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPS-NLLLGSLANSVT 262

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVI 363
             +P   T +I + Q+ TFY + L G+S+G  +L    S FA       GGI+IDSGT +
Sbjct: 263 AGSP--NTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 320

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL-SAYQEVNIPLVKMEFEGNAEM 421
           T    + Y +++ EF+ Q +  P   G S   D CF   S    + IP   M F+G    
Sbjct: 321 TYFVNNAYQSVRQEFISQIN-LPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG--- 376

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +++    YF+      +CLA+ S S      I GN QQ+N  V+YDT NS + FA   C
Sbjct: 377 DLELPSENYFISPSNGLICLAMGSSS--QGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 39/389 (10%)

Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
           R++ M++G     S  E P+ +G       Y+  + +G   +  + I+DTGSDL W QCQ
Sbjct: 73  RLEAMLNG----PSGVETPVYAGDG----EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124

Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
           PC  C+NQ  P+F+P  S S+  + C+S  C AL+  T     CS++S   C Y   YGD
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQSPT-----CSNNS---CQYTYGYGD 176

Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
           GS T+G +G E L  G  S+ +  FGCG NN+G   G  +GL+G+GR  LSL SQ     
Sbjct: 177 GSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT- 235

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
              FSYC+     + +S +L+LG  ++     +P   T +I + Q+ TFY + L G+S+G
Sbjct: 236 --KFSYCMTPIGSSNSS-TLLLGSLANSVTAGSP--NTTLIQSSQIPTFYYITLNGLSVG 290

Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
              L          S    GGI+IDSGT +T    + Y A++  F+ Q +        S 
Sbjct: 291 STPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSG 350

Query: 394 LDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
            D CF + + Q  + IP   M F+G  ++ +      YF+      +CLA+ S S     
Sbjct: 351 FDLCFQMPSDQSNLQIPTFVMHFDG-GDLVLPSEN--YFISPSNGLICLAMGSSS--QGM 405

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            I GN QQ+N  V+YDT NS + F    C
Sbjct: 406 SIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 136/389 (34%), Positives = 199/389 (51%), Gaps = 39/389 (10%)

Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
           R++ M++G     S  E P+ +G       Y+  + +G   +  + I+DTGSDL W QCQ
Sbjct: 73  RLEAMLNG----PSGVETPVYAGDG----EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQ 124

Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
           PC  C+NQ  P+F+P  S S+  + C+S  C AL+     S  CS++S   C Y   YGD
Sbjct: 125 PCTQCFNQSTPIFNPQGSSSFSTLPCSSQLCQALQ-----SPTCSNNS---CQYTYGYGD 176

Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
           GS T+G +G E L  G  S+ +  FGCG NN+G   G  +GL+G+GR  LSL SQ     
Sbjct: 177 GSETQGSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT- 235

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
              FSYC+ +   +  S +L+LG  ++     +P   T +I + Q+ TFY + L G+S+G
Sbjct: 236 --KFSYCM-TPIGSSTSSTLLLGSLANSVTAGSP--NTTLIESSQIPTFYYITLNGLSVG 290

Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
              L          S    GGI+IDSGT +T    + Y A++  F+ Q +        S 
Sbjct: 291 STPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSG 350

Query: 394 LDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
            D CF + + Q  + IP   M F+G  ++ +      YF+      +CLA+ S S     
Sbjct: 351 FDLCFQMPSDQSNLQIPTFVMHFDG-GDLVLPSEN--YFISPSNGLICLAMGSSS--QGM 405

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            I GN QQ+N  V+YDT NS + F    C
Sbjct: 406 SIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 182/358 (50%), Gaps = 24/358 (6%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I LG   +  + IVDTGSDL WVQC PC  C+ Q DP+F P  S SY    C  S 
Sbjct: 8   YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSL 67

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           C AL   T     CS  +   C Y  SYGDGS TRG+   E + L  +++    FGCG N
Sbjct: 68  CDALPRPT-----CSMRN--TCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHN 120

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
            +G F G  GL+GLG+  LSL SQ +  F  +FSYCL      G    +  G  +   + 
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRA 180

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITR 365
           S    +T ++ N    ++Y + +  IS+G +++    S F       GG+++DSGT IT 
Sbjct: 181 S----FTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITY 236

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-GNAEMTVD 424
              + +  + AE  +Q S   + P    L+ C+++S+    ++ L  M     N +  + 
Sbjct: 237 WRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVDFEIP 296

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           V+ +   V +    VC A+++    D+  IIGN QQ+N  ++ D  NS++GF   DCS
Sbjct: 297 VSNLWVLVDNFGETVCTAMST---SDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 120/328 (36%), Positives = 178/328 (54%), Gaps = 32/328 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
           TVI+D+GSD++WVQC+PC    C+ Q+DP+FDP++S +Y  V C S+ C  L  +  G  
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 226

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
             CS+++   C + ++YGDGS   G    + L LG   V   F FGC   ++G      V
Sbjct: 227 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 282

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
           +G + LG    SLV QT+  +G +FSYCLP T  A + G L+LG      +      STP
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 340

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
           +  ++M P     TFY + L  I + G+ L           +IDS T+I+RLPP+ Y AL
Sbjct: 341 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 395

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           +A F    + + +AP  SILDTC++ +  + + +P + + F+G A + +D  GI+     
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 451

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKN 462
                CLA A  + +   G IGN QQK 
Sbjct: 452 ---GSCLAFAPTASDRMPGFIGNVQQKT 476



 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 83/283 (29%), Positives = 135/283 (47%), Gaps = 49/283 (17%)

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
           CS+++   C + ++YGDGS   G    + L LG   V+                      
Sbjct: 480 CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVD---------------------- 515

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKN--STPITYTN 320
             R  L L  +T+  +G +FSYC+P +  +   G + LG     +++     STP+  ++
Sbjct: 516 --RQGLPL--RTATQYGRVFSYCIPPSPSS--LGFITLGVPPQRAALVPTFVSTPLLSSS 569

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
            +P     TFY + L  I + G+ L      F+   + I S TVI+RLPP+ Y AL+A F
Sbjct: 570 SMP----PTFYRVLLRAIIVAGRPLPVPPTVFSTSSV-IASTTVISRLPPTAYQALRAAF 624

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
            +  + + +AP  SILDTC++ +  + + +P + + F+G A + +D  GI+        Q
Sbjct: 625 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-------Q 677

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            CLA A  + +   G IGN QQ+   V+YD     + F    C
Sbjct: 678 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/328 (36%), Positives = 178/328 (54%), Gaps = 32/328 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNS 203
           TVI+D+GSD++WVQC+PC    C+ Q+DP+FDP++S +Y  V C S+ C  L  +  G  
Sbjct: 78  TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRG-- 135

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFGGV 260
             CS+++   C + ++YGDGS   G    + L LG   V   F FGC   ++G      V
Sbjct: 136 --CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDV 191

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK-----NSTP 315
           +G + LG    SLV QT+  +G +FSYCLP T  A + G L+LG      +      STP
Sbjct: 192 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPT--ASSLGFLVLGVPPERAQLIPSFVSTP 249

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSAL 374
           +  ++M P     TFY + L  I + G+ L           +IDS T+I+RLPP+ Y AL
Sbjct: 250 LLSSSMAP-----TFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQAL 304

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           +A F    + + +AP  SILDTC++ +  + + +P + + F+G A + +D  GI+     
Sbjct: 305 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL---- 360

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKN 462
                CLA A  + +   G IGN QQK 
Sbjct: 361 ---GSCLAFAPTASDRMPGFIGNVQQKT 385



 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 83/283 (29%), Positives = 135/283 (47%), Gaps = 49/283 (17%)

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMG 265
           CS+++   C + ++YGDGS   G    + L LG   V+                      
Sbjct: 389 CSANA--QCQFGINYGDGSTATGTYSFDDLTLGPYDVD---------------------- 424

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKN--STPITYTN 320
             R  L L  +T+  +G +FSYC+P +  +   G + LG     +++     STP+  ++
Sbjct: 425 --RQGLPL--RTATQYGRVFSYCIPPSPSS--LGFITLGVPPQRAALVPTFVSTPLLSSS 478

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
            +P     TFY + L  I + G+ L      F+   + I S TVI+RLPP+ Y AL+A F
Sbjct: 479 SMP----PTFYRVLLRAIIVAGRPLPVPPTVFSTSSV-IASTTVISRLPPTAYQALRAAF 533

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
            +  + + +AP  SILDTC++ +  + + +P + + F+G A + +D  GI+        Q
Sbjct: 534 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-------Q 586

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            CLA A  + +   G IGN QQ+   V+YD     + F    C
Sbjct: 587 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/349 (36%), Positives = 187/349 (53%), Gaps = 27/349 (7%)

Query: 140 ELGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL- 196
           +L G   TV++D+ SD+ WVQC PC    C+ Q D  +DPS SPS     C+S TC AL 
Sbjct: 153 KLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALG 212

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKG 255
            +A G    C+++    C Y V Y DGS T G    + L L    +V+ F FGC    +G
Sbjct: 213 PYANG----CANN---QCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 265

Query: 256 LFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
            F    +G+M LG    SL+SQT+  +G  FSYC+P+T  A  SG   LG      + S+
Sbjct: 266 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPAT--ASDSGFFTLGVPR---RASS 320

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYS 372
               T M+   Q ATFY + L  I++GG++L  +   FA G +L DS T ITRLPP+ Y 
Sbjct: 321 RYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVL-DSRTAITRLPPTAYQ 379

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           AL++ F    + + SAP    LDTC++ +    + +P + + F+ NA + +D +GI++  
Sbjct: 380 ALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 437

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  CLA  S + +   G++G+ QQ+   V+YD     +GF    C
Sbjct: 438 -----NDCLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 144/377 (38%), Positives = 188/377 (49%), Gaps = 49/377 (12%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ +G   R  + I+DTGSDL W QC PC  C +Q  P FDP+ SPSY K+ CNS  
Sbjct: 89  YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPM 148

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFG 248
           C+AL +      VC         YF  YGD + T G L  E    G    + +V    FG
Sbjct: 149 CNALYYPLCYRNVCVY------QYF--YGDSANTAGVLSNETFTFGTNDTRVTVPRIAFG 200

Query: 249 CGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGS 300
           CG  N G LF G SG++G GR  LSLVSQ        FSYCL       PS    GA  +
Sbjct: 201 CGNLNAGSLFNG-SGMVGFGRGPLSLVSQLGS---PRFSYCLTSFMSPVPSRLYFGAYAT 256

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------K 352
           L    NS+      P+  T  I NP L T Y LN+TGIS+GG+ L    S FA       
Sbjct: 257 L----NSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS---ILDTCFNLSAYQE--VN 407
           GG++IDSG+ IT L  + Y  +   F  Q  G P     S   +LDTCF         V 
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAFADQV-GLPLTNATSLADVLDTCFVWPPPPRKIVT 371

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P +   FEG A M + +   +  +  D   +CLA+A+    D+  IIG++Q +N  V+Y
Sbjct: 372 MPELAFHFEG-ANMELPLENYM-LIDGDTGNLCLAIAA---SDDGSIIGSFQHQNFHVLY 426

Query: 468 DTKNSQLGFAGEDCSSM 484
           D +NS L F    C+ M
Sbjct: 427 DNENSLLSFTPATCNVM 443


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/349 (36%), Positives = 187/349 (53%), Gaps = 27/349 (7%)

Query: 140 ELGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL- 196
           +L G   TV++D+ SD+ WVQC PC    C+ Q D  +DPS SP+     C+S TC AL 
Sbjct: 23  KLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALG 82

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKG 255
            +A G    C+++    C Y V Y DGS T G    + L L    +V+ F FGC    +G
Sbjct: 83  PYANG----CANN---QCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 135

Query: 256 LFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
            F    +G+M LG    SL+SQT+  +G  FSYC+P+T  A  SG   LG      + S+
Sbjct: 136 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPAT--ASDSGFFTLGVPR---RASS 190

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYS 372
               T M+   Q ATFY + L  I++GG++L  +   FA G +L DS T ITRLPP+ Y 
Sbjct: 191 RYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVL-DSRTAITRLPPTAYQ 249

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           AL+A F    + + SAP    LDTC++ +    + +P + + F+ NA + +D +GI++  
Sbjct: 250 ALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF-- 307

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  CLA  S + +   G++G+ QQ+   V+YD     +GF    C
Sbjct: 308 -----NDCLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C+ Q  PVFDPS S +Y  V C+S++C  L  +      C
Sbjct: 119 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 173

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           +S+S   C Y  +YGD S T+G L  E   L K+ +   +FGCG  N+G  F   +GL+G
Sbjct: 174 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 231

Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
           LGR  LSLVSQ      GL  FSYCL S  D   S  L+  L G S     ++ +  T +
Sbjct: 232 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 286

Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
           I NP   +FY ++L  I++G  +  L +S FA      GG+++DSGT IT L    Y AL
Sbjct: 287 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 346

Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           K  F  Q +  P+A G  + LD CF   A    +V +P +   F+G A++  D+    Y 
Sbjct: 347 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 403

Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V    S  +CL +          IIGN+QQ+N + +YD  +  L FA   C+ +
Sbjct: 404 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C+ Q  PVFDPS S +Y  V C+S++C  L  +      C
Sbjct: 109 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 163

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           +S+S   C Y  +YGD S T+G L  E   L K+ +   +FGCG  N+G  F   +GL+G
Sbjct: 164 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 221

Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
           LGR  LSLVSQ      GL  FSYCL S  D   S  L+  L G S     ++ +  T +
Sbjct: 222 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 276

Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
           I NP   +FY ++L  I++G  +  L +S FA      GG+++DSGT IT L    Y AL
Sbjct: 277 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 336

Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           K  F  Q +  P+A G  + LD CF   A    +V +P +   F+G A++  D+    Y 
Sbjct: 337 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 393

Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V    S  +CL +          IIGN+QQ+N + +YD  +  L FA   C+ +
Sbjct: 394 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C+ Q  PVFDPS S +Y  V C+S++C  L      +  C
Sbjct: 88  SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLP-----TSKC 142

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           +S+S   C Y  +YGD S T+G L  E   L K+ +   +FGCG  N+G  F   +GL+G
Sbjct: 143 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 200

Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
           LGR  LSLVSQ      GL  FSYCL S  D   S  L+  L G S     ++ +  T +
Sbjct: 201 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 255

Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
           I NP   +FY ++L  I++G  +  L +S FA      GG+++DSGT IT L    Y AL
Sbjct: 256 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 315

Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           K  F  Q +  P+A G  + LD CF   A    +V +P +   F+G A++  D+    Y 
Sbjct: 316 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 372

Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V    S  +CL +          IIGN+QQ+N + +YD  +  L FA   C+ +
Sbjct: 373 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/407 (28%), Positives = 199/407 (48%), Gaps = 31/407 (7%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
           H     +++ +  S    D      P+ SG+   +  Y A I +G       V++DTGSD
Sbjct: 51  HAAPFTAQVASFHSIAADDDDRLRSPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSD 110

Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
           L W+QC PC+ CY Q  P++DP  S +++++ C S  C  +    G    C + +   C 
Sbjct: 111 LIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLRYPG----CDART-GGCV 165

Query: 216 YFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
           Y V YGDGS + G+L  + L       V++   GCG +N GL    +GL+G+GR  LS  
Sbjct: 166 YMVVYGDGSASSGDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFP 225

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
           +Q +  +G +FSYCL        +GS  L    +    ST   +T +  NP+  + Y ++
Sbjct: 226 TQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTPEPPST--AFTPLRTNPRRPSLYYVD 283

Query: 335 LTGISIGGKQLQASGFA-----------KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           + G S+GG+++  +GF+           +GGI++DSGT I+R     Y+A++  F    +
Sbjct: 284 MVGFSVGGERV--TGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAA 341

Query: 384 GFPS----APGFSILDTCFNL----SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
              +    A  FS+ D C++L    +    V +P + + F G A+M +     +  V+  
Sbjct: 342 AAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGG 401

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             +    L   + +D   ++GN QQ+   +++D +  ++GF    CS
Sbjct: 402 DRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRIGFTPNGCS 448


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 112/277 (40%), Positives = 153/277 (55%), Gaps = 19/277 (6%)

Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
           C Y V YGDGSYT G    + L L    ++  F FGCG  N+GLFG  +GL+GLGR   S
Sbjct: 21  CLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTS 80

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL----A 328
           L  QT + +GG+F++C P+      +G L  G        S+P     +   P L     
Sbjct: 81  LPVQTYDKYGGVFAHCFPARSS--GTGYLEFG------PGSSPAVSAKLSTTPMLIDTGP 132

Query: 329 TFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
           TFY + +TGI +GGK L    S FA  G ++DSGTVITRLPP+ YS+L++ F    +  G
Sbjct: 133 TFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAASMAARG 192

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +  AP  S+LDTC++L+   EV IP V + F+G   + VD +GI+Y   +  SQ CL  A
Sbjct: 193 YKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY--AASVSQACLGFA 250

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                D+  I+GN Q K   V+YD  +  +GF    C
Sbjct: 251 GNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 145/436 (33%), Positives = 210/436 (48%), Gaps = 46/436 (10%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
           G  +++L H++       D ++ Q  RL  D        SR+     G  +  + T   +
Sbjct: 30  GGFSVDLIHRDSPHSPFFDPSKTQAERLT-DAFRRSV--SRV-----GRFRPTAMTSDGI 81

Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            S I      Y+  + +G   + VI  VDTGSDLTW QC+PC  CY Q  P+FDP  S +
Sbjct: 82  QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSST 141

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----G 238
           Y+   C +S C AL    G    CS      C +  SY DGS+T G L  E L +    G
Sbjct: 142 YRDSSCGTSFCLAL----GKDRSCSKEK--KCTFRYSYADGSFTGGNLASETLTVDSTAG 195

Query: 239 K-ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
           K  S   F FGCG ++ G+F    SG++GLG  +LSL+SQ      GLFSYC LP + D+
Sbjct: 196 KPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDS 255

Query: 296 GASGSLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
             S  +  G +  V      STP+   +  P+    TFY L L GIS+G K+L   G++K
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLVQKS--PD----TFYYLTLEGISVGKKRLPYKGYSK 309

Query: 353 ------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
                 G I++DSGT  T LP   YS L+        G        I   C+N +A  E+
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EI 367

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
           N P++   F+   +  V++  +  F++     VC  +A  S   + G++GN  Q N  V 
Sbjct: 368 NAPIITAHFK---DANVELQPLNTFMRMQEDLVCFTVAPTS---DIGVLGNLAQVNFLVG 421

Query: 467 YDTKNSQLGFAGEDCS 482
           +D +  ++ F   DC+
Sbjct: 422 FDLRKKRVSFKAADCT 437


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 185/354 (52%), Gaps = 38/354 (10%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            I+DTGSDL W QC+PC  C+NQ  PVFDPS S +Y  + C+S+ C  L      S  C+
Sbjct: 117 AIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSSTLCSDLP-----SSKCT 171

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGL 266
           S+    C Y  +YGD S T+G L  E   L K  + D  FGCG  N+G  F   +GL+GL
Sbjct: 172 SAK---CGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGL 228

Query: 267 GRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNSSV---FKNSTPITYTNM 321
           GR  LSLVSQ      GL  FSYCL S  D   S  L+LG  +++      ++ +  T +
Sbjct: 229 GRGPLSLVSQL-----GLNKFSYCLTSLDDTSKS-PLLLGSLATISESAAAASSVQTTPL 282

Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
           I NP   +FY +NL G+++G     L +S FA      GG+++DSGT IT L    Y AL
Sbjct: 283 IRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYLELQGYRAL 342

Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           K  F  Q    P+A G  I LDTCF   A    +V +P +    +G     +D+    Y 
Sbjct: 343 KKAFAAQMK-LPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLDG---ADLDLPAENYM 398

Query: 432 V-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V  S +  +CL +          IIGN+QQ+N + +YD   + L FA   C+ +
Sbjct: 399 VLDSGSGALCLTVMG---SRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQCAKL 449


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 131/392 (33%), Positives = 199/392 (50%), Gaps = 46/392 (11%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC  C+ Q  P +DP  S 
Sbjct: 170 LESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSS 229

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
           SY+ + C+ S CH          + SS  PP         C Y+  YGD S T G+   E
Sbjct: 230 SYRNIGCHDSRCH----------LVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALE 279

Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
              +      GK     V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  
Sbjct: 280 TFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHS 339

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP---NPQLATFYILNLTGISI 340
           FSYCL     DA  S  LI G +  +  +   + +T ++    NP + TFY + +  I +
Sbjct: 340 FSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVAGKENP-VDTFYYVQIKSIVV 397

Query: 341 GG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG       ++ Q +    GG +IDSGT ++      Y  +K  F+ +  G+P    F +
Sbjct: 398 GGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPV 457

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
           L+ C+N++  ++ ++P   + F   A     V    YF++ +  + VCLA+         
Sbjct: 458 LEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVEN--YFIEIEPREVVCLAILGTP-PSAL 514

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            IIGNYQQ+N  ++YDTK S+LGFA   C+ +
Sbjct: 515 SIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 546


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 196/381 (51%), Gaps = 27/381 (7%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C+ Q+ PVFDP+ S 
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSL 200

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
           SY+ V C    C  +   T         S P C Y+  YGD S T G+L  E   +    
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDP-CPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
                 V+D +FGCG +N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL    D 
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL---VDH 316

Query: 296 GAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGGKQLQASGF-- 350
           G+S GS I+ G+         + YT   P+   A  TFY + L G+ +GG++L  S    
Sbjct: 317 GSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTW 376

Query: 351 -----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQ 404
                  GG +IDSGT ++      Y  ++  F+++    +P    F +L  C+N+S  +
Sbjct: 377 DVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVE 436

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQ 463
            V +P   + F   A    D     YFV+ D   + CLA+   +      IIGN+QQ+N 
Sbjct: 437 RVEVPEFSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVLG-TPRSAMSIIGNFQQQNF 493

Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
            V+YD +N++LGFA   C+ +
Sbjct: 494 HVLYDLQNNRLGFAPRRCAEV 514


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 196/381 (51%), Gaps = 27/381 (7%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C+ Q+ PVFDP+ S 
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASL 200

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
           SY+ V C    C  +   T         S P C Y+  YGD S T G+L  E   +    
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDP-CPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
                 V+D +FGCG +N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL    D 
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCL---VDH 316

Query: 296 GAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGGKQLQASGF-- 350
           G+S GS I+ G+         + YT   P+   A  TFY + L G+ +GG++L  S    
Sbjct: 317 GSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTW 376

Query: 351 -----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQ 404
                  GG +IDSGT ++      Y  ++  F+++    +P    F +L  C+N+S  +
Sbjct: 377 DVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVE 436

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQ 463
            V +P   + F   A    D     YFV+ D   + CLA+   +      IIGN+QQ+N 
Sbjct: 437 RVEVPEFSLLFADGA--VWDFPAENYFVRLDPDGIMCLAVLG-TPRSAMSIIGNFQQQNF 493

Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
            V+YD +N++LGFA   C+ +
Sbjct: 494 HVLYDLQNNRLGFAPRRCAEV 514


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 188/354 (53%), Gaps = 34/354 (9%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + IVDTGSDL W QC+PC  C+ Q  PVFDPS S +Y  V C+S++C  L  +      C
Sbjct: 181 SAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSK-----C 235

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           +S+S   C Y  +YGD S T+G L  E   L K+ +   +FGCG  N+G  F   +GL+G
Sbjct: 236 TSAS--KCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVG 293

Query: 266 LGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLI--LGGNSSVFKNSTPITYTNM 321
           LGR  LSLVSQ      GL  FSYCL S  D   S  L+  L G S     ++ +  T +
Sbjct: 294 LGRGPLSLVSQL-----GLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 348

Query: 322 IPNPQLATFYILNLTGISIGGKQ--LQASGFA-----KGGILIDSGTVITRLPPSIYSAL 374
           I NP   +FY ++L  I++G  +  L +S FA      GG+++DSGT IT L    Y AL
Sbjct: 349 IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 408

Query: 375 KAEFLKQFSGFPSAPGFSI-LDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           K  F  Q +  P+A G  + LD CF   A    +V +P +   F+G A++  D+    Y 
Sbjct: 409 KKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADL--DLPAENYM 465

Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V    S  +CL +          IIGN+QQ+N + +YD  +  L FA   C+ +
Sbjct: 466 VLDGGSGALCLTVMG---SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 516


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 141/408 (34%), Positives = 217/408 (53%), Gaps = 49/408 (12%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
           D   V ++ S+     SGN+K+ ++      + +  +  N++  +  G   +   +I+DT
Sbjct: 92  DESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFLVDVAFGTPPQKFKLILDT 146

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           GS +TW QC+ C  C       FD   S +Y    C  ST        GN+         
Sbjct: 147 GSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIPST-------VGNT--------- 190

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFG-GVSGLMGLGRSD 270
              Y ++YGD S + G  G + + L  + V   F FGCGRNN+G FG G  G++GLG+  
Sbjct: 191 ---YNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQ 247

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP----- 325
           LS VSQT+  F  +FSYCLP   +  + GSL+ G  ++    S+ + +T+++  P     
Sbjct: 248 LSTVSQTASKFKKVFSYCLP---EENSIGSLLFGEKAT--SQSSSLKFTSLVNGPGTSGL 302

Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           + + +Y + L  IS+G K+L   +S FA  G +IDSGTVITRLP   YSALKA F K  +
Sbjct: 303 EESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQRAYSALKAAFKKAMA 362

Query: 384 GFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
            +P + G      +LDTC+NLS  ++V +P   + F   A++ ++   +V+   +DAS++
Sbjct: 363 KYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVW--GNDASRL 420

Query: 440 CLALASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           CLA A  S      E  IIGN QQ +  V+YD +  ++GF G  CS++
Sbjct: 421 CLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCSNL 468


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 145/401 (36%), Positives = 207/401 (51%), Gaps = 39/401 (9%)

Query: 99  VQYLQSRIKNMISGNIKDVSNTEI--PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGS 154
           ++    R++ + +  +   SN EI  P+ SG       ++  + +G      + I+DTGS
Sbjct: 66  IKRANHRLERLNAMVLAASSNAEINSPVLSG----NGEFLMNLAIGTPPETYSAIMDTGS 121

Query: 155 DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
           DL W QC+PC  C++Q  P+FDP  S S+ K+ C+S  C AL  ++     CS S    C
Sbjct: 122 DLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSS-----CSDS----C 172

Query: 215 NYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSL 273
            Y  +YGD S T+G +  E    GK S+ +  FGCG +N+G  F   SGL+GLGR  LSL
Sbjct: 173 EYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSL 232

Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           VSQ  E     FSYCL S  D   S +L++G  +SV   S  I  T +I NP   +FY L
Sbjct: 233 VSQLKE---AKFSYCLTSIDDTKTS-TLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYL 288

Query: 334 NLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
           +L GIS+GG +L       Q      GG++IDSGT IT L  S +  +K EF  Q  G P
Sbjct: 289 SLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQM-GLP 347

Query: 387 -SAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDASQVCLAL 443
               G + L+ C+NL S   E+ +P + + F G     +++ G  Y +  S    +CLA+
Sbjct: 348 VDNSGATGLELCYNLPSDTSELEVPKLVLHFTG---ADLELPGENYMIADSSMGVICLAM 404

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            S        I GN QQ+N  V +D +   L F   +C  +
Sbjct: 405 GS---SGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 128/339 (37%), Positives = 181/339 (53%), Gaps = 21/339 (6%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           I DTGSDLTW QC PC  CY Q  P+F+P  S S+  V CN+ TCHA++   G+ GV   
Sbjct: 96  IADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD--DGHCGVQGV 153

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
                C+Y  +YGD +Y++G+LG E + +G +SV   I GCG  + G FG  SG++GLG 
Sbjct: 154 -----CDYSYTYGDRTYSKGDLGFEKITIGSSSVKSVI-GCGHASSGGFGFASGVIGLGG 207

Query: 269 SDLSLVSQTSEIFG--GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
             LSLVSQ S+  G    FSYCLP T  + A+G +  G N+ V   S P   +  + +  
Sbjct: 208 GQLSLVSQMSQTSGISRRFSYCLP-TLLSHANGKINFGQNAVV---SGPGVVSTPLISKN 263

Query: 327 LATFYILNLTGISIGGKQLQASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
             T+Y + L  ISIG ++  A  FAK G ++IDSGT ++ LP  +Y  + +  LK     
Sbjct: 264 TVTYYYITLEAISIGNERHMA--FAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAK 321

Query: 386 PSAPGFSILDTCFN--LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL 443
                 +  D CF+  ++      IP++  +F G A   V++  +  F K   +  CL L
Sbjct: 322 RVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGAN--VNLLPVNTFQKVANNVNCLTL 379

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              S  DE GIIGN    N  + YD +  +L F    C+
Sbjct: 380 TPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 134/354 (37%), Positives = 185/354 (52%), Gaps = 26/354 (7%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            IVDTGSDL W QC+PC  C+NQ  PVFDP+ S +Y  + C+S+ C  L  +T  S   S
Sbjct: 131 AIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSALCADLPTSTCASSSSS 190

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMGL 266
           SS+   C Y  +YGD S T+G L  E   L +  V    FGCG  N+G  F   +GL+GL
Sbjct: 191 SSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGL 250

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL--GGNSSVFKNSTPITYTNMIPN 324
           GR  LSLVSQ        FSYCL S  DA     L+L      S    + P   T ++ N
Sbjct: 251 GRGPLSLVSQLGI---DRFSYCLTSLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKN 307

Query: 325 PQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAE 377
           P   +FY ++LTG+++G  +L   +S FA      GG+++DSGT IT L    Y AL+  
Sbjct: 308 PSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKA 367

Query: 378 FLKQFSGFPSAPGFSI-LDTCFNLSAYQ-----EVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           F+   S  P+     I LD CF   A       +V +P + + F+G A++  D+    Y 
Sbjct: 368 FVAHMS-LPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKLVLHFDGGADL--DLPAENYM 424

Query: 432 VKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V   AS  +CL + +        IIGN+QQ+N + +YD     L FA  +C+ +
Sbjct: 425 VLDSASGALCLTVMA---SRGLSIIGNFQQQNFQFVYDVAGDTLSFAPAECNKL 475


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 117/301 (38%), Positives = 173/301 (57%), Gaps = 27/301 (8%)

Query: 92  LILDNLHVQYLQSRIKN---------MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG 142
           L  D+  V+ L SR+           +   +I+   +  +PL  G  + + NY   +  G
Sbjct: 66  LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFG 125

Query: 143 --GRNMTVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
              R  ++IVDTGS L+W+QC+PC   C+ Q DP+FDPS S +YK + C SS C +L  A
Sbjct: 126 SPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDA 185

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG 258
           T N+ +C +SS   C Y  SYGD SY+ G L ++ L L  + ++  F++GCG+++ GLFG
Sbjct: 186 TLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS---SVFKNSTP 315
             +G++GLGR+ LS++ Q S  FG  FSYCLP+    G  G L +G  S   S +K    
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR---GGGGFLSIGKASLAGSAYK---- 297

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSAL 374
             +T M  +P   + Y L LT I++GG+ L  A+   +   +IDSGTVITRLP S+Y+  
Sbjct: 298 --FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRLPMSVYTPF 355

Query: 375 K 375
           +
Sbjct: 356 Q 356


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 188/345 (54%), Gaps = 29/345 (8%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + I+DTGSDL W QC+PC  C++Q  P+FDP  S S+ K+ C+S  C AL  ++ N+G  
Sbjct: 111 SAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG-- 168

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
                  C Y  SYGD S T+G L  E L  GKASV +  FGCG +N+G  F   +GL+G
Sbjct: 169 -------CEYLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVG 221

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR  LSLVSQ  E     FSYCL +  D   S +L++G  +SV  +S+ I  T +I +P
Sbjct: 222 LGRGPLSLVSQLKE---PKFSYCLTTVDDTKTS-TLLMGSLASVNASSSAIKTTPLIHSP 277

Query: 326 QLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
              +FY L+L GIS+G  +L  + S F+      GG++IDSGT IT L  S ++ +  EF
Sbjct: 278 AHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEF 337

Query: 379 LKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDA 436
             + +    + G + LD CF L S    + +P +   F+G     +++    Y +  S  
Sbjct: 338 TAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG---ADLELPAENYMIGDSSM 394

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              CLA+ S S      I GN QQ+N  V++D +   L F    C
Sbjct: 395 GVACLAMGSSS---GMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 145/399 (36%), Positives = 202/399 (50%), Gaps = 46/399 (11%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
           D L  +Y+Q ++         D++   +P T G  L T+ Y+ T+ +G      T+++DT
Sbjct: 92  DQLRAKYIQRKLSGTDGLQPLDLT---VPTTLGSALDTMEYVITVGIGSPAVTQTMMIDT 148

Query: 153 GSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-CSSS 209
           GSD++WV+C       N  D   +FDPS S +Y    C+S+ C  L    GN+G  CS+S
Sbjct: 149 GSDVSWVRC-------NSTDGLTLFDPSKSTTYAPFSCSSAACAQL----GNNGDGCSNS 197

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG-GVSGLMGLG 267
               C Y V YGDGS T G    + L L  + +V DF FGC  + +   G  + GLMGLG
Sbjct: 198 G---CQYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGCSHHEEDFDGEKIDGLMGLG 254

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
               SLVSQT+  +G  FSYCLP T     SG L  G  +     S     T M+  P+ 
Sbjct: 255 GDAQSLVSQTAATYGKSFSYCLPPTNR--TSGFLTFGAPNG---TSGGFVTTPMLRWPKA 309

Query: 328 ATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEF---LKQF 382
            T Y + L  IS+GG  L  Q S  + G ++ DSGTVIT LP   YSAL + F   + + 
Sbjct: 310 PTLYGVLLQDISVGGTPLGIQPSVLSNGSVM-DSGTVITWLPRRAYSALSSAFRSSMTRL 368

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
               +AP   ILDTC++ +    V+IP V +  +G A + +D  GI+        Q CLA
Sbjct: 369 RHQRAAP-LGILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI-------QDCLA 420

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            A+ S +    IIGN QQ+   V++D      GF    C
Sbjct: 421 FAATSGDS---IIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 135/349 (38%), Positives = 187/349 (53%), Gaps = 31/349 (8%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           + I+DTGSDL W QC+PC  C++Q  P+FDP  S S+ K+ C+S  C AL  +T + G  
Sbjct: 111 SAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG-- 168

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL-FGGVSGLMG 265
                  C Y   YGD S T+G L  E L  GK SV +  FGCG +N+G  F   SGL+G
Sbjct: 169 -------CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVG 221

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR  LSLVSQ  E     FSYCL S  D  AS +L++G  +SV  + + I  T +I N 
Sbjct: 222 LGRGPLSLVSQLKE---PKFSYCLTSVDDTKAS-TLLMGSLASVKASDSEIKTTPLIQNS 277

Query: 326 QLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEF 378
              +FY L+L GIS+G   L  + S F+      GG++IDSGT IT L  S +  +  EF
Sbjct: 278 AQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEF 337

Query: 379 LKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
             Q +      G + L+ CF L S   ++ +P +   F+G A++ +       ++ +DAS
Sbjct: 338 TSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDG-ADLELPAEN---YMIADAS 393

Query: 438 Q--VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               CLA+ S S      I GN QQ+N  V++D +   L F    C  +
Sbjct: 394 MGVACLAMGSSS---GMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 26/340 (7%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNSGVC 206
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L  +  G    C
Sbjct: 166 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAG----C 221

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGG-VSGLM 264
           S++    C YFV YGDG  T G    + L L  ++V  +F FGC    +G F    SG M
Sbjct: 222 SNN---QCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTM 278

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
            LG    SL+SQT+  FG  FSYC+P   D  +SG L LGG +           T ++ N
Sbjct: 279 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 334

Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           P  + T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    
Sbjct: 335 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 394

Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           + +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CL
Sbjct: 395 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 447

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A      +   G IGN QQ+   V+YD     +GF    C
Sbjct: 448 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 173/344 (50%), Gaps = 26/344 (7%)

Query: 147 TVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           T+ +DT  D+ W+QC PC    CY Q+DP+FDP+ S +   V C S  C +L    GN G
Sbjct: 149 TMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLG-PYGN-G 206

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGV-SG 262
             + S+  +C Y + Y D   T G    + L + G  +V +F FGC    +G F  + +G
Sbjct: 207 CSNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAG 266

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKNSTPITY 318
            M LG    SL++QT+   G  FSYC+P    A ASG L +GG    NS+    +TP+  
Sbjct: 267 TMSLGGGAQSLLAQTARSLGNAFSYCVP---QASASGFLSIGGPATTNSTTVFATTPLVR 323

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAE 377
           + +  NP L   Y++ L GI + G++L     A   G ++DS  VIT+LPP+ Y AL+  
Sbjct: 324 SAI--NPSL---YLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRA 378

Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
           F      +P +     LDTC++      V +P V + F G A + +D   ++        
Sbjct: 379 FRNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI------- 431

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             CLA  + S +   G IGN QQ+   V+YD     +GF    C
Sbjct: 432 GGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 123/340 (36%), Positives = 173/340 (50%), Gaps = 26/340 (7%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L    G  G  C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGG-VSGLM 264
           S++    C YFV YGDG  T G    + L L  ++V  +F FGC    +G F    SG M
Sbjct: 206 SNN---QCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTM 262

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
            LG    SL+SQT+  FG  FSYC+P   D  +SG L LGG +           T ++ N
Sbjct: 263 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 318

Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           P  + T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    
Sbjct: 319 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 378

Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           + +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CL
Sbjct: 379 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 431

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A      +   G IGN QQ+   V+YD     +GF    C
Sbjct: 432 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 195/385 (50%), Gaps = 39/385 (10%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y A I +G    +  V++DTGSDL W+QC PC+ CY Q  P++DP  S
Sbjct: 80  PVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNS 139

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GK 239
            +++++ C S  C  +    G    C + +   C Y V YGDGS + G+L  + L L   
Sbjct: 140 KTHRRIPCASPQCRGVLRYPG----CDART-GGCVYMVVYGDGSASSGDLATDTLVLPDD 194

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGAS 298
             V++   GCG +N+GL    +GL+G GR  LS  +Q +  +G +FSYCL      A  S
Sbjct: 195 TRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNS 254

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA------- 351
            S ++ G +    ++    +T +  NP+  + Y +++ G S+GG+++  +GF+       
Sbjct: 255 SSYLVFGRTPELPST---AFTPLRTNPRRPSLYYVDMVGFSVGGERV--AGFSNASLALN 309

Query: 352 ----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-------FSILDTCFNL 400
               +GG+++DSGT I+R     Y+A++  F+       +A G       FS+ DTC+++
Sbjct: 310 PATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHA----AAAGMRRLRNKFSVFDTCYDV 365

Query: 401 SAYQE---VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
                   V +P + + F   A+M +     +  V     +    L   + +D   ++GN
Sbjct: 366 HGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGN 425

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
            QQ+   V++D +  ++GF    CS
Sbjct: 426 VQQQGFGVVFDVERGRIGFTPNGCS 450


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 208/425 (48%), Gaps = 58/425 (13%)

Query: 92  LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--------YIATIELGG 143
           L  D     Y+Q R+   ++G ++     ++P+++    Q++         Y A   +  
Sbjct: 68  LWSDQHRADYIQWRLSGSVAGVLQPAD--DVPVSTNYEQQSIEGDLNYGTYYPAPAPMSS 125

Query: 144 RNM----------------TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
           + M                T+++DT SD+TWVQC PC +  CY Q+D ++DP+ S S   
Sbjct: 126 KAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGV 185

Query: 186 VLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
             CNS TC  L  +A G    C++++   C Y V Y DG+ T G    + L +  A +V 
Sbjct: 186 FSCNSPTCTQLGPYANG----CTNNN--QCQYRVRYPDGTSTAGTYISDLLTITPATAVR 239

Query: 244 DFIFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            F FGC    +G F      +G+M LG    SLVSQT+  +G +FS+C P        G 
Sbjct: 240 SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR---RGF 296

Query: 301 LILG-GNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG--FAKGGIL 356
             LG    + ++       T M+ NP +  TFY++ L  I++ G+++      FA G  L
Sbjct: 297 FTLGVPRVAAWR----YVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAAL 352

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
            DS T ITRLPP+ Y AL+  F  + + +  AP    LDTC++++  +   +P + + F+
Sbjct: 353 -DSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD 411

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
            NA + +D +G+++       Q CLA  +   +   GIIGN Q +   V+Y+   + +GF
Sbjct: 412 KNAAVELDPSGVLF-------QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 464

Query: 477 AGEDC 481
               C
Sbjct: 465 RHAAC 469


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 208/425 (48%), Gaps = 58/425 (13%)

Query: 92  LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--------YIATIELGG 143
           L  D     Y+Q R+   ++G ++     ++P+++    Q++         Y A   +  
Sbjct: 93  LWSDQHRADYIQWRLSGSVAGVLQPAD--DVPVSTNYEQQSIEGDLNYGTYYPAPAPMSS 150

Query: 144 RNM----------------TVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKK 185
           + M                T+++DT SD+TWVQC PC +  CY Q+D ++DP+ S S   
Sbjct: 151 KAMNPAATGGGGGGPGVTQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGV 210

Query: 186 VLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVN 243
             CNS TC  L  +A G    C++++   C Y V Y DG+ T G    + L +  A +V 
Sbjct: 211 FSCNSPTCTQLGPYANG----CTNNN--QCQYRVRYPDGTSTAGTYISDLLTITPATAVR 264

Query: 244 DFIFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
            F FGC    +G F      +G+M LG    SLVSQT+  +G +FS+C P        G 
Sbjct: 265 SFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGRVFSHCFPPPTR---RGF 321

Query: 301 LILG-GNSSVFKNSTPITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG--FAKGGIL 356
             LG    + ++       T M+ NP +  TFY++ L  I++ G+++      FA G  L
Sbjct: 322 FTLGVPRVAAWR----YVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAAGAAL 377

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
            DS T ITRLPP+ Y AL+  F  + + +  AP    LDTC++++  +   +P + + F+
Sbjct: 378 -DSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFD 436

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
            NA + +D +G+++       Q CLA  +   +   GIIGN Q +   V+Y+   + +GF
Sbjct: 437 KNAAVELDPSGVLF-------QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGF 489

Query: 477 AGEDC 481
               C
Sbjct: 490 RHAAC 494


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 184/369 (49%), Gaps = 26/369 (7%)

Query: 132 TLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           T  Y+  + +G   R + + +DTGSDL W QC PC+ C++Q  PV DP+ S +Y  + C 
Sbjct: 81  TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-------SV 242
           ++ C AL F +   GV +  +   C Y   YGD S T GE+  +    G +         
Sbjct: 141 AARCRALPFTS--CGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHT 198

Query: 243 NDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
               FGCG  NKG+F    +G+ G GR   SL SQ +      FSYC  S  ++ +S  +
Sbjct: 199 RRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFESKSS-LV 254

Query: 302 ILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILID 358
            LGG+ +      +S  +  T ++ NP   + Y L+L GIS+G  +L          +ID
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIID 314

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL---SAYQEVNIPLVKMEF 415
           SG  IT LP  +Y A+KAEF  Q    PS    S LD CF L   + ++   +P + +  
Sbjct: 315 SGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHL 374

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
           EG A+  +  +  V F    A  +C+ L +     E  +IGN+QQ+N  V+YD +N +L 
Sbjct: 375 EG-ADWELPRSNYV-FEDLGARVMCIVLDAA--PGEQTVIGNFQQQNTHVVYDLENDRLS 430

Query: 476 FAGEDCSSM 484
           FA   C  +
Sbjct: 431 FAPARCDRL 439


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 181/351 (51%), Gaps = 35/351 (9%)

Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATG 201
           + TVI+D+GSD+ WVQCQPC    C+ Q+DP+FDP+ S +Y  V C+S+ C  L  +  G
Sbjct: 80  SQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRG 139

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKG--LFG 258
               C ++S   C + ++Y +G+   G    + L LG   V   F+FGC   ++G     
Sbjct: 140 ----CLANS--QCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSY 193

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS------SVFKN 312
            V+G + LG    S V QT+  +  +FSYC+P +  +   G ++ G           F +
Sbjct: 194 DVAGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS--FGFIMFGVPPQRAALVPTFVS 251

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSI 370
           +  ++ + M P     TFY + L  I + G+ L      F+   + IDS TVI+R+PP+ 
Sbjct: 252 TPLLSSSTMSP-----TFYRVLLRSIIVAGRPLPVPPTVFSASSV-IDSATVISRIPPTA 305

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           Y AL+A F    + +  AP  SILDTC++ S  + + +P + + F+G A + +D  GI+ 
Sbjct: 306 YQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL 365

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  Q CLA A  + +   G IGN QQ+   V+YD     + F    C
Sbjct: 366 -------QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 150/422 (35%), Positives = 221/422 (52%), Gaps = 51/422 (12%)

Query: 77  CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNY 135
           CSG         Q     D   V ++ S+       N+KD   N ++    G      N+
Sbjct: 109 CSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKDHTPNNKLFDEDG------NF 162

Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
           +  +  G   +  T+I+DTGS +TW QC+PC  C       FDPS S +Y    C  ST 
Sbjct: 163 LVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPST- 221

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRN 252
                  GN+            Y ++YGD S + G  G + + L  + V   F FGCGRN
Sbjct: 222 ------VGNT------------YNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRN 263

Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           N+G FG G  G++GLG+  LS VSQT+  F  +FSYCLP   +  + GSL+ G  ++   
Sbjct: 264 NEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP---EEDSIGSLLFGEKAT--S 318

Query: 312 NSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVIT 364
            S+ + +T+++  P     + + +Y + L  IS+G K+L   +S FA  G +IDSGTVIT
Sbjct: 319 QSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTIIDSGTVIT 378

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           RLP   YSALKA F K  + +P + G      ILDTC+NLS  ++V +P + + F   A+
Sbjct: 379 RLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGAD 438

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + ++   +++   +DAS++CLA A  S   E  IIGN QQ +  V+YD +  ++GF G  
Sbjct: 439 VRLNGKRVIW--GNDASRLCLAFAGNS---ELTIIGNRQQVSLTVLYDIQGGRIGFGGNG 493

Query: 481 CS 482
           CS
Sbjct: 494 CS 495


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/349 (37%), Positives = 183/349 (52%), Gaps = 28/349 (8%)

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFA 199
           G    +++DT SD+ WVQC PC +  CY Q D ++DPS S S +   C+S TC  L  +A
Sbjct: 179 GVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYA 238

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFG 258
            G S   SS+S   C Y V Y DGS T G L  + L L   S V  F FGC    +G F 
Sbjct: 239 NGCSS--SSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGSFS 296

Query: 259 --GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
               +G+M LGR   SLVSQTS  +G +FSYC P T  A   G  +LG      ++S+  
Sbjct: 297 RSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPT--ASHKGFFVLGVPR---RSSSRY 351

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSAL 374
             T M+  P L   Y + L  I++ G++L    + FA G  L DS TVITRLPP+ Y AL
Sbjct: 352 AVTPMLKTPML---YQVRLEAIAVAGQRLDVPPTVFAAGAAL-DSRTVITRLPPTAYQAL 407

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN-AEMTVDVTGIVYFVK 433
           ++ F  + S +  A     LDTC++ +    + +P + + F+   A + +D +G+++   
Sbjct: 408 RSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLF--- 464

Query: 434 SDASQVCLALASLSYEDE-TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 CLA AS + +D  TGIIG  Q +   V+Y+     +GF    C
Sbjct: 465 ----GSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 134/372 (36%), Positives = 193/372 (51%), Gaps = 33/372 (8%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDP 177
           P+TSG       Y A I +G   ++   + DTGSD++W+QCQPC     CY Q  P+FDP
Sbjct: 172 PVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S SY  + C+S  CH L+ A      C ++S   C Y V YGDGS+T GEL  E    
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEA-----ACDANS---CIYEVEYGDGSFTVGELATETFSF 283

Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
             + S+ +   GCG +N+GLF G +GL+GLG   +SL SQ   +    FSYCL    D+ 
Sbjct: 284 RHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQ---LEATSFSYCL-VDLDSE 339

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
           +S +L    +      ++P     ++ N +  TF  + + G+S+GGK L  S  +     
Sbjct: 340 SSSTLDFNADQPSDSLTSP-----LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 394

Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
              GGI++DSGT IT +P  +Y  L+  F+      P APG S  DTC++LS+   V +P
Sbjct: 395 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 454

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +     G   + +     ++ V S A   CLA    ++     IIGN QQ+  RV YD 
Sbjct: 455 TIAFILPGENSLQLPAKNCLFQVDS-AGTFCLAFLPSTF--PLSIIGNVQQQGIRVSYDL 511

Query: 470 KNSQLGFAGEDC 481
            NS +GF+ + C
Sbjct: 512 ANSLVGFSTDKC 523


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 156/450 (34%), Positives = 221/450 (49%), Gaps = 41/450 (9%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           SS  +S+  ++ ++G  T +L H++       +  E    RL  + +H     SR+ +  
Sbjct: 16  SSPFLSNANAKSKLG-FTADLIHRDSPKSPFYNPTETSSQRL-RNAIHRSV--SRVFHFT 71

Query: 111 SGNIKDVSNT--EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS 166
             + KD S+   +I LTS     +  Y+  I LG     +  I DTGSDL W QC+PC  
Sbjct: 72  DISQKDASDNAPQIDLTS----NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDD 127

Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
           CY Q DP+FDP  S +YK V C+SS C ALE    N   CS+     C+Y  SYGD SYT
Sbjct: 128 CYTQVDPLFDPKASSTYKDVSCSSSQCTALE----NQASCSTED-NTCSYSTSYGDRSYT 182

Query: 227 RGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEI 280
           +G +  + L LG        + + I GCG NN G F    SG++GLG   +SL++Q  + 
Sbjct: 183 KGNIAVDTLTLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDS 242

Query: 281 FGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
             G FSYCL P T +   +  +  G N+ V  + T +  T +I   Q  TFY L L  IS
Sbjct: 243 IDGKFSYCLVPLTSENDRTSKINFGTNAVV--SGTGVVSTPLIAKSQ-ETFYYLTLKSIS 299

Query: 340 IGGKQLQ----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
           +G K++Q     SG  +G I+IDSGT +T LP   YS L+                + L 
Sbjct: 300 VGSKEVQYPGSDSGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLS 359

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGI 454
            C+  SA  ++ +P + M F+G     V++     FV+     VC A   S S+     I
Sbjct: 360 LCY--SATGDLKVPAITMHFDG---ADVNLKPSNCFVQISEDLVCFAFRGSPSF----SI 410

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            GN  Q N  V YDT +  + F   DC+ M
Sbjct: 411 YGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 440


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 194/389 (49%), Gaps = 43/389 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC +C+ Q  P +DP  S 
Sbjct: 181 LESGVSLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESS 240

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
           S++ + C+   C           + SS  PP         C YF  YGD S T G+   E
Sbjct: 241 SFENITCHDPRC----------KLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALE 290

Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
              +      GK+    V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   I+G  
Sbjct: 291 TFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHS 350

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIG 341
           FSYCL     D   S  LI G +  +  +   + +T+ +   +  + TFY + +  I + 
Sbjct: 351 FSYCLVDRNSDTSVSSKLIFGEDKELLSHPN-LNFTSFVGGEENSVDTFYYVGIKSIMVD 409

Query: 342 GKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
           G+ L+        S    GG +IDSGT +T      Y  +K  F+K+  G+    GF  L
Sbjct: 410 GEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPL 469

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
             C+N+S  +++ +P   + F   A     V    YF++ +   VCLA+     +    I
Sbjct: 470 KPCYNVSGIEKMELPDFGILFSDGAMWDFPVEN--YFIQIEPDLVCLAILGTP-KSALSI 526

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           IGNYQQ+N  ++YD K S+LG+A   C++
Sbjct: 527 IGNYQQQNFHILYDMKKSRLGYAPMKCTA 555


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 145/430 (33%), Positives = 214/430 (49%), Gaps = 44/430 (10%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           L+H +  SGK +   E+ Q+ +      +Q L + +  + +  +      E P+ +G   
Sbjct: 52  LRHVD--SGKNLTKLERVQHGIKRGKSRLQRLNAMV--LAASTLDSEDQLEAPIHAG--- 104

Query: 131 QTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
               Y+  + +G   ++   ++DTGSDL W QC+PC  CY Q  P+FDP  S S+ KV C
Sbjct: 105 -NGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSC 163

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVND 244
            SS C A+  +T + G         C Y  SYGD S T+G L  E    GK+    SV++
Sbjct: 164 GSSLCSAVPSSTCSDG---------CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214

Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGCG +N+G  F   SGL+GLGR  LSLVSQ  E     FSYCL    D     S++L
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---PRFSYCLTPMDDTKE--SILL 269

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGIL 356
            G+    K++  +  T ++ NP   +FY L+L GIS+G  +L  + S F       GG++
Sbjct: 270 LGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVI 329

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEF 415
           IDSGT IT +    + ALK EF+ Q          + LD CF+L S   +V IP +   F
Sbjct: 330 IDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHF 389

Query: 416 EGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           +G     +++    Y +  S+    CLA+ + S      I GN QQ+N  V +D +   +
Sbjct: 390 KGG---DLELPAENYMIGDSNLGVACLAMGASS---GMSIFGNVQQQNILVNHDLEKETI 443

Query: 475 GFAGEDCSSM 484
            F    C  +
Sbjct: 444 SFVPTSCDQL 453


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  184 bits (468), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 134/372 (36%), Positives = 191/372 (51%), Gaps = 33/372 (8%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDP 177
           P+TSG       Y A I +G   ++   + DTGSD++W+QCQPC     CY Q  P+FDP
Sbjct: 172 PVTSGASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDP 231

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S SY  + C+S  CH L+ A      C ++S   C Y V YGDGS+T GEL  E    
Sbjct: 232 KSSSSYSPLSCDSEQCHLLDEA-----ACDANS---CIYEVEYGDGSFTVGELATETFSF 283

Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
             + S+ +   GCG +N+GLF G  GL+GLG   +SL SQ   +    FSYCL    D+ 
Sbjct: 284 RHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQ---LEATSFSYCLVDL-DSE 339

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----- 351
           +S +L    +      ++P     ++ N +  TF  + + G+S+GGK L  S  +     
Sbjct: 340 SSSTLDFNADQPSDSLTSP-----LVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDE 394

Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
              GGI++DSGT IT +P  +Y  L+  F+      P APG S  DTC++LS+   V +P
Sbjct: 395 SGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVP 454

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +     G   + +     +  V S A   CLA    ++     IIGN QQ+  RV YD 
Sbjct: 455 TIAFILPGENSLQLPAKNCLIQVDS-AGTFCLAFLPSTF--PLSIIGNVQQQGIRVSYDL 511

Query: 470 KNSQLGFAGEDC 481
            NS +GF+ + C
Sbjct: 512 ANSLVGFSTDKC 523


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 129/392 (32%), Positives = 195/392 (49%), Gaps = 46/392 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC +C+ Q  P +DP  S 
Sbjct: 184 LESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSS 243

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--------CNYFVSYGDGSYTRGELGRE 233
           S+K + C+   C           + SS  PP         C YF  YGD S T G+   E
Sbjct: 244 SFKNITCHDPRCQ----------LVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALE 293

Query: 234 HLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
              +      GK     V + +FGCG  N+GLF G +GL+GLGR  LS  +Q   ++G  
Sbjct: 294 TFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHS 353

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMI---PNPQLATFYILNLTGISI 340
           FSYCL     ++  S  LI G +  +  +   + +T+ +    NP + TFY + +  I +
Sbjct: 354 FSYCLVDRNSNSSVSSKLIFGEDKELLSHPN-LNFTSFVGGKENP-VDTFYYVLIKSIMV 411

Query: 341 GGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           GG+ L+        S    GG +IDSGT +T      Y  +K  F+++  GFP    F  
Sbjct: 412 GGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPP 471

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ-VCLALASLSYEDET 452
           L  C+N+S  +++ +P   + F   A     V    YF++ +    VCLA+         
Sbjct: 472 LKPCYNVSGVEKMELPEFAILFADGAMWDFPVEN--YFIQIEPEDVVCLAILGTP-RSAL 528

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            IIGNYQQ+N  ++YD K S+LG+A   C+ +
Sbjct: 529 SIIGNYQQQNFHILYDLKKSRLGYAPMKCADV 560


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 188/368 (51%), Gaps = 28/368 (7%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+    LG   + + + +DT +D TW  C PC +C +    +F P+ S SY  + C+SS
Sbjct: 80  SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 137

Query: 192 TCHALEFAT------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
            C   +         G       ++ P C +   + D S+ +  L  + L LGK ++ ++
Sbjct: 138 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNY 196

Query: 246 IFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
            FGC  +  G    +   GL+GLGR  ++L+SQ   ++ G+FSYCLPS +    SGSL L
Sbjct: 197 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 256

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGIL 356
           G      ++   + YT M+ NP  ++ Y +N+TG+S+G    ++ A  FA       G +
Sbjct: 257 GAGGGQPRS---VRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTV 313

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +DSGTVITR    +Y+AL+ EF +Q +           DTCFN         P V +  +
Sbjct: 314 VDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD 373

Query: 417 GNAEMTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           G  ++ + +   +  + S A+ + CLA+A    +      +I N QQ+N RV++D  NS+
Sbjct: 374 GGVDLALPMENTL--IHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 431

Query: 474 LGFAGEDC 481
           +GFA E C
Sbjct: 432 IGFAKESC 439


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 145/466 (31%), Positives = 227/466 (48%), Gaps = 46/466 (9%)

Query: 57  HQKSRIEMGAITLELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSR------IKNM 109
           H +  +++     E+K +    +  +VD   Q   R+    LH ++ +S+      +K  
Sbjct: 72  HTRESVKLHLRRREIKQETKRTTHSVVDLQIQDLTRI--QTLHARFKKSKKQRNEKVKKK 129

Query: 110 ISGNIKDVSNTEIP-------LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
           I+ +I  V   E+        L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+Q
Sbjct: 130 ITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQ 189

Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
           C PC  C++Q +  +DP  S S+K + CN   C  +  ++    V   S    C YF  Y
Sbjct: 190 CLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLI--SSPEPPVQCKSDNQSCPYFYWY 247

Query: 221 GDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
           GD S T G+   E   +      G++S   V + +FGCG  N+GLF G SGL+GLGR  L
Sbjct: 248 GDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPL 307

Query: 272 SLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LA 328
           S  SQ   ++G  FSYCL     D   S  LI G +  +  N T + +T+ +   +  + 
Sbjct: 308 SFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL-NHTNLNFTSFVNGKENSVE 366

Query: 329 TFYILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           TFY + +  I +GG+ L         S    GG +IDSGT ++      Y  +K +F ++
Sbjct: 367 TFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEK 426

Query: 382 F-SGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
               +     F +LD CFN+S  +E NI  P + + F   A           ++  D   
Sbjct: 427 MKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDL-- 484

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           VCLA+   + +    IIGNYQQ+N  ++YDTK S+LGF    C+ +
Sbjct: 485 VCLAILG-TPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKCADI 529


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 188/368 (51%), Gaps = 28/368 (7%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+    LG   + + + +DT +D TW  C PC +C +    +F P+ S SY  + C+SS
Sbjct: 78  SYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSS 135

Query: 192 TCHALEFAT------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
            C   +         G       ++ P C +   + D S+ +  L  + L LGK ++ ++
Sbjct: 136 WCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNY 194

Query: 246 IFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
            FGC  +  G    +   GL+GLGR  ++L+SQ   ++ G+FSYCLPS +    SGSL L
Sbjct: 195 TFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRL 254

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGIL 356
           G      ++   + YT M+ NP  ++ Y +N+TG+S+G    ++ A  FA       G +
Sbjct: 255 GAGGGQPRS---VRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTV 311

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           +DSGTVITR    +Y+AL+ EF +Q +           DTCFN         P V +  +
Sbjct: 312 VDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMD 371

Query: 417 GNAEMTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           G  ++ + +   +  + S A+ + CLA+A    +      +I N QQ+N RV++D  NS+
Sbjct: 372 GGVDLALPMENTL--IHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 429

Query: 474 LGFAGEDC 481
           +GFA E C
Sbjct: 430 VGFAKESC 437


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 147/430 (34%), Positives = 212/430 (49%), Gaps = 43/430 (10%)

Query: 71  LKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           L+H +  SGK +   E+ Q+ +      +Q L + +    S       ++E  L + I  
Sbjct: 51  LRHVD--SGKNLTKLERVQHGIKRGKSRLQKLNAMVLAASS-----TPDSEDQLEAPIHA 103

Query: 131 QTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
               Y+  + +G   ++   ++DTGSDL W QC+PC  CY Q  P+FDP  S S+ KV C
Sbjct: 104 GNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSC 163

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVND 244
            SS C AL  +T + G         C Y  SYGD S T+G L  E    GK+    SV++
Sbjct: 164 GSSLCSALPSSTCSDG---------CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHN 214

Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGCG +N+G  F   SGL+GLGR  LSLVSQ  E     FSYCL    D     S++L
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKE---QRFSYCLTPIDDTKE--SVLL 269

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF-----AKGGIL 356
            G+    K++  +  T ++ NP   +FY L+L  IS+G  +L  + S F       GG++
Sbjct: 270 LGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVI 329

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEF 415
           IDSGT IT +    Y ALK EF+ Q          + LD CF+L S   +V IP +   F
Sbjct: 330 IDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHF 389

Query: 416 EGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           +G     +++    Y +  S+    CLA+ + S      I GN QQ+N  V +D +   +
Sbjct: 390 KGG---DLELPAENYMIGDSNLGVACLAMGASS---GMSIFGNVQQQNILVNHDLEKETI 443

Query: 475 GFAGEDCSSM 484
            F    C  +
Sbjct: 444 SFVPTSCDQL 453


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 205/408 (50%), Gaps = 37/408 (9%)

Query: 99  VQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYIATIELGGRNMTVIV--DTGSD 155
           V  L ++ K    G+ +   +T +P+ +G + L+T +Y+A   LG    T++V  D  +D
Sbjct: 66  VATLAAKPKPKPKGHSR---HTFVPIAAGRQILRTPSYVARARLGTPPQTLLVAIDPSND 122

Query: 156 LTWVQCQPCKSCY-NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
             WV C  C  C      P FDP+ S +Y+ V C +  C  +  AT +   C +     C
Sbjct: 123 AAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATPS---CPAGPGASC 179

Query: 215 NYFVSYGDGSYTRGELGREHLGL----GKASVND-FIFGCGRNNKGLFGGVS--GLMGLG 267
            + +SY   S     LG++ L L    G A  +D + FGC R   G  G V   GL+G G
Sbjct: 180 AFNLSYAS-STLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGSGGSVPPQGLVGFG 238

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  LS +SQT   +G +FSYCLPS + +  SG+L LG      +    I  T ++ NP  
Sbjct: 239 RGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRR----IKTTPLLSNPHR 294

Query: 328 ATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALKAEFL 379
            + Y + + G+ + GK   + AS  A      +GG ++D+GT+ TRL P  Y+AL+  F 
Sbjct: 295 PSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFR 354

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           +  S  P+AP     DTC+ ++  + V  P V   F G A +T+    +V    +     
Sbjct: 355 RGVSA-PAAPALGGFDTCYYVNGTKSV--PAVAFVFAGGARVTLPEENVV-ISSTSGGVA 410

Query: 440 CLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           CLA+A+   +       ++ + QQ+N RV++D  N ++GF+ E C+++
Sbjct: 411 CLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELCTAV 458


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 136/376 (36%), Positives = 191/376 (50%), Gaps = 35/376 (9%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QCQPC +C++Q  P FDPS S +     
Sbjct: 30  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 89

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
           C+S+ C  L  A+  S     +    C Y  SYGD S T G L  +        ASV   
Sbjct: 90  CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 147

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C  +T       +++L 
Sbjct: 148 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLD 203

Query: 305 GNSSVFKN------STP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA---- 351
             + +F N      +TP I Y     NP   T Y L+L GI++G  +L    S FA    
Sbjct: 204 LPADLFSNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLPVPESAFALTNG 260

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPL 410
            GG +IDSGT IT LPP +Y  ++ EF  Q    P  PG +    TCF+  +  + ++P 
Sbjct: 261 TGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVPK 319

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           + + FEG A M +     V+ V  DA  S +CLA   ++  DET IIGN+QQ+N  V+YD
Sbjct: 320 LVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNFQQQNMHVLYD 375

Query: 469 TKNSQLGFAGEDCSSM 484
            +N+ L F    C  +
Sbjct: 376 LQNNMLSFVAAQCDKL 391


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 128/389 (32%), Positives = 183/389 (47%), Gaps = 18/389 (4%)

Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
           I  +I+G        + P+ SG  L +  Y     LG   +  ++IVD+GSDL WVQC P
Sbjct: 35  ITAVIAGPPSHDYGFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSP 94

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C+ CY Q  P++ PS S ++  V C SS C  +    G    C    P  C Y   Y D 
Sbjct: 95  CRQCYAQDSPLYVPSNSSTFSPVPCLSSDCLLIPATEGFP--CDFRYPGACAYEYLYADT 152

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           S ++G    E   +    ++   FGCG +N+G F    G++GLG+  LS  SQ    +G 
Sbjct: 153 SSSKGVFAYESATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGN 212

Query: 284 LFSYCLPSTQD-AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            F+YCL +  D    S SLI G    +      + YT ++ NP+  T Y + +  +++GG
Sbjct: 213 KFAYCLVNYLDPTSVSSSLIFG--DELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGG 270

Query: 343 KQLQASGFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
           K L  S  A        GG + DSGT +T   PS YS + A F      +P A     LD
Sbjct: 271 KSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVH-YPRAESVQGLD 329

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE-DETGI 454
            C  L+   + + P   +EF+  A    +     YFV    +  CLA+A L+        
Sbjct: 330 LCVELTGVDQPSFPSFTIEFDDGAVFQPEAEN--YFVDVAPNVRCLAMAGLASPLGGFNT 387

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           IGN  Q+N  V YD + + +GFA   CSS
Sbjct: 388 IGNLLQQNFFVQYDREENLIGFAPAKCSS 416


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 133/365 (36%), Positives = 189/365 (51%), Gaps = 40/365 (10%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I  G   +  +VIVDTGSDL W QC PC++C      +FDP  S +Y  V C S+ 
Sbjct: 80  YLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASNF 139

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           C +L F +     C++S    C Y   YGDGS T G L  E + +G  ++ +  FGCG  
Sbjct: 140 CSSLPFQS-----CTTS----CKYDYMYGDGSSTSGALSTETVTVGTGTIPNVAFGCGHT 190

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
           N G F G +G++GLG+  LSL+SQ S I    FSYCL        S  LI  G+S+    
Sbjct: 191 NLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLI--GDSAAAGG 248

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASGFAKGGILIDSGTVI 363
              + YT ++ N    TFY  +LTGIS+ GK          + ASG  +GG ++DSGT +
Sbjct: 249 ---VAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASG--QGGFILDSGTTL 303

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           T L    ++AL A  LK    FP A G    LD CF+ +       P +   F+G A+  
Sbjct: 304 TYLETGAFNALVAA-LKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKG-ADYE 361

Query: 423 VDVTGIVYFVKSD-ASQVCLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGE 479
           +    +  FV  D    +CLA+A+      TG  I+GN QQ+N  +++D  N ++GF   
Sbjct: 362 LPPENV--FVALDTGGSICLAMAA-----STGFSIMGNIQQQNHLIVHDLVNQRVGFKEA 414

Query: 480 DCSSM 484
           +C ++
Sbjct: 415 NCETI 419


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 132/372 (35%), Positives = 178/372 (47%), Gaps = 41/372 (11%)

Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y+  + +G   +  T +VDTGSDL W QC PC  C +Q  P F P+ S +Y+ V C S 
Sbjct: 91  EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
            C AL +       C   S   C Y   YGD + T G L  E    G A+     V+D  
Sbjct: 151 LCAALPYPA-----CFQRS--VCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASG 299
           FGCG  N G     SG++GLGR  LSLVSQ        FSYCL       PS  + G   
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSFLSPEPSRLNFGVFA 260

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------K 352
           +L  G N+S   + +P+  T ++ N  L + Y ++L GIS+G K+L              
Sbjct: 261 TLN-GTNAS--SSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT 317

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIP 409
           GG+ IDSGT +T L    Y A++ E +      P      I L+TCF         V +P
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVP 377

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +++ F+G A MTV       ++  D +   L LA +   D T IIGNYQQ+N  ++YD 
Sbjct: 378 DMELHFDGGANMTVPPEN---YMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILYDI 433

Query: 470 KNSQLGFAGEDC 481
            NS L F    C
Sbjct: 434 ANSLLSFVPAPC 445


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 185/364 (50%), Gaps = 24/364 (6%)

Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+    LG     +++  DT +D TW  C PC +C +    +F P+ S SY  + C+S+
Sbjct: 76  SYVVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGS-LFAPANSTSYAPLPCSST 134

Query: 192 TCHALE--FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
            C  L+           SS+  P C +   + D S+ +  L  + L LGK ++ ++ FGC
Sbjct: 135 MCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKDAIPNYAFGC 193

Query: 250 GRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
                G    +   GL+GLGR  ++L+SQ   ++ G+FSYCLPS +    SGSL LG   
Sbjct: 194 VSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAG 253

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSG 360
                   + YT M+ NP  ++ Y +N+TG+S+G    ++ A  FA       G ++DSG
Sbjct: 254 QPRG----VRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSG 309

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           TVITR  P +Y+AL+ EF +  +           DTCFN         P V +  +G  +
Sbjct: 310 TVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLD 369

Query: 421 MTVDVTGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           + + +   +  + S A+ + CLA+A    +      ++ N QQ+N RV++D  NS++GFA
Sbjct: 370 LALPMENTL--IHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFA 427

Query: 478 GEDC 481
            E C
Sbjct: 428 RESC 431


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 132/372 (35%), Positives = 178/372 (47%), Gaps = 41/372 (11%)

Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y+  + +G   +  T +VDTGSDL W QC PC  C +Q  P F P+ S +Y+ V C S 
Sbjct: 91  EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSP 150

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
            C AL +       C   S   C Y   YGD + T G L  E    G A+     V+D  
Sbjct: 151 LCAALPYPA-----CFQRS--VCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVA 203

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASG 299
           FGCG  N G     SG++GLGR  LSLVSQ        FSYCL       PS  + G   
Sbjct: 204 FGCGNINSGQLANSSGMVGLGRGPLSLVSQLGP---SRFSYCLTSFLSPEPSRLNFGVFA 260

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------K 352
           +L  G N+S   + +P+  T ++ N  L + Y ++L GIS+G K+L              
Sbjct: 261 TLN-GTNAS--SSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGT 317

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIP 409
           GG+ IDSGT +T L    Y A++ E +      P      I L+TCF         V +P
Sbjct: 318 GGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVP 377

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            +++ F+G A MTV       ++  D +   L LA +   D T IIGNYQQ+N  ++YD 
Sbjct: 378 DMELHFDGGANMTVPPEN---YMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILYDI 433

Query: 470 KNSQLGFAGEDC 481
            NS L F    C
Sbjct: 434 ANSLLSFVPAPC 445


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 210/433 (48%), Gaps = 39/433 (9%)

Query: 87  QQQNRLILDNLHVQYLQSR------IKNMISGNIKDVSNTEIP-------LTSGIRLQTL 133
           Q Q+   +  LH ++ +S+      ++  I+ +I  V   E+        L SG+ L + 
Sbjct: 99  QIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATLESGMTLGSG 158

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y   + +G   ++ ++I+DTGSDL W+QC PC  C++Q    +DP  S S+K + CN  
Sbjct: 159 EYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDP 218

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---------GLGKASV 242
            C  +  ++ +  V   S    C YF  YGD S T G+   E           G  +  V
Sbjct: 219 RCSLI--SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKV 276

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSL 301
            + +FGCG  N+GLF G SGL+GLGR  LS  SQ   ++G  FSYCL     +   S  L
Sbjct: 277 GNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSKL 336

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQLQ-------ASGFAK 352
           I G +  +  N T + +T+ +   +  + TFY + +  I +GGK L         S    
Sbjct: 337 IFGEDKDLL-NHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
           GG +IDSGT ++      Y  +K +F ++    +P    F +LD CFN+S  +E NI L 
Sbjct: 396 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLP 455

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           ++          +      F+      VCLA+     +    IIGNYQQ+N  ++YDTK 
Sbjct: 456 ELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTP-KSTFSIIGNYQQQNFHILYDTKR 514

Query: 472 SQLGFAGEDCSSM 484
           S+LGF    C+ +
Sbjct: 515 SRLGFTPTKCADI 527


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 141/476 (29%), Positives = 228/476 (47%), Gaps = 73/476 (15%)

Query: 66  AITLELKHKNYCSG-----KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISG-------- 112
           ++ L LKH++   G      ++D   +   R+   NLH + +++R +N IS         
Sbjct: 100 SVKLHLKHRSGSKGAEPKNSVIDSTVRDLTRI--QNLHRRVIENRNQNTISRLQRLQKEQ 157

Query: 113 ---NIKDV----SNTEIP--------LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSD 155
              + K V    +++  P        L SG+ L +  Y   + +G   ++ ++I+DTGSD
Sbjct: 158 PKQSFKPVFAPAASSTSPVSGQLVATLESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSD 217

Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-- 213
           L W+QC PC +C+ Q  P +DP  S S++ + C+   C           + SS  PP+  
Sbjct: 218 LNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQ----------LVSSPDPPNPC 267

Query: 214 ------CNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFG 258
                 C YF  YGDGS T G+   E   +      GK+    V + +FGCG  N+GLF 
Sbjct: 268 KAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFH 327

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPIT 317
           G +GL+GLG+  LS  SQ   ++G  FSYCL     +A  S  LI G +  +  +   + 
Sbjct: 328 GAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPN-LN 386

Query: 318 YTNM--IPNPQLATFYILNLTGISIGGKQLQA-------SGFAKGGILIDSGTVITRLPP 368
           +T+     +  + TFY + +  + +  + L+        S    GG +IDSGT +T    
Sbjct: 387 FTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAE 446

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
             Y  +K  F+++  G+    G   L  C+N+S  +++ +P   + F   A     V   
Sbjct: 447 PAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVEN- 505

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            YF++ D   VCLA+   +      IIGNYQQ+N  ++YD K S+LG+A   C+ +
Sbjct: 506 -YFIQIDPDVVCLAILG-NPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKCADV 559


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 183/348 (52%), Gaps = 24/348 (6%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            + DTGSDLTW QCQPCK C+ Q  PV+DPS S ++  + C+S+TC  L   + N   C+
Sbjct: 86  ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATC--LPIWSRN---CT 140

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRNNKGLFGGVSGL 263
            SS   C Y  +YGDG+Y+ G LG E L LG +    SV    FGCG +N G     +G 
Sbjct: 141 PSS--LCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGT 198

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
           +GLGR  LSL++Q      G FSYCL    ++      +LG  + +    + +  T ++ 
Sbjct: 199 VGLGRGTLSLLAQLGV---GKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQ 255

Query: 324 NPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKA 376
           +PQ  + Y ++L GIS+G  +L          G   GG+++DSGT  T L  S +  +  
Sbjct: 256 SPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVG 315

Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
              +   G P     S+   CF   A +   +P + + F G A+M +     + + + D+
Sbjct: 316 RVARVL-GQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDS 374

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           S  CL +A  + E  T ++GN+QQ+N ++++DT   QL F   DCS +
Sbjct: 375 S-FCLNIAGTTPE-STSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSKL 420


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 180/348 (51%), Gaps = 39/348 (11%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           +   +I+DTGSD TW+QC  C   +C+N++   F+PS+S SY    C  ST         
Sbjct: 140 QKFNLIIDTGSDTTWIQCNSCSLGNCHNKK--TFNPSLSSSYSNRSCIPST--------- 188

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
                      D NY + Y D SY++G    + + L       F FGCG +  G FG  S
Sbjct: 189 -----------DTNYTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGEFGTAS 237

Query: 262 GLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           G++GL + +  SL+SQT+  F   FSYC P  +     GSL+ G        S  + +T 
Sbjct: 238 GVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEH--TLGSLLFG--EKAISASPSLKFTQ 293

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           ++ NP     Y + L GIS+  K+L  S   FA  G +IDSGTVITRLP + Y AL+  F
Sbjct: 294 LL-NPPSGLGYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAF 352

Query: 379 LKQFSGFPS---APGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            ++    PS    P   +LDTC+NL     + + +P + + F G  ++++  +GI++   
Sbjct: 353 QQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILW-AN 411

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            D +Q CLA A  S      IIGN QQ + +V+YD +  +LGF G DC
Sbjct: 412 GDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGF-GNDC 458


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 134/420 (31%), Positives = 190/420 (45%), Gaps = 84/420 (20%)

Query: 65  GAITLELKHK-NYCSGKIVDWNEQQ---QNRLILDNLHVQYLQSRIKN---MISGNIKDV 117
           G  ++ L H+   CS    +  E++   +  L  D L   Y++ +        +G     
Sbjct: 29  GTSSVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQS 88

Query: 118 SNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS---CYNQQD 172
           S   +P T G  L TL Y+ ++ LG   +T  V++DTGSD++WVQC+PC +   C+    
Sbjct: 89  SKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG 148

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            +FDP+ S +Y    C+++ C  L   +G +  C + S   C Y V YGDGS T G    
Sbjct: 149 ALFDPAASSTYAAFNCSAAACAQLG-DSGEANGCDAKS--RCQYIVKYGDGSNTTG---- 201

Query: 233 EHLGLGKASVNDFIFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
                       F FGC       G+     GL+GLG    SLVSQT+            
Sbjct: 202 ----------TGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTA------------ 239

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
                                       +  +P     T+Y   L  I++GGK+L    S
Sbjct: 240 --------------------------ARSKKVP-----TYYFAALEDIAVGGKKLGLSPS 268

Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
            FA G  L+DSGTVITRLPP+ Y+AL + F    + +  A    ILDTCFN +   +V+I
Sbjct: 269 VFAAGS-LVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSI 327

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P V + F G A + +D  GIV       S  CLA A    +   G IGN QQ+   V+YD
Sbjct: 328 PTVALVFAGGAVVDLDAHGIV-------SGGCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  181 bits (458), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 134/377 (35%), Positives = 192/377 (50%), Gaps = 38/377 (10%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QC+PC SC++Q  P FD S S +   + 
Sbjct: 30  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLP 89

Query: 188 CNSSTCHALEFATGNSGVCS--SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVND 244
           C S+ C      T    VC   + +   C Y+ SYGD S T G L  +    +   S+  
Sbjct: 90  CESTQCKLDPTVT----VCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPG 145

Query: 245 FIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGCG NN G+F    +G+ G GR  LSL SQ      G FS+C  +T       +++L
Sbjct: 146 VTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLL 201

Query: 304 GGNSSVFKN------STP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA--- 351
              + +F N      +TP I Y     NP   T Y L+L GI++G  +L    S FA   
Sbjct: 202 DLPADLFSNGQGAVQTTPLIQYAKNEANP---TLYYLSLKGITVGSTRLPVPESAFALTN 258

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIP 409
             GG +IDSGT IT LPP +Y  ++ EF  Q    P  PG +    TCF+  +  + ++P
Sbjct: 259 GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVP 317

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
            + + FEG A M +     V+ V  DA  S +CLA   ++  DET IIGN+QQ+N  V+Y
Sbjct: 318 KLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNFQQQNMHVLY 373

Query: 468 DTKNSQLGFAGEDCSSM 484
           D +N+ L F    C  +
Sbjct: 374 DLQNNMLSFVAAQCDKL 390


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 136/440 (30%), Positives = 204/440 (46%), Gaps = 64/440 (14%)

Query: 94  LDNLHVQYLQSRIKNMIS----GNIKDVSNTEIPLTSGIRLQTLNYIATIELG------- 142
           +  LH + L+   +N +S     N K+V  T  P+ S +  Q    +AT+E G       
Sbjct: 111 IQTLHKRVLEKNNQNTVSQKQKKNDKEVVTT-TPVASSVEEQAGQLVATLESGMTLGSGE 169

Query: 143 ----------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
                      ++ ++I+DTGSDL W+QC PC  C+ Q    +DP  S SYK + CN   
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQR 229

Query: 193 CHALEFATGNSGVCSSSSPP--------DCNYFVSYGDGSYTRGELGREHLGLGKAS--- 241
           C+          + SS  PP         C Y+  YGD S T G+   E   +   +   
Sbjct: 230 CN----------LVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGG 279

Query: 242 ------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQD 294
                 V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL     D
Sbjct: 280 SSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSD 339

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL------- 345
              S  LI G +  +  +   + +T+ +   +  + TFY + +  I + G+ L       
Sbjct: 340 TNVSSKLIFGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETW 398

Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQ 404
             S    GG +IDSGT ++      Y  +K +  ++  G +P    F ILD CFN+S   
Sbjct: 399 NISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIH 458

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
            V +P + + F   A           ++  D   VCLA+     +    IIGNYQQ+N  
Sbjct: 459 NVQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAMLGTP-KSAFSIIGNYQQQNFH 515

Query: 465 VIYDTKNSQLGFAGEDCSSM 484
           ++YDTK S+LG+A   C+ +
Sbjct: 516 ILYDTKRSRLGYAPTKCADI 535


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 176/344 (51%), Gaps = 31/344 (9%)

Query: 149 IVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           ++DTGSD+TW+QC PC     CY Q  P+FDP +S SY  V C+S  C  L+ A  N   
Sbjct: 13  VLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEAGCNVN- 71

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGLM 264
                   C Y V YGDGS+T GEL  E L    + S+ +   GCG +N+GLF G  GL+
Sbjct: 72  -------SCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADGLI 124

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           GLG   +S+ SQ   +    FSYCL    D  +     L  N+    +S     + ++ N
Sbjct: 125 GLGGGAISISSQ---LKASSFSYCL---VDIDSPSFSTLDFNTDPPSDSL---ISPLVKN 175

Query: 325 PQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAE 377
            +  +F  + + G+S+GGK L  S           GGI++DSGT IT+LP  +Y  L+  
Sbjct: 176 DRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREA 235

Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
           FL   +  P AP  S  DTC++LS+   V +P +     G   + +     +  V S A 
Sbjct: 236 FLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDS-AG 294

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             CLA  S ++     IIGN+QQ+  RV YD  NS +GF+   C
Sbjct: 295 TFCLAFVSATF--PLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 126/373 (33%), Positives = 183/373 (49%), Gaps = 36/373 (9%)

Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y+A I +G   +  ++  DT SDLTW+QCQPC+ CY Q  PVFDP  S SY ++  ++ 
Sbjct: 133 EYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAP 192

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDG----SYTRGELGREHLGLGKASVNDFI- 246
            C AL  + G      +     C Y V YGDG    S + G+L  E L         ++ 
Sbjct: 193 DCQALGRSGGGDAKRGT-----CIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLS 247

Query: 247 FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLILG 304
            GCG +NKGLFG   +G++GLGR  +S+  Q + + +   FSYCL        S S  L 
Sbjct: 248 IGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLT 307

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKGGI 355
             +     S P ++T  + N  + TFY + L G+S+GG        + LQ   +  +GG+
Sbjct: 308 FGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGV 367

Query: 356 LIDSGTVITRLPPSIY------SALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNI 408
           ++DSGT +TRL    Y          A  L Q S G PS     + DTC+ +     V +
Sbjct: 368 ILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSG----LFDTCYTVGGRAGVKV 423

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P V M F G  E+++     +  V S  + VC A A    +    +IGN  Q+  RV+YD
Sbjct: 424 PAVSMHFAGGVEVSLQPKNYLIPVDSRGT-VCFAFAGTG-DRSVSVIGNILQQGFRVVYD 481

Query: 469 TKNSQLGFAGEDC 481
               ++GFA  +C
Sbjct: 482 LAGQRVGFAPNNC 494


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 194/384 (50%), Gaps = 34/384 (8%)

Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
           K+ +N  +P+  G ++ ++ NYIA   LG   + + V +D  +D  WV C  C  C    
Sbjct: 81  KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 139

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
            P F P+ S +Y+ V C S  C  +   +  +GV SS     C + ++Y   ++ +  LG
Sbjct: 140 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 193

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           ++ L L    V  + FGC R   G      GL+G GR  LS +SQT + +G +FSYCLP+
Sbjct: 194 QDSLALENNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN 253

Query: 292 TQDAGASGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
            + +  SG+L LG  G     K +TP+ Y     NP   + Y +N+ GI +G K +Q   
Sbjct: 254 YRSSNFSGTLKLGPIGQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQ 307

Query: 350 FAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
            A         G +ID+GT+ TRL   +Y+A++  F  +    P AP     DTC+N++ 
Sbjct: 308 SALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT- 365

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQ 459
              V++P V   F G   +T+    ++    S     CLA+A   S        ++ + Q
Sbjct: 366 ---VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQ 421

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
           Q+NQRV++D  N ++GF+ E C++
Sbjct: 422 QQNQRVLFDVANGRVGFSRELCTA 445


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 194/384 (50%), Gaps = 34/384 (8%)

Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
           K+ +N  +P+  G ++ ++ NYIA   LG   + + V +D  +D  WV C  C  C    
Sbjct: 62  KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 120

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
            P F P+ S +Y+ V C S  C  +   +  +GV SS     C + ++Y   ++ +  LG
Sbjct: 121 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 174

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           ++ L L    V  + FGC R   G      GL+G GR  LS +SQT + +G +FSYCLP+
Sbjct: 175 QDSLALENNVVVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPN 234

Query: 292 TQDAGASGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
            + +  SG+L LG  G     K +TP+ Y     NP   + Y +N+ GI +G K +Q   
Sbjct: 235 YRSSNFSGTLKLGPIGQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQ 288

Query: 350 FAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
            A         G +ID+GT+ TRL   +Y+A++  F  +    P AP     DTC+N++ 
Sbjct: 289 SALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT- 346

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQ 459
              V++P V   F G   +T+    ++    S     CLA+A   S        ++ + Q
Sbjct: 347 ---VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQ 402

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
           Q+NQRV++D  N ++GF+ E C++
Sbjct: 403 QQNQRVLFDVANGRVGFSRELCTA 426


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 205/435 (47%), Gaps = 48/435 (11%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTS 126
           T+EL H++  S K   +N  +         H   +   ++  IS N   V+NT E P+ +
Sbjct: 31  TVELIHRD--SPKSPMYNPLEN--------HYHRVADTLRRSISHNTGLVTNTVEAPIYN 80

Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
                   Y+  + +G     +I   DTGSD+ W QC+PC +CY Q  P+F+PS S +Y+
Sbjct: 81  ----NRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYR 136

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
           KV C+S  C      TG    CS    PDC Y +SYGD S+++G+   + L +G  S   
Sbjct: 137 KVSCSSPVCS----FTGEDNSCSFK--PDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190

Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGA 297
             F     GCG +N G F   VSG++GLG    SL+ Q     GG FSYCL P   D G 
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGG 250

Query: 298 SGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---- 350
           S  L  G N++V  +   STPI  ++     +  +FY L L  +S+G      S      
Sbjct: 251 SNKLNFGSNANVSGSGAVSTPIYISD-----KFKSFYSLKLKAVSVGRNNTFYSTANSIL 305

Query: 351 -AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
             K  I+IDSGT +T LP  +Y           +   +      L+ CF  +   +  +P
Sbjct: 306 GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTT-DDYKVP 364

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + M FEG A + +    ++  V  +   +CLA A  + +++  I GN  Q N  V YD 
Sbjct: 365 FIAMHFEG-ANLRLQRENVLIRVSDNV--ICLAFAG-AQDNDISIYGNIAQINFLVGYDV 420

Query: 470 KNSQLGFAGEDCSSM 484
            N  L F   +C +M
Sbjct: 421 TNMSLSFKPMNCVAM 435


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 144/403 (35%), Positives = 202/403 (50%), Gaps = 43/403 (10%)

Query: 105 RIKNMISGNIKDV------SNT---EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTG 153
           R++N I  ++  V       NT   +I LTS     +  Y+  + +G     +  I DTG
Sbjct: 55  RLRNAIHRSVNRVFHFTEKDNTPQPQIDLTSN----SGEYLMNVSIGTPPFPIMAIADTG 110

Query: 154 SDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
           SDL W QC PC  CY Q DP+FDP  S +YK V C+SS C ALE    N   CS++    
Sbjct: 111 SDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQCTALE----NQASCSTND-NT 165

Query: 214 CNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFG-GVSGLMGLG 267
           C+Y +SYGD SYT+G +  + L LG +      + + I GCG NN G F    SG++GLG
Sbjct: 166 CSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLG 225

Query: 268 RSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
              +SL+ Q  +   G FSYCL P T     +  +  G N+ V  + + +  T +I    
Sbjct: 226 GGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIV--SGSGVVSTPLIAKAS 283

Query: 327 LATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
             TFY L L  IS+G KQ+Q     S  ++G I+IDSGT +T LP   YS L+       
Sbjct: 284 QETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSI 343

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
                    S L  C+  SA  ++ +P++ M F+G A++ +D +    FV+     VC A
Sbjct: 344 DAEKKQDPQSGLSLCY--SATGDLKVPVITMHFDG-ADVKLDSSNA--FVQVSEDLVCFA 398

Query: 443 L-ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              S S+     I GN  Q N  V YDT +  + F   DC+ M
Sbjct: 399 FRGSPSF----SIYGNVAQMNFLVGYDTVSKTVSFKPTDCAKM 437


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 139/433 (32%), Positives = 195/433 (45%), Gaps = 56/433 (12%)

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--R 144
           QQQN L   N  V  L+S  K   SGNI         L SG  L T  Y   + +G   +
Sbjct: 132 QQQNNLA--NAFVASLESS-KGEFSGNIMAT------LESGASLGTGEYFLDMFVGTPPK 182

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           ++ +I+DTGSDL+W+QC PC  C+ Q    + P  S +Y+ + C    C           
Sbjct: 183 HVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQ---------- 232

Query: 205 VCSSSSP--------PDCNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIF 247
           + SSS P          C YF  Y DGS T G+   E   +      GK     V D +F
Sbjct: 233 LVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMF 292

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGN 306
           GCG  NKG F G SGL+GLGR  +S  SQ   I+G  FSYCL     +   S  LI G +
Sbjct: 293 GCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSSKLIFGED 352

Query: 307 SSVFKNSTPITYTNMIPNPQL--ATFYILNLTGISIGGKQLQAS------------GFAK 352
             +  N   + +T ++   +    TFY L +  I +GG+ L  S              A 
Sbjct: 353 KELLNNHN-LNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAG 411

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS-AYQEVNIPLV 411
           GG +IDSG+ +T  P S Y  +K  F K+      A    ++  C+N+S A  +V +P  
Sbjct: 412 GGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPDF 471

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
            + F              Y  + D   +CLA+          IIGN  Q+N  ++YD K 
Sbjct: 472 GIHFADGGVWNFPAENYFYQYEPDEV-ICLAIMKTPNHSHLTIIGNLLQQNFHILYDVKR 530

Query: 472 SQLGFAGEDCSSM 484
           S+LG++   C+ +
Sbjct: 531 SRLGYSPRRCAEV 543


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 134/364 (36%), Positives = 186/364 (51%), Gaps = 30/364 (8%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  + +G     +  I DTGSDL W QC PC  CY Q DP+FDP  S +YK V C+SS 
Sbjct: 90  YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQ 149

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIF 247
           C ALE    N   CS++    C+Y +SYGD SYT+G +  + L LG +      + + I 
Sbjct: 150 CTALE----NQASCSTND-NTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204

Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG NN G F    SG++GLG   +SL+ Q  +   G FSYCL P T     +  +  G 
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSGT 361
           N+ V  + + +  T +I      TFY L L  IS+G KQ+Q     S  ++G I+IDSGT
Sbjct: 265 NAIV--SGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGT 322

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            +T LP   YS L+                S L  C+  SA  ++ +P++ M F+G A++
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY--SATGDLKVPVITMHFDG-ADV 379

Query: 422 TVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
            +D +    FV+     VC A   S S+     I GN  Q N  V YDT +  + F   D
Sbjct: 380 KLDSSNA--FVQVSEDLVCFAFRGSPSF----SIYGNVAQMNFLVGYDTVSKTVSFKPTD 433

Query: 481 CSSM 484
           C+ M
Sbjct: 434 CAKM 437


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 188/384 (48%), Gaps = 29/384 (7%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           + SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C++Q  PVFDP+ S 
Sbjct: 140 VESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASS 199

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL---- 237
           SY+ V C    C  L         C       C Y+  YGD S T G+L  E   +    
Sbjct: 200 SYRNVTCGDQRC-GLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 258

Query: 238 --GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
                 V+D +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL      
Sbjct: 259 PGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSD 318

Query: 296 GASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLA-TFYILNLTGISIGGKQLQASG---- 349
            AS  +    ++     + P + YT   P    A TFY + L G+ +GG+ L  S     
Sbjct: 319 VASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWG 378

Query: 350 -----FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAY 403
                   GG +IDSGT ++      Y  ++  F+ +    +P  P F +L  C+N+S  
Sbjct: 379 VGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGV 438

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG--IIGNYQQ 460
               +P + + F   A    D     YF++ D   + CLA+        TG  IIGN+QQ
Sbjct: 439 DRPEVPELSLLFADGA--VWDFPAENYFIRLDPDGIMCLAVLGTP---RTGMSIIGNFQQ 493

Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
           +N  V+YD KN++LGFA   C+ +
Sbjct: 494 QNFHVVYDLKNNRLGFAPRRCAEV 517


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 134/442 (30%), Positives = 203/442 (45%), Gaps = 69/442 (15%)

Query: 94  LDNLHVQYLQSRIKNMISGNIKDVSNTEI---PLTSGIRLQTLNYIATIELG-------- 142
           +  LH + L  + +N +S   K   N E+   P+ S +  Q    +AT+E G        
Sbjct: 97  IQTLHKRVLAKKNQNTVSQKQKK-KNKEVVTTPVASSVEEQAGQLVATLESGMTLGSGEY 155

Query: 143 ---------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
                     ++ ++I+DTGSDL W+QC PC  C+ Q    +DP  S SYK + CN   C
Sbjct: 156 FMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRC 215

Query: 194 HALEFATGNSGVCSSSSPPD-----------CNYFVSYGDGSYTRGELGREHLGLGKAS- 241
           + +             SPPD           C Y+  YGD S T G+   E   +   + 
Sbjct: 216 NLV-------------SPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTS 262

Query: 242 --------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
                   V + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL    
Sbjct: 263 GGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN 322

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL----- 345
            D   S  LI G +  +  +   + +T+ +   +  + TFY + +  I + G+ L     
Sbjct: 323 SDTNVSSKLIFGEDKDLLSHPN-LNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEE 381

Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSA 402
               S    GG +IDSGT ++      Y  +K +  ++  G +P    F ILD CFN+S 
Sbjct: 382 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSG 441

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
              + +P + + F   A           ++  D   VCLA+     +    IIGNYQQ+N
Sbjct: 442 IDSIQLPELGIAFADGAVWNFPTENSFIWLNEDL--VCLAILGTP-KSAFSIIGNYQQQN 498

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
             ++YDTK S+LG+A   C+ +
Sbjct: 499 FHILYDTKRSRLGYAPTKCADI 520


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 133/371 (35%), Positives = 191/371 (51%), Gaps = 46/371 (12%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I +G   + +  I DTGSDL W QC PC+ CY Q  P+FDP  S +Y+KV C+SS 
Sbjct: 86  YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQ 145

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIF 247
           C ALE A+     CS+     C+Y ++YGD SYT+G++  + + +G +     S+ + I 
Sbjct: 146 CRALEDAS-----CSTDENT-CSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMII 199

Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG  N G F    SG++GLG    SLVSQ  +   G FSYCL P T + G +  +  G 
Sbjct: 200 GCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGT 259

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKGGILIDSGT 361
           N  V  +   +  T+M+     AT+Y LNL  IS+G K++Q +    G  +G I+IDSGT
Sbjct: 260 NGIVSGDG--VVSTSMV-KKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGT 316

Query: 362 VITRLPPSIY--------SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
            +T LP + Y        S +KAE ++   G        IL  C+  S+     +P + +
Sbjct: 317 TLTLLPSNFYYELESVVASTIKAERVQDPDG--------ILSLCYRDSS--SFKVPDITV 366

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F+G     V +  +  FV       C A A+    ++  I GN  Q N  V YDT +  
Sbjct: 367 HFKGG---DVKLGNLNTFVAVSEDVSCFAFAA---NEQLTIFGNLAQMNFLVGYDTVSGT 420

Query: 474 LGFAGEDCSSM 484
           + F   DCS M
Sbjct: 421 VSFKKTDCSQM 431


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 128/390 (32%), Positives = 194/390 (49%), Gaps = 38/390 (9%)

Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
           R    I+  ++  S  E P+ +G    +  Y+  + +G    +++ I+DTGSDL W QC+
Sbjct: 70  RRMRSINAMLQSSSGIETPVYAG----SGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCE 125

Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
           PC  C++Q  P+F+P  S S+  + C S  C  L          S S   DC Y   YGD
Sbjct: 126 PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLP---------SESCYNDCQYTYGYGD 176

Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
           GS T+G +  E      +SV +  FGCG +N+G   G  +GL+G+G   LSL SQ   + 
Sbjct: 177 GSSTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ---LG 233

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
            G FSYC+ ++  + +  +L LG  +S     +P   T +I +    T+Y + L GI++G
Sbjct: 234 VGQFSYCM-TSSGSSSPSTLALGSAASGVPEGSP--STTLIHSSLNPTYYYITLQGITVG 290

Query: 342 GK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           G          QLQ  G   GG++IDSGT +T LP   Y+A+   F  Q +  P     S
Sbjct: 291 GDNLGIPSSTFQLQDDG--TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSS 348

Query: 393 ILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
            L TCF L S    V +P + M+F+G     +++      +      +CLA+ S S +  
Sbjct: 349 GLSTCFQLPSDGSTVQVPEISMQFDGGV---LNLGEENVLISPAEGVICLAMGS-SSQQG 404

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             I GN QQ+  +V+YD +N  + F    C
Sbjct: 405 ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 204/435 (46%), Gaps = 48/435 (11%)

Query: 68  TLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTS 126
           T+EL H++  S K   +N  +         H   +   ++  IS N   V+NT E P+ +
Sbjct: 31  TVELIHRD--SPKSPMYNPLEN--------HYHRVADTLRRSISHNTGLVTNTVEAPIYN 80

Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
                   Y+  + +G     +I   DTGSD+ W QC PC +CY Q  P+F+PS S +Y+
Sbjct: 81  ----NRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYR 136

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
           KV C+S  C      TG    CS    PDC Y +SYGD S+++G+   + L +G  S   
Sbjct: 137 KVSCSSPVCS----FTGEDNSCSFK--PDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRV 190

Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGA 297
             F     GCG +N G F   VSG++GLG    SL+ Q     GG FSYCL P   D G 
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGG 250

Query: 298 SGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---- 350
           S  L  G N++V  +   STPI  ++     +  +FY L L  +S+G      S      
Sbjct: 251 SNKLNFGSNANVSGSGAVSTPIYISD-----KFKSFYSLKLKAVSVGRNNTFYSTANSIL 305

Query: 351 -AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
             K  I+IDSGT +T LP  +Y           +   +      L+ CF  +   +  +P
Sbjct: 306 GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQFLEYCFETTT-DDYKVP 364

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + M FEG A + +    ++  V  +   +CLA A  + +++  I GN  Q N  V YD 
Sbjct: 365 FIAMHFEG-ANLRLQRENVLIRVSDNV--ICLAFAG-AQDNDISIYGNIAQINFLVGYDV 420

Query: 470 KNSQLGFAGEDCSSM 484
            N  L F   +C +M
Sbjct: 421 TNMSLSFKPMNCVAM 435


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 139/443 (31%), Positives = 212/443 (47%), Gaps = 48/443 (10%)

Query: 56  SHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK 115
           SH  S I + A      H  + S  ++D      +    D+    YL S    +++G  K
Sbjct: 37  SHDLSIIPINAKCSPFAHT-HVSASVIDTVLHMASS---DSHRFTYLSS----LVAGKSK 88

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
               T +P+ SG +L   NY+    LG   + M +++DT +D  W+ C  C  C N    
Sbjct: 89  P---TSVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTS 145

Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE 233
               S S +Y  V C+++ C     A G +   S+  P  C++  SYG  S     L ++
Sbjct: 146 FNTNSSS-TYSTVSCSTTQCTQ---ARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQD 201

Query: 234 HLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
            L L    + +F FGC  +  G      GLMGLGR  +SLVSQT+ ++ G+FSYCLPS +
Sbjct: 202 TLTLSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFR 261

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF--- 350
               SGSL LG    +      I YT ++ NP+  + Y +NLTG+S+G  Q+        
Sbjct: 262 SFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLT 317

Query: 351 ----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCF---NL 400
               +  G +IDSGTVITR    +Y A++ EF KQ +G      FS L   DTCF   N 
Sbjct: 318 FDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNG-----SFSTLGAFDTCFSADNE 372

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET--GIIGNY 458
           +   ++ + +  ++ +   E T+          S  +  CL++A +         +I N 
Sbjct: 373 NVTPKITLHMTSLDLKLPMENTL-------IHSSAGTLTCLSMAGIRQNANAVLNVIANL 425

Query: 459 QQKNQRVIYDTKNSQLGFAGEDC 481
           QQ+N R+++D  NS++G A E C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 131/358 (36%), Positives = 186/358 (51%), Gaps = 45/358 (12%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           N+  I DTGSDLTW QC PC+ C+NQ  P+F+P  S SY+KV C S TC +LE       
Sbjct: 102 NVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLE------- 154

Query: 205 VCSSSSPPD---CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
             S    PD   C+Y  SYGD S+T G+L  + + +G   +   + GCG  N G FGGV+
Sbjct: 155 --SYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVT 212

Query: 262 -GLMGLGRSDLSLVSQTSEIFG--GLFSYCLPS-TQDAGASGSLILGGNSSVFKNSTPIT 317
            G++GLG   LSLVSQ   I G    FSYCLP+   +A  +G++  G  + V  +   + 
Sbjct: 213 SGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVV--SGRQVV 270

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFA----KGGILIDSGTVITRLPPSIY- 371
            T ++P     TFY L L  IS+G K+ +A+ G +     G I+IDSGT +T LP S+Y 
Sbjct: 271 STPLVPR-SPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGTTLTLLPRSLYY 329

Query: 372 -------SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
                    +KA+ +   SG        IL+ C++     ++NIP++   F G A+  V 
Sbjct: 330 GVFSTLARVIKAKRVDDPSG--------ILELCYSAGQVDDLNIPIITAHFAGGAD--VK 379

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +  +  F     +  CL  A  +   +  I GN  Q N  V YD  N +L F  + C+
Sbjct: 380 LLPVNTFAPVADNVTCLTFAPAT---QVAIFGNLAQINFEVGYDLGNKRLSFEPKLCA 434


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 151/449 (33%), Positives = 213/449 (47%), Gaps = 44/449 (9%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           SS  +S+  ++ ++G  T +L H++       +  E    R I + +H  +  +R+ +  
Sbjct: 16  SSHILSNVNAKPKLG-FTTDLIHRDSPKSPFYNPAETPSQR-IRNAIHRSF--NRVSHFT 71

Query: 111 SGNIKDVS----NTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC 164
             +  D S     T+I    G       Y+  + LG     +  + DTGS+L W QC+PC
Sbjct: 72  DLSEMDASLNSPQTDITPCGG------EYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC 125

Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
             CY Q DP+FDP  S +YK V C+SS C ALE    N   CS+     C+Y VSY DGS
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALE----NQASCSTED-KTCSYLVSYADGS 180

Query: 225 YTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTS 278
           YT G+   + L LG        + + I GCG+NN   F    SG++GLG   +SL+ Q  
Sbjct: 181 YTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLG 240

Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
           +   G FSYCL    D   +  +  G N+ V   S P T +  +      TFY L L  I
Sbjct: 241 DSIDGKFSYCLVPEND--QTSKINFGTNAVV---SGPGTVSTPLVVKSRDTFYYLTLKSI 295

Query: 339 SIGGKQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
           S+G K +Q      KG ++IDSGT +T LP   Y  ++       +   S         C
Sbjct: 296 SVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLC 355

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY--FVKSDASQVCLALASLSYEDETGII 455
           +N +A  ++NIP++ M FEG      DV    Y  F K     VCLA     Y +  GI 
Sbjct: 356 YNATA--DLNIPVITMHFEG-----ADVKLYPYNSFFKVTEDLVCLAFGMSFYRN--GIY 406

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           GN  QKN  V YDT +  + F   DC+ M
Sbjct: 407 GNVAQKNFLVGYDTASKTMSFKPTDCAKM 435


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 178/362 (49%), Gaps = 32/362 (8%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG     +  I DTGSDL W QC+PC+ CY Q DP+FDP  S +Y+   C++  
Sbjct: 95  YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQ 154

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
           C  L+ +T +  +        C Y  SYGD SYT G +  + + L        S    + 
Sbjct: 155 CSLLDQSTCSGNI--------CQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVI 206

Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG  N G F    SG++GLG   LSL+SQ     GG FSYCL P +  AG S  L  G 
Sbjct: 207 GCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGS 266

Query: 306 NSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILIDSG 360
           N+ V   S P +  T ++ +  +++FY L L  +S+G ++++    + G  +G I+IDSG
Sbjct: 267 NAVV---SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSG 323

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T +T +P   +S L      Q  G  +      L  C+  SA  ++ +P +   F G   
Sbjct: 324 TTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHFTG--- 378

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             V +  I  FV+     VCLA AS +      I GN  Q N  V Y+ +   L F   D
Sbjct: 379 ADVKLKPINTFVQVSDDVVCLAFASTT--SGISIYGNVAQMNFLVEYNIQGKSLSFKPTD 436

Query: 481 CS 482
           C+
Sbjct: 437 CT 438


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 141/463 (30%), Positives = 212/463 (45%), Gaps = 50/463 (10%)

Query: 48  SGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQ---------NRLILDNLH 98
           SG S + +SH  S     A   +           + W+E +          N   +D+  
Sbjct: 65  SGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSGNAAPMDDAG 124

Query: 99  VQYLQS-RIKNMISGNIK-DVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDL 156
            +  QS ++ +  + N+    S+T+     GI           +L G   +++VDT SD+
Sbjct: 125 EETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQSMVVDTASDV 184

Query: 157 TWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGNSGVCSSSSPPD 213
            WVQC PC    CY Q D ++DP+ S       C+S  C +L  +A G +G  ++ +   
Sbjct: 185 PWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAGNTGT--- 241

Query: 214 CNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR--------NNKGLFGGVSG 262
           C Y V Y DGS T G    + L L    K +V+ F FGC          NNK      +G
Sbjct: 242 CQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNK-----TAG 296

Query: 263 LMGLGRSDLSLVSQTSEIF--GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
            M LGR   SL SQT   F  G +FSYCLP T       SL +  +++     TP+  + 
Sbjct: 297 FMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLKSK 356

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           M P       Y++ L GI + G++L      FA    + DS T+ITRLPP+ Y AL+A F
Sbjct: 357 MAP-----MIYMVRLIGIDVAGQRLPVPPAVFAANAAM-DSRTIITRLPPTAYMALRAAF 410

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
             Q   + +      LDTC++ +    V +P V + F+ NA + +D +G++         
Sbjct: 411 RAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML-------D 463

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            CLA A  + +   GIIGN QQ+   V+Y+   + +GF    C
Sbjct: 464 SCLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 140/460 (30%), Positives = 215/460 (46%), Gaps = 51/460 (11%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLT 125
           ++ L L H+    G+  +  E   +    D + ++ +  R      G +   S+    L+
Sbjct: 76  SLKLRLNHRAAEGGRTRE--ESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPRRALS 133

Query: 126 --------SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF 175
                   SG+ + +  Y+  + +G   R   +I+DTGSDL W+QC PC  C+ Q+ PVF
Sbjct: 134 ERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVF 193

Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSG----VCSSSSPPDCNYFVSYGDGSYTRGELG 231
           DP+ S SY+ V C    C  +             C       C Y+  YGD S T G+L 
Sbjct: 194 DPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLA 253

Query: 232 REHLGL------GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
            E   +          V+  +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  F
Sbjct: 254 LESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTF 313

Query: 286 SYCLPSTQDAGAS-GSLILGG---NSSVFKNSTPITYTNMIPNPQLA----TFYILNLTG 337
           SYCL    D G+  GS ++ G   ++        + YT   P    +    TFY + L G
Sbjct: 314 SYCL---VDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKG 370

Query: 338 ISIGGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAP 389
           + +GG+ L  S           GG +IDSGT ++      Y  ++  F+ + S  +P  P
Sbjct: 371 VLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVP 430

Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD---ASQVCLALASL 446
            F +L  C+N+S  +   +P + + F   A    D     YF++ D    S +CLA+   
Sbjct: 431 EFPVLSPCYNVSGVERPEVPELSLLFADGA--VWDFPAENYFIRLDPDGGSIMCLAVLGT 488

Query: 447 SYEDETG--IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                TG  IIGN+QQ+N  V+YD +N++LGFA   C+ +
Sbjct: 489 P---RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 139/398 (34%), Positives = 209/398 (52%), Gaps = 41/398 (10%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQ 160
           Q R++ +   ++ +V   E P+ +G       ++  + +G  +++   I+DTGSDLTW Q
Sbjct: 88  QDRLEKL-QMSVDEVKAVEAPVYAG----NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQ 142

Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
           C+PC  CY Q  P++DPS S +Y KV C+SS C AL   +     CS +   +C Y  SY
Sbjct: 143 CKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPMYS-----CSGA---NCEYLYSY 194

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK-GLFGGVSGLMGLGRSDLSLVSQTSE 279
           GD S T+G L  E   L   S+    FGCG+ N+ G F    GL+G GR  LSL+SQ  +
Sbjct: 195 GDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQ 254

Query: 280 IFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
             G  FSYCL S  D+ +  S L +G  +S+  N+  ++ T ++ +    TFY L+L GI
Sbjct: 255 SLGNKFSYCLVSITDSPSKTSPLFIGKTASL--NAKTVSSTPLVQSRSRPTFYYLSLEGI 312

Query: 339 SIGGK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
           S+GG+          LQ  G   GG++IDSGT +T L  S Y  +K   +   +  P   
Sbjct: 313 SVGGQLLDIADGTFDLQLDG--TGGVIIDSGTTVTYLEQSGYDVVKKAVISSIN-LPQVD 369

Query: 390 GFSI-LDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASL 446
           G +I LD CF   S     + P +   FEG A+  +     +Y   +D+S + CLA+   
Sbjct: 370 GSNIGLDLCFEPQSGSSTSHFPTITFHFEG-ADFNLPKENYIY---TDSSGIACLAMLP- 424

Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +   I GN QQ+N +++YD + + L FA   C ++
Sbjct: 425 --SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 135/364 (37%), Positives = 179/364 (49%), Gaps = 44/364 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R  + I+DTGSDL W QC PC  C +Q  P FDP+ S +Y+ + C S  C+AL +     
Sbjct: 101 RYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQ 160

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFGG 259
            VC         YF  YGD + T G L  E    G    + S+    FGCG  N GL   
Sbjct: 161 KVCVY------QYF--YGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLAN 212

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKN 312
            SG++G GR  LSLVSQ        FSYCL       PS    G   +L     +S   +
Sbjct: 213 GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATL-----NSTNAS 264

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVIT 364
           S P+  T  + NP L T Y LN+TGIS+GG  L    + FA       GG +IDSGT IT
Sbjct: 265 SEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTIT 324

Query: 365 RLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAE 420
            L    Y A++A F  Q +  P  +    S+LDTCF       Q V +P + + F+G A+
Sbjct: 325 YLAEPAYDAVRAAFASQIT-LPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDG-AD 382

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             + +   +    S    +CLA+AS S   +  IIG+YQ +N  V+YD +NS + F    
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMASSS---DGSIIGSYQHQNFNVLYDLENSLMSFVPAP 439

Query: 481 CSSM 484
           C  M
Sbjct: 440 CHLM 443


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/342 (35%), Positives = 177/342 (51%), Gaps = 25/342 (7%)

Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           D GSD+TW+QC PC  CY+Q  PV++   S S   V C +  C AL    G+SG C    
Sbjct: 148 DMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCYAPACRAL----GSSGGCVQFL 203

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLF-GGVSGLMGLGR 268
             +C Y V YGDGS + G+ G E L       V     GCG +N+GLF    +G++GLGR
Sbjct: 204 -NECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGR 262

Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-GNSSVFKNSTPITYTNMIPNPQL 327
             LS  SQ +  +G  FSYCL      G S +L  G G S+    +TP ++T M+ N ++
Sbjct: 263 GSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRM 322

Query: 328 ATFYILNLTGISIGGKQLQA---------SGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
            TFY + L GIS+GG +++               GG+++DSGT +TRL    Y+A +  F
Sbjct: 323 YTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF 382

Query: 379 ----LKQFSGFPSAPG-FSILDTCF-NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
               +K+  G+PS  G F+  DTC+ ++       +P V M F G  E+ +     +  V
Sbjct: 383 RVAAVKEL-GWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPV 441

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
            S+   +C A A  S +    IIGN Q +  RV+YD    ++
Sbjct: 442 DSNKGTMCFAFAG-SGDRGVSIIGNIQLQGFRVVYDVDGQRV 482


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/384 (32%), Positives = 187/384 (48%), Gaps = 35/384 (9%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-QDPVFDPSIS 180
           L +G  + T  Y+  + +G   R + + +DTGSDL W QC PC  C+ Q   PV DP+ S
Sbjct: 79  LGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAAS 138

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG 238
            ++  + C++  C AL F +     C   S  D  C Y   YGD S T G+L  +    G
Sbjct: 139 STHAALPCDAPLCRALPFTS-----CGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFG 193

Query: 239 K------ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
                   +     FGCG  NKG+F    +G+ G GR   SL SQ +      FSYC  S
Sbjct: 194 GDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTS 250

Query: 292 TQDAGASGSLILGGNSSVF------KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
             D  +S  + LG  ++         ++  +  T +I NP   + Y + L GIS+GG ++
Sbjct: 251 MFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARV 310

Query: 346 QA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNL--- 400
                  +   +IDSG  IT LP  +Y A+KAEF+ Q  G P +A G + LD CF L   
Sbjct: 311 AVPESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQV-GLPAAAAGSAALDLCFALPVA 369

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           + ++   +P + +  +G A+  +   G   F    A  +C+ L + + E    +IGNYQQ
Sbjct: 370 ALWRRPAVPALTLHLDGGADWELP-RGNYVFEDYAARVLCVVLDAAAGEQV--VIGNYQQ 426

Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
           +N  V+YD +N  L FA   C  +
Sbjct: 427 QNTHVVYDLENDVLSFAPARCDKL 450


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 135/380 (35%), Positives = 190/380 (50%), Gaps = 49/380 (12%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y   IELG   +    IVDTGSDL W+QC+PC  CY+Q DP++DPS S ++ K  C++S+
Sbjct: 4   YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSS 63

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVNDFIF 247
           C +L  A+G    CSSS+   C Y   YGD S T+G+   E L L  +     +  +F F
Sbjct: 64  CQSLP-ASG----CSSSA-KTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQF 117

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILGGN 306
           GCGR N G FGG +G++GLG+  +SL +Q        FSYCL     D+  +  LI G +
Sbjct: 118 GCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSS 177

Query: 307 SSVFKN--STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA------------- 351
           +S      STPI     IPN   +T+Y + L GIS+GGKQL  +  A             
Sbjct: 178 ASTGSGAISTPI-----IPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLR 232

Query: 352 -------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAY 403
                   GG + DSGT +T L  ++YS +K+ F    S  P+    S   D C+++S  
Sbjct: 233 VRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVS-LPTVDASSSGFDLCYDVSKS 291

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ--VCLALASLSYEDETGIIGNYQQK 461
           +    P + + F+G            YFV  D ++   CLA+          IIGN  Q+
Sbjct: 292 KNFKFPALTLAFKGTKFSPPQKN---YFVIVDTAETVACLAMGGSGSLGLG-IIGNLMQQ 347

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
           N  V+YD   S +  +   C
Sbjct: 348 NYHVVYDRGTSTISMSPAQC 367


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 190/369 (51%), Gaps = 27/369 (7%)

Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           + PLT G    +  Y+ ++ +G   +  I   DTGSDL W QC PC  CY Q  P+FDP 
Sbjct: 82  QAPLTPG----SGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPL 137

Query: 179 ISPSYKKVLCNSSTCHALEFA-TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
            S S+  V CNS  C A++ +  G  GV        C+Y  +YGD +YT+G+LG E + +
Sbjct: 138 KSTSFSHVPCNSQNCKAIDDSHCGAQGV--------CDYSYTYGDQTYTKGDLGFEKITI 189

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG--GLFSYCLPSTQDA 295
           G +SV   I GCG  + G FG  SG++GLG   LSLVSQ S+  G    FSYCLP T  +
Sbjct: 190 GSSSVKSVI-GCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLP-TLLS 247

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
            A+G +  G N+ V   S P   +  + +    T+Y + L  ISIG ++  AS   +G +
Sbjct: 248 HANGKINFGQNAVV---SGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASA-KQGNV 303

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN--LSAYQEVNIPLVKM 413
           +IDSGT ++ LP  +Y  + +  LK           +  D CF+  ++      IP++  
Sbjct: 304 IIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITA 363

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           +F G A   V++  +  F K   +  CL L   S  DE GIIGN    N  + YD +  +
Sbjct: 364 QFSGGAN--VNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKR 421

Query: 474 LGFAGEDCS 482
           L F    C+
Sbjct: 422 LSFKPTVCT 430


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 190/382 (49%), Gaps = 27/382 (7%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L SG+ L +  Y   + +G   ++ ++I+DTGSDL W+QC PC +C+ Q  P +DP  S 
Sbjct: 186 LESGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSS 245

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S++ + C+   C  +  A      C + +   C YF  YGDGS T G+   E   +   +
Sbjct: 246 SFRNISCHDPRCQLVS-APDPPKPCKAEN-QSCPYFYWYGDGSNTTGDFALETFTVNLTT 303

Query: 242 ---------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
                    V + +FGCG  N+GLF G +GL+GLG+  LS  SQ   ++G  FSYCL   
Sbjct: 304 PNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDR 363

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNM--IPNPQLATFYILNLTGISIGGKQLQA-- 347
             +A  S  LI G +  +  +   + +T+     +  + TFY + +  + +  + L+   
Sbjct: 364 NSNASVSSKLIFGEDKELLSHPN-LNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPE 422

Query: 348 -----SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
                S    GG +IDSGT +T      Y  +K  F+++  G+    G   L  C+N+S 
Sbjct: 423 ETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSG 482

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
            +++ +P   + F   A     V    YF+  D   VCLA+   +      IIGNYQQ+N
Sbjct: 483 IEKMELPDFGILFADEAVWNFPVEN--YFIWIDPEVVCLAILG-NPRSALSIIGNYQQQN 539

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
             ++YD K S+LG+A   C+ +
Sbjct: 540 FHILYDMKKSRLGYAPMKCADV 561


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 131/403 (32%), Positives = 200/403 (49%), Gaps = 41/403 (10%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDT 152
           D+  + YL S    +++G  K    T +P+ SG +L   NY+   +LG   + M +++DT
Sbjct: 71  DSHRLTYLSS----LVAGKPKP---TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDT 123

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
            +D  W+ C  C  C N        S S +Y  V C+++ C     A G +   SS  P 
Sbjct: 124 SNDAVWLPCSGCSGCSNASTSFNTNSSS-TYSTVSCSTAQCTQ---ARGLTCPSSSPQPS 179

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
            C++  SYG  S     L ++ L L    + +F FGC  +  G      GLMGLGR  +S
Sbjct: 180 VCSFNQSYGGDSSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMS 239

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
           LVSQT+ ++ G+FSYCLPS +    SGSL LG    +      I YT ++ NP+  + Y 
Sbjct: 240 LVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYY 295

Query: 333 LNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FS 383
           +NLTG+S+G  Q+            +  G +IDSGTVITR    +Y A++ EF KQ   S
Sbjct: 296 VNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVS 355

Query: 384 GFPSAPGFSILDTCF---NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
            F +   F   DTCF   N +   ++ + +  ++ +   E T+          S  +  C
Sbjct: 356 SFSTLGAF---DTCFSADNENVAPKITLHMTSLDLKLPMENTL-------IHSSAGTLTC 405

Query: 441 LALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           L++A +         +I N QQ+N R+++D  NS++G A E C
Sbjct: 406 LSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 133/433 (30%), Positives = 201/433 (46%), Gaps = 38/433 (8%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
           G  +++L H++       D ++ +  RL  D  H     SR+     G  +  + T   +
Sbjct: 30  GGFSVDLIHRDSPHSPFFDPSKTRTERLT-DAFHRS--ASRV-----GRFRQSAMTSDGI 81

Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            S +      YI  + +G   + VI  VDTGSDLTW QC+PC  CY Q  P FDP  S +
Sbjct: 82  QSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSST 141

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK--- 239
           Y+   C +S C AL    GN   C +     C +  SY DGS+T G L  E L +     
Sbjct: 142 YRDSSCGTSFCLAL----GNDRSCRNGK--KCTFMYSYADGSFTGGNLAVETLTVASTAG 195

Query: 240 --ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
              S   F FGC   + G+F    SG++GLG ++LS++SQ      G FSYC LP   D+
Sbjct: 196 KPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDS 255

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK--- 352
             S S I  G S +   +  ++   ++  P    +Y++ L G S+G K+L   GF+K   
Sbjct: 256 SMS-SRINFGRSGIVSGAGTVSTPLVMKGPD-TYYYLITLEGFSVGKKRLSYKGFSKKAE 313

Query: 353 ---GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
              G I++DSGT  T LP   Y  L+        G        I   C+N +  Q ++ P
Sbjct: 314 VEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYNTTVDQ-IDAP 372

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
           ++   F+   +  V++     F++     VC  +   S   + GI+GN  Q N  V +D 
Sbjct: 373 IITAHFK---DANVELQPWNTFLRMQEDLVCFTVLPTS---DIGILGNLAQVNFLVGFDL 426

Query: 470 KNSQLGFAGEDCS 482
           +  ++ F   DC+
Sbjct: 427 RKKRVSFKAADCT 439


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 122/358 (34%), Positives = 178/358 (49%), Gaps = 36/358 (10%)

Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EF 198
           GG   T+++DT SD+ WVQC PC +  C+ Q D ++DPS S S     C+S  C  L  +
Sbjct: 152 GGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPY 211

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRN-- 252
           A G    C+ +    C Y V Y DGS + G    + L L  A    ++++F FGC     
Sbjct: 212 ANG----CTPAGD-QCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEFRFGCSHALL 266

Query: 253 NKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
             G F    SG+M LGR   SL +QT   +G +FSYCLP T     SG  ILG       
Sbjct: 267 QPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVH--SGFFILGVPRVAAS 324

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPS 369
                  T M+ +      Y++ L  I + GK+L      FA G ++ DS T++TRLPP+
Sbjct: 325 R---YAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVM-DSRTIVTRLPPT 380

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLS-----AYQEVNIPLVKMEFEG-NAEMTV 423
            Y AL+A F+ +   + +A     LDTC++ S         V +P + + F+G N  + +
Sbjct: 381 AYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVEL 440

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           D +G++          CLA A  + +  TGIIGN QQ+   V+Y+   + +GF    C
Sbjct: 441 DPSGVLL-------DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 143/423 (33%), Positives = 210/423 (49%), Gaps = 76/423 (17%)

Query: 77  CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPLTSGIRLQTLNY 135
           CSG         Q     D   V ++ S+       N+KD   N ++    G      N+
Sbjct: 75  CSGSGHSQPPSPQEIFGRDESRVSFINSKFNQYAPENLKDHTPNNKLFDEDG------NF 128

Query: 136 IATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
           +  +  G   +N T+I+DTGS +TW QC+ C                             
Sbjct: 129 LVDVAFGTPPQNFTLILDTGSSITWTQCKACTV--------------------------- 161

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRN 252
                              + NY ++YGD S + G  G + + L  + V   F FG GRN
Sbjct: 162 -------------------ENNYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRN 202

Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           NKG FG GV G++GLG+  LS VSQT+  F  +FSYCLP   +  + GSL+ G  ++   
Sbjct: 203 NKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--S 257

Query: 312 NSTPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRL 366
            S+ + +T+++  P   Q + +Y +NL+ IS+G ++L   +S FA  G +IDS TVITRL
Sbjct: 258 QSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRL 317

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           P   YSALKA F K  + +P + G      ILDTC+NLS  ++V +P + + F G A++ 
Sbjct: 318 PQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVR 377

Query: 423 VDVTGIVYFVKSDASQVCLALASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           ++ T IV+   SD S++CLA A  S      E  IIGN QQ +  V+YD +  ++GF   
Sbjct: 378 LNGTNIVW--GSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSN 435

Query: 480 DCS 482
            CS
Sbjct: 436 GCS 438


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 125/353 (35%), Positives = 184/353 (52%), Gaps = 41/353 (11%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           I+DTGSD+ W+QC+PC+ CYNQ   +FDPS S +YK +  +S+TC ++E  +     CSS
Sbjct: 102 IIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTS-----CSS 156

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
            +   C Y + YGDGSY++G+L  E L LG  + +   F     GCGRNN   F G  SG
Sbjct: 157 DNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSG 216

Query: 263 LMGLGRSDLSLVSQ---TSEIFGGLFSYCLPSTQDAGAS----GSLILGGNSSVFKNSTP 315
           ++GLG   +SL++Q    S   G  FSYCL S  +  +      + ++ G+ +V   STP
Sbjct: 217 IVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLNFGDAAVVSGDGTV---STP 273

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF---AKGGILIDSGTVITRLPPSI 370
           I    +  +P++  FY L L   S+G  +++  +S F    KG I+IDSGT +T LP  I
Sbjct: 274 I----VTHDPKV--FYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSGTTLTLLPNDI 327

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           YS L++                 L  C+  S + E+N P++   F G     V +  +  
Sbjct: 328 YSKLESAVADLVELDRVKDPLKQLSLCYR-STFDELNAPVIMAHFSG---ADVKLNAVNT 383

Query: 431 FVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           F++ +    CLA  S     + G I GN  Q+N  V YD +   + F   DCS
Sbjct: 384 FIEVEQGVTCLAFIS----SKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 137/419 (32%), Positives = 196/419 (46%), Gaps = 34/419 (8%)

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--R 144
           QQQN L   N  V  L+S  K+  SGNI         L SG  L T  Y   + +G   +
Sbjct: 131 QQQNNLA--NAVVASLKSS-KDEFSGNIMAT------LESGASLGTGEYFIDMFVGTPPK 181

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           ++ +I+DTGSDL+W+QC PC  C+ Q  P ++P+ S SY+ + C    C  +  ++ +  
Sbjct: 182 HVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQLV--SSPDPL 239

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKG 255
               +    C YF  Y DGS T G+   E   +      GK     V D +FGCG  NKG
Sbjct: 240 QHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKG 299

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGNSSVFKNST 314
            F G  GL+GLGR  LS  SQ   I+G  FSYCL     +   S  LI G +  +  N  
Sbjct: 300 FFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLIFGEDKELL-NHH 358

Query: 315 PITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITR 365
            + +T ++   +    TFY L +  I +GG       K    S    GG +IDSG+ +T 
Sbjct: 359 NLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTF 418

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
            P S Y  +K  F K+      A    I+  C+N+S   +V +P   + F   A      
Sbjct: 419 FPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPA 478

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               Y  + D   +CLA+          IIGN  Q+N  ++YD K S+LG++   C+ +
Sbjct: 479 ENYFYQYEPDEV-ICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 536


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 187/343 (54%), Gaps = 43/343 (12%)

Query: 156 LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
           +TW QC+PC  C       FDPS S +Y    C  ST        GN+            
Sbjct: 98  ITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPST-------VGNT------------ 138

Query: 216 YFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSL 273
           Y ++YGD S + G  G + + L  + V   F FGCGRNN+G FG G  G++GLG+  LS 
Sbjct: 139 YNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLST 198

Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLA 328
           VSQT+  F  +FSYCLP   +  + GSL+ G  ++   + + + +T+++  P     + +
Sbjct: 199 VSQTASKFKKVFSYCLP---EEDSIGSLLFGEKAT---SQSSLKFTSLVNGPGTSGLEES 252

Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
            +Y + L  IS+G K+L   +S FA  G +IDSGTVIT LP   YSAL A F K  + +P
Sbjct: 253 GYYFVKLLDISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYP 312

Query: 387 SAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
            + G      ILDTC+NLS  ++V +P + + F   A++ ++   +++   +DAS++CLA
Sbjct: 313 LSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIW--GNDASRLCLA 370

Query: 443 LASLS---YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            A  S      E  IIGN QQ +  V+YD +  ++GF G  CS
Sbjct: 371 FAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCS 413


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 135/415 (32%), Positives = 199/415 (47%), Gaps = 40/415 (9%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIAT 138
           K   W     N    D   V YL S + +  +        T +P+ SG ++  + NY+  
Sbjct: 51  KAGSWVNTVINMASKDPARVTYLSSLVASPKA--------TSVPIASGQQVLNIGNYVVR 102

Query: 139 IELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
           ++LG  G+ M +++DT  D  WV C  C  C     P F P+ S +Y  + C+   C  +
Sbjct: 103 VKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFSPNTSSTYASLQCSVPQCTQV 159

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              +     C ++    C +  +YG  S     L ++ LGL   ++  + FGC     G 
Sbjct: 160 RGLS-----CPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPSYSFGCVNAVSGS 214

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  +SL+SQ+  ++ G+FSYC PS +    SGSL LG           I
Sbjct: 215 TLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGP----LGQPKNI 270

Query: 317 TYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG-GILIDSGTVITRLPPS 369
             T ++ NP   T Y +NLTG+S+G        +L A     G G +IDSGTVITR    
Sbjct: 271 RTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEP 330

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGI 428
           +Y+A++ EF KQ  G P A      DTCF  +A  E   P V   F G + ++ ++ T I
Sbjct: 331 VYAAIRDEFRKQVKG-PFA-TIGAFDTCF--AATNEDIAPPVTFHFTGMDLKLPLENTLI 386

Query: 429 VYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                S  S  CLA+A+   +      +I N QQ+N R+++D  NS+LG A E C
Sbjct: 387 ---HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 178/360 (49%), Gaps = 35/360 (9%)

Query: 143 GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGN 202
            R  + I+DTGSDL W QC PC  C +Q  P FDP+ S +Y+ + C++  C+AL +    
Sbjct: 102 ARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPACNALYYP--- 158

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFG 258
             +C   +   C Y   YGD + T G L  E    G    + ++    FGCG  N G   
Sbjct: 159 --LCYQKT---CVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLA 213

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV-FKNSTPIT 317
             SG++G GR  LSLVSQ   +    FSYCL S      S  L  G  +++   N++ + 
Sbjct: 214 NGSGMVGFGRGSLSLVSQ---LGSPRFSYCLTSFLSPVRS-RLYFGAYATLNSTNASTVQ 269

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--------KGGILIDSGTVITRLPPS 369
            T  I NP L T Y LN+TGIS+GG +L               GG +IDSGT IT L   
Sbjct: 270 STPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEP 329

Query: 370 IYSALKAEFLKQF-SGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAEMTVD 424
            Y A++  F+    S  P       S+LDTCF       Q V +P + + F+G A+  + 
Sbjct: 330 AYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDG-ADWELP 388

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +   +  V      +CLA+A+ S   +  IIG+YQ +N  V+YD +NS L F    C+ M
Sbjct: 389 LQNYM-LVDPSTGGLCLAMATSS---DGSIIGSYQHQNFNVLYDLENSLLSFVPAPCNLM 444


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 141/434 (32%), Positives = 213/434 (49%), Gaps = 42/434 (9%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
            T+EL H++     + + +E   +R I++ L     +S  +N +   + +    E P+ +
Sbjct: 27  FTVELIHRDSPKSPMYNSSETHFDR-IVNALR----RSSHRNTV---VLESDTAEAPIFN 78

Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
                   Y+  I +G    +++   DTGSD+ W QC+PC +CY Q  P+FDPS S +YK
Sbjct: 79  ----NGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYK 134

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            V C+S  C      +G+   CS  S  +C Y ++YGD S+++G L  + + +   S   
Sbjct: 135 NVACSSPVCS----YSGDGSSCSDDS--ECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRP 188

Query: 245 FIF-----GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-P-STQDAG 296
             F     GCG +N G F   VSG++GLGR   SLV+Q     GG FSYCL P  T    
Sbjct: 189 VAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTN 248

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKGG- 354
            S  L  G N++V  + T    T +  + Q  TFY L L  +S+G  +     G +K G 
Sbjct: 249 DSTKLNFGSNANVSGSGT--VSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGG 306

Query: 355 ---ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS-ILDTCFNLSAYQEVNIPL 410
              I+IDSGT +T LP ++ ++  +  + Q    P A   S  LD CF  +   +  +P 
Sbjct: 307 ESNIIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEFLDYCFATTT-DDYEMPP 364

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V M FEG A++ +    +  FV+     +CLA  S   +D   I GN  Q N  V YD K
Sbjct: 365 VTMHFEG-ADVPLQRENL--FVRLSDDTICLAFGSFP-DDNIFIYGNIAQSNFLVGYDIK 420

Query: 471 NSQLGFAGEDCSSM 484
           N  + F    C ++
Sbjct: 421 NLAVSFQPAHCGAV 434


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 184/378 (48%), Gaps = 34/378 (8%)

Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           T  Y+  + +G   R + + +DTGSDL W QC PC+ C++Q  P+ DP+ S +Y  + C 
Sbjct: 89  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148

Query: 190 SSTCHALEFATGNSGVCSS--SSPPDCNYFVSYGDGSYTRGELGREHLGLG--------K 239
           +  C AL F +   G  SS  +    C Y   YGD S T GE+  +    G        +
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 240 ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
                  FGCG  NKG+F    +G+ G GR   SL SQ +      FSYC  S  ++ +S
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT---TFSYCFTSMFESKSS 265

Query: 299 GSLILGGN-------SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA 351
             + LGG        S     S  +  T ++ NP   + Y L+L GIS+G  +L      
Sbjct: 266 -LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAK 324

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL---SAYQEV 406
               +IDSG  IT LP ++Y A+KAEF  Q  G P       S LD CF L   + ++  
Sbjct: 325 LRSTIIDSGASITTLPEAVYEAVKAEFAAQV-GLPPTGVVEGSALDLCFALPVTALWRRP 383

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P + +  +G A+  +     V+  +  A++V   +   +  D+T +IGN+QQ+N  V+
Sbjct: 384 PVPSLTLHLDG-ADWELPRGNYVF--EDLAARVMCVVLDAAPGDQT-VIGNFQQQNTHVV 439

Query: 467 YDTKNSQLGFAGEDCSSM 484
           YD +N  L FA   C S+
Sbjct: 440 YDLENDWLSFAPARCDSL 457


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 125/350 (35%), Positives = 178/350 (50%), Gaps = 28/350 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           + +DTGSDL W QCQPC  C+NQ  P +D S S ++    C+S+ C      T    +C 
Sbjct: 106 LTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 161

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
           + +   C +  SYGD S T G L  E +  +  ASV   +FGCG NN G+F    +G+ G
Sbjct: 162 NQTVQTCAFSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 221

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPN 324
            GR  LSL SQ      G FS+C  +      S +++    + ++KN    +  T +I N
Sbjct: 222 FGRGPLSLPSQLKV---GNFSHCFTAVSGRKPS-TVLFDLPADLYKNGRGTVQTTPLIKN 277

Query: 325 PQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
           P   TFY L+L GI++G  +L    S FA     GG +IDSGT  T LPP +Y  +  EF
Sbjct: 278 PAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEF 337

Query: 379 LK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
               +    PS     +L  CF+     +  ++P + + FEG A M +     V+  K  
Sbjct: 338 AAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKDG 394

Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  +CLA+     E E  IIGN+QQ+N  V+YD KNS+L F    C  +
Sbjct: 395 GNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 128/392 (32%), Positives = 196/392 (50%), Gaps = 37/392 (9%)

Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
           + ++++G  K    T +P+ SG +L   NY+   +LG   + M +++DT +D  W+ C  
Sbjct: 4   LSSLVAGKPKP---TSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSG 60

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C  C N        S S +Y  V C+++ C     A G +   SS  P  C++  SYG  
Sbjct: 61  CSGCSNASTSFNTNSSS-TYSTVSCSTAQCTQ---ARGLTCPSSSPQPSVCSFNQSYGGD 116

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           S     L ++ L L    + +F FGC  +  G      GLMGLGR  +SLVSQT+ ++ G
Sbjct: 117 SSFSASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSG 176

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
           +FSYCLPS +    SGSL LG    +      I YT ++ NP+  + Y +NLTG+S+G  
Sbjct: 177 VFSYCLPSFRSFYFSGSLKLG----LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSV 232

Query: 344 Q-------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FSGFPSAPGFSIL 394
           Q       L     +  G +IDSGTVITR    +Y A++ EF KQ   S F +   F   
Sbjct: 233 QVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAF--- 289

Query: 395 DTCF---NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
           DTCF   N +   ++ + +  ++ +   E T+          S  +  CL++A +     
Sbjct: 290 DTCFSADNENVAPKITLHMTSLDLKLPMENTL-------IHSSAGTLTCLSMAGIRQNAN 342

Query: 452 T--GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
               +I N QQ+N R+++D  NS++G A E C
Sbjct: 343 AVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 374


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 191/368 (51%), Gaps = 38/368 (10%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ T  +G     +  I DTGSD+ W+QC+PC+ CYNQ  P+F+PS S SYK + C+S  
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKL 146

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
           CH++   +     CS  +   C Y +SYGD S+++G+L  + L L        S    + 
Sbjct: 147 CHSVRDTS-----CSDQN--SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVI 199

Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-G 305
           GCG +N G FGG  SG++GLG   +SL++Q     GG FSYCL    +  ++ S IL  G
Sbjct: 200 GCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG-----ILIDSG 360
           +++V      ++   +  +P    FY L L   S+G K+++  G ++GG     I+IDSG
Sbjct: 260 DAAVVSGDGVVSTPLIKKDP---VFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316

Query: 361 TVITRLPPSIYSALKA---EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           T +T +P  +Y+ L++   + +K          FS+   C++L +  E + P++ + F+G
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL---CYSLKS-NEYDFPIITVHFKG 372

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGF 476
                V++  I  FV      VC A        + G I GN  Q+N  V YD +   + F
Sbjct: 373 ---ADVELHSISTFVPITDGIVCFAFQP---SPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426

Query: 477 AGEDCSSM 484
              DC+ +
Sbjct: 427 KPTDCTKV 434


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 134/364 (36%), Positives = 178/364 (48%), Gaps = 44/364 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R  + I+DTGSDL W QC PC  C +Q  P FDP+ S +Y+ + C S  C+AL +     
Sbjct: 101 RYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNALYYPLCYQ 160

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KASVNDFIFGCGRNNKGLFGG 259
            VC         YF  YGD + T G L  E    G    + S+    FGCG  N G    
Sbjct: 161 KVCVY------QYF--YGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLAN 212

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKN 312
            SG++G GR  LSLVSQ        FSYCL       PS    G   +L     +S   +
Sbjct: 213 GSGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATL-----NSTNAS 264

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA------KGGILIDSGTVIT 364
           S P+  T  + NP L T Y LN+TGIS+GG  L    + FA       GG +IDSGT IT
Sbjct: 265 SEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTIT 324

Query: 365 RLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL--SAYQEVNIPLVKMEFEGNAE 420
            L    Y A++A F  Q +  P  +    S+LDTCF       Q V +P + + F+G A+
Sbjct: 325 YLAEPAYDAVRAAFASQIT-LPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDG-AD 382

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             + +   +    S    +CLA+AS S   +  IIG+YQ +N  V+YD +NS + F    
Sbjct: 383 WELPLQNYMLVDPSTGGGLCLAMASSS---DGSIIGSYQHQNFNVLYDLENSLMSFVPAP 439

Query: 481 CSSM 484
           C  M
Sbjct: 440 CHLM 443


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 125/376 (33%), Positives = 174/376 (46%), Gaps = 22/376 (5%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           PL SG  L +  Y     LG   +   +IVDTGSDL +VQC PC  CY Q  P++ PS S
Sbjct: 22  PLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNS 81

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPD--CNYFVSYGDGSYTRGELGREHL 235
            ++  V C+S+ C  +    G    CSSS   SPP   C+Y   YGD S T G    E  
Sbjct: 82  STFTPVPCDSAECLLIPAPVG--APCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETA 139

Query: 236 GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQD 294
            +G   VN   FGCG  N+G F    G++GLG+  LS  SQ    F   F+YCL S    
Sbjct: 140 TVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSP 199

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QA 347
                SLI G +  +      + +T ++ NP   + Y + +  I  GG+ L       + 
Sbjct: 200 TSVFSSLIFGDD--MMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKI 257

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
                GG + DSGT +T   P  Y+ + A F K      + P    L  C N+S      
Sbjct: 258 DSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPI 317

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
            P   +EF+  A    +     YF++   +  CLA+   S  D   +IGN  Q+N  V Y
Sbjct: 318 YPSFTIEFDQGATYRPNQGN--YFIEVSPNIDCLAMLESS-SDGFNVIGNIIQQNYLVQY 374

Query: 468 DTKNSQLGFAGEDCSS 483
           D +  ++GFA  +C +
Sbjct: 375 DREEHRIGFAHANCDA 390


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 133/378 (35%), Positives = 187/378 (49%), Gaps = 44/378 (11%)

Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           +R     Y+  + +G   R  + ++DTGSDL W QC PC  C  Q  P F+P+ S SY  
Sbjct: 81  LRFSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYAS 140

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KAS 241
           + C+S+ C+AL      S +C  ++   C Y   YGD + + G L  E    G    + +
Sbjct: 141 LPCSSAMCNALY-----SPLCFQNA---CVYQAFYGDSASSAGVLANETFTFGTNSTRVA 192

Query: 242 VNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           V    FGCG  N G LF G SG++G GR  LSLVSQ        FSYCL S     A+  
Sbjct: 193 VPRVSFGCGNMNAGTLFNG-SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSP-ATSR 247

Query: 301 LILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA--- 351
           L  G     NS+   +S P+  T  I NP L T Y LN+TGIS+ G  L    S FA   
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINE 307

Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL--SAYQ 404
               GG++IDSGT +T L    Y+ ++  F+  + G P A        DTCF       +
Sbjct: 308 TDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRR 366

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
            V +P + + F+G A+M + +    Y V    +  +CLA+      D+  IIG++Q +N 
Sbjct: 367 MVTLPEMVLHFDG-ADMELPLEN--YMVMDGGTGNLCLAMLP---SDDGSIIGSFQHQNF 420

Query: 464 RVIYDTKNSQLGFAGEDC 481
            ++YD +NS L F    C
Sbjct: 421 HMLYDLENSLLSFVPAPC 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 133/378 (35%), Positives = 187/378 (49%), Gaps = 44/378 (11%)

Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           +R     Y+  + +G   R  + ++DTGSDL W QC PC  C  Q  P F+P+ S SY  
Sbjct: 78  LRFSEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYAS 137

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----KAS 241
           + C+S+ C+AL      S +C  ++   C Y   YGD + + G L  E    G    + +
Sbjct: 138 LPCSSAMCNALY-----SPLCFQNA---CVYQAFYGDSASSAGVLANETFTFGTNSTRVA 189

Query: 242 VNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
           V    FGCG  N G LF G SG++G GR  LSLVSQ        FSYCL S     A+  
Sbjct: 190 VPRVSFGCGNMNAGTLFNG-SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSP-ATSR 244

Query: 301 LILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFA--- 351
           L  G     NS+   +S P+  T  I NP L T Y LN+TGIS+ G  L    S FA   
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINE 304

Query: 352 ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNL--SAYQ 404
               GG++IDSGT +T L    Y+ ++  F+  + G P A        DTCF       +
Sbjct: 305 TDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVA-WVGLPRANATPSDTFDTCFKWPPPPRR 363

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
            V +P + + F+G A+M + +    Y V    +  +CLA+      D+  IIG++Q +N 
Sbjct: 364 MVTLPEMVLHFDG-ADMELPLEN--YMVMDGGTGNLCLAMLP---SDDGSIIGSFQHQNF 417

Query: 464 RVIYDTKNSQLGFAGEDC 481
            ++YD +NS L F    C
Sbjct: 418 HMLYDLENSLLSFVPAPC 435


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 136/403 (33%), Positives = 196/403 (48%), Gaps = 41/403 (10%)

Query: 99  VQYLQSRIKNM---ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTG 153
           +Q  Q R++ +    + N   + + E P+T  I   +  Y+  + +G    +++ I+DTG
Sbjct: 5   IQRSQERLEKLQITSAVNTHQMKDIETPVTPDIG--SGEYLIQMAIGTPALSLSAIMDTG 62

Query: 154 SDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNSGVCSSSSPP 212
           SDL W +C PC  C          S S +Y KVLC SS C     F+  N G        
Sbjct: 63  SDLVWTKCNPCTDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDG-------- 112

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
           DC Y   YGD S T G L  E   +   S+ +  FGCG +N+G F  V GL+G GR  LS
Sbjct: 113 DCEYVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLS 171

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
           LVSQ     G  FSYCL S  D+  +  L +G  +S+   +T +  T ++ +     +Y 
Sbjct: 172 LVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASL--EATTVGSTPLVQSSSTNHYY- 228

Query: 333 LNLTGISIGGKQL---------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           L+L GIS+GG+ L         Q+ G   GG++IDSGT +T L  + Y A+K   +   +
Sbjct: 229 LSLEGISVGGQSLAIPTGTFDIQSDG--SGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN 286

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLA 442
             P A G   LD CFN         P +   F+G      DV    Y F  S +  VCLA
Sbjct: 287 -LPQADG--QLDLCFNQQGSSNPGFPSMTFHFKG---ADYDVPKENYLFPDSTSDIVCLA 340

Query: 443 -LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            + + S      I GN QQ+N +++YD +N+ L FA   C ++
Sbjct: 341 MMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTACDTL 383


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
           D+  + +L S+  +  SG +     T  P+ SG   QT  +Y+    LG   + + + +D
Sbjct: 48  DDARLLFLSSKAAS--SGGV-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           T +D TW  C PC +C       F P+ S SY  + C S  C   E     +   +S+  
Sbjct: 98  TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
           P C +   + D S+ +  LG + L LGK ++  + FGC     G    +   GL+GLGR 
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            +SL+SQT   + G+FSYCLPS +    SGSL LG      +N   + YT ++ NP   +
Sbjct: 215 PMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270

Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            Y +N+TG+S+G    ++ A  FA       G +IDSGTVITR    +Y+AL+ EF +Q 
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
           +           DTCFN         P V +  +G  ++T+ +   +  + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388

Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A+A           ++ N QQ+N RV+ D   S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 168/346 (48%), Gaps = 25/346 (7%)

Query: 147 TVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE-FATGNS 203
           T+ +DT  D+ W+QC PC    CY Q++  FDP  S +   V C S  C  L  +A G S
Sbjct: 160 TMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCS 219

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFIFGCGRNNKGLFGG-VS 261
              S+    DC Y + Y D   T G    + L +  ++   +F FGC    +G F    S
Sbjct: 220 KPNSTG---DCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQAS 276

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--ASGSLI---LGGNSSVFKNSTPI 316
           G M LG    SL+SQT+  +G  FSYC+P    AG  + G  +    GG S  F  +TP+
Sbjct: 277 GTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFA-TTPL 335

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALK 375
             +  + NP   T Y++ L GI + G++L        GG ++DS  VIT+LPP+ Y AL+
Sbjct: 336 VRSANVINP---TIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRALR 392

Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
             F      + +      LDTCF+     +V +P V + F+G A + + +  ++      
Sbjct: 393 LAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL----- 447

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
               CLA A ++ +   G IGN QQ+   V+YD     +GF    C
Sbjct: 448 --DSCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/377 (33%), Positives = 191/377 (50%), Gaps = 43/377 (11%)

Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
            Y+ T+ +G   +    + DTGSDL W QC PC + C+ Q  P+++P+ S ++  + CNS
Sbjct: 113 EYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS 172

Query: 191 S--TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
           S   C            C+      C Y+ +YG G +T G  G E    G     +A V 
Sbjct: 173 SLSMCAGALAGAAPPPGCA------CMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVP 225

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGC   +   + G +GL+GLGR  LSLVSQ   +  G FSYCL   QD  ++ +L+L
Sbjct: 226 GVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLL 282

Query: 304 GGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------KG 353
           G ++++  N T +  T  + +P    ++T+Y LNLTGIS+G K L  S  A        G
Sbjct: 283 GPSAAL--NGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 340

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPG--FSILDTCFNLSAYQEVN--- 407
           G++IDSGT IT L  + Y  ++A    Q  +  P+  G   + LD CF L A        
Sbjct: 341 GLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAV 400

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P + + F+G A+M +       ++ S +   CLA+ + + +      GNYQQ+N  ++Y
Sbjct: 401 LPSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILY 455

Query: 468 DTKNSQLGFAGEDCSSM 484
           D +   L FA   CS++
Sbjct: 456 DVREETLSFAPAKCSTL 472


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 131/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
           D+  + +L S+  +  SG I     T  P+ SG   QT  +Y+    LG   + + + +D
Sbjct: 48  DDARLLFLSSKAAS--SGGI-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           T +D TW  C PC +C       F P+ S SY  + C S  C   E     +   +S+  
Sbjct: 98  TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
           P C +   + D S+ +  LG + L LGK ++  + FGC     G    +   GL+GLGR 
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            +SL+SQT   + G+FSYCLPS +    SGSL LG      +N   + YT ++ NP   +
Sbjct: 215 PMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270

Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            Y +N+TG+S+G    ++ A  FA       G +IDSGTVITR    +Y+AL+ EF +Q 
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
           +           DTCFN         P V +  +G  ++T+ +   +  + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388

Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A+A           ++ N QQ+N RV+ D   S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 124/379 (32%), Positives = 182/379 (48%), Gaps = 42/379 (11%)

Query: 134 NYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +YIA I +G   +  ++  DT SDLTW+QCQPC+ CY Q  PVFDP  S SY ++  ++ 
Sbjct: 140 DYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAP 199

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDG------SYTRGELGREHLGLGKASVNDF 245
            C AL  + G      +     C Y V YGDG      S + G+L  E L         +
Sbjct: 200 DCQALGRSGGGDAKRGT-----CIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAY 254

Query: 246 I-FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLI 302
           +  GCG +NKGLFG   +G++GL R  +S+  Q + + +   FSYCL        S S  
Sbjct: 255 LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSST 314

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG--------KQLQASGF-AKG 353
           L   +     S P ++T  + N  + TFY + L G+S+GG        + LQ   +   G
Sbjct: 315 LTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHG 374

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGF-------PSAPGFSILDTCFNLSAYQE- 405
           G+++DSGT +TRL    Y+A +  F    +G        PS     + DTC+ +      
Sbjct: 375 GVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSG----LFDTCYTVGGRAGL 430

Query: 406 ---VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
              V +P V M F G  E+++     +  V S  + VC A A    +    +IGN  Q+ 
Sbjct: 431 RHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGT-VCFAFAGTG-DRSVSVIGNILQQG 488

Query: 463 QRVIYDTKNSQLGFAGEDC 481
            RV+YD    ++GFA   C
Sbjct: 489 FRVVYDIGGQRVGFAPNSC 507


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/350 (35%), Positives = 177/350 (50%), Gaps = 28/350 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           + +DTGS L W QCQPC  C+NQ  P +D S S ++    C+S+ C      T    +C 
Sbjct: 106 LTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 161

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
           + +   C Y  SYGD S T G L  E +  +  ASV   +FGCG NN G+F    +G+ G
Sbjct: 162 NQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 221

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPN 324
            GR  LSL SQ      G FS+C  +      S +++    + ++KN    +  T +I N
Sbjct: 222 FGRGPLSLPSQLKV---GNFSHCFTAVSGRKPS-TVLFDLPADLYKNGRGTVQTTPLIKN 277

Query: 325 PQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
           P   TFY L+L GI++G  +L    S FA     GG +IDSGT  T LPP +Y  +  EF
Sbjct: 278 PAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEF 337

Query: 379 LK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
               +    PS     +L  CF+     +  ++P + + FEG A M +     V+  K  
Sbjct: 338 AAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKDG 394

Query: 436 AS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  +CLA+     E E  IIGN+QQ+N  V+YD KNS+L F    C  +
Sbjct: 395 GNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 440


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 133/436 (30%), Positives = 204/436 (46%), Gaps = 39/436 (8%)

Query: 58  QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
           Q    ++  I +  K   +   K   W          D   ++YL +         + D 
Sbjct: 29  QSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPERLKYLST---------LADQ 79

Query: 118 SNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
             T +P+  G + L+  NY+  ++LG  G+ M +++DT +D  WV C  C  C +     
Sbjct: 80  KTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSST---T 136

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           F P+ S +   + C+ + C  +    G S  C ++    C +  SYG  S     L ++ 
Sbjct: 137 FLPNASTTLGSLDCSGAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLTATLVQDA 191

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
           + L    +  F FGC     G      GL+GLGR  +SL+SQ   ++ G+FSYCLPS + 
Sbjct: 192 ITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKS 251

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQA 347
              SGSL LG           I  T ++ NP   + Y +NLTG+S+G        +QL  
Sbjct: 252 YYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
                 G +IDSGTVITR    +Y A++ EF KQ +G  S+ G    DTCF  +A  E  
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AATNEAE 363

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRV 465
            P + + FEG   +      +++   S  S  CL++A+   +      +I N QQ+N R+
Sbjct: 364 APAITLHFEGLNLVLPMENSLIH--SSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRI 421

Query: 466 IYDTKNSQLGFAGEDC 481
           ++DT NS+LG A E C
Sbjct: 422 MFDTTNSRLGIARELC 437


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 111/283 (39%), Positives = 160/283 (56%), Gaps = 19/283 (6%)

Query: 7   PLTILSLLLPLMVSLFLLAKGAHCFEGKKKL-----HLHKLQWQQKSGSSSSCVSHQKSR 61
           P++ + LL  L+ S  L +K    F+G+K        LH +       SS   V     +
Sbjct: 4   PISTIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNVHITSLMPSS---VCSPSPK 60

Query: 62  IEMGAITLELKHKNYCSGKIVDWNEQQQNR---LILDNLHVQYLQSRI-KNMISGNIKDV 117
            +    +LE+ HK+    K+     +  +R   L  D   V  ++SR+ KN   G     
Sbjct: 61  GDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKG 120

Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPV 174
           S   +P  SG  + T NY+ T+ LG   R++T I DTGSDLTW QC+PC + CY+QQ+P+
Sbjct: 121 SKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPI 180

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           F+PS S SY  + C+S TC  L+  TGNS  CS+S+   C Y + YGD SY+ G   ++ 
Sbjct: 181 FNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSAST---CVYGIQYGDQSYSVGFFAQDK 237

Query: 235 LGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
           L L    V N+F+FGCG+NN+GLF GV+GL+GLGR+ LSL+S+
Sbjct: 238 LALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280



 Score = 84.0 bits (206), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 43/103 (41%), Positives = 59/103 (57%), Gaps = 2/103 (1%)

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
           L   S +P A   SILDTC++ S Y  V++P + + F   AEM +D +GI Y +  + SQ
Sbjct: 275 LSLMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYIL--NISQ 332

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           VCLA A  S   +  I+GN QQK   V+YD    ++GFA   C
Sbjct: 333 VCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 209/430 (48%), Gaps = 42/430 (9%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKD-VSNTEIPL 124
           + + EL H++     +    +  QN+      HV     R  N  +   KD +SNT    
Sbjct: 27  SFSFELIHRDSSKSPLY---KPAQNKF----QHVVNAARRSINRANRLFKDSLSNTP--- 76

Query: 125 TSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            S + +    Y+ T  +G     V  +VDTGSD+ W+QC+PC+ CY Q  P+F+PS S S
Sbjct: 77  ESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSS 136

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---- 238
           YK + C+S+ C ++ + + N           C Y +++ D SY++GEL  E L L     
Sbjct: 137 YKNIPCSSNLCQSVRYTSCNKQ-------NSCEYTINFSDQSYSQGELSVETLTLDSTTG 189

Query: 239 -KASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
              S    + GCG NN+G+F G  SG++GLG   +SL +Q     GG FSYC LP   D+
Sbjct: 190 HSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDS 249

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AK 352
             +  L   G+++V      ++   +  +PQ   FY L L   S+G K+++        +
Sbjct: 250 NKTSKLNF-GDAAVVSGDGVVSTPFVKKDPQ--AFYYLTLEAFSVGNKRIEFEVLDDSEE 306

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           G I++DSGT +T LP  +Y+ L++   +            +L+ C+++++ Q  + P++ 
Sbjct: 307 GNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQ-YDFPIIT 365

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKN 471
             F+G     + +  I  F       VCLA  S     +TG I GN  Q N  V YD + 
Sbjct: 366 AHFKG---ADIKLNPISTFAHVADGVVCLAFTS----SQTGPIFGNLAQLNLLVGYDLQQ 418

Query: 472 SQLGFAGEDC 481
           + + F   DC
Sbjct: 419 NIVSFKPSDC 428


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 140/446 (31%), Positives = 207/446 (46%), Gaps = 57/446 (12%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE----- 121
           + LEL H +   G +      ++ R   D  H      R  N   G I+  S+T      
Sbjct: 25  VRLELTHADDRGGYV----GAERVRRAADRSH------RRVNGFLGAIEGPSSTARLGID 74

Query: 122 ----IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
                   + +   T  Y+  I +G   +  T ++DTGSDL W QC  PC+ C+ Q  P+
Sbjct: 75  GAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL 134

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGR 232
           + P+ S +Y  V C S  C AL+         S  SPPD  C Y+ SYGDG+ T G L  
Sbjct: 135 YAPARSATYANVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLAT 188

Query: 233 EHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           E   LG   +V    FGCG  N G     SGL+G+GR  LSLVSQ        FSYC  +
Sbjct: 189 ETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVT---RFSYCF-T 244

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQL- 345
             +A A+  L LG ++   + S+    T  +P+P     + +++Y L+L GI++G   L 
Sbjct: 245 PFNATAASPLFLGSSA---RLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP 301

Query: 346 ------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCF 398
                 + +    GG++IDSGT  T L  S + AL A  L      P A G  + L  CF
Sbjct: 302 IDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVAL-ARALASRVRLPLASGAHLGLSLCF 360

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
             ++ + V +P + + F+G A+M +     V   +S A   CL + S        ++G+ 
Sbjct: 361 AAASPEAVEVPRLVLHFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSM 415

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
           QQ+N  ++YD +   L F    C  +
Sbjct: 416 QQQNTHILYDLERGILSFEPAKCGEL 441


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 174/350 (49%), Gaps = 28/350 (8%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           + VI DTGSDLTWVQC PC  CY Q+ P+FDPS S SY+ +LC S  C+AL+ +      
Sbjct: 107 VIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVS---EQA 163

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGV 260
           C+  +   C Y  SYGD SYT G L  E   +G  S     ++  +FGCG  N G F  +
Sbjct: 164 CTMDT-NICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDEL 222

Query: 261 SGLMGLGRSD-LSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITY 318
              +       LSLVSQ S I  G FSYCL P ++ +  +  +  G +S +   S P   
Sbjct: 223 GSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLSEQSNVTSKIKFGTDSVI---SGPQVV 279

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYS 372
           +  + + Q  T+Y + L  IS+G K+L  +         KG ++IDSGT +T L    ++
Sbjct: 280 STPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFT 339

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
            L+    +       +    +   CF  +   ++++P++ + F    +  V +  +  FV
Sbjct: 340 ELERVLEETVKAERVSDPRGLFSVCFRSAG--DIDLPVIAVHFN---DADVKLQPLNTFV 394

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           K+D   +C  + S    ++ GI GN  Q +  V YD +   + F   DC+
Sbjct: 395 KADEDLLCFTMIS---SNQIGIFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 199/402 (49%), Gaps = 34/402 (8%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVD 151
           D+  + +L S+  +  SG +     T  P+ SG   QT  +Y+    LG   + + + +D
Sbjct: 48  DDARLLFLSSKAAS--SGGV-----TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALD 97

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           T +D TW  C PC +C       F P+ S SY  + C S  C   E     +   +S+  
Sbjct: 98  TSADATWSHCAPCDTCPAGSR--FIPASSSSYASLPCASDWCPLFEGQPCPANQDASAPL 155

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS--GLMGLGRS 269
           P C +   + D S+ +  LG + L LGK ++  + FGC     G    +   GL+GLGR 
Sbjct: 156 PACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRG 214

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            +SL+SQT   + G+FSYCLPS +    SGSL LG      +N   + YT ++ NP   +
Sbjct: 215 PMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPS 270

Query: 330 FYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            Y +N+TG+S+G    ++ A  FA       G +IDSGTVITR    +Y+AL+ EF +Q 
Sbjct: 271 LYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQV 330

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CL 441
           +           DTCFN         P V +  +G  ++T+ +   +  + S A+ + CL
Sbjct: 331 AAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACL 388

Query: 442 ALASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A+A           ++ N QQ+N RV+ D   S++GFA E C
Sbjct: 389 AMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 121/354 (34%), Positives = 182/354 (51%), Gaps = 31/354 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            + DTGSDLTW QCQPCK C+ Q  P++D ++S S+  V C S+TC  +     +S  C+
Sbjct: 108 ALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATCLPIW----SSRNCT 163

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHL---GLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
           +SS P C Y  +YGDG+Y+ G LG E L   G    SV    FGCG +N GL    +G +
Sbjct: 164 ASSSP-CRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTV 222

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMI 322
           GLGR  LSLV+Q      G FSYCL    +      ++ G  + +   ST   +  T ++
Sbjct: 223 GLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGALAELAAPSTGAAVQSTPLV 279

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSAL- 374
            +P + T+Y ++L GIS+G  +L              GG+++DSGT  T L  S +  + 
Sbjct: 280 QSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVV 339

Query: 375 --KAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
              A  L+Q    P     S+   CF  +    Q   +P + + F G A+M +     + 
Sbjct: 340 DHVAGVLRQ----PVVNASSLDSPCFPAATGEQQLPAMPDMVLHFAGGADMRLHRDNYMS 395

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           F + ++S  CL +A  S   +  I+GN+QQ+N ++++D    QL F   DC  +
Sbjct: 396 FNQEESS-FCLNIAG-SPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDCGKL 447


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 124/417 (29%), Positives = 189/417 (45%), Gaps = 19/417 (4%)

Query: 80  KIVDWNEQQQNRLILDN---LHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
           ++  + +Q+    + DN    H       I  +I G      + + P+ SG  L +  Y 
Sbjct: 7   RLASFRKQRGRHKLSDNDNGAHNSANPPVITAVIEGPPSHDHDFQSPVVSGSTLGSGQYF 66

Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
               LG   +  ++IVD+GSDL WVQC PC  CY Q  P++ PS S ++  V C S  C 
Sbjct: 67  VDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECL 126

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
            +    G    C    P  C Y   Y D S ++G    E   +    ++   FGCGR+N+
Sbjct: 127 LIPATEGFP--CDFHYPGACAYEYRYADTSLSKGVFAYESATVDDVRIDKVAFGCGRDNQ 184

Query: 255 GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
           G F    G++GLG+  LS  SQ    +G  F+YCL +  D  +  S ++ G+  +     
Sbjct: 185 GSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLIFGD-ELISTIH 243

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLP 367
            + +T ++ N +  T Y + +  + +GG+ L  S  A        GG + DSGT +T   
Sbjct: 244 DLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWL 303

Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
           P  Y  + A F K    +P A     LD C +++   + + P   +   G A        
Sbjct: 304 PPAYRNILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIVLGGGAVFQPQQGN 362

Query: 428 IVYFVKSDASQVCLALASL-SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
             YFV    +  CLA+A L S       IGN  Q+N  V YD + +++GFA   CSS
Sbjct: 363 --YFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFAPAKCSS 417


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 187/367 (50%), Gaps = 46/367 (12%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            I DTGSDLTW+Q +PC  CY Q+ P+FDPS S ++ K+ C ++ C+AL+         S
Sbjct: 95  AIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAPCNALD-----ESARS 149

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV--NDFIFGCGRNNKGLFGG-VSGLM 264
            + P  C Y  SYGD SYT G L  + + +G ASV   +  FGCG  N G F    SG++
Sbjct: 150 CTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIV 209

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL----------PSTQDAGASGSLILGGNSSVFKNST 314
           GLG  +LS VSQ  +  G  FSYCL          PS  D+ A+  ++ G N  VF +S+
Sbjct: 210 GLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPS--DSPATSRIVFGDN-PVFSSSS 266

Query: 315 P---ITYTNMIPNPQLATFYILNLTGISIGGKQL---------------QASGFAKGGIL 356
               +  T  + N + +T+Y L +  I++G K+L                 S   +G I+
Sbjct: 267 TNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNII 326

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           IDSGT +T L    Y AL+A  +++      +    S+   CF  S  +EV +PL+K+ F
Sbjct: 327 IDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFK-SGKEEVELPLMKVHF 385

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A+  V++  +  FV+++   VC  +      ++ GI GN  Q N  V YD     + 
Sbjct: 386 RGGAD--VELKPVNTFVRAEEGLVCFTMLP---TNDVGIYGNLAQMNFVVGYDLGKRTVS 440

Query: 476 FAGEDCS 482
           F   DCS
Sbjct: 441 FLPADCS 447


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 127/390 (32%), Positives = 190/390 (48%), Gaps = 37/390 (9%)

Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ 162
           R    I+  ++  S  E P+ +G       Y+  + +G    + + I+DTGSDL W QC+
Sbjct: 70  RRMRSINAMLQSSSGIETPVYAGDG----EYLMNVAIGTPDSSFSAIMDTGSDLIWTQCE 125

Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
           PC  C++Q  P+F+P  S S+  + C S  C  L   T N+         +C Y   YGD
Sbjct: 126 PCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSETCNNN--------ECQYTYGYGD 177

Query: 223 GSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIF 281
           GS T+G +  E      +SV +  FGCG +N+G   G  +GL+G+G   LSL SQ   + 
Sbjct: 178 GSTTQGYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQ---LG 234

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
            G FSYC+ S   +  S +L LG  +S     +P   T +I +    T+Y + L GI++G
Sbjct: 235 VGQFSYCMTSYGSSSPS-TLALGSAASGVPEGSP--STTLIHSSLNPTYYYITLQGITVG 291

Query: 342 GK---------QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           G          QLQ  G   GG++IDSGT +T LP   Y+A+   F  Q +        S
Sbjct: 292 GDNLGIPSSTFQLQDDG--TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSS 349

Query: 393 ILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
            L TCF   S    V +P + M+F+G     +++      +      +CLA+ S S +  
Sbjct: 350 GLSTCFQQPSDGSTVQVPEISMQFDGGV---LNLGEQNILISPAEGVICLAMGS-SSQLG 405

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             I GN QQ+  +V+YD +N  + F    C
Sbjct: 406 ISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 133/436 (30%), Positives = 203/436 (46%), Gaps = 39/436 (8%)

Query: 58  QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
           Q    ++  I +  K   +   K   W          D   ++YL +         + D 
Sbjct: 29  QSDTSDLSVIPIYSKCSPFVPPKQESWVNTVITMASKDPERLKYLST---------LADQ 79

Query: 118 SNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV 174
             T +P+  G + L+  NY+  ++LG  G+ M +++DT +D  WV   PC  C       
Sbjct: 80  KTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTT 136

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           F P+ S +   + C+ + C  +    G S  C ++    C +  SYG  S     L ++ 
Sbjct: 137 FLPNASTTLGSLDCSGAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLTATLVQDA 191

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
           + L    +  F FGC     G      GL+GLGR  +SL+SQ   ++ G+FSYCLPS + 
Sbjct: 192 ITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKS 251

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQA 347
              SGSL LG           I  T ++ NP   + Y +NLTG+S+G        +QL  
Sbjct: 252 YYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVF 307

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
                 G +IDSGTVITR    +Y A++ EF KQ +G  S+ G    DTCF  +A  E  
Sbjct: 308 DPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AATNEAE 363

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRV 465
            P + + FEG   +      +++   S  S  CL++A+   +      +I N QQ+N R+
Sbjct: 364 APAITLHFEGLNLVLPMENSLIH--SSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRI 421

Query: 466 IYDTKNSQLGFAGEDC 481
           ++DT NS+LG A E C
Sbjct: 422 MFDTTNSRLGIARELC 437


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 128/351 (36%), Positives = 178/351 (50%), Gaps = 30/351 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           + +DTGS L W QCQPC  C+NQ  P +D S S ++    C+S+ C      T    +C 
Sbjct: 50  LTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVT----MCV 105

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLG-LGKASVNDFIFGCGRNNKGLF-GGVSGLMG 265
           + +   C Y  SYGD S T G L  E +  +  ASV   +FGCG NN G+F    +G+ G
Sbjct: 106 NQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAG 165

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGG-NSSVFKNST-PITYTNMIP 323
            GR  LSL SQ      G FS+C   T  +G   S +L    + ++KN    +  T +I 
Sbjct: 166 FGRGPLSLPSQLKV---GNFSHCF--TAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIK 220

Query: 324 NPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAE 377
           NP   TFY L+L GI++G  +L    S FA     GG +IDSGT  T LPP +Y  +  E
Sbjct: 221 NPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDE 280

Query: 378 FLK--QFSGFPSAPGFSILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           F    +    PS     +L  CF+     +  ++P + + FEG A M +     V+  K 
Sbjct: 281 FAAHVKLPVVPSNETGPLL--CFSAPPLGKAPHVPKLVLHFEG-ATMHLPRENYVFEAKD 337

Query: 435 DAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             +  +CLA+     E E  IIGN+QQ+N  V+YD KNS+L F    C  +
Sbjct: 338 GGNCSICLAI----IEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDKL 384


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 139/432 (32%), Positives = 207/432 (47%), Gaps = 40/432 (9%)

Query: 63  EMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
           ++  I +  K   + + K   W     +    D   ++YL S         +        
Sbjct: 31  DLSVIPIYGKCSPFTAPKSESWMNTVIDMASKDPARIRYLSS---------LTAQKTVAA 81

Query: 123 PLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           P+ SG + L   NY+  ++LG  G+ M +++DT +D  W  C  C  C +     F    
Sbjct: 82  PIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTT--TFSAQN 139

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S ++  + C+   C     A G S  C ++   DC +  +YG  S     L ++ L LG 
Sbjct: 140 SSTFATLDCSKPECTQ---ARGLS--CPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGP 194

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
             + +F FGC  +  G      GLMGLGR  LSL+SQ+  ++ GLFSYCLPS +    SG
Sbjct: 195 NVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSG 254

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG 353
           SL LG           I  T ++ NP   + Y +NLTGIS+G        +L A     G
Sbjct: 255 SLKLG----PVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTG 310

Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
            G +IDSGTVITR  P+IY+A++ EF KQ  G  S  G    DTCF  +   EV+ P + 
Sbjct: 311 AGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSPLG--AFDTCF--ATNNEVSAPAIT 366

Query: 413 MEFEG-NAEMTVDVTGIVYFVKSDASQVCLALAS--LSYEDETGIIGNYQQKNQRVIYDT 469
           +   G + ++ ++ + I     S  S  CLA+A+   +      +I N QQ+N R+++D 
Sbjct: 367 LHLSGLDLKLPMENSLI---HSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDI 423

Query: 470 KNSQLGFAGEDC 481
            NS+LG A E C
Sbjct: 424 NNSKLGIARELC 435


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 124/376 (32%), Positives = 189/376 (50%), Gaps = 42/376 (11%)

Query: 134 NYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNS 190
            Y+ T+ +G   +    + DTGSDL W QC PC + C+ Q  P+++P+ S ++  + CNS
Sbjct: 111 EYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS 170

Query: 191 S--TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
           S   C            C+      C Y  +YG G +T G  G E    G     +A V 
Sbjct: 171 SLSMCAGALAGAAPPPGCA------CMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVP 223

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGC   +   + G +GL+GLGR  LSLVSQ   +  G FSYCL   QD  ++ +L+L
Sbjct: 224 GVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTLLL 280

Query: 304 GGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------KG 353
           G ++++  N T +  T  + +P    ++T+Y LNLTGIS+G K L  S  A        G
Sbjct: 281 GPSAAL--NGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 338

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVN---I 408
           G++IDSGT IT L  + Y  ++A      +  P+  G   + LD CF L A        +
Sbjct: 339 GLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVL 398

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P + + F+G A+M +       ++ S +   CLA+ + + +      GNYQQ+N  ++YD
Sbjct: 399 PSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILYD 453

Query: 469 TKNSQLGFAGEDCSSM 484
            +   L FA   CS++
Sbjct: 454 VREETLSFAPAKCSTL 469


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 125/377 (33%), Positives = 194/377 (51%), Gaps = 45/377 (11%)

Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCN 189
            Y+ T+ +G   ++   I DTGSDL W QC PC    C+ Q  P+++P+ S ++  + CN
Sbjct: 91  EYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCN 150

Query: 190 SSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGELGREHLGLGKASVND-- 244
           SS           +GV +  +PP    C Y  +YG G +T G  G E    G A+ +   
Sbjct: 151 SSLSMC-------AGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQAR 202

Query: 245 ---FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
                FGC   +   + G +GL+GLGR  LSLVSQ   +  G FSYCL   QD  ++ +L
Sbjct: 203 VPGIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQ---LGAGRFSYCLTPFQDTNSTSTL 259

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA------- 351
           +LG ++++  N T +  T  + +P    ++T+Y LNLTGIS+G K L  S  A       
Sbjct: 260 LLGPSAAL--NGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADG 317

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTCFNLSAYQEV--N 407
            GG++IDSGT IT L  + Y  ++A  ++     P+  G   + LD C+ L         
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPA 376

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P + + F+G A+M +       ++ S +   CLA+ + + +      GNYQQ+N  ++Y
Sbjct: 377 MPSMTLHFDG-ADMVLPADS---YMISGSGVWCLAMRNQT-DGAMSTFGNYQQQNMHILY 431

Query: 468 DTKNSQLGFAGEDCSSM 484
           D +N  L FA   CS++
Sbjct: 432 DVRNEMLSFAPAKCSTL 448


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 139/446 (31%), Positives = 206/446 (46%), Gaps = 57/446 (12%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE----- 121
           + LEL H +   G +      ++ R   D  H      R  N   G I+  S+T      
Sbjct: 25  VRLELTHADDRGGYV----GAERVRRAADRSH------RRVNGFLGAIEGPSSTARLGSD 74

Query: 122 ----IPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
                   + +   T  Y+  I +G   +  T ++DTGSDL W QC  PC+ C+ Q  P+
Sbjct: 75  GAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPL 134

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGR 232
           + P+ S +Y  V C S  C AL+         S  SPPD  C Y+ SYGDG+ T G L  
Sbjct: 135 YAPARSATYANVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLAT 188

Query: 233 EHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
           E   LG   +V    FGCG  N G     SGL+G+GR  LSLVSQ        FSYC  +
Sbjct: 189 ETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVT---RFSYCF-T 244

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP-----QLATFYILNLTGISIGGKQL- 345
             +A A+  L LG ++   + S+    T  +P+P     + +++Y L+L GI++G   L 
Sbjct: 245 PFNATAASPLFLGSSA---RLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLP 301

Query: 346 ------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCF 398
                 + +    GG++IDSGT  T L    + AL A  L      P A G  + L  CF
Sbjct: 302 IDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVAL-ARALASRVRLPLASGAHLGLSLCF 360

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
             ++ + V +P + + F+G A+M +     V   +S A   CL + S        ++G+ 
Sbjct: 361 AAASPEAVEVPRLVLHFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSM 415

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
           QQ+N  ++YD +   L F    C  +
Sbjct: 416 QQQNTHILYDLERGILSFEPAKCGEL 441


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 189/368 (51%), Gaps = 38/368 (10%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ T  +G     +  I DTGSD+ W+QC+PC+ CYNQ  P+F+PS S SYK + C S  
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKL 146

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
           CH++   +     CS  +   C Y +SYGD S+++G+L  + L L        S    + 
Sbjct: 147 CHSVRDTS-----CSDQN--SCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVI 199

Query: 248 GCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG-G 305
           GCG +N G FGG  SG++GLG   +SL++Q     GG FSYCL    +  ++ S IL  G
Sbjct: 200 GCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG-----ILIDSG 360
           +++V      ++   +  +P    FY L L   S+G K+++  G ++GG     I+IDSG
Sbjct: 260 DAAVVSGDGVVSTPLIKKDP---VFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316

Query: 361 TVITRLPPSIYSALKA---EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           T +T +P  +Y+ L++   + +K          FS+   C++L +  E + P++   F+G
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSL---CYSLKS-NEYDFPIITAHFKG 372

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGF 476
                +++  I  FV      VC A        + G I GN  Q+N  V YD +   + F
Sbjct: 373 ---ADIELHSISTFVPITDGIVCFAFQP---SPQLGSIFGNLAQQNLLVGYDLQQKTVSF 426

Query: 477 AGEDCSSM 484
              DC+ +
Sbjct: 427 KPTDCTKV 434


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 134/373 (35%), Positives = 183/373 (49%), Gaps = 44/373 (11%)

Query: 135 YIATIELGGR--NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I LG    +M  I DTGSDL W QC+PC SCY Q +P+FDP+ S +Y+ + C   +
Sbjct: 95  YLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKS 154

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIF 247
           C  L    G  G CS  +   C Y  SYGDGS+T G+L  + L +G       SV   +F
Sbjct: 155 CSNL----GGQGGCSDDN--TCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVPKVVF 208

Query: 248 GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG NN G F    SGL+GLG   LS++SQ   + GG FSYCL P   D   S  +  G 
Sbjct: 209 GCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVSSKMHFGS 268

Query: 306 N---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK---------- 352
               S     STP+       + Q  TFY L L  +S+G K+L   GF+K          
Sbjct: 269 RGIVSGAGAVSTPLA------SRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADE 322

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF-NLSAYQEVNIPLV 411
           G I+IDSGT +T LP   Y  L++  +    G P     ++   C+ NLS    + IP +
Sbjct: 323 GNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSNLSG---LRIPTI 379

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
              F G     +++  +  FV+      C A+  +S   +  I GN  Q N  V YD K+
Sbjct: 380 TAHFVG---ADLELKPLNTFVQVQEDLFCFAMIPVS---DLAIFGNLAQMNFLVGYDLKS 433

Query: 472 SQLGFAGEDCSSM 484
             + F   DC+ +
Sbjct: 434 RTVSFKPTDCTKI 446


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 123/390 (31%), Positives = 190/390 (48%), Gaps = 41/390 (10%)

Query: 123 PLTSGIRLQTLNYIATIELG---GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           P+T+     +  Y+    +G    + + + +DTGSDL W QC PC  C++Q  P+FDPS+
Sbjct: 75  PVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSV 134

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL---- 235
           S +++ V C    C     ++G S    +     C Y  SYGD S T G + ++      
Sbjct: 135 SSTFRAVACPDPICRP---SSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191

Query: 236 ----GLGKASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
               G    +V+   FGCG  N G+F    SG+ G GR  LSL SQ   +  G FSYCL 
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQ---LRVGRFSYCLT 248

Query: 291 S--TQDAGASGSLILGGNSSVFK--NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL- 345
           S    ++  + ++ LG   +  +  +S P   T +I +P   TFY L+L GI++G  +L 
Sbjct: 249 SHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308

Query: 346 -QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
             +S FA      GG +IDSGT +T  P +++  LK EF+ Q       P +       N
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQL----PLPRYDNTSEVGN 364

Query: 400 LSAYQEVN----IPLVKMEFE-GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
           L  +Q       +P+ K+ F   +A+M +       ++  D     + L     E +  +
Sbjct: 365 LLCFQRPKGGKQVPVPKLIFHLASADMDLPREN---YIPEDTDSGVMCLMINGAEVDMVL 421

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           IGN+QQ+N  ++YD +NS+L FA   C  M
Sbjct: 422 IGNFQQQNMHIVYDVENSKLLFASAQCDKM 451


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 128/385 (33%), Positives = 181/385 (47%), Gaps = 42/385 (10%)

Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-PVFDPSISPSYKKVLC 188
           T  Y+  + +G   R + + +DTGSDL W QC PC +C++Q   PV DP+ S ++  V C
Sbjct: 91  TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRC 150

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND---- 244
           ++  C AL F +   G  SS     C Y   YGD S T G+L  +    G     D    
Sbjct: 151 DAPVCRALPFTSCGRG-GSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGV 209

Query: 245 ----FIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
                 FGCG  NKG+F    +G+ G GR   SL SQ        FSYC  S  ++  S 
Sbjct: 210 SERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFES-TSS 265

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAKGGI 355
            + LG   +    +  +  T ++ +P   + Y L+L  I++G  ++    +     +   
Sbjct: 266 LVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASA 325

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFNL-------SAY---- 403
           +IDSG  IT LP  +Y A+KAEF+ Q  G P SA   S LD CF L       SA+    
Sbjct: 326 IIDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAVEGSALDLCFALPSAAAPKSAFGWRW 384

Query: 404 ------QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIG 456
                   V +P +     G A+  +     V F    A  +CL L A+    D+T +IG
Sbjct: 385 RGRGRAMPVRVPRLVFHLGGGADWELPRENYV-FEDYGARVMCLVLDAATGGGDQTVVIG 443

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
           NYQQ+N  V+YD +N  L FA   C
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARC 468


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 186/369 (50%), Gaps = 29/369 (7%)

Query: 133 LNYIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           + Y+  + +G   +    + DTGSDLTW QCQPCK C+ Q  PV+DPS S ++  V C+S
Sbjct: 75  VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSS 134

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA------SVND 244
           +TC         S  CS+ S   C Y  SY DG+Y+ G LG E L LG +      SV+D
Sbjct: 135 ATC----LPVLRSRNCSTPS-SLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSD 189

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
             FGCG +N G     +G +GLGR  LSL++Q      G FSYCL    ++      +LG
Sbjct: 190 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGV---GKFSYCLTDFFNSTLDSPFLLG 246

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------GKQLQASGFAKGGILI 357
             + +      +  T ++ +P   + Y+++L GI++G        K       + GG+++
Sbjct: 247 TLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVV 306

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKMEF 415
           DSGT  + LP S +  +  + + Q  G P     S+   CF   A   Q   +P + + F
Sbjct: 307 DSGTTFSILPESGFRVV-VDHVAQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHF 365

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G A+M +     + + + D+S  CL +   +      ++GN+QQ+N ++++D    QL 
Sbjct: 366 AGGADMRLHRDNYMSYNQEDSS-FCLNIVGTT--STWSMLGNFQQQNIQMLFDMTVGQLS 422

Query: 476 FAGEDCSSM 484
           F   DCS +
Sbjct: 423 FLPTDCSKL 431


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 121/352 (34%), Positives = 178/352 (50%), Gaps = 30/352 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            + DTGSDLTW QCQPCK C+ Q  PV+DPS S ++  V C+S+TC      T  S  CS
Sbjct: 81  ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC----LPTWRSRNCS 136

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA------SVNDFIFGCGRNNKGLFGGVS 261
           + S P C Y  SY DG+Y+ G LG E L +G +      SV    FGCG +N G     +
Sbjct: 137 NPSSP-CRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNST 195

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           G +GLGR  LSL++Q      G FSYCL    ++       LG  + +      +  T +
Sbjct: 196 GTVGLGRGTLSLLAQLGV---GKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPL 252

Query: 322 IPNPQLATFYILNLTGISIGGKQ---------LQASGFAKGGILIDSGTVITRLPPSIYS 372
           + +P   + Y +NL GIS+G  +         L+A G   GG+++DSGT  T L  S + 
Sbjct: 253 LQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADG--NGGMMVDSGTTFTILAKSGFR 310

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
            +  + + Q  G P     S+   CF  S   E  +P + + F G A+M +     + + 
Sbjct: 311 EV-VDRVAQLLGQPPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHRDNYMSYN 368

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           + D+S  CL +  +        +GN+QQ+N ++++D    QL F   DCS +
Sbjct: 369 EDDSS-FCLNI--VGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDCSKL 417


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 184/348 (52%), Gaps = 37/348 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           +N+ +I+DTGSD TW++C  C   +C+N++ P F+PS+S SY    C  ST         
Sbjct: 140 QNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPST--------- 190

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
                        NY ++Y D SY++G    + + L       F FGCG +  G FG  S
Sbjct: 191 -----------KTNYTMNYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSAS 239

Query: 262 GLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           G++GL + +  SL+SQT+  F   FSYC P  ++    GSL+ G        S  + +T 
Sbjct: 240 GVLGLAQGEQYSLISQTASKFKKKFSYCFPHNENT--RGSLLFG--EKAISASPSLKFTR 295

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           ++ NP   + Y + L GIS+  K+L  S   FA  G +IDSGTVIT LP + Y AL+  F
Sbjct: 296 LL-NPSSGSVYFVELIGISVAKKRLNVSSSLFASPGTIIDSGTVITHLPTAAYEALRTAF 354

Query: 379 LKQFSGFPSA---PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            ++    PS    P    LDTC+NL     + + +P + + F G  ++++  +GI++   
Sbjct: 355 QQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILW-AN 413

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            D +Q CLA A  S+     IIGN QQ + +V+YD +  +LGF G DC
Sbjct: 414 GDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF-GNDC 460


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 139/378 (36%), Positives = 188/378 (49%), Gaps = 42/378 (11%)

Query: 121 EIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           E P+ SG       Y+  I  G   +  T IVDTGSDL WVQC PCKSCY      FDPS
Sbjct: 80  ETPVASG----NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPS 135

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S SYK + C S+ C  L F +     C++S    C Y   YGDGS T G L  + + +G
Sbjct: 136 KSASYKTLGCGSNFCQDLPFQS-----CAAS----CQYDYMYGDGSSTSGALSTDDVTIG 186

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
              + +  FGCG +N G F G  GL+GLG+  LSLVSQ        FSYCL        S
Sbjct: 187 TGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTS 246

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-----A 351
              I  G+S++   +  + YT M+ N    TFY   L GIS+ GK +   A+ F      
Sbjct: 247 PLYI--GDSTL---AGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATG 301

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILDTCFNLSAYQEVNIPL 410
           +GG+++DSGT +T L    ++ + A  LK    +P A G F  L+ CF+ +       P 
Sbjct: 302 RGGLILDSGTTLTYLDVDAFNPMVAA-LKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360

Query: 411 VKMEFEG-NAEMTVDVTGIVYFVKSD-ASQVCLALASLSYEDETG--IIGNYQQKNQRVI 466
           V   F G +  +  D T    F+  D     CLA+AS      TG  I GN QQ N  ++
Sbjct: 361 VVFHFNGADVALAPDNT----FIALDFEGTTCLAMAS-----STGFSIFGNIQQLNHVIV 411

Query: 467 YDTKNSQLGFAGEDCSSM 484
           +D  N ++GF   +C ++
Sbjct: 412 HDLVNKRIGFKSANCETI 429


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 133/389 (34%), Positives = 191/389 (49%), Gaps = 57/389 (14%)

Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPC--------KSCYNQQDPVFDPSISPSY 183
            YI T+ +G   ++   I DTGSDL W QC PC          C+ Q   +++PS S ++
Sbjct: 86  EYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTF 145

Query: 184 KKVLCNS--STCHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLGK 239
             + CNS  S C A+            S PP C   Y  +YG G +T G    E    G 
Sbjct: 146 GVLPCNSPLSMCAAMA---------GPSPPPGCACMYNQTYGTG-WTAGVQSVETFTFGS 195

Query: 240 AS------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           +S      V +  FGC   +   + G +GL+GLGR  +SLVSQ   +  G FSYCL   Q
Sbjct: 196 SSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQ---LGAGAFSYCLTPFQ 252

Query: 294 DAGASGSLILGGNSSV-FKNSTPITYTNMIPNPQ---LATFYILNLTGISIG-------- 341
           DA ++ +L+LG +++   K + P+  T  +  P    ++T+Y LNLTGIS+G        
Sbjct: 253 DANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPP 312

Query: 342 -GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSA--PGFSI-LDT 396
               L+A G   GG++IDSGT IT L  S Y  ++A       +  P A  P  S  LD 
Sbjct: 313 DAFSLRADG--TGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDL 370

Query: 397 CFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
           CF L A      +P + + FEG A+M + V   +      +   CLA+ + +      ++
Sbjct: 371 CFALKASTPPPAMPSMTLHFEGGADMVLPVENYMIL---GSGVWCLAMRNQTV-GAMSMV 426

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           GNYQQ+N  V+YD +   L FA   CSS+
Sbjct: 427 GNYQQQNIHVLYDVRKETLSFAPAVCSSL 455


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 181/379 (47%), Gaps = 68/379 (17%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
             +V+ DTGS L W QC PC  C  +  P F P+ S ++ K+ C SS C   +F T    
Sbjct: 102 TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYL 158

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
            C+++    C Y+  YG G +T G L  E L +G AS     FGC   N G+    SG++
Sbjct: 159 TCNATG---CVYYYPYGMG-FTAGYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIV 213

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITY 318
           GLGRS LSLVSQ      G FSYCL S  DAG S    GSL  + GGN      STP   
Sbjct: 214 GLGRSPLSLVSQVGV---GRFSYCLRSDADAGDSPILFGSLAKVTGGN----VQSTP--- 263

Query: 319 TNMIPNPQL--ATFYILNLTGISIGGKQLQAS----GFAK-------GGILIDSGTVITR 365
             ++ NP++  +++Y +NLTGI++G   L  +    GF +       GG ++DSGT +T 
Sbjct: 264 --LLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTY 321

Query: 366 LPPSIYSALKAEFLKQFS-------------GFPSAPGFSILDTCFNLSAY---QEVNIP 409
           L    Y+ +K  FL Q +             GF         D CF+ +A      V +P
Sbjct: 322 LVKEGYAMVKRAFLSQMATANLTTTVNGTRFGF---------DLCFDATAAGGGSGVPVP 372

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSD----ASQVCLALASLSYEDETGIIGNYQQKNQRV 465
            + + F G AE  V     V  V  D    A+  CL +   S +    IIGN  Q +  V
Sbjct: 373 TLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHV 432

Query: 466 IYDTKNSQLGFAGEDCSSM 484
           +YD       FA  DC+++
Sbjct: 433 LYDLDGGMFSFAPADCANV 451


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 130/414 (31%), Positives = 193/414 (46%), Gaps = 50/414 (12%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
           V W    ++ L+ D   +QYL S  K              +P+ SG  + Q+  YI    
Sbjct: 52  VSW----ESTLLKDKARLQYLSSLAKK-----------PSVPIASGRAIVQSPTYIVRAN 96

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +G   + M V +DT +D  WV C  C  C +    +FDPS S S + + C++  C     
Sbjct: 97  IGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV--LFDPSKSSSSRNLQCDAPQCKQAPN 154

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
            T  +G         C + ++YG GS     L ++ L L    +  + FGC     G   
Sbjct: 155 PTCTAGK-------SCGFNMTYG-GSTIEASLTQDTLTLANDVIKSYTFGCISKATGTSL 206

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
              GLMGLGR  LSL+SQT  ++   FSYCLP+++ +  SGSL LG      +    I  
Sbjct: 207 PAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQPVR----IKT 262

Query: 319 TNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIY 371
           T ++ NP+ ++ Y +NL GI +G K   +  S  A       G + DSGTV TRL    Y
Sbjct: 263 TPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVEPAY 322

Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
            A++ EF ++     +A      DTC++ S    V  P V   F G   M V +      
Sbjct: 323 VAVRNEFRRRIKNA-NATSLGGFDTCYSGS----VVYPSVTFMFAG---MNVTLPPDNLL 374

Query: 432 VKSDA-SQVCLALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           + S + S  CLA+A+      +   +I + QQ+N RV+ D  NS+LG + E C+
Sbjct: 375 IHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETCT 428


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 138/437 (31%), Positives = 216/437 (49%), Gaps = 51/437 (11%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
           G  ++E+ H++          E Q  R + + +H    ++   N ++ +    ++ E  +
Sbjct: 27  GGFSVEMIHRDSSRSPFFSPTETQFQR-VANAVHRSINRA---NHLNQSFVSPNSPETTV 82

Query: 125 TSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            S +      Y+ +  +G  ++ V  I+DTGSD+ W+QCQPCK CY Q  P+FD S S +
Sbjct: 83  ISALG----EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQT 138

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV 242
           YK + C S+TC +++        CSS     C Y + Y DGS + G+L  E L LG  + 
Sbjct: 139 YKTLPCPSNTCQSVQ-----GTFCSSRK--HCLYSIHYVDGSQSLGDLSVETLTLGSTNG 191

Query: 243 NDFIF-----GCGRNNK-GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
           +   F     GCGR N  G+    SG++GLGR  +SL++Q S   GG FSYCL P    A
Sbjct: 192 SPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTA 251

Query: 296 GASGSLILGGNSSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----A 347
            +  +    GN++V       STP+   N +       FY L L   S+G  +++     
Sbjct: 252 SSKLNF---GNAAVVSGRGTVSTPLFSKNGL------VFYFLTLEAFSVGRNRIEFGSPG 302

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ-EV 406
           SG  KG I+IDSGT +T LP  +YS L+A   K            +L  C+ ++  + + 
Sbjct: 303 SG-GKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDA 361

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRV 465
           ++P++   F G A++T++   I  FV+     VC A        ETG + GN  Q+N  V
Sbjct: 362 SVPVITAHFSG-ADVTLN--AINTFVQVADDVVCFAFQ----PTETGAVFGNLAQQNLLV 414

Query: 466 IYDTKNSQLGFAGEDCS 482
            YD + + + F   DC+
Sbjct: 415 GYDLQMNTVSFKHTDCT 431


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 174/368 (47%), Gaps = 38/368 (10%)

Query: 141 LGGRNM-----------TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           +GG NM           +V+ DTGSDL W QC PC  C+ Q  P F P+ S ++ K+ C 
Sbjct: 83  VGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCT 142

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
           SS C   +F   +   C+++    C Y   YG G YT G L  E L +G AS     FGC
Sbjct: 143 SSFC---QFLPNSIRTCNATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGC 195

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
              N G+    SG+ GLGR  LSL+ Q      G FSYCL S   AGAS   IL G+ + 
Sbjct: 196 STEN-GVGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAAGASP--ILFGSLAN 249

Query: 310 FKNSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSG 360
             +   +  T  + NP +  ++Y +NLTGI++G   L  +    GF +    GG ++DSG
Sbjct: 250 LTDGN-VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSG 308

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGN 418
           T +T L    Y  +K  FL Q +   +  G   LD CF         + +P + + F+G 
Sbjct: 309 TTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGG 368

Query: 419 AEMTVDV--TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           AE  V     G+    +   +  CL +     +    +IGN  Q +  ++YD       F
Sbjct: 369 AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSF 428

Query: 477 AGEDCSSM 484
           A  DC+ +
Sbjct: 429 APADCAKV 436


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 181/379 (47%), Gaps = 33/379 (8%)

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y   ++LG     + +I+DTGSD++W+QC PCK C     P F+P  S S+ K+ C S
Sbjct: 136 LEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCAS 195

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE-------HLGLGK-ASV 242
           STC       G    CS S    C + + YGDGS + G L  E       + G G+   +
Sbjct: 196 STC--TNVYQGVKPFCSPSG-RTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 252

Query: 243 NDFIFGCGR-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           ++   GC   + +GL  G SGL+G+ R  +S  SQ S  +   FS+C P       S  L
Sbjct: 253 SNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGL 312

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLAT----FYILNLTGISIGGKQLQASG-------- 349
           +  G S +   S  + YT ++ NP + +    +Y + L GIS+   +L  S         
Sbjct: 313 VFFGESDII--SPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 370

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL----SAYQE 405
              GG +IDSGT  T L    + A++ EFL + S        S    C+N+    +A + 
Sbjct: 371 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 430

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQR 464
             +P + + F G  ++ +    I+  V S   Q  L LA  +S +    IIGNYQQ+N  
Sbjct: 431 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNLW 490

Query: 465 VIYDTKNSQLGFAGEDCSS 483
           V YD +  +LG A   C++
Sbjct: 491 VEYDLEKLRLGIAPAQCAT 509


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 180/379 (47%), Gaps = 33/379 (8%)

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y   +++G     + +I+DTGSD++W+QC PCK C     P F+P  S S+ K+ C S
Sbjct: 137 LEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCAS 196

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE-------HLGLGK-ASV 242
           STC       G    CS S    C + + YGDGS + G L  E       + G G+   +
Sbjct: 197 STC--TNVYQGVKPFCSPSG-RTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 253

Query: 243 NDFIFGCGR-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           ++   GC   + +GL  G SGL+G+ R  +S  SQ S  +   FS+C P       S  L
Sbjct: 254 SNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSSGL 313

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLAT----FYILNLTGISIGGKQLQASG-------- 349
           +  G S +   S  + YT ++ NP + +    +Y + L GIS+   +L  S         
Sbjct: 314 VFFGESDII--SPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 371

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS----AYQE 405
              GG +IDSGT  T L    + A++ EFL + S        S    C+N++    A + 
Sbjct: 372 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 431

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED-ETGIIGNYQQKNQR 464
             +P + + F G  ++ +    I+  V S   Q  L LA L   D    IIGNYQQ+N  
Sbjct: 432 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNLW 491

Query: 465 VIYDTKNSQLGFAGEDCSS 483
           V YD +  +LG A   C++
Sbjct: 492 VEYDLEKLRLGIAPAQCAT 510


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
            YI T+ +G   ++   I DTGSDL W QC PC + C+ Q  P+++PS SP+++ + C+S
Sbjct: 91  EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150

Query: 191 STCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
               AL      + +  ++ PP C   Y  +YG G +T G  G E    G     +  V 
Sbjct: 151 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 205

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGC   +   + G +GL+GLGR  LSLVSQ   +  G+FSYCL   QD  +  +L+L
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 262

Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
           G  ++    N T +  T  +P+P    ++T+Y LNLTGIS+G   L      FA      
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGT 322

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
           GG++IDSGT IT L  + Y  ++A  ++     P   G +   LD CF L  S+     +
Sbjct: 323 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P + + F G A+M + V   +     D    CLA+ S + + E   +GNYQQ+N  ++YD
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 437

Query: 469 TKNSQLGFAGEDCSSM 484
            +   L FA   CS++
Sbjct: 438 VQKETLSFAPAKCSTL 453


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 124/370 (33%), Positives = 183/370 (49%), Gaps = 48/370 (12%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+ +  LG     V  IVDT SD+ WVQCQ C++CYN   P+FDPS S +YK + C+S+
Sbjct: 87  DYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSST 146

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND------- 244
           TC +++   G S  CSS     C + V+Y DGS+++G+L  E + LG  S ND       
Sbjct: 147 TCKSVQ---GTS--CSSDERKICEHTVNYKDGSHSQGDLIVETVTLG--SYNDPFVHFPR 199

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GS 300
            + GC RN    F  + G++GLG   +SLV Q S      FSYCL    D  +      +
Sbjct: 200 TVIGCIRNTNVSFDSI-GIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKFGDA 258

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-----ASGFAKGGI 355
            ++ G+ +V   ST I + +         FY L L   S+G  +++     +    KG I
Sbjct: 259 AMVSGDGTV---STRIVFKDW------KKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNI 309

Query: 356 LIDSGTVITRLPPSIYSALK---AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           +IDSGT  T LP  +YS L+   A+ +K          FS+   C+  S Y +V++P++ 
Sbjct: 310 IIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSL---CYK-STYDKVDVPVIT 365

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
             F G     V +  +  F+ +    VCLA  S        I GN  Q+N  V YD +  
Sbjct: 366 AHFSG---ADVKLNALNTFIVASHRVVCLAFLS---SQSGAIFGNLAQQNFLVGYDLQRK 419

Query: 473 QLGFAGEDCS 482
            + F   DC+
Sbjct: 420 IVSFKPTDCT 429


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 115/344 (33%), Positives = 174/344 (50%), Gaps = 28/344 (8%)

Query: 146 MTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATGN 202
           +TV++DT  D+ W++C PC    C +     +DP+ S +Y    CNSS C  L  +A G 
Sbjct: 163 VTVVLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANG- 216

Query: 203 SGVCSSSSPPDCNYFV-SYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFGG- 259
              C ++    C Y V + GD   T G    + L +     V  F FGC +N +G F   
Sbjct: 217 ---CDANG--QCQYMVVTAGDSFTTSGTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQ 271

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
             G+M LGR   SL++QTS  +G  FSYCLP T+       + +   +S    +TP+   
Sbjct: 272 ADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKGFFQIGVPIGASYRFVTTPMLKE 331

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
               +   AT Y   L  I++ GK+L   A  FA G ++ DS T+ITRLP + Y AL+A 
Sbjct: 332 RGGASAAAATLYRALLLAITVDGKELNVPAEVFAAGTVM-DSRTIITRLPVTAYGALRAA 390

Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
           F  +   +  AP    LDTC++L+  +   +P + + F+GNA + +D +GI+        
Sbjct: 391 FRNRMR-YRVAPPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILL------- 442

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             CLA AS   +    I+GN QQ+  +V++D    ++GF    C
Sbjct: 443 NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 183/370 (49%), Gaps = 27/370 (7%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QCQPC +C++Q  P FDPS S +     
Sbjct: 77  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
           C+S+ C  L  A+  S     +    C Y  SYGD S T G L  +        ASV   
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C  +      S +++L 
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250

Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
             + ++K+    +  T +I NP   TFY L+L GI++G  +L    S FA     GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTII 310

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
           DSGT +T LP  +Y  ++  F  Q    P   G +  D  F LSA       +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368

Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           EG A M +     V+ V+   S + CLA+       E   IGN+QQ+N  V+YD +NS+L
Sbjct: 369 EG-ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKL 424

Query: 475 GFAGEDCSSM 484
            F    C  +
Sbjct: 425 SFVPAQCDKL 434


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
            YI T+ +G   ++   I DTGSDL W QC PC + C+ Q  P+++PS SP+++ + C+S
Sbjct: 96  EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 155

Query: 191 STCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
               AL      + +  ++ PP   C Y  +YG G +T G  G E    G     +  V 
Sbjct: 156 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 210

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGC   +   + G +GL+GLGR  LSLVSQ   +  G+FSYCL   QD  +  +L+L
Sbjct: 211 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 267

Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
           G  ++    N T +  T  +P+P    ++T+Y LNLTGIS+G   L      FA      
Sbjct: 268 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGT 327

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
           GG++IDSGT IT L  + Y  ++A  ++     P   G +   LD CF L  S+     +
Sbjct: 328 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 386

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P + + F G A+M + V   +     D    CLA+ S + + E   +GNYQQ+N  ++YD
Sbjct: 387 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 442

Query: 469 TKNSQLGFAGEDCSSM 484
            +   L FA   CS++
Sbjct: 443 VQKETLSFAPAKCSTL 458


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 137/406 (33%), Positives = 184/406 (45%), Gaps = 49/406 (12%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLT 157
           ++R+  + S  +      + P+T+   L T +   Y+  + +G   +  T I+DTGSDL 
Sbjct: 55  KARVAALQSAAVSPAPVAD-PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLI 113

Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
           W QC PC  C  Q  P FD   S +Y+ + C SS C AL     +S  C       C Y 
Sbjct: 114 WTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRCAAL-----SSPSCFKKM---CVYQ 165

Query: 218 VSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
             YGD + T G L  E    G AS       +  FGCG  N G     SG++G GR  LS
Sbjct: 166 YYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLS 225

Query: 273 LVSQTSEIFGGLFSYCL-------PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LVSQ        FSYCL       PS    G   +L    NS+   + +P+  T  + NP
Sbjct: 226 LVSQLGP---SRFSYCLTSYLSPTPSRLYFGVFANL----NSTNTSSGSPVQSTPFVINP 278

Query: 326 QLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSIYSALKAEF 378
            L   Y L++ GIS+G K+L              GG++IDSGT IT L    Y A++   
Sbjct: 279 ALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVR-RG 337

Query: 379 LKQFSGFPSAPGFSI-LDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           L      P+     I LDTCF         V +P     F+G A MT+     +  + S 
Sbjct: 338 LASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDG-ANMTLPPENYM-LIAST 395

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              +CLA+A  S      IIGNYQQ+N  ++YD  NS L F    C
Sbjct: 396 TGYLCLAMAPTSVGT---IIGNYQQQNLHLLYDIANSFLSFVPAPC 438


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 129/375 (34%), Positives = 186/375 (49%), Gaps = 39/375 (10%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
           Y+  + +G   +    I DTGSDL W QC PC S C+ Q  P+++PS S ++  + CNSS
Sbjct: 92  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 151

Query: 192 TCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLGK-----ASVND 244
               L           ++ PP   C Y V+YG G +T    G E    G      A V  
Sbjct: 152 ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPG 206

Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGC   + G      SGL+GLGR  LSLVSQ   +    FSYCL   QD  ++ +L+L
Sbjct: 207 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 263

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
           G ++S+   +  ++ T  + +P  A   TFY LNLTGIS+G   L     A        G
Sbjct: 264 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 322

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
           G++IDSGT IT L  + Y  ++A  +   +  P+  G +   LD CF L  S      +P
Sbjct: 323 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSADTGLDLCFMLPSSTSAPPAMP 381

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + + F G A+M +      Y +  D+   CLA+ + + + E  I+GNYQQ+N  ++YD 
Sbjct: 382 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 437

Query: 470 KNSQLGFAGEDCSSM 484
               L FA   CS++
Sbjct: 438 GQETLSFAPAKCSAL 452


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/360 (34%), Positives = 174/360 (48%), Gaps = 41/360 (11%)

Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
            I+DTGSDLTW QC PC  +C+ Q  P++DP+ S ++ K+ C S  C AL  A      C
Sbjct: 111 AIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASPLCQALPSAFR---AC 167

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGL--------GKASVNDFIFGCGRNNKGLFG 258
           +++    C Y   Y  G +T G L  + L +          +S     FGC   N G   
Sbjct: 168 NATG---CVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCSTANGGDMD 223

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
           G SG++GLGRS LSL+SQ   I  G FSYCL S  DAGAS  ++ G  ++V  +   +  
Sbjct: 224 GASGIVGLGRSALSLLSQ---IGVGRFSYCLRSDADAGAS-PILFGALANVTGDK--VQS 277

Query: 319 TNMIPNP----QLATFYILNLTGISIGGKQLQAS----GF---AKGGILIDSGTVITRLP 367
           T ++ NP    + A +Y +NLTGI++G   L  +    GF     GG+++DSGT  T L 
Sbjct: 278 TALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTTFTYLA 337

Query: 368 PSIYSALKAEFLKQFSGF---PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
            + Y+ L+  FL Q +G     S   F   D CF   A  +  +P +   F G AE  V 
Sbjct: 338 EAGYTMLRQAFLSQTAGLLTRVSGAQFD-FDLCFEAGA-ADTPVPRLVFRFAGGAEYAVP 395

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                  V       CL +          +IGN  Q +  V+YD   +   FA  DC+S+
Sbjct: 396 RQSYFDAVDEGGRVACLLVLP---TRGVSVIGNVMQMDLHVLYDLDGATFSFAPADCASL 452


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 130/376 (34%), Positives = 194/376 (51%), Gaps = 38/376 (10%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNS 190
            YI T+ +G   ++   I DTGSDL W QC PC + C+ Q  P+++PS SP+++ + C+S
Sbjct: 91  EYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS 150

Query: 191 STCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLG-----KASVN 243
               AL      + +  ++ PP C   Y  +YG G +T G  G E    G     +  V 
Sbjct: 151 ----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVP 205

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGC   +   + G +GL+GLGR  LSLVSQ   +  G+FSYCL   QD  +  +L+L
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQ---LAAGMFSYCLTPFQDTKSKSTLLL 262

Query: 304 GGNSSVFK-NSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
           G  ++    N T +  T  +P+P    ++T+Y LNLTGIS+G   L      FA      
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGT 322

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNI 408
           GG++IDSGT IT L  + Y  ++A  ++     P   G +   LD CF L  S+     +
Sbjct: 323 GGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATL 381

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P + + F G A+M + V   +     D    CLA+ S + + E   +GNYQQ+N  ++YD
Sbjct: 382 PSMTLHFGGGADMVLPVENYMIL---DGGMWCLAMRSQT-DGELSTLGNYQQQNLHILYD 437

Query: 469 TKNSQLGFAGEDCSSM 484
            +   L FA   CS++
Sbjct: 438 VQKETLSFAPAKCSTL 453


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 128/375 (34%), Positives = 187/375 (49%), Gaps = 39/375 (10%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
           Y+  + +G   +    I DTGSDL W QC PC S C+ Q  P+++PS S ++  + CNSS
Sbjct: 90  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 149

Query: 192 TCHALEFATGNSGVCSSSSPP--DCNYFVSYGDGSYTRGELGREHLGLG-----KASVND 244
               L           ++ PP   C Y V+YG G +T    G E    G     ++ V  
Sbjct: 150 ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVPG 204

Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGC   + G      SGL+GLGR  LSLVSQ   +    FSYCL   QD  ++ +L+L
Sbjct: 205 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 261

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
           G ++S+   +  ++ T  + +P  A   TFY LNLTGIS+G   L     A        G
Sbjct: 262 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTG 320

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
           G++IDSGT IT L  + Y  ++A  +   +  P+  G +   LD CF L  S      +P
Sbjct: 321 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSAATGLDLCFMLPSSTSAPPAMP 379

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + + F G A+M +      Y +  D+   CLA+ + + + E  I+GNYQQ+N  ++YD 
Sbjct: 380 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 435

Query: 470 KNSQLGFAGEDCSSM 484
               L FA   CS++
Sbjct: 436 GQETLSFAPAKCSAL 450


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 137/446 (30%), Positives = 211/446 (47%), Gaps = 60/446 (13%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
           + ++L H +  +G   +  E+ +  + +    + Y Q + +   SG++          ++
Sbjct: 28  LRMKLTHVDDKAGYTTE--ERVRRAVAVSRERLAYTQQQQQLRASGDV----------SA 75

Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQC-QPC--KSCYNQQDPVFDPSISP 181
            + L T  YIA   +G   +    ++DTGS+L W QC   C  K+C  Q  P ++ S S 
Sbjct: 76  PVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSS 135

Query: 182 SYKKVLCNSST--CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           ++  V C  S   C A       +GV        C +  SYG GS   G LG E     +
Sbjct: 136 TFAAVPCADSAKLCAA-------NGVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF-Q 186

Query: 240 ASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
           +      FGC    R  KG   G SGL+GLGR  LSLVSQT       FSYCL P  ++ 
Sbjct: 187 SGAAKLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGAT---KFSYCLTPYLRNH 243

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ------ 346
           GAS  L +G ++S+      +T    + +P+    +TFY L L GIS+G  +L       
Sbjct: 244 GASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAF 303

Query: 347 -----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNL 400
                A+G+  GG++ID+G+ +T L  + YSAL  E  +Q +      P  + LD C   
Sbjct: 304 ELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCV-- 361

Query: 401 SAYQEVN--IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
            A Q+V+  +P++   F G A+M V      Y+   D S  C+ +    YE    +IGN+
Sbjct: 362 -ARQDVDKVVPVLVFHFGGGADMAVSAGS--YWGPVDKSTACMLIEEGGYET---VIGNF 415

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
           QQ++  ++YD    +L F   DCS +
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCSVL 441


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/414 (30%), Positives = 194/414 (46%), Gaps = 33/414 (7%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATI 139
           K V  +E  +  +   +  V+++ +R  +    ++   ++ E PL          Y+  I
Sbjct: 4   KGVKRSEAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGGYVMDI 59

Query: 140 ELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
            +G  G+    I DTGSDL WVQ +PC  C      +FDP  S +++++ C+S  C  L 
Sbjct: 60  SVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELP 117

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRN 252
                 G C   S   C+Y   YG G  T GE  R+ + LG  S        F  GCG  
Sbjct: 118 ------GSCEPGSS-TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMV 169

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
           N G F GV GL+GLG+  +SL SQ S      FSYCL        S  L+ G ++++  +
Sbjct: 170 NSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL--H 226

Query: 313 STPITYTNMI-PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY 371
            T I  T +  P+    T+Y+L + GI++ G+ +     + G  +IDSGT +T +P  +Y
Sbjct: 227 GTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMG----SPGTTIIDSGTTLTYVPSGVY 282

Query: 372 SALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
             + +  ++     P   G S+ LD C++ S+ +    P + +   G A MT   +    
Sbjct: 283 GRVLSR-MESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAG-ATMTPPSSNYFL 340

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            V      VCLA+ S S      IIGN  Q+   ++YD  +S+L F    C S+
Sbjct: 341 VVDDSGDTVCLAMGSASGL-PVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 126/429 (29%), Positives = 191/429 (44%), Gaps = 58/429 (13%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
           +   L H+   +       +   +RL  D    + +    +N+             P+ S
Sbjct: 78  VRFLLAHREAFAAPNATAAQLLAHRLARDAARAEAISVSARNVTRAG----GGFSAPVVS 133

Query: 127 GIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G+   +  Y A++ +G       +++DTGSD+ W+QC PC+ CY Q   VFDP  S SY 
Sbjct: 134 GLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYA 193

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVN 243
            V C +  C  L+   G        +   C Y V+YGDGS T G+L  E L   + A V 
Sbjct: 194 AVRCGAPPCRGLDAGGGGGCDRRRGT---CLYQVAYGDGSVTAGDLATETLWFARGARVP 250

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
               GCG +N+GLF   +GL+GLGR  LSL +QT+  +G  FSYC               
Sbjct: 251 RVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYC--------------- 295

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---------FAKGG 354
                 F+ S             L    I+      +GG +++  G           +GG
Sbjct: 296 ------FQGS------------DLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGG 337

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQEVNIPLVKM 413
           +++DSGT +TRL   +Y A++  F     G   AP GFS+ DTC++L   + V +P V +
Sbjct: 338 VILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSV 397

Query: 414 EFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
              G AE+ +      Y +  D     CLALA    +    I+GN QQ+  RV++D    
Sbjct: 398 HLAGGAEVALPPEN--YLIPVDTRGTFCLALAGT--DGGVSIVGNIQQQGFRVVFDGDRQ 453

Query: 473 QLGFAGEDC 481
           ++    + C
Sbjct: 454 RVALVPKSC 462


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
           LQ + + +   ++  V  + +P+ SG  + Q+  YI    +G   + M V +DT +D  W
Sbjct: 54  LQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAW 113

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           + C  C  C +    +FDPS S S + + C +  C     A   S   S S    C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164

Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
           +YG GS     L ++ L L    + ++ FGC     G      GLMGLGR  LSL+SQ+ 
Sbjct: 165 TYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223

Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
            ++   FSYCLP+++ +  SGSL LG  +   +    I  T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279

Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +G K   +  S  A       G + DSGTV TRL    Y A++ EF ++     +A   
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA-NATSL 338

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS--LSY 448
              DTC++ S    V  P V   F G   M V +      + S A  + CLA+A+  ++ 
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPVNV 391

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                +I + QQ+N RV+ D  NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 129/375 (34%), Positives = 186/375 (49%), Gaps = 39/375 (10%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
           Y+  + +G   +    I DTGSDL W QC PC S C+ Q  P+++PS S ++  + CNSS
Sbjct: 32  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSS 91

Query: 192 TCHALEFATGNSGVCSSSSPPDC--NYFVSYGDGSYTRGELGREHLGLGK-----ASVND 244
               L           ++ PP C   Y V+YG G +T    G E    G      A V  
Sbjct: 92  ----LSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPG 146

Query: 245 FIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             FGC   + G      SGL+GLGR  LSLVSQ   +    FSYCL   QD  ++ +L+L
Sbjct: 147 IAFGCSTASSGFNASSASGLVGLGRGRLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLLL 203

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLA---TFYILNLTGISIGGKQLQASGFA-------KG 353
           G ++S+   +  ++ T  + +P  A   TFY LNLTGIS+G   L     A        G
Sbjct: 204 GPSASL-NGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTG 262

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIP 409
           G++IDSGT IT L  + Y  ++A  +   +  P+  G +   LD CF L  S      +P
Sbjct: 263 GLIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGSADTGLDLCFMLPSSTSAPPAMP 321

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + + F G A+M +      Y +  D+   CLA+ + + + E  I+GNYQQ+N  ++YD 
Sbjct: 322 SMTLHFNG-ADMVLPADS--YMMSDDSGLWCLAMQNQT-DGEVNILGNYQQQNMHILYDI 377

Query: 470 KNSQLGFAGEDCSSM 484
               L FA   CS++
Sbjct: 378 GQETLSFAPAKCSAL 392


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
           LQ + + +   ++  V  + +P+ SG  + Q+  YI    +G   + M V +DT +D  W
Sbjct: 54  LQDKARFLYLSSLAGVRKSSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAW 113

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           + C  C  C +    +FDPS S S + + C +  C     A   S   S S    C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164

Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
           +YG GS     L ++ L L    + ++ FGC     G      GLMGLGR  LSL+SQ+ 
Sbjct: 165 TYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223

Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
            ++   FSYCLP+++ +  SGSL LG  +   +    I  T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279

Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +G K   +  S  A       G + DSGTV TRL    Y A++ EF ++     +A   
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNA-NATSL 338

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALAS--LSY 448
              DTC++ S    V  P V   F G   M V +      + S A  + CLA+A+  ++ 
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPVNV 391

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                +I + QQ+N RV+ D  NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/349 (32%), Positives = 170/349 (48%), Gaps = 26/349 (7%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           V+ DTGSDL W QC PC  C+ Q  P F P+ S ++ K+ C SS C   +F   +   C+
Sbjct: 101 VVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFC---QFLPNSIRTCN 157

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
           ++    C Y   YG G YT G L  E L +G AS     FGC   N G+    SG+ GLG
Sbjct: 158 ATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLG 212

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL 327
           R  LSL+ Q      G FSYCL S   AGAS  ++ G  +++   +  +  T  + NP +
Sbjct: 213 RGALSLIPQLGV---GRFSYCLRSGSAAGAS-PILFGSLANLTDGN--VQSTPFVNNPAV 266

Query: 328 -ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYSALKAEF 378
             ++Y +NLTGI++G   L  +    GF +    GG ++DSGT +T L    Y  +K  F
Sbjct: 267 HPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF 326

Query: 379 LKQFSGFPSAPGFSILDTCF-NLSAYQEVNIPLVKMEFEGNAEMTVDV--TGIVYFVKSD 435
           L Q +   +  G   LD CF +      + +P + + F+G AE  V     G+    +  
Sbjct: 327 LSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGS 386

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  CL +     +    +IGN  Q +  ++YD       F+  DC+ +
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAKV 435


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 192/394 (48%), Gaps = 35/394 (8%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTW 158
           LQ + + +   ++  V+ + +P+ SG  + Q+  YI    +G   + M V +DT +D  W
Sbjct: 54  LQDKARFLYLSSLAGVTKSSVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAW 113

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           + C  C  C +    +FDPS S S + + C +  C     A   S   S S    C + +
Sbjct: 114 IPCSGCVGCSSSV--LFDPSKSSSSRTLQCEAPQCKQ---APNPSCTVSKS----CGFNM 164

Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
           +YG GS     L ++ L L    + ++ FGC     G      GLMGLGR  LSL+SQ+ 
Sbjct: 165 TYG-GSAIEAYLTQDTLTLATDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQ 223

Query: 279 EIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
            ++   FSYCLP+++ +  SGSL LG  +   +    I  T ++ NP+ ++ Y +NL GI
Sbjct: 224 NLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIR----IKTTPLLKNPRRSSLYYVNLVGI 279

Query: 339 SIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +G K   +  S  A       G + DSGTV TRL    Y A++ EF ++     +A   
Sbjct: 280 RVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKN-ANATSL 338

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED 450
              DTC++ S    V  P V   F G   M V +      + S A  + CLA+A+     
Sbjct: 339 GGFDTCYSGS----VVFPSVTFMFAG---MNVTLPPDNLLIHSSAGNLSCLAMAAAPTNV 391

Query: 451 ET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +   +I + QQ+N RV+ D  NS+LG + E C+
Sbjct: 392 NSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 127/370 (34%), Positives = 182/370 (49%), Gaps = 27/370 (7%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QCQPC +C++Q  P FDPS S +     
Sbjct: 77  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
           C+S+ C  L  A+  S     +    C Y  SYGD S T G L  +        ASV   
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C  +      S +++L 
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250

Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
             + ++K+    +  T +I NP   TFY L+L GI++G  +L    S F      GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTII 310

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
           DSGT +T LP  +Y  ++  F  Q    P   G +  D  F LSA       +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368

Query: 416 EGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           EG A M +     V+ V+   S + CLA+       E   IGN+QQ+N  V+YD +NS+L
Sbjct: 369 EG-ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKL 424

Query: 475 GFAGEDCSSM 484
            F    C  +
Sbjct: 425 SFVPAQCDKL 434


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 137/390 (35%), Positives = 197/390 (50%), Gaps = 49/390 (12%)

Query: 120 TEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           +  P TSG       Y+A I +G   +  +  +DTGSD+TW+QCQPC+ CY Q  PVFDP
Sbjct: 125 SRAPTTSG------EYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDP 178

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG-DGSYTRGELGREHLG 236
             S SY+++  ++  C AL  + G      +     C Y V YG DGS T G+   E L 
Sbjct: 179 RHSTSYREMGYDAPDCQALGRSGGGDAKRMT-----CVYAVGYGDDGSTTVGDFIEETLT 233

Query: 237 L-GKASVNDFIFGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLP-- 290
             G   V     GCG +NKGLF    +G++GLGR  +S  SQ + +   +  FSYCL   
Sbjct: 234 FAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADF 293

Query: 291 --STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG------ 342
             S+     S +L +G  ++    S P ++T  + N  +ATFY + L G+S+GG      
Sbjct: 294 FLSSPGRSVSSTLTIGDGAAA--GSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGV 351

Query: 343 --KQLQASGF-AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFS-GFPSAPGFS 392
               L+   +  +GG+++DSGT +TRL    Y A +  F      L Q S G PS  GF 
Sbjct: 352 TEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPS--GF- 408

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDE 451
             DTC+ +   + + +P V M F G  E+T+      Y +  D+   VC A A    +  
Sbjct: 409 -FDTCYTMGG-RAMKVPTVSMHFAGGVELTLPPKN--YLIPVDSMGTVCFAFAGTG-DRS 463

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             IIGN QQ+  RV+Y+    ++GFA   C
Sbjct: 464 VSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 123/391 (31%), Positives = 194/391 (49%), Gaps = 32/391 (8%)

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
           SR+  + S  ++  +    P+ SG +L QTL Y+    LG   + + + VDT +D +W+ 
Sbjct: 80  SRLLYLDSLAVRGRARAYAPIASGRQLLQTLTYVVRASLGTPPQQLLLAVDTSNDASWIP 139

Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP---DCNYF 217
           C  C  C       FDP+ S SY+ V C S  C     A   +  C    PP    C + 
Sbjct: 140 CAGCAGCPTSSAAPFDPAASASYRTVPCGSPLC-----AQAPNAAC----PPGGKACGFS 190

Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
           ++Y D S  +  L ++ L +   +V  + FGC +   G      GL+GLGR  LS +SQT
Sbjct: 191 LTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQT 249

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
            +++   FSYCLPS +    SG+L LG N    +    I  T ++ NP  ++ Y +N+TG
Sbjct: 250 KDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQR----IKTTPLLANPHRSSLYYVNMTG 305

Query: 338 ISIGGKQLQASGF--AKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI- 393
           + +G K +    F  A G G ++DSGT+ TRL    Y A++ E  ++      AP  S+ 
Sbjct: 306 VRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV----GAPVSSLG 361

Query: 394 -LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
             DTCFN +A   V  P + + F+G      +   +++      S + +A A        
Sbjct: 362 GFDTCFNTTA---VAWPPMTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVL 418

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            +I + QQ+N RV++D  N ++GFA E C++
Sbjct: 419 NVIASMQQQNHRVLFDVPNGRVGFARERCTA 449


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 173/356 (48%), Gaps = 37/356 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
           +   V  DT   ++ ++C+PC     C    DP F+PS S S+  + C S  C A+E   
Sbjct: 187 QRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE--- 238

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGLF 257
                C+ +S   C + + +G+ +   G L R+ L L   A+   F FGC     +   F
Sbjct: 239 -----CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTF 290

Query: 258 GGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
            G  GL+ L RS  SL S+     +      FSYCLPS+    + G L +G +   +   
Sbjct: 291 DGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGG 350

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIY 371
             I Y  M  NP     Y ++L GIS+GG+ L      FA  G L+++ T  T L P+ Y
Sbjct: 351 D-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAAY 409

Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           +AL+  F K  + +P+AP F +LDTC+NL+    + +P V + F G  E+ +DV  ++YF
Sbjct: 410 AALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYF 469

Query: 432 VKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +D S V  ++A      +        +IG   Q++  V+YD +  ++GF    C
Sbjct: 470 --ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 173/356 (48%), Gaps = 37/356 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
           +   V  DT   ++ ++C+PC     C    DP F+PS S S+  + C S  C A+E   
Sbjct: 99  QRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE--- 150

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGLF 257
                C+ +S   C + + +G+ +   G L R+ L L   A+   F FGC     +   F
Sbjct: 151 -----CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADTF 202

Query: 258 GGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
            G  GL+ L RS  SL S+     +      FSYCLPS+    + G L +G +   +   
Sbjct: 203 DGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSGG 262

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIY 371
             I Y  M  NP     Y ++L GIS+GG+ L      FA  G L+++ T  T L P+ Y
Sbjct: 263 D-IKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAAY 321

Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           +AL+  F K  + +P+AP F +LDTC+NL+    + +P V + F G  E+ +DV  ++YF
Sbjct: 322 AALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYF 381

Query: 432 VKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +D S V  ++A      +        +IG   Q++  V+YD +  ++GF    C
Sbjct: 382 --ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 126/381 (33%), Positives = 176/381 (46%), Gaps = 59/381 (15%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           T  +DT SDL W QCQPC  CY Q DPVF+P  S SY  V CNS TC  L+     +  C
Sbjct: 102 TAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELD-----THRC 156

Query: 207 SSSSPPD----CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVS 261
           +     D    C Y  SYG  + TRG L  + L +G       +FGC  ++  G    VS
Sbjct: 157 ARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFRGVVFGCSSSSVGGPPPQVS 216

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV-FKNSTPITYTN 320
           G++GLGR  LSLVSQ S      F YCLP      A G L+LG +++   +N++      
Sbjct: 217 GVVGLGRGALSLVSQLSV---RRFMYCLPPPVSRSA-GRLVLGADAAATVRNASERVVVP 272

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQ----------ASGFAKG----------------- 353
           M    +  ++Y LNL GISIG + +             G A G                 
Sbjct: 273 MSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGT 332

Query: 354 -----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQ 404
                G++ID  + IT L  S+Y  +  +  ++    P   G  + LD CF L       
Sbjct: 333 GPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIR-LPRGSGSDLGLDLCFILPEGVPMS 391

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQ 463
            V  P V + FEG   + +D   +  FV+  AS  +CL +      D   I+GNYQQ+N 
Sbjct: 392 RVYAPPVSLAFEG-VWLRLDKEQM--FVEDRASGMMCLMVGK---TDGVSILGNYQQQNM 445

Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
           +V+Y+ +  ++ F    C S+
Sbjct: 446 QVMYNLRRGRITFIKTACESV 466


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 129/422 (30%), Positives = 201/422 (47%), Gaps = 50/422 (11%)

Query: 78  SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYI 136
           +G + D   +  +RL    L++  L +R K          +    P+ SG +L QT  Y+
Sbjct: 66  AGFLADQASRDASRL----LYLDSLAARGK----------ARAYAPIASGRQLLQTPTYV 111

Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
               LG   + + + VDT +D  W+ C  C  C     P FDP+ S SY+ V C S  C 
Sbjct: 112 VRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPLC- 170

Query: 195 ALEFATGNSGVCSSSSPP---DCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGR 251
               A   +  C    PP    C + ++Y D S  +  L ++ L +   +V  + FGC +
Sbjct: 171 ----AQAPNAAC----PPGGKACGFSLTYADSSL-QAALSQDSLAVAGDAVKTYTFGCLQ 221

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
              G      GL+GLGR  LS +SQT +++ G FSYCLPS +    SG+L LG N    +
Sbjct: 222 KATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPR 281

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVIT 364
               I  T ++ NP  ++ Y +N+TGI +G K +     A         G ++DSGT+ T
Sbjct: 282 ----IKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFT 337

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           RL    Y A++ E  ++      AP  S+   DTCFN +A   V  P V + F+G     
Sbjct: 338 RLVAPAYVAVRDEVRRRV----GAPVSSLGGFDTCFNTTA---VAWPPVTLLFDGMQVTL 390

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +   +++      S + +A A         +I + QQ+N RV++D  N ++GFA E C+
Sbjct: 391 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 450

Query: 483 SM 484
           ++
Sbjct: 451 AV 452


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 127/382 (33%), Positives = 181/382 (47%), Gaps = 43/382 (11%)

Query: 128 IRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSISPSY 183
           +R  TL Y+A   +G   +    ++DTGSDL W QC  C  K C  Q  P ++ S S ++
Sbjct: 83  VRWATLQYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTF 142

Query: 184 KKVLCNSSTCHA----LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
             V C +  C A    + F    +G         C+    YG G    G LG E     +
Sbjct: 143 APVPCAARICAANDDIIHFCDLAAG---------CSVIAGYGAG-VVAGTLGTEAFAF-Q 191

Query: 240 ASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDA 295
           +   +  FGC    R  +G   G SGL+GLGR  LSLVSQT       FSYCL P   + 
Sbjct: 192 SGTAELAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGAT---KFSYCLTPYFHNN 248

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--------- 346
           GA+G L +G ++S+  +   +T T  +  P+ + FY L L G+++G  +L          
Sbjct: 249 GATGHLFVGASASLGGHGDVMT-TQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLR 307

Query: 347 --ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
             A G   GG++IDSG+  T L    Y AL +E   + +G   AP     D    + A +
Sbjct: 308 EVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCV-ARR 366

Query: 405 EVN--IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
           +V   +P V   F G A+M V      Y+   D +  C+A+AS        +IGNYQQ+N
Sbjct: 367 DVGRVVPAVVFHFRGGADMAVPAES--YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQN 424

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
            RV+YD  N    F   DCS++
Sbjct: 425 MRVLYDLANGDFSFQPADCSAL 446


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 33/363 (9%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGN 202
           +  VIVDTGS+L W QC PC  C+ +    PV  P+ S ++ ++ CN S C  L  ++  
Sbjct: 103 DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTSS-R 161

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
              C++++   C Y  +YG G YT G L  E L +G  +     FGC   N       SG
Sbjct: 162 PRTCNATA--ACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           ++GLGR  LSLVSQ +    G FSYCL S    G +  ++ G  + + + S  +  T ++
Sbjct: 217 IVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGASPILFGSLAKLTERSV-VQSTPLL 272

Query: 323 PNP--QLATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYS 372
            NP  Q +T Y +NLTGI++   +L  +    GF +    GG ++DSGT +T L    Y+
Sbjct: 273 KNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYA 332

Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFNLSA---YQEVNIPLVKMEFEGNAEMTVDV 425
            +K  F  Q +      P++     LD C+  SA    + V +P + + F G A+  V V
Sbjct: 333 MVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPV 392

Query: 426 TGIVYFVKSDA----SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 V++D+    +  CL +   + +    IIGN  Q +  ++YD       FA  DC
Sbjct: 393 QNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452

Query: 482 SSM 484
           + +
Sbjct: 453 AKL 455


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 123/412 (29%), Positives = 192/412 (46%), Gaps = 33/412 (8%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIEL 141
           V  +E  +  +   +  V+++ +R  +    ++   ++ E PL          Y+  I +
Sbjct: 6   VKRSEAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGGYVMDISV 61

Query: 142 G--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
           G  G+    I DTGSDL WVQ +PC  C      +FDP  S +++++ C+S  C  L   
Sbjct: 62  GTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCTELP-- 117

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNK 254
               G C   S   C+Y   YG G  T GE  R+ + LG  S        F  GCG  N 
Sbjct: 118 ----GSCEPGSSA-CSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNS 171

Query: 255 GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
           G F GV GL+GLG+  +SL SQ S      FSYCL        S  L+ G ++++  + T
Sbjct: 172 G-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAAL--HGT 228

Query: 315 PITYTNMI-PNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
            I  T +  P+    T+Y+L + GI++ G+ + + G      +IDSGT +T +P  +Y  
Sbjct: 229 GIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSPGTT----IIDSGTTLTYVPSGVYGR 284

Query: 374 LKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           + +  ++     P   G S+ LD C++ S+ +    P + +   G A MT   +     V
Sbjct: 285 VLSR-MESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAG-ATMTPPSSNYFLVV 342

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                 VCLA+ S        IIGN  Q+   ++YD  +S+L F    C S+
Sbjct: 343 DDSGDTVCLAMGSAGGL-PVSIIGNVMQQGYHILYDRGSSELSFVQAKCESL 393


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 129/376 (34%), Positives = 191/376 (50%), Gaps = 47/376 (12%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
           YI T+ +G   ++   I DTGSDL W QC PC S C+ Q    ++PS S ++  + CNSS
Sbjct: 88  YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSS 147

Query: 192 T--CHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLG-----KASV 242
              C AL            S PP C+  Y  +YG G +T G    E    G     +  V
Sbjct: 148 VSMCAALA---------GPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRV 197

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
               FGC   +   + G +GL+GLGR  +SLVSQ   +  G+FSYCL   QDA ++ +L+
Sbjct: 198 PGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQ---LGAGMFSYCLTPFQDANSTSTLL 254

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--ASGFA-----K 352
           LG ++++  N T +  T  + +P    ++T+Y LNLTGISIG   L    + FA      
Sbjct: 255 LGPSAAL--NGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGT 312

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEV--NI 408
           GG++IDSGT IT L  + Y  ++A  ++     P A G   + LD CF L++      ++
Sbjct: 313 GGLIIDSGTTITSLVDAAYQQVRAA-IESLVTLPVADGSDSTGLDLCFALTSETSTPPSM 371

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
           P +   F+G A+M + V   +      +   CLA+ + +        GNYQQ+N  ++YD
Sbjct: 372 PSMTFHFDG-ADMVLPVDNYMIL---GSGVWCLAMRNQTV-GAMSTFGNYQQQNVHLLYD 426

Query: 469 TKNSQLGFAGEDCSSM 484
                L FA   CS++
Sbjct: 427 IHEETLSFAPAKCSTL 442


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 33/363 (9%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSISPSYKKVLCNSSTCHALEFATGN 202
           +  VIVDTGS+L W QC PC  C+ +    PV  P+ S ++ ++ CN S C  L  ++  
Sbjct: 103 DFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTSS-R 161

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSG 262
              C++++   C Y  +YG G YT G L  E L +G  +     FGC   N       SG
Sbjct: 162 PRTCNATA--ACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           ++GLGR  LSLVSQ +    G FSYCL S    G +  ++ G  + + + S  +  T ++
Sbjct: 217 IVGLGRGPLSLVSQLAV---GRFSYCLRSDMADGGASPILFGSLAKLTEGSV-VQSTPLL 272

Query: 323 PNP--QLATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRLPPSIYS 372
            NP  Q +T Y +NLTGI++   +L  +    GF +    GG ++DSGT +T L    Y+
Sbjct: 273 KNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYA 332

Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFNLSA---YQEVNIPLVKMEFEGNAEMTVDV 425
            +K  F  Q +      P++     LD C+  SA    + V +P + + F G A+  V V
Sbjct: 333 MVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPV 392

Query: 426 TGIVYFVKSDA----SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 V++D+    +  CL +   + +    IIGN  Q +  ++YD       FA  DC
Sbjct: 393 QNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452

Query: 482 SSM 484
           + +
Sbjct: 453 AKL 455


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 114/367 (31%), Positives = 178/367 (48%), Gaps = 31/367 (8%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I +G   + +  I DTGSDL WVQCQPC+ CY Q  P+FDP  S SY+ VLC +  
Sbjct: 93  YLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEF 152

Query: 193 CHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-------- 243
           C+ L+   G +  C +      C Y  SYGD S++ G L  E  G+G  + N        
Sbjct: 153 CNKLD---GEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYF 209

Query: 244 -DFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGS 300
            +  FGCG  N G F    SG++GLG   +SLVSQ      G FSYCL P+++ +  +  
Sbjct: 210 QEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTSK 269

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-----AKGGI 355
           +  G + ++  ++  +  T ++P  +  T+Y L L  IS+  K+L  +        KG I
Sbjct: 270 INFGNDINISGSNYNVVSTPLLPK-KPETYYYLTLEAISVENKRLPYTNLWNGEVEKGNI 328

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSGT +T L    ++ L +   +   G   +    + + CF     + + +P++   F
Sbjct: 329 IIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDE--KAIELPIITAHF 386

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G     V++  +  F K +   +C  +      ++  I GN  Q N  V YD +   + 
Sbjct: 387 TG---ADVELQPVNTFAKVEEDLLCFTMIP---SNDIAIFGNLAQMNFLVGYDLEKKAVS 440

Query: 476 FAGEDCS 482
           F   DC+
Sbjct: 441 FLPTDCT 447


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 112/378 (29%), Positives = 175/378 (46%), Gaps = 53/378 (14%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
             T  +DT SDL W QCQPC  CY+Q DP+F+P +S +Y  + C+S TC  L+       
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG----V 260
            C       C Y  +Y   + T G L  + L +G+ +     FGC  ++ G  G      
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTG--GAPPPQA 213

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           SG++GLGR  LSLVSQ S      F+YCLP    +   G L+LG ++   +N+T      
Sbjct: 214 SGVVGLGRGPLSLVSQLSV---RRFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVP 269

Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG-- 353
           M  +P+  ++Y LNL G+ IG + +                          A+  A G  
Sbjct: 270 MRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDA 329

Query: 354 ---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEV 406
              G++ID  + IT L  S+Y  L  +   +    P   G S+ LD CF L    A+  V
Sbjct: 330 NRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFILPDGVAFDRV 388

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V + F+G   + +D   +  F +   S +   +   +      I+GN+QQ+N +V+
Sbjct: 389 YVPAVALAFDGR-WLRLDKARL--FAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 445

Query: 467 YDTKNSQLGFAGEDCSSM 484
           Y+ +  ++ F    C ++
Sbjct: 446 YNLRRGRVTFVQSPCGAL 463


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 118/393 (30%), Positives = 188/393 (47%), Gaps = 49/393 (12%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PVFDPS 178
           L SG  + +  Y   + +G   +   +IVDTGSDLTW+QC P  +  N      P +D S
Sbjct: 48  LVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 107

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
            S SY+++ C    C  L    G+S  CS +SP  C+Y   Y D S T G L  E + + 
Sbjct: 108 SSSSYREIPCTDDECQFLPAPIGSS--CSITSPSPCDYTYGYSDQSRTTGILAYETISMK 165

Query: 238 -----GKAS---------VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEI-F 281
                GK +         + +   GC R + G  F G SG++GLG+  +SL +QT     
Sbjct: 166 SRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 225

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           GG+FSYCL        + S ++ G +   K    + +T ++ NP   +FY +N+TG+++ 
Sbjct: 226 GGIFSYCLVDYLRGSNASSFLVMGRTHWRK----LAHTPIVRNPAAQSFYYVNVTGVAVD 281

Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYS----ALKAE-FLKQFSGFPSA 388
           GK +           G    G + DSGT ++ L    YS    AL A  +L +    P  
Sbjct: 282 GKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE- 340

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
            GF +   C+N++   E  +P + +EF+G A M +     +  V  +    C+AL  ++ 
Sbjct: 341 -GFEL---CYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQ--CVALQKVTT 393

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            + + I+GN  Q++  + YD   +++GF    C
Sbjct: 394 TNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 112/378 (29%), Positives = 175/378 (46%), Gaps = 53/378 (14%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
             T  +DT SDL W QCQPC  CY+Q DP+F+P +S +Y  + C+S TC  L+       
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG----V 260
            C       C Y  +Y   + T G L  + L +G+ +     FGC  ++ G  G      
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTG--GAPPPQA 213

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           SG++GLGR  LSLVSQ S      F+YCLP    +   G L+LG ++   +N+T      
Sbjct: 214 SGVVGLGRGPLSLVSQLSV---RRFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVP 269

Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG-- 353
           M  +P+  ++Y LNL G+ IG + +                          A+  A G  
Sbjct: 270 MRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDA 329

Query: 354 ---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEV 406
              G++ID  + IT L  S+Y  L  +   +    P   G S+ LD CF L    A+  V
Sbjct: 330 NRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFILPDGVAFDRV 388

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V + F+G   + +D   +  F +   S +   +   +      I+GN+QQ+N +V+
Sbjct: 389 YVPAVALAFDGR-WLRLDKARL--FAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 445

Query: 467 YDTKNSQLGFAGEDCSSM 484
           Y+ +  ++ F    C ++
Sbjct: 446 YNLRRGRVTFVQSPCGAL 463


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/391 (31%), Positives = 193/391 (49%), Gaps = 32/391 (8%)

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
           SR+  + S  ++  +    P+ SG +L QT  Y+    LG   + + + VDT +D +W+ 
Sbjct: 80  SRLLYLDSLAVRGRARAYAPIASGRQLLQTPTYVVRASLGTPPQQLLLAVDTSNDASWIP 139

Query: 161 CQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP---DCNYF 217
           C  C  C       FDP+ S SY+ V C S  C     A   +  C    PP    C + 
Sbjct: 140 CAGCAGCPTSSAAPFDPASSASYRTVPCGSPLC-----AQAPNAAC----PPGGKACGFS 190

Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
           ++Y D S  +  L ++ L +   +V  + FGC +   G      GL+GLGR  LS +SQT
Sbjct: 191 LTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQT 249

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
            +++   FSYCLPS +    SG+L LG N    +    I  T ++ NP  ++ Y +N+TG
Sbjct: 250 KDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQR----IKTTPLLANPHRSSLYYVNMTG 305

Query: 338 ISIGGKQLQASGF--AKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI- 393
           I +G K +    F  A G G ++DSGT+ TRL    Y A++ E  ++      AP  S+ 
Sbjct: 306 IRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRV----GAPVSSLG 361

Query: 394 -LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
             DTCFN +A   V  P V + F+G      +   +++      S + +A A        
Sbjct: 362 GFDTCFNTTA---VAWPPVTLLFDGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVL 418

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            +I + QQ+N RV++D  N ++GFA E C++
Sbjct: 419 NVIASMQQQNHRVLFDVPNGRVGFARERCTA 449


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/377 (32%), Positives = 193/377 (51%), Gaps = 44/377 (11%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS-CYNQQDPVFDPSISPSYKKVLCNSS 191
           Y+ T+ +G   ++   I DTGSDL W QC PC S C+ Q  P+++PS S ++  + CNSS
Sbjct: 86  YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSS 145

Query: 192 TCHALEFATGNSGVCSSSSPPDCN--YFVSYGDGSYTRGELGREHLGLGKAS------VN 243
                  +   + +  ++ PP C   Y ++YG G +T    G E    G ++      V 
Sbjct: 146 ------LSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVP 198

Query: 244 DFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
              FGC   + G      SGL+GLGR  LSLVSQ   +    FSYCL   QD  ++ +L+
Sbjct: 199 GIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQ---LGVPKFSYCLTPYQDTNSTSTLL 255

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA-------K 352
           LG ++S+  ++  ++ T  + +P    ++T+Y LNLTGIS+G   L     A        
Sbjct: 256 LGPSASL-NDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGT 314

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNL--SAYQEVN 407
           GG +IDSGT IT L  + Y  ++A  +   +  P+  G S    LD CF L  S      
Sbjct: 315 GGFIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGGSAATGLDLCFELPSSTSAPPT 373

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P + + F+G A+M +       ++  D++  CLA+ + + +    I+GNYQQ+N  ++Y
Sbjct: 374 MPSMTLHFDG-ADMVLPADS---YMMLDSNLWCLAMQNQT-DGGVSILGNYQQQNMHILY 428

Query: 468 DTKNSQLGFAGEDCSSM 484
           D     L FA   CS++
Sbjct: 429 DVGQETLTFAPAKCSTL 445


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 172/357 (48%), Gaps = 37/357 (10%)

Query: 143 GRNMTVIVDTGSDLTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
            +   V  DT   ++ ++C+PC     C    DP F+PS S S+  + C S  C A+E  
Sbjct: 98  AQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAAIPCGSPEC-AVE-- 150

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGR--NNKGL 256
                 C+ +S   C + + +G+ +   G L R+ L L   A+   F FGC     +   
Sbjct: 151 ------CTGAS---CPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGADADT 201

Query: 257 FGGVSGLMGLGRSDLSLVSQT----SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
           F G  GL+ L RS  SL S+     +      FSYCLPS+    + G L +G +   +  
Sbjct: 202 FDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRPEYSG 261

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSI 370
              I Y  M  NP     Y + L GIS+GG+ L      FA  G L+++ T  T L P+ 
Sbjct: 262 GD-IKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAHGTLLEAATEFTFLAPAA 320

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           Y+AL+  F +  + +P+AP F +LDTC+NL+    + +P V + F G  E+ +DV  ++Y
Sbjct: 321 YAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMY 380

Query: 431 FVKSDASQVCLALA------SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           F  +D S V  ++A      +        +IG   Q++  V+YD +  ++GF    C
Sbjct: 381 F--ADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 141/418 (33%), Positives = 209/418 (50%), Gaps = 46/418 (11%)

Query: 93  ILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IV 150
           + D L+  +L+S  ++    NI  +S T+  L SG+      +  +I +G   M V  I 
Sbjct: 47  VTDRLNAAFLRSISRSRRLNNI--LSQTD--LQSGLIGADGEFFMSITIGTPPMKVFAIA 102

Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           DTGSDLTWVQC+PC+ CY +  P+FD   S +YK   C+S  CHAL  ++   G   S +
Sbjct: 103 DTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCHAL--SSSERGCDESKN 160

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IFGCGRNNKGLFGGV-SGLM 264
              C Y  SYGD S+++G++  E + +  AS +       +FGCG NN G F    SG++
Sbjct: 161 V--CKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGII 218

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI--LGGN---SSVFKNSTPITYT 319
           GLG   LSL+SQ        FSYCL S + A  +G+ +  LG N   SS+ K+S  I+  
Sbjct: 219 GLGGGHLSLISQLGSSISKKFSYCL-SHKSATTNGTSVINLGTNSIPSSLSKDSGVISTP 277

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFA------------KGGILIDSGTVITRLP 367
            +   P+  T+Y L L  IS+G K++  +G +             G I+IDSGT +T L 
Sbjct: 278 LVDKEPR--TYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLD 335

Query: 368 PSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
              +    A   +  +G    S P   +L  CF  S   E+ +P + + F G     V +
Sbjct: 336 SGFFDKFGAAVEELVTGAKRVSDPQ-GLLSHCFK-SGSAEIGLPEITVHFTG---ADVRL 390

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + I  FVK     VCL++   +   E  I GN+ Q +  V YD +   + F   DCS+
Sbjct: 391 SPINAFVKVSEDMVCLSMVPTT---EVAIYGNFAQMDFLVGYDLETRTVSFQRMDCSA 445


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 132/422 (31%), Positives = 194/422 (45%), Gaps = 50/422 (11%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPL 124
           G  +++L H++       D ++ Q  RL  D        SR+     G  +  + T   +
Sbjct: 30  GGFSVDLIHRDSPHSPFFDPSKTQAERLT-DAFRRSV--SRV-----GRFRPTAMTSDGI 81

Query: 125 TSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPS 182
            S I      Y+  + +G   + VI  VDTGSDLTW QC+PC  CY Q  P+FDP  S +
Sbjct: 82  QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSST 141

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----G 238
           Y+   C +S C AL    G    CS      C +  SY DGS+T G L  E L +    G
Sbjct: 142 YRDSSCGTSFCLAL----GKDRSCSKEK--KCTFRYSYADGSFTGGNLASETLTVDSTAG 195

Query: 239 K-ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDA 295
           K  S   F FGCG ++ G+F    SG++GLG  +LSL+SQ      GLFSYC LP + D+
Sbjct: 196 KPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDS 255

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
             S  +  G +  V    T            ++T   L   G S      + +   +G I
Sbjct: 256 SISSRINFGASGRVSGYGT------------VSTPLRLPYKGYS------KKTEVEEGNI 297

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           ++DSGT  T LP   YS L+        G        I   C+N +A  E+N P++   F
Sbjct: 298 IVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHF 355

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
           +   +  V++  +  F++     VC  +A  S   + G++GN  Q N  V +D +  + G
Sbjct: 356 K---DANVELQPLNTFMRMQEDLVCFTVAPTS---DIGVLGNLAQVNFLVGFDLRKKR-G 408

Query: 476 FA 477
           F+
Sbjct: 409 FS 410



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/149 (25%), Positives = 64/149 (42%), Gaps = 13/149 (8%)

Query: 340 IGGKQLQASGFAK------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           +G    +  GF+K      G I++DSGT  T LP   Y  L+        G        I
Sbjct: 399 VGFDLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 458

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
              C+N +  Q ++ P++   F+   +  V++     F++     VC  +   S   + G
Sbjct: 459 SSLCYNTTVDQ-IDAPIITAHFK---DANVELQPWNTFLRMQEDLVCFTVLPTS---DIG 511

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           I+GN  Q N  V +D +  ++ F   DC+
Sbjct: 512 ILGNLAQVNFLVGFDLRKKRVSFKAADCT 540


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 131/367 (35%), Positives = 178/367 (48%), Gaps = 36/367 (9%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I LG   + +  I DTGSDL W QC PC +CY Q +P+FDP  S +YK + C++  
Sbjct: 94  YLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEF 153

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIF 247
           C  L    G  G C   +   C Y  SYGD SYTRG+L  + L +G      AS     F
Sbjct: 154 CQDL----GQQGSCDDDN--TCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAF 207

Query: 248 GCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG +N G F     GL+GLG   LSLV Q S   GG FSYCL P + D+  S S I  G
Sbjct: 208 GCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVS-SKINFG 266

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----------KGGI 355
            S V   S  ++   +   P   TFY L L G+S+G + +   GF+          +G I
Sbjct: 267 KSGVVSGSGTVSTPLIKGTPD--TFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNI 324

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSGT +T LP   Y+ +++       G  +     I   C+  S+   + IP +   F
Sbjct: 325 IIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTITAHF 382

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G     V +  +  FV+     VC ++   S      I GN  Q N  V YD KN+++ 
Sbjct: 383 TG---ADVQLPPLNTFVQVQEDLVCFSMIPSS---NLAIFGNLAQINFLVGYDLKNNKVS 436

Query: 476 FAGEDCS 482
           F   DC+
Sbjct: 437 FKQTDCT 443


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 127/386 (32%), Positives = 176/386 (45%), Gaps = 35/386 (9%)

Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
           S NI+++    I    G  L  + YI T  +    +T +VDTGSDL W+QC PC  CY Q
Sbjct: 50  SNNIQNIVQAPINAYIGQHLMEI-YIGTPPI---KITGLVDTGSDLIWIQCAPCLGCYKQ 105

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
             P+FDP  S +Y  + C+S  CH L+     +GVCS      CNY   YGD S T+G L
Sbjct: 106 IKPMFDPLKSSTYNNISCDSPLCHKLD-----TGVCSPEK--RCNYTYGYGDNSLTKGVL 158

Query: 231 GREHLGL----GK-ASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG- 283
            ++        GK  S++ F+FGCG NN G F     GL+GLG    SL+SQ   +FGG 
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218

Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            FS CL P   D   S  +  G  S V  N   +  T ++P  +  T Y + L GIS+  
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNG--VVTTPLVPR-EKDTSYFVTLLGISVED 275

Query: 343 KQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSILDT 396
                 S   K  +L+DSGT    LP  +Y  + AE      LK  +  PS      L T
Sbjct: 276 TYFPMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPS------LGT 329

Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
                    +  P +   F G   +   +   +          CLA+ + +  D  G+ G
Sbjct: 330 QLCYRTQTNLKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDP-GVYG 388

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCS 482
           N+ Q N  + +D     + F   DC+
Sbjct: 389 NFAQSNYLIGFDLDRQVVSFKPTDCT 414


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 137/460 (29%), Positives = 205/460 (44%), Gaps = 42/460 (9%)

Query: 39  LHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLH 98
           +H L +   +    S VS ++        +++L H++         +    +R+I   L 
Sbjct: 1   MHPLVFLSLALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALR 60

Query: 99  VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTW 158
             Y  +R  +      K +    IP   G  L    YI T  +       I DT SDL W
Sbjct: 61  SIYQLNRASHSDLNEKKTLERVRIP-NHGEYLMRF-YIGTPPV---ERLAIADTASDLIW 115

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           VQC PC++C+ Q  P+F+P  S ++  + C+S  C      + N   C       C Y  
Sbjct: 116 VQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPC-----TSSNIYYCPLVG-NLCLYTN 169

Query: 219 SYGDGSYTRGELGREHLGLGKASVN--DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSL 273
           +YGDGS T+G L  E +  G  +V     IFGCG NN  +      V+G++GLG   LSL
Sbjct: 170 TYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSL 229

Query: 274 VSQTSEIFGGLFSYC-LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
           VSQ  +  G  FSYC LP T  + ++  L  G ++++  N   +  T +I +P   ++Y 
Sbjct: 230 VSQLGDQIGHKFSYCLLPFT--STSTIKLKFGNDTTITGNG--VVSTPLIIDPHYPSYYF 285

Query: 333 LNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIY--------SALKAEFLKQF 382
           L+L GI+IG K LQ   +    G I+ID GTV+T L  + Y         AL     K  
Sbjct: 286 LHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDD 345

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
             +P        D CF      + NI   K+ F+            ++F   D + +CLA
Sbjct: 346 IPYP-------FDFCFP----NQANITFPKIVFQFTGAKVFLSPKNLFFRFDDLNMICLA 394

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +    Y     + GN  Q + +V YD K  ++ FA  DCS
Sbjct: 395 VLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 137/388 (35%), Positives = 181/388 (46%), Gaps = 40/388 (10%)

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           T+  L SG+      Y  +I +G        I DTGSDLTWVQC+PC+ CY Q  P+FD 
Sbjct: 70  TKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDK 129

Query: 178 SISPSYKKVLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
             S +YK   C+S TC+AL E   G    C  S    C Y  SYGD S+T+GE+  E + 
Sbjct: 130 KKSSTYKTESCDSITCNALSEHEEG----CDESRNA-CKYRYSYGDESFTKGEVATETIS 184

Query: 237 LGKASVNDF-----IFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLP 290
           +  +S +        FGCG NN G F      +       LSLVSQ     G  FSYCL 
Sbjct: 185 IDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLS 244

Query: 291 STQDAGASGSLI-LGGNSSVFKNS--TPITYTNMI-PNPQLATFYILNLTGISIGGKQLQ 346
            T       S+I LG NS   K S  + I  T +I  +P+  T+Y L L  I++G  +L 
Sbjct: 245 HTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPE--TYYFLTLEAITVGKTKLP 302

Query: 347 ASG----------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSIL 394
            +G             G I+IDSGT +T L    Y    A   +  +G    S P   IL
Sbjct: 303 YTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQ-GIL 361

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
             CF  S  +E+ +P + M F G     V ++ I  FVK     VCL++   +   E  I
Sbjct: 362 THCFK-SGDKEIGLPTITMHFTG---ADVKLSPINSFVKLSEDIVCLSMIPTT---EVAI 414

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            GN  Q +  V YD +   + F   DCS
Sbjct: 415 YGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 132/368 (35%), Positives = 179/368 (48%), Gaps = 36/368 (9%)

Query: 134 NYIATIELGGR--NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+  I LG    +M  I DTGSDL W QC PC  CY Q +P+FDP  S +YK + CN+ 
Sbjct: 93  SYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNND 152

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFI 246
            C  L    G  G C   +   C    SYGD SYTR +L  E   +G      AS     
Sbjct: 153 FCQDL----GQQGSCGDDN--TCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLA 206

Query: 247 FGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILG 304
           FGCG +N G F     GL+GLG   LSLV Q S   GG FSYCL P + D+ AS S I  
Sbjct: 207 FGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTAS-SKINF 265

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG----------G 354
           G S+V   S  ++   +   P   TFY L L G+S+G +++   GF+K            
Sbjct: 266 GKSAVVSGSGTVSTPLIKGTPD--TFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESN 323

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
           I+IDSGT +T LP   Y+ +++   K   G  +         C+  S  +++ IP +   
Sbjct: 324 IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAH 381

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F G     V +  +  FV++    VC ++   S      I GN  Q N  V YD KN+++
Sbjct: 382 FIG---ADVQLPPLNTFVQAQEDLVCFSMIPSS---NLAIFGNLSQMNFLVGYDLKNNKV 435

Query: 475 GFAGEDCS 482
            F   DC+
Sbjct: 436 SFKPTDCT 443


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 132/449 (29%), Positives = 211/449 (46%), Gaps = 58/449 (12%)

Query: 67  ITLELKHKNYCSGKIVDWNE---QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
           + + LKH +  +GK +  +E   +   R       +  +++R  +       D   T  P
Sbjct: 32  VRVALKHVD--AGKQLSRSELIRRAMQRSKARAAALSAVRNRAASARFSGKNDDQRTTPP 89

Query: 124 LTSGIRLQ-TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
               +R    L Y+  + +G   + ++ ++DTGSDL W QC PC SC  Q DP+F P  S
Sbjct: 90  TGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGES 149

Query: 181 PSYKKVLCNSSTC-----HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
            SY+ + C    C     H  E             P  C Y  +YGDG+ T G    E  
Sbjct: 150 ASYEPMRCAGQLCSDILHHGCEM------------PDTCTYRYNYGDGTMTMGVYATERF 197

Query: 236 GLGKASVNDFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
               +  +  +     FGCG  N G     SG++G GR+ LSLVSQ S      FSYCL 
Sbjct: 198 TFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSI---RRFSYCL- 253

Query: 291 STQDAGASGSLILGGNS-SVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ-- 346
           ++  +G   +L+ G  S  V+ ++T P+  T ++ + Q  TFY ++L G+++G ++L+  
Sbjct: 254 TSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIP 313

Query: 347 ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL 400
            S FA      GG+++DSGT +T LP ++ + +   F +Q    P A G +  D  CF +
Sbjct: 314 ESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLR-LPFANGGNPEDGVCFLV 372

Query: 401 -------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDET 452
                  S+  +V +P +   F+   +  +D+    Y +      ++CL LA     D+ 
Sbjct: 373 PAAWRRSSSTSQVPVPRMVFHFQ---DADLDLPRRNYVLDDHRKGRLCLLLADSG--DDG 427

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             IGN  Q++ RV+YD +   L FA   C
Sbjct: 428 STIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 121/377 (32%), Positives = 180/377 (47%), Gaps = 47/377 (12%)

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y+  + +G   + +T ++DTGSDL W QC  C +C  Q DP+F P +S SY+ + C  
Sbjct: 96  LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
             C       G+    S   P  C Y  SYGDG+ T G    E        G+       
Sbjct: 156 QLC-------GDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG 208

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
           FGCG  N G     SG++G GR  LSLVSQ S      FSYCL     +  S    GSL 
Sbjct: 209 FGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSI---RRFSYCLTPYASSRKSTLQFGSL- 264

Query: 303 LGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGG 354
              +  ++ ++T P+  T ++ + Q  TFY +  TG+++G ++L+  AS FA      GG
Sbjct: 265 --ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGG 322

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAY--------QE 405
           ++IDSGT +T  P ++ + +   F  Q    P A G S  D  CF   A         ++
Sbjct: 323 VIIDSGTALTLFPAAVLAEVVRAFRSQLR-LPFANGSSPDDGVCFAAPAVAAGGGRMARQ 381

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
           V +P +   F+G     +D+    Y ++      +C+ L      D+   IGN+ Q++ R
Sbjct: 382 VAVPRMVFHFQG---ADLDLPRENYVLEDHRRGHLCVLLGDSG--DDGATIGNFVQQDMR 436

Query: 465 VIYDTKNSQLGFAGEDC 481
           V+YD +   L FA  +C
Sbjct: 437 VVYDLERETLSFAPVEC 453


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 121/353 (34%), Positives = 158/353 (44%), Gaps = 44/353 (12%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
            DTGSDL W QC PC  CY QQ+P+FDP  S SY  + C + +C+ L+     S +CS+ 
Sbjct: 77  ADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTESCNKLD-----SSLCSTD 131

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLM 264
               CNY  SY D S T+G L +E L L   +         IFGCG NN G      GL+
Sbjct: 132 Q-KTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLI 190

Query: 265 GLGRSDLSLVSQTSEIFGG---LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPIT 317
           GLGR  LSL+SQ     G    +FS CL P   D   +  +  G  S V  N   STP+ 
Sbjct: 191 GLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLI 250

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIY 371
             +        T Y   L GIS+    L  S         KG ILIDSGT IT LP   Y
Sbjct: 251 SKD-------GTGYFATLLGISVEDINLPFSNGSSLGTITKGNILIDSGTTITYLPEEFY 303

Query: 372 SALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
             L  +   + +  P    F I   + C+       +N P + + FEG     V +T   
Sbjct: 304 HRLIEQVRNKVALEP----FRIDGYELCYQTPT--NLNGPTLTIHFEGG---DVLLTPAQ 354

Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            F+       C A+      +E    GNY Q N  + +D +   + F   DC+
Sbjct: 355 MFIPVQDDNFCFAV--FDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 176/364 (48%), Gaps = 34/364 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+    LG    ++  I DTGSDL W QC+PC  CY Q  P+FDP  S +Y+ + C++  
Sbjct: 92  YLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQ 151

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIF 247
           C  L+        CS      C+Y  SYGD S+T G +  + + LG  S     +   I 
Sbjct: 152 CDLLK----EGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAII 207

Query: 248 GCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GCG NN G F    SG++GLG   +SL+SQ      G FSYCL P + +A  S  L  G 
Sbjct: 208 GCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGS 267

Query: 306 NSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAKGGILID 358
           N  V      STP+   +  P+    TFY L L  +S+G ++++    + G ++G I+ID
Sbjct: 268 NGIVSGGGVQSTPLISKD--PD----TFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIID 321

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           SGT +T  P   +S L +      +G P      IL  C+++ A  ++  P +   F+G 
Sbjct: 322 SGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--DLKFPSITAHFDG- 378

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
               V +  +  FV+   + +C A   +   +   I GN  Q N  V YD +   + F  
Sbjct: 379 --ADVKLNPLNTFVQVSDTVLCFAFNPI---NSGAIFGNLAQMNFLVGYDLEGKTVSFKP 433

Query: 479 EDCS 482
            DC+
Sbjct: 434 TDCT 437


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 114/362 (31%), Positives = 179/362 (49%), Gaps = 37/362 (10%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            + DTGSDLTW QC+PCK C+ Q  P++D + S S+  V C S+TC  +  ++ N   C+
Sbjct: 110 ALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASATCLPIWRSSRN---CT 166

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA---------SVNDFIFGCGRNNKGLFG 258
           +++   C Y  +Y DG+Y+ G LG E L    +         SV    FGCG +N GL  
Sbjct: 167 ATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAFGCGVDNGGLSY 226

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST---- 314
             +G +GLGR  LSLV+Q      G FSYCL    +      ++ G  + +   ST    
Sbjct: 227 NSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNTSLGSPVLFGSLAELAAPSTIGGA 283

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLP 367
            +  T ++  P   + Y ++L GIS+G  +L              GG+++DSGT+ T L 
Sbjct: 284 AVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLV 343

Query: 368 PSIYSAL---KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMT 422
            S +  +    A  L Q    P     S+   CF  +A ++   ++P + + F G A+M 
Sbjct: 344 ESAFRVVVNHVAGVLNQ----PVVNASSLDSPCFPATAGEQQLPDMPDMLLHFAGGADMR 399

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +     + F   ++S  CL +A  +      I+GN+QQ+N ++++D    QL F   DCS
Sbjct: 400 LHRDNYMSF-NQESSSFCLNIAG-APSAYGSILGNFQQQNIQMLFDITVGQLSFVPTDCS 457

Query: 483 SM 484
            +
Sbjct: 458 KL 459


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 125/348 (35%), Positives = 179/348 (51%), Gaps = 29/348 (8%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           IVDTGSD+ W+QCQPC+ CYNQ  P+FDPS S +YK + C+S+ C +++ A      CSS
Sbjct: 110 IVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQSAAS----CSS 165

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
           ++  +C Y ++YGD S+++G+L  E L LG    +   F     GCG NNKG F    SG
Sbjct: 166 NN-DECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSG 224

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           ++GLG   +SL+SQ S   GG FSYCL P    + +S  L  G  + V    T    T +
Sbjct: 225 IVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGT--VSTPI 282

Query: 322 IPNPQLATFYILNLTGISIGGKQL------QASGFAKGGILIDSGTVITRLPPSIYSALK 375
           +P   L  FY L L   S+G  ++        S   +G I+IDSGT +T LP   Y  L+
Sbjct: 283 VPKNGLG-FYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLE 341

Query: 376 AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           +                 L  C+  ++  E+N+P++   F+G     V++  I  F++ D
Sbjct: 342 SAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITAHFKG---ADVELNPISTFIEVD 398

Query: 436 ASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              VC A  S     + G I GN  Q+N  V YD     + F   DC+
Sbjct: 399 EGVVCFAFRS----SKIGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCT 442


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 122/362 (33%), Positives = 178/362 (49%), Gaps = 27/362 (7%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y   I +G   + V+V  DTGSDL WVQCQPC+ CY Q+ P+F+P  S +Y++VLC +  
Sbjct: 94  YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRY 153

Query: 193 CHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGREHLGLGKA--SVNDFIFGC 249
           C+AL     +   CS+      C Y  SYGD S+T G L  E   +G    S+ +  FGC
Sbjct: 154 CNAL---NSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFGC 210

Query: 250 GRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGAS-GSLILGGN 306
           G +N G F    SG++GLG   LSL+SQ        FSYCL P  + +  S G ++ G N
Sbjct: 211 GNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDN 270

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS------GFAKGGILIDSG 360
           S +  + T ++   +   P+  TFY L L  IS+G ++L            KG I+IDSG
Sbjct: 271 SFISGSDTYVSTPLVSKEPE--TFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSG 328

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T +T L   +Y+ L+    K   G   +    I   CF       + +P++ + F    +
Sbjct: 329 TTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKI--GIELPIITVHF---TD 383

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             V++  I  F K++   +C  +      +   I GN  Q N  V YD   + + F   D
Sbjct: 384 ADVELKPINTFAKAEEDLLCFTMIP---SNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTD 440

Query: 481 CS 482
           CS
Sbjct: 441 CS 442


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 117/393 (29%), Positives = 187/393 (47%), Gaps = 49/393 (12%)

Query: 124 LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PVFDPS 178
           L SG  + +  Y   + +G   +   +I+DTGSDLTW+QC P  +  N      P +D S
Sbjct: 16  LVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 75

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
            S SY+++ C    C  L    G+S  CS  SP  C+Y   Y D S T G L  E + + 
Sbjct: 76  SSSSYREIPCTDDECLFLPAPIGSS--CSIKSPSPCDYTYGYSDQSRTTGILAYETISMK 133

Query: 238 -----GKAS---------VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEI-F 281
                GK +         + +   GC R + G  F G SG++GLG+  +SL +QT     
Sbjct: 134 SRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTAL 193

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           GG+FSYCL        + S ++ G +   K    + +T ++ NP   +FY +N+TG+++ 
Sbjct: 194 GGIFSYCLVDYLRGSNASSFLVMGRTRWRK----LAHTPIVRNPAAQSFYYVNVTGVAVD 249

Query: 342 GKQLQA--------SGFAKGGILIDSGTVITRLPPSIYS----ALKAE-FLKQFSGFPSA 388
           GK +           G    G + DSGT ++ L    YS    AL A  +L +    P  
Sbjct: 250 GKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPE- 308

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
            GF +   C+N++   E  +P + +EF+G A M +     +  V  +    C+AL  ++ 
Sbjct: 309 -GFEL---CYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQ--CVALQKVTT 361

Query: 449 EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            + + I+GN  Q++  + YD   +++GF    C
Sbjct: 362 TNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 141/441 (31%), Positives = 206/441 (46%), Gaps = 52/441 (11%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
           +T+EL H++     + +      +  + D L+  +L+S     IS + +  + T+  L S
Sbjct: 29  LTVELIHRDSPHSPLYN-----PHHTVSDRLNAAFLRS-----ISRSRRFTTKTD--LQS 76

Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G+      Y  +I +G     V  I DTGSDLTWVQC+PC+ CY Q  P+FD   S +YK
Sbjct: 77  GLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYK 136

Query: 185 KVLCNSSTCHAL-EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
              C+S TC AL E   G    C  S    C Y  SYGD S+T+G++  E + +  +S +
Sbjct: 137 TESCDSKTCQALSEHEEG----CDESKDI-CKYRYSYGDNSFTKGDVATETISIDSSSGS 191

Query: 244 DF-----IFGCGRNNKGLFGGVSGLMGLGRSD-LSLVSQTSEIFGGLFSYCLPSTQDAGA 297
                  +FGCG NN G F      +       LSLVSQ     G  FSYCL  T     
Sbjct: 192 SVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTN 251

Query: 298 SGSLILGGNSSVFKNSTPITYTNMIP----NPQLATFYILNLTGISIGGKQLQASGFA-- 351
             S+I  G +S+  N +  + T   P    +P+  T+Y L L  +++G  +L  +G    
Sbjct: 252 GTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE--TYYFLTLEAVTVGKTKLPYTGGGYG 309

Query: 352 --------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLS 401
                    G I+IDSGT +T L    Y        +  +G    S P   +L  CF  S
Sbjct: 310 LNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQ-GLLTHCFK-S 367

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
             +E+ +P + M F  NA+  V ++ I  FVK +   VCL++   +   E  I GN  Q 
Sbjct: 368 GDKEIGLPAITMHFT-NAD--VKLSPINAFVKLNEDTVCLSMIPTT---EVAIYGNMVQM 421

Query: 462 NQRVIYDTKNSQLGFAGEDCS 482
           +  V YD +   + F   DCS
Sbjct: 422 DFLVGYDLETKTVSFQRMDCS 442


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 172/354 (48%), Gaps = 38/354 (10%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           IVDTGSD+ W+QC+PC+ CYNQ  P+F+PS S SYK + C S  C ++E  + N      
Sbjct: 103 IVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNY-- 160

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKGLF-GGVSG 262
                C Y   YGD S++ G+L  + L L        S  + + GCG NN   + G  SG
Sbjct: 161 -----CEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSG 215

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLP-----STQDAGASGSLILGGNSSVFKN---ST 314
           ++G G    S ++Q     GG FSYCL      +   + A+  L  G  ++V  +   +T
Sbjct: 216 IVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTT 275

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSI 370
           PI    +  +P+  TFY L L   S+G ++++  G      +G I+IDSGT +T L    
Sbjct: 276 PI----LKKDPE--TFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDD 329

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           YS L++  +              L+ C+++ A +  + P++ M F+G     VD+  I  
Sbjct: 330 YSFLESAVVDLVKLERVDDPTQTLNLCYSVKA-EGYDFPIITMHFKG---ADVDLHPIST 385

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           FV       CLA  S     +  I GN  Q+N  V YD +   + F   DC+ +
Sbjct: 386 FVSVADGVFCLAFES---SQDHAIFGNLAQQNLMVGYDLQQKIVSFKPSDCTKV 436


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 168/362 (46%), Gaps = 34/362 (9%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           L  GI   T N++  I +GG  +   +I D  +D TW+QCQPC  CY+Q D +FDPS S 
Sbjct: 176 LNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSS 235

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           SY  + C +  C+ L     NS   S S    C Y ++Y DG+ T G L  E +    + 
Sbjct: 236 SYTLLSCETKHCNLLP----NS---SCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG 288

Query: 242 VNDFI-FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGS 300
             D +  GC   N+G F G  G  GLGR  LS  S+   I     SYCL  ++D  +S +
Sbjct: 289 WVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSR---INASSMSYCLVESKDGYSSST 345

Query: 301 LILGGNSSVFKNSTPIT---YTNMIPNPQLATFYILNLTGISIGGKQLQASG-------F 350
           L          NS P +      ++ NP+    Y + L GI +GG+++           +
Sbjct: 346 LEF--------NSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPY 397

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL 410
             GG+++ S ++IT L    Y+ ++  F+ +         F   DTC+NLS+   V +P+
Sbjct: 398 GNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPI 457

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           ++ E        +     +Y V  + +  C A A    +    I+G  QQ   RV +D  
Sbjct: 458 LEFEVNDGKSWLLPKESYLYAVDKNGT-FCFAFAPS--KGSFSILGTLQQYGTRVTFDLV 514

Query: 471 NS 472
           NS
Sbjct: 515 NS 516


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 121/377 (32%), Positives = 180/377 (47%), Gaps = 47/377 (12%)

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y+  + +G   + +T ++DTGSDL W QC  C +C  Q DP+F P +S SY+ + C  
Sbjct: 96  LEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
             C       G+    S   P  C Y  SYGDG+ T G    E        G+       
Sbjct: 156 QLC-------GDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG 208

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSLI 302
           FGCG  N G     SG++G GR  LSLVSQ S      FSYCL     +  S    GSL 
Sbjct: 209 FGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSI---RRFSYCLTPYASSRKSTLQFGSL- 264

Query: 303 LGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGG 354
              +  ++ ++T P+  T ++ + Q  TFY +  TG+++G ++L+  AS FA      GG
Sbjct: 265 --ADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGG 322

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAY--------QE 405
           ++IDSGT +T  P ++ + +   F  Q    P A G S  D  CF   A         ++
Sbjct: 323 VIIDSGTALTLFPVAVLAEVVRAFRSQLR-LPFANGSSPDDGVCFAAPAVAAGGGRMARQ 381

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
           V +P +   F+G     +D+    Y ++      +C+ L      D+   IGN+ Q++ R
Sbjct: 382 VAVPRMVFHFQG---ADLDLPRENYVLEDHRRGHLCVLLGDSG--DDGATIGNFVQQDMR 436

Query: 465 VIYDTKNSQLGFAGEDC 481
           V+YD +   L FA  +C
Sbjct: 437 VVYDLERETLSFAPVEC 453


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  157 bits (398), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 135/421 (32%), Positives = 210/421 (49%), Gaps = 62/421 (14%)

Query: 110 ISGNIKDVSNTEIPL-----TSGIR--LQTLNYIA--TIELG----GRNMTVIVDTGSDL 156
           I   ++D  N  + L     TSG+R  +  L   A  +++LG     +N++ I+DTGS+ 
Sbjct: 64  IQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLSAIIDTGSEA 123

Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT--GNSGVCSSSSPPDC 214
             VQC       ++  PVFDP+ S SY++V C S  C A++  T  G+S  C +SS   C
Sbjct: 124 VLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSSQPCVNSSA-TC 176

Query: 215 NYFVSYGDGSYTRGELGREHLGL------GKA-SVNDFIFGCGRNNKGLFG--GVSGLMG 265
            Y +SYGD   + G+  ++ + L      G+A    D  FGC  + +G     G  G++G
Sbjct: 177 TYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVG 236

Query: 266 LGRSDLSLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
             R +LSL SQ  +  GG  FSYC PS      +  +I  G+S + K  + + YT ++ N
Sbjct: 237 FNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSK--SKVGYTPLLDN 294

Query: 325 P---QLATFYILNLTGISIGGKQLQ--ASGF------AKGGILIDSGTVITRLPPSIYSA 373
           P     +  Y + LT IS+ GK L    S F        GG ++DSGT  TR+    Y+A
Sbjct: 295 PVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTA 354

Query: 374 LKAEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVDV 425
            +  F       L++  G  +A GF   D C+N+SA   +  +P V++  + N  + +  
Sbjct: 355 FRNAFAASNRSGLRKKVG--AAAGF---DDCYNISAGSSLPGVPEVRLSLQNNVRLELRF 409

Query: 426 TGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +   V +  ++V + LA LS +     +  ++GNYQQ N  V YD + S++GF   DC
Sbjct: 410 EHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469

Query: 482 S 482
           S
Sbjct: 470 S 470


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 169/369 (45%), Gaps = 41/369 (11%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+  + +G     +  I DTGSDLTW  C PC  CY Q++P+FDP  S SY+ + C+S 
Sbjct: 24  HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSK 83

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
            CH L+     +GVCS      CNY  +Y   + T+G L +E + L         +   +
Sbjct: 84  LCHKLD-----TGVCSPQK--HCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIV 136

Query: 247 FGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLIL 303
           FGCG NN G F     G++GLG   +S +SQ    FGG  FS CL P   D   S  + L
Sbjct: 137 FGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSL 196

Query: 304 GGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG-----FAKGGI 355
           G  S V      STP+         Q  T Y + L GIS+G   L  +G       KG +
Sbjct: 197 GKGSEVSGKGVVSTPLVAK------QDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGNV 250

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKM 413
            +DSGT  T LP  +Y  L A+   + +  P     + LD    L    + N+  P++  
Sbjct: 251 FLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVT---NDLDLGPQLCYRTKNNLRGPVLTA 307

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            FEG     V +     FV       CL   + S   + G+ GN+ Q N  + +D     
Sbjct: 308 HFEGG---DVKLLPTQTFVSPKDGVFCLGFTNTS--SDGGVYGNFAQSNYLIGFDLDRQV 362

Query: 474 LGFAGEDCS 482
           + F   DC+
Sbjct: 363 VSFKPMDCT 371


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 181/375 (48%), Gaps = 42/375 (11%)

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y+  + +G   + ++ ++DTGSDL W QC PC SC +Q DP+F P  S SY+ + C  
Sbjct: 94  LEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAG 153

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI---- 246
           + C  +          S   P  C Y  +YGDG+ T G    E      +          
Sbjct: 154 TLCSDILHH-------SCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTV 206

Query: 247 ---FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
              FGCG  N G     SG++G GR+ LSLVSQ S      FSYCL S      S  L  
Sbjct: 207 PLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSI---RRFSYCLTSYASRRQSTLLFG 263

Query: 304 GGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGI 355
             +  V+ ++T  +  T ++ +PQ  TFY ++ TG+++G ++L+   S FA      GG+
Sbjct: 264 SLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 323

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL-------SAYQEVN 407
           ++DSGT +T LP ++ + +   F +Q    P A G +  D  CF +       S+  ++ 
Sbjct: 324 IVDSGTALTLLPAAVLAEVVRAFRQQLR-LPFANGGNPEDGVCFLVPAAWRRSSSTSQMP 382

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVI 466
           +P + + F+G     +D+    Y +      ++CL LA     D+   IGN  Q++ RV+
Sbjct: 383 VPRMVLHFQG---ADLDLPRRNYVLDDHRRGRLCLLLADSG--DDGSTIGNLVQQDMRVL 437

Query: 467 YDTKNSQLGFAGEDC 481
           YD +   L  A   C
Sbjct: 438 YDLEAETLSIAPARC 452


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/384 (33%), Positives = 171/384 (44%), Gaps = 44/384 (11%)

Query: 123 PLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           P+T+   L T +   Y+  + +G   +  T I+DTGSDL W QC PC  C +Q  P FD 
Sbjct: 74  PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDV 133

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S +Y+ + C SS C +L     +S  C       C Y   YGD + T G L  E    
Sbjct: 134 KKSATYRALPCRSSRCASL-----SSPSCFKKM---CVYQYYYGDTASTAGVLANETFTF 185

Query: 238 G-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
           G     K    +  FGCG  N G     SG++G GR  LSLVSQ        FSYCL S 
Sbjct: 186 GAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSYCLTSY 242

Query: 293 QDAGASGSLILGGNSSVFKNST----PITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
             A  S  L  G  +++   +T    P+  T  + NP L   Y L+L  IS+G K L   
Sbjct: 243 LSATPS-RLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301

Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
                      GG++IDSGT IT L    Y A++   +      P+     I LDTCF  
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIP-LPAMNDTDIGLDTCFQW 360

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---IIGN 457
                V + +  + F  ++     +      + S    +CL +A       TG   IIGN
Sbjct: 361 PPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLCLVMA------PTGVGTIIGN 414

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
           YQQ+N  ++YD  NS L F    C
Sbjct: 415 YQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 184/402 (45%), Gaps = 49/402 (12%)

Query: 109 MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKS 166
           +I  N   V    I   + + +   +Y+  + +G   +     VDTGSDL W+QC PC +
Sbjct: 33  LIPRNSSQVLFNRITAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTN 92

Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDG 223
           CY Q +P+FDP  S +Y  +   S +C         S + S+S  PD   CNY  SY D 
Sbjct: 93  CYKQLNPMFDPQSSSTYSNIAYGSESC---------SKLYSTSCSPDQNNCNYTYSYEDD 143

Query: 224 SYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQT 277
           S T G L +E L L    GK  ++   IFGCG NN G+F     G++GLGR  LSLVSQ 
Sbjct: 144 SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQI 203

Query: 278 SEIFGG-LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYI 332
              FGG +FS CL P   +   +  +  G  S V  N   STP+   N         FY 
Sbjct: 204 GSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNT-----HQAFYF 258

Query: 333 LNLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS--G 384
           + L GIS+    L  +         KG ++IDSGT  T LP   Y  L  E   + +   
Sbjct: 259 VTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDP 318

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIP--LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
            P  P        + L      N+    +   FEG A++ +  T I  F+       C A
Sbjct: 319 IPIDPTLG-----YQLCYRTPTNLKGTTLTAHFEG-ADVLLTPTQI--FIPVQDGIFCFA 370

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             S ++ +E GI GN+ Q N  + +D +   + F   DC+++
Sbjct: 371 FTS-TFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDCTNL 411


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 170/370 (45%), Gaps = 49/370 (13%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           +  +DT SDL W+QCQPC SCY Q DPVF+P +S SY  V C S TC  L+   G+   C
Sbjct: 106 SAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLD---GHR--C 160

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMG 265
                  C Y   Y     T+G L  + L +G    +  +FGC  ++  G     SGL+G
Sbjct: 161 HEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVG 220

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           LGR  LSLVSQ S      F YCLP    +  SG L+LG  +   +N +      M  + 
Sbjct: 221 LGRGPLSLVSQLSV---HRFMYCLPPPM-SRTSGKLVLGAGADAVRNMSDRVTVTMSSST 276

Query: 326 QLATFYILNLTGISIGGKQLQASGFAKG--------------------------GILIDS 359
           +  ++Y LNL G+++G +    +  A                            G+++D 
Sbjct: 277 RYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDV 336

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEF 415
            + I+ L  S+Y  L  +  ++     + P   + LD CF L        V +P V + F
Sbjct: 337 ASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF 396

Query: 416 EGN-AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           +G   E+  D      FV +D   +CL +   S      I+GN+Q +N RV+++ +  ++
Sbjct: 397 DGRWLELDRD----RLFV-TDGRMMCLMIGRTS---GVSILGNFQLQNMRVLFNLRRGKI 448

Query: 475 GFAGEDCSSM 484
            FA   C S+
Sbjct: 449 TFAKASCDSL 458


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 125/418 (29%), Positives = 198/418 (47%), Gaps = 46/418 (11%)

Query: 78  SGKIVDWNEQQQNRLI-LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNY 135
           +G + D   +  +RL+ LD+L V+                      P+ SG +L QT  Y
Sbjct: 65  AGFLADQAARDASRLLYLDSLAVK-----------------GRAYAPIASGRQLLQTPTY 107

Query: 136 IATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
           +    LG   + + + VDT +D  W+ C  C  C       F+P+ S SY+ V C S  C
Sbjct: 108 VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQC 165

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
                    +  CS ++   C + +SY D S  +  L ++ L +    V  + FGC +  
Sbjct: 166 -----VLAPNPSCSPNAK-SCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRA 218

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
            G      GL+GLGR  LS +SQT +++G  FSYCLPS +    SG+L LG N    +  
Sbjct: 219 TGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRR-- 276

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRL 366
             I  T ++ NP  ++ Y +N+TGI +G K   + AS  A       G ++DSGT+ TRL
Sbjct: 277 --IKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 334

Query: 367 PPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
              +Y AL+ E  ++  +G  +       DTC+N +    V  P V + F+G      + 
Sbjct: 335 VAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTT----VAWPPVTLLFDGMQVTLPEE 390

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
             +++      S + +A A         +I + QQ+N RV++D  N ++GFA E C++
Sbjct: 391 NVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCTA 448


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 122/376 (32%), Positives = 189/376 (50%), Gaps = 47/376 (12%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSS 191
           ++ T+ +G   +    I DTGSDL W QC PC + C+ Q  P+++PS S ++  + CNSS
Sbjct: 85  FLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSS 144

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI----- 246
                       G+C+ +    C Y ++YG G +T    G E    G ++  D +     
Sbjct: 145 L-----------GLCAPAC--ACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGI 190

Query: 247 -FGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC   + G      SGL+GLGR  LSLVSQ   +    FSYCL   QD  ++ +L+LG
Sbjct: 191 AFGCSNASSGFNASSASGLVGLGRGSLSLVSQ---LGAPKFSYCLTPYQDTNSTSTLLLG 247

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILI 357
            ++S+  ++  ++ T  + +P  + +Y LNLTGIS+G   L     A        GG++I
Sbjct: 248 PSASL-NDTGVVSSTPFVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNL--SAYQEVNIPLVKM 413
           DSGT IT L  + Y  ++A  L   +  P+  G +   LD CF L  S     ++P + +
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVLSLVT-LPTTDGSAATGLDLCFELPSSTSAPPSMPSMTL 364

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV---CLALASLSYED--ETGIIGNYQQKNQRVIYD 468
            F+G A+M +     +  +    S     CLA+ + +  D     I+GNYQQ+N  ++YD
Sbjct: 365 HFDG-ADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYD 423

Query: 469 TKNSQLGFAGEDCSSM 484
                L FA   CS++
Sbjct: 424 VGKETLSFAPAKCSTL 439


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 129/422 (30%), Positives = 191/422 (45%), Gaps = 51/422 (12%)

Query: 92  LILDNLHVQYLQSRIKNMISGN-----------IKDVSNTEIPLT--SGIRLQTLNYIAT 138
           L+L  LH+   +    N+I  N           + ++S  E  LT  S I     +Y+  
Sbjct: 16  LMLLPLHISATEGFSVNLIRKNSSHAHVLPLRRLMELSAMEKTLTPQSPIYAYLGHYLME 75

Query: 139 IELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
           + +G     +  I DTGSDLTW  C PC +CY Q++P+FDP  S +Y+ + C+S  CH L
Sbjct: 76  LSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLCHKL 135

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKA-SVNDFIFGCGR 251
           +     +GVCS      CNY  +Y   + TRG L +E + L    GK+  +   +FGCG 
Sbjct: 136 D-----TGVCSPQK--RCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGH 188

Query: 252 NNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLILGGNSS 308
           NN G F     G++GLG   +SL+SQ    FGG  FS CL P   D   S  +  G  S 
Sbjct: 189 NNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSK 248

Query: 309 VFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG----FAKGGILIDSGT 361
           V      STP+         Q  T Y + L GIS+    L  +G      KG + +DSGT
Sbjct: 249 VSGKGVVSTPLVAK------QDKTPYFVTLLGISVENTYLHFNGSSQNVEKGNMFLDSGT 302

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAE 420
             T LP  +Y  + A+   + +  P      +    C+       +  P++   FEG   
Sbjct: 303 PPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTK--NNLRGPVLTAHFEG--- 357

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
             V ++    F+       CL   + S   + G+ GN+ Q N  + +D     + F  +D
Sbjct: 358 ADVKLSPTQTFISPKDGVFCLGFTNTS--SDGGVYGNFAQSNYLIGFDLDRQVVSFKPKD 415

Query: 481 CS 482
           C+
Sbjct: 416 CT 417


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 117/372 (31%), Positives = 183/372 (49%), Gaps = 28/372 (7%)

Query: 123 PLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           P+ SG +L QT  Y+    LG   + + + VDT +D  W+ C  C  C       F+P+ 
Sbjct: 41  PIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAA 98

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S SY+ V C S  C         +  CS ++   C + +SY D S  +  L ++ L +  
Sbjct: 99  SASYRPVPCGSPQC-----VLAPNPSCSPNAK-SCGFSLSYADSSL-QAALSQDTLAVAG 151

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
             V  + FGC +   G      GL+GLGR  LS +SQT +++G  FSYCLPS +    SG
Sbjct: 152 DVVKAYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSG 211

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----K 352
           +L LG N    +    I  T ++ NP  ++ Y +N+TGI +G K   + AS  A      
Sbjct: 212 TLRLGRNGQPRR----IKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATG 267

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
            G ++DSGT+ TRL   +Y AL+ E  ++  +G  +       DTC+N +    V  P V
Sbjct: 268 AGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCYNTT----VAWPPV 323

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
            + F+G      +   +++      S + +A A         +I + QQ+N RV++D  N
Sbjct: 324 TLLFDGMQVTLPEENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPN 383

Query: 472 SQLGFAGEDCSS 483
            ++GFA E C++
Sbjct: 384 GRVGFARESCTA 395


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/291 (36%), Positives = 151/291 (51%), Gaps = 34/291 (11%)

Query: 206 CSSSSPP-----------DCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNN 253
           C+ SSPP            C + +SY DG+ T G   ++ L L   A V +F FGCG   
Sbjct: 18  CARSSPPMRTAAAVTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGHGK 77

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
             + G   G++GLGR   SL ++    +GG+FSYCLPS       G L LG      KN 
Sbjct: 78  HAVRGLFDGVLGLGRLRESLGAR----YGGVFSYCLPSVSSK--PGFLALGAG----KNP 127

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIY 371
           +   +T M   P   TF  + L GI++GGK+L  + S F+ GG+++DSGTVIT L  + Y
Sbjct: 128 SGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS-GGMIVDSGTVITGLQSTAY 186

Query: 372 SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVY 430
            AL++ F K    +   P    LDTC+NL+ Y+ V +P + + F G A + +DV  GI+ 
Sbjct: 187 RALRSAFRKAMEAYRLLPN-GDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILV 245

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                    CLA A    +   G++GN  Q+   V++DT  S+ GF  + C
Sbjct: 246 -------NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 186/377 (49%), Gaps = 33/377 (8%)

Query: 122 IPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           +P+  G +L ++ +Y+A   LG   + + V +D  +D  WV C         + P FDP+
Sbjct: 93  VPIAPGRQLLSIPSYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPT 150

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +Y+ V C +  C      +   G+ SS     C + +SY   ++ +  LG++ L L 
Sbjct: 151 RSSTYRPVRCGAPQCSQAPAPSCPGGLGSS-----CAFNLSYAASTF-QALLGQDALALH 204

Query: 239 KA--SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
               +V  + FGC     G      GL+G GR  LS  SQT +++G +FSYCLPS + + 
Sbjct: 205 DDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSN 264

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG--- 353
            SG+L LG      +    I  T ++ NP   + Y +N+ GI +GG+ +     A     
Sbjct: 265 FSGTLRLGPAGQPKR----IKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDP 320

Query: 354 ----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
               G ++D+GT+ TRL   +Y+A++  F  +    P A      DTC+N++    +++P
Sbjct: 321 TSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRA-PVAGPLGGFDTCYNVT----ISVP 375

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS---YEDETGIIGNYQQKNQRVI 466
            V   F+G   +T+    +V    S     CLA+A+      +    ++ + QQ+N RV+
Sbjct: 376 TVTFSFDGRVSVTLPEENVV-IRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVL 434

Query: 467 YDTKNSQLGFAGEDCSS 483
           +D  N ++GF+ E C++
Sbjct: 435 FDVANGRVGFSRELCTA 451


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 135/442 (30%), Positives = 202/442 (45%), Gaps = 52/442 (11%)

Query: 65  GAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS--GNIKDVSNTEI 122
              T EL H++  S K   +N QQ         H+Q     ++  +S   + +  + T  
Sbjct: 29  AGFTTELVHRD--SPKSPLYNSQQT--------HLQRWNKAMRRSVSRVHHFQRTAATVS 78

Query: 123 P--LTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           P  + S I      Y+ ++ LG     +  I DTGSDL W QC PC  CY Q  P+FDP 
Sbjct: 79  PKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPK 138

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL- 237
            S +Y+ + C++  C  L    G S  CSS     C Y   YGD S+T G L  + + L 
Sbjct: 139 SSKTYRDLSCDTRQCQNL----GESSSCSSEQL--CQYSYYYGDRSFTNGNLAVDTVTLP 192

Query: 238 ----GKASVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLP-- 290
               G       + GCGR N G F    SG++GLG   +SL+SQ     GG FSYCL   
Sbjct: 193 STNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPF 252

Query: 291 STQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQL-- 345
           S++ AG S  L  G N+ V  +   STP+    +  NP   TFY L L  +S+G K++  
Sbjct: 253 SSESAGNSSKLHFGRNAVVSGSGVQSTPL----ISKNPD--TFYYLTLEAMSVGDKKIEF 306

Query: 346 --QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ-FSGFPSAPGFSILDTCFNLSA 402
              + G ++G I+IDSGT +T  P + ++           +G  +     +L  C+  + 
Sbjct: 307 GGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP 366

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
             ++ +P++   F G A++ +        +  D   +CLA  S        I GN  Q N
Sbjct: 367 --DLKVPVITAHFNG-ADVVLQTLNTFILISDDV--LCLAFNS---TQSGAIFGNVAQMN 418

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
             + YD +   + F   DC+ +
Sbjct: 419 FLIGYDIQGKSVSFKPTDCTQL 440


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 178/359 (49%), Gaps = 57/359 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQ--PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           + + + +DTGSD+TW QC+  P  +C+NQ  P+FDPS S S+  + C+S  C       G
Sbjct: 99  QEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGG 158

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL----GLGK---ASVNDFIFGCGRNNK 254
             G  ++S P  CNY +SYGDGS +RGE+GRE      G G+   A+V   +FGCG  N+
Sbjct: 159 --GNDATSRP--CNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANR 214

Query: 255 GLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
           G+F    +G+ G GR  LSL SQ   +  G FS+C  +T     + +++LG       ++
Sbjct: 215 GVFTSNETGIAGFGRGSLSLPSQ---LKVGNFSHCF-TTITGSKTSAVLLGLPGVAPPSA 270

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
           +P+                    G   G  + +++  +      +SGT IT LPP  Y A
Sbjct: 271 SPL--------------------GRRRGSYRCRSTPRSS-----NSGTSITSLPPRTYRA 305

Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           ++ EF  Q    P  PG +    TCF+      + ++P + + FEG A M +     V+ 
Sbjct: 306 VREEFAAQVK-LPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEG-ATMRLPQENYVFE 363

Query: 432 VKSDASQ------VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V  D         +CLA+     E    I+GN QQ+N  V+YD +NS+L F    C  +
Sbjct: 364 VVDDDDAGNSSRIICLAV----IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCDQL 418


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 134/448 (29%), Positives = 202/448 (45%), Gaps = 52/448 (11%)

Query: 55  VSH-QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
           V+H   + ++ G  +++L H++     + + +E    RL  D    +++ S  +  IS N
Sbjct: 22  VAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL--DRFFRRFM-SFSEASISPN 78

Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
                  E P++S        Y+  I +G     V  I DTGSDL W QC PC SCY Q+
Sbjct: 79  -----TPEPPVSS----NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQK 129

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGEL 230
           +P+FDPS S S+K+V C S  C  L+         S S P   C++   YGDGS  +G +
Sbjct: 130 NPMFDPSKSTSFKEVSCESQQCRLLD-------TVSCSQPQKLCDFSYGYGDGSLAQGVI 182

Query: 231 GREHLGLG-----KASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG- 283
             E L L        S+ + +FGCG NN G F     GL G G   LSL SQ     G  
Sbjct: 183 ATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSG 242

Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGI 338
             FS CL P   D   +  +I G  + V  +   STP+   +   +P   T+Y + L GI
Sbjct: 243 RKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKD---DP---TYYFVTLDGI 296

Query: 339 SIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
           S+G K    S  +    KG + ID+GT  T LP   Y+ L     +     P        
Sbjct: 297 SVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP 356

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
             C+  +    ++ P++   F+G     V +  +  F+       C A+  +  + +TGI
Sbjct: 357 QLCYRSATL--IDGPILTAHFDG---ADVQLKPLNTFISPKEGVYCFAMQPI--DGDTGI 409

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            GN+ Q N  + +D    ++ F   DC+
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDCT 437


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 120/373 (32%), Positives = 188/373 (50%), Gaps = 49/373 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFAT--G 201
           +N++ I+DTGS+   VQC       ++  PVFDP+ S SY++V C S  C A++  T  G
Sbjct: 10  KNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNG 63

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-------VNDFIFGCGRNNK 254
           +S  C +SS   C Y +SYGD   + G+  ++ + L   +         D  FGC  + +
Sbjct: 64  SSQPCVNSSAA-CTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQ 122

Query: 255 GLFG--GVSGLMGLGRSDLSLVSQTSEIFGG-LFSYCLPSTQDAGASGSLILGGNSSVFK 311
           G     G  G++G  R +LSL SQ  +  GG  FSYC PS      +  +I  G+S + K
Sbjct: 123 GFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSK 182

Query: 312 NSTPITYTNMIPN---PQLATFYILNLTGISIGGKQLQ--ASGF------AKGGILIDSG 360
             + ++YT ++ N   P  +  Y + LT IS+ GK L    S F        GG ++DSG
Sbjct: 183 --SKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSG 240

Query: 361 TVITRLPPSIYSALKAEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVK 412
           T  TR+    Y+A +  F       L++  G  +A GF   D C+N+SA   +  +P V+
Sbjct: 241 TTFTRVVDDAYTAFRNAFAASNRSGLRKKVG--AAAGF---DDCYNISAGSSLPGVPEVR 295

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYD 468
           +  + N  + +    +   V +  ++V + LA LS +     +  ++GNYQQ N  V YD
Sbjct: 296 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 355

Query: 469 TKNSQLGFAGEDC 481
            + S++GF   DC
Sbjct: 356 NERSRVGFERADC 368


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 132/450 (29%), Positives = 210/450 (46%), Gaps = 43/450 (9%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           S S +S +++R  +   +++L H++  S    + +     R+I   L       R+ + +
Sbjct: 13  SLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQRVSHFL 72

Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCY 168
             N K   +  IP           Y+    +G   +    +VDTGS L W+QC PC +C+
Sbjct: 73  DEN-KLPESLLIP-------DKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCF 124

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
            Q+ P+F+P  S +YK   C+S  C  L+ +  + G         C Y + YGD S++ G
Sbjct: 125 PQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLG-----QCIYGIMYGDKSFSVG 179

Query: 229 ELGREHLGLGK------ASVNDFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE 279
            LG E L  G        S  + IFGCG +N         V G+ GLG   LSLVSQ   
Sbjct: 180 ILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGA 239

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
             G  FSYCL    D+ ++  L  G  + +  N   +  T +I  P L T+Y LNL  ++
Sbjct: 240 QIGHKFSYCL-LPYDSTSTSKLKFGSEAIITTNG--VVSTPLIIKPSLPTYYFLNLEAVT 296

Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSIL 394
           IG K + ++G   G I+IDSGT +T L  + Y+   A       +K     PS      L
Sbjct: 297 IGQK-VVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSP-----L 350

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
            TCF   A   + IP +  +F G A + +    ++  + +D++ +CLA+   S      +
Sbjct: 351 KTCFPNRA--NLAIPDIAFQFTG-ASVALRPKNVLIPL-TDSNILCLAVVP-SSGIGISL 405

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            G+  Q + +V YD +  ++ FA  DC+ +
Sbjct: 406 FGSIAQYDFQVEYDLEGKKVSFAPTDCAKV 435


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 134/448 (29%), Positives = 202/448 (45%), Gaps = 52/448 (11%)

Query: 55  VSH-QKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
           V+H   + ++ G  +++L H++     + + +E    RL  D    +++ S  +  IS N
Sbjct: 22  VAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL--DRFFRRFM-SFSEASISPN 78

Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
                  E P++S        Y+  I +G     V  I DTGSDL W QC PC SCY Q+
Sbjct: 79  -----TPEPPVSS----NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQK 129

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGEL 230
           +P+FDPS S S+K+V C S  C  L+         S S P   C++   YGDGS  +G +
Sbjct: 130 NPMFDPSKSTSFKEVSCESQQCRLLD-------TVSCSQPQKLCDFSYGYGDGSLAQGVI 182

Query: 231 GREHLGLG-----KASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG- 283
             E L L        S+ + +FGCG NN G F     GL G G   LSL SQ     G  
Sbjct: 183 ATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSG 242

Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPITYTNMIPNPQLATFYILNLTGI 338
             FS CL P   D   +  +I G  + V  +   STP+   +   +P   T+Y + L GI
Sbjct: 243 RKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKD---DP---TYYFVTLDGI 296

Query: 339 SIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
           S+G K    S  +    KG + ID+GT  T LP   Y+ L     +     P        
Sbjct: 297 SVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP 356

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
             C+  +    ++ P++   F+G     V +  +  F+       C A+  +  + +TGI
Sbjct: 357 QLCYRSATL--IDGPILTAHFDG---ADVQLKPLNTFISPKEGVYCFAMQPI--DGDTGI 409

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            GN+ Q N  + +D    ++ F   DC+
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDCT 437


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 165/349 (47%), Gaps = 33/349 (9%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           ++DT +D  W QC PCK C+N   P+FDPS S +YK + C+S  C  +E     +  CSS
Sbjct: 105 VMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKCKNVE-----NTHCSS 159

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDFIFGCGRNNKG-LFGGVSG 262
                C Y  +YG  +Y++G+L  + L L        S  + + GCG  NKG L G VSG
Sbjct: 160 DDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSG 219

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVF---KNSTPITY 318
            +GLGR  LS +SQ +   GG FSYCL P   + G SG L  G  S V      STPIT 
Sbjct: 220 NIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITA 279

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-----GGILIDSGTVITRLPPSIYSA 373
             +         Y   L  +S+G   ++           G  +IDSGT +T LP ++YS 
Sbjct: 280 GEI--------GYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGTTLTILPENVYSR 331

Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           L++          +         C+  +  + +++P++   F G     V +  +  F  
Sbjct: 332 LESIVTSMVKLERAKSPNQQFKLCYK-ATLKNLDVPIITAHFNG---ADVHLNSLNTFYP 387

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            D   VC A  S+     T IIGN  Q+N  V +D + + + F   DC+
Sbjct: 388 IDHEVVCFAFVSVGNFPGT-IIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 120/420 (28%), Positives = 192/420 (45%), Gaps = 47/420 (11%)

Query: 76  YCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLN 134
           +   K + W E        D   +Q+L S +             + +P+ SG ++ Q+  
Sbjct: 46  FWPSKPLKWEESVLQMQAKDQARLQFLSSLVAR----------KSVVPIASGRQIVQSPT 95

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           YI   ++G   + M + +DT +D  W+ C  C  C +    VF+   S ++K V C +  
Sbjct: 96  YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSST---VFNNVKSTTFKTVGCEAPQ 152

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           C  +      +  C  S+   C + ++YG  S     L ++ + L   S+  + FGC   
Sbjct: 153 CKQVP-----NSKCGGSA---CAFNMTYGSSSIA-ANLSQDVVTLATDSIPSYTFGCLTE 203

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
             G      GL+GLGR  +SL+SQT  ++   FSYCLPS +    SGSL LG      + 
Sbjct: 204 ATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKR- 262

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
              I  T ++ NP+ ++ Y +NL  I +G +   +  S  A       G + DSGTV TR
Sbjct: 263 ---IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTR 319

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
           L    Y+A++  F K+  G  +       DTC+       +  P +   F G   M V +
Sbjct: 320 LVAPAYTAVRDAFRKRV-GNATVTSLGGFDTCYT----SPIVAPTITFMFSG---MNVTL 371

Query: 426 TGIVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                 + S AS + CLA+A+   +      +I N QQ+N R+++D  NS+LG A E C+
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 177/388 (45%), Gaps = 54/388 (13%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNS 190
            YIA   +G   +    I+DTGS+L W QC  C+ +C+ Q  P +DPS S + + V CN 
Sbjct: 70  QYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCND 129

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
           + C     A G+   C S +   C     YG G+   G L  E+L     +V+  +FGC 
Sbjct: 130 AAC-----ALGSETQCLSDNK-TCAVVTGYGAGNIA-GTLATENLTFQSETVS-LVFGCI 181

Query: 250 --GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL---------PSTQDAGAS 298
              + + G   G SG++GLGR  LSL SQ  +     FSYCL         PS    GAS
Sbjct: 182 VVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIEPSHMVVGAS 238

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQASGFA---- 351
             LI G  SS     TP+T    + +P     +TFY L LTGI+ G  +L     A    
Sbjct: 239 AGLINGSASS-----TPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLR 293

Query: 352 ------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCFNLSAY 403
                   G  IDSG  +T L    Y AL+AE  +Q       P  G +  D C  L   
Sbjct: 294 QVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDA 353

Query: 404 QEVNIPLVKMEFEGNAEMTVD--VTGIVYFVKSDASQVCLALASLSYE-----DETGIIG 456
           + +  PLV + F G +    D  V    Y+   D++  C+ + S         +ET +IG
Sbjct: 354 ERLVPPLV-LHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIG 412

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           NY Q+N  V+YD     L F   DCSS+
Sbjct: 413 NYMQQNMHVLYDLAGGVLSFQPADCSSI 440


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 123/368 (33%), Positives = 174/368 (47%), Gaps = 49/368 (13%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QCQPC +C++Q  P FDPS S +     
Sbjct: 84  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 143

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
           C+S+ C  L  A          S P  + F   G G                ASV    F
Sbjct: 144 CDSTLCQGLPVA----------SLPRSDKFTFVGAG----------------ASVPGVAF 177

Query: 248 GCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
           GCG  N G+F    +G+ G GR  LSL SQ      G FS+C  +T       +++L   
Sbjct: 178 GCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLDLP 233

Query: 307 SSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDS 359
           + +F N    +  T +I NP   TFY L+L GI++G  +L    S FA     GG +IDS
Sbjct: 234 ADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 293

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEFEG 417
           GT +T LP  +Y  ++  F  Q    P   G +  D  F LSA       +P + + FEG
Sbjct: 294 GTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEG 351

Query: 418 NAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
            A M +     V+ V+   S + CLA+       E   IGN+QQ+N  V+YD +NS+L F
Sbjct: 352 -ATMDLPRENYVFEVEDAGSSILCLAIIE---GGEVTTIGNFQQQNMHVLYDLQNSKLSF 407

Query: 477 AGEDCSSM 484
               C  +
Sbjct: 408 VPAQCDKL 415


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 119/429 (27%), Positives = 198/429 (46%), Gaps = 44/429 (10%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK---------DVSNTEIPLTSGIRLQT 132
           V+      +R    N+    LQ RI N+++ +IK          +S+ ++P  + I    
Sbjct: 29  VELIHPDSSRSPFYNIRETQLQ-RISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIPYAG 87

Query: 133 LNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
             Y+ +  +G     +  +VDTGSD  W QC+PCK C NQ  P+F+PS S +YK + C+S
Sbjct: 88  SYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSS 147

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-----ASVNDF 245
             C       G    CSS+    C Y ++Y D S ++G++ ++ L L        S    
Sbjct: 148 PIC-----KRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKI 202

Query: 246 IFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
           + GCG  N     G+ SG++G GR + S+VSQ     GG FSYCL S        S +  
Sbjct: 203 VIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYF 262

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYI----LNLTGISIGGK--QLQASGFA---KGGI 355
           G+ +V      ++   ++  P + +FY+     NL   S+G    +L+ S      +G  
Sbjct: 263 GDMAV------VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNA 316

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +IDSG+ IT+LP  +YS L+   +              L  C+  +  ++  +P++   F
Sbjct: 317 VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYK-TTLKKYEVPIITAHF 375

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLG 475
            G     V +     F++ +   +C A  S ++     + GN  Q+N  V YDT  + + 
Sbjct: 376 RG---ADVKLNAFNTFIQMNHEVMCFAFNSSAF--PWVVYGNIAQQNFLVGYDTLKNIIS 430

Query: 476 FAGEDCSSM 484
           F   +C+ +
Sbjct: 431 FKPTNCTKL 439


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 171/360 (47%), Gaps = 47/360 (13%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           VDTGSD+ W QC+PC  C+ Q  P FD S S +   VLC    C AL       G     
Sbjct: 110 VDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPICRALRPHACFLG----- 164

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHL-----GLGKASVNDFIFGCGRNNKGLF-GGVSGL 263
               C Y V+YGD S T G+L ++       G GK +V D +FGCG+ N G F    +G+
Sbjct: 165 ---GCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTGNFHSNETGI 221

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNM 321
            G GR  LSL  Q        FSYC  +  ++ ++   + G  +   +     PI  T  
Sbjct: 222 AGFGRGPLSLPRQLGV---SSFSYCFTTIFESKSTPVFLGGAPADGLRAHATGPILSTPF 278

Query: 322 IPNPQLATFYILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITRLPPSIYSAL 374
           +PN     +Y L+L GI++G  +L    S F       GG +IDSGT IT  P +++ +L
Sbjct: 279 LPN--HPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSL 336

Query: 375 KAEFLKQFSGFPSAPGFSILDT------CFNLSAYQE---VNIPLVKMEFEG-NAEMTVD 424
              F+ Q       P  S  DT      CF+  +  +   V +P + +  EG + E+  +
Sbjct: 337 WEAFVAQV----PLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLEGADWELPRE 392

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                Y    D+ Q+C+ +  L+ +D+  +IGN+QQ+N  +++D   ++L      C  M
Sbjct: 393 NYMAEY---PDSDQLCVVV--LAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQCDKM 447


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 185/377 (49%), Gaps = 43/377 (11%)

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L++  T+ +G   +  T+I+DTGSDL W QC+   +  +++ P++DP+ S S+    C+ 
Sbjct: 87  LHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDG 146

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIF 247
             C    F T N   CS +    C Y  +YG  + T+GEL  E    G   + SV+   F
Sbjct: 147 RLCETGSFNTKN---CSRNK---CIYTYNYGSAT-TKGELASETFTFGEHRRVSVS-LDF 198

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           GCG+   G   G SG++G+    LSLVSQ        FSYCL    D   +  +  G  +
Sbjct: 199 GCGKLTSGSLPGASGILGISPDRLSLVSQLQI---PRFSYCLTPFLDRNTTSHIFFGAMA 255

Query: 308 --SVFKNSTPITYTNMIPNPQLAT-FYILNLTGISIGGKQLQ--ASGFA-----KGGILI 357
             S ++ + PI  T+++ NP  +  +Y + L GIS+G K+L    S FA      GG  +
Sbjct: 256 DLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFV 315

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP----SAPGFSILDTCFNL------SAYQEVN 407
           DSG     LP  +  ALK E + +    P    +  G+   + CF L      +    V 
Sbjct: 316 DSGDTTGMLPSVVMEALK-EAMVEAVKLPVVNATDHGYE-YELCFQLPRNGGGAVETAVQ 373

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           +P +   F+G A M +      Y V+  A ++CL ++S +      IIGNYQQ+N  V++
Sbjct: 374 VPPLVYHFDGGAAMLLRRDS--YMVEVSAGRMCLVISSGA---RGAIIGNYQQQNMHVLF 428

Query: 468 DTKNSQLGFAGEDCSSM 484
           D +N +  FA   C+ +
Sbjct: 429 DVENHEFSFAPTQCNQI 445


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 116/371 (31%), Positives = 177/371 (47%), Gaps = 33/371 (8%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y  +I+LG  G+   +IVDTGS+LTW+QC PCK C    D ++D + S SY+ V CN+S
Sbjct: 99  EYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNS 158

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDF 245
              +   + G    C+  S   C +   YGDGS++ G L  + L      G    +V DF
Sbjct: 159 QLCS-NSSQGTYAYCARGS--QCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC + +  L   G SG++GL    ++L  Q  + FG  FS+C P       S  ++  
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVI 363
           GN+ +       T   +  +     FY + L G+SI   +L      +G ++I DSG+  
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVF--LPRGSVVILDSGSSF 333

Query: 364 TRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLS--AYQEVN--IPLVKME 414
           +      +S L+  FLK     PS        F  L TCF +S     E++  +P + + 
Sbjct: 334 SSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTK 470
           FE    + +   G++  V    + V +  A   +ED       +IGNYQQ+N  V YD +
Sbjct: 392 FEDGVTIGIPSIGVLLPVARFQNHVKMCFA---FEDGGPNPVNVIGNYQQQNLWVEYDIQ 448

Query: 471 NSQLGFAGEDC 481
            S++GFA   C
Sbjct: 449 RSRVGFARASC 459


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 124/375 (33%), Positives = 173/375 (46%), Gaps = 56/375 (14%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQ--QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           T+ +DT  D+ W+QC+PC        ++ +FDP+ S S   V C S  C AL    GN G
Sbjct: 166 TMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRAL----GNYG 221

Query: 205 ---------------VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFG 248
                            S++S  DCNY V+Y DG  + G    + L +    S  +F FG
Sbjct: 222 NGCSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFG 281

Query: 249 CGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           C    +G F G  SG M LG    SL+SQT+  +G  FSYC+P      ASG L LGG  
Sbjct: 282 CSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPKPS---ASGFLSLGGAI 338

Query: 308 SVFKN---------STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-KGGILI 357
           +   +         +TP+     I NP   T+Y++ L GI + G++L        GG L+
Sbjct: 339 NDGDSDSDSPSSFVTTPLMRNARIVNP---TYYVVRLQGIDVAGRRLNVPPVVFSGGTLM 395

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGF-----------PSAPGFSILDTCFNLSAYQEV 406
           DS  V+T+LPP+ Y AL+  F     G+             A G  ILDTC++      V
Sbjct: 396 DSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNV 455

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +P V + F G A + +D T  V        + CLA      + + G IGN QQ+   V+
Sbjct: 456 TVPTVSLVFFGGAVVDLDPTTAVMM------EGCLAFVPTPADFDLGFIGNVQQQTHEVL 509

Query: 467 YDTKNSQLGFAGEDC 481
           YD     +GF    C
Sbjct: 510 YDVGARNVGFRRGAC 524


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 170/356 (47%), Gaps = 39/356 (10%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCY--NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +  ++DTGSDL W++C  C  C   +  + +F    S SYKK+ CNS+ C  +  A G  
Sbjct: 18  IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNSTHCSGMSSA-GIG 76

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-----GLG---KASVNDFIFGCGRNNKG 255
             C  +    C Y   YGDGS T G++G + +     G G   ++  + F+FGCGR  KG
Sbjct: 77  PRCEET----CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFGCGRKLKG 132

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN--- 312
            +    GL+GLG+   SL+ Q  +  G  FSYCL S     ++ S +  G+S+  +    
Sbjct: 133 DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSSAALRGHDV 192

Query: 313 -STPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASG-------FAKGGILIDSG 360
            STPI + + +      T Y ++L  I++GG  +    + SG       F     +IDSG
Sbjct: 193 VSTPILHGDHLDQ----TLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFLANKTVIDSG 248

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T  T L P +Y A++    +Q    P+    + LD CFN S       P V   F    +
Sbjct: 249 TTYTLLTPPVYEAMRKSIEEQVI-LPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQ 307

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           + +    I      D   VCL++ S     +  IIGN QQ+N  ++YD   SQ+ F
Sbjct: 308 LVLPFENIFQVTSRDV--VCLSMDSSG--GDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 127/391 (32%), Positives = 175/391 (44%), Gaps = 26/391 (6%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQC 161
           L  +  ++ S NI+D+    I    G  L  L YI T  +    ++  VDTGSDL WVQC
Sbjct: 37  LIRKSSHLSSNNIQDIVQAPINAYIGQYLMEL-YIGTPPI---KISGTVDTGSDLIWVQC 92

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
            PC  CYNQ +P+FDP  S +Y  + C+S  C+         G CS      C+Y   Y 
Sbjct: 93  VPCLGCYNQINPMFDPLKSSTYTNISCDSPLCYKPYI-----GECSPEK--RCDYTYGYA 145

Query: 222 DGSYTRGELGREHLGL----GKA-SVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVS 275
           D S T+G L +E + L    GK  S+   +FGCG NN G F     GL+GLG    SLVS
Sbjct: 146 DSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVS 205

Query: 276 QTSEIFGG-LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           Q   +FGG  FS CL P   D   S  +  G  S V      +  T ++   Q  T Y +
Sbjct: 206 QIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEG--VVTTPLVQREQDMTSYYV 263

Query: 334 NLTGISIGGKQLQA-SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
            L GIS+    L   S   KG +L+DSGT    LP  +Y  +  E   +    P     S
Sbjct: 264 TLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPS 323

Query: 393 I-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
           +    C+       +  P +   FEG   +   +   +          CLA+ + +  D 
Sbjct: 324 LGPQLCYRTQT--NLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDP 381

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            GI GN+ Q N  + +D     + F   DC+
Sbjct: 382 -GIYGNFAQTNYLIGFDLDRQIVSFKPTDCT 411


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 175/374 (46%), Gaps = 41/374 (10%)

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L Y+  + +G   + ++ ++DTGSDL W QC PC SC  Q DP+F P  S SY+ + C  
Sbjct: 102 LEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAG 161

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN------- 243
             C+ +          S   P  C Y  SYGDG+ TRG    E      +S         
Sbjct: 162 ELCNDILHH-------SCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLS 214

Query: 244 -DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
               FGCG  NKG     SG++G GR+ LSLVSQ +      FSYCL +   +G   +L+
Sbjct: 215 APLGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIR---RFSYCL-TPYASGRKSTLL 270

Query: 303 LGG-NSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KG 353
            G     V+  +T  +  T ++ + Q  TFY +  TG+++G ++L+   S FA      G
Sbjct: 271 FGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSG 330

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY---QEVNI 408
           G ++DSGT +T  P  + + +   F  Q     +A G S  D   CF  +A    +   +
Sbjct: 331 GAIVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVV 390

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           P +    +G     +D+    Y +       +CL LA     D    IGN+ Q++ RV+Y
Sbjct: 391 PRMVFHLQG---ADLDLPRRNYVLDDQRKGNLCLLLADSG--DSGTTIGNFVQQDMRVLY 445

Query: 468 DTKNSQLGFAGEDC 481
           D +   L FA   C
Sbjct: 446 DLEADTLSFAPAQC 459


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/371 (31%), Positives = 177/371 (47%), Gaps = 33/371 (8%)

Query: 134 NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            Y  +I+LG  G+   +IVDTGS+LTW++C PCK C    D ++D + S SYK V CN+S
Sbjct: 99  EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNS 158

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDF 245
              +   + G    C+  S   C +   YGDGS++ G L  + L      G    +V DF
Sbjct: 159 QLCS-NSSQGTYAYCARGS--QCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDF 215

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC + +  L   G SG++GL    ++L  Q  + FG  FS+C P       S  ++  
Sbjct: 216 AFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFF 275

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVI 363
           GN+ +       T   +  +     FY + L G+SI   +L      +G ++I DSG+  
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVL--LPRGSVVILDSGSSF 333

Query: 364 TRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLS--AYQEVN--IPLVKME 414
           +      +S L+  FLK     PS        F  L TCF +S     E++  +P + + 
Sbjct: 334 SSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED----ETGIIGNYQQKNQRVIYDTK 470
           FE    + +   G++  V    + V +  A   +ED       +IGNYQQ+N  V YD +
Sbjct: 392 FEDGVTIGIPSIGVLLPVARYQNHVKMCFA---FEDGGPNPVNVIGNYQQQNLWVEYDIQ 448

Query: 471 NSQLGFAGEDC 481
            S++GFA   C
Sbjct: 449 RSRVGFARASC 459


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 167/368 (45%), Gaps = 54/368 (14%)

Query: 141 LGGRNM-----------TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           +GG NM           +V+ DTGSDL W QC PC  C+ Q  P F P+ S ++ K+ C 
Sbjct: 83  VGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCT 142

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
           SS C   +F   +   C+++    C Y   YG G YT G L  E L +G AS     FGC
Sbjct: 143 SSFC---QFLPNSIRTCNATG---CVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGC 195

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
              N           GLG+ DL +         G FSYCL S   AGAS   IL G+ + 
Sbjct: 196 STEN-----------GLGQLDLGV---------GRFSYCLRSGSAAGASP--ILFGSLAN 233

Query: 310 FKNSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQAS----GFAK----GGILIDSG 360
             +   +  T  + NP +  ++Y +NLTGI++G   L  +    GF +    GG ++DSG
Sbjct: 234 LTDGN-VQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSG 292

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGN 418
           T +T L    Y  +K  FL Q +   +  G   LD CF         + +P + + F+G 
Sbjct: 293 TTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGG 352

Query: 419 AEMTVDV--TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           AE  V     G+    +   +  CL +     +    +IGN  Q +  ++YD       F
Sbjct: 353 AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSF 412

Query: 477 AGEDCSSM 484
           A  DC+ +
Sbjct: 413 APADCAKV 420


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 128/218 (58%), Gaps = 14/218 (6%)

Query: 92  LILDNLHVQYLQSRIKN---------MISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG 142
           L  D+  V+ L SR+           +   +I+   +  +PL  G  + + NY   +  G
Sbjct: 66  LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFG 125

Query: 143 --GRNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
              R  ++IVDTGS L+W+QC+PC   C+ Q DP+FDPS S +YK + C SS C +L  A
Sbjct: 126 SPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDA 185

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFG 258
           T N+ +C +SS   C Y  SYGD SY+ G L ++ L L  + ++  F++GCG+++ GLFG
Sbjct: 186 TLNNPLCETSSN-VCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG 296
             +G++GLGR+ LS++ Q S  FG  FSYCLP+    G
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGG 282


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/352 (33%), Positives = 156/352 (44%), Gaps = 39/352 (11%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DTGSDL W QC PC  C +Q  P FD   S +Y+ + C SS C +L     +S  C   
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASL-----SSPSCFKK 55

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVSGLM 264
               C Y   YGD + T G L  E    G     K    +  FGCG  N G     SG++
Sbjct: 56  M---CVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMV 112

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST----PITYTN 320
           G GR  LSLVSQ        FSYCL S   A  S  L  G  +++   +T    P+  T 
Sbjct: 113 GFGRGPLSLVSQLGP---SRFSYCLTSYLSATPS-RLYFGVYANLSSTNTSSGSPVQSTP 168

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSIYSA 373
            + NP L   Y L+L  IS+G K L              GG++IDSGT IT L    Y A
Sbjct: 169 FVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEA 228

Query: 374 LKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           ++   +      P+     I LDTCF       V + +  + F  ++     +      +
Sbjct: 229 VRRGLVSAIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLI 287

Query: 433 KSDASQVCLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            S    +CL +A       TG   IIGNYQQ+N  ++YD  NS L F    C
Sbjct: 288 ASTTGYLCLVMA------PTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 169/356 (47%), Gaps = 39/356 (10%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCY--NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +  ++DTGSDL W++C  C  C   +  + +F    S SYKK+ CNS+ C  +  A G  
Sbjct: 18  IPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNSTHCSGMSSA-GIG 76

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-----GLG---KASVNDFIFGCGRNNKG 255
             C  +    C Y   YGDGS T G++G + +     G G   ++  + F+FGC R  KG
Sbjct: 77  PRCEET----CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFGCARKLKG 132

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN--- 312
            +    GL+GLG+   SL+ Q  +  G  FSYCL S     ++ S +  G+S+  +    
Sbjct: 133 DWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFLFLGSSAALRGHDV 192

Query: 313 -STPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASG-------FAKGGILIDSG 360
            STPI + + +      T Y ++L  I+IGG  +    + SG       F     +IDSG
Sbjct: 193 VSTPILHGDHLDQ----TLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFLANKTVIDSG 248

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T  T L P +Y A++    +Q    P+    + LD CFN S       P V   F    +
Sbjct: 249 TTYTLLTPPVYEAMRKSIEEQVI-LPTLGNSAGLDLCFNSSGDTSYGFPSVTFYFANQVQ 307

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           + +    I      D   VCL++ S     +  IIGN QQ+N  ++YD   SQ+ F
Sbjct: 308 LVLPFENIFQVTSRDV--VCLSMDSSG--GDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 121/377 (32%), Positives = 179/377 (47%), Gaps = 48/377 (12%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTW--VQCQP--CKSCYNQQDPVFD 176
           PL SG+   T  Y A + +G    T  +++DTGSD+ W  V+  P   ++          
Sbjct: 110 PLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAA 169

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           P+ +P +    C +  C  L+ A  +    S      C Y V+YGDGS T G+   E L 
Sbjct: 170 PAPTPRWN---CVAPICRRLDSAGCDRRRNS------CLYQVAYGDGSVTAGDFASETLT 220

Query: 237 LGK-ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA 295
             + A V     GCG +N+GLF   SGL+GLGR  LS  SQ +  FG  FSYCL     +
Sbjct: 221 FARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSS 280

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-------- 347
             +      G +                 P++ATFY ++L G S+GG +++         
Sbjct: 281 RRARPSRRWGGT-----------------PRMATFYYVHLLGFSVGGARVKGVSQSDLRL 323

Query: 348 -SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP-GFSILDTCFNLSAYQE 405
                +GG+++DSGT +TRL   +Y A++  F     G   +P GFS+ DTC+NLS  + 
Sbjct: 324 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 383

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS-QVCLALASLSYEDETGIIGNYQQKNQR 464
           V +P V M   G A + +      Y +  D S   C A+A    +    IIGN QQ+  R
Sbjct: 384 VKVPTVSMHLAGGASVALPPEN--YLIPVDTSGTFCFAMAGT--DGGVSIIGNIQQQGFR 439

Query: 465 VIYDTKNSQLGFAGEDC 481
           V++D    ++GF  + C
Sbjct: 440 VVFDGDAQRVGFVPKSC 456


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 103/319 (32%), Positives = 149/319 (46%), Gaps = 35/319 (10%)

Query: 117 VSNTEIPLTSGIR---------LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCK 165
           +S+ E P+ + +R         + T  Y+  + +G   R + + +DTGSDL W QC PC+
Sbjct: 59  LSSHERPVRARVRAGLVAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCR 118

Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
            C++Q  P+ DP+ S +Y  + C +  C AL F +     C   S   C Y   YGD S 
Sbjct: 119 DCFDQGIPLLDPAASSTYAALPCGAPRCRALPFTS-----CGGRS---CVYVYHYGDKSV 170

Query: 226 TRGELGREHLGLGK----------ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLV 274
           T G++  +    G            +     FGCG  NKG+F    +G+ G GR   SL 
Sbjct: 171 TVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLP 230

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNSTPITYTNMIPNPQLATFYI 332
           SQ +      FSYC  S  D+ +S   + G  ++++   +S  +  T +  NP   + Y 
Sbjct: 231 SQLNATS---FSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYF 287

Query: 333 LNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           L+L GIS+G  +L          +IDSG  IT LP  +Y A+KAEF  Q    PS    S
Sbjct: 288 LSLKGISVGKTRLPVPETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGS 347

Query: 393 ILDTCFNLSAYQEVNIPLV 411
            LD CF L        P V
Sbjct: 348 ALDVCFALPVSALWRRPAV 366


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 140/443 (31%), Positives = 216/443 (48%), Gaps = 49/443 (11%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
            ++EL H++     I  +N Q     + D L+  +L+S  ++    +   +S T+  L S
Sbjct: 26  FSVELIHRDSPLSPI--YNPQIT---VTDRLNAAFLRSVSRSRRFNH--QLSQTD--LQS 76

Query: 127 GIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G+      +  +I +G   + V  I DTGSDLTWVQC+PC+ CY +  P+FD   S +YK
Sbjct: 77  GLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYK 136

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
              C+S  C AL  ++   G   S++   C Y  SYGD S+++G++  E + +  AS + 
Sbjct: 137 SEPCDSRNCQAL--SSTERGCDESNNI--CKYRYSYGDQSFSKGDVATETVSIDSASGSP 192

Query: 245 F-----IFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
                 +FGCG NN G F    SG++GLG   LSL+SQ        FSYCL S + A  +
Sbjct: 193 VSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCL-SHKSATTN 251

Query: 299 GSLI--LGGNS--SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--- 351
           G+ +  LG NS  S     + +  T ++    L T+Y L L  IS+G K++  +G +   
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAISVGKKKIPYTGSSYNP 310

Query: 352 ---------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNL 400
                     G I+IDSGT +T L    +    +   +  +G    S P   +L  CF  
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQ-GLLSHCFK- 368

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           S   E+ +P + + F G     V ++ I  FVK     VCL++   +   E  I GN+ Q
Sbjct: 369 SGSAEIGLPEITVHFTG---ADVRLSPINAFVKLSEDMVCLSMVPTT---EVAIYGNFAQ 422

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
            +  V YD +   + F   DCS+
Sbjct: 423 MDFLVGYDLETRTVSFQHMDCSA 445


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 175/379 (46%), Gaps = 33/379 (8%)

Query: 128 IRLQTLNYIATIELGGR--NMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYK 184
           +   T  Y+    +G     ++ ++DTGSDL W QC  PC+ C+ Q  P++ P+ S +Y 
Sbjct: 93  VHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYA 152

Query: 185 KVLCNSSTCHALE-----FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
            V C S  C AL           S    +     C Y+ SYGDGS T G L  E    G 
Sbjct: 153 NVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGA 212

Query: 240 AS-VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
            + V+D  FGCG +N G     SGL+G+GR  LSLVSQ        FSYC     D   S
Sbjct: 213 GTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVT---KFSYCFTPFNDTTTS 269

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK---------QLQASG 349
             L LG ++S+   +    +      P+ +++Y L+L GI++G           +L ASG
Sbjct: 270 SPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASG 329

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQE 405
             +GG++IDSGT  T L    +  L      + +  P A G  + L  CF        + 
Sbjct: 330 --RGGLIIDSGTTFTALEERAFVVLARAVAARVA-LPLASGAHLGLSVCFAAPQGRGPEA 386

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
           V++P + + F+G A+M +  +  V   +  A   CL + S        ++G+ QQ+N  V
Sbjct: 387 VDVPRLVLHFDG-ADMELPRSSAVVEDRV-AGVACLGIVS---ARGMSVLGSMQQQNMHV 441

Query: 466 IYDTKNSQLGFAGEDCSSM 484
            YD     L F   +C  +
Sbjct: 442 RYDVGRDVLSFEPANCGEL 460


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 130/419 (31%), Positives = 205/419 (48%), Gaps = 52/419 (12%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYIATIE 140
           + W  +    L  D   +QYL S    +++G       + +P+ SG + LQ+  YI  + 
Sbjct: 55  LSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYIVKVL 104

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +G   + + + +DT SD+ W+ C  C  C    +  F P+ S S+K V C++  C  +  
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCKQVP- 161

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
               +  C + +   C++ ++YG  S     L ++ + L    +  F FGC   NK   G
Sbjct: 162 ----NPACGARA---CSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NKVAGG 211

Query: 259 GV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
           G      GL+GLGR  LSL+SQ   ++   FSYCLPS +    SGSL LG  S   +   
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQR--- 268

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLP 367
            + YT ++ NP+ ++ Y +NL  I +G K   L  +  A       G + DSGTV TRL 
Sbjct: 269 -VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLA 327

Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
             +Y A++ EF K+    P+A   S+   DTC++     +V +P +   F+G   MT+  
Sbjct: 328 KPVYEAVRNEFRKRVKP-PTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VNMTMPA 381

Query: 426 TGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             ++    +  S  CLA+AS   +      +I + QQ+N RV+ D  N +LG A E CS
Sbjct: 382 DNLMLH-STAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/397 (28%), Positives = 176/397 (44%), Gaps = 43/397 (10%)

Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ---------PCKSCY 168
            E P+ SG  L    Y+ ++  G   + + +I DTGSDL W+QC          P K+C 
Sbjct: 39  AESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC- 97

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
             + P F  S S +   V C+++ C  +    G+   CS ++P  C Y   Y DGS T G
Sbjct: 98  -SRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTG 156

Query: 229 ELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
            L R+   +     G A+V    FGCG RN  G F G  G++GLG+  LS  +Q+  +F 
Sbjct: 157 FLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA 216

Query: 283 GLFSYCLPSTQDA--GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
             FSYCL   +    G S S +  G     +      YT ++ NP   TFY + +  I +
Sbjct: 217 QTFSYCLLDLEGGRRGRSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRV 273

Query: 341 GGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--- 390
           G + L   G          GG +IDSG+ +T L    Y  L + F       P  P    
Sbjct: 274 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH-LPRIPSSAT 332

Query: 391 -FSILDTCFNLSAYQEV-----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
            F  L+ C+N+S+   +       P + ++F     + +     +  V  D    CLA+ 
Sbjct: 333 FFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIR 390

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                    ++GN  Q+   V +D  ++++GFA  +C
Sbjct: 391 PTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 427


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 177/358 (49%), Gaps = 34/358 (9%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +I+DTGSDL W QC+PC  C+++     DPS S ++  + C+S  C  L +++       
Sbjct: 430 LILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWG 489

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKASVNDFIFGCGRNNKGLF-GGV 260
           + +   C Y  +Y DGS T G L  E        G G+A+V D  FGCG  N G+F    
Sbjct: 490 NQT---CVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCGLFNNGIFTSNE 546

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITYT 319
           +G+ G GR  LSL SQ        FS+C  +   +  S S++LG  ++++ ++   +  T
Sbjct: 547 TGIAGFGRGALSLPSQLKV---DNFSHCFTAITGSEPS-SVLLGLPANLYSDADGAVQST 602

Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYS 372
            ++ N      Y L+L GI++G  +L    S FA      GG +IDSGT +T LP   Y 
Sbjct: 603 PLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYK 662

Query: 373 ALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEV--NIPLVKMEFEGNAEMTVDVTGIV 429
            +   F  Q      +A   S+   CF+ S  +    ++P + + FEG    T+D+    
Sbjct: 663 LVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEG---ATLDLPREN 719

Query: 430 Y---FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           Y   F  +  S  CLA+ +    D+  IIGNYQQ+N  V+YD   + L F    C+ +
Sbjct: 720 YMFEFEDAGGSVTCLAINA---GDDLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNRL 774


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/367 (30%), Positives = 176/367 (47%), Gaps = 45/367 (12%)

Query: 130 LQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
             T  Y+  +++G     +  ++DTGS+  W QC PC  CYNQ  P+FDPS S ++K++ 
Sbjct: 60  FDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIR 119

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-- 245
           C++   H+                  C Y + YG  SYT+G L  E + +   S   F  
Sbjct: 120 CDTHD-HS------------------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVM 160

Query: 246 ---IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
              I GCGRNN G   G +G++GL R   SL++Q    + GL SYC      AG   S I
Sbjct: 161 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCF-----AGKGTSKI 215

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
             G +++      ++ T  +   +   FY LNL  +S+G  +++  G      KG I+ID
Sbjct: 216 NFGANAIVAGDGVVSTTVFVKTAKPG-FYYLNLDAVSVGNTRIETVGTPFHALKGNIVID 274

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEG 417
           SG+ +T  P S Y  L  + ++Q       P   IL  C+     + ++I P++ M F G
Sbjct: 275 SGSTLTYFPES-YCNLVRKAVEQVVTAVRFPRSDIL--CY---YSKTIDIFPVITMHFSG 328

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A++ +D   + Y   +     CLA+   S  +E  I GN  Q N  V YD+ +  + F 
Sbjct: 329 GADLVLDKYNM-YVASNTGGVFCLAIICNSPIEE-AIFGNRAQNNFLVGYDSSSLLVSFK 386

Query: 478 GEDCSSM 484
             +CS++
Sbjct: 387 PTNCSAL 393


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 190/432 (43%), Gaps = 84/432 (19%)

Query: 94  LDNLHVQYLQSRIKNMIS----GNIKDVSNTEIPLTSGIRLQTLNYIATIELG------- 142
           +  LH + L+   +N +S     N K+V  T  P+ S +  Q    +AT+E G       
Sbjct: 111 IQTLHKRVLEKNNQNTVSQKQKKNDKEVVTT-TPVASSVEEQAGQLVATLESGMTLGSGE 169

Query: 143 ----------GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
                      ++ ++I+DTGSDL W+QC PC  C+ Q D                N S 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND----------------NQS- 212

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---------VN 243
                                C Y+  YGD S T G+   E   +   +         V 
Sbjct: 213 ---------------------CPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 251

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLI 302
           + +FGCG  N+GLF G +GL+GLGR  LS  SQ   ++G  FSYCL     D   S  LI
Sbjct: 252 NMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 311

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQ--LATFYILNLTGISIGGKQL-------QASGFAKG 353
            G +  +  +   + +T+ +   +  + TFY + +  I + G+ L         S    G
Sbjct: 312 FGEDKDLLSHPN-LNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 370

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           G +IDSGT ++      Y  +K +  ++  G +P    F ILD CFN+S    V +P + 
Sbjct: 371 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELG 430

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           + F   A           ++  D   VCLA+     +    IIGNYQQ+N  ++YDTK S
Sbjct: 431 IAFADGAVWNFPTENSFIWLNEDL--VCLAMLGTP-KSAFSIIGNYQQQNFHILYDTKRS 487

Query: 473 QLGFAGEDCSSM 484
           +LG+A   C+ +
Sbjct: 488 RLGYAPTKCADI 499


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/367 (30%), Positives = 176/367 (47%), Gaps = 45/367 (12%)

Query: 130 LQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
             T  Y+  +++G     +  ++DTGS+  W QC PC  CYNQ  P+FDPS S ++K++ 
Sbjct: 54  FDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIR 113

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-- 245
           C++   H+                  C Y + YG  SYT+G L  E + +   S   F  
Sbjct: 114 CDTHD-HS------------------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVM 154

Query: 246 ---IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
              I GCGRNN G   G +G++GL R   SL++Q    + GL SYC      AG   S I
Sbjct: 155 PETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCF-----AGKGTSKI 209

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
             G +++      ++ T  +   +   FY LNL  +S+G  +++  G      KG I+ID
Sbjct: 210 NFGANAIVAGDGVVSTTVFVKTAKPG-FYYLNLDAVSVGNTRIETVGTPFHALKGNIVID 268

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEG 417
           SG+ +T  P S Y  L  + ++Q       P   IL  C+     + ++I P++ M F G
Sbjct: 269 SGSTLTYFPES-YCNLVRKAVEQVVTAVRFPRSDIL--CY---YSKTIDIFPVITMHFSG 322

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A++ +D   + Y   +     CLA+   S  +E  I GN  Q N  V YD+ +  + F 
Sbjct: 323 GADLVLDKYNM-YVASNTGGVFCLAIICNSPIEE-AIFGNRAQNNFLVGYDSSSLLVSFK 380

Query: 478 GEDCSSM 484
             +CS++
Sbjct: 381 PTNCSAL 387


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 135/483 (27%), Positives = 202/483 (41%), Gaps = 77/483 (15%)

Query: 28  AHCFEGKKKLHL-HKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNE 86
           A  F G  +LHL H    +Q S       + Q+S+    A+++         GK     E
Sbjct: 27  ADAFAGDVRLHLTHVDAGKQMSRRELIRRAMQRSKARAAALSVARSGSGRVPGKSAQQGE 86

Query: 87  QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GR 144
           Q Q             Q  +    SG+                   L Y+  + +G   +
Sbjct: 87  QHQ-------------QPGVPVRPSGD-------------------LEYLIDLAIGTPPQ 114

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
            ++ ++DTGSDL W QC PC SC  Q DP+F P+ S SY  + C+   C+ +        
Sbjct: 115 PVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHH----- 169

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI----FGCGRNNKGLFGGV 260
             S   P  C Y  +YGDG+ T G    E      +S         FGCG  N G     
Sbjct: 170 --SCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNG 227

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS-VFKNSTPIT-- 317
           SG++G GR  LSLVSQ S      FSYCL +   +    +L+ G  S  VF+     T  
Sbjct: 228 SGIVGFGRDPLSLVSQLSI---RRFSYCL-TPYTSTRKSTLMFGSLSDGVFEGDDAATGQ 283

Query: 318 --YTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA-----KGGILIDSGTVITRLPP 368
              T ++ + Q  TFY +  TG+++G ++L+   S FA      GG+++DSGT +T  P 
Sbjct: 284 VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPA 343

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQE---------VNIPLVKMEFEGN 418
           ++ + +   F  Q    P     S  D  CF                V++P +   F+G 
Sbjct: 344 AVLTEVLRAFRAQLR-LPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHFQG- 401

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A++ +       +V  D  +  L +      D    IGN+ Q++ RV+YD +   L FA 
Sbjct: 402 ADLELPRRN---YVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAP 458

Query: 479 EDC 481
             C
Sbjct: 459 AQC 461


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 124/405 (30%), Positives = 183/405 (45%), Gaps = 46/405 (11%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
           Q + + +++G   DVS       + +   T  YIA+  +G   +    ++DTGSDL W Q
Sbjct: 61  QQQQQRLMAGAEDDVS-------AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQ 113

Query: 161 CQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
           C      KSC  Q  P ++ S S ++  V C         F   N GV        C + 
Sbjct: 114 CATTCLPKSCAKQGLPYYNLSQSSTFVPVPCADKA----GFCAAN-GVHLCGLDGSCTFI 168

Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLV 274
            SYG G    G LG E     ++      FGC    R   G     SGL+GLGR  LSLV
Sbjct: 169 ASYGAGRVI-GSLGTESFAF-ESGTTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLV 226

Query: 275 SQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           SQ   I    FSYCL P    +GAS  L +G ++S+      + +     +   +TFY L
Sbjct: 227 SQ---IGATRFSYCLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYL 283

Query: 334 NLTGISIGGKQLQA------------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
            L GI++G  +L A             G+  GG++ID+G+ +T+L    Y ALK E   Q
Sbjct: 284 PLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQ 343

Query: 382 F--SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
                   AP  S L+ C     +Q+V +P +   F G A+M V      Y+   D +  
Sbjct: 344 LGNGSLVPAPEDSGLELCVAREGFQKV-VPALVFHFGGGADMAVPAAS--YWAPVDKAAA 400

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           C+ +    Y+    IIGN+QQ++  ++YD +  +  F   DC+ +
Sbjct: 401 CMMILEGGYDS---IIGNFQQQDMHLLYDLRRGRFSFQTADCTML 442


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 116/397 (29%), Positives = 175/397 (44%), Gaps = 43/397 (10%)

Query: 120 TEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ---------PCKSCY 168
            E P+ SG  L    Y+ ++  G   + + +I DTGSDL W+QC          P K+C 
Sbjct: 38  AESPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC- 96

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
             + P F  S S +   V C+++ C  +    G+   CS ++P  C Y   Y DGS T G
Sbjct: 97  -SRRPAFVASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTG 155

Query: 229 ELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
            L R+   +     G A+V    FGCG RN  G F G  G++GLG+  LS  +Q+  +F 
Sbjct: 156 FLARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFA 215

Query: 283 GLFSYCLPSTQDA--GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
             FSYCL   +    G S S +  G     +      YT ++ NP   TFY + +  I +
Sbjct: 216 QTFSYCLLDLEGGRRGRSSSFLFLGRP---ERRAAFAYTPLVSNPLAPTFYYVGVVAIRV 272

Query: 341 GGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--- 390
           G + L   G          GG +IDSG+ +T L    Y  L + F       P  P    
Sbjct: 273 GNRVLPVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVH-LPRIPSSAT 331

Query: 391 -FSILDTCFNL-----SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
            F  L+ C+N+     SA      P + ++F     + +     +  V  D    CLA+ 
Sbjct: 332 FFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVK--CLAIR 389

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                    ++GN  Q+   V +D  ++++GFA  +C
Sbjct: 390 PTLSPFAFNVLGNLMQQGYHVEFDRASARIGFARTEC 426


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 126/411 (30%), Positives = 187/411 (45%), Gaps = 41/411 (9%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
           L +R  + +S   K V   + P+ SG    +  Y   + +G   +++ +I DTGSDL WV
Sbjct: 50  LDTRRLHFLSLRRKPVPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWV 109

Query: 160 QCQPCKSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYF 217
           +C  C++C ++    VF P  S ++    C    C  +    G +  C+ +     C Y 
Sbjct: 110 KCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVP-KPGRAPRCNHTRIHSTCPYE 168

Query: 218 VSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL------FGGVSGLMGL 266
             Y DGS T G   RE   L    GK A +    FGCG    G       F G +G+MGL
Sbjct: 169 YGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGL 228

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILG-GNSSVFKNSTPITYTNMIPN 324
           GR  +S  SQ    FG  FSYCL   T     +  LI+G G  +V K    + +T ++ N
Sbjct: 229 GRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSK----LFFTPLLTN 284

Query: 325 PQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
           P   TFY + L  + + G +L       +      GG ++DSGT +  L    Y  + A 
Sbjct: 285 PLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA 344

Query: 378 FLKQFSGFPSA----PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
            +KQ    P+A    PGF   D C N+S     E  +P +K EF G A          YF
Sbjct: 345 -VKQRIKLPNADELTPGF---DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRN--YF 398

Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ++++    CLA+ S+  +    +IGN  Q+     +D   S+LGF+   C+
Sbjct: 399 IETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 131/424 (30%), Positives = 202/424 (47%), Gaps = 54/424 (12%)

Query: 78  SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYI 136
           S   + W  +    L  D   +QYL S    +++G       + +P+ SG + LQ+  YI
Sbjct: 51  SSSPLSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYI 100

Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
               +G   + + + +DT SD+ W+ C  C  C    +  F P+ S S+K V C++  C 
Sbjct: 101 VKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCK 158

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
            +   T  +  CS        + ++YG  S     L ++ + L    +  F FGC   NK
Sbjct: 159 QVPNPTCGARACS--------FNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NK 207

Query: 255 GLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
              GG      GL+GLGR  LSL+SQ   I+   FSYCLPS +    SGSL LG  S   
Sbjct: 208 VAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQ 267

Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVI 363
           +    + YT ++ NP+ ++ Y +NL  I +G K   L  +  A       G + DSGTV 
Sbjct: 268 R----VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 323

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCFNLSAYQEVNIPLVKMEFEGNAE 420
           TRL   +Y A++ EF K+    P+    + L   DTC++     +V +P +   F+G   
Sbjct: 324 TRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VN 376

Query: 421 MTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           MT+    ++    +  S  CLA+A+   +      +I + QQ+N RV+ D  N +LG A 
Sbjct: 377 MTMPADNLMLH-STAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLAR 435

Query: 479 EDCS 482
           E CS
Sbjct: 436 ERCS 439


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 131/424 (30%), Positives = 202/424 (47%), Gaps = 54/424 (12%)

Query: 78  SGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIR-LQTLNYI 136
           S   + W  +    L  D   +QYL S    +++G       + +P+ SG + LQ+  YI
Sbjct: 67  SSSPLSWEARVLQTLAQDQARLQYLSS----LVAGR------SVVPIASGRQMLQSTTYI 116

Query: 137 ATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
               +G   + + + +DT SD+ W+ C  C  C    +  F P+ S S+K V C++  C 
Sbjct: 117 VKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCSAPQCK 174

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNK 254
            +   T  +  CS        + ++YG  S     L ++ + L    +  F FGC   NK
Sbjct: 175 QVPNPTCGARACS--------FNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCV--NK 223

Query: 255 GLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF 310
              GG      GL+GLGR  LSL+SQ   I+   FSYCLPS +    SGSL LG  S   
Sbjct: 224 VAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQ 283

Query: 311 KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVI 363
           +    + YT ++ NP+ ++ Y +NL  I +G K   L  +  A       G + DSGTV 
Sbjct: 284 R----VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVY 339

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL---DTCFNLSAYQEVNIPLVKMEFEGNAE 420
           TRL   +Y A++ EF K+    P+    + L   DTC++     +V +P +   F+G   
Sbjct: 340 TRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS----GQVKVPTITFMFKG-VN 392

Query: 421 MTVDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           MT+    ++    +  S  CLA+A+   +      +I + QQ+N RV+ D  N +LG A 
Sbjct: 393 MTMPADNLMLH-STAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLAR 451

Query: 479 EDCS 482
           E CS
Sbjct: 452 ERCS 455


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  149 bits (375), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/373 (30%), Positives = 172/373 (46%), Gaps = 43/373 (11%)

Query: 134 NYIATIELGG-RNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
            Y+  + +G  R+  V++  DTGSD+ W QC+PC  C+ Q  P FD + S + + V C+ 
Sbjct: 91  EYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSD 150

Query: 191 STCHALE----FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL------GLGKA 240
             C+A      F  G            C Y   YGDGS + G   R+        G GK 
Sbjct: 151 PLCNAHSEHGCFLHG------------CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKV 198

Query: 241 SVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
           +V D  FGCG  N G F    +G+ G GR  LSL SQ        FSYC  +T+    S 
Sbjct: 199 TVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKV---RQFSYCF-TTRFEAKSS 254

Query: 300 SLILGGNSSVFKNST-PITYTNMI---PNPQLATFYILNLTGISIGGKQL---QASGFAK 352
            + LGG   +  ++T PI  T  +   P     + Y+L+  G+++G  +L   +      
Sbjct: 255 PVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGS 314

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
           G   IDSGT IT  P +++  LK+ F+ Q +  P        D CF+    +   +P + 
Sbjct: 315 GATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDICFSWDGKKTAAMPKLV 373

Query: 413 MEFEGNAEMTVDVTGIVYFVKS-DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
              EG      D+    Y  +  ++ QVC+A+++    D T +IGN+QQ+N  ++YD   
Sbjct: 374 FHLEG---ADWDLPRENYVTEDRESGQVCVAVSTSGQMDRT-LIGNFQQQNTHIVYDLAA 429

Query: 472 SQLGFAGEDCSSM 484
            +L      C  +
Sbjct: 430 GKLLLVPAQCDKL 442


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  149 bits (375), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 136/438 (31%), Positives = 212/438 (48%), Gaps = 57/438 (13%)

Query: 68  TLELKHK-NYCSG----KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
           TLE+ H  + CS     K + W E        D   +Q+L S    M++G       + +
Sbjct: 35  TLEVFHVFSPCSPFRPPKPLSWAESVLQLQAKDQARLQFLAS----MVAGR------SVV 84

Query: 123 PLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           P+ SG ++ Q+  YI   ++G    T+++  DT +D  W+ C  C  C +    +F P  
Sbjct: 85  PIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTST---LFAPEK 141

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S ++K V C S  C+ +      +  C +S+   C + ++YG  S     + ++ + L  
Sbjct: 142 STTFKNVSCGSPQCNQVP-----NPSCGTSA---CTFNLTYGSSSIA-ANVVQDTVTLAT 192

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
             + D+ FGC     G      GL+GLGR  LSL+SQT  ++   FSYCLPS +    SG
Sbjct: 193 DPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSG 252

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF--AKG 353
           SL LG  +   +    I YT ++ NP+ ++ Y +NL  I +G K +    +A  F  A G
Sbjct: 253 SLRLGPVAQPIR----IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATG 308

Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-----LDTCFNLSAYQEVN 407
            G + DSGTV TRL    Y+A++ EF ++ +    A   ++      DTC+ +     + 
Sbjct: 309 AGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKA-NLTVTSLGGFDTCYTV----PIV 363

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQR 464
            P +   F G   M V +      + S A S  CLA+AS   +      +I N QQ+N R
Sbjct: 364 APTITFMFSG---MNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHR 420

Query: 465 VIYDTKNSQLGFAGEDCS 482
           V+YD  NS+LG A E C+
Sbjct: 421 VLYDVPNSRLGVARELCT 438


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 123/384 (32%), Positives = 171/384 (44%), Gaps = 42/384 (10%)

Query: 123 PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SG+   +  Y   I +G       +++DTGSD+ W+QC PC+ CY+Q   +FDP  S
Sbjct: 135 PVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRAS 194

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK- 239
            SY  V C +  C  L+     SG C       C Y V+YGDGS T G+   E L     
Sbjct: 195 HSYGAVDCAAPLCRRLD-----SGGCDLRR-KACLYQVAYGDGSVTAGDFATETLTFASG 248

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-------- 291
           A V     GCG +N+GLF   +GL+GLGR  LS  SQ S  FG  FSYCL          
Sbjct: 249 ARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASA 308

Query: 292 -------TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
                  T  +GA G+L   G   +  +       +++                    + 
Sbjct: 309 TSRSSTVTFGSGARGAL---GRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRP 365

Query: 345 LQASGFAKGGILIDSG------TVITRLPP-SIYSALKAEFLKQFSGFPSAPGFSILDTC 397
                  +GG+++DSG          R PP +  S   A  L+   G     GFS+ DTC
Sbjct: 366 PPDPSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPG-----GFSLFDTC 420

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           ++LS  + V +P V M F G AE  +     +  V S  +  C A A    +    IIGN
Sbjct: 421 YDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGT-FCFAFAGT--DGGVSIIGN 477

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
            QQ+  RV++D    +LGF  + C
Sbjct: 478 IQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 106/340 (31%), Positives = 151/340 (44%), Gaps = 45/340 (13%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L           
Sbjct: 166 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----------- 214

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG---CGRNNKGLFGGVSGLM 264
                          G Y R  L +    L +            C           SG M
Sbjct: 215 ---------------GRYGRWLLQQPVPVLRRLRRRQGQPRGRTCHAVRGNFSASTSGTM 259

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
            LG    SL+SQT+  FG  FSYC+P   D  +SG L LGG +           T ++ N
Sbjct: 260 SLGGGRQSLLSQTAATFGNAFSYCVP---DPSSSGFLSLGGPADGGGAGR-FARTPLVRN 315

Query: 325 PQ-LATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           P  + T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    
Sbjct: 316 PSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAM 375

Query: 383 SGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           + +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CL
Sbjct: 376 AAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCL 428

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A      +   G IGN QQ+   V+YD     +GF    C
Sbjct: 429 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)

Query: 60  SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
           +R +  A+ L   H         D       R +L  +  +  ++R   ++SG       
Sbjct: 47  ARCDAAALRLHATH--------ADAGRGLSTRELLRRMAARS-KARSARLLSGRAASARM 97

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
                T G+      Y+  + +G   + + +I+DTGSDLTW QC PC SC+ Q  P F+P
Sbjct: 98  DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
           S S ++  + C+   C  L +++     C   S  +  C Y  +Y D S T G L  +  
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 210

Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
                   +G ASV D  FGCG  N G+F    +G+ G  R  LS+ +Q        FSY
Sbjct: 211 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 267

Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           C        PS    G   +L     GG   V +++  I Y     + QL  +YI +L G
Sbjct: 268 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYH----SSQLKAYYI-SLKG 322

Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           +++G  +L    S FA      GG ++DSGT +T LP ++Y+ +   F+ Q         
Sbjct: 323 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 382

Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
            S+   CF++    + ++P + + FEG    T+D+    Y F   +A  + L   +++  
Sbjct: 383 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 439

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++  +IGN+QQ+N  V+YD  N  L F    C+ +
Sbjct: 440 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)

Query: 60  SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
           +R +  A+ L   H         D       R +L  +  +  ++R   ++SG       
Sbjct: 21  ARCDAAALRLHATH--------ADAGRGLSTRELLRRMAARS-KARSARLLSGRAASARM 71

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
                T G+      Y+  + +G   + + +I+DTGSDLTW QC PC SC+ Q  P F+P
Sbjct: 72  DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 129

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
           S S ++  + C+   C  L +++     C   S  +  C Y  +Y D S T G L  +  
Sbjct: 130 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 184

Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
                   +G ASV D  FGCG  N G+F    +G+ G  R  LS+ +Q        FSY
Sbjct: 185 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 241

Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           C        PS    G   +L     GG   V +++  I Y +     QL  +YI +L G
Sbjct: 242 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSS----QLKAYYI-SLKG 296

Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           +++G  +L    S FA      GG ++DSGT +T LP ++Y+ +   F+ Q         
Sbjct: 297 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 356

Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
            S+   CF++    + ++P + + FEG    T+D+    Y F   +A  + L   +++  
Sbjct: 357 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 413

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++  +IGN+QQ+N  V+YD  N  L F    C+ +
Sbjct: 414 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 448


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 124/445 (27%), Positives = 217/445 (48%), Gaps = 40/445 (8%)

Query: 54  CVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
           C S   S+      ++EL H++           Q + + ++D +H      R  N ++ +
Sbjct: 15  CFSISFSQAVSNGFSIELIHRDSSKSPFYKPT-QNKYQHVVDAVH------RSINRVNHS 67

Query: 114 IKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQ 171
            K+ S    P ++ I  +  +YI +  +G   +    IVDTGSD+ W+QC+PC+ CYNQ 
Sbjct: 68  NKN-SLASTPESTVISYEG-DYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQT 125

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
            P F+PS S SYK + C+S  C ++   + N          +C Y ++YG+ S+++G+L 
Sbjct: 126 TPKFNPSKSSSYKNISCSSKLCQSVRDTSCN-------DKKNCEYSINYGNQSHSQGDLS 178

Query: 232 REHLGLGK-----ASVNDFIFGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLF 285
            E L L        S    + GCG NN G F    SG++GLG    SL++Q     GG F
Sbjct: 179 LETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKF 238

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQL----ATFYILNLTGISIG 341
           SYCL   + +    ++ +G +   F +   ++  N++  P +    + FY L +   S+G
Sbjct: 239 SYCL--VRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVG 296

Query: 342 GKQLQASGFAK----GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
            K+++ +G +K    G I+IDS T++T +P  +Y+ L +  +   +             C
Sbjct: 297 DKRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLC 356

Query: 398 FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           +N+S+ +E + P +   F+G A++ +  T    FV+     +C A A     +   I G+
Sbjct: 357 YNVSSDEEYDFPYMTAHFKG-ADILLYATNT--FVEVARDVLCFAFAP---SNGGAIFGS 410

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
           + Q++  V YD +   + F   DC+
Sbjct: 411 FSQQDFMVGYDLQQKTVSFKSVDCT 435


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 120/354 (33%), Positives = 172/354 (48%), Gaps = 45/354 (12%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           ++ G++L W    P   C+ Q  P F+P    ++ + L  +S C + +F    +      
Sbjct: 12  LENGNELIWNHSNPSPECFEQAFPYFEPL---TFSRGLPFAS-CGSPKFWPNQT------ 61

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDFIFGCGRNNKGLF-GGVSGLMGL 266
               C Y  SYGD S T G L  +        ASV    FGCG  N G+F    +G+ G 
Sbjct: 62  ----CVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETGIAGF 117

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN------STP-ITYT 319
           GR  LSL SQ      G FS+C  +T       +++L   + +F N      +TP I Y 
Sbjct: 118 GRGPLSLPSQLKV---GNFSHCF-TTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYA 173

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSA 373
               NP   T Y L+L GI++G  +L    S FA     GG +IDSGT IT LPP +Y  
Sbjct: 174 KNEANP---TLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQV 230

Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
           ++ EF  Q    P  PG +    TCF+  +  + ++P + + FEG A M +     V+ V
Sbjct: 231 VRDEFAAQIK-LPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEG-ATMDLPRENYVFEV 288

Query: 433 KSDA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             DA  S +CLA+   +  DET IIGN+QQ+N  V+YD +N+ L F    C  +
Sbjct: 289 PDDAGNSIICLAI---NKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDKL 339


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 206/455 (45%), Gaps = 57/455 (12%)

Query: 60  SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
           +R +  A+ L   H         D       R +L  +  +  ++R   ++SG       
Sbjct: 47  ARSDAAALRLHATH--------ADAGRGLSTRELLHRMAARS-KARSARLLSGRAASARV 97

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
                T G+      Y+  + +G   + + +I+DTGSDLTW QC PC SC+ Q  P F+P
Sbjct: 98  DPGSYTDGV--PDTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNP 155

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHL 235
           S S ++  + C+   C  L +++     C   S  +  C Y  +Y D S T G L  +  
Sbjct: 156 SRSMTFSVLPCDLRICRDLTWSS-----CGEQSWGNGICVYAYAYADHSITTGHLDSDTF 210

Query: 236 -------GLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
                   +G ASV D  FGCG  N G+F    +G+ G  R  LS+ +Q        FSY
Sbjct: 211 SFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV---DNFSY 267

Query: 288 CL-------PSTQDAGASGSL---ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           C        PS    G   +L     GG   V +++  I Y     + QL  +YI +L G
Sbjct: 268 CFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYH----SSQLKAYYI-SLKG 322

Query: 338 ISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           +++G  +L    S FA      GG ++DSGT +T LP ++Y+ +   F+ Q         
Sbjct: 323 VTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNST 382

Query: 391 FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVKSDASQVCLALASLSYE 449
            S+   CF++    + ++P + + FEG    T+D+    Y F   +A  + L   +++  
Sbjct: 383 SSLSQLCFSVPPGAKPDVPALVLHFEG---ATLDLPRENYMFEIEEAGGIRLTCLAINAG 439

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++  +IGN+QQ+N  V+YD  N  L F    C+ +
Sbjct: 440 EDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNKI 474


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 126/421 (29%), Positives = 200/421 (47%), Gaps = 54/421 (12%)

Query: 78  SGKIVDWNEQQQNRLI-LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNY 135
           +G + D + +  +RL+ LD+L V                       P+ SG +L QT  Y
Sbjct: 66  AGFLADQSSRDASRLLYLDSLAV-----------------AGRAYAPIASGRQLLQTPTY 108

Query: 136 IATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC 193
           +    LG   + + + VDT +D  W+ C  C  C       F+P+ S SY+ V C S  C
Sbjct: 109 VVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPAC 166

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
                +   +  CS ++   C + ++Y D S     L ++ L +    V  + FGC +  
Sbjct: 167 -----SRAPNPSCSLNTK-SCGFSLTYADSSL-EAALSQDSLAVANDVVKSYTFGCLQKA 219

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
            G      GL+GLGR  LS +SQT +++ G FSYCLPS +    SG+L LG      +  
Sbjct: 220 TGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLR-- 277

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRL 366
             I  T ++ NP  ++ Y +++TGI +G K +     A         G ++DSGT+ TRL
Sbjct: 278 --IKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335

Query: 367 PPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
               Y A++ E  ++  G P  S  GF   DTC+N +    V  P V   F G  ++T+ 
Sbjct: 336 VAPAYVAVRDEVRRRIRGAPLSSLGGF---DTCYNTT----VKWPPVTFMFTG-MQVTLP 387

Query: 425 VTGIVYFVKSDASQVCLALASLSYEDET--GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              +V    +  +  CLA+A+      T   +I + QQ+N R+++D  N ++GFA E C+
Sbjct: 388 ADNLVIH-STYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446

Query: 483 S 483
           +
Sbjct: 447 A 447


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 131/444 (29%), Positives = 207/444 (46%), Gaps = 34/444 (7%)

Query: 55  VSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNI 114
           +S   S++     ++E+ H++     +    E    R+         ++  I      N 
Sbjct: 23  ISFSNSKVLNSGFSVEMIHRDSSRSPLYRHTETPFQRV------ANAMRRSINRANHFNK 76

Query: 115 KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQD 172
           K    +     S ++     Y+ +  +G     +  +VDTGS +TW+QCQ C+ CY Q  
Sbjct: 77  KSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTT 136

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
           P+FDPS S +YK + C+S+ C ++      S    SS    C Y + YGDGS+++G+L  
Sbjct: 137 PIFDPSKSKTYKTLPCSSNMCQSVI-----STPSCSSDKIGCKYTIKYGDGSHSQGDLSV 191

Query: 233 EHLGLGK---ASVN--DFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           E L LG    +SV   + + GCG NNKG F G  SG++GLG   +SL+SQ S   GG FS
Sbjct: 192 ETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFS 251

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
           YCL        S S +  G+++V      ++ T ++       FY L L   S+G K+++
Sbjct: 252 YCLAPMFSQSNSSSKLNFGDAAVVSGLGAVS-TPLVSKTGSEVFYYLTLEAFSVGDKRIE 310

Query: 347 --------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
                    S   +G I+IDSGT +T LP   YS L++           +   + L  C+
Sbjct: 311 FVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCY 370

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
             +   ++++P++   F+G     V++  I  FV+     VC A  S    +   I GN 
Sbjct: 371 QTTPSGQLDVPVITAHFKG---ADVELNPISTFVQVAEGVVCFAFHS---SEVVSIFGNL 424

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
            Q N  V YD     + F   DC+
Sbjct: 425 AQLNLLVGYDLMEQTVSFKPTDCT 448


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 176/348 (50%), Gaps = 31/348 (8%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           IVDTGSD+ W+QC+PC+ CY Q  P+FDPS S +YK + C+S+TC +L      +  CSS
Sbjct: 107 IVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLR-----NTACSS 161

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLF-GGVSG 262
            +   C Y + YGDGS++ G+L  E L LG    +   F     GCG NN G F    SG
Sbjct: 162 DNV--CEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSG 219

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           ++GLG   +SL+SQ S   GG FSYCL P   ++ +S  L  G  + V    T  T  + 
Sbjct: 220 IVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDP 279

Query: 322 IPNPQLATFYILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSAL 374
           + N Q+  FY L L   S+G  +++        SG   G I+IDSGT +T LP   Y  L
Sbjct: 280 L-NGQV--FYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNL 336

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           ++          +     +L  C+  ++  E+++P++   F+G     V++  I  FV  
Sbjct: 337 ESAVSDVIKLERARDPSKLLSLCYKTTS-DELDLPVITAHFKG---ADVELNPISTFVPV 392

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +   VC A  S        I GN  Q+N  V YD     + F   DC+
Sbjct: 393 EKGVVCFAFISSKI---GAIFGNLAQQNLLVGYDLVKKTVSFKPTDCT 437


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 118/349 (33%), Positives = 163/349 (46%), Gaps = 31/349 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL----EFATGNS 203
            I DTGSDL WVQC PC+ C  Q  P+FDP  S ++K V C+S  C  L        G S
Sbjct: 107 AIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKS 166

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS----VNDFIFGCGRNNKGLFGG 259
           G         C Y   YGD +   G LG E +  G  +         FGC  +N      
Sbjct: 167 G--------QCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTFSNNDTVDE 218

Query: 260 VS---GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLG   LSL+SQ     G  FSYC P    +  S S +  GN ++ K    +
Sbjct: 219 SKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPL--SSNSTSKMRFGNDAIVKQIKGV 276

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQAS-GFAKGGILIDSGTVITRLPPSIYSALK 375
             T +I      ++Y LNL G+SIG K+++ S     G ILIDSGT  T L  S Y+   
Sbjct: 277 VSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFV 336

Query: 376 AEFLKQFSGFPSA--PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           A  +K+  G  +   P   + + CF     ++   P V   F G A++ VD + +  F  
Sbjct: 337 A-LVKEVYGVEAVKIPPL-VYNFCFENKGKRK-RFPDVVFLFTG-AKVRVDASNL--FEA 390

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            D + +C+     S ED++ I GN+ Q   +V YD +   + FA  DC+
Sbjct: 391 EDNNLLCMVALPTSDEDDS-IFGNHAQIGYQVEYDLQGGMVSFAPADCA 438


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 190/403 (47%), Gaps = 60/403 (14%)

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC 161
           SR+ N+ +  I D+ +   P+ +        ++A I +G   +   +++DTGSDLTW+QC
Sbjct: 62  SRLDNLWTTEIADIVSHVTPIPN-----PAAFLANISIGDPPVPQLLLIDTGSDLTWIQC 116

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE--FATGNSGVCSSSSPPDCNYFVS 219
            PCK CY Q  P F PS S +Y+   C S+  HA+   F    +G        +C Y + 
Sbjct: 117 LPCK-CYPQTIPFFHPSRSSTYRNASCESAP-HAMPQIFRDEKTG--------NCRYHLR 166

Query: 220 YGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLV 274
           Y D S TRG L +E L       G  S  + +FGCG++N G F   SG++GLG    S+V
Sbjct: 167 YRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIV 225

Query: 275 SQTSEIFGGLFSYCLPSTQDAGASGS-LILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           ++    FG  FSYC  S  D     + LILG  + +  + TP+              Y L
Sbjct: 226 TRN---FGSKFSYCFGSLIDPTYPHNFLILGNGARIEGDPTPLQI--------FQDRYYL 274

Query: 334 NLTGISIGGKQLQASG------FAKGGILIDSGTVITRLPPSIYSALKAEF-------LK 380
           +L  IS+G K L           +KGG +ID+G   T L    Y  L  E        L+
Sbjct: 275 DLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLR 334

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQ 438
           +   +         + C+  +   ++   P+V   F G AE+ +DV  +  FV S++   
Sbjct: 335 RVKDWE-----QYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESL--FVSSESGDS 387

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            CLA+   +++D + +IG   Q+N  V Y+ +  ++ F   DC
Sbjct: 388 FCLAMTMNTFDDMS-VIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 180/370 (48%), Gaps = 40/370 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC-HALEFATGN 202
           + + + +DTGSDL W QC  C  C++Q  PVF  S+S ++ +V C+   C HA+      
Sbjct: 106 QRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSG 164

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-------ASVNDFIFGCGRNNKG 255
                 S    C Y   Y D S T G++  +             A+V +  FGCG  N G
Sbjct: 165 CAARDRS----CFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYG 220

Query: 256 LF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST 314
           LF    SG+ G G   LSL SQ        FSYC  + +++  S  ++ G   ++  ++T
Sbjct: 221 LFTPNQSGIAGFGTGPLSLPSQLKV---RRFSYCFTAMEESRVSPVILGGEPENIEAHAT 277

Query: 315 -PITYTNMIPNPQLAT-----FYILNLTGISIGGKQL--QASGFA-----KGGILIDSGT 361
            PI  T   P P  A      FY L+L G+++G  +L   AS FA      GG  IDSGT
Sbjct: 278 GPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGT 337

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT--CFNLSAYQEV-NIPLVKMEFEGN 418
            IT  P +++ +L+  F+ Q    P A G++  D   CF++ A ++   +P + +  EG 
Sbjct: 338 AITFFPQAVFRSLREAFVAQVP-LPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEG- 395

Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           A+  +     V     D S    ++C+ + S    + T IIGN+QQ+N  ++YD +++++
Sbjct: 396 ADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGT-IIGNFQQQNMHIVYDLESNKM 454

Query: 475 GFAGEDCSSM 484
            FA   C  +
Sbjct: 455 VFAPARCDKL 464


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 118/409 (28%), Positives = 184/409 (44%), Gaps = 37/409 (9%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWV 159
           L +R  + +S   K +   + P+ SG    +  Y   + +G   +++ +I DTGSDL WV
Sbjct: 51  LDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWV 110

Query: 160 QCQPCKSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYF 217
           +C  C++C ++    VF P  S ++    C    C  +      + +C+ +     C+Y 
Sbjct: 111 KCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVP-KPDRAPICNHTRIHSTCHYE 169

Query: 218 VSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL------FGGVSGLMGL 266
             Y DGS T G   RE   L    GK A +    FGCG    G       F G +G+MGL
Sbjct: 170 YGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGL 229

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ 326
           GR  +S  SQ    FG  FSYCL     +    S ++ GN       + + +T ++ NP 
Sbjct: 230 GRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGG--DGISKLFFTPLLTNPL 287

Query: 327 LATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
             TFY + L  + + G +L       +      GG ++DSGT +  L    Y ++ A   
Sbjct: 288 SPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 347

Query: 380 KQFSGFPSA----PGFSILDTCFNLSAY--QEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           ++    P A    PGF   D C N+S     E  +P +K EF G A          YF++
Sbjct: 348 RRVK-LPIADALTPGF---DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRN--YFIE 401

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ++    CLA+ S+  +    +IGN  Q+     +D   S+LGF+   C+
Sbjct: 402 TEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 131/438 (29%), Positives = 206/438 (47%), Gaps = 57/438 (13%)

Query: 68  TLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
           TLE+ H       +   K + W E        D   +Q+L S    M++G       + +
Sbjct: 34  TLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLAS----MVAGR------SIV 83

Query: 123 PLTSGIRL-QTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           P+ SG ++ Q+  YI   ++G    T++  +DT +D  W+ C  C  C +    +F P  
Sbjct: 84  PIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTST---LFAPEK 140

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S ++K V C S  C+ +      S  C +S+   C + ++YG  S     + ++ + L  
Sbjct: 141 STTFKNVSCGSPECNKVP-----SPSCGTSA---CTFNLTYGSSSIA-ANVVQDTVTLAT 191

Query: 240 ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASG 299
             +  + FGC     G      GL+GLGR  LSL+SQT  ++   FSYCLPS +    SG
Sbjct: 192 DPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSG 251

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK-------QLQASGFAK 352
           SL LG  +   +    I YT ++ NP+ ++ Y +NL  I +G K        L  +    
Sbjct: 252 SLRLGPVAQPIR----IKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATG 307

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-----LDTCFNLSAYQEVN 407
            G + DSGTV TRL   +Y+A++ EF ++ +    A   ++      DTC+ +     + 
Sbjct: 308 AGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKA-NLTVTSLGGFDTCYTV----PIV 362

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQR 464
            P +   F G   M V +      + S A S  CLA+AS   +      +I N QQ+N R
Sbjct: 363 APTITFMFSG---MNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHR 419

Query: 465 VIYDTKNSQLGFAGEDCS 482
           V+YD  NS+LG A E C+
Sbjct: 420 VLYDVPNSRLGVARELCT 437


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 132/419 (31%), Positives = 194/419 (46%), Gaps = 86/419 (20%)

Query: 77  CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYI 136
           CSG         Q     D   V ++ S+     SGN+K+ ++      + +  +  N++
Sbjct: 75  CSGSGHSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHN-----NNLFDEDGNFL 129

Query: 137 ATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH 194
             +  G   +N  +I+DTGS +TW QC+ C +C       F+ S S +Y    C   T  
Sbjct: 130 VDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSCIPGTVE 189

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNN 253
                               NY ++YGD S + G  G + + L  + V   F FGCGRNN
Sbjct: 190 N-------------------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNN 230

Query: 254 KGLFG-GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
           KG FG GV G++GLG+  LS VSQT+  F  +FSYCLP   +  + GSL+ G  ++    
Sbjct: 231 KGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLP---EEDSIGSLLFGEKAT--SQ 285

Query: 313 STPITYTNMIPNP---QLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLP 367
           S+ + +T+++  P   Q + +Y +NL+ IS+G ++L   +S FA  G +IDS TVITRLP
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLP 345

Query: 368 PSIYSALKAEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
              YSALKA F K  + +P + G      ILDTC+N                    E+T 
Sbjct: 346 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNXXXXXX-------------PELT- 391

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                                         IIGN QQ +  V+YD +  ++GF    CS
Sbjct: 392 ------------------------------IIGNRQQLSLTVLYDIQGGRIGFRSNGCS 420


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 125/382 (32%), Positives = 177/382 (46%), Gaps = 44/382 (11%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNS 190
           YIA   +G   +    I+DTGS+L W QC  C+   C++Q    +DPS S + + V CN 
Sbjct: 71  YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACND 130

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFIFGC 249
           + C     A G+   C+  +   C    +YG G    G LG E       S N    FGC
Sbjct: 131 TAC-----ALGSETRCARDNK-ACAVLTAYGAG-VIGGVLGTEAFTFQPQSENVSLAFGC 183

Query: 250 ---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
               R   G   G SG++GLGR +LSLVSQ  +     FSYCL P    +  +  L +G 
Sbjct: 184 IAATRLTPGSLDGASGIIGLGRGNLSLVSQLGD---NKFSYCLTPYFSQSTNTSRLFVGA 240

Query: 306 NSSVFKNSTPITYTNMIPNPQL---ATFYILNLTGISIGGKQL----------QASGFAK 352
           ++ +     P T    + NP +   +TFY L LTGI++G  +L          Q +    
Sbjct: 241 SAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLW 300

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVN--I 408
            G LIDSG+  T L    Y AL+ E ++Q      P   G   LD C  + A+ +V   +
Sbjct: 301 AGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV-AHGDVGKLV 359

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKN 462
           P + + F G+    V V    Y+   D S  C+ + S    +      ET IIGNY Q++
Sbjct: 360 PPLVLHF-GSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQD 418

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
             ++YD +   L F   DCSSM
Sbjct: 419 MHLLYDLEKGMLSFQPADCSSM 440


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 129/415 (31%), Positives = 196/415 (47%), Gaps = 41/415 (9%)

Query: 86  EQQQNRLILDNLHV--QYLQSRIKNMISGNIKD---VSNTEIPLTSGIRL-QTLNYIATI 139
           + Q N   L  +HV    LQ + K+       D      + +P+ SG ++ Q+  YI   
Sbjct: 23  DVQDNGSTLQVIHVFKSVLQMQAKDTTRLQFLDSLVARKSVVPIASGRQIIQSPTYIVRA 82

Query: 140 ELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           ++G    T+++  DT +D  W+ C  C  C +    +F P  S ++K V C +  C  + 
Sbjct: 83  KIGTPPQTLLLAMDTSNDAAWIPCTACDGCAST---LFAPEKSTTFKNVSCAAPECKQVP 139

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
               N G C  SS   CN+ ++YG  S     L ++ + L    V  + FGC     G  
Sbjct: 140 ----NPG-CGVSS---CNFNLTYGSSSIA-ANLVQDTITLATDPVPSYTFGCVSKTTGTS 190

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
               GL+GLGR  LSL+SQT  ++   FSYCLPS +    SGSL LG  +   +    I 
Sbjct: 191 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKR----IK 246

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITRLPPSI 370
           YT ++ NP+ ++ Y +NL  I +G K +     A         G + DSGTV TRL   +
Sbjct: 247 YTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPV 306

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           Y A++ EF ++     +       DTC+N+     + +P +   F G   M V +     
Sbjct: 307 YVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIFTG---MNVTLPQDNI 359

Query: 431 FVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            + S A S  CLA+A    +      +I N QQ+N RV+YD  NS++G A E C+
Sbjct: 360 LIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/315 (33%), Positives = 155/315 (49%), Gaps = 26/315 (8%)

Query: 113 NIKDVSNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
            + D   T +P+  G + L+  NY+  ++LG  G+ M +++DT +D  WV C  C  C +
Sbjct: 22  TLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
                F P+ S +   + C+ + C  +    G S  C ++    C +  SYG  S     
Sbjct: 82  T---TFLPNASTTLGSLDCSEAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLAAT 133

Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
           L ++ + L    +  F FGC     G      GL+GLGR  +SL+SQ   ++ G+FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------G 342
           PS +    SGSL LG           I  T ++ NP   + Y +NLTG+S+G        
Sbjct: 194 PSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249

Query: 343 KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
           +QL        G +IDSGTVITR    +Y A++ EF KQ +G  S+ G    DTCF  +A
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AA 305

Query: 403 YQEVNIPLVKMEFEG 417
             E   P V + FEG
Sbjct: 306 TNEAEAPAVTLHFEG 320


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 134/428 (31%), Positives = 211/428 (49%), Gaps = 51/428 (11%)

Query: 74  KNYCSGK---IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL 130
           KNY +     I+D   +   R++       YL S     +  +++    +  P+ SG   
Sbjct: 56  KNYSTSWENIIIDMASKDPERVV-------YLSS-----LDASLRRKPISAAPIASGQAF 103

Query: 131 QTLNYIATIELGGRN--MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
              +Y+  ++LG  N    +++DT +D  WV C  C  C +     + P  S +Y   + 
Sbjct: 104 GIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGC-SSSSTYYSPQASTTYGGAV- 161

Query: 189 NSSTCHALEFATGNSGV-CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
               C+A   A     + C  +    C +  SY  GS     L ++ L LG  ++  + F
Sbjct: 162 ---ACYAPRCAQARGALPCPYTGSKACTFNQSYA-GSTFSATLVQDSLRLGIDTLPSYAF 217

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           GC  +  G      GL+GLGR  LSL SQ+S+++ G+FSYCLPS Q +  SGSL LG   
Sbjct: 218 GCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYFSGSLKLGPTG 277

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ------ASGFAKG-GILIDSG 360
              +    I  T ++ NP+  + Y +NLTG+++G  ++       A    KG G ++DSG
Sbjct: 278 QPRR----IRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSGTILDSG 333

Query: 361 TVITRLPPSIYSALKAEFLKQFSG-FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           TVITR    +YSA++ EF  Q  G F S  GF   DTCF +  Y+ +  PL+K+ F G  
Sbjct: 334 TVITRFVGPVYSAIRDEFRNQVKGPFFSRGGF---DTCF-VKTYENLT-PLIKLRFTG-- 386

Query: 420 EMTVDVT-----GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
              +DVT      +++      + + +A A  +      +I NYQQ+N RV++DT N+++
Sbjct: 387 ---LDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNNRV 443

Query: 475 GFAGEDCS 482
           G A E C+
Sbjct: 444 GIARELCN 451


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 71/339 (20%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L    G  G  C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
           S++    C YFV YGDG  T G    + L L  ++V  +F FGC                
Sbjct: 206 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGC---------------- 246

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
                                             S  + GN S   + T    T ++ NP
Sbjct: 247 ----------------------------------SHAVRGNFSASTSGTMFARTPLVRNP 272

Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
            +  T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    +
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 332

Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
            +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CLA
Sbjct: 333 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 385

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 +   G IGN QQ+   V+YD     +GF    C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 71/339 (20%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L    G  G  C
Sbjct: 150 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 205

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
           S++    C YFV YGDG  T G    + L L  ++V  +F FGC                
Sbjct: 206 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGC---------------- 246

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
                                             S  + GN S   + T    T ++ NP
Sbjct: 247 ----------------------------------SHAVRGNFSASTSGTMFARTPLVRNP 272

Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
            +  T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    +
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 332

Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
            +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CLA
Sbjct: 333 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 385

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 +   G IGN QQ+   V+YD     +GF    C
Sbjct: 386 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 101/339 (29%), Positives = 147/339 (43%), Gaps = 71/339 (20%)

Query: 150 VDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV-C 206
           +DT  DL W+QC PC    CY QQ+ +FDP  S +   V C S+ C  L    G  G  C
Sbjct: 168 IDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL----GRYGAGC 223

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMG 265
           S++    C YFV YGDG  T G    + L L  ++V  +F FGC    +           
Sbjct: 224 SNNQ---CQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVR----------- 269

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
                                                  GN S   + T    T ++ NP
Sbjct: 270 ---------------------------------------GNFSASTSGTMFARTPLVRNP 290

Query: 326 QL-ATFYILNLTGISIGGKQLQASGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
            +  T Y++ L GI +GG++L        GG ++DS  +IT+LPP+ Y AL+  F    +
Sbjct: 291 SIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 350

Query: 384 GFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
            +P  A G + LDTC++   +  V +P V + F+G A + +D  G++        + CLA
Sbjct: 351 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-------EGCLA 403

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                 +   G IGN QQ+   V+YD     +GF    C
Sbjct: 404 FVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/391 (30%), Positives = 180/391 (46%), Gaps = 42/391 (10%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSI 179
           PL SG    +  Y   I LG   +++ ++ DTGSDL WV+C  C++C ++     F P  
Sbjct: 76  PLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRH 135

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELGREHLG 236
           S S+    C    C  L  A  +  +C+ +   SP  C +  SY DGS + G   +E   
Sbjct: 136 SSSFSPFHCFDPHCRLLPHAPHH--LCNHTRLHSP--CRFLYSYADGSLSSGFFSKETTT 191

Query: 237 LGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           L   S     +    FGCG    G       F G  G+MGLGR  +S  SQ    FG  F
Sbjct: 192 LKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKF 251

Query: 286 SYCLPS-TQDAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
           SYCL   T     +  L++GG  +S    N+T I+YT +  NP   TFY + +  I+I G
Sbjct: 252 SYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDG 311

Query: 343 KQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGF 391
            +L       +      GG ++DSGT +T L  + Y  +     ++    P+A    PGF
Sbjct: 312 VKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK-LPNAAELTPGF 370

Query: 392 SILDTCFNLSAY-QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED 450
              D C N S   +  ++P ++    G A          YF++++   +CLA+ ++   +
Sbjct: 371 ---DLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRN--YFLETEEGVMCLAIRAVESGN 425

Query: 451 ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
              +IGN  Q+   + +D + S+LGF    C
Sbjct: 426 GFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 122/419 (29%), Positives = 193/419 (46%), Gaps = 52/419 (12%)

Query: 88  QQNRLILDNLHVQYLQS-----RIKNMISGNIKDVSNTE---------IPLTSGIRLQTL 133
           +Q++L +D++H++ L S     R+     G +K+   +E         I +T G   +  
Sbjct: 45  RQDQLRVDHIHMRLLSSSSQGVRVSKQKQGPVKEPVRSEVIHLHDQPVIQVTIGSERKGA 104

Query: 134 NYIATIEL-------GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYK 184
           +  +            G   TV++DT SD+ WVQC P  S          +DP+ S +Y 
Sbjct: 105 SGGSGGSGDQQQSQAAGVVQTVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYY 164

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR---GELGREHLGLGKAS 241
            + CNS+ C   E      G C ++    C Y V       +    G  G + L L    
Sbjct: 165 ALACNSAAC--TELGRLYRGACVNN---QCQYRVPIPSSPASSSSSGTYGSDLLKLTADP 219

Query: 242 VN----DFIFGC--GRNNKGLFGGV----SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS 291
            +     F FGC  G   +G  G +    +G+M LG    SLVSQ + ++G  FSYC+P+
Sbjct: 220 ADGASMSFKFGCSHGEAKQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPA 279

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SG 349
           T+       ++ GG       +     T M+   ++ T Y + L  I++ G+QL    S 
Sbjct: 280 TESRRPGFFVLGGGVGD-LSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSV 338

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIP 409
           FA G +L DS T ITRLPP+ Y AL+  F  + + +  AP    LDTC++ +    V +P
Sbjct: 339 FASGSVL-DSRTAITRLPPTAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVP 397

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
            V +  +GNA + +D  GI++         CL   S + +   GI+GN QQ+   V+Y+
Sbjct: 398 RVALLLDGNAVVALDRQGILF-------HDCLVFTSNTDDRMPGILGNVQQQTMEVLYN 449


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 130/432 (30%), Positives = 197/432 (45%), Gaps = 37/432 (8%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
            T++L H++       + +     R+I   L      +R+ N++  N K +  + + L +
Sbjct: 29  FTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSISRLNRVSNLLDQNNK-LPQSVLILHN 87

Query: 127 GIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKV 186
           G  L    YI T  +         DTGSDL WVQC PC SC+ Q  P+F P  S ++   
Sbjct: 88  GEYLMRF-YIGTPPV---ERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPT 143

Query: 187 LCNSSTCHAL---EFATGNSGVCSSSSPPDCNYFVSYGDG-SYTRGELGREHL------G 236
            C S  C  L   +   G SG        +C Y   YGD  S++ G L  E L      G
Sbjct: 144 TCRSQPCTLLLPEQKGCGKSG--------ECIYTYKYGDQYSFSEGLLSTETLRFDSQGG 195

Query: 237 LGKASVNDFIFGCG-RNNKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ 293
           +   +  +  FGCG  NN  +F    ++G+MGLG   LSLVSQ  +  G  FSYCL    
Sbjct: 196 VQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCL--LP 253

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG 353
               S S +  GN S+      +  T MI  P L T+Y LNL  +++  K +  +G   G
Sbjct: 254 LGSTSTSKLKFGNESIITGEG-VVSTPMIIKPWLPTYYFLNLEAVTVAQKTV-PTGSTDG 311

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVK 412
            ++IDSGT++T L  S Y    A   +  +        S L  CF    Y++    P + 
Sbjct: 312 NVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCF---PYRDNFVFPEIA 368

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            +F G A +++    + + +  D + VCL +A  S      I G++ Q + +V YD +  
Sbjct: 369 FQFTG-ARVSLKPANL-FVMTEDRNTVCLMIAPSSVSG-ISIFGSFSQIDFQVEYDLEGK 425

Query: 473 QLGFAGEDCSSM 484
           ++ F   DCS +
Sbjct: 426 KVSFQPTDCSKV 437


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 120/405 (29%), Positives = 189/405 (46%), Gaps = 68/405 (16%)

Query: 134 NYIATIELGG---RNMTVIVDTGSDLTWVQCQP-----CKSCYNQQDPVFDPSISPSYKK 185
           +Y  +  LG    +++T+ +DTGSDL W  C P     C+  +N   P+   +I+ S++ 
Sbjct: 18  DYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPL---NITRSHR- 73

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGR 232
           V C S  C     +  +  +C+ +  P       DC+      ++ +YGDGS+    L R
Sbjct: 74  VSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFI-AHLHR 132

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---FGGLFSYCL 289
           + L + +  + +F FGC           +G+ G GR  LSL +Q + +    G  FSYCL
Sbjct: 133 DTLSMSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCL 189

Query: 290 PS----TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
            S     +       LILG             YT+M+ NP+ + FY + LTGIS+G + +
Sbjct: 190 VSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTI 249

Query: 346 QASGFAK-------GGILIDSGTVITRLPPSIYSALKAEF-------LKQFSGFPSAPGF 391
            A    +       GG+++DSGT  T LP S+Y+++ AEF        K+ S      G 
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTG- 308

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-----SDASQVCLALASL 446
             L  C+ L    EV  P V   F GN    V +  + YF +      +A +    L  +
Sbjct: 309 --LGPCYFLEGLVEV--PTVTWHFLGNNS-NVMLPRMNYFYEFLDGEDEARRKVGCLMLM 363

Query: 447 SYEDET-------GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +  D+T        I+GNYQQ+   V+YD +N ++GFA   C+S+
Sbjct: 364 NGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCASL 408


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 167/372 (44%), Gaps = 55/372 (14%)

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           +  +DT SDL W+QCQPC SCY Q DP+F+P +S SY  V C+S TC  L+   G+   C
Sbjct: 102 SAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLD---GHR--C 156

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMG 265
                  C Y   Y   + T G L  + L +G    +  + GC  ++  G     SGL+G
Sbjct: 157 DEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVLGCSDSSVGGPPPQASGLVG 216

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFKNSTPITYTNMI 322
           L R  LSL+SQ S      F YCLP    +   G L+LG   G  +V   S  +T T M 
Sbjct: 217 LARGPLSLLSQLSV---RRFMYCLPPPM-SRTPGKLVLGAGAGADAVRNVSDRVTVT-MS 271

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAKG-------------------------GILI 357
            + +  ++Y LN  G+++G    Q  G  +                          G+++
Sbjct: 272 SSTRYPSYYYLNFDGLAVGD---QTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIV 328

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKM 413
           D  + I+ L  S+Y  L  +  ++     + P   + LD CF L        V +P V M
Sbjct: 329 DVASTISFLEASLYDELADDLEEEIRLPRATPSTRLGLDLCFILPEGVGIDRVYVPTVSM 388

Query: 414 EFEGN-AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
            F+G   E+  D          D   +CL +   S      I+GNYQQ+N  V+Y+ +  
Sbjct: 389 SFDGRWLELERD-----RLFLEDGRMMCLMIGRTS---GVSILGNYQQQNMHVLYNLRRG 440

Query: 473 QLGFAGEDCSSM 484
           ++ FA   C S+
Sbjct: 441 KITFAKASCDSL 452


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 138/421 (32%), Positives = 203/421 (48%), Gaps = 43/421 (10%)

Query: 77  CSGKIVDWNEQQQNRLI----LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQT 132
           CS  I    E   N +I     D   ++YL S    M          T +P+  G ++  
Sbjct: 43  CSPFIPPKQEPLVNTVIDMASKDPARLKYLSSLAAQM---------TTAVPIAPGQQVLN 93

Query: 133 L-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN 189
           + NY+  ++LG  G+ M +++DT +D  WV C  C  C +        + S +Y  + C+
Sbjct: 94  IGNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFST---NTSSTYGSLDCS 150

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
            + C  +    G S  C ++    C +  SYG  S     L  + L L    + +F FGC
Sbjct: 151 MAQCTQVR---GFS--CPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVNDVIPNFAFGC 205

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV 309
             +  G      GL+GLGR  LSL++Q+  ++ GLFSYCLPS +    SGSL LG     
Sbjct: 206 INSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAG-- 263

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQLQASGFAKG-GILIDSGTV 362
                 I YT ++ NP   + Y +NLTG+S+G        +L A     G G +IDSGTV
Sbjct: 264 --QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTV 321

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           ITR    IY+A++ EF KQ +G  S+ G    DTCF  +A  E   P V + F G   + 
Sbjct: 322 ITRFVQPIYTAIRDEFRKQVAGPFSSLG--AFDTCF--AATNEAVAPAVTLHFTGLNLVL 377

Query: 423 VDVTGIVYFVKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
                +++   S  S  CLA+A+   +      +I N QQ+N R+++D  NS+LG A E 
Sbjct: 378 PMENSLIH--SSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIAREL 435

Query: 481 C 481
           C
Sbjct: 436 C 436


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  144 bits (364), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 118/365 (32%), Positives = 168/365 (46%), Gaps = 34/365 (9%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+  + LG   + +  +VDTGSDL W QC PC  CY Q+ P+F+P  S +Y  + C S 
Sbjct: 81  DYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESE 140

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDFI 246
            C    ++     +C+        Y  SY D S T+G L RE +           V D I
Sbjct: 141 QCSFFGYSCSPQKMCA--------YSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDII 192

Query: 247 FGCGRNNKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLIL 303
           FGCG +N G F     G++G+G   LSLVSQ   ++G   FS CL P   DA  SG++  
Sbjct: 193 FGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINF 252

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---ASGFAKGGILIDSG 360
           G  S V   S     T  + + +  T Y++ L GIS+G   ++   +   +KG I+IDSG
Sbjct: 253 GEESDV---SGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSG 309

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGN 418
           T  T +P   Y  L  E   Q S  P        D    L    E N+  P++   FEG 
Sbjct: 310 TPATYIPQEFYERLVEELKVQSSLLPIE---DDPDLGTQLCYRSETNLEGPILTAHFEG- 365

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
               V +  I  F+       C A+A  +  D   I GN+ Q N  + +D     + F  
Sbjct: 366 --ADVQLLPIQTFIPPKDGVFCFAMAGST--DGDYIFGNFAQSNILMGFDLDRKTISFKP 421

Query: 479 EDCSS 483
            DC++
Sbjct: 422 TDCTN 426


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 105/315 (33%), Positives = 154/315 (48%), Gaps = 26/315 (8%)

Query: 113 NIKDVSNTEIPLTSGIR-LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN 169
            + D   T +P+  G + L+  NY+  ++LG  G+ M +++DT +D  WV C  C  C +
Sbjct: 22  TLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS 81

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
                F P+ S +   + C+ + C  +    G S  C ++    C +  SYG  S     
Sbjct: 82  T---TFLPNASTTLGSLDCSEAQCSQVR---GFS--CPATGSSACLFNQSYGGDSSLAAT 133

Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
           L ++ + L    +  F FGC     G      GL+GLGR  +SL+SQ   ++ G+FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG-------G 342
           PS +    SGSL LG           I  T ++ NP   + Y +NLTG+S+G        
Sbjct: 194 PSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249

Query: 343 KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA 402
           +QL        G +IDSGTVITR    +Y A++ EF KQ +G  S+ G    DTCF  + 
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNGPISSLG--AFDTCF--AE 305

Query: 403 YQEVNIPLVKMEFEG 417
             E   P V + FEG
Sbjct: 306 TNEAEAPAVTLHFEG 320


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 128/418 (30%), Positives = 196/418 (46%), Gaps = 51/418 (12%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
           K V W +     L  D   +Q+L S +             + +P+ SG ++ Q+  YI  
Sbjct: 44  KPVSWEDSVLQMLAEDQARLQFLSSLVGR----------KSWVPIASGRQIVQSPTYIVK 93

Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
             +G    T +  +DT +D  W+ C  C  C +    VF+   S ++K + C++  C  +
Sbjct: 94  ANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCDAPQCKQV 150

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              T     C  S+   C +  +YG GS     L R+ + L    V  + FGC +   G 
Sbjct: 151 PNPT-----CGGST---CTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  LS +SQT +++   FSYCLPS +    SG+L LG      +    I
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR----I 257

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
             T ++ NP+ ++ Y +NL GI +G K   + AS  A       G + DSGTV TRL   
Sbjct: 258 KTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAP 317

Query: 370 IYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
           +Y+A++ EF K+       S  GF   DTC+       +  P +   F G   M V +  
Sbjct: 318 VYTAVRDEFRKRVGNAIVSSLGGF---DTCYT----GPIVAPTMTFMFSG---MNVTLPP 367

Query: 428 IVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               ++S A S  CLA+A+   +      +I N QQ+N R+++D  NS++G A E CS
Sbjct: 368 DNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 128/418 (30%), Positives = 196/418 (46%), Gaps = 51/418 (12%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
           K V W +     L  D   +Q+L S +             + +P+ SG ++ Q+  YI  
Sbjct: 44  KPVSWEDSVLQMLAEDQARLQFLSSLVGR----------KSWVPIASGRQIVQSPTYIVK 93

Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
             +G    T +  +DT +D  W+ C  C  C +    VF+   S ++K + C++  C  +
Sbjct: 94  ANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSST---VFNSVTSTTFKTLGCDAPQCKQV 150

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              T     C  S+   C +  +YG GS     L R+ + L    V  + FGC +   G 
Sbjct: 151 PNPT-----CGGST---CTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGS 201

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  LS +SQT +++   FSYCLPS +    SG+L LG      +    I
Sbjct: 202 SVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLR----I 257

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
             T ++ NP+ ++ Y +NL GI +G K   + AS  A       G + DSGTV TRL   
Sbjct: 258 KTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAP 317

Query: 370 IYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
           +Y+A++ EF K+       S  GF   DTC+       +  P +   F G   M V +  
Sbjct: 318 VYTAVRDEFRKRVGNAIVSSLGGF---DTCYT----GPIVAPTMTFMFSG---MNVTLPT 367

Query: 428 IVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               ++S A S  CLA+A+   +      +I N QQ+N R+++D  NS++G A E CS
Sbjct: 368 DNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 132/439 (30%), Positives = 188/439 (42%), Gaps = 78/439 (17%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN------T 120
            TLEL H++  S K   + +  QN+             RI N +  +I  V++      T
Sbjct: 29  FTLELIHRD--SSK-SPFYQPTQNKY-----------ERIANAVRRSINRVNHFYKYSLT 74

Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
             P  S +      Y+ +  +G     V   VDTGSDL W+QC+PCK CY Q  P+FDPS
Sbjct: 75  STP-QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPS 133

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
           +S SY+ + C S TCH++   +             C+           RG L  E L L 
Sbjct: 134 LSSSYQNIPCLSDTCHSMRTTS-------------CD----------VRGYLSVETLTLD 170

Query: 239 -----KASVNDFIFGCGRNNKGLFGGV-SGLMGLGRSDLSLVSQTSEIFGGLFSYCL--- 289
                  S    + GCG  N G F G  SG++GLG   +SL SQ     GG FSYCL   
Sbjct: 171 STTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPW 230

Query: 290 --PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
              ST       + I+ G+ ++   +TPI   +        + Y L L   S+G K ++ 
Sbjct: 231 LPNSTSKLNFGDAAIVYGDGAM---TTPIVKKDA------QSGYYLTLEAFSVGNKLIEF 281

Query: 348 SGFAKGG----ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
            G   GG    ILIDSGT  T LP  +Y   ++   +  +             C+N+ AY
Sbjct: 282 GGPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNV-AY 340

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
                PL+   F+G     + +  I  F+K      CLA        +T I GN  Q+N 
Sbjct: 341 HGFEAPLITAHFKG---ADIKLYYISTFIKVSDGIACLAFI----PSQTAIFGNVAQQNL 393

Query: 464 RVIYDTKNSQLGFAGEDCS 482
            V Y+   + + F   DC+
Sbjct: 394 LVGYNLVQNTVTFKPVDCT 412


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/201 (41%), Positives = 119/201 (59%), Gaps = 14/201 (6%)

Query: 92  LILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVI 149
           L  D   V+ + S++   I+  +    +T++P  +GI L + NYI TI +G    +++++
Sbjct: 91  LRRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLM 150

Query: 150 VDTGSDLTWVQCQPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
            DTGSDLTW QC+PC  SCY+Q++P F+PS S SY  V C+S  C       GN   CS+
Sbjct: 151 FDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSPMC-------GNPESCSA 203

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGVSGLMGLG 267
           S   +C Y + YGDGS T G L +E   L  + V +D  FGCG NNKG+F G +G++GLG
Sbjct: 204 S---NCLYGIGYGDGSVTVGFLAKEKFTLTNSDVLDDIYFGCGENNKGVFIGSAGILGLG 260

Query: 268 RSDLSLVSQTSEIFGGLFSYC 288
               S   QT+  +  +FSYC
Sbjct: 261 PGKFSFPLQTTTTYNNIFSYC 281


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/407 (28%), Positives = 188/407 (46%), Gaps = 42/407 (10%)

Query: 101 YLQSRIKNM------ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDT 152
           Y+ +R+++       ++  +   S   +P++SG    T  Y   + +G   +  T++ DT
Sbjct: 76  YICARLRSRQGGSRRVAAEVASSSAVSLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADT 135

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGNSGVCSSSSP 211
           GSDLTWV+C    +  +    VF P  S S+  + C+S TC   + F   N   CSS + 
Sbjct: 136 GSDLTWVKC----AGASPPGRVFRPKTSRSWAPIPCSSDTCKLDVPFTLAN---CSSPAS 188

Query: 212 PDCNYFVSYGDGSY-TRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL-FGGVSGLM 264
           P C Y   Y +GS   RG +G E   +    GK A + D + GC  ++ G  F    G++
Sbjct: 189 P-CTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDGQSFRSADGVL 247

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
            LG + +S  +Q +  FGG FSYCL        A+G L  G         TP T T +  
Sbjct: 248 SLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQV---PRTPATQTKLFL 304

Query: 324 NPQLATFYILNLTGISIGGKQL----QASGFAKGGILIDSGTVITRLPPSIYSALKAEFL 379
           +P++  FY + +  I + GK L    +      GG+++DSG  +T L    Y A+ A   
Sbjct: 305 DPEM-PFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTLTVLAAPAYKAVVAALS 363

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           K   G P    F   + C+N +A +    E+ IP + ++F G+A +       V  VK  
Sbjct: 364 KHLDGVPKV-SFPPFEHCYNWTARRPGAPEI-IPKLAVQFAGSARLEPPAKSYVIDVKPG 421

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               C+ +    +   + +IGN  Q+     +D KN Q+ F   +C+
Sbjct: 422 VK--CIGVQEGEWPGLS-VIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 136/463 (29%), Positives = 218/463 (47%), Gaps = 44/463 (9%)

Query: 33  GKKKLHLHKLQWQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRL 92
           G   + +  + W  +  +   C S Q    ++  I +  K   +   K   W+ +  N  
Sbjct: 6   GTTLIVIFSVMWLMRVNAIDPCAS-QPDNSDLNVIPIYSKCSPFKPPKADTWDNRIINMA 64

Query: 93  ILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIV 150
             D + V+YL + +        K VS    P+ SG      NY+  ++LG  G+ + +++
Sbjct: 65  SKDPVRVKYLSTLVSQ------KTVSTA--PIASGQAFNIGNYVVRVKLGTPGQLLFMVL 116

Query: 151 DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           DT +D  +V C  C  C    D  F P  S SY  + C+   C  +   +     C ++ 
Sbjct: 117 DTSTDEAFVPCSGCTGC---SDTTFSPKASTSYGPLDCSVPQCGQVRGLS-----CPATG 168

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSD 270
              C++  SY   S++   L ++ L L    +  + FGC     G      GL+GLGR  
Sbjct: 169 TGACSFNQSYAGSSFS-ATLVQDALRLATDVIPYYSFGCVNAITGASVPAQGLLGLGRGP 227

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           LSL+SQ+   + G+FSYCLPS +    SGSL LG           I  T ++ +P   + 
Sbjct: 228 LSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRSPHRPSL 283

Query: 331 YILNLTGISIG-------GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y +N TGIS+G        + L  +     G +IDSGTVITR    +Y+A++ EF KQ  
Sbjct: 284 YYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVG 343

Query: 384 G--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYFVKSDASQVC 440
           G  F S   F   DTCF +  Y+ +  P + + FEG + ++ ++ + I     S  S  C
Sbjct: 344 GTTFTSIGAF---DTCF-VKTYETL-APPITLHFEGLDLKLPLENSLI---HSSAGSLAC 395

Query: 441 LALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           LA+A+   +      +I N+QQ+N R+++D  N+++G A E C
Sbjct: 396 LAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 164/353 (46%), Gaps = 46/353 (13%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DTGSDL WVQC+PC  C+ Q  P+FDPS S +Y  +  +S  C        NS     +
Sbjct: 108 IDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKKYN 160

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VSGL 263
               C Y  SY DGS + G L  E +       G  +V+  +FGCG +N+G F G  SG+
Sbjct: 161 HLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGI 220

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTNMI 322
           +GL   D S+VS+     G  FSYC+    D   +   L+LG    +  +STP    N  
Sbjct: 221 LGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN-- 274

Query: 323 PNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALK 375
                  FY + L GIS+G  +L       Q +   +GG+++DSGT  T L    +  L 
Sbjct: 275 ------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLS 328

Query: 376 AEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVTGI 428
            E  +   G      + + PG+     C+     +++   P +   F   A++ +D   +
Sbjct: 329 NEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDANSL 384

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             FV+ +    CLA+   + ++   +IG   Q++  V YD    ++ F   DC
Sbjct: 385 --FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 178/391 (45%), Gaps = 42/391 (10%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSI 179
           P+ SG    +  Y  ++ +G   + + ++ DTGSDL WV+C PC++C +      F    
Sbjct: 74  PVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARH 133

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSS---SPPDCNYFVSYGDGSYTRGELGREHLG 236
           S +Y  + C S  C  +     N   C+ +   SP  C Y  +Y D S T G   +E L 
Sbjct: 134 STTYSAIHCYSPQCQLVPHPHPNP--CNRTRLHSP--CRYQYTYADSSTTTGFFSKEALT 189

Query: 237 LGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           L  ++     +N   FGCG    G       F G  G+MGLGR+ +S  SQ    FG  F
Sbjct: 190 LNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKF 249

Query: 286 SYCLPS-TQDAGASGSLILGG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
           SYCL   T     +  L +GG  N +V K    +++T ++ NP   TFY + + G+ + G
Sbjct: 250 SYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGI-MSFTPLLINPLSPTFYYIAIKGVYVNG 308

Query: 343 KQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGF 391
            +L  +           GG +IDSGT +T +    Y+ +   F K+    PS     PGF
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVK-LPSPAEPTPGF 367

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
              D C N+S      +P +     G +  +       YF+++     CLA+  +S +  
Sbjct: 368 ---DLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRN--YFIETGDQIKCLAVQPVSQDGG 422

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             ++GN  Q+   + +D   S+LGF    C+
Sbjct: 423 FSVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 171/362 (47%), Gaps = 31/362 (8%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+    LG  ++    I DTGSDL+W+QC PCK+CY Q+ P+FDP+ S +Y  V C S  
Sbjct: 88  YLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQP 147

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH-------LGLGKASVNDF 245
           C        N   C SS    C Y   YG  S+T G LG +        +G G A+    
Sbjct: 148 CTLFP---QNQRECGSSK--QCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKS 202

Query: 246 IFGCGRNNKGLFG---GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
           +FGC   +   F      +G +GLG   LSL SQ  +  G  FSYC+     + ++G L 
Sbjct: 203 VFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCM-VPFSSTSTGKLK 261

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV 362
            G  +     +  +  T  + NP   ++Y+LNL GI++G K++  +G   G I+IDS  +
Sbjct: 262 FGSMAP----TNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV-LTGQIGGNIIIDSVPI 316

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMT 422
           +T L   IY+   +   +  +   +    +  + C  +     +N P     F G A++ 
Sbjct: 317 LTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTG-ADVV 373

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +    +  F+  D + VC+ +          I GN+ Q N +V YD    ++ FA  +CS
Sbjct: 374 LGPKNM--FIALDNNLVCMTVVP---SKGISIFGNWAQVNFQVEYDLGEKKVSFAPTNCS 428

Query: 483 SM 484
           ++
Sbjct: 429 TI 430


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 126/444 (28%), Positives = 205/444 (46%), Gaps = 32/444 (7%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMI 110
           S S +S +++   +   +++L H++       D +     R+           +R+ + +
Sbjct: 16  SPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRSSSRLNRVSHFL 75

Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ 170
             N  ++  + +   +G  L TL YI T  +       I DTGSDL WVQC PC++C+ Q
Sbjct: 76  DEN--NLPESLLIPENGEYLMTL-YIGTPPV---ERLAIADTGSDLIWVQCSPCQNCFPQ 129

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
             P+F+P  S ++K   C+S  C ++  +    G         C Y  SYGD S+T G +
Sbjct: 130 DTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVG-----QCIYSYSYGDKSFTVGVV 184

Query: 231 GREHLGLGK------ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---F 281
           G E L  G        S    IFGCG  N   F     + GL       +S  S++    
Sbjct: 185 GTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQI 244

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           G  FSYCL     + ++  L  G  + V  N   +  T +I  P   +FY LNL  ++IG
Sbjct: 245 GYKFSYCL-LPFSSNSTSKLKFGSEAIVTTNG--VVSTPLIIKPLFPSFYFLNLEAVTIG 301

Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
            K +  +G   G I+IDSGTV+T L  + Y+   A  L++     SA         CF  
Sbjct: 302 QK-VVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVAS-LQEVLSVESAQDLPFPFKFCF-- 357

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
             Y+++ IP++  +F G A + +    ++  ++ D + +CLA+   S      I GN  Q
Sbjct: 358 -PYRDMTIPVIAFQFTG-ASVALQPKNLLIKLQ-DRNMLCLAVVPSSLSG-ISIFGNVAQ 413

Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
            + +V+YD +  ++ FA  DC+ +
Sbjct: 414 FDFQVVYDLEGKKVSFAPTDCTKV 437


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 132/444 (29%), Positives = 204/444 (45%), Gaps = 54/444 (12%)

Query: 59  KSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
           K  I+    TL++ H       +   K + W E   N    D   +QY  S +       
Sbjct: 25  KCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVAR----- 79

Query: 114 IKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQ 170
                 + +P+ S  ++ Q+  YI   + G    T+++  DT SD  W+ C  C  C   
Sbjct: 80  -----KSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTS 134

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
           +   F P  S S++ V C S  C  +   T     C  S+   C +  +YG  S     +
Sbjct: 135 KP--FAPIKSTSFRNVSCGSPHCKQVPNPT-----CGGSA---CAFNFTYGSSSIA-ASV 183

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
            ++ L L    +  + FGC     G      GL+GLGR  LSL+SQ+  ++   FSYCLP
Sbjct: 184 VQDTLTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLP 243

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
           S +    SGSL LG    V++    I YT ++ NP+ ++ Y +NL  I +G K +     
Sbjct: 244 SFKSINFSGSLRLG---PVYQPKR-IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPA 299

Query: 351 A-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLS 401
           A         G + DSGTV TRL   +Y+A++ EF ++    P  P  ++   DTC+N+ 
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKLPVTTLGGFDTCYNV- 356

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
               + +P +   F G   +T+    IV  + S A S  CLA+A    +      +I N 
Sbjct: 357 ---PIVVPTITFLFSG-MNVTLPPDNIV--IHSTAGSTTCLAMAGAPDNVNSVLNVIANM 410

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
           QQ+N RV++D  NS++G A E C+
Sbjct: 411 QQQNHRVLFDVPNSRIGIARELCT 434


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 131/444 (29%), Positives = 202/444 (45%), Gaps = 54/444 (12%)

Query: 59  KSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGN 113
           K  I+    TL++ H       +   K + W E   N    D   +QY  S +       
Sbjct: 25  KCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVAR----- 79

Query: 114 IKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQ 170
                 + +P+ S  ++ Q+  YI   + G    T+++  DT SD  W+ C  C  C   
Sbjct: 80  -----KSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTS 134

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
           +   F P  S S++ V C S  C  +   T     C  S+   C +  +YG  S     +
Sbjct: 135 KP--FAPIKSTSFRNVSCGSPHCKQVPNPT-----CGGSA---CAFNFTYGSSSIA-ASV 183

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
            ++ L L    +  + FGC     G      GL+GLGR  LSL+SQ+  ++   FSYCLP
Sbjct: 184 VQDTLTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLP 243

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
           S +    SGSL LG    V++    I YT ++ NP+ ++ Y +NL  I +G K +     
Sbjct: 244 SFKSINFSGSLRLG---PVYQPKR-IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPA 299

Query: 351 A-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLS 401
           A         G + DSGTV TRL   +Y+A++ EF ++    P  P  ++   DTC+N+ 
Sbjct: 300 ALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKLPVTTLGGFDTCYNV- 356

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
               + +P +   F G   M V +      + S A S  CLA+A    +      +I N 
Sbjct: 357 ---PIVVPTITFLFSG---MNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 410

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
           QQ+N RV++D  NS++G A E C+
Sbjct: 411 QQQNHRVLFDVPNSRIGIARELCT 434


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 165/355 (46%), Gaps = 46/355 (12%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           V +DTGSDL WVQC+PC  C+ Q  P+FDPS S +Y  +  +S  C        NS    
Sbjct: 74  VGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKK 126

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VS 261
            +    C Y  SY DGS + G L  E +       G  +V+  +FGCG +N+G F G  S
Sbjct: 127 YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQS 186

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
           G++GL   D S+VS+     G  FSYC+    D   +   L+LG    +  +STP    N
Sbjct: 187 GILGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN 242

Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
                    FY + L GIS+G  +L       Q +   +GG+++DSGT  T L    +  
Sbjct: 243 --------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294

Query: 374 LKAEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVT 426
           L  E  +   G      + + PG+     C+     +++   P +   F   A++ +D  
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDAN 350

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +  FV+ +    CLA+   + ++   +IG   Q++  V YD    ++ F   DC
Sbjct: 351 SL--FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 191/387 (49%), Gaps = 40/387 (10%)

Query: 117 VSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPV 174
           +S +  P  + +R     Y+  + +G   +  I   DTGSDLTW QC+PCK C+ Q  P+
Sbjct: 65  LSTSSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPI 124

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           +D + S S+  + C+S+TC  +      S  CS+ S   C Y  +Y DG+Y+      E 
Sbjct: 125 YDTTTSSSFSPLPCSSATCLPIW-----SSRCSTPS-ATCRYRYAYDDGAYS-----PEC 173

Query: 235 LGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
            G+   SV    FGCG +N GL    +G +GLGR  LSLV+Q      G FSYCL    +
Sbjct: 174 AGI---SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFN 227

Query: 295 AGASGSLILGGNSSVFKNSTP-----ITYTNMIPNPQLATFYILNLTGISIGGKQLQASG 349
              S  +  G  + +  +S       +  T ++ +P   + Y ++L GIS+G  +L    
Sbjct: 228 TSLSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPN 287

Query: 350 --------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
                      GG+++DSGT+ T L  + +  +  + +    G P     S+   CF   
Sbjct: 288 GTFDLNDDDGSGGMIVDSGTIFTILVETGFRVV-VDHVAGVLGQPVVNASSLDRPCFPAP 346

Query: 402 A--YQEV-NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-IIGN 457
           A   QE+ ++P + + F G A+M +     + F + ++S  CL +  +  E  +G ++GN
Sbjct: 347 AAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESS-FCLNI--VGTESASGSVLGN 403

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +QQ+N ++++D    QL F   DCS +
Sbjct: 404 FQQQNIQMLFDITVGQLSFMPTDCSKL 430


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 165/355 (46%), Gaps = 46/355 (12%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           V +DTGSDL WVQC+PC  C+ Q  P+FDPS S +Y  +  +S  C        NS    
Sbjct: 74  VGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC-------PNSPQKK 126

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFGG-VS 261
            +    C Y  SY DGS + G L  E +       G  +V+  +FGCG +N+G F G  S
Sbjct: 127 YNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQS 186

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
           G++GL   D S+VS+     G  FSYC+    D   +   L+LG    +  +STP    N
Sbjct: 187 GILGLSAGDQSIVSR----LGSRFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFN 242

Query: 321 MIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
                    FY + L GIS+G  +L       Q +   +GG+++DSGT  T L    +  
Sbjct: 243 --------GFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDP 294

Query: 374 LKAEFLKQFSG------FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVT 426
           L  E  +   G      + + PG+     C+     +++   P +   F   A++ +D  
Sbjct: 295 LSNEIQRLVRGHFQQVIYRTIPGW----LCYKGRVNEDLRGFPELAFHFAEGADLVLDAN 350

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +  FV+ +    CLA+   + ++   +IG   Q++  V YD    ++ F   DC
Sbjct: 351 SL--FVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 111/347 (31%), Positives = 168/347 (48%), Gaps = 32/347 (9%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
            +DTGS++ W+QCQPC +C+NQ  P+F+PS S SYK + C SSTC      T ++ +  S
Sbjct: 105 FMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSSTCK----DTNDTHISCS 160

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGG-VSG 262
           +    C Y ++YG  + ++G+L  + L L   S +  +F     GCG  N        SG
Sbjct: 161 NGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSG 220

Query: 263 LMGLGRSDLSLVSQT-SEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKN---STPIT 317
           ++G+GR  +SL+ Q  S   G  FSYCL P   D+ +S  LI G +  V      STP+ 
Sbjct: 221 VVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMV 280

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDSGTVITRLPPSIYSAL 374
             N   N     +Y L L   S+G  ++   + S  +   ILIDSGT +T LP    S L
Sbjct: 281 KVNGQEN-----YYFLTLEAFSVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKL 335

Query: 375 KAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
              ++ Q    P   P    L  C+N +  +++N+P +   F G A++ ++  G   F  
Sbjct: 336 -VSYVAQEVKLPRIEPPDHHLSLCYNTTG-KQLNVPDITAHFNG-ADVKLNSNGT--FFP 390

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
            +   +C    S    +   I GN  Q N  + YD +   + F   D
Sbjct: 391 FEDGIMCFGFIS---SNGLEIFGNIAQNNLLIDYDLEKEIISFKPTD 434


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 168/367 (45%), Gaps = 57/367 (15%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQDPVFDPSISPSYKKVLCNSS 191
           Y +TI LG   ++ ++++DTGSDLTWV+C PC   C +     FD   S +YK + C   
Sbjct: 3   YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSS----TFDRLASNTYKALTCAD- 57

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND------F 245
                                  +Y   YGDGS+T+G+L  + L +  A+ ++      F
Sbjct: 58  -----------------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGF 94

Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLIL 303
           +FGCG   KGL  G  G++ L    LS  SQ  E +G  FSYCL   + Q++     ++ 
Sbjct: 95  VFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVF 154

Query: 304 GGNSSVFKN--STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GIL 356
           G  +   K   S  +      P  + + +Y + L GIS+G ++L    S F  G     +
Sbjct: 155 GEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPTI 214

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKME 414
            DSGT +T LPP +  ++K       SG  F +  G   LD CF +       +P +   
Sbjct: 215 FDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG---LDACFRVPPSSGQGLPDITFH 271

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F G A+    VT    +V    S  CL        +E  I GN QQ++  V++D  N ++
Sbjct: 272 FNGGADF---VTRPSNYVIDLGSLQCLIFVP---TNEVSIFGNLQQQDFFVLHDMDNRRI 325

Query: 475 GFAGEDC 481
           GF   DC
Sbjct: 326 GFKETDC 332


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 140/460 (30%), Positives = 199/460 (43%), Gaps = 74/460 (16%)

Query: 67  ITLELKH---KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIP 123
           + LEL H   K  C+ K       ++ R   +  H      R+ +M  G  +        
Sbjct: 33  LRLELTHVDAKQNCTTK-------ERMRRATERTH-----RRLASMAGGGGE-------- 72

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSI 179
            ++ I      YIA   +G   +    I+DTGS+L W QC  C++  C+ Q    +DPS 
Sbjct: 73  ASAPIHWNETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSR 132

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGL 237
           S + K V CN + C       G+   C+      C    +YG G+   G LG E    G 
Sbjct: 133 SRTAKPVACNDTAC-----LLGSETRCARDGK-ACAVLTAYGAGAIG-GFLGTEVFTFGH 185

Query: 238 GKASVND--FIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
           G++S N+    FGC    R   G   G SG++GLGR  LSL SQ  +     FSYCL P 
Sbjct: 186 GQSSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGD---NKFSYCLTPY 242

Query: 292 TQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQAS 348
             DA  + +L +G ++ +     P T    + NP      +FY L LTGI++G  +L   
Sbjct: 243 FSDAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVP 302

Query: 349 GFA----------KGGILIDSGTVITRLPPSIYSALKAEFLKQF--SGFPSAPGFSILDT 396
             A           GG LIDSG+  T L    Y AL+ E ++Q   S  P   G   LD 
Sbjct: 303 AAAFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDL 362

Query: 397 CFNLSAYQEVN--IPLVKMEFEGNAEMTVDVTGIV----YFVKSDASQVCLALASLSYED 450
           C    A  +    +P + + F        DV  +V    Y+   D S  C+ + S    +
Sbjct: 363 CVGGVAPGDAGKLVPPLVLHFGSGGGGGGDV--VVPPENYWGPVDDSTACMVVFSSGGPN 420

Query: 451 ------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                 ET IIGNY Q++  ++YD     L F   DCSS+
Sbjct: 421 STLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADCSSV 460


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 113/388 (29%), Positives = 171/388 (44%), Gaps = 36/388 (9%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPS 178
           P+ SG    +  Y   + LG   + + ++ DTGSDL WV+C  C++C  +  P   F   
Sbjct: 77  PVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNC-TRHTPGSAFLAR 135

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S ++    C  S C  +     +    +    P C Y  SYGDGS T G   +E   L 
Sbjct: 136 HSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSP-CRYEYSYGDGSKTSGFFSKETTTLN 194

Query: 239 -----KASVNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
                +A +    FGC     G       F G  G+MGLGR  +SL SQ    FG  FSY
Sbjct: 195 TSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSY 254

Query: 288 CLPSTQDAGASGSLILGGNS--SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           CL     + +  S +L G++   V      + +T +  NP   TFY + +  +S+ G +L
Sbjct: 255 CLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKL 314

Query: 346 QASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA----PGFSIL 394
             +           GG ++DSGT +T LP   Y  +    +K+    PS     PGF   
Sbjct: 315 PINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI-LTVIKRRVRLPSPAEPTPGF--- 370

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
           D C N+S  +   +P  K+ F+   +         YFV +D    CLAL ++       +
Sbjct: 371 DLCVNVSEIEHPRLP--KLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSV 428

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           IGN  Q+   + +D   ++LGF+   C+
Sbjct: 429 IGNLMQQGFLLEFDKDRTRLGFSRHGCA 456


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 114/325 (35%), Positives = 161/325 (49%), Gaps = 24/325 (7%)

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFAT-GNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
           P FD S S +     C+S+ C  L  A+ GN+    + +   C Y   Y D S T G L 
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQT---CVYTYYYNDKSVTTGLLE 231

Query: 232 REHLGLGK-ASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
            +    G  ASV    FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C 
Sbjct: 232 VDKFTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCF 288

Query: 290 PSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA- 347
            +      S +++L   + ++KN    +  T +I N    T Y L+L GI++G  +L   
Sbjct: 289 TAVNGLKQS-TVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347

Query: 348 -SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLS 401
            S FA     GG +IDSGT IT LPP +Y  ++ EF  Q    P  PG +    TCF+  
Sbjct: 348 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGPYTCFSAP 406

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNYQ 459
           +  + ++P + + FEG A M +     V+ V  DA  S +CLA+  L   DE   IGN+Q
Sbjct: 407 SQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSMICLAINELG--DERATIGNFQ 463

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSSM 484
           Q+N  V+YD +N+ L F    C  +
Sbjct: 464 QQNMHVLYDLQNNMLSFVAAQCDKL 488



 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/139 (38%), Positives = 76/139 (54%), Gaps = 14/139 (10%)

Query: 337 GISIGGKQLQA--SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           GI++G  +L    S FA     GG +IDSGT IT LPP +Y  ++ EF  Q    P  PG
Sbjct: 41  GITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPG 99

Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLS 447
            +    TCF+  +  + ++P + + FEG A M +     V+ V  DA  S +CLA   ++
Sbjct: 100 NATGPYTCFSAPSQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---IN 155

Query: 448 YEDETGIIGNYQQKNQRVI 466
             DET IIGN+QQ+N   +
Sbjct: 156 KGDETTIIGNFQQQNMHAL 174


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 128/373 (34%), Positives = 183/373 (49%), Gaps = 53/373 (14%)

Query: 134 NYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCN 189
           NY+  I +G  ++    I DTGSDLTWVQC PC +  C+ Q  P++DP  S ++  + C+
Sbjct: 95  NYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCD 154

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGLGKASVNDFI- 246
           S  C  L ++     VCS     DC Y  +YGD SY+ G L  +   L L +   N  I 
Sbjct: 155 SQPCTQLPYS---QYVCSDYG--DCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKIC 209

Query: 247 FGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPSTQDAGAS---- 298
           FGCG  NK      G  +G++GLG   LSLVSQ  +  G  FSYC LP + ++ +     
Sbjct: 210 FGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFG 269

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILID 358
            + I+ GN  V   STP     +I  P L  FY LNL GI++G K ++ +G   G I+ID
Sbjct: 270 EAAIVQGNGVV---STP-----LIIKPDLP-FYYLNLEGITVGAKTVK-TGQTDGNIIID 319

Query: 359 SGTVITRLPPSIYS---ALKAEFL----KQFSGFPSAPGFSILDTCFNLSAYQE--VNIP 409
           SG+ +T L  S Y+   +L  E +     Q+  +P        D CF    Y+E     P
Sbjct: 320 SGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYP-------FDFCF---TYKEGMSTPP 369

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            V   F G   +   +  +V     + + +C  +   S+ D   I GN  Q +  V YD 
Sbjct: 370 DVVFHFTGGDVVLKPMNTLVLI---EDNLICSTVVP-SHFDGIAIFGNLGQIDFHVGYDI 425

Query: 470 KNSQLGFAGEDCS 482
           +  ++ FA  DCS
Sbjct: 426 QGGKVSFAPTDCS 438


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 180/397 (45%), Gaps = 38/397 (9%)

Query: 118 SNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP-- 173
           ++++ PL SG    +  Y  +I LG   + + ++ DTGSDLTWV+C  CK+  +   P  
Sbjct: 66  TSSKSPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGS 125

Query: 174 VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS-PPDCNYFVSYGDGSYTRGELGR 232
            F    S ++    C SS C  +     N   C+ +     C Y   Y DGS T G   +
Sbjct: 126 TFLARHSTTFSPTHCFSSLCQLV--PQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSK 183

Query: 233 EHLGLGKAS-----VNDFIFGCGRNNKGL------FGGVSGLMGLGRSDLSLVSQTSEIF 281
           E   L  +S     +    FGCG +  G       F G SG+MGLGR  +S  SQ    F
Sbjct: 184 ETTTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRF 243

Query: 282 GGLFSYC-LPSTQDAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATFYILNLTGIS 339
           G  FSYC L  T     +  L++G   S  K N + +++T ++ NP+  TFY +++ G+ 
Sbjct: 244 GRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVF 303

Query: 340 IGGKQLQASG-------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-- 390
           + G +L              GG +IDSGT +T L    Y  + + F ++       PG  
Sbjct: 304 VDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGA 363

Query: 391 --FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
              S  D C N++       P + +E  G +  +       YF+       CLA+  +  
Sbjct: 364 STRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRN--YFIDISEGIKCLAIQPV-- 419

Query: 449 EDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           E E+G   +IGN  Q+   + +D   S+LGF+   C+
Sbjct: 420 EAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 49/358 (13%)

Query: 159 VQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFV 218
           +QCQPC SCY Q DPVF+P +S SY  V C S TC  L+   G+   C       C Y  
Sbjct: 1   MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLD---GHR--CHEDDDGACQYTY 55

Query: 219 SYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN-KGLFGGVSGLMGLGRSDLSLVSQT 277
            Y     T+G L  + L +G    +  +FGC  ++  G     SGL+GLGR  LSLVSQ 
Sbjct: 56  KYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQL 115

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           S      F YCLP    +  SG L+LG  +   +N +      M  + +  ++Y LNL G
Sbjct: 116 SV---HRFMYCLPPPM-SRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDG 171

Query: 338 ISIGGKQLQASGFAKG--------------------------GILIDSGTVITRLPPSIY 371
           +++G +    +  A                            G+++D  + I+ L  S+Y
Sbjct: 172 LAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLY 231

Query: 372 SALKAEFLKQFSGFPSAPGFSI-LDTCFNLS---AYQEVNIPLVKMEFEGN-AEMTVDVT 426
             L  +  ++     + P   + LD CF L        V +P V + F+G   E+  D  
Sbjct: 232 DELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRD-- 289

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                  +D   +CL +   S      I+GN+Q +N RV+++ +  ++ FA   C S+
Sbjct: 290 ---RLFVTDGRMMCLMIGRTS---GVSILGNFQLQNMRVLFNLRRGKITFAKASCDSL 341


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 91/220 (41%), Positives = 121/220 (55%), Gaps = 14/220 (6%)

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
           MGLG    SLVSQT+   G  FSYCLP T  +  SG L L   ++    ++    T M+ 
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSS--SGFLTL--GAAGGSGTSGFVKTPMLR 56

Query: 324 NPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           + Q+ TFY + L  I +GG+QL   AS F+ G ++ DSGTVITRLPP+ YSAL + F   
Sbjct: 57  SSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVM-DSGTVITRLPPTAYSALSSAFKAG 115

Query: 382 FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
              +P A    ILDTCF+ S    V+IP V + F G A +++D +GI+          CL
Sbjct: 116 MKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL-------SNCL 168

Query: 442 ALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           A A  S +   GIIGN QQ+   V+YD     +GF    C
Sbjct: 169 AFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/401 (29%), Positives = 183/401 (45%), Gaps = 47/401 (11%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGI-RLQTLNYIATIELGGRNMTVI--VD 151
           D   +Q+L S +             + +P+ SG   +Q+ +YI   ++G    T++  +D
Sbjct: 4   DQARLQFLSSLVAK----------KSVVPIASGRGVIQSPSYIVKAKVGTPPQTLLMALD 53

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
              D  W+   PCK C      VF+   S ++K + C +  C  +      + +C  S+ 
Sbjct: 54  NSYDAAWI---PCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQVP-----NPICGGST- 104

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
             C +  +YG  S     L R+ + L    V  + FGC +   G      GL+G GR  L
Sbjct: 105 --CTWNTTYGS-STILSNLTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPL 161

Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
           S +SQT  ++   FSYCLPS +    SGSL LG      +    I  T ++ NP+ ++ Y
Sbjct: 162 SFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPR----IKTTPLLKNPRRSSLY 217

Query: 332 ILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            + L GI +G K   +  S  A       G + DSGTV TRL    Y A++ EF K+  G
Sbjct: 218 YVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRV-G 276

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLAL 443
             +       DTC+++     +  P +   F G   M V +      + S A    CLA+
Sbjct: 277 NATVSSLGGFDTCYSV----PIVPPTITFMFSG---MNVTMPPENLLIHSTAGVTSCLAM 329

Query: 444 ASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           A+   +      +I + QQ+N R+++D  NS+LG A E CS
Sbjct: 330 AAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 159/348 (45%), Gaps = 46/348 (13%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           I DTGSD+ W+QC+PCK CYNQ  P F PS S +YK + C+S  C + +           
Sbjct: 103 IADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQ----------- 151

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLG 267
                         G+ +   L  E       S    + GCG +N   F G  SG++GLG
Sbjct: 152 -------------QGNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLG 198

Query: 268 RSDLSLVSQTSEIFGGLFSYC-LPSTQDAGASGSLILGGNSSVFKN---STPITYTNMIP 323
               SL++Q        FSYC LP+  ++  +  L  G  + V  +   STPI   + I 
Sbjct: 199 GGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPI- 257

Query: 324 NPQLATFYILNLTGISIGGKQLQASGFAKGG----ILIDSGTVITRLPPSIYSALKAEFL 379
                 FY L L   S+G K+++  G + GG    I+IDSGT +T +P  +Y+ L++  L
Sbjct: 258 -----VFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVL 312

Query: 380 KQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           +            + + C+++++    + P++   F+G     V +  I  FV      V
Sbjct: 313 ELVKLKRVNDPTRLFNLCYSVTS-DGYDFPIITTHFKG---ADVKLHPISTFVDVADGIV 368

Query: 440 CLALASLSY---EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           CLA A+ S     D   I GN  Q+N  V YD +   + F   DCS +
Sbjct: 369 CLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCSKV 416


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 163/371 (43%), Gaps = 57/371 (15%)

Query: 128 IRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYK 184
           +   T  Y+  I +G   +  T ++DTGSDL W QC  PC+ C+ Q  P++ P+ S +Y 
Sbjct: 85  VHASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYA 144

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG-KAS 241
            V C S  C AL+         S  SPPD  C Y+ SYGDG+ T G L  E   LG   +
Sbjct: 145 NVSCRSPMCQALQ------SPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTA 198

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V    FGCG  N G     SGL+G+GR  LSLVSQ            L  T+   +  + 
Sbjct: 199 VRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQ------------LGVTRPRRSCRAR 246

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGG 354
                      ++P                   L GI++G   L       + +    GG
Sbjct: 247 AAARGGGAPTTTSP-------------------LEGITVGDTLLPIDPAVFRLTPMGDGG 287

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKM 413
           ++IDSGT  T L    + AL A  L      P A G  + L  CF  ++ + V +P + +
Sbjct: 288 VIIDSGTTFTALEERAFVAL-ARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVL 346

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F+G A+M +     V   +S A   CL + S        ++G+ QQ+N  ++YD +   
Sbjct: 347 HFDG-ADMELRRESYVVEDRS-AGVACLGMVS---ARGMSVLGSMQQQNTHILYDLERGI 401

Query: 474 LGFAGEDCSSM 484
           L F    C  +
Sbjct: 402 LSFEPAKCGEL 412


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 193/400 (48%), Gaps = 56/400 (14%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++N ++PL    R  ++  Y   I+LG   +   V VDTGSD+ WV C PC  C  + D 
Sbjct: 58  LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 117

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++D   S + K V C  + C  +      S  C +  P  C+Y V YGDGS + G
Sbjct: 118 GIPLSLYDSKASSTSKNVGCEDAFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 171

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
           +  ++++ L + + N        + +FGCG+N  G  G     V G+MG G+S+ S++SQ
Sbjct: 172 DFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQ 231

Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
            +   GG    +FS+CL    +    G   +G   S    +TP     ++PN      Y 
Sbjct: 232 LAA--GGSVKRIFSHCL---DNMNGGGIFAIGEVESPVVKTTP-----LVPN---QVHYN 278

Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
           + L G+ + G+ +      AS    GG +IDSGT +  LP ++Y++L    +++ +    
Sbjct: 279 VILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 334

Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
                + +T  CF+ ++  +   P+V + FE + +++V     ++ ++ D          
Sbjct: 335 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 394

Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           ++ +D   +I  G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 395 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 434


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 129/399 (32%), Positives = 186/399 (46%), Gaps = 63/399 (15%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDP---V 174
           IPL+ G   QTL              +I+DTGSDL W  C     C++C ++  +P   +
Sbjct: 92  IPLSFGTPPQTL-------------PLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNI 138

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP--PDCN-----YFVSYGDGSYTR 227
           F P  S S K + C +  C  +  +   S  C    P  P+C      Y V YG G  T 
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITG 196

Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
           G +  E L L    V +FI GC   +     G+SG    GR   SL SQ      GL  F
Sbjct: 197 GIMLSETLDLPGKGVPNFIVGCSVLSTSQPAGISGF---GRGPPSLPSQL-----GLKKF 248

Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA------TFYILNLTG 337
           SYCL S +  D   S SL+L G S   + +  ++YT  + NP++A       +Y L L  
Sbjct: 249 SYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRH 308

Query: 338 ISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSA 388
           I++GGK ++             GG +IDSGT  T +   I+  + AEF KQ         
Sbjct: 309 ITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEV 368

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS--L 446
            G + L  CFN+S     + P + ++F G AEM + +   V F+  D   VCL + +   
Sbjct: 369 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGD-DVVCLTIVTDGA 427

Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + ++ +G    I+GN+QQ+N  V YD +N +LGF  + C
Sbjct: 428 AGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 191/400 (47%), Gaps = 56/400 (14%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++N ++PL    R  ++  Y   I+LG   +   V VDTGSD+ WV C PC  C  + D 
Sbjct: 55  LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 114

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++D   S + K V C    C  +      S  C +  P  C+Y V YGDGS + G
Sbjct: 115 GIPLSLYDSKTSSTSKNVGCEDDFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 168

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
           +  ++++ L + + N        + +FGCG+N  G  G     V G+MG G+S+ S++SQ
Sbjct: 169 DFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ 228

Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
            +   GG    +FS+CL    +    G   +G   S    +TPI     +PN      Y 
Sbjct: 229 LAA--GGSTKRIFSHCL---DNMNGGGIFAVGEVESPVVKTTPI-----VPN---QVHYN 275

Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
           + L G+ + G  +      AS    GG +IDSGT +  LP ++Y++L    +++ +    
Sbjct: 276 VILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 331

Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
                + +T  CF+ ++  +   P+V + FE + +++V     ++ ++ D          
Sbjct: 332 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 391

Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           ++ +D   +I  G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 392 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 431


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 172/357 (48%), Gaps = 38/357 (10%)

Query: 7   PLTILSLLLPLMVSLFLLAKGAHCFEGKKKLHLHKLQ----WQQKSGSSSSCVSHQKSRI 62
           PL   + LL + + LFL +  +      +    H L      ++   ++      ++++ 
Sbjct: 10  PLLPFTFLLCVGMLLFLQSAQSRPISVPEVPAYHALDVASSLRETDTAAGGAEYKRETKP 69

Query: 63  EMGAITLELKHKNY-----CSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDV 117
                ++E+ H++       +     +  + + +L  + + V+ L+ +I+  ++ N   V
Sbjct: 70  RRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPV 129

Query: 118 SNTEI----------PLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK 165
           +  E            + SG+   +  Y   I +G   R   +++DTGSD+ W+QC+PC+
Sbjct: 130 NRYENVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCR 189

Query: 166 SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSY 225
            CY+Q DP+F+PS S S+  V C+S+ C  L+    +SG         C Y  SYGDGSY
Sbjct: 190 ECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSG--------GCLYEASYGDGSY 241

Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           + G    E L  G  SV +   GCG  N GLF G +GL+GLG   LS  +Q     G  F
Sbjct: 242 STGSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTF 301

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQLATFYILNLTGISI 340
           SYCL   +++ +SG L  G        S P+   +T +  NP L TFY L++T ISI
Sbjct: 302 SYCL-VDRESDSSGPLQFG------PKSVPVGSIFTPLEKNPHLPTFYYLSVTAISI 351


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 191/400 (47%), Gaps = 56/400 (14%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++N ++PL    R  ++  Y   I+LG   +   V VDTGSD+ WV C PC  C  + D 
Sbjct: 59  LANIDLPLGGDSRADSIGLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDL 118

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++D   S + K V C    C  +      S  C +  P  C+Y V YGDGS + G
Sbjct: 119 GIPLSLYDSKTSSTSKNVGCEDDFCSFIM----QSETCGAKKP--CSYHVVYGDGSTSDG 172

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
           +  ++++ L + + N        + +FGCG+N  G  G     V G+MG G+S+ S++SQ
Sbjct: 173 DFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQ 232

Query: 277 TSEIFGG----LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
            +   GG    +FS+CL    +    G   +G   S    +TPI     +PN      Y 
Sbjct: 233 LAA--GGSTKRIFSHCL---DNMNGGGIFAVGEVESPVVKTTPI-----VPN---QVHYN 279

Query: 333 LNLTGISIGGKQLQ-----ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS 387
           + L G+ + G  +      AS    GG +IDSGT +  LP ++Y++L    +++ +    
Sbjct: 280 VILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSL----IEKITAKQQ 335

Query: 388 APGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
                + +T  CF+ ++  +   P+V + FE + +++V     ++ ++ D          
Sbjct: 336 VKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLREDMYCFGWQSGG 395

Query: 446 LSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           ++ +D   +I  G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 396 MTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 435


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 129/434 (29%), Positives = 195/434 (44%), Gaps = 74/434 (17%)

Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQP--CKSC 167
           ++++     +PL+ G      +Y  +  L     +++++ +DTGSDL W  C+P  C  C
Sbjct: 65  HLRNRHQVSLPLSPGS-----DYTLSFTLNSNPPQHVSLYLDTGSDLVWFPCKPFECILC 119

Query: 168 Y----NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN- 215
                N       P +S + + V C SS C A       S +C+ +  P       DC+ 
Sbjct: 120 EGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIADCPLESIETSDCHS 179

Query: 216 -----YFVSYGDGSYTRGELGREHLGLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGL 266
                ++ +YGDGS     L  + + L  A    S+++F FGC            G+ G 
Sbjct: 180 FSCPSFYYAYGDGSLV-ARLYHDSIKLPLATPSLSLHNFTFGCAHT---ALAEPVGVAGF 235

Query: 267 GRSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSS----VFKNSTP 315
           GR  LSL +Q +      G  FSYCL S    +        LILG +      V K+   
Sbjct: 236 GRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQ 295

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPP 368
             YT+M+ NP+   FY + L GISIG K++ A  F K       GG+++DSGT  T LP 
Sbjct: 296 FVYTSMLDNPKHPYFYCVGLEGISIGKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPA 355

Query: 369 SIYSALKAEFLKQ----FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
           S+Y+++ AEF  +    +         + L  C+       VNIP + + F GN E +V 
Sbjct: 356 SLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCYYYDTV--VNIPSLVLHFVGN-ESSVV 412

Query: 425 VTGIVYF---------VKSDASQVCLALASLSYEDE-TG----IIGNYQQKNQRVIYDTK 470
           +    YF         V+      CL L +   E E TG     +GNYQQ    V+YD +
Sbjct: 413 LPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLE 472

Query: 471 NSQLGFAGEDCSSM 484
             ++GFA   C+S+
Sbjct: 473 QRRVGFARRKCASL 486


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 167/362 (46%), Gaps = 44/362 (12%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  +++G     +  ++DTGS++TW QC PC  CY Q  P+FDPS S ++K+  C+  +
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCHDHS 439

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IF 247
           C                      Y V Y D +YT+G L  + + +   S   F     I 
Sbjct: 440 CP---------------------YEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETII 478

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           GCGRNN        G +GL    LSL++Q    + GL SYC      AG   S I  G +
Sbjct: 479 GCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCF-----AGNGTSKINFGTN 533

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVI 363
           ++      ++ T M        FY LNL  +S+G  +++  G      +G I+IDSGT +
Sbjct: 534 AIVGGGGVVS-TTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTL 592

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           T  P S  + ++          P+A        C+  S   E+  P++ M F G A++ +
Sbjct: 593 TYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCY-YSNTTEI-FPVITMHFSGGADLVL 650

Query: 424 DVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           D   +  F++S +  + CLA+   +   E  I GN  Q N  V YD+ +  + F   +CS
Sbjct: 651 DKYNM--FMESYSGGLFCLAIICNNPTQE-AIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707

Query: 483 SM 484
           ++
Sbjct: 708 AL 709



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 107/369 (28%), Positives = 154/369 (41%), Gaps = 73/369 (19%)

Query: 117 VSNTEI--PLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQD 172
           VSNT+   P    +   T  Y+  +++G     V  ++DTGS+L W QC PC  CY+Q+ 
Sbjct: 46  VSNTQAGSPYADTV-FDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKA 104

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGEL 230
           P+FDPS S ++K+  CN+                     PD  C Y + Y D SYT+G L
Sbjct: 105 PIFDPSKSSTFKETRCNT---------------------PDHSCPYKLVYDDKSYTQGTL 143

Query: 231 GREHLGLGKASVNDF-----IFGCGRNN--KGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
             E + +   S   F     I GC RNN   G     SG++GL R  LSL+SQ    + G
Sbjct: 144 ATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGAYPG 203

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
                     D   S                    T M         Y LNL  +S+G  
Sbjct: 204 ----------DGVVS--------------------TTMFAKTAKRGQYYLNLDAVSVGDT 233

Query: 344 QLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           +++  G       G I+IDSGT +T  P S Y  L  + +++          S  D    
Sbjct: 234 RIETVGTPFHALNGNIVIDSGTPLTYFPVS-YCNLVRKAVERVVTADRVVDPSRNDMLCY 292

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
            S   E+  P++ + F G A++ +D   + Y   +     CLA+   +   +  I GN  
Sbjct: 293 YSNTIEI-FPVITVHFSGGADLVLDKYNM-YMELNRGGVFCLAII-CNNPTQVAIFGNRA 349

Query: 460 QKNQRVIYD 468
           Q N  V YD
Sbjct: 350 QNNFLVGYD 358


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 129/412 (31%), Positives = 202/412 (49%), Gaps = 43/412 (10%)

Query: 84  WNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG- 142
           W+ +  N    D L  +YL + +        K VS    P+ SG      NY+  ++LG 
Sbjct: 57  WDNRIINMASKDPLRFKYLSTLVGQ------KTVSTA--PIASGQTFNIGNYVVRVKLGT 108

Query: 143 -GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
            G+ + +++DT +D  +V C  C  C    D  F P  S SY  + C+   C  +   + 
Sbjct: 109 PGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKASTSYGPLDCSVPQCGQVRGLS- 164

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
               C ++    C++  SY   S++   L ++ L L    + ++ FGC     G      
Sbjct: 165 ----CPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYSFGCVNAITGASVPAQ 219

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           GL+GLGR  LSL+SQ+   + G+FSYCLPS +    SGSL LG           I  T +
Sbjct: 220 GLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPL 275

Query: 322 IPNPQLATFYILNLTGISIG-------GKQLQASGFAKGGILIDSGTVITRLPPSIYSAL 374
           + +P   + Y +N TGIS+G        + L  +     G +IDSGTVITR    +Y+A+
Sbjct: 276 LRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAV 335

Query: 375 KAEFLKQFSG--FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYF 431
           + EF KQ  G  F S   F   DTCF +  Y+ +  P + + FEG + ++ ++ + I   
Sbjct: 336 REEFRKQVGGTTFTSIGAF---DTCF-VKTYETL-APPITLHFEGLDLKLPLENSLI--- 387

Query: 432 VKSDASQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             S  S  CLA+A+   +      +I N+QQ+N R+++DT N+++G A E C
Sbjct: 388 HSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 44/362 (12%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  +++G     +  I+DTGS++TW QC PC  CY Q  P+FDPS S ++K+  C+   
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD--- 121

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF-----IF 247
                   G+S          C Y V Y D +YT G L  E + L   S   F     I 
Sbjct: 122 --------GHS----------CPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETII 163

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           GCG NN       SG++GL     SL++Q    + GL SYC         +  +  G N+
Sbjct: 164 GCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF----SGQGTSKINFGANA 219

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVI 363
            V  +   +  T M        FY LNL  +S+G  +++  G      +G I+IDSGT +
Sbjct: 220 IVAGDG--VVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGTTL 277

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI-PLVKMEFEGNAEMT 422
           T  P S  + ++       +   +A        C+N      ++I P++ M F G  ++ 
Sbjct: 278 TYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYN---SDTIDIFPVITMHFSGGVDLV 334

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +D   + Y   ++    CLA+   S   E  I GN  Q N  V YD+ +  + F+  +CS
Sbjct: 335 LDKYNM-YMESNNGGVFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392

Query: 483 SM 484
           ++
Sbjct: 393 AL 394


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 121/417 (29%), Positives = 197/417 (47%), Gaps = 45/417 (10%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
           K + W +        D   +Q+L S +             + +P+ S  +L Q+  ++  
Sbjct: 57  KPLSWADNVLQMQAKDQARLQFLSSLVAR----------RSFVPIASARQLIQSPTFVVR 106

Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
            ++G    T+++  DT +D  W+ C  C  C +    VF    S S++ + C S  C+ +
Sbjct: 107 AKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSDKSSSFRPLPCQSPQCNQV 164

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              +     CS S+   C + ++YG  S    +L +++L L   SV  + FGC R   G 
Sbjct: 165 PNPS-----CSGSA---CGFNLTYG-SSTVAADLVQDNLTLATDSVPSYTFGCIRKATGS 215

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  LSL+ Q+  ++   FSYCLPS +    SGSL LG  +   +    I
Sbjct: 216 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIR----I 271

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPS 369
            YT ++ NP+ ++ Y +NL  I +G K   +  S  A       G +IDSGT  TRL   
Sbjct: 272 KYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAP 331

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
            Y+A++ EF ++     +       DTC+ +     +  P +   F G   M V +    
Sbjct: 332 AYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMFAG---MNVTLPPDN 384

Query: 430 YFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + + S A S  CLA+A+   +      +I + QQ+N R+++D  NS++G A E CSS
Sbjct: 385 FLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 441


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 126/418 (30%), Positives = 196/418 (46%), Gaps = 47/418 (11%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
           K + W E        D   +QYL S +             + +P+ SG ++ Q+  YI  
Sbjct: 52  KPMSWEESVLKLQAKDQARMQYLSSLVAR----------RSIVPIASGRQITQSPTYIVK 101

Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
            ++G    T+++  DT +D +WV C  C  C       F P+ S ++KKV C +S C  +
Sbjct: 102 AKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPAKSTTFKKVGCGASQCKQV 159

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              T     C  S+   C +  +YG  S     L ++ + L    V  + FGC +   G 
Sbjct: 160 RNPT-----CDGSA---CAFNFTYGTSSVA-ASLVQDTVTLATDPVPAYAFGCIQKVTGS 210

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  LSL++QT +++   FSYCLPS +    SGSL LG  +   +    I
Sbjct: 211 SVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGSLRLGPVAQPKR----I 266

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF---AKGGILIDSGTVITRLPPS 369
            +T ++ NP+ ++ Y +NL  I +G + +    +A  F      G + DSGTV TRL   
Sbjct: 267 KFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTRLVEP 326

Query: 370 IYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
            Y+A++ EF ++ +        S+   DTC+       +  P +   F G   M V +  
Sbjct: 327 AYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT----APIVAPTITFMFSG---MNVTLPP 379

Query: 428 IVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               + S A  V CLA+A    +      +I N QQ+N RV++D  NS+LG A E C+
Sbjct: 380 DNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 437


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 110/346 (31%), Positives = 166/346 (47%), Gaps = 27/346 (7%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           + +T + DTGSDL W +C             + P+ S ++ ++ C+   C AL   + + 
Sbjct: 111 QKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALR--SYSL 168

Query: 204 GVCSSSSPPDCNYFVSYG---DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGV 260
             C++    +C+Y  +YG   D  +T+G LG E   LG  +V    FGC    +G +G  
Sbjct: 169 ARCAAGGA-ECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGFGCTTALEGDYGEG 227

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GL+GLGR  LSLVSQ   +  G F YCL  T DA  +  L+ G  +++      +  T 
Sbjct: 228 AGLVGLGRGPLSLVSQ---LDAGTFMYCL--TADASKASPLLFGALATMTGAGAGVQSTG 282

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           ++ +    TFY +NL  I+I G    A     GG++ DSGT +T L    Y+  KA FL 
Sbjct: 283 LLAS---TTFYAVNLRSITI-GSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLS 338

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
           Q +      G    + C+       + IP + + F+G A+M + V    Y V+ D   VC
Sbjct: 339 QTTSLTPVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALPVAN--YVVEVDDGVVC 395

Query: 441 LAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
             +    SLS      IIGN  Q N  V++D + S L F   +C S
Sbjct: 396 WVVQRSPSLS------IIGNIMQMNYLVLHDVRKSVLSFQPANCDS 435


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 114/378 (30%), Positives = 176/378 (46%), Gaps = 35/378 (9%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD--PVFDPSI 179
           + S I  ++  Y+  + +G     M  I DTGSDL WV C          D   VF PS 
Sbjct: 89  VESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSR 148

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG- 238
           S +Y  + C S+ C AL  A+     C + S  +C Y  +YGDGS T G L  E      
Sbjct: 149 STTYSLLSCQSAACQALSQAS-----CDADS--ECQYQYAYGDGSRTIGVLSTETFSFAA 201

Query: 239 -------KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCL 289
                  +  V    FGC   + G F    GL+GLG   LSLVSQ   +      FSYCL
Sbjct: 202 AGGGGEGQVRVPRVSFGCSTGSAGSFRS-DGLVGLGAGALSLVSQLGAAARIARRFSYCL 260

Query: 290 -PSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
            P    A +S +L  G  + V   S P    T ++P+ ++ ++Y + L  +++ G+ + +
Sbjct: 261 VPPYAAANSSSTLSFGARAVV---SDPGAASTPLVPS-EVDSYYTVALESVAVAGQDVAS 316

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL---SAYQ 404
           +  ++  I++DSGT +T L P++   L AE  ++     + P   +L  C+++   S  +
Sbjct: 317 ANSSR--IIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAE 374

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
           +  IP V + F G A +T+        ++     +CL L  +S      I+GN  Q+N  
Sbjct: 375 DFGIPDVTLRFGGGASVTLRPENTFSLLEE--GTLCLVLVPVSESQPVSILGNIAQQNFH 432

Query: 465 VIYDTKNSQLGFAGEDCS 482
           V YD     + FA  DC+
Sbjct: 433 VGYDLDARTVTFAAVDCT 450


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 124/393 (31%), Positives = 189/393 (48%), Gaps = 48/393 (12%)

Query: 123 PLTSGIRL-QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSI 179
           PL SG +L  T  Y+    LG   + + + VDT +D  WV C  C  C     P F+P+ 
Sbjct: 81  PLASGRQLLHTPTYLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTA-PSFNPAS 139

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +++ V C +  C      +  S   S +S   C + +SYGD S     L +++L +  
Sbjct: 140 SATFRPVPCGAPPCSQAPNPSCTSLAKSKNS---CGFSLSYGDSSLD-ATLSQDNLAVTA 195

Query: 240 --ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
               +  + FGC   + G      GL+GLGR  L  V+QT  I+ G FSYCLPS   + A
Sbjct: 196 NGGVIKGYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAA 255

Query: 298 --SGSLILG--GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFA 351
             SGSL LG  G  +  K  T    T ++ +P   + Y + +TG+ IG K   +  S  A
Sbjct: 256 NFSGSLTLGRKGQPAPEKMKT----TPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALA 311

Query: 352 -----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG-------------FPSAPGFSI 393
                  G ++DSGT+  RL    Y+A++ E  ++ +G               S  GF  
Sbjct: 312 FDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGF-- 369

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
            DTC+N+S    V  P V + F G  E+ +    +V    +  S  CLA+A+   +    
Sbjct: 370 -DTCYNVS---TVAWPAVTLVFGGGMEVRLPEENVV-IRSTYGSTSCLAMAASPADGVNA 424

Query: 454 ---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
              +IG+ QQ+N RV++D  N+++GFA E C++
Sbjct: 425 ALNVIGSLQQQNHRVLFDVPNARVGFARERCTA 457


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 175/378 (46%), Gaps = 37/378 (9%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDP----VFDP 177
           + S I  ++  Y+  + +G     +  I DTGSDL WV C          D     VF P
Sbjct: 92  VESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQP 151

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-- 235
           + S +Y ++ C S+ C AL  A+     C + S  +C Y  SYGDGS T G L  E    
Sbjct: 152 TRSSTYSQLSCQSNACQALSQAS-----CDADS--ECQYQYSYGDGSRTIGVLSTETFSF 204

Query: 236 ----GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYC 288
               G G+  V    FGC   + G F    GL+GLG    SLVSQ   T+ I   L SYC
Sbjct: 205 VDGGGKGQVRVPRVNFGCSTASAGTFRS-DGLVGLGAGAFSLVSQLGATTHIDRKL-SYC 262

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
           L  + DA +S +L  G  + V   S P    T ++P+  + ++Y + L  +++GG+++  
Sbjct: 263 LIPSYDANSSSTLNFGSRAVV---SEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVAT 318

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN 407
                  I++DSGT +T L P++   L  E  ++       P   +L  C+++    E +
Sbjct: 319 H---DSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETD 375

Query: 408 ---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
              IP V + F G A +T+        ++     +CL L  +S      I+GN  Q+N  
Sbjct: 376 NFGIPDVTLRFGGGAAVTLRPENTFSLLQE--GTLCLVLVPVSESQPVSILGNIAQQNFH 433

Query: 465 VIYDTKNSQLGFAGEDCS 482
           V YD     + FA  DC+
Sbjct: 434 VGYDLDARTVTFAAADCA 451


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 125/435 (28%), Positives = 196/435 (45%), Gaps = 51/435 (11%)

Query: 57  HQKSRIEMGAITLELKH-----KNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMIS 111
           + K  ++    TL++ H       +   K + W E        D   +Q+L S +     
Sbjct: 19  NPKCDVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVAR--- 75

Query: 112 GNIKDVSNTEIPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCY 168
                   + +P+ SG ++ Q+  YI   ++G    T+++  DT +D  W+ C  C  C 
Sbjct: 76  -------KSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCA 128

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
           +    +F P  S ++K V C +  C  +     N G   SS     N+ ++YG  S    
Sbjct: 129 ST---LFAPEKSTTFKNVSCAAPECKQVP----NPGCGVSSR----NFNLTYGSSSIA-A 176

Query: 229 ELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
            L ++ + L    V  + FGC     G      GL+GLGR  LSL+SQT  ++   FSYC
Sbjct: 177 NLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYC 236

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
           LPS +    SGSL LG  +   +    I YT ++ NP+ ++ Y +NL  I +G K +   
Sbjct: 237 LPSFKSLNFSGSLRLGPVAQPKR----IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIP 292

Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
             A         G + DSGTV TRL   +Y A++ EF ++     +       DTC+N+ 
Sbjct: 293 PAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV- 351

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNY 458
               + +P +   F G   M V +      + S A S  CLA+A    +      +I N 
Sbjct: 352 ---PIVVPTITFIFTG---MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 405

Query: 459 QQKNQRVIYDTKNSQ 473
           QQ+N RV+YD  NS+
Sbjct: 406 QQQNHRVLYDVPNSR 420


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/389 (29%), Positives = 168/389 (43%), Gaps = 48/389 (12%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPC--KSCYNQQDPVFDPSI 179
           +++ +   T  YIA   +G   +    ++DTGS L W QC  C  K C  Q  P F+ S 
Sbjct: 75  VSAPVHWATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASS 134

Query: 180 SPSYKKVLCNSSTCHA--LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           S S+  V C    C    L F   +           C + V+YG G    G LG +    
Sbjct: 135 SGSFAPVPCQDKACAGNYLHFCALDG---------TCTFRVTYGAGGII-GFLGTDAFTF 184

Query: 238 GKASVNDFIFGCGRNNK----GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
                    FGC    +     +  G SGL+GLGR  LSL SQT       FSYCL P  
Sbjct: 185 QSGGAT-LAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYF 240

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQ---LATFYILNLTGISIGGKQLQ--- 346
            + GAS  L +G  +S+      +     + +P+    +TFY L L GI++G  +L    
Sbjct: 241 HNNGASSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPS 300

Query: 347 --------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF---PSAPGFSILD 395
                     GF +GG++IDSG+  T L    Y  L  E  +Q +G    P       + 
Sbjct: 301 TAFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMA 360

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
            C        V +P + + F G A+M +      Y+   + S  C+A+     +    II
Sbjct: 361 LCVARGDLDRV-VPTLVLHFSGGADMALPPEN--YWAPLEKSTACMAIVRGYLQS---II 414

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           GN+QQ+N  +++D    +L F   DCS++
Sbjct: 415 GNFQQQNMHILFDVGGGRLSFQNADCSTI 443


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 101/289 (34%), Positives = 147/289 (50%), Gaps = 23/289 (7%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           VDTGSDL WV+C PC  C     P++DP+ S S  K+ C+S  C AL      S  C S 
Sbjct: 104 VDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQC-SD 162

Query: 210 SPPDCNYFVSYG-DGSY-TRGELGREHLGLGKASV-NDFIFGCGRNNKG-LFGGVSGLMG 265
            PP C Y  +YG  G + T+G LG E    G   V N+  FG      G  FGG +GL+G
Sbjct: 163 DPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVG 222

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI--P 323
           LGR  LSLVSQ   +  G F+YCL +  D     +++ G  +++  ++  ++ T ++  P
Sbjct: 223 LGRGHLSLVSQ---LGAGRFAYCLAA--DPNVYSTILFGSLAALDTSAGDVSSTPLVTNP 277

Query: 324 NPQLATFYILNLTGISIGGKQL--QASGFA-----KGGILIDSGTVITRLPPSIYSALKA 376
            P   T Y +NL GIS+GG +L  +   FA      GG+  DSG + T L  + Y  ++ 
Sbjct: 278 KPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQ 337

Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEV-NIPLVKMEFEGNAEMTVD 424
               +        G    DTCF  +  Q V  +P + + F+  A+M+++
Sbjct: 338 AITSEIQRLGYDAGD---DTCFVAANQQAVAQMPPLVLHFDDGADMSLN 383


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 136/450 (30%), Positives = 215/450 (47%), Gaps = 40/450 (8%)

Query: 44  WQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ 103
           +   S ++  C S Q    ++  I +  K   +   K   W+ +  N    D   + YL 
Sbjct: 16  FMSMSNATDPCAS-QPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           S +        K VS+   P+ SG      NYI  +++G  G+ + +++DT +D  ++  
Sbjct: 75  SLVAQ------KTVSSA--PIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI-- 124

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
            P   C       F P+ S SY  + C+   C  +   +     C ++    C++  SY 
Sbjct: 125 -PSSGCIGCSATTFSPNASTSYVPLECSVPQCSQVRGLS-----CPATGSGACSFNKSYA 178

Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
             +Y+   L ++ L L    +  + FG      G      GL+GLGR  LSL+SQT  ++
Sbjct: 179 GSTYS-ATLVQDSLRLATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLY 237

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
            G+FSYCLPS +    SGSL LG           I  T ++ NP+  + Y +NLTGI++G
Sbjct: 238 SGVFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVG 293

Query: 342 G------KQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
                  K+L A     G G +IDSGTVITR    +Y+A++ EF KQ +G  S+ G    
Sbjct: 294 KVNVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLG--AF 351

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE---DE 451
           DTCF +  Y+ +  P + + F  + ++ + +   +    S  S  CLA+AS         
Sbjct: 352 DTCF-VKNYETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMASTPKNVNYTV 407

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +I NYQQ+N RV++DT N+++G A E C
Sbjct: 408 LNVIANYQQQNLRVLFDTVNNKVGIARELC 437


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/393 (29%), Positives = 178/393 (45%), Gaps = 32/393 (8%)

Query: 107 KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC 164
           +  ++  +   S   +P++SG    T  Y   + +G   +  T++ DTGS+LTWV+C   
Sbjct: 63  RQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGG 122

Query: 165 KSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
            S       VF P  S S+  V C+S TC   + F+  N   CSSS+ P C+Y   Y +G
Sbjct: 123 ASPPGL---VFRPEASKSWAPVPCSSDTCKLDVPFSLAN---CSSSASP-CSYDYRYKEG 175

Query: 224 SY-TRGELGREHLGL----GK-ASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQ 276
           S    G +G +   +    GK A + D + GC   + G  F  V G++ LG + +S  S+
Sbjct: 176 SAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASR 235

Query: 277 TSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
            +  FGG FSYCL        A+G L  G         TP T T +  +P +  FY + +
Sbjct: 236 AAARFGGSFSYCLVDHLAPRNATGYLAFGPGQV---PRTPATQTKLFLDPAM-PFYGVKV 291

Query: 336 TGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
             + + G+ L           GG+++DSGT +T L    Y A+ A   K  +G P    F
Sbjct: 292 DAVHVAGQALDIPAEVWDPKSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKV-DF 350

Query: 392 SILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
              + C+N +A +     IP + ++F G A +       V  VK      C+ L    + 
Sbjct: 351 PPFEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPAKSYVIDVKPGVK--CIGLQEGEWP 408

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               +IGN  Q+     +D KN ++ F    C+
Sbjct: 409 G-VSVIGNIMQQEHLWEFDLKNMEVRFMPSTCT 440


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 171/366 (46%), Gaps = 35/366 (9%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+  + LG   + V  +VDTGSDL W QC PC+ CY Q+ P+F+P  S +Y  + C+S 
Sbjct: 49  DYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSE 108

Query: 192 TCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS-----VNDF 245
            C++L           S SP   C Y  +Y D S T+G L RE +           V D 
Sbjct: 109 ECNSL--------FGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDI 160

Query: 246 IFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLI 302
           +FGCG +N G F     G++GLG   LSLVSQ   ++G   FS CL P   D    G++ 
Sbjct: 161 VFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTIS 220

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDS 359
            G  S V  +   +  T ++ + +  T Y++ L GIS+G   +    +   +KG I+IDS
Sbjct: 221 FGDASDV--SGEGVAATPLV-SEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDS 277

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEG 417
           GT  T LP   Y  L  E   Q +  P        D    L    E N+  P++   FEG
Sbjct: 278 GTPATYLPQEFYDRLVKELKVQSNMLPID---DDPDLGTQLCYRSETNLEGPILIAHFEG 334

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
                V +  I  F+       C A+A  +  D   I GN+ Q N  + +D     + F 
Sbjct: 335 ---ADVQLMPIQTFIPPKDGVFCFAMAGTT--DGEYIFGNFAQSNVLIGFDLDRKTVSFK 389

Query: 478 GEDCSS 483
             DCS+
Sbjct: 390 ATDCSN 395


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 175/374 (46%), Gaps = 59/374 (15%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           ++A I +G   +   +++DTGSDLTW+ C PCK CY Q  P F PS S +Y+   C S+ 
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCK-CYPQTIPFFHPSRSSTYRNASCVSAP 136

Query: 193 CHALE--FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDF 245
            HA+   F    +G        +C Y + Y D S TRG L  E L       G  S  + 
Sbjct: 137 -HAMPQIFRDEKTG--------NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNI 187

Query: 246 IFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPS-TQDAGASGSLILG 304
           +FGCG++N G F   SG++GLG    S+V++    FG  FSYC  S T        LILG
Sbjct: 188 VFGCGQDNSG-FTKYSGVLGLGPGTFSIVTRN---FGSKFSYCFGSLTNPTYPHNILILG 243

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGF----AKGGILID 358
             + +  + TP+              Y L+L  IS G K L  +   F    ++GG +ID
Sbjct: 244 NGAKIEGDPTPLQI--------FQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVID 295

Query: 359 SGTVITRLPPSIYSALKAEF----------LKQFSGFPSAPGFSILDTCFNLSAYQEVNI 408
           +G   T L    Y  L  E           +K +  + + P +   +    L  Y     
Sbjct: 296 TGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQY-TTPCY---EGNLKLDLY---GF 348

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           P+V   F G AE+ +DV  +  FV S++    CLA+   +++D + +IG   Q+N  V Y
Sbjct: 349 PVVTFHFAGGAELALDVESL--FVSSESGDSFCLAMTMNTFDDMS-VIGAMAQQNYNVGY 405

Query: 468 DTKNSQLGFAGEDC 481
           + +  ++ F   DC
Sbjct: 406 NLRTMKVYFQRTDC 419


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 167/357 (46%), Gaps = 69/357 (19%)

Query: 167 CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT 226
           C  +  P F P+ S ++ K+ C SS C   +F T     C+++    C Y+  YG G +T
Sbjct: 88  CAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYLTCNATG---CVYYYPYGMG-FT 140

Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
            G L  E L +G AS     FGC   N G+    SG++GLGRS LSLVSQ      G FS
Sbjct: 141 AGYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGV---GRFS 196

Query: 287 YCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITYTNMIPNPQL--ATFYILNLTGI 338
           YCL S  DAG S    GSL  + GG SS            ++ NP++  +++Y +NLTGI
Sbjct: 197 YCLRSDADAGDSPILFGSLAKVTGGKSS----------PAILENPEMPSSSYYYVNLTGI 246

Query: 339 SIGGKQLQAS----GFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFS---- 383
           ++G   L  +    GF +G       G ++DSGT +T L    Y+ +K  FL Q +    
Sbjct: 247 TVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANL 306

Query: 384 ---------GFPSAPGFSILDTCFNLSAY---QEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
                    GF         D CF+ +A      V +P + + F G AE  V     V  
Sbjct: 307 TTTVNGTRFGF---------DLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGV 357

Query: 432 VKSD----ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V+ D    A+  CL +   S +    IIGN  Q +  V+YD       FA  DC+++
Sbjct: 358 VEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 414


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 115/317 (36%), Positives = 162/317 (51%), Gaps = 27/317 (8%)

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFAT-GNSGVCSSSSPPDCNYFVSYGDGSYTRG--E 229
           P FD S S +     C+S+ C  L  A+ GN+    + +   C Y   Y D S T G  E
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQT---CVYTYYYNDKSVTTGLIE 79

Query: 230 LGREHLGLGKASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYC 288
           + +   G G ASV    FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C
Sbjct: 80  VDKFTFGAG-ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHC 135

Query: 289 LPSTQDAGASGSLILGGNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
             +      S +++L   + ++KN    +  T +I N    TFY L+L GI++G  +L  
Sbjct: 136 FTAVNGLKQS-TVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPV 194

Query: 348 --SGFA----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNL 400
             S FA     GG +IDSGT IT LPP +Y  ++ EF  Q    P  PG +    TCF+ 
Sbjct: 195 PESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVPGNATGPYTCFSA 253

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETGIIGNY 458
            +  + ++P + + FEG A M +     V+ V  DA  S +CLA   ++  DET IIGN+
Sbjct: 254 PSQAKPDVPKLVLHFEG-ATMDLPRENYVFEVPDDAGNSIICLA---INKGDETTIIGNF 309

Query: 459 QQKNQRVIYDTKNSQLG 475
           QQ+N  V+YD +N   G
Sbjct: 310 QQQNMHVLYDLQNMHRG 326


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 176/383 (45%), Gaps = 54/383 (14%)

Query: 115 KDVSNTEIPLTSGIRLQTL-NYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ 171
           K+ +N  +P+  G ++ ++ NYIA   LG   + + V +D  +D  WV C  C  C    
Sbjct: 81  KNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCA-AS 139

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
            P F P+ S +Y+ V C S  C  +   +  +GV SS     C + ++Y   ++ +  LG
Sbjct: 140 SPSFSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSS-----CGFNLTYAASTF-QAVLG 193

Query: 232 REHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGL-GRSDLSLVSQTSEIFGGLFSYCLP 290
           ++ L L    V  + FGC R   G     +G   L  R+ L LV+               
Sbjct: 194 QDSLALENNVVVSYTFGCLRVVNGNSRAAAGAHRLRPRAALLLVA--------------- 238

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
              D G  G +   G     K +TP+ Y     NP   + Y +N+ GI +G K +Q    
Sbjct: 239 ---DQGHLGPI---GQPKRIK-TTPLLY-----NPHRPSLYYVNMIGIRVGSKVVQVPQS 286

Query: 351 AKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
           A         G +ID+GT+ TRL   +Y+A++  F  +    P AP     DTC+N++  
Sbjct: 287 ALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPPLGGFDTCYNVT-- 343

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGIIGNYQQ 460
             V++P V   F G   +T+    ++    S     CLA+A   S        ++ + QQ
Sbjct: 344 --VSVPTVTFMFAGAVAVTLPEENVMIH-SSSGGVACLAMAAGPSDGVNAALNVLASMQQ 400

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
           +NQRV++D  N ++GF+ E C++
Sbjct: 401 QNQRVLFDVANGRVGFSRELCTA 423


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 171/367 (46%), Gaps = 33/367 (8%)

Query: 131 QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           Q +NY+A   +G   +  + ++D   +L W QC+ C  C+ Q  P+FDP+ S +Y+   C
Sbjct: 47  QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            +  C ++   + N   CS +    C Y  S   G  T G++G +   +G A  +   FG
Sbjct: 107 GTPLCESIPSDSRN---CSGNV---CAYQASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158

Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           C   ++    GG SG++GLGR+  SLV+QT       FSYCL +  DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGRNSALFLGSSA 214

Query: 308 SVF----KNSTPITYTNMIPN-PQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
            +       STP  + N+  N   L+ +Y + L G+  G     L  SG     +L+D+ 
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           + I+ L    Y A+K          P A      D CF  S        LV   F G A 
Sbjct: 270 SPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328

Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           MTV  T   Y +      VCLA+   A L+   E  ++G+ QQ+N   ++D     L F 
Sbjct: 329 MTVPATN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386

Query: 478 GEDCSSM 484
             DC+ +
Sbjct: 387 PADCTKL 393


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/365 (30%), Positives = 175/365 (47%), Gaps = 40/365 (10%)

Query: 147 TVIVDTGSDLTWVQC-------QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
           T+IVDTGSDL W QC       +   S   Q++P+++P  S S+  + C+   C   +F+
Sbjct: 98  TLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDRLCQEGQFS 157

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVN-DFIFGCGRNNKGLF 257
             N   C+ ++   C Y   YG      G L  E    G  A V+    FGCG  + G  
Sbjct: 158 YKN---CARNN--RCMYDELYGSAE-AGGVLASETFTFGVNAKVSLPLGFGCGALSAGDL 211

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTP 315
            G SGLMGL    +SLVSQ S      FSYCL    +   S  L+ G  + +  ++ +  
Sbjct: 212 VGASGLMGLSPGIMSLVSQLSV---PRFSYCLTPFAERKTS-PLLFGAMADLRRYRTTGT 267

Query: 316 ITYTNMIPNPQLAT-FYILNLTGISIGGKQLQAS----GFAK----GGILIDSGTVITRL 366
           +  T+++ NP + T +Y + L G+S+G K+L       G  K    GG ++DSG+ ++ L
Sbjct: 268 VQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSYL 327

Query: 367 PPSIYSALKAEFLKQFSGFPSAPG----FSILDTCFNLS---AYQEVNIPLVKMEFEGNA 419
             + + A+K   ++     P A G    +   + CF L    A + V  P + + F+G A
Sbjct: 328 EETAFRAVKKAVVEAVR-LPVANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGGA 386

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
            MT+      YF +  A  +CLA+ +        IIGN QQ+N  V++D +N +  FA  
Sbjct: 387 AMTLPRDN--YFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPT 444

Query: 480 DCSSM 484
            C  +
Sbjct: 445 KCDDI 449


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+A   +G   + ++ +VD   +L W QC PC+ C+ Q  P+FDP+ S +++ + C S  
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116

Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
           C ++  ++ N  S VC   +P         GD   T G+ G +   +G A      FGC 
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGKAGTDTFAIGAAK-ETLGFGCV 166

Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
              +K L   GG SG++GLGR+  SLV+Q +      FSYCL          GA+   + 
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223

Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
           GG  +S+ F   T    ++   NP    +Y++ L GI  GG  LQA+  +   +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKTGGAPLQAASSSGSTVLLDTVS 279

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
             + L    Y ALK          P A      D CF  +   +   P +   F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA--PELVFTFDGGAAL 337

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLG 475
           TV      Y + S    VCL + S +  + TG      I+G+ QQ+N  V++D K   L 
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLS 395

Query: 476 FAGEDCSSM 484
           F   DCSS+
Sbjct: 396 FKPADCSSL 404


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+A   +G   + ++ +VD   +L W QC PC+ C+ Q  P+FDP+ S +++ + C S  
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116

Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
           C ++  ++ N  S VC   +P         GD   T G  G +   +G A      FGC 
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGMAGTDTFAIGAAK-ETLGFGCV 166

Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
              +K L   GG SG++GLGR+  SLV+Q +      FSYCL          GA+   + 
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223

Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
           GG  +S+ F   T    ++   NP    +Y++ L GI  GG  LQA+  +   +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKAGGAPLQAASSSGSTVLLDTVS 279

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
             + L    Y ALK          P A      D CF+ +   +   P +   F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDA--PELVFTFDGGAAL 337

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLG 475
           TV      Y + S    VCL + S +  + TG      I+G+ QQ+N  V++D K   L 
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLS 395

Query: 476 FAGEDCSSM 484
           F   DCSS+
Sbjct: 396 FKPADCSSL 404


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 97/292 (33%), Positives = 149/292 (51%), Gaps = 26/292 (8%)

Query: 214 CNYFVSYGDGSYTRGELGREHLGL------GKAS---VNDFIFGCGRNNKGLFGGVSGLM 264
           C Y+  YGD S T G+   E   +      GK     V + +FGCG  N+GLF G +GL+
Sbjct: 74  CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIP 323
           GLGR  LS  SQ   ++G  FSYCL     DA  S  LI G +  +  +   + +T ++ 
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPE-LNFTTLVA 192

Query: 324 ---NPQLATFYILNLTGISIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSA 373
              NP + TFY + +  I +GG       ++ Q +    GG +IDSGT ++      Y  
Sbjct: 193 GKENP-VDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQV 251

Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           +K  F+ +  G+P    F +L+ C+N++  ++ ++P   + F   A     V    YF++
Sbjct: 252 IKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVEN--YFIE 309

Query: 434 SDASQ-VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  + VCLA+          IIGNYQQ+N  ++YDTK S+LGFA   C+ +
Sbjct: 310 IEPREVVCLAILGTP-PSALSIIGNYQQQNFHILYDTKKSRLGFAPTKCADV 360


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 189/408 (46%), Gaps = 40/408 (9%)

Query: 101 YLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTW 158
           +L +++  ++S     VS  ++ L+    L    +  T+ +G   +   +IVDTGSDL W
Sbjct: 60  WLTAKLAGVLSNRRGGVSPADVRLSP---LSDQGHSLTVGIGTPPQPRKLIVDTGSDLIW 116

Query: 159 VQCQPCKS----CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDC 214
            QC+   S      +   PV+DP  S ++  + C+   C   +F+  N   C+S +   C
Sbjct: 117 TQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKN---CTSKN--RC 171

Query: 215 NYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 272
            Y   YG  +   G L  E    G  +A      FGCG  + G   G +G++GL    LS
Sbjct: 172 VYEDVYGSAAAV-GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLS 230

Query: 273 LVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNPQLATF 330
           L++Q        FSYCL    D   S  L+ G  + + ++ T  PI  T ++ NP    +
Sbjct: 231 LITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVY 286

Query: 331 YILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y + L GIS+G K+L   A+  A      GG ++DSG+ +  L  + + A+K E +    
Sbjct: 287 YYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK-EAVMDVV 345

Query: 384 GFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
             P A       + CF L      +A + V +P + + F+G A M +      YF +  A
Sbjct: 346 RLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDN--YFQEPRA 403

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             +CLA+   +      IIGN QQ+N  V++D ++ +  FA   C  +
Sbjct: 404 GLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 451


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 118/368 (32%), Positives = 177/368 (48%), Gaps = 46/368 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+T+++DTGS+L+W+ C+  +  +N    +F+P  S +Y K+ C+S TC   E  T + 
Sbjct: 78  QNITMVLDTGSELSWLHCKK-EPNFNS---IFNPLASKTYTKIPCSSPTC---ETRTRDL 130

Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFG 258
            +  S  P   C++ +SY D S   G L  E   +G  +    +FGC      +N     
Sbjct: 131 PLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDA 190

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
             +GLMG+ R  LS V+Q        FSYC+    D  +SG L+LG  S  F    P+ Y
Sbjct: 191 KTTGLMGMNRGSLSFVNQMG---FRKFSYCI---SDRDSSGVLLLGEAS--FSWLKPLNY 242

Query: 319 TNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFAK-----GGILIDSGTVITRL 366
           T ++    P P      Y + L GI +  K   L  S F       G  ++DSGT  T L
Sbjct: 243 TPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFL 302

Query: 367 PPSIYSALKAEFLKQFSGF------PSAPGFSILDTCFNLSAYQEV--NIPLVKMEFEGN 418
              +YSALK EFL Q  G       P       +D C+ +   +    N+P+V + F G 
Sbjct: 303 LGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRG- 361

Query: 419 AEMTVDVTGIVYFVKSDA----SQVCLALA-SLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
           AEM+V    ++Y V  +     S  C     S S   E+ +IG++QQ+N  + YD + S+
Sbjct: 362 AEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDLEKSR 421

Query: 474 LGFAGEDC 481
           +GFA   C
Sbjct: 422 IGFAEVRC 429


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 127/476 (26%), Positives = 200/476 (42%), Gaps = 63/476 (13%)

Query: 42  LQWQQ--KSGSSSSCVSHQKSRIEMGAITLELKHKNY----CSGKIVDWNEQQQNRLILD 95
           +QW    K+    +        + + ++ LEL H+++      G  VD  E  +  +  D
Sbjct: 6   MQWNTITKASILVTITLLLILPVAVNSMRLELVHRHHERFAGGGGDVDRVEAVKGFVKRD 65

Query: 96  NLHVQYLQSR---IKNMISGN-----IKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
            L  Q +  R   + N  S           +  E+P+ SG       Y A +++G  G+ 
Sbjct: 66  KLRRQRMNQRWGVVSNYDSRRKGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKVGSPGQR 125

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
             ++VDTGS+ TW+ C                  S S++ V C S  C        +  V
Sbjct: 126 FWLVVDTGSEFTWLNC------------------SKSFEAVTCASRKCKVDLSELFSLSV 167

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGV 260
           C   S P C Y +SY DGS  +G  G + + +G     +  +N+   GC    K +  GV
Sbjct: 168 CPKPSDP-CLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGC---TKSMLNGV 223

Query: 261 S------GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNS 313
           +      G++GLG +  S + + +  +G  FSYCL         S +L +GG+ +  K  
Sbjct: 224 NFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNA-KLL 282

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGF-AKGGILIDSGTVITRLPP 368
             I  T +I  P    FY +N+ GISIGG+ L    Q   F A+GG LIDSGT +T L  
Sbjct: 283 GEIRRTELILFPP---FYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLIDSGTTLTSLLL 339

Query: 369 SIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
             Y A+     K  +      G  F  L+ CF+   + +  +P +   F G A     V 
Sbjct: 340 PAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVK 399

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              Y +       C+ +  +       +IGN  Q+N    +D   + +GFA   C+
Sbjct: 400 S--YIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 175/398 (43%), Gaps = 37/398 (9%)

Query: 111 SGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCY 168
           +  + + S   +PLTSG    T  Y     +G   +   ++ DTGSDLTWV+C+  ++  
Sbjct: 86  TAPMPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASS 145

Query: 169 NQQDP-----VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSS--SSPPDCNYFVSY 220
               P     VF P+ S S+  + C+S TC + + F+  N   CS+  + P  C Y   Y
Sbjct: 146 PDASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLAN---CSAGTTPPAPCGYDYRY 202

Query: 221 GDGSYTRGELGREHLGLG--------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDL 271
            D S  RG +G +   +         KA + + + GC  +  G  F    G++ LG S++
Sbjct: 203 KDKSSARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNI 262

Query: 272 SLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           S  S+ +  FGG FSYCL        A+  L  G   +    S     T ++ + Q+A F
Sbjct: 263 SFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSR----TPLLLDAQVAPF 318

Query: 331 YILNLTGISIGGKQLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
           Y + +  +S+ GK L            GG ++DSGT +T L    Y A+ A   KQ +  
Sbjct: 319 YAVTVDAVSVAGKALNIPAEVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARV 378

Query: 386 PSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           P        + C+N +A +    +P +++ F G+A +        Y + +     C+ L 
Sbjct: 379 PRV-TMDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKS--YVIDAAPGVKCIGLQ 435

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              +     +IGN  Q+     +D  N  L F    C+
Sbjct: 436 EGVWPG-VSVIGNILQQEHLWEFDLANRWLRFQESRCA 472


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 110/390 (28%), Positives = 176/390 (45%), Gaps = 42/390 (10%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ-----DPV 174
           +PLTSG    T  Y   + +G   +   ++ DTGSDLTWV+C    S  +         V
Sbjct: 91  MPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRV 150

Query: 175 FDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGR 232
           F P+ S S+  + C+S TC + + F+  N      SSPPD C+Y   Y D S  RG +G 
Sbjct: 151 FRPAGSKSWSPLPCDSDTCKSYVPFSLANC-----SSPPDPCSYDYRYKDNSSARGVVGL 205

Query: 233 EHL--------GLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           +          G  KA + + + GC  +  G  F    G++ LG S++S  S+ +  FGG
Sbjct: 206 DSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGG 265

Query: 284 LFSYCLPSTQDAGASGSLILGGN------SSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
            FSYCL        + S +  GN             TP+    ++ + +   FY +++  
Sbjct: 266 RFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLV---LLEDARTRPFYFVSVDA 322

Query: 338 ISIGGKQLQ----ASGFAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           +++ G++L+       F K GG ++DSGT +T L    Y A+     KQF+G P      
Sbjct: 323 VTVAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRV-NMD 381

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
             + C+N +      IP +++ F G A  T+   G  Y + +     C+ +   ++    
Sbjct: 382 PFEYCYNWTGVS-AEIPRMELRFAGAA--TLAPPGKSYVIDTAPGVKCIGVVEGAWPG-V 437

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +IGN  Q+     +D  N  L F    C+
Sbjct: 438 SVIGNILQQEHLWEFDLANRWLRFKQSRCA 467


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/302 (34%), Positives = 149/302 (49%), Gaps = 22/302 (7%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           + T  Y+  + +G   + + + +DTGSDL W QCQPC +C++Q  P FDPS S +     
Sbjct: 77  VPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTS 136

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL--GKASVNDF 245
           C+S+ C  L  A+  S     +    C Y  SYGD S T G L  +        ASV   
Sbjct: 137 CDSTLCQGLPVASCGSPKFWPNQ--TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGV 194

Query: 246 IFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGCG  N G+F    +G+ G GR  LSL SQ      G FS+C  +      S +++L 
Sbjct: 195 AFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPS-TVLLD 250

Query: 305 GNSSVFKNST-PITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFA----KGGILI 357
             + ++K+    +  T +I NP   TFY L+L GI++G  +L    S FA     GG +I
Sbjct: 251 LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTII 310

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEF 415
           DSGT +T LP  +Y  ++  F  Q    P   G +  D  F LSA       +P + + F
Sbjct: 311 DSGTAMTSLPTRVYRLVRDAFAAQVK-LPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHF 368

Query: 416 EG 417
           EG
Sbjct: 369 EG 370


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DTGSDL W QC PC +CY+Q  P+FDPS S ++K+  CN ++CH               
Sbjct: 78  IDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNSCH--------------- 122

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGGVSGLM 264
                 Y + Y D +Y++G L  E + +   S   F+      GCG N+       SG++
Sbjct: 123 ------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMV 176

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           GL     SL++Q    + GL SYC  S  T       + I+ G+  V   ST +  T   
Sbjct: 177 GLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV---STTMFLTTAK 233

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
           P       Y LNL  +S+G   ++  G      +G I+IDSGT +T  P S Y  L  E 
Sbjct: 234 PG-----LYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVREA 287

Query: 379 LKQFSGFPSAPGFSILDTCFN--LSAYQE-VNI-PLVKMEFEGNAEMTVDVTGIVYFVKS 434
           +  +            D   N  L  Y + ++I P++ M F G A++ +D   + Y    
Sbjct: 288 VDHY-----VTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNM-YIETI 341

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                CLA+   +   +  I GN  Q N  V YD+ +  + F+  +CS++
Sbjct: 342 TRGTFCLAIIC-NNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCSAL 390


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 174/383 (45%), Gaps = 25/383 (6%)

Query: 113 NIKDVSNTEIP--LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC-QPCKSC 167
            + D + T  P  +T  +      Y+  + +G   + ++ I+D G +L W QC Q C+ C
Sbjct: 27  ELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRC 86

Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
           + Q  P+FD + S +++   C ++ C ++   +       +       Y  S   G  T 
Sbjct: 87  FKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACG-----YEASTSFG-RTV 140

Query: 228 GELGREHLGLGKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           G +G + + +G A+     FGC   +      G SG +GLGR++LSL +Q +      FS
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFS 197

Query: 287 YCLPSTQDAGASGSLILGGNSSVF-----KNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           YCL +  D G S +L LG ++ +        +TP   T+  P+  L+  Y+L L  I  G
Sbjct: 198 YCL-APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAG 256

Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
              + A   +   I++ + T +T L  S+Y  L+          P  P     D CF   
Sbjct: 257 NATI-AMPQSGNTIMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFP-K 314

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
           A      P + + F+G AEMTV V+  ++   +D +  C+A+          I+G+ QQ 
Sbjct: 315 ASASGGAPDLVLAFQGGAEMTVPVSSYLFDAGNDTA--CVAILGSPALGGVSILGSLQQV 372

Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
           N  +++D     L F   DCS++
Sbjct: 373 NIHLLFDLDKETLSFEPADCSAL 395


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 57/376 (15%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS- 191
           ++  I +G   +T ++  DT SDL W+QC+PC +CY Q  P+FDPS S +++   C +S 
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144

Query: 192 -TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVN 243
            +  +L F         ++    C Y + Y DG+ ++G L +E L           A+++
Sbjct: 145 YSMPSLRF---------NAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALH 195

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
           D +FGCG +N G     +G++GLG  + SLV +    FG  FSYC  S  D     ++++
Sbjct: 196 DVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGTKFSYCFGSLDDPSYPHNVLV 251

Query: 304 GGN--SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA--------KG 353
            G+  +++  ++TP+   N         FY + +  IS+ G  L    +          G
Sbjct: 252 LGDDGANILGDTTPLEIYN--------GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLG 303

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ--------E 405
           G +ID+G  +T L    Y  LK +    F G  +A   +  D  F +  Y         E
Sbjct: 304 GTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVN-QDDMFKVECYNGNLERDLVE 362

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
              P+V   F   AE+++DV  +  F+K   +  CLA+           IG   Q++  +
Sbjct: 363 SGFPIVTFHFSDGAELSLDVKSV--FMKLSPNVFCLAVTP----GNMNSIGATAQQSYNI 416

Query: 466 IYDTKNSQLGFAGEDC 481
            YD +  ++ F   DC
Sbjct: 417 GYDLEAKKISFERIDC 432


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 171/367 (46%), Gaps = 33/367 (8%)

Query: 131 QTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           Q +NY+A   +G   +  + ++D   +L W QC+ C  C+ Q  P+FDP+ S +Y+   C
Sbjct: 47  QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPC 106

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            +  C ++   + N   CS +    C Y  S   G  T G++G +   +G A  +   FG
Sbjct: 107 GTPLCESIPSDSRN---CSGNV---CAYQASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158

Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           C   ++    GG SG++GLGR+  SLV+QT       FSYCL +  DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGKNSALFLGSSA 214

Query: 308 SVF----KNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
            +       STP  + N+  N   L+ +Y + L G+  G     L  SG     +L+D+ 
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           + I+ L    Y A+K          P A      D CF  S        LV   F G A 
Sbjct: 270 SPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328

Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           MTV  +   Y +      VCLA+   A L+   E  ++G+ QQ+N   ++D     L F 
Sbjct: 329 MTVAASN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386

Query: 478 GEDCSSM 484
             DC+ +
Sbjct: 387 PADCTKL 393


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 114/375 (30%), Positives = 185/375 (49%), Gaps = 35/375 (9%)

Query: 122 IPLTSGIRL-QTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           +P+ S  +L Q+  ++   ++G    T+++  DT +D  W+ C  C  C +    VF   
Sbjct: 12  VPIASARQLIQSPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTT--VFSSD 69

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S S++ + C S  C+ +   +     CS S+   C + ++YG  S    +L +++L L 
Sbjct: 70  KSSSFRPLPCQSPQCNQVPNPS-----CSGSA---CGFNLTYG-SSTVAADLVQDNLTLA 120

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
             SV  + FGC R   G      GL+GLGR  LSL+ Q+  ++   FSYCLPS +    S
Sbjct: 121 TDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFS 180

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA----- 351
           GSL LG  +   +    I YT ++ NP+ ++ Y +NL  I +G K   +  S  A     
Sbjct: 181 GSLRLGPVAQPIR----IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSAT 236

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLV 411
             G +IDSGT  TRL    Y+A++ EF ++     +       DTC+ +     +  P +
Sbjct: 237 GAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTI 292

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYD 468
              F G   M V +    + + S + S  CLA+A+   +      +I + QQ+N R+++D
Sbjct: 293 TFMFAG---MNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 349

Query: 469 TKNSQLGFAGEDCSS 483
             NS++G A E CSS
Sbjct: 350 IPNSRVGVARESCSS 364


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 173/383 (45%), Gaps = 25/383 (6%)

Query: 113 NIKDVSNTEIP--LTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC-QPCKSC 167
            + D + T  P  +T  +      Y+  + +G   + ++ I+D G +L W QC Q C+ C
Sbjct: 27  ELADDATTARPGGVTVPVHFSQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRC 86

Query: 168 YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
           + Q  P+FD + S +++   C ++ C ++   +       +       Y  S   G  T 
Sbjct: 87  FKQDLPLFDTNASSTFRPEPCGAAVCESIPTRSCAGDGGGACG-----YEASTSFG-RTV 140

Query: 228 GELGREHLGLGKASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           G +G + + +G A+     FGC   +      G SG +GLGR++LSL +Q +      FS
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFS 197

Query: 287 YCLPSTQDAGASGSLILGGNSSVF-----KNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           YCL +  D G S +L LG ++ +        +TP   T+  PN  L+  Y+L L  I  G
Sbjct: 198 YCL-APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAG 256

Query: 342 GKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
              + A   +   I + + T +T L  S+Y  L+          P  P     D CF   
Sbjct: 257 NATI-AMPQSGNTITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFP-K 314

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQK 461
           A      P + + F+G AEMTV V+  ++   +D +  C+A+          I+G+ QQ 
Sbjct: 315 ASASGGAPDLVLAFQGGAEMTVPVSSYLFDAGNDTA--CVAILGSPALGGVSILGSLQQV 372

Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
           N  +++D     L F   DCS++
Sbjct: 373 NIHLLFDLDKETLSFEPADCSAL 395


>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 96/156 (61%), Gaps = 4/156 (2%)

Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
           +FY LN+  I++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  + S +P
Sbjct: 30  SFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYP 89

Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
           +  G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K   SQVCLA A  
Sbjct: 90  TTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFK--ISQVCLAFAGN 147

Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           S +    I GN QQ+   V+YD    ++GFA   CS
Sbjct: 148 SDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 183


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 170/367 (46%), Gaps = 33/367 (8%)

Query: 131 QTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           Q +NY+A   +G   +  + ++D   +L W QC+ C  C+ Q  P+FDP+ S +Y+   C
Sbjct: 47  QAMNYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPC 106

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            +  C ++     N   CS +    C Y  S   G  T G++G +   +G A  +   FG
Sbjct: 107 GTPLCESIPSDVRN---CSGNV---CAYEASTNAGD-TGGKVGTDTFAVGTAKAS-LAFG 158

Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
           C   ++    GG SG++GLGR+  SLV+QT       FSYCL +  DAG + +L LG ++
Sbjct: 159 CVVASDIDTMGGPSGIVGLGRTPWSLVTQTGV---AAFSYCL-APHDAGKNSALFLGSSA 214

Query: 308 SVF----KNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSG 360
            +       STP  + N+  N   L+ +Y + L G+  G     L  SG     +L+D+ 
Sbjct: 215 KLAGGGKAASTP--FVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSG---STVLLDTF 269

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           + I+ L    Y A+K          P A      D CF  S        LV   F G A 
Sbjct: 270 SPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLV-FTFRGGAA 328

Query: 421 MTVDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           MTV  T   Y +      VCLA+   A L+   E  ++G+ QQ+N   ++D     L F 
Sbjct: 329 MTVPATN--YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFE 386

Query: 478 GEDCSSM 484
             DC+ +
Sbjct: 387 PADCTKL 393


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 168/359 (46%), Gaps = 35/359 (9%)

Query: 148 VIVDTGSDLTWVQCQ----PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +IVDTGSDL W QC+       +  +   PV+DP  S ++  + C+   C   +F+  N 
Sbjct: 28  LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKN- 86

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG--KASVNDFIFGCGRNNKGLFGGVS 261
             C+S +   C Y   YG  +   G L  E    G  +A      FGCG  + G   G +
Sbjct: 87  --CTSKN--RCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGAT 141

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYT 319
           G++GL    LSL++Q        FSYCL    D   S  L+ G  + + ++ T  PI  T
Sbjct: 142 GILGLSPESLSLITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTT 197

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYS 372
            ++ NP    +Y + L GIS+G K+L   A+  A      GG ++DSG+ +  L  + + 
Sbjct: 198 AIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFE 257

Query: 373 ALKAEFLKQFSGFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDV 425
           A+K E +      P A       + CF L      +A + V +P + + F+G A M +  
Sbjct: 258 AVK-EAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPR 316

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               YF +  A  +CLA+   +      IIGN QQ+N  V++D ++ +  FA   C  +
Sbjct: 317 DN--YFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 373


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 163/362 (45%), Gaps = 51/362 (14%)

Query: 150 VDTGSDLTWVQCQPCKS----CYNQQDPVFDPSISPSYKKVLCNS-STCHALEFATGNSG 204
           +DTG++L+W+QC+ C++    C+  +DP +  S S SYK V CN  S C   +   G   
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCEPNQCKEG--- 161

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCGRNNKGLF-- 257
                    C Y V+YG GSYT G L  E        GK  ++    FGC  +++ +   
Sbjct: 162 --------LCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYA 213

Query: 258 -----GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
                  VSG++G+G    S ++Q   I  G FSYC+     A  + +  L     V K+
Sbjct: 214 FLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCI----TANNTHNTYLRFGKHVVKS 269

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITR 365
               T   M   P  A  Y +NL GIS+ G +L  +            G +ID+GT+ T 
Sbjct: 270 KNLQTTKIMQVKPSAA--YHVNLLGISVNGVKLNITKTDLAVRKDGSRGCIIDAGTLATL 327

Query: 366 LPPSIYSALKAEFLKQFSGFPSAPGFSIL----DTCFN-LSAYQEVNIPLVKMEFEGNAE 420
           L   I+  L        S   +   + I     D C+  LS     N+P+V    E NA+
Sbjct: 328 LVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLE-NAD 386

Query: 421 MTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           + V    I  F + +   V CL++ S   +D   IIG YQQ  Q+ +YDTK   L F  E
Sbjct: 387 LEVKPEAIFLFREFEGKNVFCLSMLS---DDSKTIIGAYQQMKQKFVYDTKARVLSFGPE 443

Query: 480 DC 481
           DC
Sbjct: 444 DC 445


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 94/251 (37%), Positives = 134/251 (53%), Gaps = 17/251 (6%)

Query: 91  RLILDNLHVQYLQSRI-----KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--G 143
           RL  D+L V+ + S       +N      +        + SG+   +  Y   + +G   
Sbjct: 86  RLQRDSLRVKSITSLAAVSTGRNATKRTPRTAGGFSGAVISGLSQGSGEYFMRLGVGTPA 145

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
            N+ +++DTGSD+ W+QC PCK+CYNQ D +FDP  S ++  V C S  C  L+    +S
Sbjct: 146 TNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRLD----DS 201

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGL 263
             C +     C Y VSYGDGS+T G+   E L    A V+    GCG +N+GLF G +GL
Sbjct: 202 SECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGL 261

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCL---PSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GLGR  LS  SQT   + G FSYCL    S+  +    S I+ GN++V K S    +T 
Sbjct: 262 LGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTS---VFTP 318

Query: 321 MIPNPQLATFY 331
           ++ NP+L TFY
Sbjct: 319 LLTNPKLDTFY 329


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 161/354 (45%), Gaps = 33/354 (9%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           VI+D GSDL W QC        Q +PVFD + S S+  + C+S  C A  F    +  C+
Sbjct: 122 VILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGTF---TNKTCT 178

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK---ASVNDFIFGCGRNNKGLFGGVSGLM 264
                 C Y   YG  + T G L  E    G     S N   FGCG+   G     SG++
Sbjct: 179 DRK---CAYENDYGIMTAT-GVLATETFTFGAHHGVSAN-LTFGCGKLANGTIAEASGIL 233

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTPITYTNMI 322
           GL    LS++ Q +      FSYCL    D   S  ++ G  + +  +K +  +    ++
Sbjct: 234 GLSPGPLSMLKQLAIT---KFSYCLTPFADRKTS-PVMFGAMADLGKYKTTGKVQTIPLL 289

Query: 323 PNPQLATFYILNLTGISIGGKQLQA-------SGFAKGGILIDSGTVITRLPPSIYSALK 375
            NP    +Y + + G+S+G K+L              GG ++DS T +  L    ++ LK
Sbjct: 290 KNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELK 349

Query: 376 AEFLKQFSGFPSAPGFSILD--TCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
              ++     P A   S+ D   CF L    + + V +P + + F+G+AEM++      Y
Sbjct: 350 KAVMEGIK-LPVA-NRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDN--Y 405

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           F +     +CLA+    +E    +IGN QQ+N  V+YD  N +  +A   C S+
Sbjct: 406 FQEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKCDSI 459


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 121/455 (26%), Positives = 196/455 (43%), Gaps = 64/455 (14%)

Query: 69  LELKHK-NYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSG 127
           L + H+ N CS       +   + + + +   + L+S    + SG+    +      + G
Sbjct: 68  LPVLHRLNPCSPLNAGGKQSTTSSVDVSHRAGRRLRSLFAAVQSGDDAAPAPAPAAASGG 127

Query: 128 IRLQTLNYIATIELGGRNMTVIV-------------DTGSDLTWVQCQPCKS---CYNQQ 171
           + + T         G  + TV+V             DTG  ++ V+C  C+    C    
Sbjct: 128 VTIPTTGTPEPGAPGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLA 187

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
              FDPS S ++  V C S  C         SG CSS S P C    S+    +  G + 
Sbjct: 188 S--FDPSRSSTFAPVPCGSPDC--------RSG-CSSGSTPSCP-LTSF---PFLSGAVA 232

Query: 232 REHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
           ++ L L   ASV+DF FGC   + G   G +GL+ L R   S+ S+ +   GG FSYCLP
Sbjct: 233 QDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLP 292

Query: 291 STQDAGASGSLILGGNSSVFKNST-------PITYTNMIPNPQLATFYILNLTGISIGGK 343
            +    +S   +  G + V  N T       P+ Y    PN      Y+++L G+S+GG+
Sbjct: 293 LSTT--SSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPN-----HYVIDLAGVSLGGR 345

Query: 344 QL---QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
            +     +  A   +++D+    T + PS+Y+ L+  F +  + +P AP    LDTC+N 
Sbjct: 346 DIPIPPHAATASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNF 405

Query: 401 SAYQ-EVNIPLVKMEFEGNAEMTVDVTGIV----YFVKSDA----SQVCLALASLSYEDE 451
           +  + EV IPLV + F G           +     F  S+     S  CLA A+L  + +
Sbjct: 406 TGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGD 465

Query: 452 TG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  ++G   Q +  V++D    ++GF    C
Sbjct: 466 AEAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/405 (29%), Positives = 184/405 (45%), Gaps = 64/405 (15%)

Query: 134 NYIATIELGGRN---MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSYKKVLC 188
           +Y  +  LG      +T+ +DTGSDL W  C P  C  C  +       +I+     V C
Sbjct: 74  DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133

Query: 189 NSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHL 235
            S  C A   +  +S +C+ S  P       DC+      ++ +YGDGS+    L ++ L
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLYQQTL 192

Query: 236 GLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI---FGGLFSYCLPST 292
            L    + +F FGC           +G+ G GR  LSL +Q S +    G  FSYCL S 
Sbjct: 193 SLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSH 249

Query: 293 QDAG----ASGSLILGGNSSVFK-----NSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
              G        LILG ++          S    YT+M+ NP+   +Y + L GIS+G +
Sbjct: 250 SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKR 309

Query: 344 QLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF----S 392
            + A    K       GG+++DSGT  T LP S Y+A+  EF K+ + F          +
Sbjct: 310 TVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKT 369

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF--------VKSDASQVCLALA 444
            L  C+ L+   +  IP++K+ F GN    V      ++        ++      C+ L 
Sbjct: 370 GLGPCYYLNGLSQ--IPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMML- 426

Query: 445 SLSYEDETGI-------IGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            ++ EDET +       +GNYQQ+   V+YD +  ++GFA ++C+
Sbjct: 427 -MNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 470


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 160/355 (45%), Gaps = 67/355 (18%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
             +V+ DTGS L W QC PC  C  +  P F P+ S ++ K+ C SS C   +F T    
Sbjct: 102 TFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLC---QFLTSPYR 158

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLM 264
            C+++    C Y+  YG G +T G L  E L +G AS     FGC   N G+    SG++
Sbjct: 159 TCNATG---CVYYYPYGMG-FTAGYLATETLHVGGASFPGVTFGCSTEN-GVGNSSSGIV 213

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS----GSL--ILGGNSSVFKNSTPITY 318
           GLGRS LSLVSQ        FSYCL S  DAG S    GSL  + GGN      STP   
Sbjct: 214 GLGRSPLSLVSQVGV---ARFSYCLRSNADAGDSPILFGSLAKVTGGN----VQSTP--- 263

Query: 319 TNMIPNPQL--ATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKA 376
             ++ NP++  +++Y +NLTGI++G   L     A   +   +GT               
Sbjct: 264 --LLENPEMPSSSYYYVNLTGITVGATDLP---MAMANLTTVNGTRF------------- 305

Query: 377 EFLKQFSGFPSAPGFSILDTCFN---LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
                  GF         D CF+         V +P + + F G AE  V        V+
Sbjct: 306 -------GF---------DLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVE 349

Query: 434 SD----ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            D    A+  CL +   S +    IIGN  Q +  V+YD       FA  DC+++
Sbjct: 350 VDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCANV 404


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 159/350 (45%), Gaps = 52/350 (14%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DTGSDL W QC PC +CY+Q  P+FDPS S ++K+  CN ++CH               
Sbjct: 78  IDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCNGNSCH--------------- 122

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCGRNNKGLFGGVSGLM 264
                 Y + Y D +Y++G L  E + +   S   F+      GCG N+       SG++
Sbjct: 123 ------YKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMV 176

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPS--TQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           GL     SL++Q    + GL SYC  S  T       + I+ G+  V   ST +  T   
Sbjct: 177 GLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIVAGDGVV---STTMFLTTAK 233

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSALKAEF 378
           P       Y LNL  +S+G   ++  G      +G I+IDSGT +T  P S Y  L  E 
Sbjct: 234 PG-----LYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFPVS-YCNLVREA 287

Query: 379 LKQFSGFPSAPGFSILDTCFN--LSAYQE-VNI-PLVKMEFEGNAEMTVDVTGIVYFVKS 434
           +  +            D   N  L  Y + ++I P++ M F G A++ +D   + Y    
Sbjct: 288 VDHY-----VTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYNM-YIETI 341

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                CLA+   +   +  I GN  Q N  V YD+ +  + F+  +CS++
Sbjct: 342 TRGTFCLAIIC-NNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCSAL 390


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/387 (30%), Positives = 182/387 (47%), Gaps = 60/387 (15%)

Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP----VFDPSISPSYKKVL 187
           TL    TI    +N+T+++DTGS+L+W++C        +++P    +F+P  S +Y K+ 
Sbjct: 66  TLTASLTIGTPPQNITMVLDTGSELSWLRC--------KKEPNFTSIFNPLASKTYTKIP 117

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
           C+S TC      T +  +  +  P   C++ +SY D S   G L  E    G  +    +
Sbjct: 118 CSSQTCKT---RTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATV 174

Query: 247 FGC----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
           FGC      +N       +GLMG+ R  LS V+Q        FSYC+       ++G L+
Sbjct: 175 FGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMG---FRKFSYCI---SGLDSTGFLL 228

Query: 303 LGGNSSVFKNSTPITYTNMI----PNPQL-ATFYILNLTGISIGGK--QLQASGFA---- 351
           LG   + +    P+ YT ++    P P      Y + L GI +  K   L  S F     
Sbjct: 229 LG--EARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHT 286

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF------PSAPGFSILDTCFNLSAYQ 404
             G  ++DSGT  T L   +YSAL+ EFL Q +G       P       +D C+ + +  
Sbjct: 287 GAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTS 346

Query: 405 EV--NIPLVKMEFEGNAEMTVDVTGIVYFVKSDA----SQVCLALASLSYEDETGI---- 454
               N+P+VK+ F G AEM+V    ++Y V  +     S  C    +    DE GI    
Sbjct: 347 STLPNLPVVKLMFRG-AEMSVSGQRLLYRVPGEVRGKDSVWCFTFGN---SDELGISSFL 402

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           IG++QQ+N  + YD +NS++GFA   C
Sbjct: 403 IGHHQQQNVWMEYDLENSRIGFAELRC 429


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/374 (30%), Positives = 171/374 (45%), Gaps = 51/374 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISPSYKKVLCNSSTC---HAL 196
           + ++ ++DTGS   W  C     C +C +  +   F P  S S K + C +  C   H  
Sbjct: 88  QTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQT 147

Query: 197 EF----ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           +        NS  CS   PP   Y + YG G+ T G    E L L    V +F+ GC   
Sbjct: 148 DLRCTDCDNNSRNCSQICPP---YLILYGSGT-TGGVALSETLHLHGLIVPNFLVGCS-- 201

Query: 253 NKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ--DAGASGSLILGGN 306
              +F     +G+ G GR   SL SQ      GL  FSYCL S +  D   S SL+L   
Sbjct: 202 ---VFSSRQPAGIAGFGRGPSSLPSQL-----GLTKFSYCLLSHKFDDTQESSSLVLDSQ 253

Query: 307 SSVFKNSTPITYTNMIPNPQL------ATFYILNLTGISIGG-------KQLQASGFAKG 353
           S   K +  + YT ++ NP++      + +Y ++L  ISIGG       K L       G
Sbjct: 254 SDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNG 313

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSILDTCFNLSAYQEVNIPL 410
           G +IDSGT  T +    +  L  EF+ Q   +  A      S L  CFN+S  +E+ +P 
Sbjct: 314 GTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKELELPQ 373

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---IIGNYQQKNQRVIY 467
           +++ F+G A++ + +     F+ S     C  + +   E  +G   I+GN+Q +N  V Y
Sbjct: 374 LRLHFKGGADVELPLENYFAFLGS-REVACFTVVTDGAEKASGPGMILGNFQMQNFYVEY 432

Query: 468 DTKNSQLGFAGEDC 481
           D +N +LGF  E C
Sbjct: 433 DLQNERLGFKKESC 446


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/359 (28%), Positives = 166/359 (46%), Gaps = 26/359 (7%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           TI    +  + I+D   +L W QC  C  C+ Q  P+F P+ S +++   C +  C ++ 
Sbjct: 72  TIGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIP 131

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
            +  +S +C+          ++   G +T G +  +   +G A+ +   FGC   +    
Sbjct: 132 TSNCSSNMCTYEG------TINSKLGGHTLGIVATDTFAIGTATAS-LGFGCVVASGIDT 184

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
            GG SGL+GLGR+  SLVSQ +      FSYCL +  D+G +  L+LG ++ +    NST
Sbjct: 185 MGGPSGLIGLGRAPSSLVSQMNIT---KFSYCL-TPHDSGKNSRLLLGSSAKLAGGGNST 240

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAKGGILIDSGTVITRLPPSIYS 372
              +    P   ++ +Y + L GI  G     L  SG     +L+ +   ++ L  S Y 
Sbjct: 241 TTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSG---NTVLVQTLAPMSFLVDSAYQ 297

Query: 373 ALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF-EGNAEMTVDVTGIVYF 431
           ALK E  K     P+A      D CF  +     + P +   F +G A +TV     +  
Sbjct: 298 ALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLID 357

Query: 432 VKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           V  +   VC+A+ S S+ + T       I+G+ QQ+N   + D +   L F   DCSS+
Sbjct: 358 VGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCSSL 416


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 102/262 (38%), Positives = 140/262 (53%), Gaps = 23/262 (8%)

Query: 91  RLILDNLHV--QYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELGG--RN 145
           RL L   H     L +    ++  +IK ++   E PL SG    +  Y + + +G   ++
Sbjct: 6   RLTLMVFHCCKSILATYFHVILLFSIKTIAEALETPLVSGASQGSGEYFSRVGIGSPPKH 65

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           + ++VDTGSD+ WVQC PC  CY Q DP+F+PS S SY  + C +  C +L+ +      
Sbjct: 66  VYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSE----- 120

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLM 264
           C + S   C Y VSYGDGSYT G+   E + L G AS+N+   GCG +N+GLF G +GL+
Sbjct: 121 CRNDS---CLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLL 177

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPN 324
           GLG   LS  SQ   I    FSYCL +     AS    L  NS +  +S       ++ N
Sbjct: 178 GLGGGSLSFPSQ---INASSFSYCLVNRDTDSAS---TLEFNSPIPSHSVT---APLLRN 228

Query: 325 PQLATFYILNLTGISIGGKQLQ 346
            QL TFY L +TGI    K LQ
Sbjct: 229 NQLDTFYYLGMTGIGESYKILQ 250


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 177/382 (46%), Gaps = 43/382 (11%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFDPS 178
           + S +  ++  Y+ T+ LG   R+M  I DTGSDL WV+C+      S        FDPS
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
            S +Y +V C +  C AL  AT + G        +C Y  +YGDGS T G L  E     
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDG-------SNCAYLYAYGDGSNTTGVLSTETFTFD 202

Query: 236 --GLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSY 287
             G G++     V    FGC     G F     +       +SLV+Q   +   G  FSY
Sbjct: 203 DGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLG-GGAVSLVTQLGGATSLGRRFSY 261

Query: 288 CLPSTQDAGASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           CL       AS +L  G  + V +    STP+   +      + T+Y + L  + +G K 
Sbjct: 262 CL-VPHSVNASSALNFGALADVTEPGAASTPLVAGD------VDTYYTVVLDSVKVGNKT 314

Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
           + ++  ++  I++DSGT +T L PS+   +  E  ++ +  P      +L  C+N+ A +
Sbjct: 315 VASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNV-AGR 371

Query: 405 EV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           EV    +IP + +EF G A + +       FV      +CLA+ + + +    I+GN  Q
Sbjct: 372 EVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQQPVSILGNLAQ 429

Query: 461 KNQRVIYDTKNSQLGFAGEDCS 482
           +N  V YD     + FAG DC+
Sbjct: 430 QNIHVGYDLDAGTVTFAGADCA 451


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 92/289 (31%), Positives = 133/289 (46%), Gaps = 43/289 (14%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
             T  +DT SDL W QCQPC  CY+Q DP+F+P +S +Y  + C+S TC  L+       
Sbjct: 101 KFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHR---- 156

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG--VSG 262
            C       C Y  +Y   + T G L  + L +G+ +     FGC  ++ G       SG
Sbjct: 157 -CGHDDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASG 215

Query: 263 LMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI 322
           ++GLGR  LSLVSQ S      F+YCLP    +   G L+LG ++   +N+T      M 
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPA-SRIPGKLVLGADADAARNATNRIAVPMR 271

Query: 323 PNPQLATFYILNLTGISIGGKQL-------------------------QASGFAKG---- 353
            +P+  ++Y LNL G+ IG + +                          A+  A G    
Sbjct: 272 RDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANR 331

Query: 354 -GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI-LDTCFNL 400
            G++ID  + IT L  S+Y  L  +   +    P   G S+ LD CF L
Sbjct: 332 YGMIIDIASTITFLEASLYDELVNDLEVEIR-LPRGTGSSLGLDLCFIL 379


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 173/389 (44%), Gaps = 35/389 (8%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ---- 171
           E+P+          Y    ++G   +   ++ DTGSDLTW+ C+  C+S  C N++    
Sbjct: 69  EVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128

Query: 172 --DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
               VF  ++S S+K + C +  C        +   C +   P C Y   Y DGS   G 
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGF 187

Query: 230 LGREHLGL-----GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
              E + +      K  +++ + GC  + +G  F    G+MGLG S  S   + +E FGG
Sbjct: 188 FANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG 247

Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            FSYCL         S  L  G + S       +TYT ++    + +FY +N+ GISIGG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGG 306

Query: 343 KQLQASGFA-----KGGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI 393
             L+           GG ++DSG+ +T L    Y    +AL+   LK F       G   
Sbjct: 307 AMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--P 363

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
           L+ CFN + ++E  +P +   F   AE    V    Y + +     CL   S+++   T 
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TS 420

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ++GN  Q+N    +D    +LGFA   C+
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
          Length = 337

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 165/352 (46%), Gaps = 39/352 (11%)

Query: 151 DTGSDLTWVQCQPCKS---CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           DTG  ++  +C  C+    C       FDPS S ++  V C S  C         SG CS
Sbjct: 4   DTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDC--------RSG-CS 52

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCGRNNKGLFGGVSGLMGL 266
           S S P C    S+    +  G + ++ L L   ASV+DF FGC   + G   G +GL+ L
Sbjct: 53  SGSTPSCP-LTSF---PFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDL 108

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT-YTNMIPNP 325
            R   SL S+ +   GG FSYCLP +  + + G L++G        S  +T    ++ +P
Sbjct: 109 SRDSRSLASRLAAGAGGTFSYCLPLSTTS-SHGFLVIGEADVPHNRSARVTAVAPLVYDP 167

Query: 326 QLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
                Y+++L G+S+GG+ +     A   +++D+    T + PS+Y+ L+  F +  + +
Sbjct: 168 AFPNHYVIDLAGVSLGGRDIPIPPHAA--MVLDTALPYTYMKPSMYAPLRDAFRRAMARY 225

Query: 386 PSAPGFSILDTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTG--------IVYFVKSDA 436
           P AP    LDTC+N +  + EV IPLV + F G +                ++Y  +   
Sbjct: 226 PRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPGN 285

Query: 437 --SQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             S  CLA A+L  + +       ++G   Q +  V++D +  ++GF    C
Sbjct: 286 FFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 177/410 (43%), Gaps = 44/410 (10%)

Query: 104 SRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
           SRI+++I  + K  S            ++ L SGI   T  Y   I +G   +   V+VD
Sbjct: 65  SRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVD 124

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           TGS+LTWV C+  ++       VF    S S+K V C + TC        +   C + S 
Sbjct: 125 TGSELTWVNCR-YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 183

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           P C+Y   Y DGS  +G   +E + +G      A +   + GC  +  G  F G  G++G
Sbjct: 184 P-CSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLG 242

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSS---VFKNSTPITYTNM 321
           L  SD S  S  + ++G  FSYCL     +   S  LI G + S    F+ +TP+  T +
Sbjct: 243 LAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRI 302

Query: 322 IPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGILIDSGTVITRLPPSIYSALK- 375
            P      FY +N+ GIS+G   L          + GG ++DSGT +T L  + Y  +  
Sbjct: 303 PP------FYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 356

Query: 376 --AEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
             A +L +       P    ++ CF+  S +    +P +    +G A          Y V
Sbjct: 357 GLARYLVELKRV--KPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKS--YLV 412

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +     CL   S      T +IGN  Q+N    +D   S L FA   C+
Sbjct: 413 DAAPGVKCLGFVSAG-TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 184/419 (43%), Gaps = 68/419 (16%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRN--MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDP 177
           +PL+ G      +Y  +  LG  +  +T+ +DTGSDL W  C P  C  C  +     DP
Sbjct: 67  LPLSPGS-----DYTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDP 121

Query: 178 S----ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSY 220
           S    IS S   + CNS  C     +T +S +C+ +  P       DC       ++ +Y
Sbjct: 122 SPPTNISHS-TPISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAY 180

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ---T 277
           GDGS     L R+ L L    + +F FGC       F   +G+ G GR  LSL +Q    
Sbjct: 181 GDGSLI-ASLYRDTLSLSTLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATH 236

Query: 278 SEIFGGLFSYCLPS----TQDAGASGSLILGG-NSSVFKNSTPIT---YTNMIPNPQLAT 329
           S   G  FSYCL S    ++       LILG  N     N   +    YT+M+ NP+ + 
Sbjct: 237 SPQLGNRFSYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSY 296

Query: 330 FYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEF---- 378
           FY + L GIS+G K + A    +       GG+++DSGT  T LP   Y+++   F    
Sbjct: 297 FYTVGLKGISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRA 356

Query: 379 LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVYF------ 431
            K     P     + L  C+ L+      +P V + F G N+ + +      Y       
Sbjct: 357 RKSNRRAPEIEQKTGLSPCYYLNT--AAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGD 414

Query: 432 -VKSDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            V+      CL   +   E E      G++GNYQQ+   V YD +  ++GFA   C+S+
Sbjct: 415 GVRRKERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCASL 473


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 172/372 (46%), Gaps = 29/372 (7%)

Query: 128 IRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           IR     Y+A   +G   +  + IVD   +L W QC  C+ C+ Q  PVF P+ S ++K 
Sbjct: 38  IRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKP 97

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
             C ++ C ++   + +  VCS   PP      +   G+ T G    +   +G A+V   
Sbjct: 98  EPCGTAVCESIPTRSCSGDVCSYKGPP------TQLRGN-TSGFAATDTFAIGTATVR-L 149

Query: 246 IFGC-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC   ++     G SG +GLGR+  SLV+Q        FSYCL S ++ G S  L LG
Sbjct: 150 AFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-SPRNTGKSSRLFLG 205

Query: 305 GNSSVF--KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGT 361
            ++ +   ++++   +    P+   + +Y+L+L  I  G   +  +    GGIL+  + +
Sbjct: 206 SSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATA--QSGGILVMHTVS 263

Query: 362 VITRLPPSIYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEG 417
             + L  S Y A K    +   G    P A      D CF  +A +     P +   F+G
Sbjct: 264 PFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 323

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNS 472
            A +TV     +  V  +    C A+ S+++ + TG     ++G+ QQ++   +YD K  
Sbjct: 324 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 383

Query: 473 QLGFAGEDCSSM 484
            L F   DCSS+
Sbjct: 384 TLSFEPADCSSL 395


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 118/410 (28%), Positives = 177/410 (43%), Gaps = 44/410 (10%)

Query: 104 SRIKNMISGNIKDVS----------NTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
           SRI+++I  + K  S            ++ L SGI   T  Y   I +G   +   V+VD
Sbjct: 43  SRIEDVIGADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVD 102

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           TGS+LTWV C+  ++       VF    S S+K V C + TC        +   C + S 
Sbjct: 103 TGSELTWVNCR-YRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPST 161

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGL-FGGVSGLMG 265
           P C+Y   Y DGS  +G   +E + +G      A +   + GC  +  G  F G  G++G
Sbjct: 162 P-CSYDYRYADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLG 220

Query: 266 LGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSS---VFKNSTPITYTNM 321
           L  SD S  S  + ++G  FSYCL     +   S  LI G + S    F+ +TP+  T +
Sbjct: 221 LAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRI 280

Query: 322 IPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGILIDSGTVITRLPPSIYSALK- 375
            P      FY +N+ GIS+G   L          + GG ++DSGT +T L  + Y  +  
Sbjct: 281 PP------FYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 334

Query: 376 --AEFLKQFSGFPSAPGFSILDTCFNL-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV 432
             A +L +       P    ++ CF+  S +    +P +    +G A          Y V
Sbjct: 335 GLARYLVELKRV--KPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKS--YLV 390

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +     CL   S      T +IGN  Q+N    +D   S L FA   C+
Sbjct: 391 DAAPGVKCLGFVSAG-TPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
          Length = 340

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/270 (32%), Positives = 131/270 (48%), Gaps = 23/270 (8%)

Query: 172 DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELG 231
           D  FDPS S S+  + C S  C A+E        C+ +S   C + + +G+ +   G L 
Sbjct: 30  DVAFDPSRSSSFAAIPCGSPEC-AVE--------CTGAS---CPFTIQFGNVTVANGTLV 77

Query: 232 REHLGLGK-ASVNDFIFGCGR--NNKGLFGGVSGLMGLGRSDLSLVSQT-----SEIFGG 283
           R+ L L   A+   F FGC     +   F G  GL+ L RS  SL S+      +     
Sbjct: 78  RDTLTLSPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATTTTTA 137

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK 343
            FSYCLPS     + G L +G +   +     I Y  M  NP     Y ++L GIS+GG+
Sbjct: 138 AFSYCLPSLSSTRSRGFLSIGASRPEYSGGD-IKYAPMSSNPNHPNSYFVDLVGISVGGE 196

Query: 344 QLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
            L       A  G L+++ T  T L P+ Y+AL+  F    + +P+AP F +LDTC+NL+
Sbjct: 197 DLPVPPAVLAAHGTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLDTCYNLT 256

Query: 402 AYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
               + +P V + F G  E+ +DV   +YF
Sbjct: 257 GLASLAVPAVALRFAGGTELELDVRQTMYF 286


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 175/362 (48%), Gaps = 33/362 (9%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           LQT  Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV 
Sbjct: 77  LQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVS 134

Query: 188 CNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDF 245
           C +S C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F
Sbjct: 135 CGTSMC----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF 190

Query: 246 IFGCGRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGAS 298
            FGC  ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +
Sbjct: 191 TFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTT 249

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGIL 356
           G   LG  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++
Sbjct: 250 GYFSLGKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVV 305

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
            DSG+ ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+
Sbjct: 306 FDSGSELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFD 364

Query: 417 GNAEMTVDVTGIVYFVKSDASQ---VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
             A   +   G+  FV+    +    CLA A     +   IIG+  Q ++ V+YD K   
Sbjct: 365 DGARFDLGSHGV--FVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQL 419

Query: 474 LG 475
           +G
Sbjct: 420 IG 421


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 122/425 (28%), Positives = 183/425 (43%), Gaps = 77/425 (18%)

Query: 122 IPLTSGIRLQTLNYIATIELGG---RNMTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFD 176
           +PL+ G      +Y  +  LG    + +++ +DTGSDL W  C P  C  C  + D    
Sbjct: 65  LPLSPGS-----DYTLSFNLGSHPPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAAT 119

Query: 177 PSISP----SYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVS 219
             +SP    S   V C S  C A   +  +S +C+ +  P       DC+      ++ +
Sbjct: 120 GGLSPPNITSSASVSCKSPACSAAHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYA 179

Query: 220 YGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
           YGDGS     L R+ L +  +S   +++F FGC        G   G+ G GR  LSL +Q
Sbjct: 180 YGDGSLV-ARLYRDSLSMPASSPLVLHNFTFGCAHTA---LGEPVGVAGFGRGVLSLPAQ 235

Query: 277 TSEI---FGGLFSYCLPS----TQDAGASGSLILGGNS-------SVFKNSTPITYTNMI 322
            +      G  FSYCL S             LILG  S        V  +     YT M+
Sbjct: 236 LASFSPHLGNQFSYCLVSHSFDADRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAML 295

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALK 375
            NP+   FY + L GI++G +++      K       GG+++DSGT  T LP  +Y +L 
Sbjct: 296 DNPKHPYFYCVGLEGITVGNRKIPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLV 355

Query: 376 AEF-------LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
            EF        K+ +      G   L  C+  S      +P V + F GN+ + +     
Sbjct: 356 TEFNHRMGRVYKRATQIEERTG---LGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNY 411

Query: 429 VY--FVKSDASQV-----CLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFA 477
            Y  F   D  +      CL L +   E E+G     +GNYQQ+   V+YD +  ++GFA
Sbjct: 412 YYEFFDGRDGQKKKRKVGCLMLMNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFA 471

Query: 478 GEDCS 482
              C+
Sbjct: 472 RRKCA 476


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 135/483 (27%), Positives = 198/483 (40%), Gaps = 87/483 (18%)

Query: 66  AITLELKHKNYCSGKIVDWNE----QQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTE 121
           A+ LEL H        VD NE    +++ R   +  H + L      + +          
Sbjct: 22  ALRLELAH--------VDANEHCTMEERVRRATERTHHRRL------LHASTAAAAGGVA 67

Query: 122 IPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK----------SCYN 169
            PL    + Q   YIA+  +G   +    +VDTGSDL W QC  C+           C+ 
Sbjct: 68  APLRWSGKTQ---YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFP 124

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTR 227
           Q  P ++ S+S + + V C+         A   +G        D  C    SYG G    
Sbjct: 125 QNLPYYNFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VAL 183

Query: 228 GELGREHLGLGKASVNDFIFGC---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
           G LG +      +S     FGC    R + G   G SG++GLGR  LSLVSQ +      
Sbjct: 184 GVLGTDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNAT---E 240

Query: 285 FSYCL-PSTQDAGASGSLILGGNSSVFKNST---------PITYTNMIPNPQ---LATFY 331
           FSYCL P  +D  +   L +G       ++          P+T      NP+    +TFY
Sbjct: 241 FSYCLTPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFY 300

Query: 332 ILNLTGISIGGK--QLQASGFA---------KGGILIDSGTVITRLPPSIYSALKAEFLK 380
            L L G++ G     L A  F           GG LIDSG+  TRL    + AL  E  +
Sbjct: 301 YLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELAR 360

Query: 381 QFSGF-----PSAPGFSILDTCFNL----SAYQEVNIPLVKMEFE----GNAEMTVDVTG 427
           Q  G      P A     L+ C        +     +P + + F+    G  E+ +    
Sbjct: 361 QLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420

Query: 428 IVYFVKSDASQVCLALASLSY------EDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             Y+ + +AS  C+A+ S +        +ET IIGN+ Q++ RV+YD  N  L F   +C
Sbjct: 421 --YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478

Query: 482 SSM 484
           S++
Sbjct: 479 SAV 481


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 114/367 (31%), Positives = 167/367 (45%), Gaps = 43/367 (11%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+T+++DTGS+L+W+ C   ++     D  F P  S ++  V C S+ C + +     S
Sbjct: 72  QNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCSSRDLPAPPS 130

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNNKGLFGGV 260
             C ++S   C   +SY DGS + G L  +   +G A      FGC     ++       
Sbjct: 131 --CDAASR-RCRVSLSYADGSASDGALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVAT 187

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           +GL+G+ R  LS V+Q S      FSYC+    DAG    L+LG +   F    P+ YT 
Sbjct: 188 AGLLGMNRGALSFVTQAST---RRFSYCISDRDDAGV---LLLGHSDLPF---LPLNYTP 238

Query: 321 MI-PNPQLATF----YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPP 368
           +  P P L  F    Y + L GI +GGK L              G  ++DSGT  T L  
Sbjct: 239 LYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLG 298

Query: 369 SIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLS---AYQEVNIPLVKMEFEGNA 419
             YSA+KAEFLKQ      A   P F+     DTCF +          +P V + F G A
Sbjct: 299 DAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNG-A 357

Query: 420 EMTVDVTGIVYFVKSDASQV----CLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQL 474
           +M+V    ++Y V  +        CL   +      T  +IG++ Q N  V YD +  ++
Sbjct: 358 QMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDLERGRV 417

Query: 475 GFAGEDC 481
           G A   C
Sbjct: 418 GLAPVKC 424


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 111/389 (28%), Positives = 173/389 (44%), Gaps = 35/389 (8%)

Query: 121 EIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ---- 171
           E+P+          Y    ++G   +   ++ DTGSDLTW+ C+  C+S  C N++    
Sbjct: 69  EVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRI 128

Query: 172 --DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
               VF  ++S S+K + C +  C        +   C +   P C Y   Y DGS   G 
Sbjct: 129 RHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGF 187

Query: 230 LGREHLGL-----GKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGG 283
              E + +      K  +++ + GC  + +G  F    G+MGLG S  S   + +E FGG
Sbjct: 188 FANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGG 247

Query: 284 LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGG 342
            FSYCL         S  L  G + S       +TYT ++    + +FY +N+ GISIGG
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGG 306

Query: 343 KQLQASGFA-----KGGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI 393
             L+           GG ++DSG+ +T L    Y    +AL+   LK F       G   
Sbjct: 307 AMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--P 363

Query: 394 LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
           L+ CFN + ++E  +P +   F   AE    V    Y + +     CL   S+++   T 
Sbjct: 364 LEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TS 420

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ++GN  Q+N    +D    +LGFA   C+
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 171/372 (45%), Gaps = 29/372 (7%)

Query: 128 IRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK 185
           IR     Y+A   +G   +  + IVD   +L W QC  C+ C+ Q  PVF P+ S ++K 
Sbjct: 55  IRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKP 114

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF 245
             C ++ C ++   + +  VCS   PP      +   G+ T G    +   +G A+V   
Sbjct: 115 EPCGTAVCESIPTRSCSGDVCSYKGPP------TQLRGN-TSGFAATDTFAIGTATVR-L 166

Query: 246 IFGC-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
            FGC   ++     G SG +GLGR+  SLV+Q        FSYCL S ++ G S  L LG
Sbjct: 167 AFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-SPRNTGKSSRLFLG 222

Query: 305 GNSSVF--KNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGT 361
            ++ +   ++++   +    P+     +Y+L+L  I  G   +  +    GGIL+  + +
Sbjct: 223 SSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVS 280

Query: 362 VITRLPPSIYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEG 417
             + L  S Y A K    +   G    P A      D CF  +A +     P +   F+G
Sbjct: 281 PFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQG 340

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNS 472
            A +TV     +  V  +    C A+ S+++ + TG     ++G+ QQ++   +YD K  
Sbjct: 341 AAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKE 400

Query: 473 QLGFAGEDCSSM 484
            L F   DCSS+
Sbjct: 401 TLSFEPADCSSL 412


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 122/416 (29%), Positives = 188/416 (45%), Gaps = 50/416 (12%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
           + W E        D   +Q+L S +             + +P+ SG ++ Q   YI   +
Sbjct: 57  LSWEESVLQMQAKDKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAK 106

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +G   + M + +DT SD+ W+ C  C  C +    +F+   S +YK + C ++ C  +  
Sbjct: 107 IGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQVPK 163

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
            T   GVCS        + ++YG GS     L ++ + L   +V  + FGC +   G   
Sbjct: 164 PTCGGGVCS--------FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSL 214

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
              GL+GLGR  LSL+SQT  ++   FSYCLPS +    SGSL LG      +    I Y
Sbjct: 215 PAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKY 270

Query: 319 TNMIPNPQLATFYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPS 369
           T ++ NP+  + Y +NL  + +         G      S  A  G + DSGTV TRL   
Sbjct: 271 TPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTP 328

Query: 370 IYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIV 429
            Y A++  F  +     +       DTC+ +     +  P +   F G   M V +    
Sbjct: 329 AYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDN 381

Query: 430 YFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             + S A S  CLA+A+   +      +I N QQ+N R++YD  NS+LG A E C+
Sbjct: 382 LLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 176/366 (48%), Gaps = 33/366 (9%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           R + ++VDT S+LTWVQ   C +C   + P F+P +S S+    C SS C       G  
Sbjct: 10  REVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRS-KLGFQ 68

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKAS-VNDFIFGCGRNN-KGLF 257
             C+ S+   C++ V+Y DGS   G + RE   L    G AS + D IFGC   + +   
Sbjct: 69  SACNRST-GSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQRPV 127

Query: 258 GGVSGLMGLGRSDLSLVSQT-SEIFGGL---FSYCLPSTQDAGASGSLILGGNSSVFKNS 313
              SG +GL R   S  +Q  S    GL   FSYC P+  +   S  +I+ G+S +  + 
Sbjct: 128 DFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGDSGIPAHH 187

Query: 314 TPITYTNMIPNPQLAT---FYILNLTGISIGGKQLQ--ASGF-----AKGGILIDSGTVI 363
               Y ++   P +A+   FY + L GIS+GG+ L    S F       GG   DSGT +
Sbjct: 188 --FQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFDSGTTV 245

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSIL-DTCFNLSA--YQEVNIPLVKMEFEGNAE 420
           + L    ++AL   F ++        G     + C++++A   +    PLV + F+ N +
Sbjct: 246 SFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLHFKNNVD 305

Query: 421 MTVDVTGIVYFVKSDASQV---CLALASLSYEDETG--IIGNYQQKNQRVIYDTKNSQLG 475
           M +     V+   +   QV   CLA  +     + G  +IGNYQQ++  + +D + S++G
Sbjct: 306 MELREAS-VWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLERSRIG 364

Query: 476 FAGEDC 481
           FA  +C
Sbjct: 365 FAPANC 370


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 58/371 (15%)

Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  +++G     ++  +DTGSD+ W QC PC +CY+Q  P+FDPS S ++++  CN ++
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCNGNS 480

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF----- 247
           CH                     Y + Y D +Y++G L  E + +   S   F+      
Sbjct: 481 CH---------------------YEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKI 519

Query: 248 GCGRNN-----KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
           GCG +N      G     SG++GL    LSL+SQ    + GL SYC      +G   S I
Sbjct: 520 GCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF-----SGQGTSKI 574

Query: 303 LGGNSSVFKNSTPITYTNMIP--NPQLATFYILNLTGISIGGKQLQASGFA----KGGIL 356
             G +++      +     I   NP    FY LNL  +S+    +   G       G I 
Sbjct: 575 NFGTNAIVAGDGTVAADMFIKKDNP----FYYLNLDAVSVEDNLIATLGTPFHAEDGNIF 630

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI---PLVKM 413
           IDSGT +T  P S Y  L  E ++Q       P         NL  Y    I   P++ M
Sbjct: 631 IDSGTTLTYFPMS-YCNLVREAVEQVVTAVKVPDMG----SDNLLCYYSDTIDIFPVITM 685

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F G A++ +D   + Y         CLA+   +      + GN  Q N  V YD  ++ 
Sbjct: 686 HFSGGADLVLDKYNM-YLETITGGIFCLAIG-CNDPSMPAVFGNRAQNNFLVGYDPSSNV 743

Query: 474 LGFAGEDCSSM 484
           + F+  +CS++
Sbjct: 744 ISFSPTNCSAL 754



 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 101/338 (29%), Positives = 149/338 (44%), Gaps = 50/338 (14%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DTGSDL W QC PC  CY+Q DP+FDPS S ++ +  C+  +CH               
Sbjct: 99  IDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQRCHGKSCH--------------- 143

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF-----GCG-----RNNKGLFGG 259
                 Y + Y D +Y++G L  E + +   S   F+      GCG      +N G    
Sbjct: 144 ------YEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASS 197

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
            SG++GL     SL+SQ    + GL SYC      +G   S I  G +++      +   
Sbjct: 198 SSGIVGLNMGPRSLISQMDLPYPGLISYCF-----SGQGTSKINFGTNAIVAGDGTVAAD 252

Query: 320 NMIP--NPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGTVITRLPPSIYSA 373
             I   NP    FY LNL  +S+   +++  G       G I+IDSG+ +T  P S Y  
Sbjct: 253 MFIKKDNP----FYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFPVS-YCN 307

Query: 374 LKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           L  + ++Q       P  S  D     S   ++  P++ M F G A++ +D   + Y   
Sbjct: 308 LVRKAVEQVVTAVRVPDPSGNDMLCYFSETIDI-FPVITMHFSGGADLVLDKYNM-YMES 365

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           +     CLA+   S   E  I GN  Q N  V YD+ +
Sbjct: 366 NSGGLFCLAIICNSPTQE-AIFGNRAQNNFLVGYDSSS 402


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 210/442 (47%), Gaps = 40/442 (9%)

Query: 44  WQQKSGSSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQ 103
           +   S ++  C S Q    ++  I +  K   +   K   W+ +  N    D   + YL 
Sbjct: 16  FMSMSNATDPCAS-QPDDSDLNVIPMYGKCSPFNPQKTDSWDNRVLNMASKDPARMSYLS 74

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           S +        K VS+   P+ SG      NYI  +++G  G+ + +++DT +D  ++  
Sbjct: 75  SLVAQ------KTVSSA--PIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPS 126

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
             C  C       F P+ S SY  + C+   C  +   +     C ++    C++  SY 
Sbjct: 127 SGCIGC---SATTFSPNASTSYVPLECSVPQCSQVRGLS-----CPATGSGACSFNKSYA 178

Query: 222 DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIF 281
             +Y+   L ++ L L    +  + FG      G      GL+GLGR  LSL+SQT  ++
Sbjct: 179 GSTYS-ATLVQDSLRLATDVIPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLY 237

Query: 282 GGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
            G+FSYCLPS +    SGSL LG           I  T ++ NP+  + Y +NLTGI++G
Sbjct: 238 SGVFSYCLPSFKSYYFSGSLKLGP----VGQPKSIRTTPLLRNPRRPSLYFVNLTGITVG 293

Query: 342 G------KQLQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
                  K+L A     G G +IDSGTVITR    +Y+A++ EF KQ +G  S+ G    
Sbjct: 294 KVNVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRKQVTGPFSSLG--AF 351

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE---DE 451
           DTCF +  Y+ +  P + + F  + ++ + +   +    S  S  CLA+AS         
Sbjct: 352 DTCF-VKNYETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMASTPKNVNYTV 407

Query: 452 TGIIGNYQQKNQRVIYDTKNSQ 473
             +I NYQQ+N RV++DT N++
Sbjct: 408 LNVIANYQQQNLRVLFDTVNNK 429


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 107/395 (27%), Positives = 183/395 (46%), Gaps = 49/395 (12%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           +++ ++PL    R+ ++  Y   I+LG   +   V VDTGSD+ WV C+PC  C ++ + 
Sbjct: 55  LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNL 114

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                +FD + S + KKV C+   C  +      S  C  +    C+Y + Y D S + G
Sbjct: 115 NFHLSLFDVNASSTSKKVGCDDDFCSFIS----QSDSCQPAV--GCSYHIVYADESTSEG 168

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
              R+ L L + + +        + +FGCG +  G  G     V G+MG G+S+ S++SQ
Sbjct: 169 NFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQ 228

Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
            +       +FS+CL + +  G     ++        +S  +  T M+PN      Y + 
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277

Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           L G+ + G  L    S    GG ++DSGT +   P  +Y +L    L +           
Sbjct: 278 LMGMDVDGTALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHI 333

Query: 393 ILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED 450
           + DT  CF+ S   +V  P V  EFE + ++TV     ++ ++ +          L+  +
Sbjct: 334 VEDTFQCFSFSENVDVAFPPVSFEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGE 393

Query: 451 ETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            T +I  G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 394 RTEVILLGDLVLSNKLVVYDLENEVIGWADHNCSS 428


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 124/390 (31%), Positives = 177/390 (45%), Gaps = 41/390 (10%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLT 157
            +  L +R+ +  SG+ +    T + L SG     + +  +I    + ++ + DTGSDL 
Sbjct: 53  RLSMLAARLDDAASGSAQ----TPLQLDSGGGAYDMTF--SIGTPPQELSALADTGSDLI 106

Query: 158 WVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYF 217
           W +C  C  C  Q  P + P+ S S+ K+ C+ S C  L      S  CS+    +C+Y 
Sbjct: 107 WAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLP-----SSQCSAGGA-ECDYK 160

Query: 218 VSYGDGS----YTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSL 273
            SYG  S    YT+G LG E   LG  +V    FGC   ++G +G  SGL+GLGR  LSL
Sbjct: 161 YSYGLASDPHHYTQGYLGSETFTLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSL 220

Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS--SVFKNSTPITYTNMIPNPQLATFY 331
           VSQ +    G FSYCL  T DA  +  L+ G  +       STP+  T+         +Y
Sbjct: 221 VSQLNV---GAFSYCL--TSDAAKTSPLLFGSGALTGAGVQSTPLLRTSTY-------YY 268

Query: 332 ILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +NL  ISIG      +G    GI+ DSGT +  L    Y+  K   L Q +    A G 
Sbjct: 269 TVNLESISIGAATTAGTG--SSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGR 326

Query: 392 SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
              + CF  S       P + + F+G     +D+    YF   D S  C  +        
Sbjct: 327 DGYEVCFQTSG---AVFPSMVLHFDGG---DMDLPTENYFGAVDDSVSCWIVQK---SPS 377

Query: 452 TGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             I+GN  Q N  + YD + S L F   +C
Sbjct: 378 LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 175/362 (48%), Gaps = 33/362 (9%)

Query: 130 LQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
           LQT  Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV 
Sbjct: 77  LQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVS 134

Query: 188 CNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDF 245
           C +S C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F
Sbjct: 135 CGTSMC----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGF 190

Query: 246 IFGCGRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGAS 298
            FGC  ++ G   FG V GL+G+G   +S++ Q+S  F   FSYCLP  +      +  +
Sbjct: 191 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTT 249

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGIL 356
           G   LG  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++
Sbjct: 250 GYFSLGKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVV 305

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
            DSG+ ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+
Sbjct: 306 FDSGSELSYIPDRALSVL-SQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFD 364

Query: 417 GNAEMTVDVTGIVYFVKSDASQ---VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
             A   +   G+  FV+    +    CLA A     +   IIG+  Q ++ V+YD K   
Sbjct: 365 DGARFDLGSHGV--FVERSVQEQDVWCLAFAP---TESVSIIGSLMQTSKEVVYDLKRQL 419

Query: 474 LG 475
           +G
Sbjct: 420 IG 421


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 169/358 (47%), Gaps = 28/358 (7%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y  T  +G   + ++ + DTGSDL W +C  CK C  +    + P+ S S+ K+ C+S+ 
Sbjct: 81  YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSAL 140

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGS----YTRGELGREHLGLGKASVNDFIFG 248
           C  LE  +  +   + +    C+Y  SYG  S    YT+G +G E   LG  +V    FG
Sbjct: 141 CRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFG 200

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSS 308
           C   ++G +G  SGL+GLGR  LSLV Q   +  G FSYCL  T D   S  L+ G  + 
Sbjct: 201 CTTMSEGGYGSGSGLVGLGRGKLSLVRQ---LKVGAFSYCL--TSDPSTSSPLLFGAGAL 255

Query: 309 VFK--NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRL 366
                 STP+       N + +TFY +NL  ISIG  +   +G  + GI+ DSGT +T L
Sbjct: 256 TGPGVQSTPLV------NLKTSTFYTVNLDSISIGAAKTPGTG--RHGIIFDSGTTLTFL 307

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
               Y+  +A  L Q +     PG    + CF  S       P + + F+G  +M +   
Sbjct: 308 AEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGG-DMALKTE 364

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              YF   + S  C  +       E  I+GN  Q +  + YD   S L F   +C S+
Sbjct: 365 N--YFGAVNDSVSCWLVQ--KSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNCDSV 418


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 108/367 (29%), Positives = 159/367 (43%), Gaps = 49/367 (13%)

Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  ++LG     ++  +DTGSDL W QC PC +CY Q  P+FDPS S ++K+  C+   
Sbjct: 61  YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEKRCH--- 117

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF----- 247
                   GNS          C Y + Y D SY+ G L  E + +   S   F+      
Sbjct: 118 --------GNS----------CPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSI 159

Query: 248 GCGRNNKGLF-----GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
           GCG NN  L         SG++GL     SL+SQ      GL SYC  S      +  + 
Sbjct: 160 GCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQ----GTSKIN 215

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILID 358
            G N+ V  + T +     I   Q   FY LNL  +S+G K+++  G       G I ID
Sbjct: 216 FGTNAVVAGDGT-VAADMFIKKDQ--PFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEG 417
           SGT  T LP S  + ++             P  S  +  C+N    +    P++ + F G
Sbjct: 273 SGTTYTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEI--FPVITLHFAG 330

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
            A++ +D   + Y         CLA+  +       I GN    N  V YD+    + F+
Sbjct: 331 GADLVLDKYNM-YVETITGGTFCLAIGCVD-PSMPAIFGNRAHNNLLVGYDSSTLVISFS 388

Query: 478 GEDCSSM 484
             +CS++
Sbjct: 389 PTNCSAL 395


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 167/384 (43%), Gaps = 64/384 (16%)

Query: 120 TEIPLTSGIRLQTL-NYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFD 176
           T  P+ SG   QT  +Y+    LG   + + + +DT +D TW  C PC +C       F 
Sbjct: 66  TSAPVASG---QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FI 120

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           P+ S SY  + C S  C                                  GE GR    
Sbjct: 121 PASSSSYASLPCASDWCPLFRRPA-------------------------VPGEPGR---- 151

Query: 237 LGKASVNDFIFGCGRNNK-GLFGGVSGLMGLGRSD--------LSLVSQTSEIFGGLFSY 287
           +G A+    +    R  + G+        G  R+         +SL+SQT   + G+FSY
Sbjct: 152 VGAAADVRLLQAASRTPRSGVLAATR--CGWARTPSPATRSGPMSLLSQTGSRYNGVFSY 209

Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
           CLPS +    SGSL LG      +N   + YT ++ NP   + Y +N+TG+S+G   ++A
Sbjct: 210 CLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPSLYYVNVTGLSVGRALVKA 265

Query: 348 SG--FA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
               FA       G +IDSGTVITR    +Y+AL+ EF +Q +           DTCFN 
Sbjct: 266 PAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNT 325

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLS--YEDETGIIGN 457
                   P V +   G  ++T+ +      + S A+ + CLA+A           ++ N
Sbjct: 326 DEVAAGGAPPVTLHMGGGVDLTLPMENT--LIHSSATPLACLAMAEAPQNVNSVVNVVAN 383

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDC 481
            QQ+N RV+ D   S++GFA E C
Sbjct: 384 LQQQNVRVVVDVAGSRVGFAREPC 407


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 110/415 (26%), Positives = 185/415 (44%), Gaps = 52/415 (12%)

Query: 98  HVQYL----QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVD 151
           H+Q++     +R K + +  +K++ +++  +     ++T  +     +G   +    I+D
Sbjct: 27  HIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVPQFTIMD 86

Query: 152 TGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           TGS L W+QC PCK C +     PVF+P++S ++ +  C+   C         +G CSS+
Sbjct: 87  TGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRFCRY-----APNGHCSSN 141

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-----FGCGRNN-KGLFGGVSGL 263
               C Y   Y  G+ ++G L +E L     + N  +     FGCG  N + L    +G+
Sbjct: 142 K---CVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHENGEQLESEFTGI 198

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG-ASGSLILGGNSSVFKNSTPITYTNMI 322
           +GLG    SL  Q     G  FSYC+    +       L+LG ++ +  + TPI +    
Sbjct: 199 LGLGAKPTSLAVQ----LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFETE- 253

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGF------AKGGILIDSGTVITRLPPSIYSALKA 376
                   Y +NL GIS+G KQL           ++ G+++D+GT+ T L    Y  L  
Sbjct: 254 -----NGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWLADIAYRELYN 308

Query: 377 EFLKQFSGFPSAPGFSILD-TCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVY-FVK 433
           E        P    F   D  C++    +E +  P+V   F G AE+ ++ T + Y   +
Sbjct: 309 EIKSILD--PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFYPMTE 366

Query: 434 SDASQ--VCLALASLS-----YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           SD      C+++   +     Y+D T  IG   Q+   + YD K   +     DC
Sbjct: 367 SDTYHNVFCMSVRPTTEHGGEYKDFTA-IGLMAQQYYNIAYDLKERNIYLQRIDC 420


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 131/440 (29%), Positives = 205/440 (46%), Gaps = 44/440 (10%)

Query: 56  SHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIK 115
           + Q    ++  I +  K   +   K   W+ +  N    D   + YL + +    +    
Sbjct: 27  ASQPDDSDLNVIPMYGKCSPFNPPKADSWDNRVINMASKDPARMSYLSTLVAQKTA---- 82

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
               T  P+ SG      NY+  +++G  G+ + +++DT +D  +V    C  C      
Sbjct: 83  ----TSAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGC---SAT 135

Query: 174 VFDPSISPSYKKVLCNSSTC---HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGEL 230
            F P++S S+  + C+   C     L      SG CS        +  SY  GS     L
Sbjct: 136 TFYPNVSTSFVPLDCSVPQCGQVRGLSCPATGSGACS--------FNQSYA-GSTFSATL 186

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP 290
            ++ L L    +  + FG      G      GL+GLGR  LSL+SQ+  I+ G+FSYCLP
Sbjct: 187 VQDSLRLATDVIPSYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLP 246

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIG------GKQ 344
           S +    SGSL LG           I  T ++ NP   + Y +NLT IS+G        +
Sbjct: 247 SFKSYYFSGSLKLGP----VGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSE 302

Query: 345 LQASGFAKG-GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
           L A   + G G +IDSGTVITR    IY+A++ EF KQ +G  S+ G    DTCF +  Y
Sbjct: 303 LLAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVTGPFSSLG--AFDTCF-VKNY 359

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET--GIIGNYQQK 461
           + +  P + + F  + ++ + +   +    S  S  CLA+A+      +   +I N+QQ+
Sbjct: 360 ETL-APAITLHFT-DLDLKLPLENSLIH-SSSGSLACLAMAAAPSNVNSVLNVIANFQQQ 416

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
           N RV++DT N+++G A E C
Sbjct: 417 NLRVLFDTVNNKVGIARELC 436


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 170/386 (44%), Gaps = 49/386 (12%)

Query: 131 QTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDP---VFDPSISPSYKK 185
           +   Y+  IE+G   + V  I DTGSDL WV+C+   +  N   P    F PS S +Y +
Sbjct: 106 RQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGR 165

Query: 186 VLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLG----- 238
           V C++  C AL  A        +S  PD  C Y  SYGDGS   G+L  E          
Sbjct: 166 VGCDTKACRALSSA--------ASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADS 217

Query: 239 -----------------KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSE 279
                            +  +    FGC     G F    GL+GLG   +SL SQ   + 
Sbjct: 218 SKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRA-DGLVGLGGGPVSLASQLGATT 276

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
             G  FSYCL    +  AS +L  G  + V   S P   +  +   ++ T+Y + L  I+
Sbjct: 277 SLGRKFSYCLAPYANTNASSALNFGSRAVV---SEPGAASTPLITGEVETYYTIALDSIN 333

Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           + G +   +  A+  I++DSGT +T L  ++ + L  +  ++     +     ILD C++
Sbjct: 334 VAGTKRPTTA-AQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYD 392

Query: 400 LSAYQ---EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
           +S  +    + IP V +   G  E+T+       FV      +CLAL + S      I+G
Sbjct: 393 ISGVRGEDALGIPDVTLVLGGGGEVTLKPDNT--FVVVQEGVLCLALVATSERQSVSILG 450

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCS 482
           N  Q+N  V YD +   + FA  DC+
Sbjct: 451 NIAQQNLHVGYDLEKGTVTFAAADCA 476


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 127/435 (29%), Positives = 186/435 (42%), Gaps = 73/435 (16%)

Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQ 170
           N  +     +PL+ G      +Y  +  L  + + + +DTGSDL W  CQP  C  C  +
Sbjct: 65  NTHNHRQVSLPLSPGS-----DYTLSFTLDSQPIFLYLDTGSDLVWFPCQPFECILCEGK 119

Query: 171 QD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN--- 215
            +         P +S +   V C SS C A      +S +C+ S+ P       DC    
Sbjct: 120 AENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHS 179

Query: 216 ---YFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLG 267
              ++ +YGDGS     L R+ + L  ++     VN+F FGC            G+ G G
Sbjct: 180 CPQFYYAYGDGSLI-ARLYRDSISLPLSNPTNLIVNNFTFGCAHT---ALAEPIGVAGFG 235

Query: 268 RSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFK-------NS 313
           R  LSL +Q + +    G  FSYCL S    +        LILG      K       N 
Sbjct: 236 RGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNK 295

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRL 366
               YT+M+ N +   FY + L GISIG K++ A GF +       GG+++DSGT  T L
Sbjct: 296 PRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTML 355

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK---MEFEGNAEMTV 423
           P S+Y ++ AEF  +             DT  +   Y + N+  V    + F GN    V
Sbjct: 356 PASLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVV 415

Query: 424 DVTGIVYFVK---------SDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDT 469
            +    YF +               CL L +   E E        +GNYQQ+   V+YD 
Sbjct: 416 -LPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGEEAELSGGPGATLGNYQQQGFEVVYDL 474

Query: 470 KNSQLGFAGEDCSSM 484
           +N ++GFA   C+S+
Sbjct: 475 ENKRVGFARRQCASL 489


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/445 (25%), Positives = 181/445 (40%), Gaps = 54/445 (12%)

Query: 81  IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
           + D     + R+     H +          S      +   +PLTSG       Y     
Sbjct: 43  LADLARSDRQRMAFIASHGRRRTRETAAGSSSASSAAAAFAMPLTSGAYTGIGQYFVRFR 102

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV---------FDPSISPSYKKVLCN 189
           +G   +   ++ DTGSDLTWV+C+   S  +   P          F P  S ++  + C 
Sbjct: 103 VGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPISCA 162

Query: 190 SSTC-HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KAS 241
           S TC  +L F+      C +   P C Y   Y DGS  RG +G E   +        KA 
Sbjct: 163 SDTCTKSLPFSLAT---CPTPGSP-CAYDYRYKDGSAARGTVGTESATIALSGREERKAK 218

Query: 242 VNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASG 299
           +   + GC  +  G  F    G++ LG S +S  S  +  FGG FSYCL        A+ 
Sbjct: 219 LKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRNATS 278

Query: 300 SLILGGNSSVFKNSTP-------------ITYTNMIPNPQLATFYILNLTGISIGGKQLQ 346
            L  G N +V   S+P                T ++ + ++  FY ++L  IS+ G+ L+
Sbjct: 279 YLTFGPNPAV---SSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEFLK 335

Query: 347 ASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
                    A GG+++DSGT +T L    Y A+ A   K  +G P        + C+N +
Sbjct: 336 IPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRV-TMDPFEYCYNWT 394

Query: 402 AYQ----EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
           +      +V +P + + F G A +  +  G  Y + +     C+ L    +     +IGN
Sbjct: 395 SPSGKDADVAVPKMAVHFAGAARL--EPPGKSYVIDAAPGVKCIGLQEGPWPG-ISVIGN 451

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCS 482
             Q+     +D KN +L F    C+
Sbjct: 452 ILQQEHLWEFDIKNRRLKFQRSRCT 476


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 160/349 (45%), Gaps = 31/349 (8%)

Query: 144 RNMTVIVDTGSD-LTWVQCQPC---KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFA 199
           +  TV  DT +   T +QC+PC   + C++     FDPS S S   V C S  C    F 
Sbjct: 156 QQFTVGFDTTTTGATQLQCKPCAADEPCHH----AFDPSASSSIAHVPCGSPDC---PFN 208

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKGLFG 258
            G    CS  S   C   VS  +          + L L   + V+DF F C         
Sbjct: 209 KG----CSGHS---CTLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAGFRPDD 261

Query: 259 GVSGLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
             +G++ L R+  SL S+   S      FSYCLPS       G L LG           +
Sbjct: 262 DSTGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSD--VGFLSLGATKPELLGRK-V 318

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVITRLPPSIYSAL 374
           +YT +  N      Y++ L G+ +GG  L    +  A GG +++  T  T L P +Y+AL
Sbjct: 319 SYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAIAGGGTILELHTTFTYLKPKVYAAL 378

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           + EF K  S +P AP    LDTC+N +A    ++P V ++F+G AE  + +  ++YF + 
Sbjct: 379 RDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEP 438

Query: 435 DA--SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +  S  CLA  +   +D   +IG+  Q +  V+YD +  ++GF    C
Sbjct: 439 GSYFSVGCLAFVA---QDGGAVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 120/373 (32%), Positives = 172/373 (46%), Gaps = 48/373 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCK------SCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           +N+T+++DTGS+L+W+ C   +               F P  S ++  V C S+ C + +
Sbjct: 74  QNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRD 133

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRN 252
                S  C  +S   C+  +SY DGS + G L  +   +G+A      FGC       +
Sbjct: 134 LPAPPS--CDGASR-QCHVSLSYADGSASDGALATDVFAVGEAPPLRSAFGCMSTAYDSS 190

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK- 311
             G+    +GL+G+ R  LS V+Q S      FSYC+    DAG    L+LG +   F  
Sbjct: 191 PDGV--ATAGLLGMNRGTLSFVTQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLP 242

Query: 312 -NSTPITYTNMIPNPQL-ATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTV 362
            N TP+ Y   +P P      Y + L GI +GGK L   AS  A      G  ++DSGT 
Sbjct: 243 LNYTPL-YQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQ 301

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE---VNIPLVKM 413
            T L    YSALKAEFLKQ      A   P F+    LDTCF + A +      +P V +
Sbjct: 302 FTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTL 361

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQV----CLALASLSYEDETG-IIGNYQQKNQRVIYD 468
            F G AEM+V    ++Y V  +        CL   +      T  +IG++ Q N  V YD
Sbjct: 362 LFNG-AEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYD 420

Query: 469 TKNSQLGFAGEDC 481
            +  ++G A   C
Sbjct: 421 LERGRVGLAPVKC 433


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 171/370 (46%), Gaps = 55/370 (14%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS- 191
           ++  I +G   +T ++  DT SDL W+QC PC +CY Q  P+FDPS S +++   C +S 
Sbjct: 85  FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144

Query: 192 -TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVN 243
            +  +L+F               C Y + Y D + ++G L RE L           A+++
Sbjct: 145 YSMPSLKFNANTRS---------CEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALH 195

Query: 244 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
           D +FGCG +N G     +G++GLG  + SLV +    FG  FSYC  S  D     ++++
Sbjct: 196 DVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHR----FGKKFSYCFGSLDDPSYPHNVLV 251

Query: 304 GGN--SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK------G 353
            G+  +++  ++TP+   N         FY + +  IS+ G  L      F +      G
Sbjct: 252 LGDDGANILGDTTPLEIHN--------GFYYVTIEAISVDGIILPIDPRVFNRNHQTGLG 303

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT----CFNLSAYQ---EV 406
           G +ID+G  +T L    Y  LK      F G  +A   S  D     C+N +  +   E 
Sbjct: 304 GTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVES 363

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
             P+V   F   AE+++DV  +  F+K   +  CLA+           IG   Q++  + 
Sbjct: 364 GFPIVTFHFSEGAELSLDVKSL--FMKLSPNVFCLAVTP----GNLNSIGATAQQSYNIG 417

Query: 467 YDTKNSQLGF 476
           YD +  ++ F
Sbjct: 418 YDLEAMEVSF 427


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 176/380 (46%), Gaps = 56/380 (14%)

Query: 150 VDTGSDLTWVQCQPCKSCYN-----QQDPVFDPSISPSYKKVLCNSSTCHAL-------- 196
           +DTGSDL WV C    SC N       + VF P +S S   V C  S C  L        
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 197 -EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL------GKASVNDFIFGC 249
            +   G+   CS + PP   Y + YG GS T G L  E L L      G  ++  F  GC
Sbjct: 61  CQSCAGSLKNCSETCPP---YGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGC 116

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG-GLFSYCLPSTQ-DAGASGSLILGGNS 307
              +       SG+ G GR  LS+ SQ  E  G   F+YCL S + D     SL++ G+ 
Sbjct: 117 SIVSSQQ---PSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDK 173

Query: 308 SVFKNSTPITYTNMI------PNPQLATFYILNLTGISIGGKQLQA--------SGFAKG 353
           ++  N+ P+ YT  +      P+ Q   +Y + L G+SIGGK+L+              G
Sbjct: 174 AL-PNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNG 232

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSILDTCFNLSAYQEVNIPL 410
           G +IDSGT  T     I+  + A F  Q  G+  A      + +  C++++  + + +P 
Sbjct: 233 GTIIDSGTTFTVFSDEIFKHIAAGFASQI-GYRRAGEVEDKTGMGLCYDVTGLENIVLPE 291

Query: 411 VKMEFEGNAEMTVDVTGIV-YFVKSDASQVCLALASLS--YEDETG---IIGNYQQKNQR 464
               F+G ++M + V     YF   D+  +CL + S     E ++G   I+GN QQ++  
Sbjct: 292 FAFHFKGGSDMVLPVANYFSYFSSFDS--ICLTMISSRGLLEVDSGPAVILGNDQQQDFY 349

Query: 465 VIYDTKNSQLGFAGEDCSSM 484
           ++YD + ++LGF  + C + 
Sbjct: 350 LLYDREKNRLGFTQQTCKTF 369


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 127/435 (29%), Positives = 186/435 (42%), Gaps = 73/435 (16%)

Query: 113 NIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQ 170
           N  +     +PL+ G      +Y  +  L  + + + +DTGSDL W  CQP  C  C  +
Sbjct: 65  NTHNHRQVSLPLSPGS-----DYTLSFTLDSQPIFLYLDTGSDLVWFPCQPFECILCEGK 119

Query: 171 QD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN--- 215
            +         P +S +   V C SS C A      +S +C+ S+ P       DC    
Sbjct: 120 AENTSLASTPPPKLSKTATPVSCKSSACSAAHSNLPSSDLCAISNCPLESIETSDCQKHS 179

Query: 216 ---YFVSYGDGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLG 267
              ++ +YGDGS     L R+ + L  ++     VN+F FGC            G+ G G
Sbjct: 180 CPQFYYAYGDGSLI-ARLYRDSISLPLSNPTNLIVNNFTFGCAHT---ALAEPIGVAGFG 235

Query: 268 RSDLSLVSQTSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFK-------NS 313
           R  LSL +Q + +    G  FSYCL S    +        LILG      K       N 
Sbjct: 236 RGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRLRRPSPLILGRYDHDEKERRVNGVNK 295

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRL 366
               YT+M+ N +   FY + L GISIG K++ A GF +       GG+++DSGT  T L
Sbjct: 296 PRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIPAPGFLRKVDGEGSGGLVVDSGTTFTML 355

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK---MEFEGNAEMTV 423
           P S+Y ++ AEF  +             DT  +   Y + N+  V    + F GN    V
Sbjct: 356 PASLYGSVVAEFENRVGRVNERARVIEEDTGLSPCYYFDNNVVNVPSVVLHFVGNGSSVV 415

Query: 424 DVTGIVYFVK---------SDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDT 469
            +    YF +               CL L +   E E        +GNYQQ+   V+YD 
Sbjct: 416 -LPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDL 474

Query: 470 KNSQLGFAGEDCSSM 484
           +N ++GFA   C+S+
Sbjct: 475 ENKRVGFARRQCASL 489


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 116/394 (29%), Positives = 166/394 (42%), Gaps = 64/394 (16%)

Query: 149 IVDTGSDLTWVQCQPCK----------SCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +VDTGSDL W QC  C+           C+ Q  P ++ S+S + + V C+         
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 199 ATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC---GRNN 253
           A   +G        D  C    SYG G    G LG +      +S     FGC    R +
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVTLAFGCVSQTRIS 195

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKN 312
            G   G SG++GLGR  LSLVSQ +      FSYCL P  +D  +   L +G        
Sbjct: 196 PGALNGASGIIGLGRGALSLVSQLNAT---EFSYCLTPYFRDTVSPSHLFVGDGELAGLR 252

Query: 313 ST---------PITYTNMIPNPQ---LATFYILNLTGISIGGK--QLQASGFA------- 351
           +          P+T      NP+    +TFY L L G++ G     L A  F        
Sbjct: 253 AAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPK 312

Query: 352 --KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAPGFSILDTCFNL---- 400
              GG LIDSG+  TRL    + AL  E  +Q  G      P A     L+ C       
Sbjct: 313 VWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDG 372

Query: 401 SAYQEVNIPLVKMEFE----GNAEMTVDVTGIVYFVKSDASQVCLALASLSY------ED 450
            +     +P + + F+    G  E+ +      Y+ + +AS  C+A+ S +        +
Sbjct: 373 DSLAAAAVPPLVLRFDDGVGGGRELVIPAEK--YWARVEASTWCMAVVSSASGNATLPTN 430

Query: 451 ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ET IIGN+ Q++ RV+YD  N  L F   +CS++
Sbjct: 431 ETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 464


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/307 (33%), Positives = 135/307 (43%), Gaps = 33/307 (10%)

Query: 123 PLTSGIRLQTLN---YIATIELGGRNM--TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           P+T+   L T +   Y+  + +G   +  T I+DTGSDL W QC PC  C +Q  P FD 
Sbjct: 74  PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDV 133

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
             S +Y+ + C SS C +L     +S  C       C Y   YGD + T G L  E    
Sbjct: 134 KKSATYRALPCRSSRCASL-----SSPSCFKKM---CVYQYYYGDTASTAGVLANETFTF 185

Query: 238 G-----KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST 292
           G     K    +  FGCG  N G     SG++G GR  LSLVSQ        FSYCL S 
Sbjct: 186 GAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGP---SRFSYCLTSY 242

Query: 293 QDAGASGSLILGGNSSVFKNST----PITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
             A  S  L  G  +++   +T    P+  T  + NP L   Y L+L  IS+G K L   
Sbjct: 243 LSATPS-RLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301

Query: 349 GFA-------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLS 401
                      GG++IDSGT IT L    Y A++   +              LDTCF   
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQWP 361

Query: 402 AYQEVNI 408
               V +
Sbjct: 362 PPPNVTV 368


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 165/375 (44%), Gaps = 50/375 (13%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  + +G  G  + ++ DTGS L W QC+PC   + Q  P+F+ + S +Y+ + C    
Sbjct: 91  YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQH-- 148

Query: 193 CHALEFATGNSGV--CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG 250
               +F T N  V  C       C Y ++Y  GS T G   ++ L   +     F FGC 
Sbjct: 149 ----QFCTNNQNVFQCRDDK---CVYRIAYAGGSATAGVAAQDILQSAENDRIPFYFGCS 201

Query: 251 RNNKGL-----FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP--STQDAGASGSLIL 303
           R+N+        G   G++GL  S +SL+ Q + I    FSYCL          + SL+ 
Sbjct: 202 RDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLR 261

Query: 304 GGN----SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----K 352
            GN    S     STP      +PN      Y LNL  +S+ G ++Q     FA      
Sbjct: 262 FGNDIRKSRRKYLSTPFVSPRGMPN------YFLNLIDVSVAGNRMQIPPGTFALKPDGT 315

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD------TCFNLSAYQEV 406
           GG +IDSGT +T +  + Y  +   F   F       GF  ++       C+    +   
Sbjct: 316 GGTIIDSGTAVTYISQTAYFPVITAFKNYFDQH----GFQRVNIQLSGYICYKQQGHTFH 371

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
           N P +   F+G A+  V+    VY    D    C+AL  +S +  T IIG   Q N + I
Sbjct: 372 NYPSMAFHFQG-ADFFVE-PEYVYLTVQDRGAFCVALQPISPQQRT-IIGALNQANTQFI 428

Query: 467 YDTKNSQLGFAGEDC 481
           YD  N QL F  E+C
Sbjct: 429 YDAANRQLLFTPENC 443


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 118/417 (28%), Positives = 183/417 (43%), Gaps = 65/417 (15%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +PL+ G    TL++    +   + +T+ +DTGSDL W  C P K    +  P  +P+ SP
Sbjct: 62  LPLSPGSDY-TLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPN-EPNASP 119

Query: 182 SYK-----KVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDG 223
                    V C S  C A       S +C+++  P       DC       ++ +YGDG
Sbjct: 120 PTNITQSVAVSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDG 179

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI--- 280
           S     L R+ L L    + +F FGC           +G+ G GR  LSL +Q + +   
Sbjct: 180 SLI-ARLYRDTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQ 235

Query: 281 FGGLFSYCLPS----TQDAGASGSLILGGNSSVFKNS-----TPITYTNMIPNPQLATFY 331
            G  FSYCL S    ++       LILG      K           YT+M+ NP+   FY
Sbjct: 236 LGNRFSYCLVSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFY 295

Query: 332 ILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            ++L GI++G + + A    +       GG+++DSGT  T LP   Y+++  EF ++  G
Sbjct: 296 TVSLIGIAVGKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRV-G 354

Query: 385 FPSAPGFSI-----LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK-SDASQ 438
             +     I     L  C+ L++  +V  P + + F G    +V +    YF + SD S 
Sbjct: 355 RDNKRARKIEEKTGLAPCYYLNSVADV--PALTLRFAGGKNSSVVLPRKNYFYEFSDGSD 412

Query: 439 --------VCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                    CL L +   E +        +GNYQQ+   V YD +  ++GFA   C+
Sbjct: 413 GAKGKRKVGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 161/357 (45%), Gaps = 38/357 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP FDP  S +YK + CN      ++    + 
Sbjct: 94  QQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN------IDCICDSD 147

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
           GV        C Y   Y + S + G LG + +  G  S       +FGC     G LF  
Sbjct: 148 GV-------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQ 200

Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
              G+MGLG  DLSLV Q  E       FS C       G  G+++LGG S    +    
Sbjct: 201 RADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP--PSDMIF 256

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKG--GILIDSGTVITRLPPSIYSA 373
           TY++ + +P    +Y ++L  I + GK+L  +SG   G  G ++DSGT    LP   +SA
Sbjct: 257 TYSDPVRSP----YYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSA 312

Query: 374 LKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVKMEFEGNAEMTVDVTG 427
            K   + +         P  +  D CF+ +      +    P V M FE   ++++    
Sbjct: 313 FKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPEN 372

Query: 428 IVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             +         CL +   +  D+T ++G    +N  V+YD  NS++GF   +CS +
Sbjct: 373 YFFRHSKVHGAYCLGIFE-NGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 161/357 (45%), Gaps = 38/357 (10%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP FDP  S +YK + CN      ++    + 
Sbjct: 94  QQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN------IDCICDSD 147

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
           GV        C Y   Y + S + G LG + +  G  S       +FGC     G LF  
Sbjct: 148 GV-------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENMETGDLFSQ 200

Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
              G+MGLG  DLSLV Q  E       FS C       G  G+++LGG S    +    
Sbjct: 201 RADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGG--GAMVLGGISP--PSDMIF 256

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQ-ASGFAKG--GILIDSGTVITRLPPSIYSA 373
           TY++ + +P    +Y ++L  I + GK+L  +SG   G  G ++DSGT    LP   +SA
Sbjct: 257 TYSDPVRSP----YYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLPAEAFSA 312

Query: 374 LKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVKMEFEGNAEMTVDVTG 427
            K   + +         P  +  D CF+ +      +    P V M FE   ++++    
Sbjct: 313 FKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKLSLTPEN 372

Query: 428 IVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             +         CL +   +  D+T ++G    +N  V+YD  NS++GF   +CS +
Sbjct: 373 YFFRHSKVHGAYCLGIFE-NGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNCSEL 428


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 165/364 (45%), Gaps = 33/364 (9%)

Query: 144 RNMTVIVDTGSDLTWVQCQ-PCKS--CYNQQ------DPVFDPSISPSYKKVLCNSSTCH 194
           +   ++ DTGSDLTW+ C+  C+S  C N++        VF  ++S S+K + C +  C 
Sbjct: 23  QKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCK 82

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGC 249
                  +   C +   P C Y   Y DGS   G    E + +      K  +++ + GC
Sbjct: 83  IELMDLFSLTNCPTPLTP-CGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGC 141

Query: 250 GRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNS 307
             + +G  F    G+MGLG S  S   + +E FGG FSYCL         S  L  G + 
Sbjct: 142 SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSR 201

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGILIDSGTV 362
           S       +TYT ++    + +FY +N+ GISIGG  L+           GG ++DSG+ 
Sbjct: 202 SKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSS 260

Query: 363 ITRLPPSIY----SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           +T L    Y    +AL+   LK F       G   L+ CFN + ++E  +P +   F   
Sbjct: 261 LTFLTEPAYQPVMAALRVSLLK-FRKVEMDIG--PLEYCFNSTGFEESLVPRLVFHFADG 317

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           AE    V    Y + +     CL   S+++   T ++GN  Q+N    +D    +LGFA 
Sbjct: 318 AEFEPPVKS--YVISAADGVRCLGFVSVAWPG-TSVVGNIMQQNHLWEFDLGLKKLGFAP 374

Query: 479 EDCS 482
             C+
Sbjct: 375 SSCT 378


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 118/378 (31%), Positives = 180/378 (47%), Gaps = 47/378 (12%)

Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           TL    T+    +N+T+++DTGS+L+W+ C+   +     +  F+P +S SY    CNSS
Sbjct: 59  TLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSS 114

Query: 192 TCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
            C      T +  + +S  P +  C+  VSY D S   G L  E   L  A+    +FGC
Sbjct: 115 ICTT---RTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC 171

Query: 250 GRNNKGLFGGV------SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             ++ G    +      +GLMG+ R  LSLV+Q S      FSYC+ S +D  A G L+L
Sbjct: 172 -MDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSL---PKFSYCI-SGED--ALGVLLL 224

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATF-----YILNLTGISIGGK--QLQASGFA----- 351
           G  +      +P+ YT ++     + +     Y + L GI +  K  QL  S F      
Sbjct: 225 GDGTDA---PSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 281

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
            G  ++DSGT  T L  S+YS+LK EFL+Q  G  +    P F     +D C++  A   
Sbjct: 282 AGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SF 340

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED-ETGIIGNYQQKNQ 463
             +P V + F G AEM V    ++Y V   +  V C    +      E  +IG++ Q+N 
Sbjct: 341 AAVPAVTLVFSG-AEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNV 399

Query: 464 RVIYDTKNSQLGFAGEDC 481
            + +D   S++GF    C
Sbjct: 400 WMEFDLLKSRVGFTQTTC 417


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 120/426 (28%), Positives = 179/426 (42%), Gaps = 73/426 (17%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQQD-----PV 174
           +PL+ G      +Y  +  +  + +++ +DTGSDL W  CQP  C  C  + +       
Sbjct: 74  LPLSPGS-----DYTLSFTINSQPISLYLDTGSDLVWFPCQPFECILCEGKAENASLAST 128

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYG 221
             P +S +   V C SS C A+     +S +C+ S+ P       DC       ++ +YG
Sbjct: 129 PPPKLSKTATPVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYG 188

Query: 222 DGSYTRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
           DGS     L R+ + L  ++      N+F FGC            G+ G GR  LSL +Q
Sbjct: 189 DGSLI-ARLYRDSIRLPLSNQTNLIFNNFTFGCAHTT---LAEPIGVAGFGRGVLSLPAQ 244

Query: 277 TSEI---FGGLFSYCLPS----TQDAGASGSLILGGNSSVFKN-------STPITYTNMI 322
            + +    G  FSYCL S    +        LILG      K             YT+M+
Sbjct: 245 LATLSPQLGNQFSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSML 304

Query: 323 PNPQLATFYILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALK 375
            NP+   FY + L GISIG K++ A  F +       GG+++DSGT  T LP S+Y  + 
Sbjct: 305 DNPRHPYFYCVGLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVV 364

Query: 376 AEFLKQFSGFPSAPGF----SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           AEF  +              + L  C+            V + F GN    V      ++
Sbjct: 365 AEFENRVGRVNERASVIEENTGLSPCYYFDNNVVNVP-RVVLHFVGNGSSVVLPRRNYFY 423

Query: 432 VKSDASQV--------CLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAG 478
              D            CL L +   E E        +GNYQQ+   V+YD +N ++GFA 
Sbjct: 424 EFLDGGHGKGKKRKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFAR 483

Query: 479 EDCSSM 484
             C+S+
Sbjct: 484 RQCASL 489


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 121/419 (28%), Positives = 184/419 (43%), Gaps = 68/419 (16%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRN----MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           +PL+ G      +Y  +  LG R     +T+ +DTGSDL W  C P K    +  P   P
Sbjct: 40  LPLSPGS-----DYTLSFNLGPRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNASP 94

Query: 178 SISPSYK-KVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDG 223
            ++ +    V C S  C A       S +C+++  P       DC       ++ +YGDG
Sbjct: 95  PVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDG 154

Query: 224 SYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI--- 280
           S     L R+ L L    + +F FGC           +G+ G GR  LSL +Q + +   
Sbjct: 155 SLI-ARLYRDTLSLSSLFLRNFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQ 210

Query: 281 FGGLFSYCLPS----TQDAGASGSLILG------GNSSVFKNSTPITYTNMIPNPQLATF 330
            G  FSYCL S    ++       LILG          V        YT M+ NP+   F
Sbjct: 211 LGNRFSYCLVSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYF 270

Query: 331 YILNLTGISIGGKQLQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y + L GIS+G + + A    +       GG+++DSGT  T LP   Y+++  EF +   
Sbjct: 271 YTVGLIGISVGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGV- 329

Query: 384 GFPSAPGFSI-----LDTCFNLSAYQEVNIPLVKMEFEG-NAEMTVDVTGIVY-FVK-SD 435
           G  +     I     L  C+ L++  EV  P++ + F G N+ + +      Y F+   D
Sbjct: 330 GRVNERARKIEEKTGLAPCYYLNSVAEV--PVLTLRFAGGNSSVVLPRKNYFYEFLDGRD 387

Query: 436 ASQV-----CLALASLSYEDET-----GIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           A++      CL L +   E E        +GNYQQ+   V YD +  ++GFA   C+S+
Sbjct: 388 AAKGKRRVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCASL 446


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 184/403 (45%), Gaps = 50/403 (12%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIELG--GRNMTVIVD 151
           D   +Q+L S +             + +P+ SG ++ Q   YI   ++G   + M + +D
Sbjct: 5   DKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAKIGTPAQTMLMAMD 54

Query: 152 TGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
           T SD+ W+   PC  C      +F+   S +YK + C ++ C  +   T   GVCS    
Sbjct: 55  TSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCS---- 107

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDL 271
               + ++YG GS     L ++ + L   +V  + FGC +   G      GL+GLGR  L
Sbjct: 108 ----FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPL 162

Query: 272 SLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
           SL+SQT  ++   FSYCLPS +    SGSL LG      +    I YT ++ NP+  + Y
Sbjct: 163 SLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKYTPLLKNPRRPSLY 218

Query: 332 ILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            +NL  + +         G      S  A  G + DSGTV TRL    Y A++  F  + 
Sbjct: 219 FVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTPAYIAVRDAFRNRV 276

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQVCL 441
               +       DTC+ +     +  P +   F G   M V +      + S A S  CL
Sbjct: 277 GRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDNLLIHSTAGSTTCL 329

Query: 442 ALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           A+A+   +      +I N QQ+N R++YD  NS+LG A E C+
Sbjct: 330 AMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 372


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 104/350 (29%), Positives = 158/350 (45%), Gaps = 36/350 (10%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
            I+DTGS++ WV+C PCK C  Q  P+ DPS S +Y  + C ++ CH        S  C+
Sbjct: 114 AIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMCH-----YAPSAYCN 168

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRNNKGLFG-GVS 261
             +   C Y +SY  G  + G L  E L       G  +V   +FGC   N        +
Sbjct: 169 RLN--QCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHENGDYKDRRFT 226

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN-STPITYTN 320
           G+ GLG+   S V++     G  FSYCL +  D     + ++ G  + F+  STP+   N
Sbjct: 227 GVFGLGKGITSFVTR----MGSKFSYCLGNIADPHYGYNQLVFGEKANFEGYSTPLKVVN 282

Query: 321 MIPNPQLATFYILNLTGISIGGKQL--QASGFAKGG----ILIDSGTVITRLPPSIYSAL 374
                     Y + L GIS+G K+L   ++ F+  G     LIDSGT +T L  S + AL
Sbjct: 283 --------GHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTWLAESAFRAL 334

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
             E  +   G    P +     C+  +  Q+ +  P+V   F G A++ +D   + Y   
Sbjct: 335 DNEVRQLLDGV-LMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQAT 393

Query: 434 SDASQVCLALASLSYED--ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            D   + +  AS    D     +IG   Q+   + YD  +++L F   DC
Sbjct: 394 PDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDC 443


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 119/403 (29%), Positives = 189/403 (46%), Gaps = 48/403 (11%)

Query: 110 ISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYN 169
           I  N    S  ++P    I   +L    T+    +N+++++DTGS+L+W+ C    +  +
Sbjct: 11  IPSNSFPRSPNKLPFRHNI---SLTVSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTS 67

Query: 170 QQDPVFDPSISPSYKKVLCNSSTC--HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
                F+ + S SY+ + C+SSTC     +F+   S  C S+S   C+  +SY D S + 
Sbjct: 68  YPT-TFNQTRSISYRPIPCSSSTCTNQTRDFSIPAS--CDSNS--LCHATLSYADASSSE 122

Query: 228 GELGREHLGLGKASVNDFIFGCG----RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGG 283
           G L  +   +G + +   +FGC      +N       +GLMG+ R  LS VSQ       
Sbjct: 123 GNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMG---FP 179

Query: 284 LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI----PNPQLATF-YILNLTGI 338
            FSYC+  T     SG L+LG   S F  + P+ YT ++    P P      Y + L GI
Sbjct: 180 KFSYCISGTD---FSGMLLLG--ESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGI 234

Query: 339 SIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA--- 388
            +  + L       +      G  ++DSGT  T L    Y+AL++EFL Q +GF      
Sbjct: 235 KVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLED 294

Query: 389 PGFSI---LDTCFNLSAYQEV--NIPLVKMEFEGNAEMTVDVTGIVYFV----KSDASQV 439
           P F     +D C+ +   Q V   +P V + F G AEMTV    ++Y V    + + S  
Sbjct: 295 PDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNG-AEMTVADERVLYRVPGEIRGNDSVH 353

Query: 440 CLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           CL+  +      E  +IG++ Q+N  + +D + S++G A   C
Sbjct: 354 CLSFGNSDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 119/445 (26%), Positives = 197/445 (44%), Gaps = 55/445 (12%)

Query: 67  ITLELKHKNYCSGKIVDWNEQQQNRL--ILDNL-----HVQYLQSRIKNMISGNIKDVSN 119
           +T +L H++       + N+  ++R   +L N      +VQ +  R   ++  +  D S 
Sbjct: 35  VTTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSA 94

Query: 120 TEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
            +    + +  +   ++    +G   +    ++DTGS LTW+QC+PC +C+ Q+ P+++P
Sbjct: 95  ADDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNP 154

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           S S         S+     +F   ++   ++    DCNY  +Y D + TRG   RE L  
Sbjct: 155 SSS---------STYVSCSDFDRTDTTFTATHG-SDCNYSQTYADKTTTRGTYAREQLLF 204

Query: 238 -----GKASVNDFIFGCGRNNK---GLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
                G   ++D IFGCG NN    G  G  SG+ GLG S  S++S+     G  FSYC+
Sbjct: 205 ETPDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISK----LGFGFSYCI 260

Query: 290 PSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
            +  D       L LG    +   STP     ++P       Y + L GISIG ++L   
Sbjct: 261 GNIGDPLYGFHRLTLGNKLKIEGYSTP-----LVPR----GLYYITLVGISIGQERLDID 311

Query: 349 GFA---------KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF--SILDTC 397
                          I+IDSG  ++ +P   Y+ ++ +     SGF S   +    L  C
Sbjct: 312 PIVFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLC 371

Query: 398 FNLSAYQEVN-IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIG 456
           +     Q++   P         A++   V G+  F +   + +CLAL     ++ET +IG
Sbjct: 372 YIGKLNQDLQGFPDATFHLADGADLVFQVEGL--FFQYTDNVLCLALVPTESDEETCLIG 429

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
              Q+   V YD K  +L F   +C
Sbjct: 430 LLAQQYYNVAYDLKQQKLYFQRIEC 454


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 168/361 (46%), Gaps = 46/361 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP F P +S +Y+ V C +  C+      G++
Sbjct: 100 QRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC-TPDCN----CDGDT 154

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
                     C Y   Y + S + G LG + +  G  S       +FGC  +  G L+  
Sbjct: 155 N--------QCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDETGDLYSQ 206

Query: 259 GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
              G+MGLGR DLS++ Q    ++    FS C     D G  G++ILGG S     VF +
Sbjct: 207 RADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-GGMDVGG-GAMILGGISPPEDMVFTH 264

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
           S          +P  + +Y +NL  + + GK+LQ +      K G ++DSGT    LP +
Sbjct: 265 S----------DPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLPET 314

Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTV 423
            + A K   +K+ +     + P  +  D CF  +         + P+V M FE   ++++
Sbjct: 315 AFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSL 374

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
                ++         CL + S +  D T ++G    +N  V+YD +NS++GF   +CS 
Sbjct: 375 SPENYLFRHSKVRGAYCLGVFS-NGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSE 433

Query: 484 M 484
           +
Sbjct: 434 L 434


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 170/368 (46%), Gaps = 43/368 (11%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
           +N+T+++DTGS+L+W+ C P             F P  S ++  V C+S+ C + +  + 
Sbjct: 77  QNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCRSRDLPSP 136

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGL 256
            +  C  +S   C   +SY DGS + G L  E   +G+       FGC       +  G+
Sbjct: 137 PA--CDGAS-KQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGV 193

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
               +GL+G+ R  LS VSQ S      FSYC+    DAG    L+LG +   F   N T
Sbjct: 194 --ATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLPLNYT 245

Query: 315 PITYTNMIPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
           P+ Y   +P P      Y + L GI +GGK L   AS  A      G  ++DSGT  T L
Sbjct: 246 PL-YQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFL 304

Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQ--EVNIPLVKMEFEGN 418
               YSALKAEF +Q   +  A   P F+     DTCF +   +     +P V + F G 
Sbjct: 305 LGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNG- 363

Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
           A+MTV    ++Y V  +        CL   +      T  +IG++ Q N  V YD +  +
Sbjct: 364 AQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGR 423

Query: 474 LGFAGEDC 481
           +G A   C
Sbjct: 424 VGLAPIRC 431


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 110/370 (29%), Positives = 177/370 (47%), Gaps = 45/370 (12%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           ++A + +G    N+ V++DTGSDL W+QC+PC  CY Q+DP+++ + S SY ++LCN   
Sbjct: 106 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 165

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND-----FIF 247
           C +L    G  G CS S    C Y  SY DGS T G L  E +       ++       F
Sbjct: 166 CLSL----GREGQCSDSG--SCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGF 219

Query: 248 GCGRNNKGLFGGVSG--LMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLIL 303
           GCG  N           ++GLG   +SLVSQ S I      F+YC  +  +  A G L+ 
Sbjct: 220 GCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVF 279

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ----LQASGFAK-----GG 354
           G  + +  + TP+          +A FY +NL GI +G ++    + +S F +     GG
Sbjct: 280 GDATYLNGDMTPMV---------IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGG 330

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNI-PLVK 412
           ++IDSG+ ++  PP +Y  ++   + +   G+  +P  S  D CF     +++ + P + 
Sbjct: 331 VIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFEGKIGRDLPLFPTLV 389

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           +  E    +  D   I  F++      CL   S    +   IIG   Q++ +  Y+ + S
Sbjct: 390 LYLESTGILN-DRWSI--FLQRYDELFCLGFTS---GEGLSIIGTLAQQSYKFGYNLELS 443

Query: 473 QLGF-AGEDC 481
            L   +  DC
Sbjct: 444 TLSIESNPDC 453


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 115/374 (30%), Positives = 174/374 (46%), Gaps = 51/374 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
           +N+++++DTGS+L+W++C       +  +PV  FDP+ S SY  + C+S TC        
Sbjct: 84  QNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND--FIFGCGRNNKG---- 255
               C S     C+  +SY D S + G L  E    G  S ND   IFGC  +  G    
Sbjct: 140 IPASCDSDK--LCHATLSYADASSSEGNLAAEIFHFGN-STNDSNLIFGCMGSVSGSDPE 196

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
                +GL+G+ R  LS +SQ        FSYC+  T D    G L+LG   S F   TP
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMGF---PKFSYCISGTDD--FPGFLLLG--DSNFTWLTP 249

Query: 316 ITYTNMI----PNPQLATF-YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVI 363
           + YT +I    P P      Y + LTGI + GK L              G  ++DSGT  
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQF 309

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEV-----NIPLVK 412
           T L   +Y+AL+++FL Q +G  +    P F     +D C+ +S ++        +P V 
Sbjct: 310 TFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVS 369

Query: 413 MEFEGNAEMTVDVTGIVYFVKS----DASQVCLALASLSYED-ETGIIGNYQQKNQRVIY 467
           + FEG AE+ V    ++Y V      + S  C    +      E  +IG++ Q+N  + +
Sbjct: 370 LVFEG-AEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428

Query: 468 DTKNSQLGFAGEDC 481
           D + S++G A   C
Sbjct: 429 DLQRSRIGLAPVQC 442


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 152/361 (42%), Gaps = 66/361 (18%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+  I +G     V  I DTGSDL W QC PC SCY Q++P+FDPS S S+K+V C S  
Sbjct: 24  YLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQ 83

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
           C  L+  T                                        S+ + +FGCG N
Sbjct: 84  CRLLDTPT----------------------------------------SILNIVFGCGHN 103

Query: 253 NKGLFG-GVSGLMGLGRSDLSLVSQTSEIFGG--LFSYCL-PSTQDAGASGSLILGGNSS 308
           N G F     GL G G   LSL SQ     G    FS CL P   D   +  +I G  + 
Sbjct: 104 NSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAE 163

Query: 309 VFKN---STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA----KGGILIDSGT 361
           V  +   STP+   +   +P   T+Y + L GIS+G K    S  +    KG + ID+GT
Sbjct: 164 VSGSDVVSTPLVTKD---DP---TYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGT 217

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
             T LP   Y+ L     +     P          C+  +    ++ P++   F+G    
Sbjct: 218 PPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATL--IDGPILTAHFDG---A 272

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            V +  +  F+       C A+  +  + +TGI GN+ Q N  + +D    ++ F   DC
Sbjct: 273 DVQLKPLNTFISPKEGVYCFAMQPI--DGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDC 330

Query: 482 S 482
           +
Sbjct: 331 T 331


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 123/418 (29%), Positives = 188/418 (44%), Gaps = 48/418 (11%)

Query: 80  KIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIAT 138
           K + W E        D   +QYL     N+++        + +P+ SG ++ Q+  YI  
Sbjct: 60  KPMSWEESVLQLQAKDQARMQYL----SNLVA------RRSIVPIASGRQITQSPTYIVR 109

Query: 139 IELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL 196
            + G    T++  +DT +D  WV C  C  C       F P  S ++KKV C +S C  +
Sbjct: 110 AKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPPKSTTFKKVGCGASQCKQV 167

Query: 197 EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGL 256
              T     C  S+   C +  +YG  S     L ++ + L    V  + FGC +   G 
Sbjct: 168 RNPT-----CDGSA---CAFNFTYGTSSVA-ASLVQDTVTLATDPVPAYTFGCIQKATGS 218

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
                GL+GLGR  LSL++QT +++   FSYCLPS +    SG   L   +       P 
Sbjct: 219 SLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGHXDLXPVAQPRDQVYP- 277

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAK---GGILIDSGTVITRLPPS 369
                  NP+ ++ Y +NL  I +G + +    +A  F      G + DSGTV TRL   
Sbjct: 278 ----SFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTRLVEP 333

Query: 370 IYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
            Y+A++ EF ++ S        S+   DTC+ +     +  P +   F G   M V +  
Sbjct: 334 AYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV----PIVAPTITFMFSG---MNVTLPP 386

Query: 428 IVYFVKSDASQV-CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               + S A  V CLA+A    +      +I N QQ+N RV++D  NS+LG A E C+
Sbjct: 387 DNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELCT 444


>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
          Length = 484

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 120/428 (28%), Positives = 188/428 (43%), Gaps = 51/428 (11%)

Query: 85  NEQQQNRLILDNLHVQYLQSRIKNMISGNIKD----------VSNTEIPLTSGIRLQTL- 133
           N+Q + R + D LH   L  R+++++     +               IP + G  ++ L 
Sbjct: 77  NQQPERRSVADVLHRDAL--RLRSLLHREEDNHRTPAPAAPPGGGVSIP-SRGEPIEELP 133

Query: 134 -----NYIATIELGGRNMTVIVDTGSD-LTWVQCQPCKSCYNQQDPVFDPSISPSYKKVL 187
                + +A      + + V  DT +   T +QC PC S     D  FDPS S S  +V 
Sbjct: 134 GAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPCGS---GADHAFDPSASSSVSQVP 190

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD---GSYTRGELGREHLGLGKASVND 244
           C S  C    F  G SG       P C   VS+ +   G+ T             A+V+ 
Sbjct: 191 CGSPDC---PF-HGCSGR------PSCTLSVSFNNTLLGNATFFTDTLTLTPSSSATVDK 240

Query: 245 FIFGC--GRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYCLP-STQDAGAS 298
           F F C  G        G +G++ L R+  SL S+   +S      FSYCLP ST D G  
Sbjct: 241 FRFACLEGIAPGPAEDGSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCLPASTADVGF- 299

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--IL 356
             L LG           ++YT +  +P     Y+++L G+ +GG  L     A  G   +
Sbjct: 300 --LSLGATKPELLGRK-VSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAIAGDDTI 356

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
           ++  T  T L P +Y  L+  F K  S +P+AP    LDTC+N +     ++P V ++F 
Sbjct: 357 LELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDAFSVPAVTLKFA 416

Query: 417 GNAEMTVDVTGIVYFVKSDA--SQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
           G A++ + +  ++YF   D   S  CLA  +   + + G +IG+  Q +  V+YD +  +
Sbjct: 417 GGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMSTEVVYDVRGGK 476

Query: 474 LGFAGEDC 481
           +GF    C
Sbjct: 477 VGFVPYRC 484


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 166/387 (42%), Gaps = 62/387 (16%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N QDP F P +S SY
Sbjct: 83  LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSY 138

Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
             V CN   TC               S    C Y   Y + S + G LG + +  G+ S 
Sbjct: 139 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 184

Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
                 IFGC  +  G LF     G+MGLGR  LS++ Q  E  +    FS C       
Sbjct: 185 LKPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG 244

Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
           G  G+++LGG       +F NS P+           + +Y + L  I + GK L+     
Sbjct: 245 G--GAMVLGGMLAPPDMIFSNSDPLR----------SPYYNIELKEIHVAGKALRVESRI 292

Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFSILDTCF---- 398
             +K G ++DSGT    LP   + A K         LK+  G    P  S  D CF    
Sbjct: 293 FNSKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRG----PDPSYKDICFAGAG 348

Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
            N+S   EV  P V M F    ++++     ++         CL +   + +D T ++G 
Sbjct: 349 RNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ-NGKDPTTLLGG 406

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +N  V YD  N ++GF   +CS +
Sbjct: 407 IIVRNTLVTYDRHNEKIGFWKTNCSEL 433


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 162/360 (45%), Gaps = 25/360 (6%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           TI    +  + I+D   +L W QC  C  C+ Q  P+F P+ S +++   C +  C +  
Sbjct: 48  TIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTP 107

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
            +  +  VC+  S  +        D   T G +G E   +G A+ +   FGC   ++   
Sbjct: 108 TSNCSGDVCTYESTTNIRL-----DRHTTLGIVGTETFAIGTATAS-LAFGCVVASDIDT 161

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
             G SG +GLGR+  SLV+Q        FSYCL S +  G S  L LG ++ +   ++++
Sbjct: 162 MDGTSGFIGLGRTPRSLVAQMKLT---KFSYCL-SPRGTGKSSRLFLGSSAKLAGGESTS 217

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVITRLPPSIYSA 373
              +    P+     +Y+L+L  I  G   +  +    GGIL+  + +  + L  S Y A
Sbjct: 218 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVSPFSLLVDSAYRA 275

Query: 374 LKAEFLKQFSGF---PSAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIV 429
            K    +   G    P A      D CF  +A +     P +   F+G A +TV     +
Sbjct: 276 FKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYL 335

Query: 430 YFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
             V  +    C A+ S+++ + TG     ++G+ QQ++   +YD K   L F   DCSS+
Sbjct: 336 IDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCSSL 395


>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
           oleracea]
          Length = 165

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 67/156 (42%), Positives = 95/156 (60%), Gaps = 4/156 (2%)

Query: 329 TFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
           +FY L++ GIS+GG++L    + F+  G LIDSGTVI+RLPP  Y+AL+  F  + S + 
Sbjct: 12  SFYGLDIVGISVGGQKLAIPQTVFSTPGALIDSGTVISRLPPKAYAALRGAFKAKMSQYK 71

Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
           +    SILDTCF+L+ ++ V IP V   F G A + +   G++Y  K   SQVCLA A  
Sbjct: 72  NTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAFK--MSQVCLAFAGN 129

Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           S ++   I GN QQ+   V+YD    ++GFA   CS
Sbjct: 130 SDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGCS 165


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 104/396 (26%), Positives = 182/396 (45%), Gaps = 51/396 (12%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----Y 168
           +++ ++PL    R+ ++  Y   I+LG   +   V VDTGSD+ W+ C+PC  C      
Sbjct: 55  LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL 114

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
           N +  +FD + S + KKV C+   C  +      S  C  +    C+Y + Y D S + G
Sbjct: 115 NFRLSLFDMNASSTSKKVGCDDDFCSFIS----QSDSCQPAL--GCSYHIVYADESTSDG 168

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
           +  R+ L L + + +        + +FGCG +  G  G     V G+MG G+S+ S++SQ
Sbjct: 169 KFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQ 228

Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
            +       +FS+CL + +  G     ++        +S  +  T M+PN      Y + 
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277

Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           L G+ + G  L    S    GG ++DSGT +   P  +Y +L    L +           
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILAR-----QPVKLH 332

Query: 393 ILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
           I++    CF+ S   +   P V  EFE + ++TV     ++ ++ +          L+ +
Sbjct: 333 IVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTD 392

Query: 450 DETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + + +I  G+    N+ V+YD  N  +G+A  +CSS
Sbjct: 393 ERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSS 428


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 155/350 (44%), Gaps = 37/350 (10%)

Query: 154 SDLTWVQCQPCKS------CYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           S ++ ++C+PC S           D  FDPS+S S++ VLC S  C     + G S    
Sbjct: 158 SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSSFRSVLCGSPDCGGHSCSAGGS---- 213

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGK-ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
                 C + +      +  G +  + L L   A+  +F  GC + +  LF   +  + +
Sbjct: 214 ------CTFTLQNSTFVFGNGTIVMDTLTLSPSATFENFAVGCMQLDNDLF---TDGVAV 264

Query: 267 GRSDLSL--------VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
           G  DLSL        V  +S      FSYCLP+  D    G L +    S + +   + Y
Sbjct: 265 GNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPA--DTDTHGFLTIAPALSDYSDHAGVKY 322

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKA 376
             ++ NP    FY ++L  I+I G+ L    + F   G +IDS +  T L P IY+AL+ 
Sbjct: 323 VPLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFTGNGTMIDSQSAFTYLNPPIYAALRD 382

Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
           EF K    +   P F  LDTC+N +  + + +P + + F     M +D    +YF +   
Sbjct: 383 EFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHL 442

Query: 437 SQ----VCLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +      CLA A+   ++     +G+  Q+ + ++YD +   + F    C
Sbjct: 443 TDGFPFGCLAFAAAPDQNFPWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/422 (25%), Positives = 180/422 (42%), Gaps = 47/422 (11%)

Query: 95  DNLHVQ-YLQSRIKNMISGNIKD---VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTV 148
           D+LH   Y++S++ +   G        S   +PL+SG    T  Y     +G   +   +
Sbjct: 57  DDLHRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFVL 116

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDP----VFDPSISPSYKKVLCNSSTCHA-LEFATGNS 203
           + DTGSDLTWV+C+   +           VF  + S S+  + C+S TC + + F+  N 
Sbjct: 117 VADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSSDTCTSYVPFSLAN- 175

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG----------------KASVNDFIF 247
             CSS + P C Y   Y DGS  RG +G +   +                 +A +   + 
Sbjct: 176 --CSSPASP-CAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQGVVL 232

Query: 248 GCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGG 305
           GC     G  F    G++ LG S++S  S+ +  FGG FSYCL        A+  L  G 
Sbjct: 233 GCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGP 292

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGILIDSG 360
            ++      P   T ++ + ++  FY + +  + + G+ L            GG ++DSG
Sbjct: 293 GATA-----PAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILDSG 347

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAE 420
           T +T L    Y A+     K  +G P        + C+N +    + IP +++ F G+A 
Sbjct: 348 TSLTILATPAYRAVVTALSKHLAGLPRV-TMDPFEYCYNWTDAGALEIPKMEVHFAGSAR 406

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           +  +     Y + +     C+ +   S+     +IGN  Q+     +D ++  L F    
Sbjct: 407 L--EPPAKSYVIDAAPGVKCIGVQEGSWPG-VSVIGNILQQEHLWEFDLRDRWLRFKHTR 463

Query: 481 CS 482
           C+
Sbjct: 464 CA 465


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 169/368 (45%), Gaps = 43/368 (11%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
           +N+T+++DTGS+L+W+ C P             F P  S ++  V C S+ C + +  + 
Sbjct: 76  QNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCRSRDLPSP 135

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGL 256
            +  C  +S   C   +SY DGS + G L  E   +G+       FGC       +  G+
Sbjct: 136 PA--CDGAS-KQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFDTSPDGV 192

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
               +GL+G+ R  LS VSQ S      FSYC+    DAG    L+LG +   F   N T
Sbjct: 193 --ATAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLGHSDLPFLPLNYT 244

Query: 315 PITYTNMIPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
           P+ Y   +P P      Y + L GI +GGK L   AS  A      G  ++DSGT  T L
Sbjct: 245 PL-YQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFL 303

Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQ--EVNIPLVKMEFEGN 418
               YSALKAEF +Q   +  A   P F+     DTCF +   +     +P V + F G 
Sbjct: 304 LGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNG- 362

Query: 419 AEMTVDVTGIVYFVKSDAS----QVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQ 473
           A+MTV    ++Y V  +        CL   +      T  +IG++ Q N  V YD +  +
Sbjct: 363 AQMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLERGR 422

Query: 474 LGFAGEDC 481
           +G A   C
Sbjct: 423 VGLAPIRC 430


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/360 (30%), Positives = 164/360 (45%), Gaps = 54/360 (15%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           ++DTGS LTWV C PC SC  Q  P+FDPS S +Y  + C  S C+  +   G       
Sbjct: 109 VMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSC--SECNKCDVVNG------- 159

Query: 209 SSPPDCNYFVSY-GDGS----YTRGELGREHLGLGKASVNDFIFGCGR-----NNKGLFG 258
               +C Y V Y G GS    Y R +L  E +      V   IFGCGR     +N   + 
Sbjct: 160 ----ECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQ 215

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPIT 317
           G++G+ GLG    SL+      FG  FSYC+ + ++       L+LG  +++  +ST + 
Sbjct: 216 GINGVFGLGSGRFSLLPS----FGKKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLN 271

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAK------GGILIDSGTVITRLPPS 369
             N +        Y +NL  ISIGG++L    + F +       G++IDSG   T L   
Sbjct: 272 VINGL--------YYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKY 323

Query: 370 IYSALKAEFLKQFSG---FPSAPGFSILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDV 425
            +  L  E      G          +    C++    Q+++  PLV   F   A + +DV
Sbjct: 324 GFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDV 383

Query: 426 TGIVYFVKSDASQVCLALASLSY--EDETGI--IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           T +  F+++  ++ C+A+   +Y  +D      IG   Q+N  V YD    ++ F   DC
Sbjct: 384 TSM--FIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 117/378 (30%), Positives = 178/378 (47%), Gaps = 47/378 (12%)

Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           TL    TI    +N+T+++DTGS+L+W+ C+   +     +  F+P +S SY    CNSS
Sbjct: 58  TLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSS 113

Query: 192 TCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
            C      T +  + +S  P +  C+  VSY D S   G L  E   L  A+    +FGC
Sbjct: 114 VCMT---RTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGC 170

Query: 250 GRNNKGLFGGV------SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
             ++ G    +      +GLMG+ R  LSLV+Q   +    FSYC+ S +D  A G L+L
Sbjct: 171 -MDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQ---MVLPKFSYCI-SGED--AFGVLLL 223

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATF-----YILNLTGISIGGK--QLQASGFAK---- 352
           G   S     +P+ YT ++     + +     Y + L GI +  K  QL  S F      
Sbjct: 224 GDGPSA---PSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 280

Query: 353 -GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
            G  ++DSGT  T L   +Y++LK EFL+Q  G  +    P F     +D C++  A   
Sbjct: 281 AGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPA-SL 339

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYED-ETGIIGNYQQKNQ 463
             +P V + F G AEM V    ++Y V      V C    +      E  +IG++ Q+N 
Sbjct: 340 AAVPAVTLVFSG-AEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNV 398

Query: 464 RVIYDTKNSQLGFAGEDC 481
            + +D   S++GF    C
Sbjct: 399 WMEFDLVKSRVGFTETTC 416


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 122/433 (28%), Positives = 188/433 (43%), Gaps = 38/433 (8%)

Query: 63  EMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEI 122
           E  + T EL H++  +  + + +E    RL           +R  ++IS +I   +  E 
Sbjct: 33  EKLSFTTELIHRDSPNSPLFNASETTDIRLANAVERSADRVNRFNDLISNSI---TAAEF 89

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-PVFDPSI 179
           P      L   +++  I +G     + V V TGSDL W+ C   K C +  D   FDP  
Sbjct: 90  PSI----LDNGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPME 145

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +YK V C+S  C     AT     C  S  P           S   G+L  + L L  
Sbjct: 146 SSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPR-------HQDSCPDGDLAMDTLTLNS 198

Query: 240 ASVNDFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
            +   F+     F CG    G + GV G++GLG   LSL+++ S +  G FS+C+     
Sbjct: 199 TTGKSFMLPNTGFICGNRIGGDYPGV-GILGLGHGSLSLLNRISHLIDGKFSHCI-VPYS 256

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG----F 350
           +  +  L  G  + V  ++   T  +M   P     Y L+  GIS+G K + A G    +
Sbjct: 257 SNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYS---YTLSFYGISVGNKSISAGGIGSDY 313

Query: 351 AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS-ILDTCFNLSAYQEVNIP 409
              G+ +DSGT+ T  P   YS L+ +        P  P  +  L  C+  S   + + P
Sbjct: 314 YMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYSP--DFSPP 371

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            + M FEG +   V+++    F++     VCLA A+ S E +  + G +QQ N  + YD 
Sbjct: 372 TITMHFEGGS---VELSSSNSFIRMTEDIVCLAFATSSSEQD-AVFGYWQQTNLLIGYDL 427

Query: 470 KNSQLGFAGEDCS 482
               L F   DC+
Sbjct: 428 DAGFLSFLKTDCT 440


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 182/385 (47%), Gaps = 54/385 (14%)

Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCN 189
           TL    T+    +++T+++DTGS+L+W+ C+       QQ  + VF+P +S SY  + C 
Sbjct: 69  TLTVSLTVGTPPQSVTMVLDTGSELSWLHCK------KQQNINSVFNPHLSSSYTPIPCM 122

Query: 190 SSTC--HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
           S  C     +F    S  C S++   C+  VSY D +   G L  +   +  +     IF
Sbjct: 123 SPICKTRTRDFLIPVS--CDSNN--LCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIF 178

Query: 248 GCG----RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
           G       +N       +GLMG+ R  LS V+Q        FSYC+ S +D  ASG L+ 
Sbjct: 179 GSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCI-SGKD--ASGVLLF 232

Query: 304 GGNSSVFKNSTPITYTNMIP-NPQLATF----YILNLTGISIGGKQLQASG--FA----- 351
           G   + FK   P+ YT ++  N  L  F    Y + L GI +G K LQ     FA     
Sbjct: 233 G--DATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTG 290

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE 405
            G  ++DSGT  T L  S+Y+AL+ EF+ Q  G  +    P F     +D CF +     
Sbjct: 291 AGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGV 350

Query: 406 V-NIPLVKMEFEGNAEMTVDVTGIVYFV-------KSDASQVCLALASLSYED-ETGIIG 456
           V  +P V M FEG AEM+V    ++Y V       K +    CL   +      E  +IG
Sbjct: 351 VPAVPAVTMVFEG-AEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIG 409

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
           ++ Q+N  + +D  NS++GFA   C
Sbjct: 410 HHHQQNVWMEFDLVNSRVGFADTKC 434


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 171/384 (44%), Gaps = 58/384 (15%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
           Y A I LG   ++  V VDTGSD+ WV C  C  C  + D      ++DP  S S  ++ 
Sbjct: 82  YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
           C+   C A     G    C+   P  C Y V YGDGS T G   +++L   + + N    
Sbjct: 142 CDDDFCAAT--YNGVLQGCTKDLP--CQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTS 197

Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
                 IFGCG    G  G     + G++G G+++ S++SQ +       +F++CL + +
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVK 257

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA 351
             G      +G   S   N+TP     M+PN      Y + +  I +GG   +L    F 
Sbjct: 258 GGGI---FAIGEVVSPKVNTTP-----MVPN---QPHYNVVMKEIEVGGNVLELPTDIFD 306

Query: 352 KG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-----TCFNLSAY 403
            G   G +IDSGT +  LP  +Y ++  + + +       PG  +       TCF  +  
Sbjct: 307 TGDRRGTIIDSGTTLAYLPEVVYESMMTKIVSE------QPGLKLHTVEEQFTCFQYTGN 360

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQ 459
                P+VK  F G+  +TV+    ++ +  +    C    +   + + G    ++G+  
Sbjct: 361 VNEGFPVVKFHFNGSLSLTVNPHDYLFQIHEEVW--CFGWQNSGMQSKDGRDMTLLGDLV 418

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
             N+ V+YD +N  +G+   +CSS
Sbjct: 419 LSNKLVLYDLENQAIGWTDYNCSS 442


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 176/374 (47%), Gaps = 53/374 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           ++A + +G    N+ V++DTGSDL W+QC+PC  CY Q+DP+++ + S SY ++LCN   
Sbjct: 93  FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPP 152

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIF 247
           C +L    G  G CS S    C Y  +Y DG+ T G L  E +        +       F
Sbjct: 153 CVSL----GREGQCSDSG--SCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGF 206

Query: 248 GCGRNNKGLFGGVSG--LMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLIL 303
           GCG  N           ++GLG   +SLVSQ S I      F+YC  +  +  A G L+ 
Sbjct: 207 GCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVF 266

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ----LQASGFAK-----GG 354
           G  + +  + TP+          +A FY +NL GI +G  +    + +S F +     GG
Sbjct: 267 GDATYLNGDMTPMV---------IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGG 317

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQF-SGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           ++IDSG+ ++  PP +Y  ++   + +   G+  +P  S  D CF      E ++PL   
Sbjct: 318 VIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFE--GKIERDLPLFP- 373

Query: 414 EFEGNAEMTVDVTGIV-----YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD 468
                  + ++ TGI+      F++      CL   S    +   IIG   Q++ +  Y+
Sbjct: 374 ----TLVLYLESTGILNDRWSIFLQRYDELFCLGFTS---GEGLSIIGTLAQQSYKFGYN 426

Query: 469 TKNSQLGF-AGEDC 481
            + S L   +  DC
Sbjct: 427 LELSTLSIESNPDC 440


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 172/391 (43%), Gaps = 58/391 (14%)

Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
           ++++ +DTGSDL W  C P  C  C  +  P         P I    +++ C S  C A 
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159

Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
             +   S +C+++  P        C        + +YGDGS     L R  +GL  + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----AS 298
            +F F C            G+ G GR  LSL +Q +    G FSYCL +          S
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRS 275

Query: 299 GSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF----- 350
             LILG +   +++  + T   YT ++ NP+   FY + L  +S+GGK++QA        
Sbjct: 276 SPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVD 335

Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAY 403
               GG+++DSGT  T LP   ++ +  EF +  +         A   + L  C++ S  
Sbjct: 336 RDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPS 395

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TG 453
               +P V + F GNA + +         KS+  +   CL L ++   ++         G
Sbjct: 396 DRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAG 454

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 455 TLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/446 (24%), Positives = 186/446 (41%), Gaps = 70/446 (15%)

Query: 94  LDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVD 151
           +D   + ++ SR +   +   +  S   +PL+SG    T  Y     +G   +   ++ D
Sbjct: 49  MDRERMAFISSRGRRRAA---ETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVAD 105

Query: 152 TGSDLTWVQCQ----------------PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH- 194
           TGSDLTWV+C                 P  +  + +   F P  S ++  + C+S+TC  
Sbjct: 106 TGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRR-TFRPDKSRTWAPIPCSSATCRE 164

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-------KASVNDFIF 247
           +L F+      C++ + P C Y   Y DGS  RG +G +   +        KA +   + 
Sbjct: 165 SLPFSLA---ACATPANP-CAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVL 220

Query: 248 GCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
           GC  +  G  F    G++ LG S++S  S+ +  FGG FSYCL        + S +  G 
Sbjct: 221 GCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGP 280

Query: 307 SSVFKNSTP----------------------ITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           +  F +  P                         T ++ + +   FY + + G+S+ G+ 
Sbjct: 281 NPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGEL 340

Query: 345 LQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           L+           GG ++DSGT +T L    Y A+ A   K+ +G P        D C+N
Sbjct: 341 LKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRV-TMDPFDYCYN 399

Query: 400 LSAYQEVNI----PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGII 455
            ++    ++    P++ + F G+A +  +     Y + +     C+ L    +   + +I
Sbjct: 400 WTSPSGSDVAAPLPMLAVHFAGSARL--EPPAKSYVIDAAPGVKCIGLQEGPWPGLS-VI 456

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDC 481
           GN  Q+     YD KN +L F    C
Sbjct: 457 GNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/353 (26%), Positives = 157/353 (44%), Gaps = 21/353 (5%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           TI    +  +  +D   +L W QC  C  C+ Q  PVF P+ S ++K   C +  C ++ 
Sbjct: 59  TIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIP 118

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
                S VC+        Y    G G +T G +  +   +G A+     FGC   ++   
Sbjct: 119 TPKCASDVCA--------YDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDT 170

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
            GG SG +GLGR+  SLV+Q        FSYCL +  D G +  L LG ++ +       
Sbjct: 171 MGGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-APHDTGKNSRLFLGASAKLAGGGAWT 226

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV-ITRLPPSIYSALK 375
            +    PN  ++ +Y + L  I  G   +      +  +L+ +  V ++ L  S+Y   K
Sbjct: 227 PFVKTSPNDGMSQYYPIELEEIKAGDATITMPR-GRNTVLVQTAVVRVSLLVDSVYQEFK 285

Query: 376 AEFLKQFSGFPSA-PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
              +      P+A P  +  + CF  +       P +   F+  A +TV     ++ V +
Sbjct: 286 KAVMASVGAAPTATPVGAPFEVCFPKAGVS--GAPDLVFTFQAGAALTVPPANYLFDVGN 343

Query: 435 DA---SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           D    S + +AL +++  D   I+G++QQ+N  +++D     L F   DCSS+
Sbjct: 344 DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 396


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 176/380 (46%), Gaps = 39/380 (10%)

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDP 177
           T IP+        L+Y   +  G   +   + +DT   ++ V C+PC       DP FD 
Sbjct: 134 TIIPIDGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDT 193

Query: 178 SISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
           S S ++  V C+S  C +    T N   CS+ S    N F       +  G   ++ L +
Sbjct: 194 SQSTTFTHVPCDSPDCPS----TAN---CSAGSVCPFNLF-------FVEGTFSQDVLTV 239

Query: 238 GKA-SVNDFIFGCGRNNKGLFGGVS--GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD 294
             + +V DF F C   + G   G+   G + L R   SL S+ +      FSYC+P   D
Sbjct: 240 APSVAVQDFTFVC--LDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPD 297

Query: 295 AGASGSLILGGNSSVFKNS----TPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF 350
           +   G L LG +++V  ++     P+  ++   +P LA  Y +++ G+S+G   L     
Sbjct: 298 S--PGFLSLGDDATVRGDNCTAHAPLLSSD---DPDLANMYFIDVVGMSLGDVDLPIPSG 352

Query: 351 AKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGF-PSAPGFSILDTCFNLSAYQEV 406
             G     ++++GT  T L P  Y+ L+  F +  + +  S PGF   DTC+N +  QE+
Sbjct: 353 TFGNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQEL 412

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYF-VKSDA--SQVCLALASL--SYEDETGIIGNYQQK 461
            +PLV+ +F     + +D   ++Y+ + S+   +  CLA ++L    +D + +IG Y   
Sbjct: 413 TVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLA 472

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
              V+YD     +GF  E C
Sbjct: 473 TTEVVYDVAGGTVGFIPESC 492


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/405 (26%), Positives = 175/405 (43%), Gaps = 52/405 (12%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP------ 173
           +PL+SG    T  Y     +G   +   +I DTGSDLTWV+C+   S  +          
Sbjct: 97  MPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAA 156

Query: 174 ---------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPDCNYFVSYGDG 223
                    VF P  S ++  + C+S TC + + F+  N   CSSS+   C+Y   Y D 
Sbjct: 157 PSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLAN---CSSSTA-ACSYDYRYNDN 212

Query: 224 SYTRGELGREHLGLG-------------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRS 269
           S  RG +G +   +              KA +   + GC   + G  F    G++ LG S
Sbjct: 213 SAARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYS 272

Query: 270 DLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPI--TYTNMIPNPQ 326
           ++S  S+ +  FGG FSYCL        A+  L  G       +S P   + T ++ + +
Sbjct: 273 NISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDAR 332

Query: 327 LATFYILNLTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQ 381
           +  FY + +  +S+ G  L          + GG +IDSGT +T L    Y A+ A   +Q
Sbjct: 333 VRPFYAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQ 392

Query: 382 FSGFPSAPGFSILDTCFNLSAY----QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
            +G P        D C+N +A      ++ +P + ++F G+A +  +     Y + +   
Sbjct: 393 LAGLPRV-AMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARL--EPPAKSYVIDAAPG 449

Query: 438 QVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             C+ +   ++   + +IGN  Q+     +D  N  L F    C+
Sbjct: 450 VKCIGVQEGAWPGVS-VIGNILQQEHLWEFDLNNRWLRFRQTSCT 493


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 106/391 (27%), Positives = 172/391 (43%), Gaps = 58/391 (14%)

Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
           ++++ +DTGSDL W  C P  C  C  +  P         P I    +++ C S  C A 
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159

Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
             +   S +C+++  P        C        + +YGDGS     L R  +GL  + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----AS 298
            +F F C            G+ G GR  LSL +Q +    G FSYCL +          S
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIRS 275

Query: 299 GSLILGGN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF----- 350
             LILG +   +++  + T   YT ++ NP+   FY + L  +S+GGK++QA        
Sbjct: 276 SPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVD 335

Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAY 403
               GG+++DSGT  T LP   ++ +  EF +  +         A   + L  C++ S  
Sbjct: 336 RDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPS 395

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TG 453
               +P V + F GNA + +         KS+  +   CL L ++   ++         G
Sbjct: 396 DRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAG 454

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 455 TLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 87/146 (59%), Gaps = 6/146 (4%)

Query: 123 PLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSIS 180
           P+ SGI  ++  Y A + +G       +++DTGSDL W+QC PC+ CY Q+  VFDP  S
Sbjct: 74  PVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRS 133

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +Y++V C+S  C AL F   +SG  +      C Y V+YGDGS + G+L  + L     
Sbjct: 134 STYRRVPCSSPQCRALRFPGCDSGGAAGGG---CRYMVAYGDGSSSTGDLATDKLAFAND 190

Query: 241 S-VNDFIFGCGRNNKGLFGGVSGLMG 265
           + VN+   GCGR+N+GLF   +GL+G
Sbjct: 191 TYVNNVTLGCGRDNEGLFDSAAGLLG 216



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 64/134 (47%), Gaps = 10/134 (7%)

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDTCFNLSAYQEVNIPLVKME 414
           DSGT I+R     Y+AL+  F  +             S+ D C++L      + PL+ + 
Sbjct: 316 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 375

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLA-----LASLSYEDETGIIGNYQQKNQRVIYDT 469
           F G A+M +      YF+  D  +   A     L   + +D   +IGN QQ+  RV++D 
Sbjct: 376 FAGGADMALPPEN--YFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 433

Query: 470 KNSQLGFAGEDCSS 483
           +  ++GFA + C+S
Sbjct: 434 EKERIGFAPKGCTS 447


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 166/356 (46%), Gaps = 39/356 (10%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
             +IVD+GS +T+V C  C+ C   QDP F P +S +Y+ V CN   C+           
Sbjct: 106 FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNMD-CN----------- 153

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
           C       C Y   Y + S ++G LG + +  G  S       +FGC     G L+    
Sbjct: 154 CDDDR-EQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRA 212

Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
            G++GLG+ DLSLV Q  +  +    F  C     D G  GS+ILGG    F   + + +
Sbjct: 213 DGIIGLGQGDLSLVDQLVDKGLISNSFGLCY-GGMDVGG-GSMILGG----FDYPSDMVF 266

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALK 375
           T+   +P  + +Y ++LTGI + GKQL         + G ++DSGT    LP + ++A +
Sbjct: 267 TDS--DPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYLPDAAFAAFE 324

Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTVDVTGI 428
              +++ S       P  +  DTCF ++A   V+      P V+M F+      +     
Sbjct: 325 EAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENY 384

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++         CL +   + +D T ++G    +N  V+YD +NS++GF   +CS +
Sbjct: 385 MFRHSKVHGAYCLGVFP-NGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSEL 439


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 114/374 (30%), Positives = 174/374 (46%), Gaps = 51/374 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVLCNSSTCHALEFATG 201
           +N+++++DTGS+L+W++C       +  +PV  FDP+ S SY  + C+S TC        
Sbjct: 84  QNISMVIDTGSELSWLRCNRS----SNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFL 139

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND--FIFGCGRNNKG---- 255
               C S     C+  +SY D S + G L  E    G  S ND   IFGC  +  G    
Sbjct: 140 IPASCDSDK--LCHATLSYADASSSEGNLAAEIFHFGN-STNDSNLIFGCMGSVSGSDPE 196

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
                +GL+G+ R  LS +SQ        FSYC+  T D    G L+LG   S F   TP
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMGF---PKFSYCISGTDD--FPGFLLLG--DSNFTWLTP 249

Query: 316 ITYTNMI----PNPQLATF-YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVI 363
           + YT +I    P P      Y + LTGI + GK L              G  ++DSGT  
Sbjct: 250 LNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQF 309

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEVN-----IPLVK 412
           T L   +Y+AL++ FL + +G  +    P F     +D C+ +S  +  +     +P V 
Sbjct: 310 TFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVS 369

Query: 413 MEFEGNAEMTVDVTGIVYFVKS----DASQVCLALASLSYED-ETGIIGNYQQKNQRVIY 467
           + FEG AE+ V    ++Y V      + S  C    +      E  +IG++ Q+N  + +
Sbjct: 370 LVFEG-AEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428

Query: 468 DTKNSQLGFAGEDC 481
           D + S++G A  +C
Sbjct: 429 DLQRSRIGLAPVEC 442


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 115/369 (31%), Positives = 166/369 (44%), Gaps = 45/369 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+T+++DTGS+L+W+ C P  +        F P  S ++  V C S+ C + +  +  +
Sbjct: 96  QNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSRDLPSPPA 155

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-----GRNNKGLFG 258
              +SS    C+  +SY DGS + G L  +   +G        FGC       +  G+  
Sbjct: 156 CDGASSR---CSVSLSYADGSSSDGALATDVFAVGSGPPLRAAFGCMSSAFDSSPDGV-- 210

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
             +GL+G+ R  LS VSQ S      FSYC+    DAG    L+LG   S      P+ Y
Sbjct: 211 ASAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGV---LLLG--HSDLPTFLPLNY 262

Query: 319 TNM----IPNPQL-ATFYILNLTGISIGGKQL--QASGFAK-----GGILIDSGTVITRL 366
           T M    +P P      Y + L GI +GGK L   AS  A      G  ++DSGT  T L
Sbjct: 263 TPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFL 322

Query: 367 PPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLS---AYQEVNIPLVKMEFEG 417
               YSALKAEF +Q      A   P F+     DTCF +    +     +P V + F G
Sbjct: 323 LGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFNG 382

Query: 418 NAEMTVDVTGIVYFVKSDASQ----VCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNS 472
            AEM V    ++Y V  +        CL   +         +IG++ Q N  V YD +  
Sbjct: 383 -AEMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYDLERG 441

Query: 473 QLGFAGEDC 481
           ++G A   C
Sbjct: 442 RVGLAPVRC 450


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 117/397 (29%), Positives = 178/397 (44%), Gaps = 50/397 (12%)

Query: 104 SRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNM--TVIVDTGSDLTWVQC 161
           +R++  ++G++       +PL    R+    Y  TI +G      T+I DT SDLTW QC
Sbjct: 69  ARLEARLTGDM------SVPLA---RISDEGYTVTIGIGTPPQLHTLIADTASDLTWTQC 119

Query: 162 QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV--CSSSSPPDCNYFVS 219
                   Q +P+FDP+ S S+  V C+S  C        N G   CS+ +   C Y   
Sbjct: 120 NLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLC-----TEDNPGTKRCSNKT---CRYVYP 171

Query: 220 YGDGSYTRGELGREHLGLGKASVN---DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
           Y       G L  E   L   + +    F FGCG    G   G SG++G+  + LS+VSQ
Sbjct: 172 YVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILSMVSQ 230

Query: 277 TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSV--FKNSTPITYTNMIPNPQLATFYILN 334
            +      FSYCL    D  +S  L  G  + +  +K + PI          L  +Y + 
Sbjct: 231 LAI---PKFSYCLTPYTDRKSS-PLFFGAWADLGRYKTTGPI-------QKSLTFYYYVP 279

Query: 335 LTGISIGGKQLQ--ASGFA--KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           L G+S+G ++L   A+ FA  +GG ++D G  + +L    ++ALK   L   +   +   
Sbjct: 280 LVGLSLGTRRLDVPAATFALKQGGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRT 339

Query: 391 FSILDTCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
                 CF L    A   V  P + + F+G A+M +      YF +  A  +CLAL    
Sbjct: 340 VKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDN--YFQEPTAGLMCLALVP-- 395

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                 IIGN QQ+N  +++D  +S+  FA   C  +
Sbjct: 396 -GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTICDDI 431


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 168/351 (47%), Gaps = 34/351 (9%)

Query: 144 RNMTVIVDTGSDLTWVQC--QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           + +T + DTGSDL W +C      SC  Q  P + P+ S ++ K+ C+   C  L     
Sbjct: 102 QKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLLR---S 158

Query: 202 NSGVCSSSSPPDCNYFVSYG----DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLF 257
           +S    +++  +C+Y  SYG    D  YT+G L RE   LG  +V    FGC   ++G +
Sbjct: 159 DSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAVPSVRFGCTTASEGGY 218

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
           G  SGL+GLGR  LSLVSQ   +    F YCL  T DA  +  L+ G  +S+      + 
Sbjct: 219 GSGSGLVGLGRGPLSLVSQ---LNASTFMYCL--TSDASKASPLLFGSLASL--TGAQVQ 271

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAE 377
            T ++ +    TFY +NL  ISIG       G  + G++ DSGT +T L    YS  KA 
Sbjct: 272 STGLLAS---TTFYAVNLRSISIGSATTPGVGEPE-GVVFDSGTTLTYLAEPAYSEAKAA 327

Query: 378 FLKQFS--GFPSAPGFSILDTCFNLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFV 432
           FL Q S        GF   + CF   A   ++   +P + + F+G A+M + V    Y V
Sbjct: 328 FLSQTSLDQVEDTDGF---EACFQKPANGRLSNAAVPTMVLHFDG-ADMALPVAN--YVV 381

Query: 433 KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + +   VC  +          IIGN  Q N  V++D   S L F   +C +
Sbjct: 382 EVEDGVVCWIVQR---SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANCDT 429


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 114/419 (27%), Positives = 179/419 (42%), Gaps = 68/419 (16%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP--CKSCYNQQDP-VFDPS 178
           +PL+ G      +Y  T  +  + ++V +DTGSD+ W  C P  C  C  + +P    P 
Sbjct: 86  LPLSPGT-----DYTLTFSINSQTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPL 140

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP-------DCN------YFVSYGDGSY 225
                  + C S  C     +   S +C+ +  P       DC+      ++ +YGDGS 
Sbjct: 141 NVSKSSLISCKSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSL 200

Query: 226 TRGELGREHLGLGKAS-----VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
              +L + +L +   S     + DF FGC  +     G   G+ G G   LSL +Q + +
Sbjct: 201 I-AKLHKHNLIMPSTSNKPFSLKDFTFGCAHS---ALGEPIGVAGFGFGSLSLPAQLANL 256

Query: 281 ---FGGLFSYCLPS----TQDAGASGSLILGG-NSSVFKNSTPITYTNMIPNPQLATFYI 332
               G  FSYCL S    +        LILG      F   T   YT M+ NP+   FY 
Sbjct: 257 SPDLGNQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYS 316

Query: 333 LNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEF------- 378
           +++  IS+G  +++A            GG+++DSGT  T LP   Y+++  E        
Sbjct: 317 VSMEAISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRV 376

Query: 379 LKQFSGFPSAPGFSILDTCFNLSA----YQEVNIPLVKMEFEGNAEMTVDVTGIVY-FVK 433
            K+ S   S  G   L  C+ L         + +P +   F GN  + +      Y F+ 
Sbjct: 377 FKRASETESKTG---LSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLD 433

Query: 434 SDASQV-----CLALASLSYEDETG---IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  +      CL L     E E G    +GNYQQ+  +V+YD +  ++GFA   C+S+
Sbjct: 434 GEDEKKGRKVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCASL 492


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 157/376 (41%), Gaps = 58/376 (15%)

Query: 121 EIPLTSGIRLQTLNYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPS 178
           ++PL+S        Y+ +  +G     +  ++DTG+D  W QC+PCK C NQ  P+F PS
Sbjct: 79  DVPLSS---FMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPS 135

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            S +YK + C S  C                            DG Y    LG + L L 
Sbjct: 136 KSSTYKTIPCTSPICKN-------------------------ADGHY----LGVDTLTLN 166

Query: 239 K-----ASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PS 291
                  S  + + GCG  N+G L G VSG +GL R  LS +SQ +   GG FSYCL P 
Sbjct: 167 SNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPL 226

Query: 292 TQDAGASGSLILGGNSSVF---KNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQ 346
                 S  L  G  S+V      STPI   N          Y ++L   S+G    +L+
Sbjct: 227 FSKENVSSKLHFGDKSTVSGLGTVSTPIKEENG---------YFVSLEAFSVGDHIIKLE 277

Query: 347 ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEV 406
            S   +G  +IDSGT +T LP  +YS L++  L               + C+  ++   +
Sbjct: 278 NSD-NRGNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLL 336

Query: 407 NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
              L+       +E+ ++     Y +  +   +C A  S        I GN  Q+N  V 
Sbjct: 337 TKVLIITAHFSGSEVHLNALNTFYPITDEV--ICFAFVSGGNFSSLAIFGNVVQQNFLVG 394

Query: 467 YDTKNSQLGFAGEDCS 482
           +D     + F   DC+
Sbjct: 395 FDLNKKTISFKPTDCT 410


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 102/356 (28%), Positives = 168/356 (47%), Gaps = 39/356 (10%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
             +IVD+GS +T+V C  C+ C   QDP F P +S +Y+ V CN   C+           
Sbjct: 107 FALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCNMD-CN----------- 154

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
           C       C Y   Y + S ++G LG + +  G  S       +FGC     G L+    
Sbjct: 155 CDDDK-EQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRA 213

Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
            G++GLG+ DLSLV Q  +  +    F  C     D G  GS+ILGG    F   + + +
Sbjct: 214 DGIIGLGQGDLSLVDQLVDKGLISNSFGLCY-GGMDVGG-GSMILGG----FDYPSDMIF 267

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALK 375
           T+   +P  + +Y ++LTGI + GK+L  +      + G ++DSGT    LP + ++A +
Sbjct: 268 TDS--DPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYLPDAAFAAFE 325

Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTVDVTGI 428
              +++ S       P  +  DTCF ++A  +V+      P V+M F+      +     
Sbjct: 326 EAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENY 385

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++         CL +   + +D T ++G    +N  V+YD +NS++GF   +CS +
Sbjct: 386 MFRHSKVHGAYCLGVFP-NGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTNCSEL 440


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 119/430 (27%), Positives = 188/430 (43%), Gaps = 72/430 (16%)

Query: 85  NEQQQNRLILDNLH----VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
           NE  ++R+ LD  H      Y+Q+RI+  +      VSN E        L     +A I 
Sbjct: 53  NETAKDRMELDIQHSAARFAYIQARIEGSL------VSNNEYKARVSPSLTGRTIMANIS 106

Query: 141 LGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +G   +   V++DTGSD+ WV C PC +C N    +FDPS+S ++   LC +      +F
Sbjct: 107 IGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSP-LCKT----PCDF 161

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRN- 252
                  CS   P    + V+Y D S   G  GR+ +       G + + D +FGCG N 
Sbjct: 162 KG-----CSRCDP--IPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNI 214

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFK 311
            +    G +G++GL     SL ++     G  FSYC+    D   +   LILG  + +  
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATK----IGQKFSYCIGDLADPYYNYHQLILGEGADLEG 270

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVIT 364
            STP    N         FY + + GIS+G K+L       +      GG++ID+G+ IT
Sbjct: 271 YSTPFEVHN--------GFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE----------VNIPLVKME 414
            L  S++  L  E            G+S   T    S + +          V  P+V   
Sbjct: 323 FLVDSVHRLLSKEVRNLL-------GWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFH 375

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLA---LASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           F   A++ +D     +F + + +  C+    ++SL+ + +  +IG   Q++  V YD  N
Sbjct: 376 FADGADLALDSGS--FFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVN 433

Query: 472 SQLGFAGEDC 481
             + F   DC
Sbjct: 434 QFVYFQRIDC 443


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 169/379 (44%), Gaps = 44/379 (11%)

Query: 133 LNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-----PCKSCYNQQDPV-----FDPSIS 180
             Y+  + +G     M  I DTGSDL W+ C      P  +     D       FDPS S
Sbjct: 98  FEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKS 157

Query: 181 PSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA 240
            +++ V C+S  C  L  A+     C + S   C Y  SYGDGS+T G L  E      A
Sbjct: 158 TTFRLVDCDSVACSELPEAS-----CGADS--KCRYSYSYGDGSHTSGVLSTETFTFADA 210

Query: 241 S----------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYC 288
                      V +  FGC     G      GL+GLG  DLSLVSQ       G  FSYC
Sbjct: 211 PGARGDGTTTRVANVNFGCSTTFVG-SSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYC 269

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
           L       AS +L  G  ++V         T +IP+ Q+  +YI+ L  + +G K  +A 
Sbjct: 270 L-VPYSVKASSALNFGPRAAVTDPGA--VTTPLIPS-QVKAYYIVELRSVKVGNKTFEAP 325

Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE--- 405
              +  +++DSGT +T LP ++   L  E   +    P+     +L  CF++S  +E   
Sbjct: 326 D--RSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQV 383

Query: 406 -VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQR 464
              IP V +   G A +T+       FV+     +CLA++++S +    IIGN  Q+N  
Sbjct: 384 AAMIPDVTVGLGGGAAVTLKAENT--FVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMH 441

Query: 465 VIYDTKNSQLGFAGEDCSS 483
           V YD     + FA   C+S
Sbjct: 442 VGYDLDKGTVTFAPAACAS 460


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 96/349 (27%), Positives = 157/349 (44%), Gaps = 28/349 (8%)

Query: 148 VIVDTGSDLTWVQCQPC-KSCYNQQD---PVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           V +DTGS ++WVQCQ C   CY Q     P F+ S S +Y++V C++  CH +  +    
Sbjct: 38  VTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIP 97

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSG 262
             C       C Y + Y  G Y+ G L ++ L L  + S+  FIFGCG +N+   G  +G
Sbjct: 98  SGCVEEE-DSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKFIFGCGSDNR-YNGHSAG 155

Query: 263 LMGLGRSDLSLVSQTSEIFG-GLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
           ++G G    S  +Q +++     FSYC PS Q+    G L +G      ++S  +  T +
Sbjct: 156 IIGFGNKSYSFFNQIAQLTNYSAFSYCFPSNQE--NEGFLSIG---PYVRDSNKLILTQL 210

Query: 322 IPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEFL 379
                    Y L    + + G +LQ     +     ++DSGTV T +   ++ AL     
Sbjct: 211 FDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALT 270

Query: 380 KQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDAS 437
           K         G    + CF  N  +     +P+V+++F   + + +    + Y+  SD S
Sbjct: 271 KAMVAEGYVRGSDSKEICFHSNGDSVDWSKLPVVEIKFS-RSILKLPAENVFYYETSDGS 329

Query: 438 QVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                + S    D+ G     I+GN   ++ RV++D +    GF    C
Sbjct: 330 -----ICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQQRNFGFEAGAC 373


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 113/377 (29%), Positives = 178/377 (47%), Gaps = 58/377 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF---DPSISPSYKK--------VLCNSST 192
           + +++++DTGS L W  C    + Y  Q+  F   DP+  P Y +        + C S  
Sbjct: 85  QKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPK 144

Query: 193 CHALEFATGNSGVCSSSSPPDCNYF-VSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCG 250
           C+   +  G+   CS++    C Y+ + YG GS T G+L  + LGL K + + DF+FGC 
Sbjct: 145 CN---WVFGSDLNCSTTK--RCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRIPDFLFGCS 198

Query: 251 R-NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ--DAGASGSLILG- 304
             +N+       G+ G GR   S+ +Q      GL  FSYCL S +  D   SG L+L  
Sbjct: 199 LVSNRQ----PEGIAGFGRGLASIPAQL-----GLTKFSYCLVSHRFDDTPQSGDLVLHR 249

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATF---YILNLTGISIGGKQ-------LQASGFAKGG 354
           G       +  + Y     +P L+ +   Y ++L+ I +GGK        L  S    GG
Sbjct: 250 GRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGG 309

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLV 411
           +++DSG+  T +   I+  +  E  K  + +  A      S L  C+N++   EV++P +
Sbjct: 310 MIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKL 369

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-------IIGNYQQKNQR 464
              F+G A M + +T   YF       VC+ +  L+  DE G       I+GNYQQ+N  
Sbjct: 370 TFSFKGGANMDLPLTD--YFSLVTDGVVCMTV--LTDPDEPGSTTGPAIILGNYQQQNFY 425

Query: 465 VIYDTKNSQLGFAGEDC 481
           + YD K  + GF  + C
Sbjct: 426 IEYDLKKQRFGFKPQQC 442


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 62/387 (16%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N QDP F P +S SY
Sbjct: 84  LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSY 139

Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
             V CN   TC               S    C Y   Y + S + G LG + +  G+ S 
Sbjct: 140 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 185

Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
                 +FGC  +  G LF     G+MGLGR  LS++ Q  E  +    FS C       
Sbjct: 186 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIG 245

Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
           G  G+++LGG    +  VF +S P+           + +Y + L  I + GK L+     
Sbjct: 246 G--GAMVLGGVPAPSDMVFSHSDPLR----------SPYYNIELKEIHVAGKALRVDSRV 293

Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
             +K G ++DSGT    LP   + A       K   LK+  G    P  +  D CF    
Sbjct: 294 FNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRG----PDPNYKDICFAGAG 349

Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
            N+S   EV  P V M F    ++++     ++         CL +   + +D T ++G 
Sbjct: 350 RNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQ-NGKDPTTLLGG 407

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +N  V YD  N ++GF   +CS +
Sbjct: 408 IIVRNTLVTYDRHNEKIGFWKTNCSEL 434


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 116/399 (29%), Positives = 175/399 (43%), Gaps = 76/399 (19%)

Query: 122 IPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDP---V 174
           IPL+ G   QTL              +I+DTGSDL W  C     C++C ++  +P   +
Sbjct: 92  IPLSFGTPPQTL-------------PLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNI 138

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGV---------CSSSSPPDCNYFVSYGDGSY 225
           F P  S S K + C +  C  +  +   S           C+   PP  N F+ + D  +
Sbjct: 139 FIPKSSSSSKVLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLN-FLRFWD--H 195

Query: 226 TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
            R +  R  L     S    I G GR    L       +GL +                F
Sbjct: 196 RRSQFHRRMLCPLHQSTRREISGFGRGPPSL----PSQLGLKK----------------F 235

Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA------TFYILNLTG 337
           SYCL S +  D   S SL+L G S   + +  ++YT  + NP++A       +Y L L  
Sbjct: 236 SYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRH 295

Query: 338 ISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSA 388
           I++GGK ++             GG +IDSGT  T +   I+  + AEF KQ         
Sbjct: 296 ITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEV 355

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS--L 446
            G + L  CFN+S     + P + ++F G AEM + +   V F+  D   VCL + +   
Sbjct: 356 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGD-DVVCLTIVTDGA 414

Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + ++ +G    I+GN+QQ+N  V YD +N +LGF  + C
Sbjct: 415 AGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 453


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 121/422 (28%), Positives = 189/422 (44%), Gaps = 65/422 (15%)

Query: 90  NRLILDNLH-VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNM 146
           +R +LD  H +++LQ+ +K   S N +   + ++ LT+G       Y   + +G   +  
Sbjct: 51  HRRVLDRDHRLRHLQNLVKPH-SSNARMRLHDDL-LTNGY------YTTRLWIGSPPQEF 102

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
            +IVDTGS +T+V C  C  C N QDP F P +S +Y+ V CN+  C+  E     +GV 
Sbjct: 103 ALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDE-----NGV- 155

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLF--GGVS 261
                  C Y   Y + S + G L  + +  GK S       +FGC     G        
Sbjct: 156 ------QCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRAD 209

Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNSTP 315
           G+MGLGR  LS++ Q     +    FS C     D G  G+++LGG SS    VF +S  
Sbjct: 210 GIMGLGRGTLSVMDQLVGKGVVSNSFSLCY-GGMDVGG-GAMVLGGISSPPGMVFSHS-- 265

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYS 372
                   +P  + +Y + L  I + GK L+ +      K G ++DSGT     P   Y 
Sbjct: 266 --------DPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYY 317

Query: 373 AL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL----VKMEFEGNAEMT 422
           A       K  FLKQ SG    P  +  D CF+ +      +P     V M F    +++
Sbjct: 318 AFKDAIMKKISFLKQISG----PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +     ++     +   CL +   +  D+T ++G    +N  V Y+ +NS +GF   +CS
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFK-NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432

Query: 483 SM 484
            +
Sbjct: 433 EL 434


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 117/421 (27%), Positives = 190/421 (45%), Gaps = 64/421 (15%)

Query: 102 LQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--YIATIELGG--RNMTVIVDTGSDLT 157
           L + +++ +  N + +   ++PL  G+ L T    Y   IE+G   +   V VDTGSD+ 
Sbjct: 51  LAALLRHDMGRNGRLLGAVDLPL-GGVGLPTATGLYYTRIEIGSPPKGYYVQVDTGSDIL 109

Query: 158 WVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           WV    C  C  +     +   +DP+ S +   V C    C A   A+G    C S++ P
Sbjct: 110 WVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVGCEQEFCVANSAASGVPPACPSAASP 167

Query: 213 DCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GV 260
            C + ++YGDGS T G    + +   + S N           FGCG    G  G     +
Sbjct: 168 -CQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSITFGCGAQLGGDLGSSSQAL 226

Query: 261 SGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPI 316
            G++G G+SD S++SQ   +     +F++CL + +  G  A G+++            PI
Sbjct: 227 DGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGGIFAIGNVV----------QPPI 276

Query: 317 TYTN-MIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSI 370
             T  ++PN   AT Y +NL GIS+GG  LQ   S F  G   G +IDSGT +  LP  +
Sbjct: 277 VKTTPLVPN---ATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREV 333

Query: 371 YSALKAEFLKQFSGFPSAPGFSILD----TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
           Y  L          F   P  ++ +     CF  S   +   P++   FEG  ++T++V 
Sbjct: 334 YRTLLTAV------FDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFSFEG--DLTLNVY 385

Query: 427 GIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              Y  ++     C+       + + G    ++G+    N+ V+YD +   +G+   +CS
Sbjct: 386 PHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTDYNCS 445

Query: 483 S 483
           S
Sbjct: 446 S 446


>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 404

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 81/242 (33%), Positives = 119/242 (49%), Gaps = 21/242 (8%)

Query: 247 FGCGRNNKGLFGG-VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--ASGSLIL 303
           FGC  + +G F G  SG M LG    SL SQT+  +G  FSYC+P    +G  + G  I 
Sbjct: 177 FGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIG 236

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-SGFAKGGILIDSGTV 362
              S     STP+  T    NP   TFY++ L GI + G++L         G L+DS  V
Sbjct: 237 SSGSGSGFASTPLVATA---NP---TFYVVRLQGIDVAGRRLNVPPAVFSAGTLMDSSAV 290

Query: 363 ITRLPPSIYSALKAEF---LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNA 419
           +T+LPP+ Y AL+  F   ++++   P+  G  ILDTC++      V +P V + F G A
Sbjct: 291 VTQLPPTAYRALRRAFRNAMRRYRRVPAG-GKQILDTCYDFEGLGNVTVPAVSLVFSGGA 349

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
            + ++   ++        + CLA      + + G IGN QQ+   V+YD     +GF   
Sbjct: 350 VVRLEPMAVMM-------EGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRG 402

Query: 480 DC 481
            C
Sbjct: 403 AC 404


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 92/353 (26%), Positives = 155/353 (43%), Gaps = 21/353 (5%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           TI    +  +  +D   +L W QC  C  C+ Q  PVF P+ S ++K   C +  C ++ 
Sbjct: 29  TIGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIP 88

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
                S VC+             G G +T G +  +   +G A+     FGC   ++   
Sbjct: 89  TPKCASDVCAFDG--------VTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDT 140

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
            GG SG +GLGR+  SLV+Q        FSYCL +  D G +  L LG ++ +       
Sbjct: 141 MGGPSGFIGLGRTPWSLVAQMKLT---RFSYCL-APHDTGKNSRLFLGASAKLAGGGAWT 196

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTV-ITRLPPSIYSALK 375
            +    PN  ++ +Y + L  I  G   +      +  +L+ +  V ++ L  S+Y   K
Sbjct: 197 PFVKTSPNDGMSQYYPIELEEIKAGDATITMPR-GRNTVLVQTAVVRVSLLVDSVYQEFK 255

Query: 376 AEFLKQFSGFPSA-PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
              +      P+A P     + CF  +       P +   F+  A +TV     ++ V +
Sbjct: 256 KAVMASVGAAPTATPVGEPFEVCFPKAGVS--GAPDLVFTFQAGAALTVPPANYLFDVGN 313

Query: 435 DA---SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           D    S + +AL +++  D   I+G++QQ+N  +++D     L F   DCSS+
Sbjct: 314 DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCSSL 366


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 120/422 (28%), Positives = 187/422 (44%), Gaps = 48/422 (11%)

Query: 82  VDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRL-QTLNYIATIE 140
           + W E        D   +Q+L S +             + +P+ SG ++ Q   YI   +
Sbjct: 57  LSWEESVLQMQAKDKARLQFLSSLVAR----------KSVVPIASGRQIVQNPTYIVRAK 106

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +G   + M + +DT SD+ W+ C  C  C +    +F+   S +YK + C ++ C  +  
Sbjct: 107 IGTPAQTMLMAMDTSSDVAWIPCNGCLGCSST---LFNSPASTTYKSLGCQAAQCKQVLH 163

Query: 199 ATGNSGVCSSSSPPD------CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN 252
                    S  P        C++ ++YG GS     L ++ + L   +V  + FGC + 
Sbjct: 164 LLSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQK 222

Query: 253 NKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN 312
             G      GL+GLGR  LSL+SQT  ++   FSYCLPS +    SGSL LG      + 
Sbjct: 223 ATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR- 281

Query: 313 STPITYTNMIPNPQLATFYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVI 363
              I YT ++ NP+  + Y +NL  + +         G      S  A  G + DSGTV 
Sbjct: 282 ---IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVF 336

Query: 364 TRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTV 423
           TRL    Y A++  F  +     +       DTC+ +     +  P +   F G   M V
Sbjct: 337 TRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNV 389

Query: 424 DVTGIVYFVKSDA-SQVCLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
            +      + S A S  CLA+A+   +      +I N QQ+N R++YD  NS+LG A E 
Sbjct: 390 TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVAREL 449

Query: 481 CS 482
           C+
Sbjct: 450 CT 451


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 120/422 (28%), Positives = 188/422 (44%), Gaps = 65/422 (15%)

Query: 90  NRLILDNLH-VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNM 146
           +R +LD  H +++LQ+ +K   S N +   + ++ LT+G       Y   + +G   +  
Sbjct: 51  HRRVLDRDHRLRHLQNLVKPH-SSNARMRLHDDL-LTNGY------YTTRLWIGSPPQEF 102

Query: 147 TVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
            +IVDTGS +T+V C  C  C N QDP F P +S +Y+ V CN+  C+  E     +GV 
Sbjct: 103 ALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCNAD-CNCDE-----NGV- 155

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKGLF--GGVS 261
                  C Y   Y + S + G L  + +  GK S       +FGC     G        
Sbjct: 156 ------QCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMESGDLYTQRAD 209

Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNSTP 315
           G+MGLGR  LS++ Q     +    FS C       G  G+++LGG SS    VF +S  
Sbjct: 210 GIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGG--GAMVLGGISSPPGMVFSHS-- 265

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYS 372
                   +P  + +Y + L  I + GK L+ +      K G ++DSGT     P   Y 
Sbjct: 266 --------DPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYY 317

Query: 373 AL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPL----VKMEFEGNAEMT 422
           A       K  FLKQ SG    P  +  D CF+ +      +P     V M F    +++
Sbjct: 318 AFKDAIMKKISFLKQISG----PDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +     ++     +   CL +   +  D+T ++G    +N  V Y+ +NS +GF   +CS
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFK-NGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCS 432

Query: 483 SM 484
            +
Sbjct: 433 EL 434


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 50/373 (13%)

Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFD----------PSISPSYKKV 186
           I++G  N++ +V  D GSDL WV C  C  C       +D          PS+S + K +
Sbjct: 107 IDIGTPNVSFLVALDAGSDLLWVPCD-CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPL 165

Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGLGKASVN-- 243
            CN   C   E  +     C SS  P C Y  SY  + + + G L  + L L   S +  
Sbjct: 166 SCNDQLC---ELGSD----CKSSKDP-CPYLASYYSENTSSSGLLIEDRLHLAPFSEHAS 217

Query: 244 ------DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPST 292
                   I GCGR   G F       GLMGLG  DLS+ S  ++  +    FS C    
Sbjct: 218 RSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF--- 274

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
            D   SG+++ G    V + ST     + +P       Y++ + G  +G   L+ +GF  
Sbjct: 275 -DDNHSGTILFGDQGLVTQKST-----SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 328

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
              L+DSGT  T LP  IY  +  EF KQ +   S+   S    C+N S+ + +NIP V 
Sbjct: 329 ---LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVT 385

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           + F  N    V    I    +++   V CL +  +   +E GIIG       R+++D +N
Sbjct: 386 LVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPI--HEEFGIIGQNFMWGYRMVFDREN 443

Query: 472 SQLGFAGEDCSSM 484
            +LG++  +C  +
Sbjct: 444 LKLGWSTSNCQDI 456


>gi|242086414|ref|XP_002443632.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
 gi|241944325|gb|EES17470.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
          Length = 556

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 116/381 (30%), Positives = 175/381 (45%), Gaps = 49/381 (12%)

Query: 132 TLNYIATIELGG--RNMTVIVDTGS-DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           TL+Y   +  G   +   V +DT S   + ++C+PC S     DP FD S+S ++  VLC
Sbjct: 194 TLDYSVLVSYGTPEQQFPVFLDTSSVGASMIRCKPCASGSVDCDPAFDTSLSSTFNHVLC 253

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYT--RGELGREHLGLGKAS-VNDF 245
            S  C            CS     D + F    DG+Y+   G    + L L  ++ +NDF
Sbjct: 254 GSPDCPT---------NCSGDG--DGDSFCPL-DGTYSVINGTFVEDVLTLAPSTAINDF 301

Query: 246 IFGCGRNNK-GLFGGVSGLMGLGRS--------DLSLVSQTSEIFGGLFSYCLPSTQDAG 296
            F C   +K  +     G + L R           S  S         FSYCLP  + + 
Sbjct: 302 KFVCLDVHKPDVLQTAVGTLDLSRDRNSLPSQLSSSSSSSGQASAAAAFSYCLP--KSSS 359

Query: 297 ASGSLILGGNSSVFKNSTPITYTNMIP--NPQLATFYILNLTGISIGGKQLQ--ASGFAK 352
           + G L LG N++V K+     +  ++   NP+LA+ Y ++L GIS+G + L   A  F  
Sbjct: 360 SQGFLSLGINATV-KDDNATAHATLVSSGNPELASMYFIDLVGISLGDEDLSIPAGTFGN 418

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAP-----GFSILDTCFNLSAYQE 405
               +D GT  T L P  Y+AL+  F +Q S   F S+P     GF   DTCFN +   +
Sbjct: 419 RSTNLDVGTTFTILAPDAYTALRESFKRQMSQYNFSSSPTDIAGGF---DTCFNFTDLND 475

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDA---SQVCLALASLSYEDE-TGIIGNYQQ 460
           + IP V+++F     + +D   ++Y+   +DA   +  CLA +SL   D    +IG+Y  
Sbjct: 476 LVIPNVQLKFSNGDMLVIDADQMLYYDDDTDAAPFTMACLAFSSLDAGDSFAAVIGSYTL 535

Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
               V+YD    Q+GF    C
Sbjct: 536 ATTEVVYDVAGGQVGFIPWSC 556


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/402 (29%), Positives = 179/402 (44%), Gaps = 48/402 (11%)

Query: 105 RIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQC- 161
           RI+ + S  I   SN+     S I +    Y+    +G   +    I DTGS++ W+QC 
Sbjct: 81  RIRKIRSSGI---SNSRKYPVSRISIIDKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCG 137

Query: 162 QP-CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
            P C +CY Q+ P+F+P+ S +Y   LC    C    +  G    C SS    C Y +SY
Sbjct: 138 SPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKSSVQV-CRYHISY 196

Query: 221 GDGSYTRGELGR------EHLG-LGKASVNDFIFGCGRNNKGLFG------GVSGLMGLG 267
            D S++ G +        EH+   G  S+  F FGCG NN    G         G++GLG
Sbjct: 197 EDHSFSEGTISTDIITFPEHIAEFGNYSLRMF-FGCGYNNSETPGQDPNSFTAPGVVGLG 255

Query: 268 RSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNP 325
               SLV Q +    G FSYC+  P  Q    +  +  G  +S+  +ST +         
Sbjct: 256 NEMASLVGQLTL---GQFSYCISTPDVQKPNGTIEIRFGLAASISGHSTALA-------N 305

Query: 326 QLATFYIL-NLTGISIGGKQLQASG-----FAKGGI---LIDSGTVITRLPPSIYSALKA 376
            L  +YI  N+ GI +   +++        FA+GGI   ++DSGT  T L  S   AL  
Sbjct: 306 NLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDALIG 365

Query: 377 EFLKQFSGFPSAPGF--SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS 434
           E  +Q    P       S    C+N + +    +P ++++F  N E     T    ++ +
Sbjct: 366 ELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDN 425

Query: 435 DASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
              Q CLA+   S      IIG YQ ++ ++ YD K + + F
Sbjct: 426 GNDQYCLAMFGTS---GISIIGIYQHRDIKIGYDLKYNLVSF 464


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 50/373 (13%)

Query: 139 IELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFD----------PSISPSYKKV 186
           I++G  N++ +V  D GSDL WV C  C  C       +D          PS+S + K +
Sbjct: 97  IDIGTPNVSFLVALDAGSDLLWVPCD-CMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPL 155

Query: 187 LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGLGKASVN-- 243
            CN   C   E  +     C SS  P C Y  SY  + + + G L  + L L   S +  
Sbjct: 156 SCNDQLC---ELGSD----CKSSKDP-CPYLASYYSENTSSSGLLIEDRLHLAPFSEHAS 207

Query: 244 ------DFIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPST 292
                   I GCGR   G F       GLMGLG  DLS+ S  ++  +    FS C    
Sbjct: 208 RSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF--- 264

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAK 352
            D   SG+++ G    V + ST     + +P       Y++ + G  +G   L+ +GF  
Sbjct: 265 -DDNHSGTILFGDQGLVTQKST-----SFVPLEGKFVTYLIEVEGYLVGSSSLKTAGFQA 318

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVK 412
              L+DSGT  T LP  IY  +  EF KQ +   S+   S    C+N S+ + +NIP V 
Sbjct: 319 ---LVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTVT 375

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           + F  N    V    I    +++   V CL +  +   +E GIIG       R+++D +N
Sbjct: 376 LVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPI--HEEFGIIGQNFMWGYRMVFDREN 433

Query: 472 SQLGFAGEDCSSM 484
            +LG++  +C  +
Sbjct: 434 LKLGWSTSNCQDI 446


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 118/393 (30%), Positives = 177/393 (45%), Gaps = 59/393 (15%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISP----SYK 184
           Y  ++  G  + T+  + DTGS L W+ C     C  C ++  DP   P   P    S K
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149

Query: 185 KVLCNSSTCHAL-------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
            + C S  C  L            N+  C+   PP   Y + YG GS T G L  E L  
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPP---YILQYGLGS-TAGVLITEKLDF 205

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DA 295
              +V DF+ GC   +       +G+ G GR  +SL SQ +      FS+CL S +  D 
Sbjct: 206 PDLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNL---KRFSHCLVSRRFDDT 259

Query: 296 GASGSLIL----GGNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQL 345
             +  L L    G NS    + TP +TYT    NP ++      +Y LNL  I +G K +
Sbjct: 260 NVTTDLDLDTGSGHNSG---SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316

Query: 346 Q------ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LD 395
           +      A G    GG ++DSG+  T +   ++  +  EF  Q S +           L 
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
            CFN+S   +V +P +  EF+G A++ + ++    FV  +   VCL + S    + +G  
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFV-GNTDTVCLTVVSDKTVNPSGGT 435

Query: 454 ----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               I+G++QQ+N  V YD +N + GFA + CS
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 159/360 (44%), Gaps = 44/360 (12%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQ--DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC 206
           I+DTGS L W+QCQPCK C +     PVF+P++S ++ +  C+   C         +G C
Sbjct: 112 IMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRY-----APNGHC 166

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-----FGCG-RNNKGLFGGV 260
            SS+   C Y   Y  G+ ++G L +E L     + N  +     FGCG  N + L    
Sbjct: 167 GSSN--KCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHF 224

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG-ASGSLILGGNSSVFKNSTPITYT 319
           +G++GLG    SL  Q     G  FSYC+    +       L+LG ++ +  + TPI + 
Sbjct: 225 TGILGLGAKPTSLAVQ----LGSKFSYCIGDLANKNYGYNQLVLGEDADILGDPTPIEFE 280

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQASGFA------KGGILIDSGTVITRLPPSIYSA 373
                    + Y +NL GIS+G  QL            + G+++DSGT+ T L    Y  
Sbjct: 281 TE------NSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRE 334

Query: 374 LKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYF 431
           L  E        P    F   D  C++    +E +  P+V   F G AE+ ++ T + Y 
Sbjct: 335 LYNEIKSILD--PKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYP 392

Query: 432 VKSDAS--QVCLALASL-----SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +    +    C+++         Y++ T  IG   Q+   + YD K   +     DC  +
Sbjct: 393 LSEPNTFNVFCMSVKPTKEHGGEYKEFTA-IGLMAQQYYNIGYDLKEKNIYLQRIDCVQL 451


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 58/374 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           + +T+++DTGS+L+W+ C+   + ++    VFDP  S SY  + C S TC          
Sbjct: 67  QTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTRTRDFSIP 122

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
             C       C+  +SY D S   G L  +   +G +++   IFGC      +N      
Sbjct: 123 VSCDKKK--LCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSK 180

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNS---------- 307
            +GL+G+ R  LS V+Q      GL  FSYC+ S QD  +SG L+ G +S          
Sbjct: 181 TTGLIGMNRGSLSFVTQM-----GLQKFSYCI-SGQD--SSGILLFGESSFSWLKALKYT 232

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSG 360
            + + STP+ Y + +        Y + L GI +    LQ   S +A      G  ++DSG
Sbjct: 233 PLVQISTPLPYFDRVA-------YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSG 285

Query: 361 TVITRLPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEV--NIPLVK 412
           T  T L   +Y+ALK EF++Q          P F     +D C+ +   +     +P V 
Sbjct: 286 TQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVT 345

Query: 413 MEFEGNAEMTVDVTGIVY----FVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIY 467
           + F G AEM+V    ++Y     ++   S  C     S     E+ IIG++ Q+N  + +
Sbjct: 346 LMFRG-AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEF 404

Query: 468 DTKNSQLGFAGEDC 481
           D   S++GFA   C
Sbjct: 405 DLAKSRVGFAEVRC 418


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 49/387 (12%)

Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSI 179
           G+   T  Y   IE+G   +   V VDTGSD+ WV C  C  C  +     +   +DP+ 
Sbjct: 76  GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +   V C    C A   A G    C S+S P C + ++YGDGS T G    + +   +
Sbjct: 136 SGT--TVGCEQEFCVA-NSAGGVPPTCPSTSSP-CQFRITYGDGSTTTGFYVTDFVQYNQ 191

Query: 240 ASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ--TSEIFGGLF 285
            S N           FGCG    G  G     + G++G G+SD S++SQ   +     +F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           ++CL + +     G +   GN    K  T    T ++PN    T Y +NL GIS+GG  L
Sbjct: 252 AHCLDTVR----GGGIFAIGNVVQPKVKT----TPLVPN---VTHYNVNLQGISVGGATL 300

Query: 346 Q--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
           Q   S F  G   G +IDSGT +  LP  +Y  L A    ++   P       +  CF  
Sbjct: 301 QLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFV--CFQF 358

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIG 456
           S   +   P++   FEG  ++T++V    Y  ++     C+       + + G    ++G
Sbjct: 359 SGSIDDGFPVITFSFEG--DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLG 416

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           +    N+ V+YD +   +G+   +CSS
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSS 443


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 58/374 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           + +T+++DTGS+L+W+ C+   + ++    VFDP  S SY  + C S TC          
Sbjct: 74  QTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTRTRDFSIP 129

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
             C       C+  +SY D S   G L  +   +G +++   IFGC      +N      
Sbjct: 130 VSCDKKK--LCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSK 187

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILGGNS---------- 307
            +GL+G+ R  LS V+Q      GL  FSYC+ S QD  +SG L+ G +S          
Sbjct: 188 TTGLIGMNRGSLSFVTQM-----GLQKFSYCI-SGQD--SSGILLFGESSFSWLKALKYT 239

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSG 360
            + + STP+ Y + +        Y + L GI +    LQ   S +A      G  ++DSG
Sbjct: 240 PLVQISTPLPYFDRVA-------YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSG 292

Query: 361 TVITRLPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEV--NIPLVK 412
           T  T L   +Y+ALK EF++Q          P F     +D C+ +   +     +P V 
Sbjct: 293 TQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVT 352

Query: 413 MEFEGNAEMTVDVTGIVY----FVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIY 467
           + F G AEM+V    ++Y     ++   S  C     S     E+ IIG++ Q+N  + +
Sbjct: 353 LMFRG-AEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEF 411

Query: 468 DTKNSQLGFAGEDC 481
           D   S++GFA   C
Sbjct: 412 DLAKSRVGFAEVRC 425


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 179/375 (47%), Gaps = 48/375 (12%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--HA 195
           T+    +N+++++DTGS+L+W+ C    S        FDP+ S SY+ + C+S TC    
Sbjct: 36  TVGTPPQNVSMVIDTGSELSWLHCNKTLSYPT----TFDPTRSTSYQTIPCSSPTCTNRT 91

Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----R 251
            +F    S  C S++   C+  +SY D S + G L  +   +G + ++  +FGC      
Sbjct: 92  QDFPIPAS--CDSNN--LCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFS 147

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N       +GLMG+ R  LS VSQ   +    FSYC+  T     SG L+LG ++  + 
Sbjct: 148 SNSDEDSKSTGLMGMNRGSLSFVSQ---LGFPKFSYCISGTD---FSGLLLLGESNLTW- 200

Query: 312 NSTPITYTNMI----PNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDS 359
            S P+ YT +I    P P      Y + L GI +  K L       +      G  ++DS
Sbjct: 201 -SVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDS 259

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQEV--NIPLV 411
           GT  T L   +Y+AL++ FL Q S        P F     +D C+ +   Q V   +P V
Sbjct: 260 GTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTV 319

Query: 412 KMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIGNYQQKNQRVI 466
            + F G AEMTV    ++Y V    + + S  CL+  +      E  +IG++ Q+N  + 
Sbjct: 320 TLVFRG-AEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWME 378

Query: 467 YDTKNSQLGFAGEDC 481
           +D + S++G A   C
Sbjct: 379 FDLEKSRIGLAQVRC 393


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 170/373 (45%), Gaps = 44/373 (11%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           T+    +N+++++DTGS+L+W++C   ++        FDP+ S SY  V C+S TC    
Sbjct: 90  TVGTPPQNVSMVLDTGSELSWLRCNKTQTFQT----TFDPNRSSSYSPVPCSSLTCTDRT 145

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN----N 253
                   C S+    C+  +SY D S + G L  +   +G + +   IFGC  +    N
Sbjct: 146 RDFPIPASCDSNQ--LCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFSTN 203

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
                  +GLMG+ R  LS VSQ        FSYC+    D+  SG L+LG  +  F   
Sbjct: 204 TEEDSKNTGLMGMNRGSLSFVSQMD---FPKFSYCI---SDSDFSGVLLLGDAN--FSWL 255

Query: 314 TPITYTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFAK-----GGILIDSGT 361
            P+ YT +I    P P      Y + L GI +  K   L  S F       G  ++DSGT
Sbjct: 256 MPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGT 315

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFN--LSAYQEVNIPLVKM 413
             T L   +YSAL+ EFL Q S        P +     +D C+   LS      +P V +
Sbjct: 316 QFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSL 375

Query: 414 EFEGNAEMTVDVTGIVYFVKSDA----SQVCLALA-SLSYEDETGIIGNYQQKNQRVIYD 468
            F G AEM V    ++Y V  +     S  C     S     E  +IG++ Q+N  + +D
Sbjct: 376 MFRG-AEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFD 434

Query: 469 TKNSQLGFAGEDC 481
            + S++GFA   C
Sbjct: 435 LEKSRIGFAQVQC 447


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 116/424 (27%), Positives = 189/424 (44%), Gaps = 66/424 (15%)

Query: 100 QYLQSRIKNMISGNIKDV-------SNTEIPLTSGIRLQTLN-YIATIELG--GRNMTVI 149
           ++   R+K++ +    DV       S  +IPL    + +++  Y A I LG   R+  V 
Sbjct: 42  KFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQ 101

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPV----FDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           VDTGSD+ WV C  C  C  + D V    +D   S + K V C+ + C  +         
Sbjct: 102 VDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVN----QRSE 157

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLF 257
           C S S   C Y + YGDGS T G L ++ + L   + N          IFGCG    G  
Sbjct: 158 CHSGST--CQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQL 215

Query: 258 G----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAG--ASGSLILGGNSSV 309
           G     V G+MG G+S+ S +SQ +        F++CL +    G  A G ++       
Sbjct: 216 GESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVV------- 268

Query: 310 FKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAKG---GILIDSGTVIT 364
              S  +  T M+     +  Y +NL  I +G    +L ++ F  G   G++IDSGT + 
Sbjct: 269 ---SPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLV 322

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVNIPLVKMEFEGNAEMT 422
            LP ++Y+ L  E L   +  P     ++ +  TCF+ +   +   P V  +F+ +  + 
Sbjct: 323 YLPDAVYNPLLNEIL---ASHPELTLHTVQESFTCFHYTDKLD-RFPTVTFQFDKSVSLA 378

Query: 423 VDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAG 478
           V     ++ V+ D    C    +   + + G    I+G+    N+ V+YD +N  +G+  
Sbjct: 379 VYPREYLFQVREDT--WCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTN 436

Query: 479 EDCS 482
            +CS
Sbjct: 437 HNCS 440


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 110/332 (33%), Positives = 163/332 (49%), Gaps = 43/332 (12%)

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGELGREHL 235
           +S ++K V C    C        +SGV  S+   +   C Y  SYGD S T G + ++  
Sbjct: 1   MSSTFKAVACPDPICRP------SSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTF 54

Query: 236 GLG-----KASVNDFIFGCGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
                     +V++  FGCG  N GLF    SG+ G GR   SL SQ      G FSYCL
Sbjct: 55  TFMSPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKV---GRFSYCL 111

Query: 290 PSTQDAGASGSLILGGNSSV----FKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
               ++ +S  +ILG            + P   T +I NP + TFY L+L GI++G  +L
Sbjct: 112 TLVTESKSS-VVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRL 170

Query: 346 --QASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDT 396
               S FA      GG +IDSGT +T LP +++  L+ E + QF    + + P   + D 
Sbjct: 171 PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTP--EVGDR 228

Query: 397 -CFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKS-DASQVCLALASLSYEDETG 453
            CF      ++V +P + +   G A+M  D+    YFV+  D+  +CL +     ED T 
Sbjct: 229 LCFRRPKGGKQVPVPKLILHLAG-ADM--DLPRDNYFVEEPDSGVMCLQINGA--EDTTM 283

Query: 454 I-IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           + IGN+QQ+N  V+YD +N++L FA   C  +
Sbjct: 284 VLIGNFQQQNMHVVYDVENNKLLFAPAQCDKL 315


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 167/380 (43%), Gaps = 54/380 (14%)

Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSCYNQQDPV------FDPSISPSYKKVLCNSSTCH 194
           + ++ I+DTGSD+ W  C     CK C             F P  S S K + C +  C 
Sbjct: 78  QTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCS 137

Query: 195 ALEFATGNSGV-CSSSS------PPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
            +  +  N    CS  S      PP   Y + YG G+ T G    E L L   S  +F+ 
Sbjct: 138 WIHHSNINCDQDCSIKSCLNQTCPP---YMIFYGSGT-TGGVALSETLHLHSLSKPNFLV 193

Query: 248 GCGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DAGASGSLI 302
           GC      +F     +G+ G GR   SL SQ      G FSYCL S +   D   S SL+
Sbjct: 194 GCS-----VFSSHQPAGIAGFGRGLSSLPSQLGL---GKFSYCLLSHRFDDDTKKSSSLV 245

Query: 303 LGGNS-SVFKNSTPITYTNMIPNPQL------ATFYILNLTGISIGG-------KQLQAS 348
           L        K +  + YT  + NP++      + +Y L L  I++GG       K L   
Sbjct: 246 LDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVKVPYKYLSPG 305

Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQE 405
               GG++IDSGT  T +    +  L  EF++Q   +           L  CFN+S  + 
Sbjct: 306 EDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAKT 365

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQK 461
           V+ P +++ F+G A++ + V     FV  + + + +    ++  +  G    I+GN+Q +
Sbjct: 366 VSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPGMILGNFQMQ 425

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
           N  V YD +N +LGF  E C
Sbjct: 426 NFYVEYDLRNERLGFKQEKC 445


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 163/374 (43%), Gaps = 52/374 (13%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
           Y   I +G   +   +IVDTGS LT+V C  C+ C   QDP F P  S +Y+ + C+   
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
           TC               S    C Y   Y + S + G LG + +  GK S       +FG
Sbjct: 152 TC--------------DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFG 197

Query: 249 CGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
           C     G        G+MGLGR DLS+V Q  E  + G  FS C     D G  G+++LG
Sbjct: 198 CENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-GGMDVGG-GAMVLG 255

Query: 305 GNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILI 357
           G S     VF +S          +P  + +Y ++L  I I GKQL  +      K G ++
Sbjct: 256 GISPPAGMVFTHS----------DPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTIL 305

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCF-----NLSAYQEVNIPL 410
           DSGT    LP   + A K   +K+ +       P  +  D CF     ++S   +   P 
Sbjct: 306 DSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPA 364

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V + F     +++     ++         CL +   +  D+T ++G    +N  V+YD +
Sbjct: 365 VDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQ-NENDQTTLLGGIIVRNTLVMYDRE 423

Query: 471 NSQLGFAGEDCSSM 484
           + ++GF   +CS +
Sbjct: 424 HLKIGFWKTNCSEI 437


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 163/374 (43%), Gaps = 52/374 (13%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
           Y   I +G   +   +IVDTGS LT+V C  C+ C   QDP F P  S +Y+ + C+   
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSMEC 151

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
           TC               S    C Y   Y + S + G LG + +  GK S       +FG
Sbjct: 152 TC--------------DSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFG 197

Query: 249 CGRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
           C     G        G+MGLGR DLS+V Q  E  + G  FS C     D G  G+++LG
Sbjct: 198 CENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCY-GGMDVGG-GAMVLG 255

Query: 305 GNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILI 357
           G S     VF +S          +P  + +Y ++L  I I GKQL  +      K G ++
Sbjct: 256 GISPPAGMVFTHS----------DPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTIL 305

Query: 358 DSGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCF-----NLSAYQEVNIPL 410
           DSGT    LP   + A K   +K+ +       P  +  D CF     ++S   +   P 
Sbjct: 306 DSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSK-TFPA 364

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           V + F     +++     ++         CL +   +  D+T ++G    +N  V+YD +
Sbjct: 365 VDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQ-NENDQTTLLGGIIVRNTLVMYDRE 423

Query: 471 NSQLGFAGEDCSSM 484
           + ++GF   +CS +
Sbjct: 424 HLKIGFWKTNCSEI 437


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 167/387 (43%), Gaps = 62/387 (16%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N QDP F P +S +Y
Sbjct: 86  LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTY 141

Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
             V CN   TC               +    C Y   Y + S + G LG + +  GK S 
Sbjct: 142 SPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESE 187

Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
                 +FGC     G LF     G+MGLGR  LS++ Q  E  +    FS C     D 
Sbjct: 188 LKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY-GGMDV 246

Query: 296 GASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
           G  G+++LGG  +    VF +S P+           + +Y + L  I + GK L+     
Sbjct: 247 GG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVAGKALRLDPKI 295

Query: 351 --AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFSILDTCF---- 398
             +K G ++DSGT    LP   + A K         LK+  G    P  +  D CF    
Sbjct: 296 FNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPNYKDICFAGAG 351

Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
            N+S   EV  P V M F    ++++     ++         CL +   + +D T ++G 
Sbjct: 352 RNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGG 409

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +N  V YD  N ++GF   +CS +
Sbjct: 410 IVVRNTLVTYDRHNEKIGFWKTNCSEL 436


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 168/365 (46%), Gaps = 35/365 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+A + +G   +  + I+    +  W QC PC+ C+ Q  P+F+ S S +Y+   C ++ 
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87

Query: 193 CHALEFAT-GNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIFGC 249
           C ++  +T    GVCS        Y V   +GD   T G  G +   +G A+ +   FGC
Sbjct: 88  CESVPASTCSGDGVCS--------YEVETMFGD---TSGIGGTDTFAIGTATAS-LAFGC 135

Query: 250 G--RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNS 307
               N K L G  SG++GLGR+  SLV Q +      FSYCL     AG   +L+LG ++
Sbjct: 136 AMDSNIKQLLGA-SGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASA 191

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLP 367
            +    +  T T ++     ++ Y+++L GI   G  + A       +L+D+   ++ L 
Sbjct: 192 KLAGGKSAAT-TPLVNTSDDSSDYMIHLEGIKF-GDVIIAPPPNGSVVLVDTIFGVSFLV 249

Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMT 422
            + + A+K          P A      D CF        A   + +P V + F+G A +T
Sbjct: 250 DAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALT 309

Query: 423 VDVTGIVYFVKSDASQVCLAL---ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           V  +  +Y   +    VCLA+   A L+   E  I+G   Q+N   ++D     L F   
Sbjct: 310 VPPSKYMY--DAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPA 367

Query: 480 DCSSM 484
           DCSS+
Sbjct: 368 DCSSL 372


>gi|345292859|gb|AEN82921.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292861|gb|AEN82922.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292863|gb|AEN82923.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292865|gb|AEN82924.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292867|gb|AEN82925.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292869|gb|AEN82926.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292871|gb|AEN82927.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292873|gb|AEN82928.1| AT5G10770-like protein, partial [Capsella rubella]
          Length = 161

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 100/165 (60%), Gaps = 8/165 (4%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T  + I +    +F
Sbjct: 1   LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTISDGN--SF 54

Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           Y LN+ GI++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  Q S +P+A
Sbjct: 55  YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 179/385 (46%), Gaps = 57/385 (14%)

Query: 132 TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           TL    T+    + +T+++DTGS+L+W+ C+   +  +    VF+P  S SY  + C+S 
Sbjct: 39  TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSP 94

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG- 250
            C        N   C       C+  VSY D S   G L  ++  +G +++   +FGC  
Sbjct: 95  VCRTRTRDLPNPVTCDPKK--LCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMD 152

Query: 251 ---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILG- 304
               +N       +GLMG+ R  LS V+Q      GL  FSYC+ S +D  +SG L+ G 
Sbjct: 153 SGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYCI-SGRD--SSGVLLFGD 204

Query: 305 ------GN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAK- 352
                 GN   + + + STP+ Y + +        Y + L GI +G K   L  S FA  
Sbjct: 205 SHLSWLGNLTYTPLVQISTPLPYFDRVA-------YTVQLDGIRVGNKILPLPKSIFAPD 257

Query: 353 ----GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSA 402
               G  ++DSGT  T L   +Y+AL+ EFL+Q  G  +    P F     +D C+ + A
Sbjct: 258 HTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPA 317

Query: 403 YQEV-NIPLVKMEFEGNAEMTVDVTGIVY----FVKSDASQVCLALASLSYED-ETGIIG 456
             ++  +P V + F G AEM V    ++Y     +K      CL   +      E  +IG
Sbjct: 318 GGKLPELPAVSLMFRG-AEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFVIG 376

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDC 481
           ++ Q+N  + +D   S++GF    C
Sbjct: 377 HHHQQNVWMEFDLVKSRVGFVETRC 401


>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
 gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
          Length = 503

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 173/370 (46%), Gaps = 33/370 (8%)

Query: 132 TLNYIATIELGG--RNMTVIVDTGS-DLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLC 188
           TL Y   +  G   +   V++DT S  ++ ++C+PC S  +     FD S S ++  VLC
Sbjct: 147 TLQYSVLVSYGTPEQQFPVLLDTSSIGMSLLRCKPCASGSDDCHLAFDTSRSSTFAHVLC 206

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
            S  C       G+     S  P D  Y  S  DG++    L    L     ++ +F F 
Sbjct: 207 GSPDCPTNCSGDGDG---DSFCPLDSTY--SIIDGAFAEDVLT---LAPSSKAIENFRFV 258

Query: 249 C---GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG---GLFSYCLPSTQDAGASGSLI 302
           C      +  L   V+G + L R   SL SQ S   G     FSYCLP  +   + G L 
Sbjct: 259 CLDVDEPDDDL--PVAGTLDLSRDRNSLPSQLSSSPGQATAAFSYCLP--KSPSSQGYLS 314

Query: 303 LGGNSSVFKNSTPITYTNMIPN---PQLATFYILNLTGISIGGKQLQ---ASGFAKGGIL 356
           L  +++V ++     +  ++ N   P+LA+ Y ++L G+S+G   +    A  F   G+ 
Sbjct: 315 LAVDATV-RHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGVDDIPIPPAGSFGNNGVN 373

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           +D GT  T+L P +Y  L+  F KQ S    S  GF   DTCFNL+  +++ +PL+  +F
Sbjct: 374 LDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDGFDTCFNLTGVRDLAMPLLWFKF 433

Query: 416 EGNAEMTVDVTGIVYFVKSDA---SQVCLALASLSYEDE-TGIIGNYQQKNQRVIYDTKN 471
                + +D+  ++Y+    A   +  CLA +SL   D  + +IG +   +  VIYD   
Sbjct: 434 SNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSLDAGDSFSAVIGTHTLASTEVIYDVAG 493

Query: 472 SQLGFAGEDC 481
            ++GF    C
Sbjct: 494 GKVGFIPRSC 503


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 104/364 (28%), Positives = 164/364 (45%), Gaps = 52/364 (14%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  CK C + QDP F P  S +Y+ V C +  C+         
Sbjct: 104 QRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKC-TWQCN--------- 153

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGRNNKGLFGG- 259
             C       C Y   Y + S + G LG + +  G   + S    IFGC  +  G     
Sbjct: 154 --CDDDR-KQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDETGDIYNQ 210

Query: 260 -VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
              G+MGLGR DLS++ Q  E  +    FS C          G+++LGG S     VF +
Sbjct: 211 RADGIMGLGRGDLSIMDQLVEKKVISDAFSLCY--GGMGVGGGAMVLGGISPPADMVFTH 268

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
           S P+           + +Y ++L  I + GK+L  +      K G ++DSGT    LP S
Sbjct: 269 SDPVR----------SPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPES 318

Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI-------PLVKMEFEGNAE 420
            + A K   +K+       S P     D CF+ +   E+N+       P+V+M F    +
Sbjct: 319 AFLAFKHAIMKETHSLKRISGPDPHYNDICFSGA---EINVSQLSKSFPVVEMVFGNGHK 375

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           +++     ++         CL + S +  D T ++G    +N  V+YD ++S++GF   +
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFS-NGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTN 434

Query: 481 CSSM 484
           CS +
Sbjct: 435 CSEL 438


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 118/397 (29%), Positives = 177/397 (44%), Gaps = 61/397 (15%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQD----PVFDPSISPSYK 184
           Y  ++ LG  + TV  I+DTGS L W  C     C SC +   D    P F P +S S K
Sbjct: 84  YSMSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSK 143

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTRGELGREHLGL 237
            + C +  C A  F +     C + +P   N       Y + YG GS T G L  E +  
Sbjct: 144 LIGCKNPKC-AWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINF 201

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQ-- 293
              +++DF+ GC   +        G+ G GRS  SL  Q      GL  FSYCL S +  
Sbjct: 202 PNKTISDFLAGCSLLSTR---QPEGIAGFGRSQESLPLQL-----GLKKFSYCLVSRRFD 253

Query: 294 DAGASGSLILG-GNSSVFKNSTPITYTNMIPN------PQLATFYILNLTGISIGGKQLQ 346
           D+  S  LIL  G S+    +T ++YT    N      P    +Y + L  I +G   ++
Sbjct: 254 DSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVK 313

Query: 347 AS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG---FSILDT 396
                        GG ++DSG+  T +   ++  L  EF KQ + +  A      + L  
Sbjct: 314 VPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRP 373

Query: 397 CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--- 453
           CF++S  + V IP +  +F+G A+M + ++   YF   D   VCL + S +     G   
Sbjct: 374 CFDISGEKSVVIPDLTFQFKGGAKMQLPLSN--YFAFVDMGVVCLTIVSDNAAALGGDGG 431

Query: 454 --------IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                   I+GN+QQ+N  + YD +N + GF  + C+
Sbjct: 432 VRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/268 (31%), Positives = 137/268 (51%), Gaps = 26/268 (9%)

Query: 230 LGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
           LG++ L L      V  + FGC R   G      GL+G G   LS  SQ  +++G +FSY
Sbjct: 343 LGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSY 402

Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-- 345
           CLPS + +  S +L LG      +    I  T ++ NP   + Y +N+ GI +GG+ +  
Sbjct: 403 CLPSYKSSNFSSTLRLGPAGQPKR----IKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLV 458

Query: 346 QASGFAKG-----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
            AS  A       G ++D+GT+ TRL   +Y+A++  F  +     + P  GF   DTC+
Sbjct: 459 PASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGPLGGF---DTCY 515

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGII 455
           N++    +++P V   F+G   +T+    +V    SD    CLA+A   S   +    ++
Sbjct: 516 NVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIA-CLAMAAGPSDGVDAVLNVL 570

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + QQ+N RV++D  N ++GF+ E C++
Sbjct: 571 ASMQQQNHRVLFDVANGRVGFSRELCTT 598


>gi|295830679|gb|ADG39008.1| AT5G10770-like protein [Capsella grandiflora]
          Length = 159

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 99/165 (60%), Gaps = 8/165 (4%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T    I +    +F
Sbjct: 1   LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPIXTISDGN--SF 54

Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           Y LN+ GI++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  Q S +P+A
Sbjct: 55  YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 116/390 (29%), Positives = 175/390 (44%), Gaps = 53/390 (13%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSCY-----NQQDPVFDPSISPSYK 184
           Y  ++  G  + T+  + DTGS L W  C     C  C        Q P F P  S S +
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSR 149

Query: 185 KVLCNSSTCHALEFAT-------GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
            + C +  C  L  A         N+  C+   PP   Y + YG GS T G L  E L  
Sbjct: 150 VIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPP---YILQYGLGS-TAGILISEKLDF 205

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGA 297
              +V DF+ GC   +       +G+ G GR   SL SQ        FS+CL S +    
Sbjct: 206 PDLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKL---KSFSHCLVSRRFDDT 259

Query: 298 SGSLILG---GNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQLQ-- 346
           + +  LG   G+     + TP ++YT    NP ++      +Y LNL  I +G K ++  
Sbjct: 260 NVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIP 319

Query: 347 ----ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCF 398
               A G    GG ++DSG+  T +   ++  +  EF  Q S +         S +  CF
Sbjct: 320 YKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIAPCF 379

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----- 453
           N+S   +V +P +  EF+G A+M + ++    FV  +A  VCL + S +  +  G     
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKMELPLSNYFSFV-GNADTVCLTVVSDNTVNPGGGTGPA 438

Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            I+G++QQ+N  V YD +N + GFA + CS
Sbjct: 439 IILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|295830681|gb|ADG39009.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830683|gb|ADG39010.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830685|gb|ADG39011.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830687|gb|ADG39012.1| AT5G10770-like protein [Capsella grandiflora]
          Length = 159

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 73/165 (44%), Positives = 99/165 (60%), Gaps = 8/165 (4%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T    I +    +F
Sbjct: 1   LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPIATISDGN--SF 54

Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           Y LN+ GI++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++ F  Q S +P+A
Sbjct: 55  YGLNIVGITVGGQKLAIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            G SILDTCF+LS ++ V IP V   F G A + +   GI Y  K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFK 159


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 170/367 (46%), Gaps = 44/367 (11%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+T+++DTGS+L+W+ C+  +      + VF+P  S +Y KV C S TC          
Sbjct: 80  QNVTMVLDTGSELSWLHCKKTQFL----NSVFNPLSSKTYSKVPCLSPTCKTRTRDLTIP 135

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLFGG 259
             C ++    C+  VSY D +   G L  E   LG  +    IFGC      +N      
Sbjct: 136 VSCDATKL--CHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSK 193

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
            +GL+G+ R  LS V+Q        FSYC+     AG    ++L GN+S F    P++YT
Sbjct: 194 TTGLIGMNRGSLSFVNQMGY---PKFSYCISGFDSAG----VLLLGNAS-FPWLKPLSYT 245

Query: 320 NMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLP 367
            ++    P P      Y + L GI +  K   L  S F       G  ++DSGT  T L 
Sbjct: 246 PLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLL 305

Query: 368 PSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE--VNIPLVKMEFEGNA 419
             +Y+ALK EFL Q  G         F     +D C+ L + +    N+P+V + F+G A
Sbjct: 306 GPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG-A 364

Query: 420 EMTVDVTGIVYFVKSDA----SQVCLALASLSYED-ETGIIGNYQQKNQRVIYDTKNSQL 474
           EM+V    ++Y V  +     S  C    +      E  +IG++ Q+N  + +D + S++
Sbjct: 365 EMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNVWMEFDLEKSRI 424

Query: 475 GFAGEDC 481
           G A   C
Sbjct: 425 GLADVRC 431


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 162/361 (44%), Gaps = 44/361 (12%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           +Y+  + LG   + V  +VDT SDL W QC PC+ CY Q++P+FDP          CNS 
Sbjct: 30  DYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKE-------CNSF 82

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
             H+          CS      C+Y  +Y D S T+G L +E        GK  V   IF
Sbjct: 83  FDHS----------CSPEKA--CDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIF 130

Query: 248 GCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASGSLILG 304
           GCG NN G+F     GL+GLG   LSLVSQ   ++G   FS CL P   D   SG++ LG
Sbjct: 131 GCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLG 190

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL---QASGFAKGGILIDSGT 361
             S V   S     T  + + +  T Y++ L GIS+G   +    +   +KG I+IDSGT
Sbjct: 191 EASDV---SGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGT 247

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNI--PLVKMEFEGNA 419
             T LP   Y  L  E   Q +     P     D    L    E N+  P++   FEG  
Sbjct: 248 PETYLPQEFYDRLVEELKVQIN---LPPIHVDPDLGTQLCYKSETNLEGPILTAHFEG-- 302

Query: 420 EMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
              V +  +  F+       C A+   +  D   I GN+ Q N  + +D     + F   
Sbjct: 303 -ADVKLLPLQTFIPPKDGVFCFAMTGTT--DGLYIFGNFAQSNVLIGFDLDKRIVFFKPT 359

Query: 480 D 480
           D
Sbjct: 360 D 360


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/393 (25%), Positives = 179/393 (45%), Gaps = 51/393 (12%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----Y 168
           +++ ++PL    R+ ++  Y   I+LG   +   V VDTGSD+ W+ C+PC  C      
Sbjct: 55  LASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNL 114

Query: 169 NQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
           N +  +FD + S + KKV C+   C  +      S  C  +    C+Y + Y D S + G
Sbjct: 115 NFRLSLFDMNASSTSKKVGCDDDFCSFI----SQSDSCQPAL--GCSYHIVYADESTSDG 168

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
           +  R+ L L + + +        + +FGCG +  G  G     V G+MG G+S+ S++SQ
Sbjct: 169 KFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQ 228

Query: 277 TSEIFGG--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
            +       +FS+CL + +  G     ++        +S  +  T M+PN      Y + 
Sbjct: 229 LAATGDAKRVFSHCLDNVKGGGIFAVGVV--------DSPKVKTTPMVPN---QMHYNVM 277

Query: 335 LTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           L G+ + G  L    S    GG ++DSGT +   P  +Y +L    L +           
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLIETILAR-----QPVKLH 332

Query: 393 ILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
           I++    CF+ S   +   P V  EFE + ++TV     ++ ++ +          L+ +
Sbjct: 333 IVEETFQCFSFSTNVDEAFPPVSFEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTD 392

Query: 450 DETGII--GNYQQKNQRVIYDTKNSQLGFAGED 480
           + + +I  G+    N+ V+YD  N  +G+A  +
Sbjct: 393 ERSEVILLGDLVLSNKLVVYDLDNEVIGWADHN 425


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 173/387 (44%), Gaps = 49/387 (12%)

Query: 127 GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSI 179
           G+   T  Y   IE+G   +   V VDTGSD+ WV C  C  C  +     +   +DP+ 
Sbjct: 76  GLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAG 135

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGK 239
           S +   V C    C A   A G    C S+S P C + ++YGDGS T G    + +   +
Sbjct: 136 SGT--TVGCEQEFCVA-NSAGGVPPTCPSTSSP-CQFRITYGDGSTTTGFYVTDFVQYNQ 191

Query: 240 ASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ--TSEIFGGLF 285
            S N           FGCG    G  G     + G++G G+SD S++SQ   +     +F
Sbjct: 192 VSGNGQTTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIF 251

Query: 286 SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           ++CL + +     G +   GN    K  T    T ++PN    T Y +NL GIS+GG  L
Sbjct: 252 AHCLDTVR----GGGIFAIGNVVQPKVKT----TPLVPN---VTHYNVNLQGISVGGATL 300

Query: 346 Q--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
           Q   S F  G   G +IDSGT +  LP  +Y  L A    ++   P       +  CF  
Sbjct: 301 QLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFV--CFQF 358

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIG 456
           S   +   P++   F+G  ++T++V    Y  ++     C+       + + G    ++G
Sbjct: 359 SGSIDDGFPVITFSFKG--DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLG 416

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           +    N+ V+YD +   +G+   +CSS
Sbjct: 417 DLVLSNKLVVYDLEKEVIGWTDYNCSS 443


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 117/422 (27%), Positives = 187/422 (44%), Gaps = 57/422 (13%)

Query: 85  NEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGR 144
           NE  ++R+ LD  H     + I+  I G++  VSN +        L     +A I +G  
Sbjct: 53  NETAKDRMELDIQHSAARLANIQARIEGSL--VSNNDYKARVSPSLTGRTIMANISIGQP 110

Query: 145 NMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK---KVLCNSSTCHALEFA 199
            +   V++DTGSD+ WV C PC +C N    +FDPS S ++    K  C+   C      
Sbjct: 111 PIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPLCKTPCDFEGCRCDPIP 170

Query: 200 TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCGRN-N 253
                           + V+Y D S   G  GR+ +       G + ++D +FGCG N  
Sbjct: 171 ----------------FTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIG 214

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKN 312
                G +G++GL     SLV++     G  FSYC+ +  D   +   LILG  + +   
Sbjct: 215 HDTDPGHNGILGLNNGPDSLVTK----LGQKFSYCIGNLADPYYNYHQLILGEGADLEGY 270

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------KGGILIDSGTVITR 365
           STP    N         FY + + GIS+G K+L  +           GG++ID+G+ IT 
Sbjct: 271 STPFEVYN--------GFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITF 322

Query: 366 LPPSIYSALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMT 422
           L  S++  L  E   L  +S   +    S    CF  S  ++ V  P+V   F   A++ 
Sbjct: 323 LVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLA 382

Query: 423 VDVTGIVYFVKSDASQVCLA---LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
           +D     +F + + +  C+    ++SL+ + +  +IG   Q++  V YD  N  + F   
Sbjct: 383 LDSGS--FFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRI 440

Query: 480 DC 481
           DC
Sbjct: 441 DC 442


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 97/326 (29%), Positives = 159/326 (48%), Gaps = 28/326 (8%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP         +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           GG   +    T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 291 DLGRHGV--FVERSVQEQDVWCLAFA 314


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 82/268 (30%), Positives = 134/268 (50%), Gaps = 26/268 (9%)

Query: 230 LGREHLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
           LG++ L L      V  + FGC R   G      GL+G G   LS  SQ  +++G +FSY
Sbjct: 282 LGQDALALHDDVDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSY 341

Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
           CLPS + +  S +L LG           I  T ++ NP   + Y +N+ GI +GG+ +  
Sbjct: 342 CLPSYKSSNFSSTLRLGPAG----QPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLV 397

Query: 348 SGFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP--GFSILDTCF 398
              A         G ++D+GT+ TRL   +Y+A++  F  +     + P  GF   DTC+
Sbjct: 398 PASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVTGPLGGF---DTCY 454

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA---SLSYEDETGII 455
           N++    +++P V   F+G   +T+    +V    SD    CLA+A   S   +    ++
Sbjct: 455 NVT----ISVPTVTFSFDGRVSVTLPEENVVIRSSSDGIA-CLAMAAGPSDGVDAVLNVL 509

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + QQ+N RV++D  N ++GF+ E C++
Sbjct: 510 ASMQQQNHRVLFDVANGRVGFSRELCTT 537


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 108/395 (27%), Positives = 173/395 (43%), Gaps = 63/395 (15%)

Query: 146 MTVIVDTGSDLTWVQCQP--CKSCY---------NQQDPVFDPSISPSYKKVLCNSSTCH 194
           +++ +DTGSDL W  C P  C  C          N  +P+  P+ S   +++ C S  C 
Sbjct: 98  VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDS---RRIPCASPFCS 154

Query: 195 ALEFATGNSGVCSSSSPP-------DC-------NYFVSYGDGSYTRGELGREHLGLGKA 240
           A   +   + +C+++  P        C         + +YGDGS     L R  +G+  +
Sbjct: 155 AAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLV-ARLRRGRVGIAAS 213

Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPS----TQD 294
            +V +F F C        G   G+ G GR  LSL +Q +     G FSYCL +       
Sbjct: 214 VAVENFTFACAHT---ALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFRADR 270

Query: 295 AGASGSLILGGNSSVFKNS-TPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----- 348
                 LILG +      S T I YT ++ NP+   FY + L  +S+GG ++ A      
Sbjct: 271 PIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGR 330

Query: 349 -GFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD-----TCF--- 398
            G A  GG+++DSGT  T LP   Y+ +  EF +  +        +  D      C+   
Sbjct: 331 VGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYD 390

Query: 399 -NLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYED-- 450
            + SA +E +   +P + M F G A + +         +S+  +   CL L +   +D  
Sbjct: 391 HDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGG 450

Query: 451 -ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              G +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 451 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 485


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 28/326 (8%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP         +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           GG   +    T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 291 DLGSHGV--FVERSVQEQDVWCLAFA 314


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 108/389 (27%), Positives = 167/389 (42%), Gaps = 64/389 (16%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK-SCYNQQD 172
           D++ T +  T+G       Y ++I LG   ++ ++++DTGSDLTWV+C PC   C +   
Sbjct: 110 DLAQTPVSFTNGG-----VYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSS--- 161

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
             FD   S +YK + C                             +      +  G   R
Sbjct: 162 -TFDRLASNTYKALTCADDL--------------------RLPVLLRLWRRLFHSGRSLR 200

Query: 233 EHLGLGKASVND------FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           + L +  A+ ++      F+FGCG   KGL  G  G++ L    LS  SQ  E +G  FS
Sbjct: 201 DTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFS 260

Query: 287 YCL--PSTQDAGASGSLILGGNSSVFKN-----STPITYTNMIPNPQLATFYILNLTGIS 339
           YCL   + Q++     ++ G  +   K         + YT   P  + + +Y + L GIS
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYT---PIGESSIYYTVRLDGIS 317

Query: 340 IGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFS 392
           +G ++L    S F  G     + DSGT +T LP  +  ++K       SG  F +  G  
Sbjct: 318 VGNQRLDLSPSTFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKG-- 375

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
            LD CF +       +P +   F G A+    VT    +V    S  CL        +E 
Sbjct: 376 -LDACFRVPPSSGQGLPDITFHFNGGADF---VTRPSNYVIDLGSLQCLIFVP---TNEV 428

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            I GN QQ++  V++D  N ++GF   DC
Sbjct: 429 SIFGNLQQQDFFVLHDMDNRRIGFKETDC 457


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 154/366 (42%), Gaps = 57/366 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  CK C   QDP F P +S SYK + CN              
Sbjct: 91  QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-------------- 136

Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGR 251
                   PDCN         Y   Y + S + G L  + +  G  S       +FGC  
Sbjct: 137 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCEN 188

Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
              G LF     G+MGLGR  LS+V Q  +  +   +FS C    +  G  G+++LG   
Sbjct: 189 VETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLG--- 243

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVIT 364
              K S P        +P  + +Y ++L  + + GK L+ +      K G ++DSGT   
Sbjct: 244 ---KISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 300

Query: 365 RLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKMEFEGN 418
             P   + A+K   +K+         P  +  D CF+ +      I    P + MEF GN
Sbjct: 301 YFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEF-GN 359

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
            +  + ++   Y  +    +    L      D T ++G    +N  V YD +N +LGF  
Sbjct: 360 GQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLK 418

Query: 479 EDCSSM 484
            +CS +
Sbjct: 419 TNCSDL 424


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 80/250 (32%), Positives = 112/250 (44%), Gaps = 64/250 (25%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISP 181
           +TSG+   +  Y   + +G   + + +++DTGSD+ W+QC PC+ CY+Q DPVFDP  S 
Sbjct: 163 VTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSG 222

Query: 182 SYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS 241
           S+  + C S  C  L+     S  C+S     C Y V+YGDGS+T GE   E L      
Sbjct: 223 SFSSISCRSPLCLRLD-----SPGCNSRQ--SCLYQVAYGDGSFTFGEFSTETLTFRGTR 275

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSL 301
           V     GCG +N+GLF G +GL+GLGR                     P        G+ 
Sbjct: 276 VPKVALGCGHDNEGLFVGAAGLLGLGRQ--------------------PRLNRPPVGGAR 315

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
           + G  +S+FK                                 L  +G   GG++IDSGT
Sbjct: 316 VAGITASLFK---------------------------------LDTAG--NGGVIIDSGT 340

Query: 362 VITRLPPSIY 371
            +TRL    Y
Sbjct: 341 SVTRLTRRAY 350


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 115/239 (48%), Gaps = 25/239 (10%)

Query: 115 KDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQD 172
           KD  N    + S +     +Y+  + +G   + +    DTGSDL W+QC PC +CY Q +
Sbjct: 40  KDFFNRNT-IQSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLN 98

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD---CNYFVSYGDGSYTRGE 229
           P+FD   S ++  + C S +C  L          S+S  PD   C Y  SY DGS T+G 
Sbjct: 99  PMFDSQSSSTFSNIACGSESCSKLY---------STSCSPDQINCKYNYSYVDGSETQGV 149

Query: 230 LGREHLGLG-----KASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG 283
           L +E L L        +    IFGCG NN G F     G++GLGR  LSLVSQ     GG
Sbjct: 150 LAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGG 209

Query: 284 -LFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
            +FS CL P   +   S  +  G  S V  N   +  T ++      +FY + L GIS+
Sbjct: 210 NMFSQCLVPFNTNPSISSPMSFGKGSEVLGNG--VVSTPLVSKTTYQSFYFVTLLGISV 266


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/407 (26%), Positives = 166/407 (40%), Gaps = 82/407 (20%)

Query: 146 MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSY--------KKVLCNSSTCHA 195
           +++ +DTGSDL W  C P  C  C  +  P    S S           ++V C S  C A
Sbjct: 109 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSA 168

Query: 196 LEFATGNSGVCSSSSPP-------DCN--------YFVSYGDGSYTRGELGREHLGLGKA 240
              +   S +C+++  P        C          + +YGDGS     L R  +GLG +
Sbjct: 169 AHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLV-AHLRRGRVGLGAS 227

Query: 241 -SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG--- 296
            +V++F F C        G   G+ G GR  LSL  Q +    G FSYCL S        
Sbjct: 228 VAVDNFTFACAHT---ALGEPVGVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRL 284

Query: 297 -ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG------ 349
                LILG +      +    YT ++ NP+   FY + L  +S+G  ++QA        
Sbjct: 285 IRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVD 344

Query: 350 -FAKGGILIDSGTVITRLPPSIYSALKA--------------EFLKQFSGFPSAPGFSIL 394
               GG+++DSGT  T LP   Y+ +                E  ++ +G         L
Sbjct: 345 RAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTG---------L 395

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV---------CLAL-- 443
             C++ +A  +  +P + + F GNA + +         KS+             CL L  
Sbjct: 396 TPCYHYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMN 454

Query: 444 -ASLSYED-----ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +S ED       G +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 455 GGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTEL 501


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 117/433 (27%), Positives = 198/433 (45%), Gaps = 65/433 (15%)

Query: 86  EQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG-- 143
           +++Q  L     H    + RI + +  N+       +P  +G+      Y   I LG   
Sbjct: 29  QRRQASLTGIKAHDSSRRGRILSAVDFNL---GGNGLPTVTGL------YFTKIGLGSPS 79

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCNSSTCHALEF 198
           ++  V VDTGSD+ WV C  C  C  + D      ++DP  S + + V C  + C +   
Sbjct: 80  KDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSST-- 137

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCG 250
             G    C + +P  C Y +SYGDGS T G   +++L   + + N          IFGCG
Sbjct: 138 YEGRILGCKAENP--CPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCG 195

Query: 251 RNNKGLFG-----GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLIL 303
               G F       + G++G G+++ S++SQ   S     +FS+CL +          + 
Sbjct: 196 AAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTN---------VG 246

Query: 304 GGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFAK---GGILI 357
           GG  S+ +   P +  T ++PN  +A + ++ L  I + G   QL +  F      G +I
Sbjct: 247 GGIFSIGEVVEPKVKTTPLVPN--MAHYNVI-LKNIEVDGDILQLPSDTFDSENGKGTVI 303

Query: 358 DSGTVITRLPPSIYSALKAEFL-KQFSGFPSAPGFSILD--TCFNLSAYQEVNIPLVKME 414
           DSGT +  LP  +Y  L ++ L KQ    P    + + +  +CF  +   +   P+VK+ 
Sbjct: 304 DSGTTLAYLPRIVYDQLMSKVLAKQ----PRLKVYLVEEQYSCFQYTGNVDSGFPIVKLH 359

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTK 470
           FE +  +TV     ++  K D S  C+     + E + G    ++G++   N+ V+YD +
Sbjct: 360 FEDSLSLTVYPHDYLFNYKGD-SYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLE 418

Query: 471 NSQLGFAGEDCSS 483
           N  +G+   +CSS
Sbjct: 419 NMTIGWTDYNCSS 431


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 61/401 (15%)

Query: 117 VSNTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP 173
           +S  ++PL    + +++  Y A I LG   R+  V VDTGSD+ WV C  C  C  + D 
Sbjct: 66  LSAIDLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL 125

Query: 174 V----FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE 229
           V    +D   S + K V C+ + C  +         C S S   C Y + YGDGS T G 
Sbjct: 126 VELTPYDADASSTAKSVSCSDNFCSYVN----QRSECHSGST--CQYVILYGDGSSTNGY 179

Query: 230 LGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQT 277
           L R+ + L   + N          IFGCG    G  G     V G+MG G+S+ S +SQ 
Sbjct: 180 LVRDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQL 239

Query: 278 SE--IFGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           +        F++CL +    G  A G ++          S  +  T M+     +  Y +
Sbjct: 240 ASQGKVKRSFAHCLDNNNGGGIFAIGEVV----------SPKVKTTPMLSK---SAHYSV 286

Query: 334 NLTGISIGGK--QLQASGFAKG---GILIDSGTVITRLPPSIYSALKAEFL---KQFSGF 385
           NL  I +G    QL +  F  G   G++IDSGT +  LP ++Y+ L  + L   ++ +  
Sbjct: 287 NLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLH 346

Query: 386 PSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALAS 445
                F    TCF+     +   P V  +F+ +  + V     ++ V+ D    C    +
Sbjct: 347 TVQDSF----TCFHYIDRLD-RFPTVTFQFDKSVSLAVYPQEYLFQVREDT--WCFGWQN 399

Query: 446 LSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              + + G    I+G+    N+ V+YD +N  +G+   +CS
Sbjct: 400 GGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 160/366 (43%), Gaps = 56/366 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP F P  S +YK + CN S C+         
Sbjct: 99  QEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCNPS-CN--------- 148

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
             C       C Y   Y + S + G L  + L  G  S       IFGC     G LF  
Sbjct: 149 --CDDEG-KQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETVETGELFSQ 205

Query: 259 GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKN 312
              G+MGLGR  LS+V Q    E+ G  FS C       G  G+++LG        VF +
Sbjct: 206 RADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVG--GAMVLGNIPPPPDMVFAH 263

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
           S          +P  + +Y + L  + + GK+L+ +      K G ++DSGT    LP  
Sbjct: 264 S----------DPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYLPEE 313

Query: 370 IYSALK------AEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGN 418
            + A K       +FLKQ  G    P  S  D CF+  A ++V+      P V M F   
Sbjct: 314 AFVAFKDAIIKEIKFLKQIHG----PDPSYNDICFS-GAGRDVSQLSKIFPEVNMVFGNG 368

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
            ++++     ++     +   CL +   + +D T ++G    +N  V YD  N ++GF  
Sbjct: 369 QKLSLSPENYLFRHTKVSGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWK 427

Query: 479 EDCSSM 484
            +CS +
Sbjct: 428 TNCSEL 433


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 168/373 (45%), Gaps = 41/373 (10%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           T+    +NM++++DTGS+L+W+ C    +      P F+P+IS SY  + C+S TC    
Sbjct: 71  TVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPISCSSPTCTTRT 129

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRN----N 253
                   C S++   C+  +SY D S + G L  +  G G +     +FGC  +    N
Sbjct: 130 RDFPIPASCDSNN--LCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYSTN 187

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
                  +GLMG+    LSLVSQ        FSYC+     +  SG L+LG   S F   
Sbjct: 188 SESDSNTTGLMGMNLGSLSLVSQLKI---PKFSYCI---SGSDFSGILLLG--ESNFSWG 239

Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGKQLQASG-------FAKGGILIDSGT 361
             + YT ++    P P    + Y + L GI I  K L  SG          G  + D GT
Sbjct: 240 GSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGT 299

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSAYQE--VNIPLVKM 413
             + L   +Y+AL+ EFL Q +G   A   P F     +D C+ +   Q     +P V +
Sbjct: 300 QFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSL 359

Query: 414 EFEGNAEMTVDVTGIVY----FVKSDASQVCLALASLSYED-ETGIIGNYQQKNQRVIYD 468
            FEG AEM V    ++Y    FV  + S  C    +      E  IIG++ Q++  + +D
Sbjct: 360 VFEG-AEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIGHHHQQSMWMEFD 418

Query: 469 TKNSQLGFAGEDC 481
               ++G A   C
Sbjct: 419 LVEHRVGLAHARC 431


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 173/368 (47%), Gaps = 49/368 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+++++DTGS+L+W+ C+   +  +    VF+P  S +Y  V C+S  C      T + 
Sbjct: 76  QNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRT---RTRDL 128

Query: 204 GVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLF 257
            + +S  P    C+  +SY D +   G L  E   +G  +    +FGC      +N    
Sbjct: 129 PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEED 188

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
              +GLMG+ R  LS V+Q   +    FSYC+  +    +SG L+LG  S  +    PI 
Sbjct: 189 AKSTGLMGMNRGSLSFVNQ---LGFSKFSYCISGSD---SSGFLLLGDASYSWLG--PIQ 240

Query: 318 YTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
           YT ++    P P      Y + L GI +G K   L  S F       G  ++DSGT  T 
Sbjct: 241 YTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 300

Query: 366 LPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEVN---IPLVKMEFE 416
           L   +Y+ALK EF+ Q          P F     +D C+ + +    N   +P+V + F 
Sbjct: 301 LMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR 360

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKNQRVIYDTK 470
           G AEM+V    ++Y V    S+    +   ++ +      E  +IG++ Q+N  + +D  
Sbjct: 361 G-AEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLA 419

Query: 471 NSQLGFAG 478
            S++GFAG
Sbjct: 420 KSRVGFAG 427


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 163/355 (45%), Gaps = 38/355 (10%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
             +IVDTGS +T+V C  C+ C   QDP F P  S +Y+ V C +  C+           
Sbjct: 97  FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC-TIDCN----------- 144

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
           C S     C Y   Y + S + G LG + +  G  S       +FGC     G L+    
Sbjct: 145 CDSDR-MQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENVETGDLYSQHA 203

Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
            G+MGLGR DLS++ Q  +  +    FS C     D G  G+++LGG S    +     Y
Sbjct: 204 DGIMGLGRGDLSIMDQLVDKNVISDSFSLCY-GGMDVGG-GAMVLGGISP--PSDMAFAY 259

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIYSALK 375
           ++ + +P    +Y ++L  I + GK+  L A+ F  K G ++DSGT    LP + + A K
Sbjct: 260 SDPVRSP----YYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAFK 315

Query: 376 AEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDVTGIV 429
              +K+       S P  +  D CF+ +         + P+V M FE   + T+     +
Sbjct: 316 DAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYM 375

Query: 430 YFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           +         CL +   +  D+T ++G    +N  V+YD + +++GF   +C+ +
Sbjct: 376 FRHSKVRGAYCLGVFQ-NGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNCAEL 429


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 161/361 (44%), Gaps = 46/361 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP F P +S +Y+ V CN S C+         
Sbjct: 88  QEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCNPS-CN--------- 137

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
             C       C Y   Y + S + G +  + +  G  S       +FGC     G L+  
Sbjct: 138 --CDDEG-KQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVETGDLYSQ 194

Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
              G+MGLGR  LS+V Q  +  + G  FS C          G + +GG + V    +P 
Sbjct: 195 RADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCY---------GGMDVGGGAMVLGQISPP 245

Query: 317 TYTNMI---PNPQLATFYILNLTGISIGGK--QLQASGF-AKGGILIDSGTVITRLPPSI 370
              NM+    NP  + +Y + L  + + GK  +L+   F  K G ++DSGT     P + 
Sbjct: 246 --PNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYFPEAA 303

Query: 371 YSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEMTV 423
           + ALK   +K+       PG   +  D CF+  A +EV+      P V M F    ++++
Sbjct: 304 FHALKDAIMKEIRHLKQIPGPDPNYHDICFS-GAGREVSHLSKVFPEVNMVFGSGQKLSL 362

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
                ++     +   CL +   +  D T ++G    +N  V YD +N ++GF   +CS 
Sbjct: 363 SPENYLFRHTKVSGAYCLGIFQ-NGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCSE 421

Query: 484 M 484
           +
Sbjct: 422 L 422


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 80/267 (29%), Positives = 135/267 (50%), Gaps = 24/267 (8%)

Query: 230 LGREHLGLGKA--SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
           LG++ L L     ++  + FGC     G      GL+G  R  LS  SQ   ++G +FSY
Sbjct: 310 LGQDALALHDDVDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSY 369

Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA 347
           CLPS + +  SG+L LG      +  T    T ++ NP   + Y +N+ GI +GG+ +  
Sbjct: 370 CLPSYKSSNFSGTLRLGPAGQPKRIKT----TPLLSNPHRPSLYYVNMVGIRVGGRPVAV 425

Query: 348 SGFAKG-------GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL 400
              A         G ++D+GT+ TRL   +Y+A+   F  +    P A      DTC+N+
Sbjct: 426 PASALAFDPASGHGTIVDAGTMFTRLSAPVYAAVCDVFRSRVRA-PVAGPLGGFDTCYNV 484

Query: 401 SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALA---SLSYEDETGIIG 456
           +    +++P V   F+G   +T+    +V  ++S    + CLA+A   S S +    ++ 
Sbjct: 485 T----ISVPTVTFLFDGRVSVTLPEENVV--IRSSLDGIACLAMAAGPSDSVDAVLNVMA 538

Query: 457 NYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + QQ+N RV++D  N ++GF+ E C++
Sbjct: 539 SMQQQNHRVLFDVANGRVGFSRELCTA 565


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 121/401 (30%), Positives = 175/401 (43%), Gaps = 68/401 (16%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV---- 174
           PL+ G   QTL+             +I DTGS L W  C     C  C + + DP     
Sbjct: 84  PLSFGTPQQTLH-------------LIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPR 130

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTR 227
           F P +S S K V C +  C  + F       C S +P   N       Y V YG GS T 
Sbjct: 131 FVPKLSSSSKLVGCQNPKCSWI-FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TA 188

Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
           G L  E L      + +F+ GC   +       SG+ G GR   SL SQ      GL  F
Sbjct: 189 GLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQM-----GLKKF 240

Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT-----FYILNLTGI 338
           +YCL S +  D+  SG LIL  +S+  K+S  +TYT    NP ++      +Y LN+  I
Sbjct: 241 AYCLASRKFDDSPHSGQLIL--DSTGVKSSG-LTYTPFRQNPSVSNNAYKEYYYLNIRKI 297

Query: 339 SIGGKQLQAS-------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +G + ++             GG +IDSG+  T +   +   +  EF KQ + +  A   
Sbjct: 298 IVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDV 357

Query: 392 SILD---TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
             L     CF++S  + V  P +  +F+G A+  + +      V S +   CL + +   
Sbjct: 358 ETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQM 416

Query: 449 ED-------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ED        + I+G +QQ+N  V YD  N +LGF  + CS
Sbjct: 417 EDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 176/398 (44%), Gaps = 50/398 (12%)

Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++  ++PL   G+   T  Y   IE+G   +   V VDTGSD+ WV C  C  C  + D 
Sbjct: 64  LAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDL 123

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++DP  S S   V C+   C A     G    C+ + P  C Y V YGDGS T G
Sbjct: 124 GIDLRLYDPKGSSSGSTVSCDQKFCAAT--YGGKLPGCAKNIP--CEYSVMYGDGSSTTG 179

Query: 229 ELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
               + L        G  + +    IFGCG    G  G     + G++G G+S+ S++SQ
Sbjct: 180 YFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQ 239

Query: 277 TSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
            +       +FS+CL + +     G   +G        STP     ++P+      Y +N
Sbjct: 240 LAAAGEVKKIFSHCLDTIK---GGGIFAIGDVVQPKVKSTP-----LVPD---MPHYNVN 288

Query: 335 LTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
           L  I++GG  LQ          K G +IDSGT +T LP  +Y  + A     F+  P   
Sbjct: 289 LESINVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAV---FAKHPDTT 345

Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
             S+ D    +  +Q V+    K+ F    ++ ++V    YF ++  +  C    +   +
Sbjct: 346 FHSVQD-FLCIQYFQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQ 404

Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + G    ++G+    N+ V+YD +N  +G+   +CSS
Sbjct: 405 SKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCSS 442


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 166/372 (44%), Gaps = 48/372 (12%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y A + +G   +   +IVDTGS +T+V C  C+ C + QDP F P  S +Y+ V C +  
Sbjct: 93  YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKC-TWQ 151

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGC 249
           C+           C +     C Y   Y + S + G LG + +  G   + S    IFGC
Sbjct: 152 CN-----------CDNDR-KQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGC 199

Query: 250 GRNNKGLFGG--VSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG 305
             +  G        G+MGLGR DLS++ Q  E  +    FS C          G+++LGG
Sbjct: 200 ENDETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCY--GGMGVGGGAMVLGG 257

Query: 306 NSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILID 358
            S     VF  S P+           + +Y ++L  I + GK+L  +      K G ++D
Sbjct: 258 ISPPADMVFTRSDPVR----------SPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLD 307

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQEVNI----PLVK 412
           SGT    LP S + A K   +K+       S P     D CF+ +      I    P+V+
Sbjct: 308 SGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVE 367

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           M F    ++++     ++         CL + S +  D T ++G    +N  V+YD +++
Sbjct: 368 MVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFS-NGNDPTTLLGGIVVRNTLVMYDREHT 426

Query: 473 QLGFAGEDCSSM 484
           ++GF   +CS +
Sbjct: 427 KIGFWKTNCSEL 438


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/404 (25%), Positives = 168/404 (41%), Gaps = 51/404 (12%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ--------- 170
           +PLTS        Y     +G   +   ++ DTGSDLTWV+C+P K+             
Sbjct: 82  MPLTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASA 141

Query: 171 --QDPVFDPSISPSYKKVLCNSSTC-HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
                 F P  S ++  + C S TC  +L F+      C +   P C Y   Y DGS  R
Sbjct: 142 SSPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLST---CPTPGSP-CAYDYRYKDGSAAR 197

Query: 228 GELGREHLGLG-------------KASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSL 273
           G +G E   +              KA +   + GC  +  G  F    G++ LG S++S 
Sbjct: 198 GTVGTESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSF 257

Query: 274 VSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY------TNMIPNPQL 327
            S  +  FGG FSYCL        + S +  G +S      P         T ++ + ++
Sbjct: 258 ASHAASRFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRM 317

Query: 328 ATFYILNLTGISIGGKQLQASG-----FAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
             FY +++  IS+ G+ L+           GG+++DSGT +T L    Y A+ A   K+ 
Sbjct: 318 RPFYDVSIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL 377

Query: 383 SGFPSAPGFSILDTCFNLSAYQEV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ 438
           + FP        + C+N ++        ++P + + F G+A +  +     Y + +    
Sbjct: 378 ARFPRV-AMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARL--EPPSKSYVIDAAPGV 434

Query: 439 VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            C+ +    +     +IGN  Q+     +D KN +L F    C+
Sbjct: 435 KCIGVQEGPWPG-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 58/367 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C  C N QDP F P +S +Y  V CN              
Sbjct: 7   QEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD------------ 54

Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
             C+  +  D C Y   Y + S + G LG + +  G  S       +FGC     G LF 
Sbjct: 55  --CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFS 112

Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
               G+MGLGR DLS+V Q  E  +    FS C    +  G  G+++LG  S     VF 
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGG--GAMVLGQISPPSDMVFS 170

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTVITRLPP 368
           +S          +P  + +Y + L G+ + GK+L  +      K G ++DSGT    LP 
Sbjct: 171 HS----------DPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPE 220

Query: 369 SIY----SALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQEV-----NIPLVKMEFEG 417
           + +     A+ +E   LKQ  G    P  +  D CF+  A  E+       P V M F+ 
Sbjct: 221 AAFLPFIQAITSELHGLKQIRG----PDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDN 275

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
             + ++     ++         CL +   + +D T ++G    +N  V YD ++S++GF 
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDREHSKVGFW 334

Query: 478 GEDCSSM 484
             +CS +
Sbjct: 335 KTNCSVL 341


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 58/367 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C  C N QDP F P +S +Y  V CN              
Sbjct: 7   QEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCNPD------------ 54

Query: 204 GVCSSSSPPD-CNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
             C+  +  D C Y   Y + S + G LG + +  G  S       +FGC     G LF 
Sbjct: 55  --CTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFS 112

Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
               G+MGLGR DLS+V Q  E  +    FS C    +  G  G+++LG  S     VF 
Sbjct: 113 QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGG--GAMVLGQISPPSDMVFS 170

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSGTVITRLPP 368
           +S          +P  + +Y + L G+ + GK+L  +      K G ++DSGT    LP 
Sbjct: 171 HS----------DPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPE 220

Query: 369 SIY----SALKAEF--LKQFSGFPSAPGFSILDTCFNLSAYQEV-----NIPLVKMEFEG 417
           + +     A+ +E   LKQ  G    P  +  D CF+  A  E+       P V M F+ 
Sbjct: 221 AAFLPFIQAITSELHGLKQIRG----PDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDN 275

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
             + ++     ++         CL +   + +D T ++G    +N  V YD ++S++GF 
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDREHSKVGFW 334

Query: 478 GEDCSSM 484
             +CS +
Sbjct: 335 KTNCSVL 341


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 123/401 (30%), Positives = 174/401 (43%), Gaps = 68/401 (16%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV---- 174
           PL+ G   QTL+             +I DTGS L W  C     C  C + + DP     
Sbjct: 84  PLSFGTPQQTLH-------------LIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPR 130

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN-------YFVSYGDGSYTR 227
           F P +S S K V C +  C  + F       C S +P   N       Y V YG GS T 
Sbjct: 131 FVPKLSSSSKLVGCQNPKCSWI-FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TA 188

Query: 228 GELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--F 285
           G L  E L      + +F+ GC   +       SG+ G GR   SL SQ      GL  F
Sbjct: 189 GLLLSETLDFPDKXIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQM-----GLKKF 240

Query: 286 SYCLPSTQ--DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT-----FYILNLTGI 338
           +YCL S +  D+  SG LIL  +S+  K+S  +TYT    NP ++      +Y LN+  I
Sbjct: 241 AYCLASRKFDDSPHSGQLIL--DSTGVKSSG-LTYTPFRQNPSVSNNAYKEYYYLNIRKI 297

Query: 339 SIGG-------KQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
            +G        K L       GG +IDSG+  T +   +   +  EF KQ + +  A   
Sbjct: 298 IVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDV 357

Query: 392 SILD---TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
             L     CF++S  + V  P +  +F+G A+  + +      V S +   CL + +   
Sbjct: 358 ETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALV-SSSGVACLTVVTHQM 416

Query: 449 ED-------ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           ED        + I+G +QQ+N  V YD  N +LGF  + CS
Sbjct: 417 EDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 124/404 (30%), Positives = 186/404 (46%), Gaps = 36/404 (8%)

Query: 99  VQYLQSRIKNMISGNIKDV-----SNTEIPLTSGIRLQTLNY-IATIELGGRNMTVIVDT 152
           VQ  +SR+  + +  + +       + + PL  G     +++ I T   G   ++   DT
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATG---LSGEADT 111

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           GSDL W +C  C  C  +  P + P+ S S   V C   TC  L     ++     S   
Sbjct: 112 GSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 213 DCNYFVSYGDGS----YTRGELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
           +C+Y  +YG+      YT G L  E    G   A+     FGC   ++G FG  SGL+GL
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGL 231

Query: 267 GRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSL--ILGGNSSVFKNSTPITYTNMIP 323
           GR  LSLV+Q + E FG   S  L S     + GSL  + GGN   F  STP+  TN  P
Sbjct: 232 GRGKLSLVTQLNVEAFGYRLSSDL-SAPSPISFGSLADVTGGNGDSFM-STPL-LTN--P 286

Query: 324 NPQLATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALK 375
             Q   FY + LTGIS+GGK  Q+ +  F+       GG++ DSGT +T LP   Y+ ++
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346

Query: 376 AEFLKQFSGFPSAPGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            E L Q  GF   P  +  D   CF          P + + F+G A+M +     +  ++
Sbjct: 347 DELLSQM-GFQKPPPAANDDDLICFT-GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD-TKNSQLGF 476
               +     + +       IIGN  Q +  V++D + N+++ F
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLF 448


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 112/376 (29%), Positives = 168/376 (44%), Gaps = 57/376 (15%)

Query: 149 IVDTGSDLTWVQCQP---CKSC-----YNQQDPVFDPSISPSYKKVLCNSSTCHAL---- 196
           ++DTGS L W  C     C  C          P F P +S S K + C +  C  +    
Sbjct: 99  VMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPE 158

Query: 197 -----EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVNDFIFGCG 250
                +     +  C+ + PP   Y + YG GS T G L  E L    K ++ DF+ GC 
Sbjct: 159 IQSKCQECDSTAQNCTQTCPP---YVIQYGSGS-TAGLLLSETLDFPNKKTIPDFLVGCS 214

Query: 251 RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPST--QDAGASGSLIL--G 304
             +        G+ G GRS  SL SQ      GL  FSYCL S    D   S  L+L  G
Sbjct: 215 IFS---IKQPEGIAGFGRSPESLPSQL-----GLKKFSYCLVSHAFDDTPTSSDLVLDTG 266

Query: 305 GNSSVFKNSTPITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFAKGGI 355
             S V K +  +++T  + NP  A   +Y + L  I IG        K L       GG 
Sbjct: 267 SGSGVTKTAG-LSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGT 325

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP---GFSILDTCFNLSAYQEVNIPLVK 412
           ++DSGT  T +   +Y  +  EF KQ + +  A      + L  C+N+S  + +++P + 
Sbjct: 326 IVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLI 385

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG------IIGNYQQKNQRVI 466
            +F+G A+M + ++   YF   D+  +CL + S +            I+GNYQQ+N  V 
Sbjct: 386 FQFKGGAKMALPLSN--YFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVE 443

Query: 467 YDTKNSQLGFAGEDCS 482
           +D +N + GF  + C+
Sbjct: 444 FDLENEKFGFKQQSCA 459


>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
          Length = 157

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 67/157 (42%), Positives = 93/157 (59%), Gaps = 4/157 (2%)

Query: 327 LATFYILNLTGISIGGKQLQ-ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG- 384
           L T Y L+LT I++GGK L  A+   K   +IDSGTVITRLP  +Y+ALK  F++  S  
Sbjct: 2   LPTLYGLDLTAITVGGKPLGLAASSYKVPTIIDSGTVITRLPMPVYTALKNSFVRIMSKK 61

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           +  APG SILDTCF  +  +   +P ++M F G A++ +        ++ D    CLA+A
Sbjct: 62  YAQAPGISILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNT--LIELDKGVTCLAIA 119

Query: 445 SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             S  +   IIGNYQQ+  +V YD  NS++GFA   C
Sbjct: 120 GSSENNPIAIIGNYQQQTFKVAYDVANSKIGFAAGGC 156


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 156/370 (42%), Gaps = 65/370 (17%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  CK C   QDP F P +S SY+ + CN              
Sbjct: 87  QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-------------- 132

Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR 251
                   PDCN         Y   Y + S + G L  + +  G   + S    +FGC  
Sbjct: 133 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184

Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
              G LF     G+MGLGR  LS+V Q  +  +   +FS C    +  G  G+++LG  S
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLGKIS 242

Query: 308 S----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSG 360
                VF +S P            + +Y ++L  + + GK L+ +      K G ++DSG
Sbjct: 243 PPPGMVFSHSDPFR----------SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSG 292

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKME 414
           T     P   + A+K   +K+         P  +  D CF+ +      I    P + ME
Sbjct: 293 TTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAME 352

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F GN +  + ++   Y  +    +    L      D T ++G    +N  V YD +N +L
Sbjct: 353 F-GNGQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKL 410

Query: 475 GFAGEDCSSM 484
           GF   +CS +
Sbjct: 411 GFLKTNCSDI 420


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 101/366 (27%), Positives = 155/366 (42%), Gaps = 57/366 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  CK C   QDP F P +S SY+ + CN              
Sbjct: 87  QEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-------------- 132

Query: 204 GVCSSSSPPDCN---------YFVSYGDGSYTRGELGREHLGLG---KASVNDFIFGCGR 251
                   PDCN         Y   Y + S + G L  + +  G   + S    +FGC  
Sbjct: 133 --------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCEN 184

Query: 252 NNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNS 307
              G LF     G+MGLGR  LS+V Q  +  +   +FS C    +  G  G+++LG   
Sbjct: 185 EETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG--GAMVLG--- 239

Query: 308 SVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVIT 364
              K S P        +P  + +Y ++L  + + GK L+ +      K G ++DSGT   
Sbjct: 240 ---KISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 296

Query: 365 RLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKMEFEGN 418
             P   + A+K   +K+         P  +  D CF+ +      I    P + MEF GN
Sbjct: 297 YFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEF-GN 355

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
            +  + ++   Y  +    +    L      D T ++G    +N  V YD +N +LGF  
Sbjct: 356 GQKLI-LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLK 414

Query: 479 EDCSSM 484
            +CS +
Sbjct: 415 TNCSDI 420


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 115/444 (25%), Positives = 186/444 (41%), Gaps = 55/444 (12%)

Query: 81  IVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIE 140
           + D     + R+     H +    R +   +G+    +  E+PLTSG       Y     
Sbjct: 45  LADLARSDRQRMAFIASHGR---RRARETAAGS--SAAAFEMPLTSGAYTGIGQYFVRFR 99

Query: 141 LG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDP---VFDPSISPSYKKVLCNSSTC- 193
           +G   +   ++ DTGSDLTWV+C+ P  +           F P  S ++  + C S TC 
Sbjct: 100 VGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCT 159

Query: 194 HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG---------KASVND 244
            +L F+      C +   P C Y   Y DGS  RG +G E   +          KA +  
Sbjct: 160 KSLPFSLAT---CPTPGSP-CAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKG 215

Query: 245 FIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLI 302
            + GC  +  G    VS G++ LG SD+S  S  +  F G FSYCL        A+  L 
Sbjct: 216 LVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATSYLT 275

Query: 303 LGGNSSVFKNSTPITY------------------TNMIPNPQLATFYILNLTGISIGGKQ 344
            G N +V  +S+P +                   T ++ + ++  FY + +  +S+ G+ 
Sbjct: 276 FGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQF 335

Query: 345 LQASGF-----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           L+         A GG+++DSGT +T L    Y A+ A   +  +G P        + C+N
Sbjct: 336 LKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRV-TMDPFEYCYN 394

Query: 400 L-SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
             S   +V +P + + F G A +  +  G  Y + +     C+ L    +     +IGN 
Sbjct: 395 WTSPSGDVTLPKMAVHFAGAARL--EPPGKSYVIDAAPGVKCIGLQEGPWPG-ISVIGNI 451

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
            Q+     +D KN +L F    C+
Sbjct: 452 LQQEHLWEFDIKNRRLKFQRSRCT 475


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 124/404 (30%), Positives = 186/404 (46%), Gaps = 36/404 (8%)

Query: 99  VQYLQSRIKNMISGNIKDV-----SNTEIPLTSGIRLQTLNY-IATIELGGRNMTVIVDT 152
           VQ  +SR+  + +  + +       + + PL  G     +++ I T   G   ++   DT
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATG---LSGEADT 111

Query: 153 GSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP 212
           GSDL W +C  C  C  +  P + P+ S S   V C   TC  L     ++     S   
Sbjct: 112 GSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 213 DCNYFVSYGDGS----YTRGELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGL 266
           +C+Y  +YG+      YT G L  E    G   A+     FGC   ++G FG  SGL+GL
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGL 231

Query: 267 GRSDLSLVSQTS-EIFGGLFSYCLPSTQDAGASGSL--ILGGNSSVFKNSTPITYTNMIP 323
           GR  LSLV+Q + E FG   S  L S     + GSL  + GGN   F  STP+  TN  P
Sbjct: 232 GRGKLSLVTQLNVEAFGYRLSSDL-SAPSPISFGSLADVTGGNGDSFM-STPL-LTN--P 286

Query: 324 NPQLATFYILNLTGISIGGK--QLQASGFA------KGGILIDSGTVITRLPPSIYSALK 375
             Q   FY + LTGIS+GGK  Q+ +  F+       GG++ DSGT +T LP   Y+ ++
Sbjct: 287 VVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVR 346

Query: 376 AEFLKQFSGFPSAPGFSILDT--CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            E L Q  GF   P  +  D   CF          P + + F+G A+M +     +  ++
Sbjct: 347 DELLSQM-GFQKPPPAANDDDLICFT-GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404

Query: 434 SDASQVCLALASLSYEDETGIIGNYQQKNQRVIYD-TKNSQLGF 476
               +     + +       IIGN  Q +  V++D + N+++ F
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARMLF 448


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 131/466 (28%), Positives = 214/466 (45%), Gaps = 76/466 (16%)

Query: 60  SRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSN 119
           S++  G+  L+LKH+       ++ + +Q  +  +   H + L    +      + +V  
Sbjct: 21  SKVTCGSGVLKLKHRF----SELEGSSKQSGKRGMSEEHFRQLMDHTRARSRRFLLEVD- 75

Query: 120 TEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQ------- 170
               + +G       Y A I +G   + +  IVDTGSD+ W +C+ C+ C ++       
Sbjct: 76  ---LMLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCS 132

Query: 171 ----QDPV--FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGS 224
               Q P+  +DP +S +     C+   C       GN+  C+        Y +SY D S
Sbjct: 133 SIIMQGPITLYDPELSITASPATCSDPLCSEGGSCRGNNNSCA--------YDISYEDTS 184

Query: 225 YTRGELGREHLGLG-KASVNDFIF-GCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG 282
            + G   R+ + LG KAS+N  +F GC  +  GL+  V G+MG GRS +S+ +Q +   G
Sbjct: 185 SSTGIYFRDVVHLGHKASLNTTMFLGCATSISGLW-PVDGIMGFGRSKVSVPNQLAAQAG 243

Query: 283 G--LFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FYILNLTGI 338
              +F +CL   ++ G  G L+LG N           +  M+  P LA    Y + L  +
Sbjct: 244 SYNIFYHCLSGEKEGG--GILVLGKNDE---------FPEMVYTPMLANDIVYNVKLVSL 292

Query: 339 SIGGKQL--QASGF------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF----P 386
           S+  K L  +AS F        GG +IDSGT     P    S   A F+K  S F    P
Sbjct: 293 SVNSKALPIEASEFEYNATVGNGGTIIDSGTSSATFP----SKALALFVKAVSKFTTAIP 348

Query: 387 SAPGFSILDTCF-NLSAYQ--EVNIPLVKMEFEGNAEMTVD----VTGIVYFVKSDASQ- 438
           +AP  S    CF ++S     EV+ P V ++F+G A M +     +  +V    S+++  
Sbjct: 349 TAPLESSGSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHF 408

Query: 439 --VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             V L   S S  + T I+G+   K++ V+YD + S++G+  +D S
Sbjct: 409 QGVRLVCISWSVGNST-ILGDAILKDKVVVYDMEKSRIGWVKQDLS 453


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/326 (29%), Positives = 158/326 (48%), Gaps = 28/326 (8%)

Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T I  +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP         +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           GG   +    T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GGK--IAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 231

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 232 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 290

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 291 DLGSHGV--FVERSVQEQDVWCLAFA 314


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 166/384 (43%), Gaps = 71/384 (18%)

Query: 149 IVDTGSDLTWVQCQP---CKSCYNQQD-----PVFDPSISPSYKKVLCNS---------- 190
           ++DTGS L W  C     C  C          P F P  S S   + C +          
Sbjct: 108 VMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPK 167

Query: 191 --STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIF 247
             S C   +  T N   C+ S PP   Y + YG GS T G L  E L    K ++  F+ 
Sbjct: 168 VQSKCQECDPTTQN---CTQSCPP---YVIQYGLGS-TAGLLLSETLDFPHKKTIPGFLV 220

Query: 248 GCGRNNKGLFG--GVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPST--QDAGASGSL 301
           GC      LF      G+ G GRS  SL SQ      GL  FSYCL S    D  AS  L
Sbjct: 221 GCS-----LFSIRQPEGIAGFGRSPESLPSQL-----GLKKFSYCLVSHAFDDTPASSDL 270

Query: 302 ILGGNSSVFKNSTP-ITYTNMIPNPQLA--TFYILNLTGISIGG-------KQLQASGFA 351
           +L   S      TP ++YT    NP  A   +Y + L  I IG        K L      
Sbjct: 271 VLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDG 330

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNI 408
            GG ++DSGT  T +   +Y  +  EF KQ + +  A      + L  CFN+S  + V++
Sbjct: 331 NGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSV 390

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---------IIGNYQ 459
           P     F+G A+M + +     FV  D+  +CL + S   ++ +G         I+GNYQ
Sbjct: 391 PEFIFHFKGGAKMALPLANYFSFV--DSGVICLTIVS---DNMSGSGIGGGPAIILGNYQ 445

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
           Q+N  V +D KN + GF  ++C S
Sbjct: 446 QRNFHVEFDLKNERFGFKQQNCVS 469


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 61/388 (15%)

Query: 132  TLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
            TL    T+    + +T+++DTGS+L+W+ C+   +  +    VF+P  S SY  + C+S 
Sbjct: 999  TLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSP 1054

Query: 192  TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG- 250
             C        N   C       C+  VSY D S   G L  ++  +G +++   +FGC  
Sbjct: 1055 ICRTRTRDLPNPVTCDPKKL--CHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMD 1112

Query: 251  ---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGL--FSYCLPSTQDAGASGSLILG- 304
                +N       +GLMG+ R  LS V+Q      GL  FSYC+ S +D  +SG L+ G 
Sbjct: 1113 SGFSSNSEEDAKTTGLMGMNRGSLSFVTQL-----GLPKFSYCI-SGRD--SSGVLLFGD 1164

Query: 305  ------GN---SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGFA-- 351
                  GN   + + + STP+ Y + +        Y + L GI +G K   L  S FA  
Sbjct: 1165 LHLSWLGNLTYTPLVQISTPLPYFDRVA-------YTVQLDGIRVGNKILPLPKSIFAPD 1217

Query: 352  ---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA---PGFSI---LDTCFNLSA 402
                G  ++DSGT  T L   +Y+AL+ EFL+Q  G  +    P F     +D C++++A
Sbjct: 1218 HTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAA 1277

Query: 403  YQEV-NIPLVKMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIG 456
              ++  +P V + F G AEM V    ++Y V    K +    CL   +      E  +IG
Sbjct: 1278 GGKLPTLPSVSLMFRG-AEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIG 1336

Query: 457  NYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            ++ Q+N  + +D     + FA + C S+
Sbjct: 1337 HHHQQNVWMEFDL----VAFAADLCGSI 1360


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 118/225 (52%), Gaps = 19/225 (8%)

Query: 68  TLELK-HKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTS 126
           TL L+ H         D+     +RL  D+  V+Y+ +++      N   +S    P+ S
Sbjct: 69  TLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNF--NTDKLSG---PIIS 123

Query: 127 GIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYK 184
           G    +  Y + I +G       +++DTGSD++WVQC PC  CY Q DP+F+P+ S SY 
Sbjct: 124 GTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFEPTASASYA 183

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND 244
            + C ++ C  L+ +   +G        +C Y VSYGDGSYT G+   E + +G   V +
Sbjct: 184 PLSCEAAQCRYLDQSQCRNG--------NCLYQVSYGDGSYTVGDFVTETVTIGVNKVKN 235

Query: 245 FIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL 289
              GCG NN+GLF G +GL+GLG   LS  +Q +      FSYCL
Sbjct: 236 VALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST---SFSYCL 277


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 161/361 (44%), Gaps = 26/361 (7%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALE 197
           TI    +  + I+D   +L W QC  C  C+ Q  P+F P+ S +++   C +  C +  
Sbjct: 48  TIGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTP 107

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-GRNNKGL 256
            +  +  VC+  S  +        D   T G +G E   +G A+ +   FGC   ++   
Sbjct: 108 TSNCSGDVCTYESTTNIRL-----DRHTTLGIVGTETFAIGTATAS-LAFGCVVASDIDT 161

Query: 257 FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVF--KNST 314
             G SG +GLGR+  SLV+Q        FSYCL S +  G S  L LG ++ +   ++++
Sbjct: 162 MDGTSGFIGLGRTPRSLVAQMKLT---KFSYCL-SPRGTGKSSRLFLGSSAKLAGGESTS 217

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI-DSGTVITRLPPSIYSA 373
              +    P+     +Y+L+L  I  G   +  +    GGIL+  + +  + L  S Y A
Sbjct: 218 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATA--QSGGILVMHTVSPFSLLVDSAYRA 275

Query: 374 LKAEFLKQFSG---FPSAPGFSILDTCFNLSA-YQEVNIPLVKMEFE-GNAEMTVDVTGI 428
            K    +   G    P A      D CF  +A +     P +   F+ G A +TV     
Sbjct: 276 FKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKY 335

Query: 429 VYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           +  V  +    C A+ S++  + TG     ++G+ QQ+N   +YD K   L F   DCSS
Sbjct: 336 LIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADCSS 395

Query: 484 M 484
           +
Sbjct: 396 L 396


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 58/378 (15%)

Query: 149 IVDTGSDLTWVQC------QPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--------- 193
           ++DTGS L W+ C        C S  N   P F P  S S K V C +  C         
Sbjct: 232 VLDTGSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVT 291

Query: 194 -HALEFATG---NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC 249
            H  + A     N+  CS + P    Y V YG GS T G L  E+L     +V+DF+ GC
Sbjct: 292 SHCCKLAKAAFSNNNNCSQTCP---AYTVQYGLGS-TAGFLLSENLNFPAKNVSDFLVGC 347

Query: 250 GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLIL-GGN 306
              +    GG++G    GR + SL +Q +      FSYCL S Q  ++  +  L++   N
Sbjct: 348 SVVSVYQPGGIAGF---GRGEESLPAQMNLT---RFSYCLLSHQFDESPENSDLVMEATN 401

Query: 307 SSVFKNSTPITYTNMIPNPQ-----LATFYILNLTGISIGGKQ-------LQASGFAKGG 354
           S   K +  ++YT  + NP         +Y + L  I +G K+       L+      GG
Sbjct: 402 SGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGG 461

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQE-VNIPL 410
            ++DSG+ +T +   I+  +  EF+KQ + +  A        L  CF L+   E  + P 
Sbjct: 462 FIVDSGSTLTFMERPIFDLVAEEFVKQVN-YTRARELEKQFGLSPCFVLAGGAETASFPE 520

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLALASLSYEDETG------IIGNYQQKNQ 463
           ++ EF G A+M + V    YF +     V CL + S     + G      I+GNYQQ+N 
Sbjct: 521 MRFEFRGGAKMRLPVAN--YFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNF 578

Query: 464 RVIYDTKNSQLGFAGEDC 481
            V  D +N + GF  + C
Sbjct: 579 YVECDLENERFGFRSQSC 596


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 124/439 (28%), Positives = 180/439 (41%), Gaps = 50/439 (11%)

Query: 82  VDWNEQQQNRLIL---DNLHVQYLQSRIKNMISGNIKDVS----------NTEIPLTSGI 128
            D  E    RL L   D L    L SRI+++I  + K  S            ++ L SGI
Sbjct: 23  ADSTEDTAVRLKLAHRDTLWPNPL-SRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGI 81

Query: 129 RLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN-------QQDPVFDPSI 179
              T  Y   + +G   +   V+VDTGS+LTWV C+     Y        +   VF    
Sbjct: 82  DYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR-----YRGRGKGKVKNRRVFRAEE 136

Query: 180 SPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG- 238
           S S+K V C + TC        +   C + S P C+Y   Y DGS  +G   +E + +G 
Sbjct: 137 SKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTP-CSYDYRYADGSAAQGVFAKETITVGL 195

Query: 239 ----KASVNDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGGLFSYCL-PST 292
               KA +   + GC  +  G     + G++GL  SD S  S  + +FG   SYCL    
Sbjct: 196 TNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHL 255

Query: 293 QDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA----- 347
            +   S  LI G +SS     T    T  +    +  FY +N+ GISIG   L       
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVW 315

Query: 348 SGFAKGGILIDSGTVITRLPPSIYSALK---AEFLKQFSGFPSAPGFSILDTCF-NLSAY 403
                GG ++DSGT +T L  + Y  +    A +L +       P    ++ CF + S +
Sbjct: 316 DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRV--KPEGIPIEYCFSSTSGF 373

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
            E  +P +    +G A          Y V +     CL   S      T ++GN  Q+N 
Sbjct: 374 NESKLPQLTFHLKGGARFEPHRKS--YLVDAAPGVKCLGFMSAG-TPATNVVGNIMQQNY 430

Query: 464 RVIYDTKNSQLGFAGEDCS 482
              +D   S L FA   C+
Sbjct: 431 LWEFDLMASTLSFAPSTCT 449


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 170/383 (44%), Gaps = 57/383 (14%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+T+++DTGS+L+W+ C       ++ D  FD S S SY  V C+S  C  L       
Sbjct: 74  QNVTMVLDTGSELSWLLCN-----GSRHDAPFDASASSSYAPVPCSSPACTWLGRDLPVR 128

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC----GRNNKGLFGG 259
             C SS+   C   +SY D S   G L  +   LG + +   +FGC      +       
Sbjct: 129 PFCDSSA---CRVSLSYADASSADGLLAADTFLLGSSPMPA-LFGCITSYSSSTDPSETP 184

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP---- 315
            +GL+G+ R  LS V+QT+      F+YC+ + Q     G L+LGGN +    ++P    
Sbjct: 185 PTGLLGMNRGGLSFVTQTATR---RFAYCIAAGQ---GPGILLLGGNDTETPLTSPPQQQ 238

Query: 316 ITYTNMI----PNPQL-ATFYILNLTGISIGGKQLQASGF-------AKGGILIDSGTVI 363
           + YT ++    P P      Y + L GI +G   L              G  ++DSGT  
Sbjct: 239 LNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSGTRF 298

Query: 364 TRLPPSIYSALKAEFLKQFS-----GFPS--APGF---SILDTCF-----NLSAYQEVN- 407
           T L P  Y+ALKAEF  Q +     G      PGF      D CF      +SA      
Sbjct: 299 TFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAAGGL 358

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY--EDETG----IIGNYQQK 461
           +P V +   G   +      ++Y V  +       +  L++   D  G    +IG++ Q+
Sbjct: 359 LPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHHHQQ 418

Query: 462 NQRVIYDTKNSQLGFAGEDCSSM 484
           +  V YD +N++LGFA   C+ +
Sbjct: 419 DVWVEYDLRNARLGFAAARCADL 441


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 160/345 (46%), Gaps = 37/345 (10%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +DT SD+ W+   PC  C      +F+   S +YK + C ++ C  +   T   GVCS  
Sbjct: 1   MDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCS-- 55

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRS 269
                 + ++YG GS     L ++ + L   +V  + FGC +   G      GL+GLGR 
Sbjct: 56  ------FNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRG 108

Query: 270 DLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
            LSL+SQT  ++   FSYCLPS +    SGSL LG      +    I YT ++ NP+  +
Sbjct: 109 PLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKR----IKYTPLLKNPRRPS 164

Query: 330 FYILNLTGISI---------GGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLK 380
            Y +NL  + +         G      S  A  G + DSGTV TRL    Y A++  F  
Sbjct: 165 LYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA--GTIFDSGTVFTRLVTPAYIAVRDAFRN 222

Query: 381 QFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA-SQV 439
           +     +       DTC+ +     +  P +   F G   M V +      + S A S  
Sbjct: 223 RVGRNLTVTSLGGFDTCYTV----PIAAPTITFMFTG---MNVTLPPDNLLIHSTAGSTT 275

Query: 440 CLALASL--SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           CLA+A+   +      +I N QQ+N R++YD  NS+LG A E C+
Sbjct: 276 CLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 320


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 159/375 (42%), Gaps = 48/375 (12%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQ---DPVFDPSISPSYKKVLCN 189
           Y + + +G   +   +IVDTGS +T+V C  C  C + Q   DP F P  S SY+ V CN
Sbjct: 99  YTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCN 158

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFI 246
           S  C         + +C +     C Y   Y + S ++G LG++ LG G  S    +  +
Sbjct: 159 SPDC--------ITKMCDARV-HQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLL 209

Query: 247 FGCGRNNKG--LFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLI 302
           FGC     G        G+MGLGR  LS+V Q   +      FS C     + G  GS++
Sbjct: 210 FGCETAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG--GSMV 267

Query: 303 LGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGI 355
           LG      + VF  S          +P  + +Y L L+ I + G  L         + G 
Sbjct: 268 LGAIPPPPAMVFAKS----------DPNRSNYYNLELSEIQVQGVSLNVPSEVFNGRLGT 317

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCF----NLSAYQEVNIP 409
           ++DSGT    LP   + A K    +Q     + PG   S  D CF    + S     + P
Sbjct: 318 VLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFP 377

Query: 410 LVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDT 469
            V   F GN ++ +     ++         CL       +D T ++G    +N  V YD 
Sbjct: 378 PVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF--FKNQDATTLLGGIVVRNTLVTYDR 435

Query: 470 KNSQLGFAGEDCSSM 484
            N Q+GF   +C+++
Sbjct: 436 ANHQIGFFKTNCTNL 450


>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
 gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
          Length = 172

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/183 (38%), Positives = 103/183 (56%), Gaps = 18/183 (9%)

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QASGFAKGGILIDSG 360
           LGG SS    ST    T ++      T+YI+ L GIS+GG+ L   AS FA G + +D+G
Sbjct: 4   LGGPSSTAGFST----TPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTG 58

Query: 361 TVITRLPPSIYSALKAEFLKQFS--GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGN 418
           TV+TRLPP+ YSAL++ F    +  G+PSAP   ILDTC++ + Y  V +P + + F G 
Sbjct: 59  TVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGG 118

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           A M +  +GI+       +  CLA A    + +  I+GN QQ++  V +D   S +GF  
Sbjct: 119 AAMDLGTSGIL-------TSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMP 169

Query: 479 EDC 481
             C
Sbjct: 170 ASC 172


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 112/372 (30%), Positives = 161/372 (43%), Gaps = 48/372 (12%)

Query: 149 IVDTGSDLTWVQCQP---CKSCY-----NQQDPVFDPSISPSYKKVLCNSSTCHALEFAT 200
           ++DTGS L W  C     C  C        + P F P  S + K + C +  C  + F +
Sbjct: 108 VLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYI-FGS 166

Query: 201 GNSGVCSSSSPPDCN-------YFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNN 253
                C    P   N       Y + YG GS T G L  ++L     +V  F+ GC   +
Sbjct: 167 DVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPGKTVPQFLVGCSILS 225

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGNSSVFK 311
                  SG+ G GR   SL SQ +      FSYCL S +  D   S  L+L  +S+   
Sbjct: 226 ---IRQPSGIAGFGRGQESLPSQMNL---KRFSYCLVSHRFDDTPQSSDLVLQISSTGDT 279

Query: 312 NSTPITYTNM-----IPNPQLATFYILNLTGISIGGKQ-------LQASGFAKGGILIDS 359
            +  ++YT         NP    +Y L L  + +GGK        L+      GG ++DS
Sbjct: 280 KTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDS 339

Query: 360 GTVITRLPPSIYSALKAEFLKQ----FSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           G+  T +   +Y+ +  EF+KQ    +S    A   S L  CFN+S  + V  P +  +F
Sbjct: 340 GSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVKTVTFPELTFKF 399

Query: 416 EGNAEMTVDVTGIVYFVKSDASQVCLALAS---LSYEDETG---IIGNYQQKNQRVIYDT 469
           +G A+MT  +      V  DA  VCL + S         TG   I+GNYQQ+N  + YD 
Sbjct: 400 KGGAKMTQPLQNYFSLV-GDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDL 458

Query: 470 KNSQLGFAGEDC 481
           +N + GF    C
Sbjct: 459 ENERFGFGPRSC 470


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 172/391 (43%), Gaps = 70/391 (17%)

Query: 132 TLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV--FDPSISPSYKKVL 187
           ++  + T+ +G   +   +++DTGS L+W+QC      +N+  P   FDPS+S S+  + 
Sbjct: 85  SMALVVTLPIGTPPQPQQMVLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLP 138

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFI 246
           C    C            C  +    C+Y   Y DG+Y  G L RE L    + +    I
Sbjct: 139 CTHPLCKPRVPDFTLPTTCDQNR--LCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLI 196

Query: 247 FGCG---RNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----ASG 299
            GC    R+ +G+ G     M LGR      ++ ++     FSYC+P+ Q A      +G
Sbjct: 197 LGCSSESRDARGILG-----MNLGRLSFPFQAKVTK-----FSYCVPTRQPANNNNFPTG 246

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATF-------YILNLTGISIGGKQL------- 345
           S  LG N     NS    Y +M+  PQ           Y + + GI IGG++L       
Sbjct: 247 SFYLGNN----PNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVF 302

Query: 346 QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSA 402
           + +    G  ++DSG+  T L    Y  ++ E ++   G     G+    + D CF+ +A
Sbjct: 303 RPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVL-GPRVKKGYVYGGVADMCFDGNA 361

Query: 403 YQEVNIPL--VKMEFEGNAEMTV-------DVTGIVYFVKSDASQVCLALASLSYEDETG 453
             E+   L  V  EFE   E+ V       DV G V+ V    S+  L  AS        
Sbjct: 362 -MEIGRLLGDVAFEFEKGVEIVVPKERVLADVGGGVHCVGIGRSER-LGAAS-------N 412

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           IIGN+ Q+N  V +D  N ++GF   DCS +
Sbjct: 413 IIGNFHQQNLWVEFDLANRRIGFGVADCSRL 443


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 109/431 (25%), Positives = 172/431 (39%), Gaps = 84/431 (19%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----------Y 168
           +PL+SG    T  Y     +G   R   ++ DTGSDLTWV+C+   +            Y
Sbjct: 42  MPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNY 101

Query: 169 NQQDP-----------------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSS 210
               P                 VF P  S ++  + C+S TC A L F+      C +  
Sbjct: 102 GYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLA---ACPTPG 158

Query: 211 PPDCNYFVSYGDGSYTRGELGREHLGLG-----------KASVNDFIFGCGRNNKGL-FG 258
            P C Y   Y DGS  RG +G +   +            +A +   + GC  +  G  F 
Sbjct: 159 SP-CAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL------------------PSTQDAGASGS 300
              G++ LG S++S  S+ +  FGG FSYCL                  P+   A AS +
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-----KGGI 355
              G  ++     TP+   +     ++  FY + + G+S+ G+ L+           GG 
Sbjct: 278 ACAGSAAAPGARQTPLLLDH-----RMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGA 332

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY-----QEVNIPL 410
           ++DSGT +T L    Y A+ A   K+  G P        D C+N ++        V +P 
Sbjct: 333 ILDSGTSLTVLVSPAYRAVVAALGKKLVGLPRV-AMDPFDYCYNWTSPLTGEDLAVAVPA 391

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTK 470
           + + F G+A +        Y + +     C+ L    +   + +IGN  Q+     +D K
Sbjct: 392 LAVHFAGSARLQPPPKS--YVIDAAPGVKCIGLQEGDWPGVS-VIGNILQQEHLWEFDLK 448

Query: 471 NSQLGFAGEDC 481
           N +L F    C
Sbjct: 449 NRRLRFKRSRC 459


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            + + G+  FV+    +    CLA A
Sbjct: 289 DLGIHGV--FVERSVQEQDVWCLAFA 312


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 171/368 (46%), Gaps = 49/368 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +N+++++DTGS+L+W+ C+   +  +    VF+P  S +Y  V C+S  C      T + 
Sbjct: 76  QNISMVLDTGSELSWLHCKKSPNLGS----VFNPVSSSTYSPVPCSSPICRT---RTRDL 128

Query: 204 GVCSSSSPPD--CNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----RNNKGLF 257
            + +S  P    C+  +SY D +   G L  E   +G  +    +FGC      +N    
Sbjct: 129 PIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEED 188

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
              +GLMG+ R  LS V+Q   +    FSYC+     +G+  S+ L    + +    PI 
Sbjct: 189 AKSTGLMGMNRGSLSFVNQ---LGFSKFSYCI-----SGSDSSVFLLLGDASYSWLGPIQ 240

Query: 318 YTNMI----PNPQLATF-YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITR 365
           YT ++    P P      Y + L GI +G K   L  S F       G  ++DSGT  T 
Sbjct: 241 YTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 300

Query: 366 LPPSIYSALKAEFLKQFSG---FPSAPGFSI---LDTCFNLSAYQEVN---IPLVKMEFE 416
           L   +Y+ALK EF+ Q          P F     +D C+ + +    N   +P+V + F 
Sbjct: 301 LMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGLPMVSLMFR 360

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYED------ETGIIGNYQQKNQRVIYDTK 470
           G AEM+V    ++Y V    S+    +   ++ +      E  +IG++ Q+N  + +D  
Sbjct: 361 G-AEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNVWMEFDLA 419

Query: 471 NSQLGFAG 478
            S++GFAG
Sbjct: 420 KSRVGFAG 427


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 159/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS ++WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +  +G+  FV+    +    CLA A
Sbjct: 289 DLGSSGV--FVERSVQEQDVWCLAFA 312


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 113/417 (27%), Positives = 184/417 (44%), Gaps = 52/417 (12%)

Query: 99  VQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLN--YIATIELGG--RNMTVIVDTGS 154
            ++L +  ++ +  + + +   ++PL  G+ L T    Y   IE+G   +   V VDTGS
Sbjct: 48  AEHLAALRRHDVGRHGRLLGAVDLPL-GGVGLPTATGLYYTQIEIGSPSKGYYVQVDTGS 106

Query: 155 DLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           D+ WV C  C  C        +   +DP+ S +   V C+   C A     G    C S+
Sbjct: 107 DILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVGCDQEFCVA-NSPNGLPPACPST 163

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN--------DFIFGCGRNNKGLFG--- 258
           S P C + ++YGDGS T G    + +   + S N           FGCG    G  G   
Sbjct: 164 SSP-CQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNASITFGCGAQLGGDLGSSS 222

Query: 259 -GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
             + G++G G++D S++SQ   +     +F++CL +    G      +G        +TP
Sbjct: 223 QALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGGGI---FAIGNVVQPKVKTTP 279

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSI 370
           +         Q  T Y +NL GIS+GG  LQ  +S F  G   G +IDSGT +  LP  +
Sbjct: 280 LV--------QNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGTTLAYLPREV 331

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           Y  L      ++           +  CF  S   +   P+V   FEG  E+T++V    Y
Sbjct: 332 YRTLLTAVFDKYQDLALHNYQDFV--CFQFSGSIDDGFPVVTFSFEG--EITLNVYPHDY 387

Query: 431 FVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
             +++    C+       + + G    ++G+    N+ V+YD +   +G+A  +CSS
Sbjct: 388 LFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADYNCSS 444


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 117/222 (52%), Gaps = 16/222 (7%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           +SL+SQT   + G+FSYCLPS +    SGSL LG      +N   + YT ++ NP   + 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRYTPLLTNPHRPSL 56

Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y +N+TG+S+G    ++ A  FA       G +IDSGTVITR    +Y+AL+ EF +Q +
Sbjct: 57  YYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVA 116

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLA 442
                      DTCFN         P V +  +G  ++T+ +   +  + S A+ + CLA
Sbjct: 117 APSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACLA 174

Query: 443 LASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +A           ++ N QQ+N RV+ D   S++GFA E C+
Sbjct: 175 MAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216


>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
 gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
          Length = 478

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 176/373 (47%), Gaps = 36/373 (9%)

Query: 130 LQTLNYIATIELGG-RNMTVIVDTGSDLTWVQCQPCKSC-YNQQDPVFDPSISPSYKKVL 187
           L+T   +A+ EL G +   +IVDTGS  T++ C+ C SC  ++    +D   S  + +V 
Sbjct: 30  LETGVLVASFELAGAQTFELIVDTGSSRTYLPCKGCASCGAHEAGRYYDYDASADFSRVE 89

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN-DFI 246
           C  S C  +    G SGV        C Y V Y +GS + G L R+ + LG +  N   +
Sbjct: 90  C--SACAGIGGKCGTSGV--------CRYDVHYLEGSGSEGYLVRDVVSLGGSVGNATVV 139

Query: 247 FGCGRNNKGLFG--GVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGAS--GS 300
           FGC     G        GL G GR   +L +Q  ++ +   LFS C+   +       G 
Sbjct: 140 FGCEERELGSIKQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGG 199

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI-LIDS 359
           L+  GN     ++  + YT M+ +   A +Y +  T  ++G   ++ S   +G + +IDS
Sbjct: 200 LLTLGNFDFGADAPALVYTPMVSS---AMYYQVTTTSWTLGNSVVEGS---RGVLTIIDS 253

Query: 360 GTVITRLPPSIYSAL--KAEFLKQFSGFPS-APGFSILDTCFNLS---AYQEVN--IPLV 411
           GT  T +P ++++     AE   + SG    AP     D CF  S    +  V+   P +
Sbjct: 254 GTSYTYVPGNMHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPAL 313

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKN 471
           K+E+ G+A +T+     +Y+ + +AS  C+ +  L ++D   ++G    +N    +D   
Sbjct: 314 KIEYHGSARLTLSPETYLYWHQKNASAFCVGI--LEHDDNRILLGQITMRNTFTEFDVAR 371

Query: 472 SQLGFAGEDCSSM 484
           SQ+G A  +C  +
Sbjct: 372 SQVGMASANCEML 384


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP         +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L+    +      +A   S  + C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVLRQRIRELLLKRGAAEEESERN-CYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 163/368 (44%), Gaps = 41/368 (11%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y + +++G      ++IVDTGS +T+V C  C  C N QDP F P++S SYK + C S  
Sbjct: 35  YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGSEC 94

Query: 193 CHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGC 249
                    ++G C  S      Y   Y + S + G LG++ +G   +S       +FGC
Sbjct: 95  ---------STGFCDGSR----KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGC 141

Query: 250 GRNNKG-LFGGVS-GLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG 305
                G L+   + G++GLGR  LS++ Q  E      +FS C     + G  G++ILGG
Sbjct: 142 ETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGG--GAMILGG 199

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGK--QLQASGF-AKGGILIDSGTV 362
               F+    + +T    +P  + +Y L L GI +GG   +L+   F  K G ++DSGT 
Sbjct: 200 ----FQPPKDMVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTT 253

Query: 363 ITRLPPSIYSALKAEFLKQFSGFPSAPG--FSILDTCFNLSAYQEVNI----PLVKMEFE 416
               P + + A K+   +Q       PG      D C+  +     N+    P V   F 
Sbjct: 254 YAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFG 313

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
               +T+     ++     +   CL +      D T ++G    +N  V Y+   + +GF
Sbjct: 314 DGQSVTLSPENYLFRHTKISGAYCLGV--FENGDPTTLLGGIIVRNMLVTYNRGKASIGF 371

Query: 477 AGEDCSSM 484
               C+ +
Sbjct: 372 LKTKCNDL 379


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 120/416 (28%), Positives = 176/416 (42%), Gaps = 83/416 (19%)

Query: 134 NYIATIELGGRN--MTVIVDTGSDLTWVQCQP-----CKSCYNQQDPVFDPSISPSYKKV 186
           +Y  +  LG  +  +++ +DTGSDL W  C P     C+     Q P+  P I+ + K V
Sbjct: 75  DYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPL--PKIA-NNKSV 131

Query: 187 -------------------LCNSSTC--HALEFATGNSGVCSS-SSPPDCNYFVSYGDGS 224
                              LC  S C   ++E +      CSS S PP   ++ +YGDGS
Sbjct: 132 SCSAAACSAAHGGSLSASHLCAISRCPLESIEISE-----CSSFSCPP---FYYAYGDGS 183

Query: 225 YTRGELGREHLGLGKAS------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS 278
                L R+ L L   +      V +F FGC        G   G+ G GR  LS+ SQ +
Sbjct: 184 LV-ARLYRDSLSLPTPAPSPPINVRNFTFGCAHTT---LGEPVGVAGFGRGVLSMPSQLA 239

Query: 279 EI---FGGLFSYCLPSTQDAG----ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
                 G  FSYCL S   A         LILG     +   T   YT+++ NP+   FY
Sbjct: 240 TFSPQLGNRFSYCLVSHSFAADRVRRPSPLILG---RYYTGETEFIYTSLLENPKHPYFY 296

Query: 332 ILNLTGISIGGKQLQASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG 384
            + L GIS+G  ++ A  F         GG+++DSGT  T LP  +Y ++ AEF  +   
Sbjct: 297 SVGLAGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGK 356

Query: 385 FPSAPGFSILDTCFNLSAYQE--VNIPLVKMEFEGNAEMTVDVTGIVYF---------VK 433
             +       +T  +   Y E  V +P V + F G     V      ++         V 
Sbjct: 357 VANRARRIEENTGLSPCYYYENSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVG 416

Query: 434 SDASQVCLALASLSYEDETG-----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
                 CL L +   E E        +GNYQQ+   V+YD + +++GFA   CS++
Sbjct: 417 RKRKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCSTL 472


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 169/380 (44%), Gaps = 49/380 (12%)

Query: 135 YIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCN 189
           Y   I LG ++  V VDTGSD  WV C  C +C  +        ++DP++S + K V C+
Sbjct: 76  YYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCD 135

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVND 244
              C     +T +  +   +    C Y ++YGDGS T G   ++ L   +      +V D
Sbjct: 136 DEFCT----STYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191

Query: 245 ---FIFGCGRNNKGLFGGVS-----GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQD 294
               IFGCG    G     +     G++G G+++ S++SQ +       +FS+CL S   
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSI-- 249

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ-----ASG 349
              SG    GG  ++ +   P   T   P  Q    Y + L  I + G  +Q        
Sbjct: 250 ---SG----GGIFAIGEVVQPKVKTT--PLLQGMAHYNVVLKDIEVAGDPIQLPSDILDS 300

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVN 407
            +  G +IDSGT +  LP SIY  L  + L Q SG      + + D  TCF+ S  + V+
Sbjct: 301 SSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKL---YLVEDQFTCFHYSDEESVD 357

Query: 408 --IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQKNQ 463
              P VK  FE    +T      ++  K D   V    +    +D  E  ++G+    N+
Sbjct: 358 DLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANK 417

Query: 464 RVIYDTKNSQLGFAGEDCSS 483
            V+YD  N  +G+A  +CSS
Sbjct: 418 LVVYDLDNMAIGWADYNCSS 437


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 112/437 (25%), Positives = 178/437 (40%), Gaps = 88/437 (20%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDP------ 173
           +PL+SG    T  Y     +G   R   ++ DTGSDLTWV+C   +  ++   P      
Sbjct: 94  MPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCH--RHDHDAPAPGYGYAA 151

Query: 174 -----------------------VFDPSISPSYKKVLCNSSTCHA-LEFATGNSGVCSSS 209
                                  VF P  S ++  + C+S TC A L F+      C + 
Sbjct: 152 PASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSL---AACPTP 208

Query: 210 SPPDCNYFVSYGDGSYTRGELGREHLGLG-----------KASVNDFIFGCGRNNKG-LF 257
             P C Y   Y DGS  RG +G +   +            +A +   + GC  +  G  F
Sbjct: 209 GSP-CAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSF 267

Query: 258 GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPI 316
               G++ LG S++S  S+ +  FGG FSYCL        A+  L  G N +V  +S+P 
Sbjct: 268 LASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAV--SSSPP 325

Query: 317 TYTN---------------------MIPNPQLATFYILNLTGISIGGKQLQASGF----A 351
           + T                      ++ + ++  FY + + GIS+ G+ L+        A
Sbjct: 326 SKTACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVA 385

Query: 352 K-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ-----E 405
           K GG ++DSGT +T L    Y A+ A   K+ +G P        D C+N ++        
Sbjct: 386 KGGGAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRV-TMDPFDYCYNWTSPSTGEDLT 444

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRV 465
           V +P + + F G+A +        Y + +     C+ L    +     +IGN  Q+    
Sbjct: 445 VAMPELAVHFAGSARLQPPAKS--YVIDAAPGVKCIGLQEGEWPG-VSVIGNILQQEHLW 501

Query: 466 IYDTKNSQLGFAGEDCS 482
            +D KN +L F    C+
Sbjct: 502 EFDLKNRRLRFKRSRCT 518


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 96/337 (28%), Positives = 154/337 (45%), Gaps = 28/337 (8%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +D  SDL W  C             F+P  S +   V C    C   +FA    G  + +
Sbjct: 117 LDISSDLVWTACGATAP--------FNPVRSTTVADVPCTDDACQ--QFAPQTCGAGAGA 166

Query: 210 SPPDCNYFVSYGDGSY-TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
              +C Y   YG G+  T G LG E    G   ++  +FGCG  N G F GVSG++GLGR
Sbjct: 167 GSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGR 226

Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
            +LSLVSQ        FSY   +  D+  + S IL G+ +  + S  ++ T ++ +    
Sbjct: 227 GNLSLVSQLQV---DRFSYHF-APDDSVDTQSFILFGDDATPQTSHTLS-TRLLASDANP 281

Query: 329 TFYILNLTGISIGGKQLQ-ASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           + Y + L GI + GK L   SG          GG+ +    ++T L  + Y  L+     
Sbjct: 282 SLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVAS 341

Query: 381 QFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           +  G P+  G ++ LD C+   +  +  +P + + F G A M +++ G  +++ S     
Sbjct: 342 KI-GLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL-GNYFYMDSTTGLA 399

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           CL +   S  D + ++G+  Q    ++YD   S+L F
Sbjct: 400 CLTILPSSAGDGS-VLGSLIQVGTHMMYDINGSKLVF 435


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 117/393 (29%), Positives = 176/393 (44%), Gaps = 59/393 (15%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSC-YNQQDPVFDPSISP----SYK 184
           Y  ++  G  + T+  + DTGS L  + C     C  C ++  DP   P   P    S K
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149

Query: 185 KVLCNSSTCHAL-------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL 237
            + C S  C  L            N+  C+   PP   Y + YG GS T G L  E L  
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPP---YILQYGLGS-TAGVLITEKLDF 205

Query: 238 GKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DA 295
              +V DF+ GC   +       +G+ G GR  +SL SQ +      FS+CL S +  D 
Sbjct: 206 PDLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNL---KRFSHCLVSRRFDDT 259

Query: 296 GASGSLIL----GGNSSVFKNSTP-ITYTNMIPNPQLAT-----FYILNLTGISIGGKQL 345
             +  L L    G NS    + TP +TYT    NP ++      +Y LNL  I +G K +
Sbjct: 260 NVTTDLDLDTGSGHNSG---SKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV 316

Query: 346 Q------ASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LD 395
           +      A G    GG ++DSG+  T +   ++  +  EF  Q S +           L 
Sbjct: 317 KIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG 376

Query: 396 TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-- 453
            CFN+S   +V +P +  EF+G A++ + ++    FV  +   VCL + S    + +G  
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFV-GNTDTVCLTVVSDKTVNPSGGT 435

Query: 454 ----IIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               I+G++QQ+N  V YD +N + GFA + CS
Sbjct: 436 GPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 37/362 (10%)

Query: 145 NMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSG 204
           N T+ VD+G   +WV C    +       +F P +S S+ K+ C S +C A    + + G
Sbjct: 13  NFTLAVDSG--FSWVACSSSCAINCTTASLFQPGLSTSHTKLPCGSPSCSAFSAVSTSCG 70

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIFGCGRNNKGLFG-- 258
             SS     C+Y  SYG    + G+L  +   +     +    +   GCGR++ GL    
Sbjct: 71  PSSS-----CSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLELL 125

Query: 259 GVSGLMGLGRSDLSLVSQTSEI-FGGLFSYCLPSTQDAGASGSLILGG----NSSVFKNS 313
             SG +G  + ++S + Q S + +   F YCLPS       G L++G     N+S+   S
Sbjct: 126 DTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDT---FRGKLVIGNYKLRNASI---S 179

Query: 314 TPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGF---AKGGILIDSGTVITRLPP 368
           + + YT MI NPQ A  Y +NL+ ISI   + Q    GF     GG +ID+ T ++ L  
Sbjct: 180 SSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTTFLSYLTS 239

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDT-----CFNLSAYQEVNIP-LVKMEFEGNAEMT 422
             Y+ L  + +K ++        S+ D      C+N+SA  +   P  +   F G A + 
Sbjct: 240 DFYTQL-VQAIKNYTTNLVEVSSSVADALGVELCYNISANSDFPPPATLTYHFLGGAGVE 298

Query: 423 VDVTGIVYFVKSDASQVCLALA-SLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           V    ++    S  + +C+A+  S S      +IG YQQ +  V YD +  + GF  + C
Sbjct: 299 VSTWFLLDDSDSVNNTICMAIGRSESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358

Query: 482 SS 483
           ++
Sbjct: 359 NT 360


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 118/406 (29%), Positives = 174/406 (42%), Gaps = 64/406 (15%)

Query: 117 VSNTEIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++  ++PL  +GI   T  Y   I +G   +   V VDTGSD+ WV C  C SC  +   
Sbjct: 70  LTAVDLPLGGNGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGL 129

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGV---CSSSSPPDCNYFVSYGDGSY 225
                ++DP+ S S K V C    C        N GV   C+++SP  C Y ++YGDGS 
Sbjct: 130 GIDLTLYDPTASASSKTVTCGQEFCATAT----NGGVPPSCAANSP--CQYSITYGDGSS 183

Query: 226 TRGEL-----------GREHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSD 270
           T G             G     L  ASV    FGCG    G  G     + G++G G+++
Sbjct: 184 TTGFFVADFLQYDQVSGDGQTNLANASVT---FGCGAKIGGALGSSNVALDGILGFGQAN 240

Query: 271 LSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
            S++SQ +       +FS+CL    D    G +   GN    K  T    T ++P     
Sbjct: 241 SSMLSQLTSAGKVTKIFSHCL----DTVNGGGIFAIGNVVQPKVKT----TPLVPG---M 289

Query: 329 TFYILNLTGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
             Y + L  I +GG  LQ        G    G +IDSGT +  LP  +Y   KA     F
Sbjct: 290 PHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIIDSGTTLAYLPEVVY---KAVLSAVF 346

Query: 383 SGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCL 441
           S  P     ++ D  CF  S   +   P V   F+G+  + V      Y  ++     C+
Sbjct: 347 SNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFDGDLPLVVYPHD--YLFQNTEDVYCV 404

Query: 442 ALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
              S   + + G    ++G+    N+ V+YD +N  +G+   +CSS
Sbjct: 405 GFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCSS 450


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  TWV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSRGV--FVERSVQEQDVWCLAFA 312


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 124/516 (24%), Positives = 197/516 (38%), Gaps = 89/516 (17%)

Query: 42  LQWQQKSGSSS--SCVSHQKSRIEMGAITLELKHKNY----CSGKIVDWNEQQQNRLILD 95
           +QW   + +S   +   H    + + ++ LEL H+++      G  VD  E  +  +  D
Sbjct: 6   MQWNTITKASILITITLHLILPVAVNSMRLELVHRHHERFSGGGGDVDQVEAVKGFVNRD 65

Query: 96  NLHVQYLQSRI------KNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMT 147
            L  Q +  R       +          +  E+P+ +G       Y   +++G  G+   
Sbjct: 66  GLRRQRMNQRWGVSNYDRRRKGLETTTTTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFW 125

Query: 148 VIVDTGSDLTWVQC---------------------------------------------Q 162
           +  DTGS+ TW  C                                              
Sbjct: 126 LAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSN 185

Query: 163 PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD 222
           PCK        VF P  S S++ V C S  C        +  +C   S P C Y +SY D
Sbjct: 186 PCKG-------VFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDP-CLYDISYAD 237

Query: 223 GSYTRGELGREHLGLG-----KASVNDFIFGCGR---NNKGLFGGVSGLMGLGRSDLSLV 274
           GS  +G  G + + +      +  +N+   GC +   N         G++GLG +  S +
Sbjct: 238 GSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFI 297

Query: 275 SQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
            + +  +G  FSYCL         S  L +GG+ +  K    I  T +I  P    FY +
Sbjct: 298 DKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNA-KLLGEIKRTELILFP---PFYGV 353

Query: 334 NLTGISIGGKQL----QASGF-AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           N+ GISIGG+ L    Q   F ++GG LIDSGT +T L    Y  +    +K  +     
Sbjct: 354 NVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRV 413

Query: 389 PG--FSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
            G  F  LD CF+   + +  +P +   F G A     V    Y +       C+ +  +
Sbjct: 414 TGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKS--YIIDVAPLVKCIGIVPI 471

Query: 447 SYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                  +IGN  Q+N    +D   + +GFA   C+
Sbjct: 472 DGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507


>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
 gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 163

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 92/160 (57%), Gaps = 4/160 (2%)

Query: 326 QLATFYILNLTGISIGGKQLQA--SGFA-KGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
           Q  +FY LNLTGI++ G+ ++   S FA   G +IDSGT  + LPPS Y+AL++      
Sbjct: 5   QHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAM 64

Query: 383 SGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLA 442
             +  AP  +I DTC++L+ ++ V IP V + F   A + +  +G++Y   S+ SQ CLA
Sbjct: 65  GRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLY-TWSNVSQTCLA 123

Query: 443 LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
                 +   G++GN QQ+   VIYD  N ++GF    C+
Sbjct: 124 FLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGCA 163


>gi|295830689|gb|ADG39013.1| AT5G10770-like protein [Neslia paniculata]
          Length = 159

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 70/165 (42%), Positives = 100/165 (60%), Gaps = 8/165 (4%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           LS  SQT+  +  +FSYCLPS+  A  +G L  G  S+    S   T  + I +    +F
Sbjct: 1   LSFPSQTATAYNKIFSYCLPSS--ASYTGHLTFG--SAGISRSVKFTPISTITDGT--SF 54

Query: 331 YILNLTGISIGGKQLQ--ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA 388
           Y L++  I++GG++L   ++ F+  G LIDSGTVITRLPP  Y+AL++EF  + S +P+ 
Sbjct: 55  YGLSIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRLPPKAYAALRSEFKAKMSKYPTT 114

Query: 389 PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
            G SILDTCF+LS ++ V IP V   F G A + +   GI+Y  K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGILYAFK 159


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 44/370 (11%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
           Y   + +G   +   +IVD+GS +T+V C  C+ C N QDP F P +S +Y  V CN   
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
           TC               S    C Y   Y + S + G LG + +  G  S       +FG
Sbjct: 148 TC--------------DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFG 193

Query: 249 CGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
           C  +  G LF     G+MGLGR  LS++ Q  +  + G  FS C       G  G+++LG
Sbjct: 194 CENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLG 251

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGILIDSGT 361
              +        T++N + +P    +Y + L  + + GK L+        K G ++DSGT
Sbjct: 252 AMPA--PPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCF-----NLSAYQEVNIPLVKME 414
               LP   + A K     Q         P  +  D CF     N+S   EV  P V M 
Sbjct: 306 TYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEV-FPKVDMV 364

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F    ++++     ++         CL +   + +D T ++G    +N  V YD  N ++
Sbjct: 365 FGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDRHNEKI 423

Query: 475 GFAGEDCSSM 484
           GF   +CS +
Sbjct: 424 GFWKTNCSEL 433


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 108/406 (26%), Positives = 165/406 (40%), Gaps = 80/406 (19%)

Query: 146 MTVIVDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPS--YKKVLCNSSTCHALEFATG 201
           +++ +DTGSDL W  C P  C  C  +  P     + P    +++ C S  C A   +  
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRRIPCASPLCSAAHASAP 164

Query: 202 NSGVCSSSSPP-------DCN-------YFVSYGDGSYTRG-ELGREHLGLGK-----AS 241
            S +C+++  P        C         + +YGDGS       GR  LG G       +
Sbjct: 165 PSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAVA 224

Query: 242 VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG----A 297
           V++F F C        G   G+ G GR  LSL  Q S    G FSYCL S          
Sbjct: 225 VDNFTFACAHTA---LGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLIR 281

Query: 298 SGSLILGGNSSVFKNSTPIT----YTNMIPNPQLATFYILNLTGISIGGKQLQASG---- 349
              LILG +      +   T    YT ++ NP+   FY + L  +S+G  ++QA      
Sbjct: 282 PSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELAR 341

Query: 350 ---FAKGGILIDSGTVITRLPPSIYSALKA--------------EFLKQFSGFPSAPGFS 392
                 GG+++DSGT  T LP  +Y+ +                E  ++ +G        
Sbjct: 342 VDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG-------- 393

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--------CLAL- 443
            L  C+  +A  +  +P + + F GNA + +         KS+ +          CL L 
Sbjct: 394 -LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLM 451

Query: 444 ----ASLSYED-ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               AS    D   G +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 452 NGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 497


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/238 (36%), Positives = 121/238 (50%), Gaps = 21/238 (8%)

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCL-PSTQDAGASGSLILGGNSSVFKNSTPITYT 319
           SGLMGLGR  LSLVSQT       FSYCL P   + GA+G L +G ++S+  +   +T T
Sbjct: 152 SGLMGLGRGRLSLVSQTGAT---KFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMT-T 207

Query: 320 NMIPNPQLATFYILNLTGISIGGKQLQ-----------ASGFAKGGILIDSGTVITRLPP 368
             +  P+ + FY L L G+++G  +L            A G   GG++IDSG+  T L  
Sbjct: 208 QFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVH 267

Query: 369 SIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN--IPLVKMEFEGNAEMTVDVT 426
             Y AL +E   + +G   AP     D    + A ++V   +P V   F G A+M V   
Sbjct: 268 DAYDALASELAARLNGSLVAPPPDADDGALCV-ARRDVGRVVPAVVFHFRGGADMAVPAE 326

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              Y+   D +  C+A+AS        +IGNYQQ+N RV+YD  N    F   DCS++
Sbjct: 327 S--YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCSAL 382


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 159/359 (44%), Gaps = 46/359 (12%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
             +IVDTGS +T+V C  C+ C   QDP F P +S +Y+ V C       L+    N  +
Sbjct: 94  FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC------TLDCNCDNDRM 147

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-GV 260
                   C Y   Y + S + G LG + +  G  S       +FGC     G L+    
Sbjct: 148 -------QCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENVETGDLYSQHA 200

Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKNST 314
            G+MGLGR DLS++ Q  +  +    FS C     D G  G+++LGG S     VF  S 
Sbjct: 201 DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCY-GGMDVGG-GAMVLGGISPPSDMVFAQSD 258

Query: 315 PITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIY 371
           P+           + +Y ++L  I + GK+  L  S F  K G ++DSGT    LP   +
Sbjct: 259 PVR----------SPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYLPEEAF 308

Query: 372 SALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTVDV 425
            A K   +K+   F   S P  +  D CF+ +           P+V M F    + ++  
Sbjct: 309 LAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSP 368

Query: 426 TGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              ++         CL +   + +D T ++G    +N  V+YD + +++GF   +C+ +
Sbjct: 369 ENYMFRHSKVRGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNCAEL 426


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSRGV--FVERSVQEQDVWCLAFA 312


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 158/370 (42%), Gaps = 44/370 (11%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SS 191
           Y   + +G   +   +IVD+GS +T+V C  C+ C N QDP F P +S +Y  V CN   
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNVDC 147

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFG 248
           TC               S    C Y   Y + S + G LG + +  G  S       +FG
Sbjct: 148 TC--------------DSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFG 193

Query: 249 CGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILG 304
           C  +  G LF     G+MGLGR  LS++ Q  +  + G  FS C       G  G+++LG
Sbjct: 194 CENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGG--GAMVLG 251

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF---AKGGILIDSGT 361
              +        T++N + +P    +Y + L  + + GK L+        K G ++DSGT
Sbjct: 252 AMPA--PPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGT 305

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCF-----NLSAYQEVNIPLVKME 414
               LP   + A K     Q         P  +  D CF     N+S   EV  P V M 
Sbjct: 306 TYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPKVDMV 364

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F    ++++     ++         CL +   + +D T ++G    +N  V YD  N ++
Sbjct: 365 FGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGGIVVRNTLVTYDRHNEKI 423

Query: 475 GFAGEDCSSM 484
           GF   +CS +
Sbjct: 424 GFWKTNCSEL 433


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 186/398 (46%), Gaps = 58/398 (14%)

Query: 121 EIPLTSGIRL---QTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVF 175
           ++P +S  +L     +    T+ +G   +N+++++DTGS+L+W+ C+   +  +    VF
Sbjct: 44  KLPRSSSDKLSFRHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS----VF 99

Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRGELGRE 233
           +P  S +Y  V C+S  C      T +  + +S  P    C+  +SY D +   G L  +
Sbjct: 100 NPVSSSTYSPVPCSSPICRT---RTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHD 156

Query: 234 HLGLGKASVNDFIFGCGRNNKGLF------GGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
              +G  +    +FGC   + GL          +GLMG+ R  LS V+Q   +    FSY
Sbjct: 157 TFVIGSVTRPGTLFGC--MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQ---LGFSKFSY 211

Query: 288 CLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMI----PNPQLATF-YILNLTGISIGG 342
           C+  +    +SG L+LG  S  +    PI YT ++    P P      Y + L GI +G 
Sbjct: 212 CISGSD---SSGILLLGDAS--YSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGS 266

Query: 343 K--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSG---FPSAPGFS 392
           K   L  S F       G  ++DSGT  T L   +Y+ALK EF+ Q          P F 
Sbjct: 267 KILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFV 326

Query: 393 I---LDTCFNLSAYQEVN---IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
               +D C+ + +    N   +P++ + F G AEM+V    ++Y V    S+    +   
Sbjct: 327 FQGTMDLCYRVGSSTRPNFTGLPVISLMFRG-AEMSVSGQKLLYRVNGAGSEGKEEVYCF 385

Query: 447 SYED------ETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
           ++ +      E  +IG++ Q+N  + +D   S++GFAG
Sbjct: 386 TFGNSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAG 423


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 151/355 (42%), Gaps = 29/355 (8%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKK---VLCNSSTCHALEFATGNSG 204
           +++DTGS L+W+QC   K+   +Q P               + CN   C           
Sbjct: 97  MVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCKPRVPDFSLPT 156

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNNKGLFGGVSGL 263
            C ++S   C+Y   Y DG+Y  G L RE +    + +    I GC   +        G+
Sbjct: 157 DCDANS--LCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQSDD----ARGI 210

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN--SSVFKNSTPITYTNM 321
           +G+    L   SQ        FSYC+P+ Q   ASGS  LG N  SS F+    +T+   
Sbjct: 211 LGMNLGRLGFPSQAKIT---KFSYCVPTKQAQPASGSFYLGNNPASSSFRYVNLLTFGQS 267

Query: 322 IPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSA 373
              P L    Y L L GISIGGK+L       + +    G  +IDSG+  T L    Y+ 
Sbjct: 268 QRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNV 327

Query: 374 LKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           ++ E +K+  G     G+    + D CF+  A  E+   +  M FE    + + +     
Sbjct: 328 IREELVKKV-GPKIKKGYMYGGVADICFDGDAI-EIGRLVGDMVFEFEKGVQIVIPKERV 385

Query: 431 FVKSDASQVCLALASLSYEDETG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               D    CL +         G IIGN+ Q+N  V +D  N ++GF   DCS +
Sbjct: 386 LATVDGGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLANRRVGFGEADCSKL 440


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/411 (27%), Positives = 178/411 (43%), Gaps = 34/411 (8%)

Query: 95  DNLHVQYLQSRIKNMISGNIKDVSNT-EIPLTSGIRLQTLNYIATIELG---GRNMTVIV 150
           DN   Q + S +++       +VS+T +IP+ SG       Y  +I +G    +   ++ 
Sbjct: 79  DNARRQMISS-LRHGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVT 137

Query: 151 DTGSDLTWVQCQ-PCKSCYNQQDP----VFDPSISPSYKKVLCNSSTCHALEFATGNSGV 205
           DTGSDLTW+ C+  CKSC  + +P    VF  + S S++ + C+S  C        +   
Sbjct: 138 DTGSDLTWMNCEYWCKSC-PKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTE 196

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-----KASVNDFIFGCGRNNKGLFGGV 260
           C + + P C +   Y +G    G    E + +G     K  + D + GC  +     G  
Sbjct: 197 CPNPNAP-CLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFP 255

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
            G+MGLG    SL  + +EIFG  FSYCL     +    + +  G+    K    + +T 
Sbjct: 256 DGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPK-MQHTE 314

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG-----FAKGGILIDSGTVITRLPPSIY---- 371
           ++    +  FY +N++GIS+GG  L  S         GG+++DSGT +T L    Y    
Sbjct: 315 LLLG-YINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVV 373

Query: 372 SALKAEFLKQFSGFP-SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
            ALK  F K     P   P  +  + CF    +    +P + + F   A     V    Y
Sbjct: 374 DALKPIFDKHKKVVPIELPELN--NFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKS--Y 429

Query: 431 FVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            +       CL +    +   + I+GN  Q+N    YD    +LGF    C
Sbjct: 430 IIDVAEGIKCLGIIKADFPGSS-ILGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 176/398 (44%), Gaps = 59/398 (14%)

Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QD 172
           ++PL  +G+  +T  Y   I +G   ++  V VDTGSD+ WV C  C +C  +     + 
Sbjct: 66  DLPLGGNGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIEL 125

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELG 231
            ++DPS S S   V C    C A      + GV  S  P   C Y +SYGDGS T G   
Sbjct: 126 TLYDPSGSSSGTGVTCGQDFCVAT-----HGGVIPSCVPAAPCQYSISYGDGSSTTGFFV 180

Query: 232 REHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE 279
            + L   + S N           FGCG    G  G     + G++G G+S+ S++SQ + 
Sbjct: 181 TDFLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA 240

Query: 280 I--FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNL 335
                 +F++CL +    G  A G ++             ++ T ++P       Y +NL
Sbjct: 241 AGKVRKVFAHCLDTINGGGIFAIGDVV----------QPKVSTTPLVPG---MPHYNVNL 287

Query: 336 TGISIGGKQLQAS------GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
             I +GG +LQ        G +KG I IDSGT +  LP  +Y+A+ ++   Q+   P   
Sbjct: 288 EAIDVGGVKLQLPTNIFDIGESKGTI-IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKN 346

Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
                  CF  S   +   P++   FEG   + +     ++    +    C+   +   +
Sbjct: 347 DQDF--QCFRYSGSVDDGFPIITFHFEGGLPLNIHPHDYLF---QNGELYCMGFQTGGLQ 401

Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + G    ++G+    N+ V+YD +N  +G+   +CSS
Sbjct: 402 TKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCSS 439


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 110/381 (28%), Positives = 173/381 (45%), Gaps = 36/381 (9%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCK-SCYN---QQDPVFD 176
           P+     +    +   I LG   +   V VDTGS L+WV CQ C+ SC+    +   VFD
Sbjct: 63  PVVGNHEIHEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFD 122

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG---DGSYTRGELGRE 233
           P  S +Y+ V C+S  C  ++ +      C   +   C Y + YG    G Y+ G LG +
Sbjct: 123 PDKSTTYELVGCSSRDCADVQRSLVAPFGCIEET-DTCLYSLRYGSGPSGQYSAGRLGTD 181

Query: 234 HLGLGKAS--VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTS-EIFGGLFSYCLP 290
            L L  +S  ++ FIFGC  ++    G  SG++G G ++ S  +Q + +     FSYC P
Sbjct: 182 KLTLASSSSIIDGFIFGCSGDDS-FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFP 240

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--S 348
              D  A G L +G           + YTN+IP+    + Y L    + + G +LQ   S
Sbjct: 241 G--DHTAEGFLSIGAYP-----KDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQS 293

Query: 349 GFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI----LDTCFNLSAYQ 404
            + K  +++DSGTV T L   ++ A    F K  +    A GF       +TCF  +   
Sbjct: 294 EYTKRMMVVDSGTVDTFLLGPVFDA----FSKAMASAMQAKGFLSDTVGTETCFRPNGGD 349

Query: 405 EV---NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL-ASLSYEDETGIIGNYQQ 460
            V   ++P V+M F G   + +    + + +     ++CLA    ++      I+GN   
Sbjct: 350 SVDSGDLPTVEMRFIGTT-LKLPPENVFHDLLPSHDKICLAFKPDVAGVRNVQILGNKAT 408

Query: 461 KNQRVIYDTKNSQLGFAGEDC 481
            + RV+YD +    GF    C
Sbjct: 409 XSFRVVYDLQAMYFGFQAGAC 429


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGRHGV--FVERSVQEQDVWCLAFA 312


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVI--VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T I  +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP         +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
          Length = 216

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 76/222 (34%), Positives = 117/222 (52%), Gaps = 16/222 (7%)

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATF 330
           +SL+SQT   + G+FSYCLPS +    SGSL LG      +N   + +T ++ NP   + 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQP-RN---VRHTPLLTNPHRPSL 56

Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y +N+TG+S+G    ++ A  FA       G +IDSGTVITR    +Y+AL+ EF +Q +
Sbjct: 57  YYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVA 116

Query: 384 GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV-CLA 442
                      DTCFN         P V +  +G  ++T+ +   +  + S A+ + CLA
Sbjct: 117 APSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTL--IHSSATPLACLA 174

Query: 443 LASLS--YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           +A           ++ N QQ+N RV+ D   S++GFA E C+
Sbjct: 175 MAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPCN 216


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/353 (28%), Positives = 155/353 (43%), Gaps = 48/353 (13%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +IVDTGSDL W QC+   S          P    +  +    + TC A   A G   V +
Sbjct: 55  LIVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTRTCTASAAAVG---VLA 111

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLG 267
           S +              +T G      L LG        FGCG  + G   G +G++GL 
Sbjct: 112 SET--------------FTFGARRAVSLRLG--------FGCGALSAGSLIGATGILGLS 149

Query: 268 RSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNP 325
              LSL++Q        FSYCL    D   S  L+ G  + + ++ T  PI  T ++ NP
Sbjct: 150 PESLSLITQLKI---QRFSYCLTPFADKKTS-PLLFGAMADLSRHKTTRPIQTTAIVSNP 205

Query: 326 QLATFYILNLTGISIGGKQLQ--ASGFAK-----GGILIDSGTVITRLPPSIYSALKAEF 378
               +Y + L GIS+G K+L   A+  A      GG ++DSG+ +  L  + + A+K E 
Sbjct: 206 VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK-EA 264

Query: 379 LKQFSGFPSA-PGFSILDTCFNL------SAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
           +      P A       + CF L      +A + V +P + + F+G A M +      YF
Sbjct: 265 VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDN--YF 322

Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +  A  +CLA+   +      IIGN QQ+N  V++D ++ +  FA   C  +
Sbjct: 323 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQI 375


>gi|242086418|ref|XP_002443634.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
 gi|241944327|gb|EES17472.1| hypothetical protein SORBIDRAFT_08g022645 [Sorghum bicolor]
          Length = 486

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 162/358 (45%), Gaps = 45/358 (12%)

Query: 133 LNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNS 190
           L+YI  +  G   +   V + T    + ++C+PC S  +  +P FD   S ++  V C+S
Sbjct: 149 LDYIVLVSYGSPEQQFPVFLGTNVGTSLLRCKPCASGSDDCNPAFDTLQSSTFAHVPCSS 208

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFG 248
             C            CSSS    C ++  YG      G    + L L  +S  V+DF F 
Sbjct: 209 PDCPV---------NCSSSV---CPFYDLYG---TVGGTFATDVLTLAPSSMAVHDFRFV 253

Query: 249 C-GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFG-----GLFSYCLPSTQDAGASGSLI 302
           C    +       +G + L R   SL SQ S   G       FSYCLP  Q   + G L 
Sbjct: 254 CMDVESPSPDLPEAGSIDLSRHRNSLPSQLSSSSGIAPTAASFSYCLP--QSRNSQGFLS 311

Query: 303 LGGNSSVFKNS------TPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAKGG 354
           LGG+++V  +        P+ + N   +P LA+ Y ++L G+S+GG+ L   +  F    
Sbjct: 312 LGGDATVVGDDDNLTVHAPMVWNN---DPDLASMYFIDLVGMSLGGEDLPIPSGTFGNAS 368

Query: 355 ILIDSGTVITRLPPSIYSALKAEFLKQFSGF---PSAPGFSILDTCFNLSAYQEVNIPLV 411
             +D G   T L P  Y+ L+  F K+ S +    S  GF   DTCFN +   E+ +PLV
Sbjct: 369 TNLDVGATFTMLAPEAYTTLRDAFRKEMSQYNNRSSPAGFDGFDTCFNFTGLNELVVPLV 428

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSDA---SQVCLALASLSYEDE-TGIIGNYQQKNQRV 465
           +++F     + +D   ++Y+    A   +  CLA +SL   D  + +IG Y   +  V
Sbjct: 429 QLKFSNGESLMIDGDQMLYYHDPAAGPFTMACLAFSSLDVGDSFSAVIGTYTLASTEV 486


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 159/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 165/374 (44%), Gaps = 49/374 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN------QQDPVFDPSISPSYKKVLCNSSTCH--- 194
           + ++ +VDTGS + W  C    +C N      ++ P+F+P +S S K + C    C    
Sbjct: 98  QKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCANTS 157

Query: 195 ------ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
                       GNS  CS + P    Y + YG G+   G    E+L     +++ F+ G
Sbjct: 158 SPDVHLGCPRCNGNSKKCSHACP---QYTLQYGTGA-ASGFFLLENLDFPGKTIHKFLVG 213

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGN 306
           C   +         L G GR+  SL  Q        F+YCL S    D   SG LIL  +
Sbjct: 214 C-TTSADREPSSDALAGFGRTMFSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYS 269

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYI-LNLTGISIGGKQLQASGF-------AKGGILID 358
               +    ++Y   + NP    FY  L +  + IG K L+  G        ++GG++ID
Sbjct: 270 DGETQG---LSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGKYLTPGSDSRGGVMID 326

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVKMEF 415
           SG     +   ++  +  E  KQ S +     A   S L  C+N + ++ + IP +  +F
Sbjct: 327 SGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHKSIKIPDLIYQF 386

Query: 416 EGNAEMTVDVTGIVYFVK-SDASQVCLALASLSYEDE-------TGIIGNYQQKNQRVIY 467
            G A M V   G+ YF+  S+AS  C  + + S  +        + I+GNYQQ +  V +
Sbjct: 387 TGGANMVVP--GMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPSIILGNYQQVDHYVEF 444

Query: 468 DTKNSQLGFAGEDC 481
           D KN +LGF  + C
Sbjct: 445 DLKNERLGFRQQTC 458


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 167/352 (47%), Gaps = 38/352 (10%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQQDP---VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
           +VD  S   W QC PC +      P    F P+ S ++  + C+S  C  +  E      
Sbjct: 105 LVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAG 164

Query: 204 GVCSSSSPPDCN-YFVSYG-DGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVS 261
              ++++   C+ Y ++YG   + T G L  +    G  +V   +FGC   + G F G S
Sbjct: 165 AAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGAS 224

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCL--PSTQDAGASGSLILGGNSSVFK----NSTP 315
           G++G+GR +LSL+SQ      G FSY L  P   D G++ S+I  G+ +V K     STP
Sbjct: 225 GVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTP 281

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQA--------SGFAKGGILIDSGTVITRLP 367
           +  + + P+     FY +NLTG+ + G +L A             GG+++ S T +T L 
Sbjct: 282 LLSSTLYPD-----FYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLE 336

Query: 368 PSIYSALKAEFLKQFSGFPSAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
            + Y  ++A    +  G P+  G +   LD C+N S+  +V +P + + F+G A+M  D+
Sbjct: 337 QAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADM--DL 393

Query: 426 TGIVYF-VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           +   YF + +D    CL +          ++G   Q    +IYD    +L F
Sbjct: 394 SAANYFYIDNDTGLECLTMLP---SQGGSVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 93/323 (28%), Positives = 156/323 (48%), Gaps = 35/323 (10%)

Query: 175 FDPSISPSYKKVLCNSSTCHAL--EFATGNSGVCSSSSPPDCN-YFVSYG-DGSYTRGEL 230
           F P+ S ++  + C+S  C  +  E         ++++   C+ Y ++YG   + T G L
Sbjct: 134 FRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYL 193

Query: 231 GREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCL- 289
             +    G  +V   +FGC   + G F G SG++G+GR +LSL+SQ      G FSY L 
Sbjct: 194 ATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLL 250

Query: 290 -PSTQDAGASGSLILGGNSSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
            P   D G++ S+I  G+ +V K     STP+  + + P+     FY +NLTG+ + G +
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPD-----FYYVNLTGVRVDGNR 305

Query: 345 LQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI--L 394
           L A             GG+++ S T +T L  + Y  ++A    +  G P+  G +   L
Sbjct: 306 LDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALEL 364

Query: 395 DTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDASQVCLALASLSYEDETG 453
           D C+N S+  +V +P + + F+G A+M  D++   YF + +D    CL +          
Sbjct: 365 DLCYNASSMAKVKVPKLTLVFDGGADM--DLSAANYFYIDNDTGLECLTMLP---SQGGS 419

Query: 454 IIGNYQQKNQRVIYDTKNSQLGF 476
           ++G   Q    +IYD    +L F
Sbjct: 420 VLGTLLQTGTNMIYDVDAGRLTF 442


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGRRGV--FVERSVQEQDVWCLAFA 312


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  ++ 
Sbjct: 89  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148

Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
           C+   C A  F TG + +C    S SSP  C Y  +YGDGS T G            +G 
Sbjct: 149 CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 204

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
           E      AS+   +FGC  +  G        V G+ G G+  LS++SQ +   +   +FS
Sbjct: 205 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 261

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           +CL  + + G  G L+LG      +   P + YT ++P+      Y LNL  I++ G++L
Sbjct: 262 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 310

Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
              +S F      G ++DSGT +  L    Y    +      S  PS     S    CF 
Sbjct: 311 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 368

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
            S+  + + P V + F G   M+V       ++   AS     L  + ++   G    I+
Sbjct: 369 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 425

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           G+   K++  +YD  N ++G+A  DCS
Sbjct: 426 GDLVLKDKIFVYDLANMRMGWADYDCS 452


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  ++ 
Sbjct: 91  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150

Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
           C+   C A  F TG + +C    S SSP  C Y  +YGDGS T G            +G 
Sbjct: 151 CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 206

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
           E      AS+   +FGC  +  G        V G+ G G+  LS++SQ +   +   +FS
Sbjct: 207 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 263

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           +CL  + + G  G L+LG      +   P + YT ++P+      Y LNL  I++ G++L
Sbjct: 264 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 312

Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
              +S F      G ++DSGT +  L    Y    +      S  PS     S    CF 
Sbjct: 313 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 370

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
            S+  + + P V + F G   M+V       ++   AS     L  + ++   G    I+
Sbjct: 371 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 427

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           G+   K++  +YD  N ++G+A  DCS
Sbjct: 428 GDLVLKDKIFVYDLANMRMGWADYDCS 454


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 126/462 (27%), Positives = 200/462 (43%), Gaps = 68/462 (14%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNL----HVQYLQSRI 106
           SS+ ++ + SR+       +L H+N     + D NE  ++R   +         +L+S+I
Sbjct: 27  SSTLITTKPSRLAT-----KLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKI 81

Query: 107 KNMIS-GNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
           K + S GN  +  ++ IP   G       ++  + +G   +T  V+VDTGS L WVQC P
Sbjct: 82  KELKSVGN--EARSSLIPFNRGS-----GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C +C+ Q    FDP  S S+K + C     + +     N   C+  +  +  Y + Y  G
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-----NGYKCNRFNQAE--YKLRYLGG 187

Query: 224 SYTRGELGREHLGL-----GKASVNDFIFGCGR-----NNKGLFGGVSGLMGLGRSDLSL 273
             ++G L +E L       GK   ++  FGCG      NN   + GV GL       +++
Sbjct: 188 DSSQGILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAY--PHITM 245

Query: 274 VSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
            +Q     G  FSYC+    +       L+LG  S +  +STP+             +Y+
Sbjct: 246 ATQ----LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQI-------HFGHYYV 294

Query: 333 LNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF 385
             L  IS+G K L       + S    GG+LIDSG   T+L    +  L  E +    G 
Sbjct: 295 -TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL 353

Query: 386 ----PSAPGFSILDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
               P+   F  L  CF  + +   V  P V   F G A++ ++   +  F +    + C
Sbjct: 354 LERIPTQRKFEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSL--FRQHGGDRFC 409

Query: 441 LA-LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           LA L S S      +IG   Q+N  V +D +  ++ F   DC
Sbjct: 410 LAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 115/442 (26%), Positives = 192/442 (43%), Gaps = 46/442 (10%)

Query: 66  AITLELKHKNYCSGKIVDWNEQQQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLT 125
           + T EL H +  +    + +E   +RL       + LQ R  N ++  +  +SN++  + 
Sbjct: 37  SFTAELIHIDSPNSPFFNASETTTHRL------AKALQ-RSANRVA-RLNPLSNSDEGVH 88

Query: 126 SGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           + I     NY+  + +G     +   +DTGS++ W+ C  CK C+NQ   +F+P  S +Y
Sbjct: 89  ASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTY 148

Query: 184 KKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN 243
           +   C+S  C     +  +  VC  S    C+        +   G +  + + L  +   
Sbjct: 149 QDAPCDSYQCETTSSSCQSDNVCLYS----CD---EKHQLNCPNGRIAVDTMTLTSSDGR 201

Query: 244 DFI-----FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS 298
            F      F CG +    F GV G++GLGR  LSL S+   +  G FSYCL        S
Sbjct: 202 PFPLPYSDFVCGNSIYKTFAGV-GVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPS 260

Query: 299 GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----ASGFAK-- 352
             +  G  S +  +   +  T +  +     +Y+  L GIS+G K+         FA   
Sbjct: 261 -KINFGLQSFISDDDLEVVSTTLGHHRHSGNYYV-TLEGISVGEKRQDLYYVDDPFAPPV 318

Query: 353 GGILIDSGTVITRLPPSIY----SALKAEFLKQFSGFPSAPGFSI-LDTCFNLSA----Y 403
           G +LIDSGT+ T LP   Y    S +     +     P    F   +D    LS     Y
Sbjct: 319 GNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYY 378

Query: 404 QEVNIPLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKN 462
            E+  P + + F + + E++ D +    F++     VC A A+ +   ++ + G++QQ N
Sbjct: 379 PELKFPKITIHFTDADVELSDDNS----FIRVAEDVVCFAFAA-TQPGQSTVYGSWQQMN 433

Query: 463 QRVIYDTKNSQLGFAGEDCSSM 484
             + YD K   + F   DCS +
Sbjct: 434 FILGYDLKRGTVSFKRTDCSKL 455


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 154/350 (44%), Gaps = 51/350 (14%)

Query: 149 IVDTGSDLTWVQCQPCKSCYNQ-QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           I+DTGS L W+QC PCKSC  Q   P+FDPSIS +Y  + C +  C         SG C 
Sbjct: 118 IMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNIICRY-----APSGECD 172

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGL-----GKASVNDFIFGCG-RNNKGLFGGVS 261
           SSS   C Y  +Y +G  + G +  E L       G+ +VN+ +FGC  RN        +
Sbjct: 173 SSS--QCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRNGNYKDRRFT 230

Query: 262 GLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTN 320
           G+ GLG    S+V+Q     G  FSYC+ +  D   S   L+L    ++   STP+   +
Sbjct: 231 GVFGLGSGITSVVNQ----MGSKFSYCIGNIADPDYSYNQLVLSEGVNMEGYSTPLDVVD 286

Query: 321 MIPNPQLATFYILNLTGISIGGKQL--QASGFAKG----GILIDSGTVITRLPPSIYSAL 374
                     Y + L GIS+G  +L    S F +      ++IDSGT  T L  + Y AL
Sbjct: 287 --------GHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAENEYRAL 338

Query: 375 KAEFLKQFSGFPSAPGFSILDTCFNLSAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFVK 433
           + E       F + P       C+     Q+ V  P V   F   A++ VD         
Sbjct: 339 EREVRNLLDRFLT-PFMRESFLCYKGKVGQDLVGFPAVTFHFAEGADLVVDTE------- 390

Query: 434 SDASQVCLALASLSYED--ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                  +  AS+  +D  +  +IG   Q+   V YD    +L F   DC
Sbjct: 391 -------MRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 165/387 (42%), Gaps = 62/387 (16%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N QDP F P +S +Y
Sbjct: 80  LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTY 135

Query: 184 KKVLCNSS-TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
             V C++  TC               S    C Y   Y + S + G LG + +  G  S 
Sbjct: 136 SPVKCSADCTC--------------DSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESE 181

Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
                 +FGC  +  G LF     G+MGLGR  LS++ Q  +  + G  FS C       
Sbjct: 182 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 241

Query: 296 GASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
           G  G+++LG   +    VF  S P+           + +Y + L  I + GK L+     
Sbjct: 242 G--GAMVLGAMPAPPDMVFSRSDPVR----------SPYYNIELKEIHVAGKALRLDPRI 289

Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
             +K G ++DSGT    LP   + A       K   LK+  G    P  +  D CF    
Sbjct: 290 FDSKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRG----PDPNYKDICFAGAG 345

Query: 399 -NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGN 457
            N+S   +   P V M F    ++++     ++         CL +   + +D T ++G 
Sbjct: 346 RNVSQLSQA-FPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQ-NGKDPTTLLGG 403

Query: 458 YQQKNQRVIYDTKNSQLGFAGEDCSSM 484
              +N  V YD  N ++GF   +CS +
Sbjct: 404 IVVRNTLVTYDRHNEKIGFWKTNCSEL 430


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 113/387 (29%), Positives = 173/387 (44%), Gaps = 62/387 (16%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  ++ 
Sbjct: 5   YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64

Query: 188 CNSSTCHALEFATGNSGVC----SSSSPPDCNYFVSYGDGSYTRGE-----------LGR 232
           C+   C A  F TG + +C    S SSP  C Y  +YGDGS T G            +G 
Sbjct: 65  CSDDRCTA-GFQTGEA-ICQTSNSQSSP--CGYTFTYGDGSGTSGYYVSDTMFFETVMGN 120

Query: 233 EHLGLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFS 286
           E      AS+   +FGC  +  G        V G+ G G+  LS++SQ +   +   +FS
Sbjct: 121 EQTANSSASI---VFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 177

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL 345
           +CL  + + G  G L+LG      +   P + YT ++P+      Y LNL  I++ G++L
Sbjct: 178 HCLKGSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIAVNGQKL 226

Query: 346 --QASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFN 399
              +S F      G ++DSGT +  L    Y    +      S  PS     S    CF 
Sbjct: 227 PIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKGSQCFI 284

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----II 455
            S+  + + P V + F G   M+V       ++   AS     L  + ++   G    I+
Sbjct: 285 TSSSVDSSFPTVTLYFMGGVAMSVKPEN---YLLQQASVDNSVLWCIGWQRNQGQEITIL 341

Query: 456 GNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           G+   K++  +YD  N ++G+A  DCS
Sbjct: 342 GDLVLKDKIFVYDLANMRMGWADYDCS 368


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  112 bits (281), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 162/363 (44%), Gaps = 44/363 (12%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCS 207
           +++DT S L W++C  C     Q+ PVFDPS S SY+ +   S  C A          CS
Sbjct: 91  LVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGDKCS 150

Query: 208 SSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS--VNDFIFGCGRNNKGL--FGGVSGL 263
              P + +            G +G + + LG  +  ++   FGC ++ +G    G  +G 
Sbjct: 151 FHLPGEAH------------GYVGTDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTFAGT 198

Query: 264 MGLGRSDLSLVSQTSEIFGGLFSYCLPST-QDAGASGSLILGGNSS-----VFKNSTPIT 317
           +G+G+   SL+ Q  +  G  FSYCL       G +G +  G +       V      + 
Sbjct: 199 LGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILP 258

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQ---LQASGFAK-----GGILIDSGTVITRLPPS 369
               +P+    + Y + L GIS+ G     ++ + F +     GG  +D+GT +T L P+
Sbjct: 259 TPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPA 318

Query: 370 IYSALK---AEFLKQFSGFPSA--PGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVD 424
            Y+ ++   A  ++Q+ G+     P FS+   CF        +IP + ++FEG A  TV 
Sbjct: 319 AYAVVEEAVAHMVQQW-GYKRVRDPNFSL---CFREHPGIWSHIPKLTLDFEGPASRTVA 374

Query: 425 VTGIV---YFVKSDASQ-VCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
              IV    F+K D    VC  +   S    T ++G  QQ + R I+D   + + F  E 
Sbjct: 375 HLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPT-VVGAMQQVDTRFIFDLHANTITFHRES 433

Query: 481 CSS 483
           C +
Sbjct: 434 CEA 436


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSKGV--FVERSVQEQDVWCLAFA 312


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 116/391 (29%), Positives = 167/391 (42%), Gaps = 54/391 (13%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQP---CKSCYN-QQDPVFDPSISPSYKKVLC 188
           Y   +E G  + T   ++DTGS L W+ C     C  C +    P F P  S S K V C
Sbjct: 86  YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGC 145

Query: 189 NSSTC----------HALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG 238
            +  C          H           CS + P    Y V YG GS T G L  E+L   
Sbjct: 146 TNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCP---AYTVQYGLGS-TAGFLLSENLNFP 201

Query: 239 KASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DA 295
               +DF+ GC   +       +G+ G GR + SL SQ +      FSYCL S Q    A
Sbjct: 202 TKKYSDFLLGCSVVS---VYQPAGIAGFGRGEESLPSQMNLT---RFSYCLLSHQFDDSA 255

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNP------QLATFYILNLTGISIGGKQ----- 344
             + +L+L   SS    +  ++YT  + NP          +Y + L  I +G K+     
Sbjct: 256 TITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPR 315

Query: 345 --LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ--FSGFPSAPGFSILDTCFNL 400
             L+ +    GG ++DSG+  T +   I+  +  EF KQ  ++    A     L  CF L
Sbjct: 316 RLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVL 375

Query: 401 SAYQE-VNIPLVKMEFEGNAEMTVDVTGIVYFV-KSDASQVCLALASLSYEDETG----- 453
           +   E  + P ++ EF G A+M + V      V K D +  CL + S       G     
Sbjct: 376 AGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVA--CLTIVSDDVAGSGGTVGPA 433

Query: 454 -IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            I+GNYQQ+N  V YD +N + GF  + C +
Sbjct: 434 VILGNYQQQNFYVEYDLENERFGFRSQSCQT 464


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 158/370 (42%), Gaps = 54/370 (14%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQ-----------DPVFDPSISPSYKKVLCNSSTCH 194
             +IVDTGS +T+V C  C  C + Q           DP F P  S SY+K+ C SS C 
Sbjct: 53  FALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIGCRSSDC- 111

Query: 195 ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGCGR 251
                   +G+C S+S   C Y   Y + S ++G LG++ L  G AS        FGC  
Sbjct: 112 -------ITGLCDSNS-HQCKYERMYAEMSTSKGVLGKDLLDFGPASRLQSQLLSFGCET 163

Query: 252 NNKG-LFGGVS-GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGG-- 305
              G L+  V+ G+MGLGR  LS+V Q   +      FS C     + G  GS++LG   
Sbjct: 164 AESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGG--GSMVLGAIP 221

Query: 306 --NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG---FAKGGILIDSG 360
             +  VF  S          +P+ + +Y L LT I + G  L+        K G ++DSG
Sbjct: 222 APSGMVFAKS----------DPRRSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSG 271

Query: 361 TVITRLPPSIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVNI----PLVKME 414
           T    LP   + A     + Q     +   P  +  D C+  +      +    PLV   
Sbjct: 272 TTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPLVDFV 331

Query: 415 FEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
           F  N ++++     ++         CL       +D T ++G    +N  V YD  N Q+
Sbjct: 332 FAENQKVSLAPENYLFKHTKVPGAYCLGF--FKNQDATTLLGGIIVRNMLVTYDRYNHQI 389

Query: 475 GFAGEDCSSM 484
           GF   +C+ +
Sbjct: 390 GFLKTNCTEL 399


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 177/390 (45%), Gaps = 38/390 (9%)

Query: 122 IPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPC-KSCYNQQDPVFDPS 178
           +PL+SG    T  Y     +G   +   ++ DTGSDLTWV+C        +    VF  +
Sbjct: 99  MPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAA 158

Query: 179 ISPSYKKVLCNSSTCHA-LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL-- 235
            S S+  + C+S TC + + F+  N   CSS + P C Y   Y DGS  RG +G +    
Sbjct: 159 ASRSWAPIACSSDTCTSYVPFSLAN---CSSPASP-CAYDYRYNDGSAARGVVGTDSATI 214

Query: 236 ----------GLGKASVNDFIFGCGRNNKGL-FGGVSGLMGLGRSDLSLVSQTSEIFGGL 284
                     G  +A +   + GC  +  G  F    G++ LG S++S  S+ +  FGG 
Sbjct: 215 ALSGSESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 274

Query: 285 FSYCLPSTQDAGASGSLIL-------GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           FSYCL        + S +        GG ++   +S+    T ++ + +++ FY + +  
Sbjct: 275 FSYCLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDA 334

Query: 338 ISIGGKQLQASG----FAK-GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
           + + G+ L         A+ GG ++DSGT +T L    Y A+ A   ++ +G P      
Sbjct: 335 VHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRV-SMD 393

Query: 393 ILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
             + C+N +A   + IP +++ F G+A +        Y V +     C+ +   ++   +
Sbjct: 394 PFEYCYNWTA-AALEIPGLEVRFAGSARLQPPAKS--YVVDAAPGVKCIGVQEGAWPGVS 450

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +IGN  Q++    +D ++  L F    C+
Sbjct: 451 -VIGNILQQDHLWEFDLRDRWLRFKHTRCA 479


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 156/356 (43%), Gaps = 40/356 (11%)

Query: 146 MTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SSTCHALEFATGNSG 204
             +IVDTGS +T+V C  C+ C   QDP F P  S +Y+ V C     C           
Sbjct: 125 FALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTIDCNCDGDRM------ 178

Query: 205 VCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG-G 259
                    C Y   Y + S + G LG + +  G  S       +FGC     G L+   
Sbjct: 179 --------QCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENVETGDLYSQH 230

Query: 260 VSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPIT 317
             G+MGLGR DLS++ Q    ++    FS C     D G  G+++LGG S       P  
Sbjct: 231 ADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-GGMDVGG-GAMVLGGISP------PSD 282

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQ--LQASGF-AKGGILIDSGTVITRLPPSIYSAL 374
            T    +P  + +Y ++L  + + GK+  L A+ F  K G ++DSGT    LP + + A 
Sbjct: 283 MTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEAAFLAF 342

Query: 375 KAEFLKQFSGFP--SAPGFSILDTCF----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGI 428
           K   +K+       S P  +  D CF    N  +    + P+V M F    + ++     
Sbjct: 343 KDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENY 402

Query: 429 VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           ++         CL +   +  D+T ++G    +N  V+YD + +++GF   +C+ +
Sbjct: 403 MFRHSKVRGAYCLGIFQ-NGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAEL 457


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 97/337 (28%), Positives = 154/337 (45%), Gaps = 32/337 (9%)

Query: 150 VDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS 209
           +D  SDL W  C             F+P  S +   V C    C   +FA    G  +S 
Sbjct: 117 LDISSDLVWTACGATAP--------FNPVRSTTVADVPCTDDACQ--QFAPQTCGAGAS- 165

Query: 210 SPPDCNYFVSYGDGSY-TRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGR 268
              +C Y   YG G+  T G LG E    G   ++  +FGCG  N G F GVSG++GLGR
Sbjct: 166 ---ECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGR 222

Query: 269 SDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLA 328
            +LSLVSQ        FSY   +  D+  + S IL G+ +  + S  ++ T ++ +    
Sbjct: 223 GNLSLVSQLQV---DRFSYHF-APDDSVDTQSFILFGDDATPQTSHTLS-TRLLASDANP 277

Query: 329 TFYILNLTGISIGGKQLQ-ASGF-------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           + Y + L GI + GK L   SG          GG+ +    ++T L  + Y  L+     
Sbjct: 278 SLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAAYKPLRQAVAS 337

Query: 381 QFSGFPSAPGFSI-LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV 439
           +  G P+  G ++ LD C+   +  +  +P + + F G A M +++ G  +++ S     
Sbjct: 338 KI-GLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL-GNYFYMDSTTGLA 395

Query: 440 CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
           CL +   S  D + ++G+  Q    ++YD   S+L F
Sbjct: 396 CLTILPSSAGDGS-VLGSLIQVGTHMMYDINGSKLVF 431


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 87/276 (31%), Positives = 137/276 (49%), Gaps = 34/276 (12%)

Query: 92  LILDNLHVQYLQSRIK-----NMISGNIKDVSNTEI----PLTS-GIRLQTLNYIATIE- 140
           L  D   V Y+Q R+      N ++G   D   T++    P ++ G+  + +   A  + 
Sbjct: 96  LDADQHRVAYIQKRLAGGDTSNGVAGASWDGQTTDVGTYLPASNVGVGAKMIGTTAAPDG 155

Query: 141 LGGRNMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-E 197
                 TVI+D+GSD+ WVQCQPC    C+ Q+DP+FDP+ S +Y  V C+S+ C  L  
Sbjct: 156 TSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGP 215

Query: 198 FATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS-VNDFIFGCGRNNKG- 255
           +  G    CS++    C +  +Y DG+   G    + L LG    V  F+FGC   ++G 
Sbjct: 216 YRRG----CSANV--QCQFGFTYTDGATATGTYSSDDLTLGPYDVVRGFLFGCAHADRGS 269

Query: 256 -LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG---GNSSVFK 311
                VSG + LG    S V QT+  +G +FSYC+P +    + G + LG     +++  
Sbjct: 270 TFSFDVSGTLALGGGAQSFVQQTATQYGRVFSYCIPPSPS--SLGFITLGVPPQRAALVP 327

Query: 312 N--STPITYTNMIPNPQLATFYILNLTGISIGGKQL 345
              STP+  ++ +P     TFY + L  I + G+ L
Sbjct: 328 TFVSTPLLSSSSMP----PTFYRVLLRAIIVAGRPL 359


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 167/397 (42%), Gaps = 72/397 (18%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ----------DP 173
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N Q          DP
Sbjct: 87  LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDP 142

Query: 174 VFDPSISPSYKKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            F P +S +Y  V CN   TC               +    C Y   Y + S + G LG 
Sbjct: 143 RFQPDLSSTYSPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGE 188

Query: 233 EHLGLGKAS---VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLF 285
           + +  GK S       +FGC     G LF     G+MGLGR  LS++ Q  E  +    F
Sbjct: 189 DIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSF 248

Query: 286 SYCLPSTQDAGASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           S C     D G  G+++LGG  +    VF +S P+           + +Y + L  I + 
Sbjct: 249 SLCY-GGMDVGG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVA 296

Query: 342 GKQLQASGF---AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFS 392
           GK L+       +K G ++DSGT    LP   + A K         LK+  G    P  +
Sbjct: 297 GKALRLDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPN 352

Query: 393 ILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
             D CF     N+S   EV  P V M F    ++++     ++         CL +   +
Sbjct: 353 YKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-N 410

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +D T ++G    +N  V YD  N ++GF   +CS +
Sbjct: 411 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 447


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 93/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSFTFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F G FSYCLP  +      +  +G   L
Sbjct: 115 LDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 113/384 (29%), Positives = 161/384 (41%), Gaps = 60/384 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQP---CKSC-YNQQDPV----FDPSISPSYKKVLCNSSTCHA 195
           +N++ I DTGS L W  C     C  C +   DP     F P +S S K V C +  C  
Sbjct: 143 QNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAW 202

Query: 196 L---------EFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
           +               S  CS S P    Y + YG G+ T G L  E L L    V DF+
Sbjct: 203 IFGPNLKSRCRNCNSKSRKCSDSCP---GYGLQYGSGA-TAGILLSETLDLENKRVPDFL 258

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILG 304
            GC   +       +G+ G GR   SL SQ        FS+CL S    D+  S  L+L 
Sbjct: 259 VGCSVMS---VHQPAGIAGFGRGPESLPSQMRL---KRFSHCLVSRGFDDSPVSSPLVLD 312

Query: 305 GNSSVFKNST------PITYTNMIPNPQLATFYILNLTGISIGGKQLQ-------ASGFA 351
             S   ++ T      P      + N     +Y L+L  I IGGK ++            
Sbjct: 313 SGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTG 372

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQE-VN 407
            GG +IDSG+  T L   I+ A+  E  KQ   +P A      S L  CFN+   +E   
Sbjct: 373 NGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEESAE 432

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG---------IIGNY 458
            P V ++F+G  ++++     +  V +D   VCL + +    DE           I+G +
Sbjct: 433 FPDVVLKFKGGGKLSLAAENYLAMV-TDEGVVCLTMMT----DEAVVGGGGGPAIILGAF 487

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
           QQ+N  V YD    ++GF  + C+
Sbjct: 488 QQQNVLVEYDLAKQRIGFRKQKCT 511


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 167/397 (42%), Gaps = 72/397 (18%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQ----------DP 173
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N Q          DP
Sbjct: 86  LTNGYYTTRL-YIGT---PSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDP 141

Query: 174 VFDPSISPSYKKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            F P +S +Y  V CN   TC               +    C Y   Y + S + G LG 
Sbjct: 142 RFQPDLSSTYSPVKCNVDCTC--------------DNERSQCTYERQYAEMSSSSGVLGE 187

Query: 233 EHLGLGKAS---VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLF 285
           + +  GK S       +FGC     G LF     G+MGLGR  LS++ Q  E  +    F
Sbjct: 188 DIMSFGKESELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSF 247

Query: 286 SYCLPSTQDAGASGSLILGGNSS----VFKNSTPITYTNMIPNPQLATFYILNLTGISIG 341
           S C     D G  G+++LGG  +    VF +S P+           + +Y + L  I + 
Sbjct: 248 SLCY-GGMDVGG-GTMVLGGMPAPPDMVFSHSNPVR----------SPYYNIELKEIHVA 295

Query: 342 GKQLQASGF---AKGGILIDSGTVITRLPPSIYSALKAEF------LKQFSGFPSAPGFS 392
           GK L+       +K G ++DSGT    LP   + A K         LK+  G    P  +
Sbjct: 296 GKALRLDPKIFNSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRG----PDPN 351

Query: 393 ILDTCF-----NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
             D CF     N+S   EV  P V M F    ++++     ++         CL +   +
Sbjct: 352 YKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQ-N 409

Query: 448 YEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            +D T ++G    +N  V YD  N ++GF   +CS +
Sbjct: 410 GKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSEL 446


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  112 bits (280), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 110/400 (27%), Positives = 178/400 (44%), Gaps = 54/400 (13%)

Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++  ++PL   G+   T  Y   I+LG   ++  V VDTGSD+ WV C  C+ C ++   
Sbjct: 67  LAAADLPLGGLGLPTDTGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGL 126

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++DP  S +   V+C+ + C A  F  G    C ++ P  C Y V+YGDGS T G
Sbjct: 127 GLDLTLYDPKASSTGSMVMCDQAFC-AATFG-GKLPKCGANVP--CEYSVTYGDGSSTIG 182

Query: 229 ELGREHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
               + L   + + +          IFGCG    G  G     + G++G G ++ S++SQ
Sbjct: 183 SFVTDALQFDQVTRDGQTQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQ 242

Query: 277 --TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
             T+     +F++CL + +     G   +G        +TP+    +   P     Y +N
Sbjct: 243 LTTAGKVKKIFAHCLDTIK---GGGIFSIGDVVQPKVKTTPL----VADKPH----YNVN 291

Query: 335 LTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIY-SALKAEFLK-QFSGFPS 387
           L  I +GG  LQ          K G +IDSGT +T LP  ++   + A F K Q   F  
Sbjct: 292 LKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHD 351

Query: 388 APGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLS 447
             GF     CF      +   P +   FE   ++ + V    YF  +     C+   + +
Sbjct: 352 VQGF----LCFQYPGSVDDGFPTITFHFED--DLALHVYPHEYFFANGNDVYCVGFQNGA 405

Query: 448 YEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + + G    ++G+    N+ VIYD +N  +G+   +CSS
Sbjct: 406 SQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCSS 445


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 162/375 (43%), Gaps = 43/375 (11%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           Y  +I +G   R   + VDTGSDLTW+QC  PC +C     P++ P+     K V     
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKE---KIVPPRDL 243

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
            C  L+   GN   C +     C+Y + Y D S + G L R+ + L    G     DF+F
Sbjct: 244 LCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDFVF 298

Query: 248 GCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
           GC  + +G          G++GL  + +SL SQ +   I   +F +C+  T++ G  G +
Sbjct: 299 GCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCI--TREQGGGGYM 356

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILIDS 359
            LG +   +     IT+T++   P     Y      +  G +QL+    A     ++ DS
Sbjct: 357 FLGDD---YVPRWGITWTSIRSGPD--NLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDS 411

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN-------LSAYQEVNIPLVK 412
           G+  T LP  IY  L A       GF        L  C+        L   ++   PL  
Sbjct: 412 GSSYTYLPDEIYENLVAAIKYASPGFVQDSSDRTLPLCWKADFPVRYLEDVKQFFKPL-N 470

Query: 413 MEFEGN---AEMTVDVTGIVYFVKSDASQVCLALASLSYEDE--TGIIGNYQQKNQRVIY 467
           + F         T  ++   Y + SD   VCL L + +  +   T I+G+   + + V+Y
Sbjct: 471 LHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVY 530

Query: 468 DTKNSQLGFAGEDCS 482
           D +  Q+G+   DC+
Sbjct: 531 DNQRRQIGWTNSDCT 545


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 167/383 (43%), Gaps = 54/383 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  K+ 
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
           C+   C A      +  VC +S    C Y  +YGDGS T G            +G E   
Sbjct: 151 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208

Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
              AS+   +FGC  +  G        V G+ G G+  LS+VSQ +   +   +FS+CL 
Sbjct: 209 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 265

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
            + + G  G L+LG    + +    + YT ++P+      Y LNL  I + G++L   +S
Sbjct: 266 GSDNGG--GILVLG---EIVEPG--LVYTPLVPS---QPHYNLNLESIVVNGQKLPIDSS 315

Query: 349 GFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAYQ 404
            F      G ++DSGT +  L    Y           S  PS     S  + CF  S+  
Sbjct: 316 LFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSSV 373

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
           + + P V + F G   MTV       ++   AS     L  + ++   G    I+G+   
Sbjct: 374 DSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVL 430

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
           K++  +YD  N ++G+   DCS+
Sbjct: 431 KDKIFVYDLANMRMGWTDYDCST 453


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  112 bits (279), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 41/361 (11%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHAL--EFATGNSGV 205
           +++DTGS L+W+QC             FDPS+S ++  + C    C     +F    S  
Sbjct: 112 MVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIPDFTLPTS-- 169

Query: 206 CSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVND-FIFGCGRNNKGLFGGVSGLM 264
           C  +    C+Y   Y DG+Y  G L RE     ++      I GC   +        G++
Sbjct: 170 CDQNR--LCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATESTD----PRGIL 223

Query: 265 GLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG---ASGSLILG--GNSSVFKNSTPITYT 319
           G+ R  LS  SQ+       FSYC+P+         +GS  LG   NS+ F+    +T+ 
Sbjct: 224 GMNRGRLSFASQSKIT---KFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTFRYIEMLTFA 280

Query: 320 NMIPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIY 371
                P L    Y + L GI IGG++L       +A     G  ++DSG+  T L    Y
Sbjct: 281 RSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAY 340

Query: 372 SALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVN-IPLVKMEFEGNAEMTVDVTG 427
             ++AE ++   G     G+    + D CF+ +A +    I  +  EFE   ++ V    
Sbjct: 341 DKVRAEVVRAV-GPRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEKGVQIVVPKER 399

Query: 428 IVYFVKSDASQVCLALASLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           ++  V+      C+ +A+    D+ G    IIGN+ Q+N  V +D  N ++GF   DCS 
Sbjct: 400 VLATVEGGVH--CIGIAN---SDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADCSR 454

Query: 484 M 484
           +
Sbjct: 455 L 455


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F   FSYCLP  +      +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 99/332 (29%), Positives = 153/332 (46%), Gaps = 34/332 (10%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+A   +G   + ++ +VD   +L W QC PC+ C+ Q  P+FDP+ S +++ + C S  
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGSHL 116

Query: 193 CHALEFATGN--SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC- 249
           C ++  ++ N  S VC   +P         GD   T G+ G +   +G A      FGC 
Sbjct: 117 CESIPESSRNCTSDVCIYEAP------TKAGD---TGGKAGTDTFAIGAAK-ETLGFGCV 166

Query: 250 GRNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA----GASGSLIL 303
              +K L   GG SG++GLGR+  SLV+Q +      FSYCL          GA+   + 
Sbjct: 167 VMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLA 223

Query: 304 GG--NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
           GG  +S+ F   T    ++   NP    +Y++ L GI  GG  LQA+  +   +L+D+ +
Sbjct: 224 GGKNSSTPFVIKTSAGSSDNGSNP----YYMVKLAGIKTGGAPLQAASSSGSTVLLDTVS 279

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
             + L    Y ALK          P A      D CF  +   +   P +   F+G A +
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDA--PELVFTFDGGAAL 337

Query: 422 TVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
           TV      Y + S    VCL + S +  + TG
Sbjct: 338 TVPPAN--YLLASGNGTVCLTIGSSASLNLTG 367


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 156/361 (43%), Gaps = 46/361 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP F P +S +Y+ V CN   C+         
Sbjct: 24  QRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNID-CN--------- 73

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV---NDFIFGCGRNNKG-LFG- 258
             C       C Y   Y + S + G LG + +  G  S       +FGC     G L+  
Sbjct: 74  --CDDEK-QQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENMETGDLYSQ 130

Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFKN 312
              G+MG+GR DLS+V    +  +    FS C          G+++LGG S     VF  
Sbjct: 131 HADGIMGMGRGDLSIVDHLVDKGVINDSFSLCY--GGMGIGGGAMVLGGISPPSNMVFSQ 188

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
           S P+           + +Y ++L  I + GK L  +      K G ++DSGT    LP +
Sbjct: 189 SDPVR----------SPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYLPEA 238

Query: 370 IYSALKAEFLKQFSGFP--SAPGFSILDTCFNLSAYQ----EVNIPLVKMEFEGNAEMTV 423
            + + K   +K+         P  +  D CF+ +         + P V+M F    ++ +
Sbjct: 239 AFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLL 298

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
                ++         CL +   + +D T ++G    +N  V+YD +NS++GF   +CS 
Sbjct: 299 SPENYLFRHSKVHGAYCLGIFQ-NGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTNCSE 357

Query: 484 M 484
           +
Sbjct: 358 L 358


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F   FSYCLP  +      +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L     +      +A   S  + C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVLSQRIRELLLKRGAAEEESERN-CYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 167/384 (43%), Gaps = 56/384 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  K+ 
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
           C+   C A      +  VC +S    C Y  +YGDGS T G            +G E   
Sbjct: 151 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208

Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
              AS+   +FGC  +  G        V G+ G G+  LS+VSQ +   +   +FS+CL 
Sbjct: 209 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 265

Query: 291 STQDAGASGSLILGGNSSVFKNSTP-ITYTNMIPNPQLATFYILNLTGISIGGKQL--QA 347
            + + G  G L+LG      +   P + YT ++P+      Y LNL  I + G++L   +
Sbjct: 266 GSDNGG--GILVLG------EIVEPGLVYTPLVPS---QPHYNLNLESIVVNGQKLPIDS 314

Query: 348 SGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAY 403
           S F      G ++DSGT +  L    Y           S  PS     S  + CF  S+ 
Sbjct: 315 SLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSS 372

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQ 459
            + + P V + F G   MTV       ++   AS     L  + ++   G    I+G+  
Sbjct: 373 VDSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLV 429

Query: 460 QKNQRVIYDTKNSQLGFAGEDCSS 483
            K++  +YD  N ++G+   DCS+
Sbjct: 430 LKDKIFVYDLANMRMGWTDYDCST 453


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  111 bits (278), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 179/398 (44%), Gaps = 58/398 (14%)

Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
           ++PL  SG+  +T  Y   I +G   +   V VDTGSD+ WV C  C  C  + +     
Sbjct: 75  DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            ++DP  S S + V C+   C A       S  C+S+SP  C Y +SYGDGS T G    
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190

Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
           + L   + S +           FGCG    G  G     + G++G G+S+ S++SQ +  
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250

Query: 281 --FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
                +F++CL +    G  A G+++             +  T ++P+      Y + L 
Sbjct: 251 GKVRKMFAHCLDTVNGGGIFAIGNVV----------QPKVKTTPLVPD---MPHYNVILK 297

Query: 337 GISIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           GI +GG  L        SG +KG I IDSGT +  +P  +Y AL A    +         
Sbjct: 298 GIDVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ-- 354

Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
            ++ D +CF  S   +   P V   FEG+  + V      Y  ++  +  C+   +   +
Sbjct: 355 -TLQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGVQ 411

Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + G    ++G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 412 TKDGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 76/235 (32%), Positives = 118/235 (50%), Gaps = 19/235 (8%)

Query: 256 LFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
           +F G +GL+GLG   +S V Q     GG FSYCL S +   +SGSL  G      + S P
Sbjct: 1   MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVS-RGTESSGSLEFG------RESVP 53

Query: 316 I--TYTNMIPNPQLATFYILNLTG-------ISIGGKQLQASGFAKGGILIDSGTVITRL 366
           +  ++ ++I NP+  +FY + L+G       + I     + +   +GG+++D+GT +TRL
Sbjct: 54  VGASWVSLIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRL 113

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVT 426
           P + Y+A +  F+ Q +  P   G SI DTC++L+ +  V +P +   F G   +T+   
Sbjct: 114 PAAAYNAFRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPAR 173

Query: 427 GIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
             +  V S     C A A  S      IIGN QQ+   +  D  N  +GF    C
Sbjct: 174 NFLIPVDS-VGTFCFAFAPSS--SGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 158/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG  + T IV  DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVCSSSSP-PDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S   PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F   FSYCLP  +      +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++LT IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGRGGV--FVERSVQEQDVWCLAFA 312


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  111 bits (277), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 171/380 (45%), Gaps = 55/380 (14%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTC--HA 195
           T+    +N+T+++DTGS+L+W+ C   ++  +     F+P  S SY  + C+SSTC    
Sbjct: 78  TVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSS-TFNPVWSSSYSPIPCSSSTCTDQT 136

Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCG----R 251
            +F    S  C S+    C+  +SY D S + G L  +   +G + + + +FGC      
Sbjct: 137 RDFPIRPS--CDSNQ--FCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFS 192

Query: 252 NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFK 311
           +N       +GLMG+ R  LS VSQ        FSYC+    +   SG L+LG  +  F 
Sbjct: 193 SNSEEDSKNTGLMGMNRGSLSFVSQMG---FPKFSYCI---SEYDFSGLLLLGDAN--FS 244

Query: 312 NSTPITYTNMI----PNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDS 359
              P+ YT +I    P P      Y + L GI +  K L       +      G  ++DS
Sbjct: 245 WLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDS 304

Query: 360 GTVITRLPPSIYSALKAEFLKQFSG-----------FPSAPGFSILDTCFNLSAYQEVNI 408
           GT  T L    Y+AL+  FL + +G           F  A     +D C+ +   Q    
Sbjct: 305 GTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGA-----MDLCYRVPTNQTRLP 359

Query: 409 PL--VKMEFEGNAEMTVDVTGIVYFV----KSDASQVCLALASLSYED-ETGIIGNYQQK 461
           PL  V + F G AEMTV    I+Y V    + + S  C    +      E  +IG+  Q+
Sbjct: 360 PLPSVTLVFRG-AEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQ 418

Query: 462 NQRVIYDTKNSQLGFAGEDC 481
           N  + +D K S++G A   C
Sbjct: 419 NVWMEFDLKKSRIGLAEIRC 438


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 171/388 (44%), Gaps = 50/388 (12%)

Query: 135 YIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY--------K 184
           Y   + LG    T   ++DTGS L W  C     C +   P  DP+  P++        K
Sbjct: 88  YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPP---DCN-----YFVSYGDGSYTRGELGREHLG 236
            + C +  C  L F       C     P   +C+     Y + YG G+ T G L  ++L 
Sbjct: 148 LLGCRNPKCGYL-FGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNLN 205

Query: 237 LGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--D 294
               +V  F+ GC   +       SG+ G GR   SL SQ +      FSYCL S +  D
Sbjct: 206 FPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNL---KRFSYCLVSHRFDD 259

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQ----LATFYILNLTGISIGG-------K 343
              S  L+L  +S+    +  ++YT    NP        +Y + L  + +GG       K
Sbjct: 260 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYK 319

Query: 344 QLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQ----FSGFPSAPGFSILDTCFN 399
            L+      GG ++DSG+  T +   +Y+ +  EFL+Q    +S   +    S L  CFN
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFN 379

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL--SYEDETG---- 453
           +S  + ++ P    +F+G A+M+  +     FV  DA  +C  + S   + + +T     
Sbjct: 380 ISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFV-GDAEVLCFTVVSDGGAGQPKTAGPAI 438

Query: 454 IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           I+GNYQQ+N  V YD +N + GF   +C
Sbjct: 439 ILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 167/382 (43%), Gaps = 50/382 (13%)

Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCKSCY-----NQQDPV 174
           G  L  L+Y   I++G  N++ +V  D GSDL WV     QC P  + Y     ++    
Sbjct: 100 GNELDWLHY-TWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSASYYNISLDRDLSE 158

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGD--GSYTRGELGR 232
           + PS+S + + + C+   C   E+ +     C +   P C Y  +Y D   + + G L  
Sbjct: 159 YSPSLSSTSRHLSCDHQLC---EWGSN----CKNPKDP-CPYIFNYDDFENTTSAGFLVE 210

Query: 233 EHLGLGKASVNDF----------IFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE 279
           + L L  ASV D           + GCGR   G F       G+MGLG  D+S+ S  ++
Sbjct: 211 DKLHL--ASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAK 268

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
              GL   C     D   SG ++ G      + STP      +P       Y + +    
Sbjct: 269 --AGLIQNCFSLCFDENDSGRILFGDRGHASQQSTP-----FLPIQGTYVAYFVGVESYC 321

Query: 340 IGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFN 399
           +G   L+ SGF     L+DSG+  T LP  +Y+ L +EF KQ +    +    + D C+N
Sbjct: 322 VGNSCLKRSGFKA---LVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378

Query: 400 LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQ 459
            S+ +  +IP ++++F  N    V             +  CL+L     +   GIIG   
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPT--DGSYGIIGQNF 436

Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
               R+++D +N +LG++   C
Sbjct: 437 MIGYRMVFDIENLKLGWSNSSC 458


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 165/383 (43%), Gaps = 55/383 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
           Y   IE+G   +   V VDTGSD+ WV C  C  C  +        ++DP  S S   V 
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
           C++  C A   +      C++  P  C Y   YGDGS T G    + L   + S N    
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKP--CEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204

Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
               + IFGCG    G        + G++G G+S+ S +SQ +       +FS+CL + +
Sbjct: 205 HAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIK 264

Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
             G  A G ++             +  T ++PN    + Y +NL  I + G  LQ     
Sbjct: 265 GGGIFAIGEVV----------QPKVKSTPLLPN---MSHYNVNLQSIDVAGNALQLPPHI 311

Query: 351 ----AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG--FPSAPGFSILDTCFNLSAYQ 404
                K G +IDSGT +T LP  +Y  + A   ++     F +  GF     CF  S   
Sbjct: 312 FETSEKRGTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGF----LCFEYSESV 367

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
           +   P +   FE   ++ ++V    YF ++  +  CL   +  ++ +      ++G+   
Sbjct: 368 DDGFPKITFHFE--DDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVL 425

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
            N+ V+YD +   +G+   +CSS
Sbjct: 426 SNKVVVYDLEKQVIGWTDYNCSS 448


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 167/383 (43%), Gaps = 54/383 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  K+ 
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGE-----------LGREHLG 236
           C+   C A      +  VC +S    C Y  +YGDGS T G            +G E   
Sbjct: 177 CSDDRCTAA--LQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 234

Query: 237 LGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLP 290
              AS+   +FGC  +  G        V G+ G G+  LS+VSQ +   +   +FS+CL 
Sbjct: 235 NSSASI---VFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 291

Query: 291 STQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--QAS 348
            + + G  G L+LG    + +    + YT ++P+      Y LNL  I + G++L   +S
Sbjct: 292 GSDNGG--GILVLG---EIVEPG--LVYTPLVPS---QPHYNLNLESIVVNGQKLPIDSS 341

Query: 349 GFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF-SILDTCFNLSAYQ 404
            F      G ++DSGT +  L    Y           S  PS     S  + CF  S+  
Sbjct: 342 LFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKGNQCFVTSSSV 399

Query: 405 EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQ 460
           + + P V + F G   MTV       ++   AS     L  + ++   G    I+G+   
Sbjct: 400 DSSFPTVSLYFMGGVAMTVKPEN---YLLQQASIDNNVLWCIGWQRNQGQQITILGDLVL 456

Query: 461 KNQRVIYDTKNSQLGFAGEDCSS 483
           K++  +YD  N ++G+   DCS+
Sbjct: 457 KDKIFVYDLANMRMGWTDYDCST 479


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/399 (27%), Positives = 176/399 (44%), Gaps = 52/399 (13%)

Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++  ++PL   G+   T  Y   + LG   +   V VDTGSD+ WV C  C  C ++   
Sbjct: 69  LATADLPLGGLGLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGL 128

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                ++DP  S +   V+C+   C A  F  G    CS++ P  C Y V+YGDGS T G
Sbjct: 129 GLDLTLYDPKASSTGSTVMCDQGFC-ADTFG-GRLPKCSANVP--CEYSVTYGDGSSTVG 184

Query: 229 ELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQ 276
               + L        G  + +    IFGCG    G  G     + G++G G ++ S++SQ
Sbjct: 185 SFVNDALQFDQVTGDGQTQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQ 244

Query: 277 --TSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
             T+     +F++CL + +     G   +G        +TP+    +   P     Y +N
Sbjct: 245 LATAGKVKKIFAHCLDTIK---GGGIFAIGDVVQPKVKTTPL----VADKPH----YNVN 293

Query: 335 LTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAP 389
           L  I +GG  L+  A  F  G   G +IDSGT +T LP  ++   K   L  F+      
Sbjct: 294 LKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTLTYLPELVF---KKVMLAVFNKHQDIT 350

Query: 390 GFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSY 448
              + D  CF  S   +   P +   FE   ++ + V    YF  +     C+   + + 
Sbjct: 351 FHDVQDFLCFEYSGSVDDGFPTLTFHFE--DDLALHVYPHEYFFPNGNDVYCVGFQNGAL 408

Query: 449 EDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + + G    ++G+    N+ V+YD +N  +G+   +CSS
Sbjct: 409 QSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCSS 447


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 163/376 (43%), Gaps = 43/376 (11%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
            Y  +I +G   R   + VDTGSDLTW+QC  PC +C     P++ P+     K V    
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRD 242

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
             C  L+   GN   C +     C+Y + Y D S + G L R+ + +    G     DF+
Sbjct: 243 LLCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFV 297

Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
           FGC  + +G          G++GL  + +S  SQ +   I   +F +C+  T++ G  G 
Sbjct: 298 FGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCI--TREQGGGGY 355

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILID 358
           + LG +   +     +T+T++   P     Y      +  G +QL+    A     ++ D
Sbjct: 356 MFLGDD---YVPRWGVTWTSIRSGPD--NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-- 416
           SG+  T LP  IY  L A       GF        L  C+  + +    +  VK  FE  
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWK-ADFPVRYLEDVKQFFEPL 469

Query: 417 ----GNAEMTVDVTGIV----YFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVI 466
               G   + +  T  +    Y + SD   VCL L + +      T I+G+   + + V+
Sbjct: 470 NLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVV 529

Query: 467 YDTKNSQLGFAGEDCS 482
           YD +  Q+G+A  DC+
Sbjct: 530 YDNQRKQIGWADSDCT 545


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 128/415 (30%), Positives = 187/415 (45%), Gaps = 81/415 (19%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD--------PVFDPSISPSYK 184
           Y  ++ LG   + + V++DTGS LTWV   PC S Y  Q+        PVF P  S S  
Sbjct: 86  YAFSLSLGTPPQPLPVLLDTGSHLTWV---PCTSNYQCQNCSAAAGSFPVFHPKSSSSSL 142

Query: 185 KVLCNSSTC---HA---LEFATGNSGVCSSSS-----------PPDCNYFVSYGDGSYTR 227
            V C+S +C   H+   L     +S  C  S+           PP   Y V YG GS T 
Sbjct: 143 LVSCSSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPP---YLVVYGSGS-TA 198

Query: 228 GELGREHLGLGK--ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLF 285
           G L  + L L    A+  +F  GC   +  +    SGL G GR   S+ +Q        F
Sbjct: 199 GLLVSDTLRLSPRGAASRNFAVGCSLAS--VHQPPSGLAGFGRGAPSVPAQLGV---NKF 253

Query: 286 SYCLPSTQ---DAGASGSLILGGNSSVFKNSTPITYTNMIPN----PQLATFYILNLTGI 338
           SYCL S +   DA  SG L+LG  SS  K    + Y  ++ N    P  + +Y L+LTGI
Sbjct: 254 SYCLLSRRFDDDAAISGELVLGA-SSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGI 312

Query: 339 SIGGKQLQASGFAKGGI--------LIDSGTVITRLPPSIYSALKAEFLKQFSGF----P 386
           ++GGK +     A   +        +IDSGT  T L P+++  + A  +    G      
Sbjct: 313 AVGGKSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSK 372

Query: 387 SAPGFSILDTCFNLSA-YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ------V 439
              G   L  CF L A  + +++P + + F G AEM + +    YF+ +  +       +
Sbjct: 373 DVEGALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIEN--YFLAAGPASGVAPEAI 430

Query: 440 CLALAS-----------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           CLA+ S                   I+G++QQ+N +V YD + ++LGF  + CSS
Sbjct: 431 CLAVVSDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPCSS 485


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/408 (24%), Positives = 173/408 (42%), Gaps = 54/408 (13%)

Query: 121 EIPLTSGIRLQTLN-YIATIELGGRNM--TVIVDTGSDLTWVQCQPCK---SCYNQQDP- 173
           E+P+ S + +  +  Y+ ++ +G   +   +++DT +DLTW+ C+  +     Y +Q   
Sbjct: 110 ELPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTG 169

Query: 174 ----------------VFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD-CNY 216
                            + P+ S S++++ C+   C  L + T     C S S  + C+Y
Sbjct: 170 QTMSMGGEGAKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNT-----CQSPSKAESCSY 224

Query: 217 FVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCG-RNNKGLFGGVSGLMGLGRSD 270
           F    DG+ T G  G+E   +    G+ A +   I GC      G      G++ LG  D
Sbjct: 225 FQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGD 284

Query: 271 LSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT 329
           +S     ++ FG  FS+CL S   +  AS  L  G N +V    T    T+++ N  +  
Sbjct: 285 MSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGT--METDILYNVDVKP 342

Query: 330 FYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQF 382
            Y   +TG+ +GG++L        A  F  GG+++D+ T +T L P  Y+ + A   +  
Sbjct: 343 AYGAQVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHL 402

Query: 383 SGFPSAPGFSILDTCFN-------LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD 435
           S  P        + C+        +     V IP   +E  G A +  +   +V   + +
Sbjct: 403 SHLPRVYELEGFEYCYKWTFTGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVV-MPEVE 461

Query: 436 ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
               CLA   L      GI+GN   +      D  + ++ F  + C++
Sbjct: 462 PGVACLAFRKL-LRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKCNT 508


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 167/373 (44%), Gaps = 48/373 (12%)

Query: 138 TIELGGRNMTVIVDTGSDLTWVQCQPCKSC--YNQQDPVFDPSISPSYKKVLCNSSTCHA 195
           TI    +  +  +D G  L W QC  C S   +NQ+ P FDP+ S +Y+   C ++ C  
Sbjct: 29  TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALC-- 86

Query: 196 LEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC--GRNN 253
            EF   +   CS      C Y  S     +T G++G + + +G A+     FGC    + 
Sbjct: 87  -EFFPASIRNCSGDV---CAYEASTQLFEHTSGKIGTDAVAIGTATAASVAFGCVMASDI 142

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKN- 312
           K + GG SG +GL R+ LSLV+Q +      FS+CL +  D G       G NS +F   
Sbjct: 143 KLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCL-APHDGGG------GKNSRLFLGA 192

Query: 313 -------------STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDS 359
                        +TP   ++  P+   + +Y++NL GI  G + +     +   +L+ +
Sbjct: 193 AAKLAGGGKSAAMTTPFVKSS--PDDIKSLYYLINLEGIKAGDEAIITVPQSGRTVLLQT 250

Query: 360 GTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKMEFE 416
            + ++ L   +Y  LK        G  + P     SI D CF          P V + F+
Sbjct: 251 FSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSIFDLCFKRGGVS--GAPDVVLTFQ 308

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET-----GIIGNYQQKNQRVIYDTKN 471
           G A +TV  T  +  V  D   VC+A+AS +  + T      I+G  QQ+N   +YD + 
Sbjct: 309 GAAALTVPPTNYLLDVGDD--TVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEK 366

Query: 472 SQLGFAGEDCSSM 484
             L F   DCSS+
Sbjct: 367 ETLSFEAADCSSL 379


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)

Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           V +DTGS L+WVQC+ C+  CY+Q      +F+P  S +Y KV C++  C+ +       
Sbjct: 21  VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 80

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
             C       C Y + YG G Y+ G LG++ L L    S+++FIFGCG +N  L+ GV +
Sbjct: 81  YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 137

Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           G++G G    S  +Q   +     FSYC P  +D    GSL +G     +     + +T 
Sbjct: 138 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 191

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           +I       + I  L  + + G +L+     +     ++DSGT  T +   ++ AL    
Sbjct: 192 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 250

Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
            K+        G+     CF  N  +    + P V+M+       T+ +     F +S  
Sbjct: 251 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 307

Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + +C    S    D+ G+     +GN   ++ ++++D +    GF    C
Sbjct: 308 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 112/375 (29%), Positives = 169/375 (45%), Gaps = 60/375 (16%)

Query: 124 LTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCK---SCYNQQDPVFDPS 178
           + S +  ++  Y+ T+ LG   R+M  I DTGSDL WV+C+      S        FDPS
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149

Query: 179 ISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--- 235
            S +Y +V C +  C AL  AT + G        +C Y  +YGDGS T G L  E     
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGS-------NCAYLYAYGDGSNTTGVLSTETFTFD 202

Query: 236 --GLGKA----SVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT--SEIFGGLFSY 287
             G G++     +    FGC     G F    GL+GLG   +SLV+Q   +   G  FSY
Sbjct: 203 DGGAGRSPRQVRIGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSY 261

Query: 288 CLPSTQDAGASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           CL       AS +L  G  + V +    STP+                       +G K 
Sbjct: 262 CL-VPHSVNASSALNFGALADVTEPGAASTPL-----------------------VGNKT 297

Query: 345 LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQ 404
           + ++  ++  I++DSGT +T L PS+   +  E  ++ +  P      +L  C+N+ A +
Sbjct: 298 VASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNV-AGR 354

Query: 405 EV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQ 460
           EV    +IP + +EF G A + +       FV      +CLA+ + + +    I+GN  Q
Sbjct: 355 EVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQQPVSILGNLAQ 412

Query: 461 KNQRVIYDTKNSQLG 475
           +N  V YD     +G
Sbjct: 413 QNIHVGYDLDAGTVG 427



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 43/153 (28%), Positives = 77/153 (50%), Gaps = 9/153 (5%)

Query: 334 NLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI 393
           +L   ++G K + ++  ++  I++DSGT +T L PS+   +  E  ++ +  P      +
Sbjct: 420 DLDAGTVGNKTVASAASSR--IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGL 477

Query: 394 LDTCFNLSAYQEV----NIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
           L  C+N+ A +EV    +IP + +EF G A + +       FV      +CLA+ + + +
Sbjct: 478 LQLCYNV-AGREVEAGESIPDLTLEFGGGAAVALKPENA--FVAVQEGTLCLAIVATTEQ 534

Query: 450 DETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
               I+GN  Q+N  V YD     + FA  DC+
Sbjct: 535 QPVSILGNLAQQNIHVGYDLDAGTVTFAVADCA 567


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 163/384 (42%), Gaps = 70/384 (18%)

Query: 145 NMTVIVDTGSDLTWVQCQP--CKSCYNQQDP------VFDPSISPSYKKVLCNSSTCHAL 196
           ++++ +DTGSDL W  C P  C  C  +  P         P I    +++ C S  C A 
Sbjct: 102 SVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID--SRRISCASPLCSAA 159

Query: 197 EFATGNSGVCSSSSPP-------DCN------YFVSYGDGSYTRGELGREHLGLGKA-SV 242
             +   S +C+++  P        C        + +YGDGS     L R  +GL  + +V
Sbjct: 160 HSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLV-ANLRRGRVGLAASMAV 218

Query: 243 NDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLI 302
            +F F C            G+ G GR  LSL +Q +    G        + DA A G+  
Sbjct: 219 ENFTFACAHT---ALAEPVGVAGFGRGPLSLPAQLAPSLSG--------STDAAAIGA-- 265

Query: 303 LGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-------AKGGI 355
                    + T   YT ++ NP+   FY + L  +S+GGK++QA            GG+
Sbjct: 266 ---------SETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGM 316

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFPS-----APGFSILDTCFNLSAYQEVNIPL 410
           ++DSGT  T LP   ++ +  EF +  +         A   + L  C++ S      +P 
Sbjct: 317 VVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYHYSPSDRA-VPP 375

Query: 411 VKMEFEGNAEMTVDVTGIVYFVKSDASQV--CLALASLSYEDE--------TGIIGNYQQ 460
           V + F GNA + +         KS+  +   CL L ++   ++         G +GN+QQ
Sbjct: 376 VALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQ 435

Query: 461 KNQRVIYDTKNSQLGFAGEDCSSM 484
           +   V+YD    ++GFA   C+ +
Sbjct: 436 QGFEVVYDVDAGRVGFARRRCTDL 459


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 130/464 (28%), Positives = 205/464 (44%), Gaps = 76/464 (16%)

Query: 50  SSSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNLH----VQYLQSR 105
           SS+S VS  K R     +  +L H           NE  ++R+ LD  H    + Y+Q+R
Sbjct: 22  SSTSTVSSAKPR----RLVSKLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQAR 77

Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
           I+  +  N  D + +  P  +G  +     +  + +G  ++   V++DTGSD+ W+ C P
Sbjct: 78  IEGSLVYN-NDYTASVSPSLTGRTI-----LVNLSIGQPSIPQLVVMDTGSDILWIMCNP 131

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C +C N    +FDPS+S ++   LC +          G  G C     P   + +SY D 
Sbjct: 132 CTNCDNHLGLLFDPSMSSTFSP-LCKT--------PCGFKG-CKCDPIP---FTISYVDN 178

Query: 224 SYTRGELGREHLGL-----GKASVNDFIFGCGRN---NKGLFGGVSGLMGLGRSDLSLVS 275
           S   G  GR+ L       G + ++D I GCG N   N     G +G++GL     SL +
Sbjct: 179 SSASGTFGRDILVFETTDEGTSQISDVIIGCGHNIGFNSD--PGYNGILGLNNGPNSLAT 236

Query: 276 QTSEIFGGLFSYCLPSTQDAGAS-GSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILN 334
           Q     G  FSYC+ +  D   +   L LG  + +   STP    +         FY + 
Sbjct: 237 Q----IGRKFSYCIGNLADPYYNYNQLRLGEGADLEGYSTPFEVYH--------GFYYVT 284

Query: 335 LTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF--LKQFSG- 384
           + GIS+G K+L       +      GG+++DSGT IT L  S +  L  E   L ++S  
Sbjct: 285 MEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFR 344

Query: 385 ---FPSAPGFSILDTC-FNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVC 440
              F +AP       C + + +   V  P+V   F   A++ +D TG  +  + D    C
Sbjct: 345 QVIFENAP----WKLCYYGIISRDLVGFPVVTFHFVDGADLALD-TGSFFSQRDDI--FC 397

Query: 441 LALASLSYEDET---GIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + ++  S  + T    +IG   Q++  V YD  N  + F   DC
Sbjct: 398 MTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 109/393 (27%), Positives = 174/393 (44%), Gaps = 61/393 (15%)

Query: 123 PLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ-QDP-----VFD 176
           P  +G+    + Y+ T  +G     V VDTGSD+TW+ C PC SC  + Q P      +D
Sbjct: 31  PFVTGLYYTKI-YLGTPPVG---YYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYD 86

Query: 177 PSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG 236
           PS S +   + C  S C A   A G++ V S +S   C Y  +YGDGS T+G   ++ + 
Sbjct: 87  PSRSSTDGALSCRDSNCGA---ALGSNEV-SCTSAGYCAYSTTYGDGSSTQGYFIQDVMT 142

Query: 237 L----------GKASVNDFIFGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSEI-- 280
                      G ASV    FGCG    G        + GL+G G++ +S+ SQ + +  
Sbjct: 143 FQEIHNNTQVNGTASV---YFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGK 199

Query: 281 FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISI 340
            G  F++CL      G  G++++G  S    + TPI   N          Y + +  I++
Sbjct: 200 VGNRFAHCLQGDNQGG--GTIVIGSVSEPNISYTPIVSRN---------HYAVGMQNIAV 248

Query: 341 GGKQL------QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL 394
            G+ +        +  + GG+++DSGT +  L    Y+    +F+   S F S+  FS  
Sbjct: 249 NGRNVTTPASFDTTSTSAGGVIMDSGTTLAYLVDPAYT----QFVNAVSTFESSM-FSSH 303

Query: 395 DTCFNLSAYQ-EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG 453
             C  L+    + + P VK+ F+  A M +     +Y       Q    +       + G
Sbjct: 304 SQCLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAG 363

Query: 454 -----IIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                I+G+   K+  V+YD  N  +G+   DC
Sbjct: 364 YLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDC 396


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)

Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           V +DTGS L+WVQC+ C+  CY+Q      +F+P  S +Y KV C++  C+ +       
Sbjct: 40  VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 99

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
             C       C Y + YG G Y+ G LG++ L L    S+++FIFGCG +N  L+ GV +
Sbjct: 100 YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 156

Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           G++G G    S  +Q   +     FSYC P  +D    GSL +G     +     + +T 
Sbjct: 157 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 210

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           +I       + I  L  + + G +L+     +     ++DSGT  T +   ++ AL    
Sbjct: 211 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 269

Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
            K+        G+     CF  N  +    + P V+M+       T+ +     F +S  
Sbjct: 270 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 326

Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + +C    S    D+ G+     +GN   ++ ++++D +    GF    C
Sbjct: 327 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 158/350 (45%), Gaps = 33/350 (9%)

Query: 148 VIVDTGSDLTWVQCQPCK-SCYNQQDP---VFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           V +DTGS L+WVQC+ C+  CY+Q      +F+P  S +Y KV C++  C+ +       
Sbjct: 14  VTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEACNGMHMDLAVE 73

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLG-KASVNDFIFGCGRNNKGLFGGV-S 261
             C       C Y + YG G Y+ G LG++ L L    S+++FIFGCG +N  L+ GV +
Sbjct: 74  YGCVEEDDT-CIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGEDN--LYNGVNA 130

Query: 262 GLMGLGRSDLSLVSQT-SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           G++G G    S  +Q   +     FSYC P  +D    GSL +G     +     + +T 
Sbjct: 131 GIIGFGTKSYSFFNQVCQQTDYTAFSYCFP--RDHENEGSLTIGP----YARDINLMWTK 184

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASG--FAKGGILIDSGTVITRLPPSIYSALKAEF 378
           +I       + I  L  + + G +L+     +     ++DSGT  T +   ++ AL    
Sbjct: 185 LIYYDHKPAYAIQQLD-MMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAM 243

Query: 379 LKQFSGFPSAPGFSILDTCF--NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
            K+        G+     CF  N  +    + P V+M+       T+ +     F +S  
Sbjct: 244 TKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKL---IRSTLKLPVENAFYESSN 300

Query: 437 SQVCLALASLSYEDETGI-----IGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           + +C    S    D+ G+     +GN   ++ ++++D +    GF    C
Sbjct: 301 NVIC----STFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  110 bits (275), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 83/273 (30%), Positives = 133/273 (48%), Gaps = 31/273 (11%)

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
           G  + T G L  +    G  +V   +FGC   + G F G SG++G+GR +LSL+SQ    
Sbjct: 124 GSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF- 182

Query: 281 FGGLFSYCL--PSTQDAGASGSLILGGNSSVFK----NSTPITYTNMIPNPQLATFYILN 334
             G FSY L  P   D G++ S+I  G+ +V K     STP+  + + P+     FY +N
Sbjct: 183 --GKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPD-----FYYVN 235

Query: 335 LTGISIGGKQLQA--------SGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFP 386
           LTG+ + G +L A             GG+++ S T +T L  + Y  ++A    +  G P
Sbjct: 236 LTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLP 294

Query: 387 SAPGFSI--LDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF-VKSDASQVCLAL 443
           +  G +   LD C+N S+  +V +P + + F+G A+M  D++   YF + +D    CL +
Sbjct: 295 AVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADM--DLSAANYFYIDNDTGLECLTM 352

Query: 444 ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGF 476
                     ++G   Q    +IYD    +L F
Sbjct: 353 LP---SQGGSVLGTLLQTGTNMIYDVDAGRLTF 382


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 114/396 (28%), Positives = 176/396 (44%), Gaps = 54/396 (13%)

Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
           ++PL  SG+  +T  Y   I +G   +   V VDTGSD+ WV C  C  C  + +     
Sbjct: 75  DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            ++DP  S S + V C+   C A       S  C+S+SP  C Y +SYGDGS T G    
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190

Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
           + L   + S +           FGCG    G  G     + G++G G+S+ S++SQ +  
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250

Query: 281 --FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
                +F++CL    D    G +   GN    K  T    ++M         Y + L GI
Sbjct: 251 GKVRKMFAHCL----DTVNGGGIFAIGNVVQPKVKTTPLVSDM-------PHYNVILKGI 299

Query: 339 SIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFS 392
            +GG  L        SG +KG I IDSGT +  +P  +Y AL A    +          +
Sbjct: 300 DVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ---T 355

Query: 393 ILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDE 451
           + D +CF  S   +   P V   FEG+  + V      Y  ++  +  C+   +   + +
Sbjct: 356 LQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGVQTK 413

Query: 452 TG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            G    ++G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 414 DGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/302 (31%), Positives = 137/302 (45%), Gaps = 49/302 (16%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCN-SSTCHALEFATGN 202
           +   +IVDTGS +T+V C  C+ C   QDP F+P +S +Y+ V CN   TC         
Sbjct: 101 QTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNIDCTC--------- 151

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG 258
                 +    C Y   Y + S + G LG + +  G  S       IFGC     G L+ 
Sbjct: 152 -----DNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQETGDLYS 206

Query: 259 -GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSS----VFK 311
               G+MGLGR DLS+V Q  E  +    FS C       G  G++ILGG S     VF 
Sbjct: 207 QRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGG--GAMILGGISPPSGMVFA 264

Query: 312 NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGF-AKGGILIDSGTVITRLPP 368
            S P+           + +Y ++L  I + GKQL    S F  K G ++DSGT    LP 
Sbjct: 265 ESDPVR----------SQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYAYLPE 314

Query: 369 SIYSALKAEFLKQFSGFPS--APGFSILDTCFNLSAYQEVN-----IPLVKMEFEGNAEM 421
           + ++A K   +K+ +       P  +  D CF+  A  +V+      P V+M F    ++
Sbjct: 315 AAFTAFKDAMMKELTSLKQIHGPDPNYNDICFS-GAESDVSQLSNTFPAVEMVFSNGQKL 373

Query: 422 TV 423
           ++
Sbjct: 374 SL 375


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 123/417 (29%), Positives = 181/417 (43%), Gaps = 62/417 (14%)

Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWV------QCQPCKSCYN 169
           +  +P T+ +   +   Y  T  LG   + + V++DTGS LTWV      +C+ C S   
Sbjct: 82  HPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSA 141

Query: 170 QQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS--SPPDCN-----------Y 216
              PVF P  S S + V C + +C  +  A   +  C  +  SP   N           Y
Sbjct: 142 SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPY 201

Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
            V YG GS T G L  + L     +V  F+ GC   +  +    SGL G GR   S+ +Q
Sbjct: 202 AVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQ 258

Query: 277 TSEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATF 330
                 GL  FSYCL S +   +A  SGSL+LGG          P+  +          +
Sbjct: 259 L-----GLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVY 313

Query: 331 YILNLTGISIGGK--QLQASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y L L G+++GGK  +L A  FA      GG ++DSGT  T L P+++  +    +    
Sbjct: 314 YYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG 373

Query: 384 GF----PSAPGFSILDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV---KSD 435
           G       A     L  CF L    + + +P +   FEG A M + V    YFV   +  
Sbjct: 374 GRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN--YFVVAGRGA 431

Query: 436 ASQVCLALAS---------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
              +CLA+ +                 I+G++QQ+N  V YD +  +LGF  + C+S
Sbjct: 432 VEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 488


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/400 (27%), Positives = 165/400 (41%), Gaps = 69/400 (17%)

Query: 146 MTVIVDTGSDLTWVQCQP--CKSCY----------------NQQDPVFDPSISPSYKKV- 186
           +++ +DTGSDL W  C P  C  C                 +++ P   P  S ++    
Sbjct: 105 VSLFLDTGSDLVWFPCAPFTCMLCEGKPTPGRLGPLPPPPDSRRIPCASPLCSAAHASAP 164

Query: 187 ---LCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG-ELGREHLGLGK--- 239
              LC  + C   +  TG+ G  S + PP    + +YGDGS       GR  LG G    
Sbjct: 165 PSDLCAVARCPLEDIETGSCGA-SHACPP---LYYAYGDGSLVAHLRRGRVALGAGARAS 220

Query: 240 --ASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG- 296
              +V++F F C        G   G+ G GR  LSL  Q S    G FSYCL S      
Sbjct: 221 VAVAVDNFTFACAHTA---LGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRAD 277

Query: 297 ---ASGSLILGGNSSVFK---NSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASG- 349
                  LILG +         +    YT ++ NP+   FY + L  +S+G  ++QA   
Sbjct: 278 RLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPE 337

Query: 350 ------FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAPGFSILDTCF 398
                    GG+++DSGT  T LP  +Y+ +   F +  +         A   + L  C+
Sbjct: 338 LARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCY 397

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV--------CLAL---ASLS 447
             +A  +  +P + + F GNA + +         KS+ +          CL L      S
Sbjct: 398 RYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDAS 456

Query: 448 YED---ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
            E+     G +GN+QQ+   V+YD    ++GFA   C+ +
Sbjct: 457 GEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTDL 496


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 157/366 (42%), Gaps = 57/366 (15%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           +   +IVDTGS +T+V C  C+ C   QDP F P  S +Y  V CN      ++    + 
Sbjct: 99  QEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN------MDCNCDHD 152

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS---VNDFIFGCGRNNKG-LFG- 258
           GV       +C Y   Y + S + G LG + +  G  S       +FGC     G L+  
Sbjct: 153 GV-------NCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVETGDLYSQ 205

Query: 259 GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGG----NSSVFKN 312
              G+MGLGR  LS+V Q  +  +    FS C       G  G+++LGG       VF  
Sbjct: 206 RADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGG--GAMVLGGIPPPPDMVFSR 263

Query: 313 STPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPS 369
           S          +P  + +Y + L  I + GK L+ S      K G ++DSGT    LP  
Sbjct: 264 S----------DPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYLPEE 313

Query: 370 IYSAL------KAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVN-----IPLVKMEFEGN 418
            + A       K+  LKQ  G    P  +  D CF+  A ++V+      P V M F   
Sbjct: 314 AFVAFRDAIIKKSHNLKQIHG----PDPNYNDICFS-GAGRDVSQLSKAFPEVDMVFSNG 368

Query: 419 AEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
            ++++     ++         CL +      D T ++G    +N  V YD +N ++GF  
Sbjct: 369 QKLSLTPENYLFQHTKVHGAYCLGI--FRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWK 426

Query: 479 EDCSSM 484
            +CS +
Sbjct: 427 TNCSEL 432


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 49/378 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDP-VFDPSISPSYKKVLCNSSTCHALEFAT 200
           +N+T+++DTGS+L+W++C   +  S    Q P  F+ S S +Y    C+S  C       
Sbjct: 71  QNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDL 130

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-------GRNN 253
                C+      C   +SY D S   G L  +   LG A     +FGC          N
Sbjct: 131 PVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXALFGCVTSYSSATATN 190

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
                  +GL+G+ R  LS V+QT+ +    F+YC+         G L+LGG+ +     
Sbjct: 191 SSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCI---APGDGPGLLVLGGDGAALAPQ 244

Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGK-------QLQASGFAKGGILIDSGT 361
             + YT +I    P P      Y + L GI +G          L       G  ++DSGT
Sbjct: 245 --LNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGT 302

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFS------ILDTCFNLS----AYQEVNIPLV 411
             T L    Y+ LK EFL Q S   +  G S        D CF  S    A     +P V
Sbjct: 303 QFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASXMLPEV 362

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD------ASQV-CLALASLSYEDETG-IIGNYQQKNQ 463
            +   G AE+ V    ++Y V  +      A  V CL   +      +  +IG++ Q+N 
Sbjct: 363 GLVLRG-AEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNV 421

Query: 464 RVIYDTKNSQLGFAGEDC 481
            V YD +N ++GFA   C
Sbjct: 422 WVEYDLQNGRVGFAPARC 439


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 115/401 (28%), Positives = 178/401 (44%), Gaps = 56/401 (13%)

Query: 117 VSNTEIPLTSGIRLQTLNYIATIELG----GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-- 170
           ++  +IPL  G+ L T   +   E+G     +   V VDTGSD+ WV C  C  C  +  
Sbjct: 70  LAAADIPL-GGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSG 128

Query: 171 ---QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTR 227
              +  ++DP  S +  KV C+   C A     G    C++S P  C Y V+YGDGS T 
Sbjct: 129 LGLELTLYDPKDSSTGSKVSCDQGFCAAT--YGGLLPGCTTSLP--CEYSVTYGDGSSTT 184

Query: 228 GELGREHL--------GLGKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVS 275
           G    + L        G  + + +   FGCG    G  G     + G++G G+S+ S++S
Sbjct: 185 GYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLS 244

Query: 276 QTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYIL 333
           Q S       +F++CL    D    G +   GN    K  T    T ++PN      Y +
Sbjct: 245 QLSAAGKVKKIFAHCL----DTINGGGIFAIGNVVQPKVKT----TPLVPN---MPHYNV 293

Query: 334 NLTGISIGGKQLQASGF-----AKGGILIDSGTVITRLPPSIYSALK-AEFLKQFS-GFP 386
           NL  I +GG  L+          K G +IDSGT +T LP  +Y  +  A F K     F 
Sbjct: 294 NLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFH 353

Query: 387 SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--A 444
           +   F     CF      + + P +   FE   ++ ++V    YF ++  +  C+     
Sbjct: 354 NVQEF----LCFQYVGRVDDDFPKITFHFEN--DLPLNVYPHDYFFENGDNLYCVGFQNG 407

Query: 445 SLSYEDETGII--GNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            L  +D  G++  G+    N+ V+YD +N  +G+   +CSS
Sbjct: 408 GLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCSS 448


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 164/375 (43%), Gaps = 41/375 (10%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
            Y  +I +G   R   + VDTGSDLTW+QC  PC +C     P++ P+     K V    
Sbjct: 193 QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRD 249

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
             C  L+   G+   C++     C+Y + Y D S + G L ++ + +    G     DF+
Sbjct: 250 LLCQELQ---GDQNYCATCK--QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLDFV 304

Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
           FGC  + +G          G++GL  + +SL SQ +   I   +F +C+  T++    G 
Sbjct: 305 FGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCI--TKEPNGGGY 362

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI--LID 358
           + LG +   +     +T+  +   P     Y      ++ G +QL+  G A   I  + D
Sbjct: 363 MFLGDD---YVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC----FNLSAYQEVNIPLVKME 414
           SG+  T LP  IY  L       +  F      + L  C    F++   ++V      + 
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLPLCWKADFDVRYLEDVKQFFKPLN 477

Query: 415 FE-GNAEMTVDVTGIV----YFVKSDASQVCLALASLSYEDE--TGIIGNYQQKNQRVIY 467
              GN    +  T  +    Y + SD   VCL L + +  D   T I+G+   + + V+Y
Sbjct: 478 LHFGNRWFVIPRTFTILPDDYLIISDKGNVCLGLLNGAEIDHASTLIVGDVSLRGKLVVY 537

Query: 468 DTKNSQLGFAGEDCS 482
           D +  Q+G+A  +C+
Sbjct: 538 DNERRQIGWADSECT 552


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 167/375 (44%), Gaps = 42/375 (11%)

Query: 132 TLNYIATIELGGRNMT--VIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLC 188
           ++  I ++ +G    T  +++DTGS L+W+QC+ P K+        FDP +S S+  + C
Sbjct: 75  SMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKT----PPTAFDPLLSSSFSVLPC 130

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIF 247
           N S C            C  +    C+Y   Y DG+Y  G L RE      + +    I 
Sbjct: 131 NHSLCKPRVPDYTLPTSCDQNR--LCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLIL 188

Query: 248 GCGRNN---KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLP---STQDAGASGSL 301
           GC  ++   +G+ G     M LGR   S +++ S+     FSYC+P   S   +  +GS 
Sbjct: 189 GCATDSSDTQGILG-----MNLGRLSFSSLAKISK-----FSYCVPPRRSQSGSSPTGSF 238

Query: 302 ILGGNSSV--FKNSTPITYTNMIPNPQLATF-YILNLTGISIGGKQLQASGFA------- 351
            LG N S   FK    +TY      P L    Y L + GI I GK+L  S  A       
Sbjct: 239 YLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSG 298

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQ-EVN 407
            G  LIDSGT  T L    YS +K E +K  +G     G+     LD CF+  A      
Sbjct: 299 AGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGRM 357

Query: 408 IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           I  +  EFE   E+ V+   ++  V      + +  + L     + IIGN+ Q++  V +
Sbjct: 358 IGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDL-LGVASNIIGNFHQQDLWVEF 416

Query: 468 DTKNSQLGFAGEDCS 482
           D    ++GF   DCS
Sbjct: 417 DLVGRRVGFGRTDCS 431


>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 157/326 (48%), Gaps = 30/326 (9%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y+ ++ LG   +   V +DTGS  +WV C+ C  C+      F  S S +  KV C +S 
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPR-TFLQSRSTTCAKVSCGTSM 58

Query: 193 CHALEFATGNSGVC-SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCG 250
           C       G+   C  S + PDC + VSY DGS + G L ++ L       +  F FGC 
Sbjct: 59  C----LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCN 114

Query: 251 RNNKGL--FGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQD-----AGASGSLIL 303
            ++ G   FG V GL+G+G   +S++ Q+S  F   FSYCLP  +      +  +G   L
Sbjct: 115 MDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTF-DCFSYCLPLQKSERGFFSKTTGYFSL 173

Query: 304 GGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ--LQASGFAKGGILIDSGT 361
           G  ++     T + YT M+   +    + ++L  IS+ G++  L  S F++ G++ DSG+
Sbjct: 174 GKVAT----RTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSRKGVVFDSGS 229

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEM 421
            ++ +P    S L ++ +++      A        C+++ +  E ++P + + F+  A  
Sbjct: 230 ELSYIPDRALSVL-SQRIRELLLKRGAAEEESERNCYDMRSVDEGDMPAISLHFDDAARF 288

Query: 422 TVDVTGIVYFVKSDASQ---VCLALA 444
            +   G+  FV+    +    CLA A
Sbjct: 289 DLGSHGV--FVERSVQEQDVWCLAFA 312


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 176/401 (43%), Gaps = 63/401 (15%)

Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
           ++PL  +G+   T  Y   + LG   +   V VDTGSD+ WV C  C +C  +       
Sbjct: 57  DVPLGGNGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDL 116

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            ++DP+ S +   V C    C      T +  +        C Y ++YGDGS T G    
Sbjct: 117 TLYDPNGSKTSNAVPCGDGFCT----DTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVN 172

Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG-----GVSGLMGLGRSDLSLVSQ--T 277
           + L   + S N          IFGCG    G         + G++G G+++ S++SQ   
Sbjct: 173 DSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAA 232

Query: 278 SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
           S     +FS+CL S    G      +G       N+TP+        P++A + ++ L  
Sbjct: 233 SGKVKRIFSHCLDSHHGGGI---FSIGQVMEPKFNTTPLV-------PRMAHYNVI-LKD 281

Query: 338 ISIGGKQ------LQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF 391
           + + G+       L  SG  +G I IDSGT +  LP SIY+ L  + L +       PG 
Sbjct: 282 MDVDGEPILLPLYLFDSGSGRGTI-IDSGTTLAYLPLSIYNQLLPKVLGR------QPGL 334

Query: 392 SILD-----TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL 446
            ++      TCF+ S   +   P+VK  FEG   +TV     ++  K D    C+     
Sbjct: 335 KLMIVEDQFTCFHYSDKLDEGFPVVKFHFEG-LSLTVHPHDYLFLYKEDI--YCIGWQKS 391

Query: 447 SYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           S + + G    +IG+    N+ V+YD +N  +G+   +CSS
Sbjct: 392 STQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 432


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 165/363 (45%), Gaps = 47/363 (12%)

Query: 150 VDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSS 208
           +DTGSDLTW+QC  PC+SC      ++DP  +   + V C   TC  ++   G    CS 
Sbjct: 48  MDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRA---RVVDCRRPTCAQVQ--RGGQFTCSG 102

Query: 209 SSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDF----IFGCGRNNKGLFGGV---- 260
                C+Y V Y DGS T G L  + + L   +   F    + GCG + +G         
Sbjct: 103 DV-RQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIGCGYDQQGTLAKAPAVT 161

Query: 261 SGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITY 318
            G++GL  S +SL SQ +   I   +  +CL    + G  G L  G        +  +T+
Sbjct: 162 DGVIGLSSSKISLPSQLAAKGIANNVIGHCLAGGSNGG--GYLFFG---DTLVPALGMTW 216

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQASGFAK--GGILIDSGTVITRLPPSIYSALKA 376
           T MI  P L   Y   L  I  GG+ L+  G     GG + DSGT  T L P+ Y+A+ +
Sbjct: 217 TPMIGRP-LVEGYQARLRSIKYGGEVLELEGTTDDVGGAMFDSGTSFTYLVPNAYTAVLS 275

Query: 377 EFLKQF--SGFPSAPGFSILDTCF----------NLSAYQEVNIPLVKMEFEGNAEMT-- 422
             ++Q   SG       + L  C+          ++SAY +     V ++F G+   +  
Sbjct: 276 AVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFKT----VTLDFGGSTWWSSG 331

Query: 423 --VDVTGIVYFVKSDASQVCLAL--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAG 478
             ++++   Y + S    VCL +  AS++  + T I+G+   +   V+YD    Q+G+  
Sbjct: 332 KLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYLVVYDNMREQIGWVR 391

Query: 479 EDC 481
            +C
Sbjct: 392 RNC 394


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 164/374 (43%), Gaps = 49/374 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN------QQDPVFDPSISPSYKKVLCNSSTCH--- 194
           + ++ ++DTGS + W  C    +C N      ++ P+F+P +S S K + C    C    
Sbjct: 98  QKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTS 157

Query: 195 ------ALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
                       GNS  CS + P    Y + YG G+   G    E+L     +++ F+ G
Sbjct: 158 SPBVHLGXPRCNGNSKKCSHACP---QYTLQYGTGA-ASGFFLLENLDFPGKTIHKFLVG 213

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ--DAGASGSLILGGN 306
           C   +         L G GR+  SL  Q        F+YCL S    D   SG LIL  +
Sbjct: 214 C-TTSADREPSSDALAGFGRTMFSLPMQMGV---KKFAYCLNSHDYDDTRNSGKLILDYS 269

Query: 307 SSVFKNSTPITYTNMIPNP-QLATFYILNLTGISIGGKQLQASGF-------AKGGILID 358
                 +  ++Y     NP     +Y L +  + IG K L+  G        ++GG++ID
Sbjct: 270 DG---ETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVID 326

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSI---LDTCFNLSAYQEVNIPLVKMEF 415
           SG   + +   ++  +  E  KQ S +  +        +  C+N + ++ + IP +  +F
Sbjct: 327 SGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCYNFTGHKSIKIPDLIYQF 386

Query: 416 EGNAEMTVDVTGIVYFVK-SDASQVCLALASLS----YEDETG---IIGNYQQKNQRVIY 467
            G A M V   G+ YF+  S+AS  C  + + S     E   G   I+GNYQQ +  V +
Sbjct: 387 TGGANMVVP--GMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEF 444

Query: 468 DTKNSQLGFAGEDC 481
           D KN +LGF  + C
Sbjct: 445 DLKNERLGFRQQTC 458


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 120/408 (29%), Positives = 184/408 (45%), Gaps = 44/408 (10%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQ 160
           Q+R   ++ G +  V +  +  TS   L  L Y   ++LG   R   V +DTGSD+ WV 
Sbjct: 55  QARHGRLLRGVVGGVVDFTVYGTSDPYLVGL-YFTKVKLGSPPREFNVQIDTGSDILWVT 113

Query: 161 CQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCN 215
           C  C  C        +   FDPS S +   V C+   C +L   T     CS  S   C+
Sbjct: 114 CNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAE--CSPQS-NQCS 170

Query: 216 YFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCGRNNKG----LFGGVSGL 263
           Y   YGDGS T G    + L         L   S    +FGC     G    +   + G+
Sbjct: 171 YSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKVDKAIDGI 230

Query: 264 MGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNM 321
            G G+ DLS+VSQ S   I   +FS+CL    D G  G L+LG    + + +  I Y+ +
Sbjct: 231 FGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGG--GKLVLG---EILEPN--IIYSPL 283

Query: 322 IPNPQLATFYILNLTGISIGGKQL--QASGFAKG---GILIDSGTVITRLPPSIYSALKA 376
           +P+    + Y LNL  IS+ G+ L    + FA     G ++DSGT +T L  + Y    +
Sbjct: 284 VPS---QSHYNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVS 340

Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV-TGIVYFVKSD 435
                 S   + P  S  + C+ +S   +   P V + F G A M +     +++   SD
Sbjct: 341 AITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSD 399

Query: 436 -ASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            A+  C+    ++ E    I+G+   K++  +YD  + ++G+A  DCS
Sbjct: 400 GAAMWCIGFQKVA-EPGITILGDLVLKDKIFVYDLAHQRIGWANYDCS 446


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 111/390 (28%), Positives = 165/390 (42%), Gaps = 72/390 (18%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
           + + + +DTGSDL W QC  C  C+ Q  P FD   S +   V C+   C + ++     
Sbjct: 112 QRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSDPICTSGKYPLSGC 170

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGEL---------------GREHLGLGKASVNDFIFG 248
               ++    C Y   Y D S T G +                + H G+   +V +  FG
Sbjct: 171 TFNDNT----CFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGV---AVPNVRFG 223

Query: 249 CGRNNKGLF-GGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA--------GASG 299
           CG+ NKG+F    SG+ G  R  +SL SQ        FS+C  +  DA        GA G
Sbjct: 224 CGQYNKGIFKSNESGIAGFSRGPMSLPSQLKV---ARFSHCFTAIADARTSPVFLGGAPG 280

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFA-------- 351
              LG +++    STP   +N        + Y L L GI++G  +L  +  A        
Sbjct: 281 PDNLGAHATGPVQSTPFANSN-------GSLYYLTLKGITVGKTRLPLNALAFAGKGTGS 333

Query: 352 -KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDT----CFNLS----- 401
             GG +IDSGT I  LP  +Y +L+A F+ +    P A   S  D     CF  +     
Sbjct: 334 GSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVK-LPVA-NESAADAESTLCFEAARSASL 391

Query: 402 -------AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGI 454
                  A  +V + +   +++   E  V    ++       S +CL + S    D T I
Sbjct: 392 PPEAPAPALPKVVLHVAGADWDLPRESYV--LDLLEDEDGSGSGLCLVMNSAGDSDLT-I 448

Query: 455 IGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           IGN+QQ+N  V YD + ++L F    C  M
Sbjct: 449 IGNFQQQNMHVAYDLEKNKLVFVPARCDKM 478


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 162/376 (43%), Gaps = 52/376 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYN--------QQDPVFDPSISPSYKKVLCNSSTCHA 195
           + ++ +VDTGSD+ W  C    +C N        ++ P+FDP +S S K + C +  C +
Sbjct: 89  QKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVS 148

Query: 196 LEFA---------TGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI 246
             F           GNS  CS +    C Y   YG G+ + G    E+L   + ++ +F+
Sbjct: 149 TYFPYVHLGCPRCNGNSKHCSYA----CPYSTQYGTGA-SSGYFLLENLKFPRKTIRNFL 203

Query: 247 FGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILG 304
            GC  +          L G GRS  SL  Q        F+YCL S    D   SG LIL 
Sbjct: 204 LGCTTSAARELSS-DALAGFGRSMFSLPIQMGV---KKFAYCLNSHDYDDTRNSGKLILD 259

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYI-LNLTGISIGGKQLQ------ASGF-AKGGIL 356
                 K    ++YT  + +P  + FY  L +  I IG K L+      A G   + G++
Sbjct: 260 YRDGKTKG---LSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVI 316

Query: 357 IDSGT-VITRLPPSIYSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVK 412
           IDSG      +   ++  +  E  KQ S +     A   + L  C+N + ++ + IP + 
Sbjct: 317 IDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLI 376

Query: 413 MEFEGNAEMTVDVTGIVYF-VKSDASQVCLAL------ASLSYEDETGIIGNYQQKNQRV 465
            +F G A M   V G  YF +    S  C  +      A     D + I+GN Q  +  V
Sbjct: 377 YQFRGGANMV--VPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIILGNSQHVDYYV 434

Query: 466 IYDTKNSQLGFAGEDC 481
            YD KN + GF  + C
Sbjct: 435 EYDLKNDRFGFRRQTC 450


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  108 bits (271), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 109/401 (27%), Positives = 180/401 (44%), Gaps = 54/401 (13%)

Query: 106 IKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQP 163
           I++++SGNI   S+ + P+ S +      Y+    +G    +   I D+GS L W+QC  
Sbjct: 75  IRSIMSGNI--TSSMKYPI-SRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGT 131

Query: 164 --CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYG 221
             C++CY Q+ P+F+PS S +Y K LCN++ C     A G+           C Y   Y 
Sbjct: 132 PYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRV---ALGDEYWRCKKPNQICKYHEDYL 188

Query: 222 DGSYTRGELGR------EHL-GLGKASVNDFIFGCGRNNKG-LFGGVSGLMGLGRSDLSL 273
           D SYT G +        EH+ G G  ++   IFGCG NN         GL+GL  +  SL
Sbjct: 189 DDSYTEGVISTDIFTFPEHISGFGNYTLR-IIFGCGYNNSDPQHFYPPGLVGLTNNKASL 247

Query: 274 VSQTSEIFGGLFSYC--LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
           V Q        FSYC  + + Q+   S  +  G  +S+  +S     T ++PN     +Y
Sbjct: 248 VGQMDV---DQFSYCVSIDTEQNLKGSMEIRFGLAASISGHS-----TQLVPNSD--GWY 297

Query: 332 IL-NLTGISIGGKQLQASGF----------AKGGILIDSGTVITRLPPSIYSALKAEFLK 380
           I  N+ GI +   + +  G+           +GG+ +D+GT  T L  S+   L     +
Sbjct: 298 IFKNVDGIYV--NEFEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEE 355

Query: 381 QFSGFP----SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
             +  P    S  GF +   C+    +    +P +++ F  N +          +  +  
Sbjct: 356 HITIVPEKDYSNSGFEL---CYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGR 412

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
           SQ+CLA+      +   IIG +Q ++ ++ YD  ++ + F 
Sbjct: 413 SQMCLAMFR---TNGMSIIGMHQLRDIKIGYDLHHNIVSFT 450


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 117/424 (27%), Positives = 180/424 (42%), Gaps = 64/424 (15%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLT 157
           H+++ ++     IS +        IPL+ G   Q L+++             VDTGS + 
Sbjct: 65  HLKHGKTSPLTQISLSPHSYGGHSIPLSFGTPPQKLSFL-------------VDTGSHVV 111

Query: 158 WVQCQPCKSCYN--------QQDPVFDPSISPSYKKVLCNSSTC--------H-ALEFAT 200
           W  C    +C N        ++ P+F+P +S S K + C +  C        H       
Sbjct: 112 WAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGCPPCN 171

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGV 260
           GNS  CS + PP   Y + YG G+ + G+   E+L     ++++F+ GC  +  G     
Sbjct: 172 GNSKNCSHACPP---YSLQYGTGA-SSGDFLLENLNFPGKTIHEFLVGCTTSAVGEVTSA 227

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPST--QDAGASGSLILGGNSSVFKNSTPITY 318
           + L G GRS  SL  Q        F+YCL S    D   S  LIL  +    K    ++Y
Sbjct: 228 A-LAGFGRSMFSLPMQMGV---KKFAYCLNSHDYDDTRNSSKLILDYSDGETKG---LSY 280

Query: 319 TNMIPN-PQLATFYILNLTGISIGGKQLQ------ASGF-AKGGILIDSGTVITRLPPSI 370
              + N P    +Y L +  I IG K L+      A G   +GG++IDSG     +   +
Sbjct: 281 APFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPV 340

Query: 371 YSALKAEFLKQFSGFP---SAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
           +  +  E  K+ S +     A     +  C+N +  + + IP +  +F G A M V   G
Sbjct: 341 FKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSIKIPDLIYQFRGGATMVVP--G 398

Query: 428 IVYFVK-SDASQVCLALA------SLSYEDETGII-GNYQQKNQRVIYDTKNSQLGFAGE 479
             YFV   + S  C  L       +L +     II GN Q  +  V +D KN +LGF  +
Sbjct: 399 KNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSIILGNSQHVDYYVEFDLKNERLGFRQQ 458

Query: 480 DCSS 483
            C S
Sbjct: 459 TCQS 462


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 121/424 (28%), Positives = 189/424 (44%), Gaps = 73/424 (17%)

Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWV------QCQPCKSCYN 169
           +  IP T+ +   +   Y  T  LG   + + V++DTGS LTWV       C+ C S + 
Sbjct: 86  HKSIPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTSNYDCRNCSSPFA 145

Query: 170 QQDPVFDPSISPSYKKVLCNSSTC---HALEFATGNSGVCSSSS---------PPDCNYF 217
              PVF P  S S + V C + +C   H+ E        CS  +         PP   Y 
Sbjct: 146 AAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTPASNVCPP---YA 202

Query: 218 VSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQT 277
           V YG GS T G L  + L     +V+ F+ GC   +  +    SGL G GR   S+ +Q 
Sbjct: 203 VVYGSGS-TAGLLIADTLRAPGRAVSGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQL 259

Query: 278 SEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYI 332
                GL  FSYCL S +   +A  SGSL+LGG++   +   P+  +        A +Y 
Sbjct: 260 -----GLSKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQY-VPLVKSAAGDKQPYAVYYY 313

Query: 333 LNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSG- 384
           L L+G+++GGK ++       A+    GG ++DSGT  T L P+++  +    +    G 
Sbjct: 314 LALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGR 373

Query: 385 FPSAPGFSI---LDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQV- 439
           +  +        L  CF L    + + +P + + F+G A M + +    YFV +  + V 
Sbjct: 374 YKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPLEN--YFVVAGRAPVP 431

Query: 440 ------------CLALAS--------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGE 479
                       CLA+ +                I+G++QQ+N  V YD +  +LGF  +
Sbjct: 432 GAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQ 491

Query: 480 DCSS 483
            C+S
Sbjct: 492 PCAS 495


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 98/412 (23%), Positives = 173/412 (41%), Gaps = 58/412 (14%)

Query: 121 EIPLTSGIRLQTLN-YIATIELGGRNM--TVIVDTGSDLTWVQCQPCK---SCYNQQ--- 171
           E+P+ S + +  +  Y+ ++ +G   +   +++DT +DLTW+ C+  +     Y +Q   
Sbjct: 109 ELPMRSALNIAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMG 168

Query: 172 ------------------DPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPD 213
                                + P+ S S++++ C+   C  L + T     C S S  +
Sbjct: 169 QTMSVGGEGATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNT-----CQSPSKAE 223

Query: 214 -CNYFVSYGDGSYTRGELGREHLGL----GK-ASVNDFIFGCG-RNNKGLFGGVSGLMGL 266
            C+YF    DG+ T G  G+E   +    G+ A +   I GC      G      G++ L
Sbjct: 224 SCSYFQKTQDGTVTIGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSL 283

Query: 267 GRSDLSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYTNMIPNP 325
           G  D+S     ++ FG  FS+CL S   +  AS  L  G N +V    T    T+++ N 
Sbjct: 284 GNGDMSFAVHAAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGT--METDILYNV 341

Query: 326 QLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYSALKAEF 378
            +   Y   +TG+ +GG++L        A  F  GG+++D+ T +T L P  Y+ + A  
Sbjct: 342 DVKPAYGAKVTGVLVGGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAAL 401

Query: 379 LKQFSGFPSAPGFSILDTCFN-------LSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYF 431
            +  S  P        + C+        +     V IP   +E  G A +  +   +V  
Sbjct: 402 DRHLSHLPRVYELEGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVV-M 460

Query: 432 VKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + +    CLA   L      GI+GN   +      D  + ++ F  + C++
Sbjct: 461 PEVEPGVACLAFRKL-LRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKCNT 511


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 168/371 (45%), Gaps = 38/371 (10%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y   I LG   + + VIVDTGSD+ WV+C PC+SC ++QD +  P    +      +S +
Sbjct: 83  YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQD-IIPPLSIYNLSASSTSSVS 141

Query: 193 CHALEFATGNSGVCSSS-SPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
             +    TG   VCS S S   C Y +SY D S + G   ++ +      G A+ +   F
Sbjct: 142 SCSDPLCTGEQAVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATTSHIFF 201

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
           GC  N  G +    G+MG G+   ++ +Q  T      +FS+CL   +  G  G L  G 
Sbjct: 202 GCAINITGSW-PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGG--GILEFGE 258

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ---------ASGFAKGGIL 356
                 N+T + +T ++    + T Y ++L  IS+  K L          ++   + G++
Sbjct: 259 EP----NTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVI 311

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKME 414
           IDSGT    L       L +E +K  +     P    L  CF L +    E + P V + 
Sbjct: 312 IDSGTSFALLATKANRILFSE-IKNLTTAKLGPKLEGLQ-CFYLKSGLTVETSFPNVTLT 369

Query: 415 FEGNAEMTVDVTGIVYFV--KSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           F G + M +     +  V  K   +  C A +S    D   I G    K++ V YD +N 
Sbjct: 370 FSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSS---ADGLTIFGEIVLKDKLVFYDVENR 426

Query: 473 QLGFAGEDCSS 483
           ++G+ G++CSS
Sbjct: 427 RIGWKGQNCSS 437


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 123/438 (28%), Positives = 188/438 (42%), Gaps = 77/438 (17%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
           Q RIK       K +S+ ++ +   +R     Y+ T+ +G   + + V +DTGSDLTWV 
Sbjct: 59  QERIK-------KPLSSVDV-VMEPLREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWVP 110

Query: 161 CQ----PCKSCYNQQD------PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVC---- 206
           C      C  CY+ ++       VF P  S +  +  C SS C  +  +      C    
Sbjct: 111 CGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAG 170

Query: 207 --------SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFG 258
                   S+   P  ++  +YG+G    G L R+ L      V  F FGC  +    + 
Sbjct: 171 CSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKARTRDVPRFSFGCVTST---YR 227

Query: 259 GVSGLMGLGRSDLSLVSQTSEIFGGLFSYC-LPS--TQDAGASGSLILGGNSSVFKNSTP 315
              G+ G GR  LSL SQ   +  G FS+C LP     +   S  LILG ++     +  
Sbjct: 228 EPIGIAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDS 286

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGG-----------KQLQASGFAKGGILIDSGTVIT 364
           + +T M+  P     Y + L  I+IG            +Q  + G   GG+L+DSGT  T
Sbjct: 287 LQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQG--NGGMLVDSGTTYT 344

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAP------GFSILDTCF-------NLSAYQE---VNI 408
            LP   YS L    L+    +P A       GF   D C+       NL++ +    +  
Sbjct: 345 HLPEPFYSQLLTT-LQSTITYPRATETESRTGF---DLCYKVPCPNNNLTSLENDVMMIF 400

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFV--KSDASQV-CLALASLSYED--ETGIIGNYQQKNQ 463
           P +   F  NA + +      Y +   SD S V CL   ++   D    G+ G++QQ+N 
Sbjct: 401 PSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNV 460

Query: 464 RVIYDTKNSQLGFAGEDC 481
           +V+YD +  ++GF   DC
Sbjct: 461 KVVYDLEKERIGFQAMDC 478


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  108 bits (270), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 159/378 (42%), Gaps = 49/378 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCK--SCYNQQDP-VFDPSISPSYKKVLCNSSTCHALEFAT 200
           +N+T+++DTGS+L+W++C   +  S    Q P  F+ S S +Y    C+S  C       
Sbjct: 73  QNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPECQWRGRDL 132

Query: 201 GNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGC-------GRNN 253
                C+      C   +SY D S   G L  +   LG A     +FGC          N
Sbjct: 133 PVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRALFGCVTSYSSATATN 192

Query: 254 KGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNS 313
                  +GL+G+ R  LS V+QT+ +    F+YC+         G L+LGG+ +     
Sbjct: 193 SSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCI---APGDGPGLLVLGGDGAALAPQ 246

Query: 314 TPITYTNMI----PNPQL-ATFYILNLTGISIGGK-------QLQASGFAKGGILIDSGT 361
             + YT +I    P P      Y + L GI +G          L       G  ++DSGT
Sbjct: 247 --LNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGT 304

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFS------ILDTCFNLS----AYQEVNIPLV 411
             T L    Y+ LK EFL Q S   +  G S        D CF  S    A     +P V
Sbjct: 305 QFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASQMLPEV 364

Query: 412 KMEFEGNAEMTVDVTGIVYFVKSD------ASQV-CLALASLSYEDETG-IIGNYQQKNQ 463
            +   G AE+ V    ++Y V  +      A  V CL   +      +  +IG++ Q+N 
Sbjct: 365 GLVLRG-AEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNV 423

Query: 464 RVIYDTKNSQLGFAGEDC 481
            V YD +N ++GFA   C
Sbjct: 424 WVEYDLQNGRVGFAPARC 441


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 111/414 (26%), Positives = 178/414 (42%), Gaps = 64/414 (15%)

Query: 100 QYLQSRIKNMISGNIK-DVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSDL 156
           ++ Q R++ ++   +   +S  +   T+G+      Y   I LG   +   V VDTGSD+
Sbjct: 18  EHDQRRLRRILPEVVAFPISGDDDTFTTGL------YYTRIYLGTPPQQFYVHVDTGSDV 71

Query: 157 TWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSP 211
            WV C PC +C    +      +FDP  S S   + C    C+       ++  CS +S 
Sbjct: 72  AWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYL-----ASNSKCSFNS- 125

Query: 212 PDCNYFVSYGDGSYTRGELGREHLGLGKASVND---------FIFGCGRNNKGLFGGVSG 262
             C Y   YGDGS T G L  + L   +    +           FGCG N  G +    G
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW-LTDG 184

Query: 263 LMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
           L+G G++++SL SQ S+  +   +F++CL    D   SG+L++G           + YT 
Sbjct: 185 LVGFGQAEVSLPSQLSKQNVSVNIFAHCLQG--DNKGSGTLVIG-----HIREPGLVYTP 237

Query: 321 MIPNPQLATFYILNLTGISIGGKQLQASGFA---KGGILIDSGTVITRLPPSIYSALKAE 377
           ++P        +LN+ G+S G      + F     GG+++DSGT +T L    Y   +A+
Sbjct: 238 IVPKQSHYNVELLNI-GVS-GTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAK 295

Query: 378 FLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY--FVKSD 435
                          +L   F      E   P V + F G A M +  +  +Y   + + 
Sbjct: 296 VRDCMRS-------GVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTG 348

Query: 436 ASQVCLALAS-------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            S  C +          LSY     I G+   K+Q V+YD  N+++G+   DC+
Sbjct: 349 LSAYCFSWLESTSVYGYLSYT----IFGDNVLKDQLVVYDNVNNRIGWKNFDCT 398


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  108 bits (269), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 100/419 (23%), Positives = 169/419 (40%), Gaps = 58/419 (13%)

Query: 99  VQYLQSRIKNMISGNIKDVSNTEIPLTSGIR--LQTLNYIATIELGGRNMTVIVDTGSDL 156
           +Q  + ++K  I   I   SN   PL   +   +  +      E G +N  + +D G  L
Sbjct: 62  LQRAKEQVKCRIKHQILPTSNEMRPLMCPLEDAVYAVVVGVGTEAGFQNYQLALDMGGGL 121

Query: 157 TWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNY 216
           +W+QC PC+ C  Q  PVFDP+ SP++  +  +++      +    +G C         +
Sbjct: 122 SWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGACG--------F 173

Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDF------IFGCGRNNKGLFG--GVSGLMGL-- 266
            ++Y D ++  G L R+      A  +DF      +FGC    +       V+G++GL  
Sbjct: 174 DIAYRDNTHASGYLARDTFSF-PAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGILGLGM 232

Query: 267 ---GRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGN------SSVFKNSTPIT 317
              G+   +   Q     GG FSYC P          L  G +       +V + STP+ 
Sbjct: 233 GPAGKPPTAFTKQVLPAHGGRFSYC-PFVPGMSMYSYLRFGSDIPSHPPPNVHRQSTPV- 290

Query: 318 YTNMIPNPQLATFYILNLTGISIGGKQL--------QASGFAKGGILIDSGTVITRLPPS 369
               +     +  Y + L G+S+G  +L        + +    GG ++D GT +T    S
Sbjct: 291 ----LAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHS 346

Query: 370 IY----SALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDV 425
            Y     A++    ++ +      G    +TC    A     +P + + FE  A + V  
Sbjct: 347 AYVHIDHAVRQHLQRRGAHIVVVRG----NTCVQQPAPHHDVLPSMTLHFENGAWLRVMP 402

Query: 426 TGI-VYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS--QLGFAGEDC 481
             + + FV       C    S     +  +IG  QQ N R I+D  ++   + F  EDC
Sbjct: 403 EHVFMPFVVGGHHYQCFGFVS---STDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 111/403 (27%), Positives = 170/403 (42%), Gaps = 60/403 (14%)

Query: 117 VSNTEIPLTS-GIRLQTLNYIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD- 172
           ++  ++PL   G+   T  Y   I+LG   +   V VDTGSD+ WV C  C+ C  +   
Sbjct: 65  LAAADLPLGGLGLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGL 124

Query: 173 ----PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRG 228
                 +DP  S S   V C+   C A     G    C+++ P  C Y V YGDGS T G
Sbjct: 125 GLDLTFYDPKASSSGSTVSCDQGFCAATY--GGKLPGCTANVP--CEYSVMYGDGSSTTG 180

Query: 229 ELGREHLGL-----------GKASVNDFIFGCGRNNKGLFG----GVSGLMGLGRSDLSL 273
               + L             G A+V    FGCG    G  G     + G++G G+++ S+
Sbjct: 181 FFVTDALQFDQVTGDGQTQPGNATVT---FGCGAQQGGDLGSSNQALDGILGFGQANTSM 237

Query: 274 VSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFY 331
           +SQ +       +F++CL    D    G +   GN    K  T     +M         Y
Sbjct: 238 LSQLAAAGKVKKIFAHCL----DTIKGGGIFAIGNVVQPKVKTTPLVADM-------PHY 286

Query: 332 ILNLTGISIGGKQLQ--ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSG-- 384
            +NL  I +GG  LQ  A  F  G   G +IDSGT +T LP  ++  + A    +     
Sbjct: 287 NVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIV 346

Query: 385 FPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALA 444
           F +   F     CF      +   P +   FE   ++ + V    YF  +     C+   
Sbjct: 347 FHNVQDF----MCFQYPGSVDDGFPTITFHFE--DDLALHVYPHEYFFPNGNDMYCVGFQ 400

Query: 445 SLSYEDETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
           + + + + G    ++G+    N+ VIYD +N  +G+   +CSS
Sbjct: 401 NGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNCSS 443


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/298 (30%), Positives = 141/298 (47%), Gaps = 31/298 (10%)

Query: 207 SSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFI-------FGCGRNNKGLFGG 259
           S   P  C Y  +YGDG+ T G    E      +             FGCG  N G    
Sbjct: 15  SCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNN 74

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNST-PITY 318
            SG++G GR+ LSLVSQ S      FSYCL S      S  L    +  V+ ++T  +  
Sbjct: 75  GSGIVGFGRNPLSLVSQLSI---RRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQT 131

Query: 319 TNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFA-----KGGILIDSGTVITRLPPSIY 371
           T ++ +PQ  TFY ++ TG+++G ++L+   S FA      GG+++DSGT +T LP ++ 
Sbjct: 132 TPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVL 191

Query: 372 SALKAEFLKQFSGFPSAPGFSILD-TCFNL-------SAYQEVNIPLVKMEFEGNAEMTV 423
           + +   F +Q    P A G +  D  CF +       S+  ++ +P + + F+G A++ +
Sbjct: 192 AEVVRAFRQQLR-LPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQG-ADLDL 249

Query: 424 DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
                V        ++CL LA     D+   IGN  Q++ RV+YD +   L  A   C
Sbjct: 250 PRRNYV-LDDHRRGRLCLLLADSG--DDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 304


>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 460

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 163/374 (43%), Gaps = 51/374 (13%)

Query: 142 GGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATG 201
           G R   + +D  ++L W+QC+P +  + Q  P F+P+ SPS++++  N++ C  L    G
Sbjct: 95  GRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFRRLPGNNAFC--LPAPRG 152

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSY-TRGELGREHLGLG-----KASVNDFIFGCGRNNKG 255
           +           C +     DGS   RG L  E L        +  V   + GC  N+KG
Sbjct: 153 HRRTVQDP----CKFHSIRLDGSADARGVLSNETLAFAASGQQQTEVTGVVIGCTHNSKG 208

Query: 256 L----FGGVSGLMGLGRSDLSLVSQTSEIFGGL-----FSYCLPSTQDAGASGSLILGGN 306
                 G ++G++GLGR   SL+    +   G      FSYCLPS   + +     L  +
Sbjct: 209 FNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPSHGSSSSDHHTFLRFD 268

Query: 307 SSVFKN----STPITYTNMIPNPQLATFYILNLTGISIGGKQLQ--ASGFAK-------- 352
             V       ST I Y +   +     +++ +LTGIS+ GK LQ     F +        
Sbjct: 269 DDVPNTQHMVSTKIMYMDSTTSRDFRAYFV-SLTGISVAGKPLQDVKELFKRHVHGQVWT 327

Query: 353 GGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSIL----DTCFNLSAYQEVNI 408
            G   D+GT    +    Y+ LK   ++         G  I+      CF  ++    ++
Sbjct: 328 SGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPL----GLQIVSGQYHLCFRATSQLWQHL 383

Query: 409 PLVKMEF-EGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIY 467
           P V ++F E  A + +    +   V  D   +CLA+   SY  +  IIG  QQ ++R +Y
Sbjct: 384 PTVMLQFAETEARLVLPPQRLFVAVGYD---ICLAVVR-SY--DITIIGAMQQVDKRFVY 437

Query: 468 DTKNSQLGFAGEDC 481
           D ++ ++ F  E+ 
Sbjct: 438 DVRHGRIYFVPENA 451


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 164/370 (44%), Gaps = 51/370 (13%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEF 198
           +   V VDTGSD+ WV C  C  C  +     +  ++DP  S +  KV C+   C A   
Sbjct: 15  KRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAAT-- 72

Query: 199 ATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCG 250
             G    C++S P  C Y V+YGDGS T G    + L        G  + + +   FGCG
Sbjct: 73  YGGLLPGCTTSLP--CEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCG 130

Query: 251 RNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILG 304
               G  G     + G++G G+S+ S++SQ S       +F++CL    D    G +   
Sbjct: 131 SQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL----DTINGGGIFAI 186

Query: 305 GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF-----AKGGILIDS 359
           GN    K  T    T ++PN      Y +NL  I +GG  L+          K G +IDS
Sbjct: 187 GNVVQPKVKT----TPLVPN---MPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239

Query: 360 GTVITRLPPSIYSALK-AEFLKQFS-GFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG 417
           GT +T LP  +Y  +  A F K     F +   F     CF      + + P +   FE 
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQEF----LCFQYVGRVDDDFPKITFHFEN 295

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLAL--ASLSYEDETGII--GNYQQKNQRVIYDTKNSQ 473
             ++ ++V    YF ++  +  C+      L  +D  G++  G+    N+ V+YD +N  
Sbjct: 296 --DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQV 353

Query: 474 LGFAGEDCSS 483
           +G+   +CSS
Sbjct: 354 IGWTEYNCSS 363


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 104/386 (26%), Positives = 167/386 (43%), Gaps = 52/386 (13%)

Query: 127 GIRLQTLNYIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCK-SCYNQQD---PVF 175
           G  L  L+Y   I++G  N++ +V  D GSDL+WV     QC P   S Y   D     +
Sbjct: 95  GNDLDWLHY-TWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCAPLSASLYKPLDRDLSEY 153

Query: 176 DPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHL 235
            PS+S + + + CN   C   E  +     C +   P C Y   Y D + +      E +
Sbjct: 154 RPSLSTTSRHLSCNHQLC---ELGSH----CKNLKDP-CPYIADYADPNTSSSGFLVEDI 205

Query: 236 GLGKASVND------------FIFGCGRNNKGLF---GGVSGLMGLGRSDLSLVSQTSE- 279
            L  ASV+D             I GCGR   G +       G+MGLG   +S+ S  ++ 
Sbjct: 206 -LHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKA 264

Query: 280 -IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGI 338
            +    FS C     D   SG+++ G      + STP     ++P       Y++ +   
Sbjct: 265 GLIRKSFSLCF----DVNGSGTILFGDQGHTSQKSTP-----LLPTQGNYDAYLIEVESY 315

Query: 339 SIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCF 398
            +G   L+ SGF     L+DSG   T LP  +Y+ +  EF KQ +    +      + C+
Sbjct: 316 CVGNSCLKQSGFKA---LVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGPWNYCY 372

Query: 399 NLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNY 458
           N S+ Q  N+P +++ F  N  + +  +   Y+V  +       L     +   GIIG  
Sbjct: 373 NTSSKQLDNVPAMRLSFLMNQSLLIHNS--TYYVPQNQEFAVFCLTLQPTDLNYGIIGQN 430

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCSSM 484
                RV++D +N +LG++  +C  +
Sbjct: 431 YMTGYRVVFDMENLKLGWSSSNCKDI 456


>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
          Length = 335

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 89/345 (25%), Positives = 140/345 (40%), Gaps = 97/345 (28%)

Query: 145 NMTVIVDTGSDLTWVQCQPCK--SCYNQQDPVFDPSISPSYKKVLCNSSTCHAL-EFATG 201
           + TVI+D+GSD+ WVQCQPC    C+ Q+DP+FDP+ S +Y  V C+S+ C  L  +  G
Sbjct: 80  SQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRG 139

Query: 202 NSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASV-NDFIFGCGRNNKGLFGGV 260
               C ++S   C + ++Y +G+   G    + L LG   V   F+FGC   ++G     
Sbjct: 140 ----CLANS--QCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQG----- 188

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTN 320
                                   FSY          +G+L LGG S  F   T   Y+ 
Sbjct: 189 ----------------------STFSY--------DVAGTLALGGGSQSFVQQTASQYSR 218

Query: 321 M----IPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGTVITRLPPSIYSALKA 376
           +    +P P  ++F                             G ++  +PP      +A
Sbjct: 219 VFSYCVP-PSTSSF-----------------------------GFIMFGVPPQ-----RA 243

Query: 377 EFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDA 436
             +  F   P      +L +      +  + +P + + F+G A + +D  GI+       
Sbjct: 244 ALVPTFVSTP------LLSSSTMSPTFYSITLPSIALVFDGGATVNLDAAGILL------ 291

Query: 437 SQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
            Q CLA A  + +   G IGN QQ+   V+YD     + F    C
Sbjct: 292 -QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 335


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 117/368 (31%), Positives = 167/368 (45%), Gaps = 59/368 (16%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
           +I+DTGS L+W+QC   K    +  P  VFDPS+S S+  + CN   C     +F    S
Sbjct: 97  MILDTGSQLSWIQCH--KKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTS 154

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRNN---KGLFGG 259
             C  +    C+Y   Y DG+   G L RE +   ++ S    I GC   +   KG+ G 
Sbjct: 155 --CDQNR--LCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESSDAKGILG- 209

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ---DAGASGSLILG--GNSSVFKNST 314
               M LGR  LS  SQ        FSYC+P+ Q       +GS  LG   NS  F+   
Sbjct: 210 ----MNLGR--LSFASQAKLT---KFSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYIN 260

Query: 315 PITYTNMIPNPQLATF-YILNLTGISIGGKQLQA--SGF-----AKGGILIDSGTVITRL 366
            +T++     P L    Y + + GI IG ++L    S F       G  +IDSG+  T L
Sbjct: 261 LLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYL 320

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKM--EFEGNAEM 421
               Y+ ++ E ++   G     G+    + D CFN +A  E+   +  M  EF+   E+
Sbjct: 321 VDEAYNKVREEVVR-LVGARLKKGYVYGGVSDMCFNGNAI-EIGRLIGNMVFEFDKGVEI 378

Query: 422 TV-------DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
            V       DV G V+ V    S++ L  AS        IIGN+ Q+N  V +D  N ++
Sbjct: 379 VVEKERVLADVGGGVHCVGIGRSEM-LGAAS-------NIIGNFHQQNIWVEFDLANRRV 430

Query: 475 GFAGEDCS 482
           GF   DCS
Sbjct: 431 GFGKADCS 438


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 107/427 (25%), Positives = 184/427 (43%), Gaps = 53/427 (12%)

Query: 88  QQNRLILDNLHVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRN 145
           QQ+ L+L       L+ R   +I+  +  + N  +PL   ++     Y AT+ LG   R 
Sbjct: 24  QQDSLVLP------LRRRDGGIIARGL--LRNATLPLHGAVKDYGYFY-ATLHLGTPARQ 74

Query: 146 MTVIVDTGSDLTWVQCQPC-KSC-YNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNS 203
             VIVDTGS +T+V C  C ++C  + +D  FDP+ S S   + C+S  C          
Sbjct: 75  FAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDSDKC------ICGR 128

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGG--VS 261
             C  S   +C Y  +Y + S + G L  + L L   +V + +FGC     G        
Sbjct: 129 PPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLRDGAV-EVVFGCETKETGEIYNQEAD 187

Query: 262 GLMGLGRSDLSLVSQT--SEIFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYT 319
           G++GLG S++SLV+Q   S +   +F+ C  S +  GA    ++ G+    +    + YT
Sbjct: 188 GILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGA----LMLGDVDAAEYDVALQYT 243

Query: 320 NMIPNPQLATFYILNLTGISIGGKQL--QASGFAKG-GILIDSGTVITRLPPSIYSALKA 376
            ++ +     +Y + L  + +GG+QL  +   + +G G ++DSGT  T LP   +   K 
Sbjct: 244 ALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSGTTFTYLPSEAFQLFK- 302

Query: 377 EFLKQFS---GFPSAPG--------FSILDTCFNLSAYQ--------EVNIPLVKMEFEG 417
           E +  ++   G  S  G            D CF  + +         E   P+ +++F  
Sbjct: 303 EAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEKVFPVFELQFAD 362

Query: 418 NAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFA 477
              +       ++    +    CL +          ++G    +N  V YD +N ++GF 
Sbjct: 363 GVRLRTGPLNYLFMHTGEMGAYCLGV--FDNGASGTLLGGISFRNILVQYDRRNRRVGFG 420

Query: 478 GEDCSSM 484
              C  +
Sbjct: 421 AASCQEI 427


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 43/376 (11%)

Query: 134 NYIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNS 190
            Y  +I +G   R   + VDTGSDLTW+QC  PC +      P++ P+     K V    
Sbjct: 186 QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKE---KIVPPRD 242

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFI 246
             C  L+   GN   C +     C+Y + Y D S + G L R+ + +    G     DF+
Sbjct: 243 LLCQELQ---GNQNYCETCK--QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLDFV 297

Query: 247 FGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGS 300
           FGC  + +G          G++GL  + +S  SQ +   I   +F +C+  T++ G  G 
Sbjct: 298 FGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCI--TREQGGGGY 355

Query: 301 LILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGG--ILID 358
           + LG +   +     +T+T++   P     Y      +  G +QL+    A     ++ D
Sbjct: 356 MFLGDD---YVPRWGVTWTSIRSGPD--NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410

Query: 359 SGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE-- 416
           SG+  T LP  IY  L A       GF        L  C+  + +    +  VK  FE  
Sbjct: 411 SGSSYTYLPNEIYENLVAAIKYASPGFVQDTSDRTLPLCWK-ADFPVRYLEDVKQFFEPL 469

Query: 417 ----GNAEMTVDVTGIV----YFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVI 466
               G   + +  T  +    Y + SD   VCL L + +      T I+G+   + + V+
Sbjct: 470 NLHFGKKWLFMSKTFTISPEDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVV 529

Query: 467 YDTKNSQLGFAGEDCS 482
           YD +  Q+G+A  DC+
Sbjct: 530 YDNQRKQIGWADSDCT 545


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 109/381 (28%), Positives = 168/381 (44%), Gaps = 52/381 (13%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSC-----YNQQDPVFDPSISPSYKKVL 187
           Y   ++LG   +   V +DTGSD+ WV C PC  C      N Q   F+P  S +  ++ 
Sbjct: 89  YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPD--CNYFVSYGDGSYTRG-----------ELGREH 234
           C+   C A    TG + VC SS  P   C Y  +YGDGS T G            +G E 
Sbjct: 149 CSDDRCTA-ALQTGEA-VCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQ 206

Query: 235 LGLGKASVNDFIFGCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYC 288
                ASV   +FGC  +  G        V G+ G G+  LS+VSQ     +    FS+C
Sbjct: 207 TANSSASV---VFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC 263

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL--Q 346
           L  + + G  G L+LG    + +    + +T ++P+      Y LNL  I++ G++L   
Sbjct: 264 LKGSDNGG--GILVLG---EIVEPG--LVFTPLVPS---QPHYNLNLESIAVSGQKLPID 313

Query: 347 ASGFAKG---GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAY 403
           +S FA     G ++DSGT +  L    Y           S    +     +  CF  ++ 
Sbjct: 314 SSLFATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSS 372

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG--IIGNYQQK 461
            + + P   + F+G   MTV     +    S  + V   L  + ++   G  I+G+   K
Sbjct: 373 VDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNV---LWCIGWQRSQGITILGDLVLK 429

Query: 462 NQRVIYDTKNSQLGFAGEDCS 482
           ++  +YD  N ++G+A  DCS
Sbjct: 430 DKIFVYDLANMRMGWADYDCS 450


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 168/371 (45%), Gaps = 38/371 (10%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSST 192
           Y   I LG   + + VIVDTGSD+ WV+C PC+SC ++QD +  P    +      +S +
Sbjct: 83  YYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQD-IIPPLSIYNLSASSTSSVS 141

Query: 193 CHALEFATGNSGVCSSS-SPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
             +    TG   VCS S +   C Y  SY D S + G   R+ +      G A+ +   F
Sbjct: 142 SCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIFF 201

Query: 248 GCGRNNKGLFGGVSGLMGLGRSDLSLVSQ--TSEIFGGLFSYCLPSTQDAGASGSLILGG 305
           GC  N  G +  V G+MG G    ++ +Q  T      +FS+CL   +  G  G L  G 
Sbjct: 202 GCATNITGSW-PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGG--GILEFGE 258

Query: 306 NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL----QASGFAKG-----GIL 356
                 N+T + +T ++    + T Y ++L  IS+  K L    +   + +      G++
Sbjct: 259 AP----NTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVI 311

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSA--YQEVNIPLVKME 414
           IDSGT    L       L  E +K  +     P    L+ CF L +    E + P V + 
Sbjct: 312 IDSGTTFVLLTTKANRMLFQE-IKSLTTAKLGPKLEGLE-CFYLKSGLTMETSFPNVTLT 369

Query: 415 FEGNAEMTV--DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNS 472
           F G + M +  D   ++   K   +  C A +S    D   I G    K++ V YD +N 
Sbjct: 370 FSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSS---ADGLTIFGEIVLKDKLVFYDVENR 426

Query: 473 QLGFAGEDCSS 483
           ++G+ G++CSS
Sbjct: 427 RIGWKGQNCSS 437


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 168/382 (43%), Gaps = 57/382 (14%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVL 187
           Y   ++LG   R   V +DTGSD+ WV C PC  C +      +  +FD + S S + + 
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LGKASVN 243
           C    C A+   T     C + +   C+Y   Y D S T G    + +     LG++++ 
Sbjct: 144 CTDPICAAVSTTTDQ---CLTQT-DHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIA 199

Query: 244 D----FIFGCG--------RNNKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCL 289
           +     +FGC         R  K L     G+ G G+ + S++SQ S   I   +FS+CL
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKAL----DGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-QAS 348
              ++ G  G L+LG    + + S  I Y+ +IP+      Y L L  I++ G+     +
Sbjct: 256 KGGENGG--GILVLG---EILEPS--IVYSPLIPS---QPHYTLKLQSIALSGQLFPNPT 305

Query: 349 GFA---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
            F     G  +IDSGT +  L   +Y  + +      S   + P  S    CF +S    
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ-SATPTISRGSQCFRVSMSVA 364

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL------SYEDETGIIGNYQ 459
              P+++  FEG A M V     + F   D+   C   ASL        ED   I+G+  
Sbjct: 365 DIFPVLRFNFEGIASMVVTPEEYLQF---DSIVSCYKFASLWCIGFQKAEDGLNILGDLV 421

Query: 460 QKNQRVIYDTKNSQLGFAGEDC 481
            K++ ++YD    ++G+A  DC
Sbjct: 422 LKDKIIVYDLAQQRIGWANYDC 443


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 168/376 (44%), Gaps = 43/376 (11%)

Query: 134 NYIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCN 189
           +Y+A   +G     V  IVD   +L W QC  C+S  C+ Q+ PVFDPS S +Y+   C 
Sbjct: 61  HYVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCG 120

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIF 247
           S  C ++   T N   CS     +C Y     +GD   T G    + + +G A      F
Sbjct: 121 SPLCKSIP--TRN---CSGDG--ECGYEAPSMFGD---TFGIASTDAIAIGNAE-GRLAF 169

Query: 248 GC----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLIL 303
           GC      +  G   G SG +GLGR+  SLV Q++      FSYCL +    G   +L L
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCL-ALHGPGKKSALFL 225

Query: 304 GGNSSV--FKNSTPIT-----YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGIL 356
           G ++ +     S P T     + +   +     +Y + L GI  G   + A+    G I 
Sbjct: 226 GASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAIT 285

Query: 357 I---DSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVK 412
           +   ++   ++ LP + Y AL+ + +    G PS A      D CF  +A     +P + 
Sbjct: 286 VLQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLV 342

Query: 413 MEFEGNAEMTVDVTGIVYFVKSDASQVCLALASL----SYEDETGIIGNYQQKNQRVIYD 468
             F+G A +T   +  +    +    VCL++ S     S +D   I+G+  Q+N   ++D
Sbjct: 343 FTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFD 402

Query: 469 TKNSQLGFAGEDCSSM 484
            +   L F   DCSS+
Sbjct: 403 LEKETLSFEPADCSSL 418


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 109/375 (29%), Positives = 167/375 (44%), Gaps = 43/375 (11%)

Query: 135 YIATIELGGRNMTV--IVDTGSDLTWVQCQPCKS--CYNQQDPVFDPSISPSYKKVLCNS 190
           Y+A   +G     V  IVD   +L W QC  C+S  C+ Q+ PVFDPS S +Y+   C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 191 STCHALEFATGNSGVCSSSSPPDCNYFVS--YGDGSYTRGELGREHLGLGKASVNDFIFG 248
             C ++   T N   CS     +C Y     +GD   T G    + + +G A      FG
Sbjct: 122 PLCKSIP--TRN---CSGDG--ECGYEAPSMFGD---TFGIASTDAIAIGNAE-GRLAFG 170

Query: 249 C----GRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAGASGSLILG 304
           C      +  G   G SG +GLGR+  SLV Q++      FSYCL +    G   +L LG
Sbjct: 171 CVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCL-APHGPGKKSALFLG 226

Query: 305 GNSSV--FKNSTPIT-----YTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILI 357
            ++ +     S P T     + +   +     +Y + L GI  G   + A+    G I I
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI 286

Query: 358 ---DSGTVITRLPPSIYSALKAEFLKQFSGFPS-APGFSILDTCFNLSAYQEVNIPLVKM 413
              ++   ++ LP + Y AL+ + +    G PS A      D CF  +A     +P +  
Sbjct: 287 LQLETFRPLSYLPDAAYQALE-KVVTAALGSPSMANPPEPFDLCFQNAAVS--GVPDLVF 343

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASL----SYEDETGIIGNYQQKNQRVIYDT 469
            F+G A +T   +  +    +    VCL++ S     S +D   I+G+  Q+N   ++D 
Sbjct: 344 TFQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDL 403

Query: 470 KNSQLGFAGEDCSSM 484
           +   L F   DCSS+
Sbjct: 404 EKETLSFEPADCSSL 418


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 116/374 (31%), Positives = 166/374 (44%), Gaps = 50/374 (13%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD---PV--FDPSISPSYKKVL 187
           Y   ++LG   R   + VDTGSDL WV C PC  C    D   P+  +D   S S  KV 
Sbjct: 36  YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIF 247
           C+  +C  L      SG C+  +   C Y   YGDGS T G L  + L     +    IF
Sbjct: 96  CSDPSC-TLITQISESG-CNDQN--QCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIF 151

Query: 248 GCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
           GCG    G        + G++G G SDLS  SQ ++      +F++CL   +  G  G L
Sbjct: 152 GCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGG--GIL 209

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-----SGFAKGGIL 356
           +LG   +V +    I YT ++P     + Y + L  IS+    L       S     G +
Sbjct: 210 VLG---NVIEPD--IQYTPLVP---YMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261

Query: 357 IDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFE 416
            DSGT +  LP   Y A       Q      AP F + DT   LS +     P V + FE
Sbjct: 262 FDSGTTLAYLPDEAYQA-----FTQAVSLVVAP-FLLCDT--RLSRFIYKLFPNVVLYFE 313

Query: 417 GNAEMTVDVTGIVYFVK----SDASQVCL---ALASLSYEDETGIIGNYQQKNQRVIYDT 469
           G A MT  +T   Y ++    ++A   C+   ++ S   E +  I G+   KN+ V+YD 
Sbjct: 314 G-ASMT--LTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDL 370

Query: 470 KNSQLGFAGEDCSS 483
           +  ++G+   DC +
Sbjct: 371 ERGRIGWRPFDCKT 384


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 125/475 (26%), Positives = 200/475 (42%), Gaps = 81/475 (17%)

Query: 51  SSSCVSHQKSRIEMGAITLELKHKNYCSGKIVDWNEQQQNRLILDNL----HVQYLQSRI 106
           SS+ ++ + SR+       +L H+N     + D NE  ++R   +         +L+S+I
Sbjct: 27  SSTLITTKPSRLAT-----KLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDFLESKI 81

Query: 107 KNMIS-GNIKDVSNTEIPLTSGIRLQTLNYIATIELGGRNMT--VIVDTGSDLTWVQCQP 163
           K + S GN  +  ++ IP   G       ++  + +G   +T  V+VDTGS L WVQC P
Sbjct: 82  KELKSVGN--EARSSLIPFNRGS-----GFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 164 CKSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDG 223
           C +C+ Q    FDP  S S+K + C     + +     N   C+  +  +  Y + Y  G
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYI-----NGYKCNRFNQAE--YKLRYLGG 187

Query: 224 SYTRGELGREHL------------------GLGKASVNDFIFGCGR-----NNKGLFGGV 260
             ++G L +E L                   + K   ++  FGCG      NN   + GV
Sbjct: 188 DSSQGILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYNGV 247

Query: 261 SGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDA-GASGSLILGGNSSVFKNSTPITYT 319
            GL       +++ +Q     G  FSYC+    +       L+LG  S +  +STP+   
Sbjct: 248 FGLGAY--PHITMATQ----LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQI- 300

Query: 320 NMIPNPQLATFYILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRLPPSIYS 372
                     +Y+  L  IS+G K L       + S    GG+LIDSG   T+L    + 
Sbjct: 301 ------HFGHYYV-TLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFE 353

Query: 373 ALKAEFLKQFSGF----PSAPGFSILDTCFN-LSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
            L  E +    G     P+   F  L  CF  + +   V  P V   F G A++ ++   
Sbjct: 354 LLYDEIVDLMKGLLERIPTQRKFEGL--CFKGVVSRDLVGFPAVTFHFAGGADLVLESGS 411

Query: 428 IVYFVKSDASQVCLA-LASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +  F +    + CLA L S S      +IG   Q+N  V +D +  ++ F   DC
Sbjct: 412 L--FRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464


>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
          Length = 654

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 162/362 (44%), Gaps = 45/362 (12%)

Query: 144 RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSYKKVLCNSSTCH-ALEFATGN 202
           +  +VI DTGS L    C  C  C +  D  F    S +   V C+    H   +  T  
Sbjct: 76  QRASVIADTGSGLMAFPCSGCDGCGSHTDQPFQADNSSTLIHVTCSQQQSHFQCKECTEK 135

Query: 203 SGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL-GKASVND----------FIFGCGR 251
           S  C+ S         SY +GS  +  +  + + L G++S +D          F FGC  
Sbjct: 136 SDTCAISQ--------SYMEGSSWKASVVEDVVYLGGESSFHDEAMRDRYGTHFQFGCQS 187

Query: 252 NNKGLFGG--VSGLMGLGRSDLSLVS---QTSEIFGGLFSYCLPSTQDAGASGSLILGGN 306
           +  GLF      G+MGL  SD  +V+   + ++I   LFS C         +G  +  G 
Sbjct: 188 SETGLFVTQVADGIMGLSNSDTHIVAKLHRENKIPSNLFSLCFTE------NGGTMSVGE 241

Query: 307 SSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA--SGFAKGGILIDSGTVIT 364
            +   +   I+Y  +I +     FY +N+  I IGGK + A    + +G  ++DSGT  +
Sbjct: 242 PNTKAHRGEISYAKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYTRGHYIVDSGTTDS 301

Query: 365 RLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEG----NAE 420
            LP     A+K EFL+ F    +   + +  +C   +     ++P +++  E     N E
Sbjct: 302 YLP----RAMKNEFLQVFKEV-AGRDYQVGTSCHGYTNEDLASLPKIQLVMEAYGDENGE 356

Query: 421 MTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGED 480
           + +D+    Y + +D S  C ++  LS E+  G+IG     N+ VI+D  N ++GF   D
Sbjct: 357 VIIDIPPEQYLLHNDNS-YCGSIY-LS-ENAGGVIGANLMMNRDVIFDNGNQRVGFVDAD 413

Query: 481 CS 482
           C+
Sbjct: 414 CA 415


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 158/368 (42%), Gaps = 42/368 (11%)

Query: 135 YIATIELGGRNMTVIV--DTGSDLTWV-----QCQPCKSCYNQQDP---VFDPSISPSYK 184
           Y A +++G    + +V  DTGSDL WV     QC P  S     D    ++ P+ S + +
Sbjct: 100 YYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSR 159

Query: 185 KVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY-GDGSYTRGELGREHLGL----GK 239
            + C+   C         SG  +   P  C Y + Y  + + + G L  + L L    G 
Sbjct: 160 HLPCSHELCQP------GSGCTNPKQP--CTYNIDYFSENTTSSGLLIEDSLHLNSREGH 211

Query: 240 ASVN-DFIFGCGRNNKGLF-GGVS--GLMGLGRSDLSLVS--QTSEIFGGLFSYCLPSTQ 293
           A VN   I GCGR   G +  G++  GL+GLG +D+S+ S    + +    FS C     
Sbjct: 212 APVNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCF---- 267

Query: 294 DAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG 353
              +SG +  G      + STP      +P       Y +N+    IG K L+ S F   
Sbjct: 268 KEDSSGRIFFGDQGVSSQQSTPF-----VPLYGKLQTYAVNVDKSCIGHKCLEGSSFQA- 321

Query: 354 GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
             L+DSGT  T LPP +Y A   EF KQ +        S    C++ S  +  ++P + +
Sbjct: 322 --LVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIIL 379

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            F  N      V  I+ F     +     LA L   +  GIIG        V++D ++ +
Sbjct: 380 AFAANKSFQA-VNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGYHVVFDRESMK 438

Query: 474 LGFAGEDC 481
           LG+   +C
Sbjct: 439 LGWYRSEC 446


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/390 (26%), Positives = 177/390 (45%), Gaps = 33/390 (8%)

Query: 105 RIKNMISGNIKD-VSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQC 161
           R+ N +S    D  + + +   +G+     +Y+  ++LG   +   V+VDT S L+WV C
Sbjct: 95  RLANRLSSCPADEATASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGC 154

Query: 162 QPC-KSCYNQQDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSY 220
           +PC  +C     P F+P+ S +YK V C S+ C+A+  AT     C + +   C+Y  SY
Sbjct: 155 EPCINACL---IPTFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPT-EGCSYRQSY 210

Query: 221 GDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEI 280
            D S + G +  + L  G  S   FIFGC    +G+ G  SG++G+  +  SL SQ +  
Sbjct: 211 HDYSLSVGVVSSDTLTYGLGS-QKFIFGCCNLFRGVGGRYSGILGMSVNKFSLFSQMT-- 267

Query: 281 FGGLF---SYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTG 337
            G  +   SYC P  ++ G     +  G     K+    T   +  N      Y ++++ 
Sbjct: 268 VGHRYRAMSYCFPHPRNQG----FLQFGRYDEHKSLLRFTPLYIDGNN-----YFVHVSN 318

Query: 338 ISIGGKQL--QASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD 395
           + +    L  Q+SG        D+GT  T LP S++ +L         G+    G S   
Sbjct: 319 VMVETMSLDVQSSGNQTMRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRV-GASTGQ 377

Query: 396 TCFNLSAYQ---EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDET 452
           TCF         ++ +P VK+EF+  A +T++   +++  + +    CLA       D  
Sbjct: 378 TCFQADGNWIEGDLYMPTVKIEFQNGARITLNSEDLMFMEEPNV--FCLAFKMNDGGDI- 434

Query: 453 GIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            ++G+        + D +   +G  G+ C+
Sbjct: 435 -VLGSRHLMGVHTVVDLEMMTMGLRGQGCN 463


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 167/382 (43%), Gaps = 53/382 (13%)

Query: 135 YIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCN 189
           Y   I LG  +  V VDTGSD  WV C  C +C  +     +  ++DP+ S + K V C+
Sbjct: 77  YYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCD 136

Query: 190 SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-----SVND 244
              C     +T +  +        C Y ++YGDGS T G   ++ L   +      +V D
Sbjct: 137 DEFCT----STYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 192

Query: 245 ---FIFGCGRNNKGLFGGVS-----GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQD 294
               IFGCG    G     +     G++G G+++ S++SQ +       +FS+CL +   
Sbjct: 193 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVN- 251

Query: 295 AGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-----SG 349
               G   +G        +TP+        P++A + ++ L  I + G  +Q        
Sbjct: 252 --GGGIFAIGEVVQPKVKTTPLV-------PRMAHYNVV-LKDIEVAGDPIQLPTDIFDS 301

Query: 350 FAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAYQEVN 407
            +  G +IDSGT +  LP SIY  L  + L Q SG      + + D  TCF+ S  + ++
Sbjct: 302 TSGRGTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMEL---YLVEDQFTCFHYSDEKSLD 358

Query: 408 --IPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG----IIGNYQQK 461
              P VK  FE    +T      ++  K D    C+     + + + G    ++G+    
Sbjct: 359 DAFPTVKFTFEEGLTLTAYPHDYLFPFKED--MWCIGWQKSTAQTKDGKDLILLGDLVLT 416

Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
           N+  IYD  N  +G+   +CSS
Sbjct: 417 NKLFIYDLDNMSIGWTDYNCSS 438


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 68/401 (16%)

Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
           S+T +    G    T +Y  T+ +G   +   + VDTGSDLTW+QC  PC+SC     P+
Sbjct: 36  SSTAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPL 95

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           + P+ +   + V C ++ C AL    G++  C S  P  C+Y + Y D + ++G L  + 
Sbjct: 96  YRPTAN---RLVPCANALCTALHSGQGSNNKCPS--PKQCDYQIKYTDSASSQGVLINDS 150

Query: 235 LGLGKASVN---DFIFGCGRN-----NKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGL 284
             L   S N      FGCG +     N  +   + G++GLGR  +SLVSQ  +  I   +
Sbjct: 151 FSLPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNV 210

Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FY-----ILNLTG 337
             +CL +       G  +  G+  V     P +    +P  Q  +  +Y      L    
Sbjct: 211 VGHCLSTN-----GGGFLFFGDDVV-----PSSRVTWVPMAQRTSGNYYSPGSGTLYFDR 260

Query: 338 ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY----SALK---AEFLKQFSGFPSAP- 389
            S+G K ++        ++ DSG+  T      Y    SALK   ++ LKQ S  P+ P 
Sbjct: 261 RSLGVKPME--------VVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSD-PTLPL 311

Query: 390 ---GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--- 443
              G     + F++    E     +      NA M +      Y + +    VCL +   
Sbjct: 312 CWKGQKAFKSVFDVK--NEFKSMFLSFSSAKNAAMEIPPEN--YLIVTKNGNVCLGILDG 367

Query: 444 --ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             A LS+     +IG+   ++Q VIYD + SQLG+A   C+
Sbjct: 368 TAAKLSFN----VIGDITMQDQMVIYDNEKSQLGWARGACT 404


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 184/418 (44%), Gaps = 64/418 (15%)

Query: 119 NTEIPLTSGIRLQTLN-YIATIELG--GRNMTVIVDTGSDLTWVQCQ---PCKSCYNQQD 172
           +  +P T+ +   +   Y  T  LG   + + V++DTGS LTWV C     C++C +   
Sbjct: 50  HPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSA 109

Query: 173 ---PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSS--SPPDCN-----------Y 216
              PVF P  S S + V C + +C  +  A   +  C  +  SP   N           Y
Sbjct: 110 SAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPY 169

Query: 217 FVSYGDGSYTRGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQ 276
            V YG GS T G L  + L     +V  F+ GC   +  +    SGL G GR   S+ +Q
Sbjct: 170 AVVYGSGS-TAGLLIADTLRAPGRAVPGFVLGCSLVS--VHQPPSGLAGFGRGAPSVPAQ 226

Query: 277 TSEIFGGL--FSYCLPSTQ---DAGASGSLILGGNSSVFK-NSTPITYTNMIPNPQLATF 330
                 GL  FSYCL S +   +A  SGSL+LGG          P+  +          +
Sbjct: 227 L-----GLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVY 281

Query: 331 YILNLTGISIGGKQLQ-------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFS 383
           Y L L G+++GGK ++       A+    GG ++DSGT  T L P+++  +    +    
Sbjct: 282 YYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVG 341

Query: 384 GF----PSAPGFSILDTCFNLS-AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV---KSD 435
           G       A     L  CF L    + + +P +   FEG A M + V    YFV   +  
Sbjct: 342 GRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN--YFVVAGRGA 399

Query: 436 ASQVCLALASLSYEDETG----------IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
              +CLA+ +  +   +G          I+G++QQ+N  V YD +  +LGF  + C+S
Sbjct: 400 VEAICLAVVT-DFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTS 456


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 112/401 (27%), Positives = 177/401 (44%), Gaps = 68/401 (16%)

Query: 118 SNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPV 174
           S+T +    G    T +Y  T+ +G   +   + VDTGSDLTW+QC  PC+SC     P+
Sbjct: 36  SSTAVFQLQGDVYPTGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPL 95

Query: 175 FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
           + P+ +   + V C ++ C AL    G++  C S  P  C+Y + Y D + ++G L  + 
Sbjct: 96  YRPTAN---RLVPCANALCTALHSGQGSNNKCPS--PKQCDYQIKYTDSASSQGVLINDS 150

Query: 235 LGLGKASVN---DFIFGCGRN-----NKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGL 284
             L   S N      FGCG +     N  +   + G++GLGR  +SLVSQ  +  I   +
Sbjct: 151 FSLPMRSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNV 210

Query: 285 FSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLAT--FY-----ILNLTG 337
             +CL +       G  +  G+  V     P +    +P  Q  +  +Y      L    
Sbjct: 211 VGHCLSTN-----GGGFLFFGDDVV-----PSSRVTWVPMAQRTSGNYYSPGSGTLYFDR 260

Query: 338 ISIGGKQLQASGFAKGGILIDSGTVITRLPPSIY----SALK---AEFLKQFSGFPSAP- 389
            S+G K ++        ++ DSG+  T      Y    SALK   ++ LKQ S  P+ P 
Sbjct: 261 RSLGVKPME--------VVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSD-PTLPL 311

Query: 390 ---GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLAL--- 443
              G     + F++    E     +      NA M +      Y + +    VCL +   
Sbjct: 312 CWKGQKAFKSVFDVK--NEFKSMFLSFASAKNAAMEIPPEN--YLIVTKNGNVCLGILDG 367

Query: 444 --ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
             A LS+     +IG+   ++Q VIYD + SQLG+A   C+
Sbjct: 368 TAAKLSFN----VIGDITMQDQMVIYDNEKSQLGWARGACT 404


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 118/416 (28%), Positives = 182/416 (43%), Gaps = 61/416 (14%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
           H + LQS +  +++  +   S+   P   G+      Y   ++LG   R   V +DTGSD
Sbjct: 56  HGRLLQSPVGGVVNFPVDGASD---PFLVGL------YYTKVKLGTPPREFNVQIDTGSD 106

Query: 156 LTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           + WV C  C  C        Q   FDP +S S   V C+   C++      N    S  S
Sbjct: 107 VLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYS------NFQTESGCS 160

Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLG--------LGKASVNDFIFGCGRNNKGLF---- 257
           P + C+Y   YGDGS T G    + +         L   S   F+FGC     G      
Sbjct: 161 PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRPR 220

Query: 258 GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
             V G+ GLG+  LS++SQ +   +   +FS+CL   +  G  G ++LG    + +  T 
Sbjct: 221 RAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG--GIMVLG---QIKRPDT- 274

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKG-GILIDSGTVITRLPPSI 370
             YT ++P+      Y +NL  I++ G+ L         A G G +ID+GT +  LP   
Sbjct: 275 -VYTPLVPS---QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEA 330

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDT---CFNLSAYQEVNIPLVKMEFEGNAEMTVDVTG 427
           YS     F++  +   S  G  I      CF ++A      P V + F G A M +    
Sbjct: 331 YS----PFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRA 386

Query: 428 IVYFVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
            +    S  S + C+    +S+   T I+G+   K++ V+YD    ++G+A  DCS
Sbjct: 387 YLQIFSSSGSSIWCIGFQRMSHRRIT-ILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 123/409 (30%), Positives = 171/409 (41%), Gaps = 88/409 (21%)

Query: 150 VDTGSDLTWVQCQP--CKSCYNQQDPVFDPSISPSYKKV-------------------LC 188
           +DTGSDL W  C+P  C  C ++  P   P    S                       LC
Sbjct: 98  LDTGSDLVWFPCRPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCSAAHSSLPSSDLC 157

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVNDFIFG 248
             S C      TG+   C++SS P   ++ +YGDGS    +L  + L L   SV +F FG
Sbjct: 158 AISNCPLDYIETGD---CNTSSYPCPPFYYAYGDGSLV-AKLFSDSLSLPSVSVANFTFG 213

Query: 249 CGRNNKGLFGGVSGLMGLGRSDLSLVSQ---TSEIFGGLFSYCLPS----TQDAGASGSL 301
           C            G+ G GR  LSL +Q    S   G  FSYCL S    +        L
Sbjct: 214 CAHTT---LAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRVRRPSPL 270

Query: 302 ILG-----------------GNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQ 344
           ILG                       K      +T M+ NP+   FY ++L GISIG + 
Sbjct: 271 ILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGISIGKRN 330

Query: 345 LQASGFAK-------GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF--------PSAP 389
           + A    +       GG+++DSGT  T LP   Y+++  EF  +            PS  
Sbjct: 331 IPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADRVEPS-- 388

Query: 390 GFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFV--------KSDASQV-C 440
             S +  C+ L+  Q V +P + + F GN   TV +    YF         K +  +V C
Sbjct: 389 --SGMSPCYYLN--QTVKVPALVLHFAGNGS-TVTLPRRNYFYEFMDGGDGKEEKRKVGC 443

Query: 441 LALASLSYEDE----TG-IIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
           L L +   E E    TG I+GNYQQ+   V+YD  N ++GFA   C+S+
Sbjct: 444 LMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCASL 492


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 169/368 (45%), Gaps = 59/368 (16%)

Query: 148 VIVDTGSDLTWVQCQPCKSCYNQQDP--VFDPSISPSYKKVLCNSSTCHAL--EFATGNS 203
           +I+DTGS L+W+QC   K    +  P  VFDPS+S S+  + CN   C     +F    S
Sbjct: 92  MILDTGSQLSWIQCH--KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTS 149

Query: 204 GVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIFGCGRN---NKGLFGG 259
             C  +    C+Y   Y DG+   G L RE +    + S    I GC  +   +KG+ G 
Sbjct: 150 --CDLNR--LCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDASDDKGILG- 204

Query: 260 VSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQDAG---ASGSLILG--GNSSVFKNST 314
               M LGR  LS  SQ        FSYC+P+ Q       +GS  LG   NS+ F+  +
Sbjct: 205 ----MNLGR--LSFASQAKIT---KFSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYIS 255

Query: 315 PITYTNMIPNPQLATF-YILNLTGISIGGKQL-------QASGFAKGGILIDSGTVITRL 366
            +T++     P L    + + L GI IG K+L       +A     G  +IDSG+  T L
Sbjct: 256 LLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYL 315

Query: 367 PPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNIPLVKM--EFEGNAEM 421
               Y+ ++ E ++  +G     G+    + D CF+ +A  E+   +  M  EF+   E+
Sbjct: 316 VDVAYNKVREEVVR-LAGPRLKKGYVYSGVSDMCFDGNA-MEIGRLIGNMVFEFDKGVEI 373

Query: 422 TV-------DVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQL 474
            +       DV G V+ V    S++  A         + IIGN+ Q+N  V +D  N ++
Sbjct: 374 VIEKGRVLADVGGGVHCVGIGRSEMLGA--------ASNIIGNFHQQNLWVEFDIANRRV 425

Query: 475 GFAGEDCS 482
           GF   DCS
Sbjct: 426 GFGKADCS 433


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 178/413 (43%), Gaps = 55/413 (13%)

Query: 98  HVQYLQSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELGG--RNMTVIVDTGSD 155
           H + LQS +  +++  +   S+   P   G+      Y   ++LG   R   V +DTGSD
Sbjct: 56  HGRLLQSPVGGVVNFPVDGASD---PFLVGL------YYTKVKLGTPPREFNVQIDTGSD 106

Query: 156 LTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSS 210
           + WV C  C  C        Q   FDP +S S   V C+   C++      N    S  S
Sbjct: 107 VLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYS------NFQTESGCS 160

Query: 211 PPD-CNYFVSYGDGSYTRGELGREHLG--------LGKASVNDFIFGCGRNNKGLF---- 257
           P + C+Y   YGDGS T G    + +         L   S   F+FGC     G      
Sbjct: 161 PNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRPR 220

Query: 258 GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTP 315
             V G+ GLG+  LS++SQ +   +   +FS+CL   +  G  G ++LG    + +  T 
Sbjct: 221 RAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG--GIMVLG---QIKRPDT- 274

Query: 316 ITYTNMIPNPQLATFYILNLTGISIGGKQLQAS----GFAKG-GILIDSGTVITRLPPSI 370
             YT ++P+      Y +NL  I++ G+ L         A G G +ID+GT +  LP   
Sbjct: 275 -VYTPLVPS---QPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEA 330

Query: 371 YSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           YS          S +     +     CF ++A      P V + F G A M +     + 
Sbjct: 331 YSPFIQAIANAVSQYGRPITYESYQ-CFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQ 389

Query: 431 FVKSDASQV-CLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
              S  S + C+    +S+   T I+G+   K++ V+YD    ++G+A  DCS
Sbjct: 390 IFSSSGSSIWCIGFQRMSHRRIT-ILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 96/309 (31%), Positives = 134/309 (43%), Gaps = 30/309 (9%)

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GK-ASV 242
           C+S  CH L+     +GVCS      CNY   YGD S T+G L ++        GK  S+
Sbjct: 21  CDSPLCHKLD-----TGVCSPEK--RCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSL 73

Query: 243 NDFIFGCGRNNKGLFGGVS-GLMGLGRSDLSLVSQTSEIFGG-LFSYCL-PSTQDAGASG 299
           + F+FGCG NN G F     GL+GLG    SL+SQ   +FGG  FS CL P   D   S 
Sbjct: 74  SRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISS 133

Query: 300 SLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQA-SGFAKGGILID 358
            +  G  S V  +   +  T ++   Q  T Y + L GIS+    L   S   KG +L+D
Sbjct: 134 RMSFGKGSQVLGDG--VVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVD 191

Query: 359 SGTVITRLPPSIYSALKAEF-----LKQFSGFPSAPGFSILDTCFNLSAYQEVNIPLVKM 413
           SGT    LP  +Y  +  E      L+  +  PS        T  NL        P +  
Sbjct: 192 SGTPPNILPQQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKG------PTLTY 245

Query: 414 EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVIYDTKNSQ 473
            FEG   +   +   +          CLA+ + +     G+ GN+ Q N  + +D     
Sbjct: 246 HFEGANLLLTPIQTFIPPTPETKGVFCLAINNYT-NSNGGVYGNFAQSNYLIGFDLDRQV 304

Query: 474 LGFAGEDCS 482
           + F   DC+
Sbjct: 305 VSFKATDCT 313


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 161/368 (43%), Gaps = 39/368 (10%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ-PCKSCYNQQDPVFDPSISPSYKKVLCNSS 191
           Y  +I +G   R   + VDTGSDLTW+QC  PC +C     P++ P+     K V    S
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKE---KIVPPRDS 247

Query: 192 TCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGL----GKASVNDFIF 247
            C  L+   G+   C +     C+Y + Y D S + G L ++ + L    G     DF+F
Sbjct: 248 LCQELQ---GDQNYCETCK--QCDYEIEYADRSSSMGVLAKDDMHLIATNGGREKLDFVF 302

Query: 248 GCGRNNKGLF----GGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDAGASGSL 301
           GC  + +G          G++GL  + +SL SQ +   I   +F +C+  T++    G +
Sbjct: 303 GCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCI--TRETNGGGYM 360

Query: 302 ILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGILIDSGT 361
            LG +   +     +T+  +   P     Y      ++ G ++L A    +  ++ DSG+
Sbjct: 361 FLGDD---YVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQELHAGNSVQ--VIFDSGS 413

Query: 362 VITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNL-----SAYQEVNIPLVKMEFE 416
             T LP  +Y  L     +    F      + L  C+       S ++ +N+   +  F 
Sbjct: 414 SYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRWFV 473

Query: 417 GNAEMTVDVTGIVYFVKSDASQVCLALASLSY--EDETGIIGNYQQKNQRVIYDTKNSQL 474
                T  +    Y + SD   VCL L + +      T I+G+   + + V+YD +  Q+
Sbjct: 474 --VPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERRQI 531

Query: 475 GFAGEDCS 482
           G+A  +C+
Sbjct: 532 GWANSECT 539


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 54/382 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
           Y A I +G    +  V VDTGSD+ WV C  C +C  + D      +++P  S +   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
           C+   C     AT ++ +        C Y V YGDGS T G    +++ L +A  N    
Sbjct: 133 CDQPFCS----ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188

Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
                 +FGCG    G  G     + G++G G+++ S++SQ +       +F++CL S  
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248

Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----- 346
             G  A G ++             +  T ++PN      Y + L G+ +G   L      
Sbjct: 249 GGGIFAIGEVV----------EPKLXNTPVVPN---QAHYNVVLNGVKVGDTALDLPLGL 295

Query: 347 -ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY 403
             + + +G I IDSGT +  LP SIY  L  + L      P     ++ D  TCF     
Sbjct: 296 FETSYKRGAI-IDSGTTLAYLPESIYLPLMEKIL---GAQPDLKLRTVDDQFTCFVFDKN 351

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQK 461
            +   P V  +FE +  +T+     ++ ++ D   V    +    +D  E  ++G+   +
Sbjct: 352 VDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411

Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
           N+ V Y+ +N  +G+   +CSS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCSS 433


>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 578

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 110/401 (27%), Positives = 175/401 (43%), Gaps = 52/401 (12%)

Query: 116 DVSNTEIPLTSGIRLQTLNYIATIELG----GRNMTVIVDTGSDLTWVQCQ-PCKSCYNQ 170
           D S T  P+   +    L Y   I +G    G+   + +DTGSDLTW+QC  PC SC   
Sbjct: 180 DSSTTIFPVGGNVYPDGL-YYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKG 238

Query: 171 QDPVFDPSISPSYKKVLCNSSTCHALEFATGN-SGVCSSSSPPDCNYFVSYGDGSYTRGE 229
            + ++ P      K  L  SS    +E      +  C S     C+Y + Y D SY+ G 
Sbjct: 239 ANQLYKPR-----KDNLVRSSEPFCVEVQRNQLTEHCESCH--QCDYEIEYADHSYSMGV 291

Query: 230 LGRE--HLGL--GKASVNDFIFGCGRNNKGLFGG----VSGLMGLGRSDLSLVSQTSE-- 279
           L ++  HL L  G  + +D +FGCG + +GL         G++GL R+ +SL SQ +   
Sbjct: 292 LTKDKFHLKLHNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRG 351

Query: 280 IFGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGIS 339
           I   +  +CL S  D    G + +G +      S  +T+  M+ +P L   Y + +T +S
Sbjct: 352 IISNVVGHCLAS--DLNGEGYIFMGSD---LVPSHGMTWVPMLHHPHLEV-YQMQVTKMS 405

Query: 340 IGGKQLQASGF--AKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTC 397
            G   L   G     G +L D+G+  T  P   YS L    L++ S        S  D  
Sbjct: 406 YGNAMLSLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTS-LQEVSDLELTRDDS--DEA 462

Query: 398 FNLSAYQEVNIPL-----VKMEFE------GNAEMTVDVTGIV----YFVKSDASQVCLA 442
             +    + N P+     VK  F       G+  + +    ++    Y + S+   VCL 
Sbjct: 463 LPICWRAKTNSPISSLSDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNVCLG 522

Query: 443 L--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDC 481
           +   S  ++  T IIG+   + + ++YD    ++G+   DC
Sbjct: 523 ILDGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMKSDC 563


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 115/384 (29%), Positives = 170/384 (44%), Gaps = 59/384 (15%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQDPV----------FDPSISPS 182
           Y   ++LG   R+  V +DTGSD+ WV C  C  C     PV          FDP  SP+
Sbjct: 52  YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGC-----PVNSGLHIPLNFFDPGSSPT 106

Query: 183 YKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LG 238
              + C+   C +L   + +S VCS+ +   C Y   YGDGS T G    + L     LG
Sbjct: 107 ASLISCSDQRC-SLGLQSSDS-VCSAQNNL-CGYNFQYGDGSGTSGYYVSDLLHFDTVLG 163

Query: 239 KASVND----FIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYC 288
            + +N+     +FGC     G        V G+ G G+ D+S+VSQ +   I    FS+C
Sbjct: 164 GSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC 223

Query: 289 LPSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQAS 348
           L      G  G L+LG    V  N   I YT ++P+      Y LN+  IS+ G+ L   
Sbjct: 224 LKGDDSGG--GILVLG--EIVEPN---IVYTPLVPS---QPHYNLNMQSISVNGQTLAID 273

Query: 349 GFAKG-----GILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSA-PGFSILDTCFNLSA 402
               G     G +IDSGT +  L  + Y    +      S  PS  P  S  + C+ +S+
Sbjct: 274 PSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFISAITSIVS--PSVRPYLSKGNHCYLISS 331

Query: 403 YQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSD----ASQVCLALASLSYEDETGIIGNY 458
                 P V + F G A M +      Y ++      A+  C+    +  +  T I+G+ 
Sbjct: 332 SINDIFPQVSLNFAGGASMILIPQD--YLIQQSSIGGAALWCIGFQKIQGQGIT-ILGDL 388

Query: 459 QQKNQRVIYDTKNSQLGFAGEDCS 482
             K++  +YD  N ++G+A  DCS
Sbjct: 389 VLKDKIFVYDIANQRIGWANYDCS 412


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/382 (26%), Positives = 167/382 (43%), Gaps = 54/382 (14%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQPCKSCYNQQD-----PVFDPSISPSYKKVL 187
           Y A I +G    +  V VDTGSD+ WV C  C +C  + D      +++P  S +   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKASVN---- 243
           C+   C     AT ++ +        C Y V YGDGS T G    +++ L +A  N    
Sbjct: 133 CDQPFCS----ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTS 188

Query: 244 ----DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQ 293
                 +FGCG    G  G     + G++G G+++ S++SQ +       +F++CL S  
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSIS 248

Query: 294 DAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQ----- 346
             G  A G ++             +  T ++PN      Y + L G+ +G   L      
Sbjct: 249 GGGIFAIGEVV----------EPKLKTTPVVPN---QAHYNVVLNGVKVGDTALDLPLGL 295

Query: 347 -ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILD--TCFNLSAY 403
             + + +G I IDSGT +  LP SIY  L  + L      P     ++ D  TCF     
Sbjct: 296 FETSYKRGAI-IDSGTTLAYLPDSIYLPLMEKIL---GAQPDLKLRTVDDQFTCFVFDKN 351

Query: 404 QEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED--ETGIIGNYQQK 461
            +   P V  +FE +  +T+     ++ ++ D   V    +    +D  E  ++G+   +
Sbjct: 352 VDDGFPTVTFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQ 411

Query: 462 NQRVIYDTKNSQLGFAGEDCSS 483
           N+ V Y+ +N  +G+   +CSS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCSS 433


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 106/415 (25%), Positives = 172/415 (41%), Gaps = 70/415 (16%)

Query: 103 QSRIKNMISGNIKDVSNTEIPLTSGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQ 160
           + R +N+++  +  +    IP  +G+      Y   I +G       V +DTGS   WV 
Sbjct: 58  RHRRRNLMAAELP-LGGFNIPYGTGL------YYTDIGIGTPAVKYYVQLDTGSKAFWVN 110

Query: 161 CQPCKSCYNQQDPV-----FDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPP--- 212
              CK C ++ D +     +DP  S S K+V C+ + C              +S PP   
Sbjct: 111 GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC--------------TSRPPCNM 156

Query: 213 --DCNYFVSYGDGSYTRGELGREHL--------GLGKASVNDFIFGCGRNNKGLFG---- 258
              C Y   Y DG  T G L  + L        G  + +     FGCG    G       
Sbjct: 157 TLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAV 216

Query: 259 GVSGLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDAGASGSLILGGNSSVFKNSTPI 316
            + G++G G S+ + +SQ +       +FS+CL ST   G      +G        +TPI
Sbjct: 217 AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI---FAIGEVVEPKVKTTPI 273

Query: 317 TYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKG-----GILIDSGTVITRLPPSIY 371
              N +       ++++NL  I++ G  LQ      G     G  IDSG+ +  LP  IY
Sbjct: 274 VKNNEV-------YHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIY 326

Query: 372 SALKAEFLKQFSGFPSAPGFSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVY 430
           S L    L  F+  P     ++ +  CF+     +   P +   FE   ++T+DV    Y
Sbjct: 327 SEL---ILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFEN--DLTLDVYPYDY 381

Query: 431 FVKSDASQVCLAL--ASLSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            ++ + +Q C     A +    +  I+G+    N+ V+YD +   +G+   +CSS
Sbjct: 382 LLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNCSS 436


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 179/398 (44%), Gaps = 58/398 (14%)

Query: 121 EIPLT-SGIRLQTLNYIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQQD----- 172
           ++PL  SG+  +T  Y   I +G   +   V VDTGSD+ WV C  C  C  + +     
Sbjct: 75  DLPLGGSGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIEL 134

Query: 173 PVFDPSISPSYKKVLCNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGR 232
            ++DP  S S + V C+   C A       S  C+S+SP  C Y +SYGDGS T G    
Sbjct: 135 TMYDPRGSQSGELVTCDQQFCVANYGGVLPS--CTSTSP--CEYSISYGDGSSTAGFFVT 190

Query: 233 EHLGLGKASVN--------DFIFGCGRNNKGLFG----GVSGLMGLGRSDLSLVSQTSEI 280
           + L   + S +           FGCG    G  G     + G++G G+S+ S++SQ +  
Sbjct: 191 DFLQYNQVSGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAA 250

Query: 281 --FGGLFSYCLPSTQDAG--ASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLT 336
                +F++CL +    G  A G+++             +  T ++P+      Y + L 
Sbjct: 251 GKVRKMFAHCLDTVNGGGIFAIGNVV----------QPKVKTTPLVPD---MPHYNVILK 297

Query: 337 GISIGGKQLQ------ASGFAKGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG 390
           GI +GG  L        SG +KG I IDSGT +  +P  +Y AL A    +         
Sbjct: 298 GIDVGGTALGLPTNIFDSGNSKGTI-IDSGTTLAYVPEGVYKALFAMVFDKHQDISVQ-- 354

Query: 391 FSILD-TCFNLSAYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYE 449
            ++ D +CF  S   +   P V   FEG+  + V      Y  ++  +  C+   +   +
Sbjct: 355 -TLQDFSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHD--YLFQNGKNLYCMGFQNGGGK 411

Query: 450 DETG----IIGNYQQKNQRVIYDTKNSQLGFAGEDCSS 483
            + G    ++G+    N+ V+YD +N  +G+A  +CSS
Sbjct: 412 TKDGKDLGLLGDLVLSNKLVLYDLENQAIGWADYNCSS 449


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  105 bits (262), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 91/275 (33%), Positives = 133/275 (48%), Gaps = 29/275 (10%)

Query: 227 RGELGREHLGLGKASVNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFS 286
           R EL RE + + +A      FGCG  + G   G SGLMGL    +SL+SQ S      FS
Sbjct: 80  RQELHREPVHVRRA----LGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSV---PRFS 132

Query: 287 YCLPSTQDAGASGSLILGGNSSVFKNST--PITYTNMIPNPQLATF-YILNLTGISIGGK 343
           YCL    +   S  ++ G  + + K +T  PI  T ++ NP + TF Y + L G+S+G K
Sbjct: 133 YCLTPFAERKTS-PMLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTK 191

Query: 344 QLQ--ASGFA-----KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPG-FSILD 395
           +L+  A+  A      GG ++DSG+ +  L    + A+K   L+     P   G     +
Sbjct: 192 RLRVPAASLAINPDGTGGTIVDSGSTMAHLAGKAFDAVKKAVLEAVK-LPVFNGTVEDYE 250

Query: 396 TCFNLS---AYQEVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYED-- 450
            CF +    A   V  P + + F+G A M +      YF +  A  +CLA+A  S ED  
Sbjct: 251 LCFAVPSGVAMAAVKTPPLVLHFDGGAAMALPRDN--YFQEPRAGLMCLAVAR-SPEDLG 307

Query: 451 -ETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCSSM 484
               IIGN QQ+N  V++D  N +  FA   C  +
Sbjct: 308 APISIIGNVQQQNMHVLFDVHNQKFSFAPTKCHDI 342


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 103/323 (31%), Positives = 139/323 (43%), Gaps = 62/323 (19%)

Query: 124 LTSGIRLQTLNYIATIELGGRNMTVIVDTGSDLTWVQCQPCKSCYNQQDPVFDPSISPSY 183
           LT+G     L YI T     +   +IVD+GS +T+V C  C+ C N QDP F P +S SY
Sbjct: 84  LTNGYYTTRL-YIGTPP---QEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSY 139

Query: 184 KKVLCN-SSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKAS- 241
             V CN   TC               S    C Y   Y + S + G LG + +  G+ S 
Sbjct: 140 SPVKCNVDCTC--------------DSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESE 185

Query: 242 --VNDFIFGCGRNNKG-LFG-GVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCLPSTQDA 295
                 +FGC  +  G LF     G+MGLGR  LS++ Q  E  +    FS C       
Sbjct: 186 LKAQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIG 245

Query: 296 GASGSLILGG----NSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGF- 350
           G  G+++LGG    +  VF  S P+           + +Y + L  I + GK L+     
Sbjct: 246 G--GAMVLGGVPTPSDMVFSRSDPLR----------SPYYNIELKEIHVAGKALRVDSRI 293

Query: 351 --AKGGILIDSGTVITRLPPSIYSAL------KAEFLKQFSGFPSAPGFSILDTCF---- 398
             +K G ++DSGT    LP   + A       K   LK+  G    P  S  D CF    
Sbjct: 294 FDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRG----PDPSYKDICFAGAR 349

Query: 399 -NLSAYQEVNIPLVKMEFEGNAE 420
            N+S   EV  P V M F GN +
Sbjct: 350 RNVSKLHEV-FPDVDMVF-GNGQ 370


>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 425

 Score =  105 bits (261), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 165/381 (43%), Gaps = 51/381 (13%)

Query: 135 YIATIELGG--RNMTVIVDTGSDLTWVQCQ----PCKSCYNQQDPVFDPSISPSYKKVLC 188
           Y  +I +G   +   + +DTGSDLTWVQC     PCK C   +D ++ P+     + V C
Sbjct: 62  YTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPN---GKQVVKC 118

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGRE--HLGLGKASVNDFI 246
           +   C A +       +CS  SPP C Y V Y D + T G L R+  H+G   +S  D +
Sbjct: 119 SDPICVATQSTHVLGQICSKQSPP-CVYNVQYADHASTLGVLVRDYMHIGSPSSSTKDPL 177

Query: 247 --FGCGRNNKGLFGGVS-------GLMGLGRSDLSLVSQTSEI--FGGLFSYCLPSTQDA 295
             FGCG   K  F G +       G++GLG    S++SQ + I     +  +CL     A
Sbjct: 178 VAFGCGYEQK--FSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCL----SA 231

Query: 296 GASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQLQASGFAKGGI 355
              G L LG     F  S+ I +T +I +  L   Y      +   GK   A G     I
Sbjct: 232 EGGGYLFLGDK---FVPSSGIVWTPIIQS-SLEKHYNTGPVDLFFNGKPTPAKGLQ---I 284

Query: 356 LIDSGTVITRLPPSIYSALKAEFLKQFSGFP-SAPGFSILDTCFN----LSAYQEVN--I 408
           + DSG+  T     +Y+ +         G P S      L  C+       +  EVN   
Sbjct: 285 IFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVNNYF 344

Query: 409 PLVKMEFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETG-----IIGNYQQKNQ 463
             + + F  +  +   +  + Y + +    VCL + +    +E G     ++G+   +++
Sbjct: 345 KPLTLSFTKSKNLQFQLPPVAYLIITKYGNVCLGILN---GNEAGLGNRNVVGDISLQDK 401

Query: 464 RVIYDTKNSQLGFAGEDCSSM 484
            V+YD +  Q+G+A  +C  +
Sbjct: 402 VVVYDNEKQQIGWASANCKQI 422


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 166/376 (44%), Gaps = 40/376 (10%)

Query: 132 TLNYIATIELGGRNMT--VIVDTGSDLTWVQCQPCKSCYNQQDPV-FDPSISPSYKKVLC 188
           ++  I ++ +G    T  +++DTGS L+W+QC              FDPS+S S+  + C
Sbjct: 77  SMALIVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPC 136

Query: 189 NSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLGLGKA-SVNDFIF 247
           N   C            C  +    C+Y   Y DG+Y  G L RE +    + S    I 
Sbjct: 137 NHPLCKPRIPDFTLPTTCDQNR--LCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLIL 194

Query: 248 GCGR---NNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSYCLPSTQ-DAG--ASGSL 301
           GC     + KG+ G     M LGR   +  ++ S+     FSYC+P+ Q  AG  ++GS 
Sbjct: 195 GCAEASTDEKGILG-----MNLGRRSFASQAKISK-----FSYCVPTRQARAGLSSTGSF 244

Query: 302 ILGGN--SSVFKNSTPITYTNMIPNPQLATF-YILNLTGISIGGKQLQASGF-------A 351
            LG N  S  F+    +T+T    +P L    Y + + GI +G  +L  S          
Sbjct: 245 YLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSG 304

Query: 352 KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGF---SILDTCFNLSAYQEVNI 408
            G  +IDSG+  T L    Y+ ++ E ++   G     G+    + D CF+ +   E+  
Sbjct: 305 AGQTIIDSGSEFTYLVDEAYNKVREEVVR-LVGPKLKKGYVYGGVSDMCFDGNP-MEIGR 362

Query: 409 PLVKM--EFEGNAEMTVDVTGIVYFVKSDASQVCLALASLSYEDETGIIGNYQQKNQRVI 466
            +  M  EFE   E+ +D   ++  V      + +  + +     + IIGN+ Q+N  V 
Sbjct: 363 LIGNMVFEFEKGVEIVIDKWRVLADVGGGVHCIGIGRSEM-LGAASNIIGNFHQQNLWVE 421

Query: 467 YDTKNSQLGFAGEDCS 482
           YD  N ++G    DCS
Sbjct: 422 YDLANRRIGLGKADCS 437


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 123/411 (29%), Positives = 178/411 (43%), Gaps = 79/411 (19%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYN--------QQDPVFDPSISPSYK 184
           Y  ++ LG   + + V++DTGS L+WV C     C N            VF P  S S +
Sbjct: 91  YAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSR 150

Query: 185 KVLCNSSTCHALEF-------ATGNSG---VCSSSSPPDCNYFVSYGDGSYTRGELGREH 234
            V C +  C  +         +TGN+G   VC    PP   Y V YG GS T G L  + 
Sbjct: 151 LVGCRNPACRWIHSKSPSTCGSTGNNGNGDVC----PP---YLVVYGSGS-TSGLLISDT 202

Query: 235 LGLGKAS-------VNDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLVSQTSEIFGGLFSY 287
           L L  +S         +F  GC  +   +    SGL G GR   S+ SQ        FSY
Sbjct: 203 LRLSPSSSSSAPAPFRNFAIGC--SIVSVHQPPSGLAGFGRGAPSVPSQLKVP---KFSY 257

Query: 288 CLPSTQ---DAGASGSLILG-GNSSVFKNSTPITYTNMIPN----PQLATFYILNLTGIS 339
           CL S +   ++  SG L+LG       K  T + Y  ++ N    P  + +Y L LTGIS
Sbjct: 258 CLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGIS 317

Query: 340 IGGK--QLQASGFAK---GGILIDSGTVITRLPPSIYSALKAEFLKQFSGF-----PSAP 389
           +GGK   L +  F     GG +IDSGT  T L P+++  + A       G      P   
Sbjct: 318 VGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVED 377

Query: 390 GFSILDTCFNLSAYQ--EVNIPLVKMEFEGNAEMTVDVTGIVYFVKSDASQ--------V 439
               L  CF L       + +P ++++F+G A M + V    YFV +  +         +
Sbjct: 378 ALG-LRPCFALPPGPGGAMELPDLELKFKGGAVMRLPVEN--YFVAAGPAGGPAAGPVAI 434

Query: 440 CLALAS--------LSYEDETGIIGNYQQKNQRVIYDTKNSQLGFAGEDCS 482
           CLA+ S         +      I+G++QQ+N  + YD    +LGF  + C+
Sbjct: 435 CLAVVSDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCA 485


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  104 bits (260), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 167/378 (44%), Gaps = 52/378 (13%)

Query: 135 YIATIELG--GRNMTVIVDTGSDLTWVQCQPCKSCYNQ-----QDPVFDPSISPSYKKVL 187
           Y   ++LG   R   V +DTGSD+ WV C PC  C +      +  +FD + S S + + 
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 188 CNSSTCHALEFATGNSGVCSSSSPPDCNYFVSYGDGSYTRGELGREHLG----LGKASVN 243
           C    C A+   T     C + +   C+Y   Y D S T G    + +     LG++++ 
Sbjct: 144 CTDPICAAVSTTTDQ---CLTQT-DHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIA 199

Query: 244 D----FIFGCG--------RNNKGLFGGVSGLMGLGRSDLSLVSQTSE--IFGGLFSYCL 289
           +     +FGC         R  K L     G+ G G+ + S++SQ S   I   +FS+CL
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKAL----DGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255

Query: 290 PSTQDAGASGSLILGGNSSVFKNSTPITYTNMIPNPQLATFYILNLTGISIGGKQL-QAS 348
              ++ G  G L+LG    + + S  I Y+ +IP+      Y L L  I++ G+     +
Sbjct: 256 KGGENGG--GILVLG---EILEPS--IVYSPLIPS---QPHYTLKLQSIALSGQLFPNPT 305

Query: 349 GFA---KGGILIDSGTVITRLPPSIYSALKAEFLKQFSGFPSAPGFSILDTCFNLSAYQE 405
            F     G  +IDSGT +  L   +Y  + +      S   + P  S    CF +S    
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQ-SATPTISRGSQCFRVSMSVA 364

Query: 406 VNIPLVKMEFEGNAEMTVDVTGIVYF--VKSDASQVCLALASLSYEDETGIIGNYQQKNQ 463
              P+++  FEG A M V     + F  +  + +  C+       ED   I+G+   K++
Sbjct: 365 DIFPVLRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKA--EDGLNILGDLVLKDK 422

Query: 464 RVIYDTKNSQLGFAGEDC 481
            ++YD    ++G+A  DC
Sbjct: 423 IIVYDLARQRIGWANYDC 440


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,647,504,905
Number of Sequences: 23463169
Number of extensions: 335431734
Number of successful extensions: 859219
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 894
Number of HSP's successfully gapped in prelim test: 2806
Number of HSP's that attempted gapping in prelim test: 849690
Number of HSP's gapped (non-prelim): 4478
length of query: 484
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 337
effective length of database: 8,910,109,524
effective search space: 3002706909588
effective search space used: 3002706909588
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)