BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011045
         (495 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  635 bits (1637), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 317/460 (68%), Positives = 380/460 (82%), Gaps = 1/460 (0%)

Query: 37  VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN 96
           +LDV+S+LQQ  +ILSF+ +T +     + T +     NSS SFSL LH RE ++K  H 
Sbjct: 39  ILDVASSLQQAHNILSFDLQTQKSSTHTTITTSTPSFSNSSLSFSLELHPRETIYKIHHK 98

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           DY+SLVLSRL RD+ R N+L  +LQLA+ ++ + +LKP E +I PED STPV SG SQGS
Sbjct: 99  DYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLSTPVTSGTSQGS 158

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEYF+R+GVG P RQF MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP  SS+Y+P+ C +
Sbjct: 159 GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQS 218

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
            QC SL++S+CR+ +CLYQV YGDGS+T GD  TE+VSFGNSGSVK +ALGCGHDNEGLF
Sbjct: 219 QQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGCGHDNEGLF 278

Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR-GGDAVTAPLIRNK 335
           VG+AGLLGLGGG LSLT Q+KATS +YCLV+RDS  S  L+FNSA+ G D+VTAPL++N+
Sbjct: 279 VGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTAPLMKNR 338

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
           K+DTFYYVGL+G SVGGQ V IP S F +DE+G+GGIIVDCGTAITRLQTQAYN LRD+F
Sbjct: 339 KIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAF 398

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
           VR+  NLK TS VALFDTCYD SG  SVRVPTVS HF  GK+ +LPA NYLIPVDSAGT+
Sbjct: 399 VRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYLIPVDSAGTY 458

Query: 456 CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           CFAFAPT+S+LSIIGNVQQQGTRV+FDLANNR+GF+PNKC
Sbjct: 459 CFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 325/500 (65%), Positives = 394/500 (78%), Gaps = 11/500 (2%)

Query: 6   PFVLFTITTILFSFCLFTS-ASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAE 63
           P  L  ++ +  S CL T+ ASSR LS +  TTVLDV S+LQQT+HILS +P      A 
Sbjct: 4   PRFLSLLSVVTLSICLTTTDASSRSLSTSHKTTVLDVVSSLQQTQHILSVDPTRSSLTAR 63

Query: 64  ESETAAESFP--LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ 121
             E   ES P  LNSSS  SL LHSR+ L  ++H DY+SLVLSRLERDS+RV  +  K++
Sbjct: 64  IPEFKPESDPVFLNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIR 123

Query: 122 LAIYNVDRHELKPA---EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
            A+  +DR +LKP    E +  PED +TPVVSG SQGSGEYFSRIGVGTP ++  +VLDT
Sbjct: 124 FAVEGIDRSDLKPVDIDETRFQPEDLTTPVVSGTSQGSGEYFSRIGVGTPAKEMYVVLDT 183

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY 238
           GSD+NW+QC PC+ECYQQSDPIFDP +SS++  L C+ P+C SLDVSACR+N+CLYQV+Y
Sbjct: 184 GSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVSACRSNKCLYQVSY 243

Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
           GDGSFTVG+  T+TV+FG SG V  +ALGCGHDNEGLF G+AGLLGLGGG LS+T QIKA
Sbjct: 244 GDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKA 303

Query: 299 TSLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
            S +YCLVDRDS  S  L+FNS +   GDA TAPL+RN K+DTFYYVGL+GFSVGGQ V 
Sbjct: 304 KSFSYCLVDRDSAKSSSLDFNSVQIGAGDA-TAPLLRNSKMDTFYYVGLSGFSVGGQQVS 362

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCY 415
           IP SLFE+D +G GG+I+DCGTA+TRLQTQAYNSLRD+FV+L  + K  TS ++LFDTCY
Sbjct: 363 IPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCY 422

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
           DFS L +V+VPTV+ HF  GK+L+LPAKNYLIP+D AGTFCFAFAPTSS+LSIIGNVQQQ
Sbjct: 423 DFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQ 482

Query: 476 GTRVSFDLANNRVGFTPNKC 495
           GTR+++DLANN +G + NKC
Sbjct: 483 GTRITYDLANNLIGLSANKC 502


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  624 bits (1608), Expect = e-176,   Method: Compositional matrix adjust.
 Identities = 316/498 (63%), Positives = 393/498 (78%), Gaps = 9/498 (1%)

Query: 6   PFVLFTITTILFS-FCLFTSASSRGLS-ETATTVLDVSSALQQTEHILSFEPETLEPFAE 63
           P  L  +TT+  S F   T ASSR LS  T TTVLDV S+LQQT+ ILS +P      A 
Sbjct: 4   PRFLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSLTAT 63

Query: 64  ESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
           + E+ ++    NSSS  SL LHSR+ L  ++H DY+SLVLSRLERDS+RV  +  K++ A
Sbjct: 64  KPESISDPVFFNSSSPLSLELHSRDTLVASQHKDYKSLVLSRLERDSSRVAGIAAKIRFA 123

Query: 124 IYNVDRHELKPA---EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
           +  +DR +LKP    + +  PE  +TPVVSG SQGSGEYFSRIGVGTP ++  +VLDTGS
Sbjct: 124 VEGIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAKEMYLVLDTGS 183

Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
           D+NW+QC PC++CYQQSDP+F+P +SS+Y  L C+APQC  L+ SACR+N+CLYQV+YGD
Sbjct: 184 DVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGD 243

Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
           GSFTVG+L T+TV+FGNSG +  +ALGCGHDNEGLF G+AGLLGLGGG LS+T Q+KATS
Sbjct: 244 GSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATS 303

Query: 301 LAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP 358
            +YCLVDRDS  S  L+FNS +   GDA TAPL+RN+K+DTFYYVGL+GFSVGGQ V +P
Sbjct: 304 FSYCLVDRDSGKSSSLDFNSVQLGSGDA-TAPLLRNQKIDTFYYVGLSGFSVGGQKVMMP 362

Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDF 417
            ++F++D +G GG+I+DCGTA+TRLQTQAYNSLRD+F++L  NLK  TS ++LFDTCYDF
Sbjct: 363 DAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDF 422

Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGT 477
           S L SV+VPTV+ HF  GK+LDLPAKNYLIPVD  GTFCFAFAPTSS+LSIIGNVQQQGT
Sbjct: 423 SSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGT 482

Query: 478 RVSFDLANNRVGFTPNKC 495
           R+++DLAN  +G + NKC
Sbjct: 483 RITYDLANKIIGLSGNKC 500


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  617 bits (1590), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 318/491 (64%), Positives = 390/491 (79%), Gaps = 3/491 (0%)

Query: 5   KPFVLFTITTILFSFCLFTSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
           KPF  F + TI+FS  L  S        T TT+LDVSS+LQQ  +ILSF P+     +++
Sbjct: 8   KPF--FFLFTIIFSLTLALSRDLLPPHATKTTILDVSSSLQQALNILSFNPQQQTALSQQ 65

Query: 65  SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
            +    + P   +SSFSL L+ R+ +HKT H DY++LVLSRL RDS+RV  + T+LQL +
Sbjct: 66  QQQTI-AIPSFLNSSFSLSLNPRDTIHKTPHKDYKALVLSRLHRDSSRVQAITTRLQLIL 124

Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
             V + +LKP + +I P+D STPV SG SQGSGEYF+R+GVG P + + MVLDTGSDINW
Sbjct: 125 NGVSKSDLKPLQTEIQPQDLSTPVSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINW 184

Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFT 244
           +QC+PC++CYQQSDPIF P  SSSYSPL C + QC SL +S+CR  +C YQV YGDGSFT
Sbjct: 185 IQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFT 244

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
            GD VTET+SFG SG+V  IALGCGHDNEGLFVG+AGLLGLGGG LSLT Q+KATS +YC
Sbjct: 245 FGDFVTETMSFGGSGTVNSIALGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYC 304

Query: 305 LVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           LV+RDS AS  L+FNSA  GD+V APL+++ K+DTFYYVGL+G SVGG+ ++IP  +F++
Sbjct: 305 LVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKL 364

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           D++GDGG+IVDCGTAITRLQ++AYNSLRDSFV ++ +L+ TSGVALFDTCYD SG  SV+
Sbjct: 365 DDSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVK 424

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           VPTVS HF  GK+ DLPA NYLIPVDSAGT+CFAFAPT+S+LSIIGNVQQQGTRVSFDLA
Sbjct: 425 VPTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLA 484

Query: 485 NNRVGFTPNKC 495
           NNRVGF+ NKC
Sbjct: 485 NNRVGFSTNKC 495


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  615 bits (1585), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 306/484 (63%), Positives = 385/484 (79%), Gaps = 8/484 (1%)

Query: 19  FCLFTSASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSS 77
           F   T ASSR LS    T VLDV S+LQQT+ ILS +P        + E+ ++    NSS
Sbjct: 18  FLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSS 77

Query: 78  SSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA-- 135
           S  SL LHSR+    ++H DY+SL LSRLERDS+RV  ++ K++ A+  VDR +LKP   
Sbjct: 78  SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYN 137

Query: 136 -EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
            + +   ED +TPVVSGASQGSGEYFSRIGVGTP ++  +VLDTGSD+NW+QC PC +CY
Sbjct: 138 EDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCY 197

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
           QQSDP+F+P +SS+Y  L C+APQC  L+ SACR+N+CLYQV+YGDGSFTVG+L T+TV+
Sbjct: 198 QQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVT 257

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
           FGNSG +  +ALGCGHDNEGLF G+AGLLGLGGG+LS+T Q+KATS +YCLVDRDS  S 
Sbjct: 258 FGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSS 317

Query: 315 VLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
            L+FNS +  GGDA TAPL+RNKK+DTFYYVGL+GFSVGG+ V +P ++F++D +G GG+
Sbjct: 318 SLDFNSVQLGGGDA-TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLH 431
           I+DCGTA+TRLQTQAYNSLRD+F++L  NLK  +S ++LFDTCYDFS L +V+VPTV+ H
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFH 436

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  GK+LDLPAKNYLIPVD +GTFCFAFAPTSS+LSIIGNVQQQGTR+++DL+ N +G +
Sbjct: 437 FTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLS 496

Query: 492 PNKC 495
            NKC
Sbjct: 497 GNKC 500


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  614 bits (1583), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 306/484 (63%), Positives = 384/484 (79%), Gaps = 8/484 (1%)

Query: 19  FCLFTSASSRGLSET-ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSS 77
           F   T ASSR LS    T VLDV S+LQQT+ ILS +P        + E+ ++    NSS
Sbjct: 18  FLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSLTTTKPESLSDPVFFNSS 77

Query: 78  SSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA-- 135
           S  SL LHSR+    ++H DY+SL LSRLERDS+RV  ++ K++ A+  VDR +LKP   
Sbjct: 78  SPLSLELHSRDTFVASQHKDYKSLTLSRLERDSSRVAGIVAKIRFAVEGVDRSDLKPVYN 137

Query: 136 -EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
            + +   ED +TPVVSGASQGSGEYFSRIGVGTP +   +VLDTGSD+NW+QC PC +CY
Sbjct: 138 EDTRYQTEDLTTPVVSGASQGSGEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCY 197

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
           QQSDP+F+P +SS+Y  L C+APQC  L+ SACR+N+CLYQV+YGDGSFTVG+L T+TV+
Sbjct: 198 QQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVT 257

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
           FGNSG +  +ALGCGHDNEGLF G+AGLLGLGGG+LS+T Q+KATS +YCLVDRDS  S 
Sbjct: 258 FGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSS 317

Query: 315 VLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
            L+FNS +  GGDA TAPL+RNKK+DTFYYVGL+GFSVGG+ V +P ++F++D +G GG+
Sbjct: 318 SLDFNSVQLGGGDA-TAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGV 376

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLH 431
           I+DCGTA+TRLQTQAYNSLRD+F++L  NLK  +S ++LFDTCYDFS L +V+VPTV+ H
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFH 436

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  GK+LDLPAKNYLIPVD +GTFCFAFAPTSS+LSIIGNVQQQGTR+++DL+ N +G +
Sbjct: 437 FTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLS 496

Query: 492 PNKC 495
            NKC
Sbjct: 497 GNKC 500


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  608 bits (1567), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 318/462 (68%), Positives = 377/462 (81%), Gaps = 4/462 (0%)

Query: 35  TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
           T VLDVSS+L Q   ILSF P+ LE   + SET   + P +SSSSFSL LH RE L   +
Sbjct: 34  TNVLDVSSSLHQAHQILSFNPQLLE--EQSSETETPTSPSSSSSSFSLQLHPRETLLNEQ 91

Query: 95  HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-PEDFSTPVVSGAS 153
           H +Y++LVLSRL RD+ARVN+L TKLQLA+ +++R +L P E ++L PED STPV SG +
Sbjct: 92  HPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPEDLSTPVSSGTA 151

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           QGSGEYFSR+GVG P + F MVLDTGSD+NWLQC+PC++CYQQSDPIFDP  SSSY+PL 
Sbjct: 152 QGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLT 211

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           C A QC+ L++SACR  +CLYQV+YGDGSFTVG+ VTETVSFG +GSV  +A+GCGHDNE
Sbjct: 212 CDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFG-AGSVNRVAIGCGHDNE 270

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR 333
           GLFVGSAGLLGLGGG LSLT QIKATS +YCLVDRDS  S  LEFNS R GD+V APL++
Sbjct: 271 GLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLVDRDSGKSSTLEFNSPRPGDSVVAPLLK 330

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
           N+KV+TFYYV LTG SVGG+ V +PP  F +D++G GG+IVD GTAITRL+TQAYNS+RD
Sbjct: 331 NQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRD 390

Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
           +F R   NL+P  GVALFDTCYD S L+SVRVPTVS HF   +A  LPAKNYLIPVD AG
Sbjct: 391 AFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWALPAKNYLIPVDGAG 450

Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           T+CFAFAPT+S++SIIGNVQQQGTRVSFDLAN+ VGF+PNKC
Sbjct: 451 TYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  605 bits (1561), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 302/473 (63%), Positives = 373/473 (78%), Gaps = 4/473 (0%)

Query: 27  SRGLS---ETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLP 83
           SR LS   ++ ++VLDVS ++++T  +LS +    +P  +  E      P + +SSFSL 
Sbjct: 24  SRELSLDTDSHSSVLDVSGSIRKTLDVLSHKSSVSKPSDQRDEKTTSFSPTSLASSFSLE 83

Query: 84  LHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-PE 142
           LH RE+LH   H DYR+L+LSRL RDSARV  + TKLQLA+   D+ +L P + +IL P+
Sbjct: 84  LHPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPMDTEILHPQ 143

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           DFSTPV SG SQGSGEYF R+G+G P + F MV+DTGSD+NWLQC+PC +CYQQ DPIFD
Sbjct: 144 DFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFD 203

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
           P +SSS+S L C  PQC++LDV ACR + CLYQV+YGDGS+TVGD  TETVSFGNSGSV 
Sbjct: 204 PASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVD 263

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
            +A+GCGHDNEGLFVG+AGL+GLGGG LSLT QIKA+S +YCLV+RDS  S  LEFNSA+
Sbjct: 264 KVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCLVNRDSVDSSTLEFNSAK 323

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
             D+VTAP+ +N KVDTFYYVG+TG SVGG+ + IPPS+FE+D +G GGIIVDCGTA+TR
Sbjct: 324 PSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVTR 383

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           LQTQAYN+LRD+FV+L  +L  TSG ALFDTCY+ S   SVRVPTV+  F  GK+L LP 
Sbjct: 384 LQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPP 443

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            NYLIPVDSAGTFC AFAPT+++LSIIGNVQQQGTRV++DLAN++V F+  KC
Sbjct: 444 SNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  598 bits (1541), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 300/458 (65%), Positives = 367/458 (80%), Gaps = 4/458 (0%)

Query: 38  LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHND 97
           LDVS++LQQ   +L F+P     F ++        P NSS SFSL LH R+ LH   H D
Sbjct: 38  LDVSASLQQANQVLKFDPTASISFQQQVHLV----PSNSSFSFSLQLHPRDSLHNAGHKD 93

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           Y+SLVLSRL RDS+RV ++  +L+ A+  + R +L+P + +ILPED STP++SG SQGSG
Sbjct: 94  YKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSG 153

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYFSR+GVG P + F MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP++SSS++ LPC + 
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
           QC++L+ S CRA++CLYQV+YGDGSFTVG+ VTET++FGNSG +  +A+GCGHDNEGLFV
Sbjct: 214 QCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGCGHDNEGLFV 273

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
           GSAGLLGLGGG LSLT Q+KA+S +YCLVDRDS +S  LEFNSA   D+V APL+++ KV
Sbjct: 274 GSAGLLGLGGGPLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKV 333

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
           DTFYYVGLTG SVGGQ + IPP+LF+MD++G GGIIVD GTAITRLQTQAYN+LRD+FV 
Sbjct: 334 DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVS 393

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
               LK T+G ALFDTCYD S    V +PTVS  F  GK+L LP KNYLIPVDS GTFCF
Sbjct: 394 RTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCF 453

Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAPT+S+LSIIGNVQQQGTRV +DLAN+ VGF+P+KC
Sbjct: 454 AFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  597 bits (1538), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 299/458 (65%), Positives = 366/458 (79%), Gaps = 4/458 (0%)

Query: 38  LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHND 97
           LDVS++LQQ   +L F+P     F ++        P NSS SFSL LH R+ LH   H D
Sbjct: 38  LDVSASLQQANQVLKFDPTASISFQQQVHLV----PSNSSFSFSLQLHPRDSLHNAGHKD 93

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           Y+SLVLSRL RDS+RV ++  +L+ A+  + R +L+P + +ILPED STP++SG SQGSG
Sbjct: 94  YKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSG 153

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYFSR+GVG P + F MVLDTGSDINWLQC+PCT+CYQQ+DPIFDP++SSS++ LPC + 
Sbjct: 154 EYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQ 213

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
           QC++L+ S CRA++CLYQV+YGDGSFTVG+ V ET++FGNSG +  +A+GCGHDNEGLFV
Sbjct: 214 QCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGCGHDNEGLFV 273

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
           GSAGLLGLGGG LSLT Q+KA+S +YCLVDRDS +S  LEFNSA   D+V APL+++ KV
Sbjct: 274 GSAGLLGLGGGSLSLTSQMKASSFSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKV 333

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
           DTFYYVGLTG SVGGQ + IPP+LF+MD++G GGIIVD GTAITRLQTQAYN+LRD+FV 
Sbjct: 334 DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVS 393

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
               LK T+G ALFDTCYD S    V +PTVS  F  GK+L LP KNYLIPVDS GTFCF
Sbjct: 394 RTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCF 453

Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAPT+S+LSIIGNVQQQGTRV +DLAN+ VGF+P+KC
Sbjct: 454 AFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  567 bits (1460), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 284/463 (61%), Positives = 352/463 (76%), Gaps = 10/463 (2%)

Query: 35  TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
           TT+LDV +++Q+ E I +     + PF ++         + SSS  ++ LHSR  + KT+
Sbjct: 25  TTLLDVEASIQKAEAIFTSSATKMTPFNQQE-------IVTSSSQLTMELHSRTSVQKTK 77

Query: 95  HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP--AEAQILPEDFSTPVVSGA 152
           H DYRSL LSRLERDSARV ++ T+L LAI+ +   +LKP   ++Q   ED   P++SG 
Sbjct: 78  HPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAEDLQGPIISGT 137

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           SQGSGEYFSR+G+G P     MVLDTGSD+NW+QC PC +CY Q+DPIF+P +S+SYSPL
Sbjct: 138 SQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPL 197

Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            C   QC+SLDVS CR N CLY+V+YGDGS+TVGD VTET++ G S SV  +A+GCGH+N
Sbjct: 198 SCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLG-SASVDNVAIGCGHNN 256

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           EGLF+G+AGLLGLGGG LS   QI A+S +YCLVDRDS ++  LEFNSA    A+TAPL+
Sbjct: 257 EGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLVDRDSDSASTLEFNSALLPHAITAPLL 316

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           RN+++DTFYYVG+TG SVGG+ + IP S+FEMDE+G+GGII+D GTA+TRLQT AYN+LR
Sbjct: 317 RNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALR 376

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           D+FV+   +L  TS VALFDTCYD S   SV VPTV+ H   GK L LPA NYLIPVDS 
Sbjct: 377 DAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSD 436

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           GTFCFAFAPTSSALSIIGNVQQQGTRV FDLAN+ VGF P +C
Sbjct: 437 GTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  556 bits (1433), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 283/466 (60%), Positives = 359/466 (77%), Gaps = 13/466 (2%)

Query: 33  TATTVLDVSSALQQTEHILSFEPETLEPF-AEESETAAESFPLNSSSSFSLPLHSREILH 91
           + TTVLDV++++Q+T++I S  P+ + PF  +E ET        +SS  ++ L SR  + 
Sbjct: 29  SETTVLDVAASIQRTKNIFSSGPK-MSPFNQQEKET--------TSSELTVELLSRTSIQ 79

Query: 92  KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVV 149
           KT H  Y+SL LSRL+RDSARV +L+T+L LAI ++   +LKP E  ++  PED  +P++
Sbjct: 80  KTTHTGYKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPII 139

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SG SQGSGEYFSR+G+G PP Q  ++LDTGSD+NW+QC PC +CYQQ+DPIF+P +S+S+
Sbjct: 140 SGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASF 199

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           S L C   QC+SLDVS CR + CLY+V+YGDGS+TVGD VTET++ G S  V  +A+GCG
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLG-SAPVDNVAIGCG 258

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
           H+NEGLFVG+AGLLGLGGG LS   QI ATS +YCLVDRDS ++  LEFNS    +AV+A
Sbjct: 259 HNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLVDRDSESASTLEFNSTLPPNAVSA 318

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+RN  +DTFYYVGLTG SVGG+ V IP S F++DE+G+GG+IVD GTAITRLQT  YN
Sbjct: 319 PLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRLQTDVYN 378

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           SLRD+FV+   +L  T+G+ALFDTCYD S   +V VPTVS HF  GK L LPAKNYL+P+
Sbjct: 379 SLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPL 438

Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           DS GTFCFAFAPT+S+LSIIGNVQQQGTRV +DL N+ VGF PNKC
Sbjct: 439 DSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  556 bits (1432), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 293/489 (59%), Positives = 368/489 (75%), Gaps = 6/489 (1%)

Query: 10  FTITTILFSFCLF--TSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESET 67
           F +  +   FC +  +  ++R LS   TTVLDVS +++++ ++LS  P+  +    E + 
Sbjct: 6   FLLCVLFAFFCTWGVSLVNARRLSLPRTTVLDVSGSIRESLNVLSLNPQYEQ---MEFQH 62

Query: 68  AAESFPLNSSSSFSLPLHS-REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
              SFP +SSSS        R  +HK+ H DY+SLVL+RLERDS RV +L T++ LAI  
Sbjct: 63  QERSFPSSSSSSSLTLSLHSRTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAG 122

Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
           + + +LKP E ++  E   TP+VSGASQGSGEYFSR+G+G+PP+   MV+DTGSD+NW+Q
Sbjct: 123 ITKSDLKPVEKELEAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQ 182

Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVG 246
           C PC +CYQQ+DPIF+P  SSSY+PL C   QCKSLDVS CR + CLY+V+YGDGS+TVG
Sbjct: 183 CAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVG 242

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
           D  TET++   S S+  +A+GCGHDNEGLFVG+AGLLGLGGG LS   QI A+S +YCLV
Sbjct: 243 DFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCLV 302

Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           +RD+ ++  LEFNS     +VTAPL+RN ++DTFYY+G+TG  VGGQ + IP S FE+DE
Sbjct: 303 NRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDE 362

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
           +G+GGIIVD GTA+TRLQ+  YNSLRDSFVR   +L  TSGVALFDTCYD S   SV VP
Sbjct: 363 SGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVP 422

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           TVS HF  GK L LPAKNYLIPVDSAGTFCFAFAPT+SALSIIGNVQQQGTRVS+DL+N+
Sbjct: 423 TVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNS 482

Query: 487 RVGFTPNKC 495
            VGF+PN C
Sbjct: 483 LVGFSPNGC 491


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 268/356 (75%), Positives = 310/356 (87%), Gaps = 1/356 (0%)

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           PED STPV SG SQGSGEYF+R+GVG P RQF MVLDTGSDINWLQC+PCT+CYQQ+DPI
Sbjct: 2   PEDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPI 61

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           FDP  SS+Y+P+ C + QC SL++S+CR+ +CLYQV YGDGS+T GD  TE+VSFGNSGS
Sbjct: 62  FDPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGS 121

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
           VK +ALGCGHDNEGLFVG+AGLLGLGGG LSLT Q+KATS +YCLV+RDS  S  L+FNS
Sbjct: 122 VKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNS 181

Query: 321 AR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
           A+ G D+VTAPL++N+K+DTFYYVGL+G SVGGQ V IP S F +DE+G+GGIIVDCGTA
Sbjct: 182 AQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTA 241

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           ITRLQTQAYN LRD+FVR+  NLK TS VALFDTCYD SG  SVRVPTVS HF  GK+ +
Sbjct: 242 ITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWN 301

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LPA NYLIPVDSAGT+CFAFAPT+S+LSIIGNVQQQGTRV+FDLANNR+GF+PNKC
Sbjct: 302 LPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 276/478 (57%), Positives = 359/478 (75%), Gaps = 17/478 (3%)

Query: 23  TSASSRGLSETATT---VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSS 79
           +S  SR L ET+TT   +L+V+ ++ +T++  SF    L    E++ +A        SSS
Sbjct: 18  SSVFSRILPETSTTTTSILNVADSIHRTKYTSSFR---LNQQEEQTHSA--------SSS 66

Query: 80  FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
           FSL LHSR  +  T H+DY+SL L+RL RD+ARV +LIT+L LAI N+ + +LKP     
Sbjct: 67  FSLQLHSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMY 126

Query: 140 LPE--DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
             E  D   P++SG +QGSGEYF+R+G+G P R+  MVLDTGSD+NWLQC PC +CY Q+
Sbjct: 127 TTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQT 186

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           +PIF+P +SSSY PL C  PQC +L+VS CR   CLY+V+YGDGS+TVGD  TET++ G 
Sbjct: 187 EPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIG- 245

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE 317
           S  V+ +A+GCGH NEGLFVG+AGLLGLGGG+L+L  Q+  TS +YCLVDRDS ++  ++
Sbjct: 246 STLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVD 305

Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
           F ++   DAV APL+RN ++DTFYY+GLTG SVGG+ +QIP S FEMDE+G GGII+D G
Sbjct: 306 FGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSG 365

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           TA+TRLQT+ YNSLRDSFV+   +L+  +GVA+FDTCY+ S   +V VPTV+ HF  GK 
Sbjct: 366 TAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKM 425

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L LPAKNY+IPVDS GTFC AFAPT+S+L+IIGNVQQQGTRV+FDLAN+ +GF+ NKC
Sbjct: 426 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 280/475 (58%), Positives = 348/475 (73%), Gaps = 17/475 (3%)

Query: 28  RGLSETATT-VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHS 86
           R L  T TT VLDV++++Q+T+ + + EP++  P     ET      ++  SS SL L+S
Sbjct: 22  RTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTP----DETT-----VSDPSSLSLQLNS 72

Query: 87  REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP------AEAQIL 140
           R  + K  H+DY+SL LSRL+RDSARV +L  ++ LAI  +   +L+P        +Q  
Sbjct: 73  RISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFG 132

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
            EDF +P+VSGASQGSGEYFSR+G+G PP    MVLDTGSD++W+QC PC ECY+Q+DPI
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPI 192

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           F+P +S+S++ L C   QCKSLDVS CR   CLY+V+YGDGS+TVGD VTETV+ G S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLG-STS 251

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
           +  IA+GCGH+NEGLF+G+AGLLGLGGG LS   Q+ A+S +YCLVDRDS ++  L+FNS
Sbjct: 252 LGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNS 311

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
               DAVTAPL RN  +DTF+Y+GLTG SVGG  + IP + F+M E G+GGIIVD GTA+
Sbjct: 312 PITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAV 371

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRLQT  YN LRD+FV+   +L+   GVALFDTCYD S    V VPTVS HF  G  L L
Sbjct: 372 TRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           PAKNYLIPVDS GTFCFAFAPT S LSI+GN QQQGTRV FDLAN+ VGF+PNKC
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  529 bits (1363), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 279/475 (58%), Positives = 347/475 (73%), Gaps = 17/475 (3%)

Query: 28  RGLSETATT-VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHS 86
           R L  T TT VLDV++++Q+T+ + + EP++  P     ET      ++  SS SL L+S
Sbjct: 22  RTLHPTPTTSVLDVAASIQRTQQVFAVEPKSSTP----DETT-----VSDPSSLSLQLNS 72

Query: 87  REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP------AEAQIL 140
           R  + K  H+DY+SL LSRL+RDSARV +L  ++ LAI  +   +L+P        +Q  
Sbjct: 73  RISVMKASHSDYKSLTLSRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFG 132

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
            EDF +P+VSGASQGSGEYFSR+G+G PP    MVLDTGSD++W+QC PC ECY+Q+DP 
Sbjct: 133 TEDFESPIVSGASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPX 192

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           F+P +S+S++ L C   QCKSLDVS CR   CLY+V+YGDGS+TVGD VTETV+ G S S
Sbjct: 193 FEPTSSASFTSLSCETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLG-STS 251

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
           +  IA+GCGH+NEGLF+G+AGLLGLGGG LS   Q+ A+S +YCLVDRDS ++  L+FNS
Sbjct: 252 LGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLVDRDSDSTSTLDFNS 311

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
               DAVTAPL RN  +DTF+Y+GLTG SVGG  + IP + F+M E G+GGIIVD GTA+
Sbjct: 312 PITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAV 371

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRLQT  YN LRD+FV+   +L+   GVALFDTCYD S    V VPTVS HF  G  L L
Sbjct: 372 TRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPL 431

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           PAKNYLIPVDS GTFCFAFAPT S LSI+GN QQQGTRV FDLAN+ VGF+PNKC
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  528 bits (1360), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 268/468 (57%), Positives = 350/468 (74%), Gaps = 15/468 (3%)

Query: 31  SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
           S T T++L+V+ ++ +T++  SF         +E +T + S      SSFSL LHSR  +
Sbjct: 31  SVTTTSILNVADSIHRTKYTSSFRLN-----QQEEQTHSRS------SSFSLQLHSRVSV 79

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE---DFSTP 147
             T H+DY+SL L+RL RD+ARV +LIT+L LAI N+ + +LKP           D   P
Sbjct: 80  RGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEEEDIEAP 139

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           ++SG +QGSGEYF+R+G+G P R+  MVLDTGSD+NWLQC PC +CY Q++PIF+P +SS
Sbjct: 140 LISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 199

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           SY PL C  PQC +L+VS CR   CLY+V+YGDGS+TVGD  TET++ G S  V+ +A+G
Sbjct: 200 SYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIG-STLVQNVAVG 258

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
           CGH NEGLFVG+AGLLGLGGG+L+L  Q+  TS +YCLVDRDS ++  +EF ++   DAV
Sbjct: 259 CGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLVDRDSDSASTVEFGTSLPPDAV 318

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
            APL+RN ++DTFYY+GLTG SVGG+ +QIP S FEMDE+G GGII+D GTA+TRLQT  
Sbjct: 319 VAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTGI 378

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           YNSLRDSF++   +L+  +GVA+FDTCY+ S   ++ VPTV+ HF  GK L LPAKNY+I
Sbjct: 379 YNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKNYMI 438

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           PVDS GTFC AFAPT+S+L+IIGNVQQQGTRV+FDLAN+ +GF+ NKC
Sbjct: 439 PVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 273/465 (58%), Positives = 344/465 (73%), Gaps = 9/465 (1%)

Query: 33  TATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHK 92
           + TT+LDV S+LQ   + ++F P  L     + E       L  SSSF + L SR  + K
Sbjct: 27  SKTTLLDVVSSLQNAHNAVAFTPHHLNQHQRQQEA------LLLSSSFGIHLRSRASIQK 80

Query: 93  TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVVS 150
             H DY+SL LSRL RDSARV +L T+L L +  V   +L PAE  A+        PVVS
Sbjct: 81  PSHRDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEFEANALQGPVVS 140

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
           G SQGSGEYF R+G+G PP Q  +VLDTGSD++W+QC PC+ECYQQSDPIFDP +S+SYS
Sbjct: 141 GTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYS 200

Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
           P+ C APQCKSLD+S CR   CLY+V+YGDGS+TVG+  TETV+ G + +V+ +A+GCGH
Sbjct: 201 PIRCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLG-TAAVENVAIGCGH 259

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
           +NEGLFVG+AGLLGLGGG LS   Q+ ATS +YCLV+RDS A   LEFNS    + VTAP
Sbjct: 260 NNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVSTLEFNSPLPRNVVTAP 319

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           L RN ++DTFYY+GL G SVGG+A+ IP S+FE+D  G GGII+D GTA+TRL+++ Y++
Sbjct: 320 LRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDA 379

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           LRD+FV+ A  +   +GV+LFDTCYD S   SV+VPTVS HF  G+ L LPA+NYLIPVD
Sbjct: 380 LRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVD 439

Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S GTFCFAFAPT+S+LSI+GNVQQQGTRV FD+AN+ VGF+ + C
Sbjct: 440 SVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 269/463 (58%), Positives = 342/463 (73%), Gaps = 9/463 (1%)

Query: 35  TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
           TT+LDV S+LQ   ++++F          + E++  +      SSF + LHSR  + K+ 
Sbjct: 29  TTLLDVVSSLQNAHNVVAFTHHHPNKHQRQQESSLLT------SSFGIQLHSRASIQKSS 82

Query: 95  HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE--AQILPEDFSTPVVSGA 152
           H+DY+SL LSRL RDSARV  L T+L L +  V   +L PAE  A+        PVVSG 
Sbjct: 83  HSDYKSLTLSRLARDSARVKALQTRLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGT 142

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           SQGSGEYF R+G+G PP Q  +VLDTGSD++W+QC PC+ECYQQSDPIFDP +S+SYSP+
Sbjct: 143 SQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPI 202

Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            C  PQCKSLD+S CR   CLY+V+YGDGS+TVG+  TETV+ G S +V+ +A+GCGH+N
Sbjct: 203 RCDEPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLG-SAAVENVAIGCGHNN 261

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           EGLFVG+AGLLGLGGG LS   Q+ ATS +YCLV+RDS A   LEFNS    +A TAPL+
Sbjct: 262 EGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLVNRDSDAVSTLEFNSPLPRNAATAPLM 321

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           RN ++DTFYY+GL G SVGG+A+ IP S FE+D  G GGII+D GTA+TRL+++ Y++LR
Sbjct: 322 RNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALR 381

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           D+FV+ A  +   +GV+LFDTCYD S   SV +PTVS  F  G+ L LPA+NYLIPVDS 
Sbjct: 382 DAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSV 441

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           GTFCFAFAPT+S+LSIIGNVQQQGTRV FD+AN+ VGF+ + C
Sbjct: 442 GTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 266/471 (56%), Positives = 325/471 (69%), Gaps = 10/471 (2%)

Query: 34  ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL--- 90
           AT  LDV+++L +    +S E   L   A  + +       +     +L LHSR+ L   
Sbjct: 35  ATETLDVAASLSRARAAVSAEAVPLHQSAAAAVSTEVVGEEHEEGRLALRLHSRDFLPEE 94

Query: 91  -HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA---QILPEDFST 146
             + RH  YRSLVL+RL RDSAR   +  +  +A   V R +L PA     +    +   
Sbjct: 95  QGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPANVTAFEASAAEIQG 154

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PVVSG   GSGEYFSR+GVG+P RQ  MVLDTGSD+ W+QC+PC +CYQQSDP+FDP  S
Sbjct: 155 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214

Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
           +SY+ + C  P+C  LD +ACR     CLY+VAYGDGS+TVGD  TET++ G+S  V  +
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDSAPVSSV 274

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
           A+GCGHDNEGLFVG+AGLL LGGG LS   QI AT+ +YCLVDRDSP+S  L+F  A   
Sbjct: 275 AIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADA 334

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           + VTAPLIR+ +  TFYYVGL+G SVGGQ + IPPS F MD  G GG+IVD GTA+TRLQ
Sbjct: 335 E-VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQ 393

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           + AY +LRD+FVR   +L  TSGV+LFDTCYD S   SV VP VSL F  G  L LPAKN
Sbjct: 394 SSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKN 453

Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           YLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A + VGFT NKC
Sbjct: 454 YLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  483 bits (1243), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 274/507 (54%), Positives = 343/507 (67%), Gaps = 16/507 (3%)

Query: 4   IKPFVLFTITTILFSFCLFTSASS------RGLSETATTVLDVSSALQQTEHILSFEPET 57
           ++P  L  +  ++ +  L  +A S      R  S   T  LDV+++L +    LS +  +
Sbjct: 1   MQPPTLLPLGAVVVAILLLATAPSPAVSRHRHSSAADTETLDVAASLSRARAALSTDAVS 60

Query: 58  LEPFAEESETAAESFPLNSSSSFSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNT 115
           L   A  +  A  S P       +L LHSR+ L   + RH  YRSLVLSRL RDSAR   
Sbjct: 61  LHQSAAAAAGAKRS-PRAREGGLTLRLHSRDFLPEEQGRHETYRSLVLSRLRRDSARAAA 119

Query: 116 LITKLQLAIYNVDRHELKPAEAQIL---PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQF 172
           +  +  LA   V R +L+PA    +         PVVSG  QGSGEYFSR+G+G+P RQ 
Sbjct: 120 VSARATLAADGVTRLDLRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQL 179

Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--AN 230
            MVLDTGSD+ W+QC+PC +CYQQSDP+FDP  S+SY+ + C + +C+ LD +ACR    
Sbjct: 180 YMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 239

Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGML 290
            CLY+VAYGDGS+TVGD  TET++ G+S  V  +A+GCGHDNEGLFVG+AGLL LGGG L
Sbjct: 240 ACLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL 299

Query: 291 SLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFS 349
           S   QI A++ +YCLVDRDSPA+  L+F + A     VTAPL+R+ +  TFYYV L+G S
Sbjct: 300 SFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGIS 359

Query: 350 VGGQAVQIPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
           VGGQ + IP S F MD  +G GG+IVD GTA+TRLQ+ AY +LRD+FV+ A +L  TSGV
Sbjct: 360 VGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGV 419

Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSI 468
           +LFDTCYD S   SV VP VSL F  G AL LPAKNYLIPVD AGT+C AFAPT++A+SI
Sbjct: 420 SLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSI 479

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IGNVQQQGTRVSFD A   VGFTPNKC
Sbjct: 480 IGNVQQQGTRVSFDTARGAVGFTPNKC 506


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 256/425 (60%), Positives = 307/425 (72%), Gaps = 10/425 (2%)

Query: 80  FSLPLHSREIL----HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
            +L LHSR+ L     + RH  YRSLVL+RL RDSAR   +  +  +A   V R +L PA
Sbjct: 77  LALRLHSRDFLPEEQGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPA 136

Query: 136 EA---QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
                +    +   PVVSG   GSGEYFSR+GVG+P RQ  MVLDTGSD+ W+QC+PC +
Sbjct: 137 NVTAFEASAAEIQGPVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCAD 196

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVT 250
           CYQQSDP+FDP  S+SY+ + C  P+C  LD +ACR     CLY+VAYGDGS+TVGD  T
Sbjct: 197 CYQQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFAT 256

Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS 310
           ET++ G+S  V  +A+GCGHDNEGLFVG+AGLL LGGG LS   QI AT+ +YCLVDRDS
Sbjct: 257 ETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDS 316

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
           P+S  L+F  A   + VTAPLIR+ +  TFYYVGL+G SVGGQ + IPPS F MD  G G
Sbjct: 317 PSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAG 375

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G+IVD GTA+TRLQ+ AY +LRD+FVR   +L  TSGV+LFDTCYD S   SV VP VSL
Sbjct: 376 GVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSL 435

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
            F  G  L LPAKNYLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A + VGF
Sbjct: 436 RFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGF 495

Query: 491 TPNKC 495
           T NKC
Sbjct: 496 TSNKC 500


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 257/427 (60%), Positives = 310/427 (72%), Gaps = 11/427 (2%)

Query: 80  FSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
            +L LHSR+ L   + RH  YRSLV SRL RDSAR   L  +  LA   V R +L+PA  
Sbjct: 83  LTLRLHSRDFLPEAQQRHATYRSLVQSRLRRDSARAAALSARATLAADGVTRQDLRPANE 142

Query: 138 QI-----LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
                  L      PVVSG  QGSGEYFSR+G+G+P R+  MVLDTGSD+ W+QC+PC +
Sbjct: 143 SAVFGASLAAAIQGPVVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCAD 202

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVT 250
           CYQQSDP+FDP  S+SY+ + C +P+C+ LD +ACR     CLY+VAYGDGS+TVGD  T
Sbjct: 203 CYQQSDPVFDPSLSASYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFAT 262

Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS 310
           ET++ G+S  V  +A+GCGHDNEGLFVG+AGLL LGGG LS   QI A++ +YCLVDRDS
Sbjct: 263 ETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCLVDRDS 322

Query: 311 PASGVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE-AG 368
           PA+  L+F +     D VTAPL+R+ +  TFYYV L+G SVGGQA+ IP S F MD  +G
Sbjct: 323 PAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSG 382

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            GG+IVD GTA+TRLQ+ AY +LRD+FVR   +L  TSGV+LFDTCYD S   SV VP V
Sbjct: 383 SGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAV 442

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           SL F  G AL LPAKNYLIPVD AGT+C AFAPT++A+SIIGNVQQQGTRVSFD A   V
Sbjct: 443 SLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVV 502

Query: 489 GFTPNKC 495
           GFTPNKC
Sbjct: 503 GFTPNKC 509


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  481 bits (1237), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 262/471 (55%), Positives = 327/471 (69%), Gaps = 11/471 (2%)

Query: 35  TTVLDVSSALQQTEHILSFEPETL--EPFAEESETAAESFPLNSSSSFSLPLHSREIL-- 90
           T  LDVS++L +    +S +   L  +  A     A       S    +L LHSR+ L  
Sbjct: 31  TETLDVSASLSRARAAVSTDARPLLHQSLASTDTDALVKEEQRSGGKLALRLHSRDFLPE 90

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE----DFST 146
            + RH  Y SLVL+RL RDSAR   L  +  LA   + R +L+PA A  + E    +   
Sbjct: 91  EQGRHESYSSLVLARLRRDSARAAALSARASLAADGISRADLRPANATPVFEASAAEIQG 150

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PVVSG  QGSGEYFSR+GVG P RQ  MVLDTGSD+ WLQC+PC +CY QSDP++DP  S
Sbjct: 151 PVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVS 210

Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
           +SY+ + C +P+C+ LD +ACR     CLY+VAYGDGS+TVGD  TET++ G+S  V  +
Sbjct: 211 TSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDSAPVSNV 270

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
           A+GCGHDNEGLFVG+AGLL LGGG LS   QI AT+ +YCLVDRDSP+S  L+F  +   
Sbjct: 271 AIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDSEQ- 329

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            AVTAPLIR+ + +TFYYV L+G SVGG+A+ IP S F MD+AG GG+IVD GTA+TRLQ
Sbjct: 330 PAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVTRLQ 389

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           + AY +LR++FV+   +L   SGV+LFDTCYD +G  SV+VP V+L F  G  L LPAKN
Sbjct: 390 SGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGELKLPAKN 449

Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           YLIPVD+AGT+C AFA TS  +SIIGNVQQQG RVSFD A N VGFT +KC
Sbjct: 450 YLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 250/473 (52%), Positives = 322/473 (68%), Gaps = 22/473 (4%)

Query: 35  TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL---- 90
           +T  D+ S L     +     E ++P  EE+    E  P      +S+PL  R+ +    
Sbjct: 23  STQKDIYSTLDVQATLRVARGEVVQPAKEET---LEIKP------WSIPLVHRDAMKGNS 73

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPEDFSTP 147
           +K     Y   +  RL+RD+ARV  + ++L+LA+  + R  LKP  +    +   DF +P
Sbjct: 74  NKNNELSYAERMQQRLKRDAARVAAINSRLELAVNGIKRSSLKPDSSSSFTMAESDFQSP 133

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           VVSG  QGSGEYFSRIGVG P R   MVLDTGSD+ W+QC PC++CYQQSDPI++P  SS
Sbjct: 134 VVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSS 193

Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           SY  + C A  C+ LDVS C R   CLYQV+YGDGS+T G+  TET++ G +  ++ +A+
Sbjct: 194 SYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLGGA-PLQNVAI 252

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF-NSAR 322
           GCGHDNEGLFVG+AGLLGLGGG LS   Q+   +    +YCLVDRDS +S  L+F  +A 
Sbjct: 253 GCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAAV 312

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
              AV AP+++N ++DTFYYV L+G SVGG+ + I  S+F +D +G+GG+IVD GTA+TR
Sbjct: 313 PNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTR 372

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           LQT AY+SLRD+F     NL  T GV+LFDTCYD S   SV VPTV  HF  G ++ LPA
Sbjct: 373 LQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPA 432

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           KNYL+PVDS GTFCFAFAPTSS+LSI+GN+QQQG RVSFD ANN+VGF  NKC
Sbjct: 433 KNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 226/358 (63%), Positives = 268/358 (74%), Gaps = 10/358 (2%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PVVSG  QGSGEYFSRIG+G+P RQ  MVLDTGSD+ WLQC PC +CY QSDP+FDP  S
Sbjct: 184 PVVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALS 243

Query: 207 SSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           SSY+ +PC +P C++LD SAC  N       C+Y+VAYGDGS+TVGD  TET++ G  GS
Sbjct: 244 SSYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGS 303

Query: 261 --VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF 318
             V  +A+GCGHDNEGLFVG+AGLL LGGG LS   QI AT  +YCLVDRDSP++  L+F
Sbjct: 304 AAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSYCLVDRDSPSASTLQF 363

Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCG 377
             A     VTAPL+R+ + +TFYYV L G SVGG+ +  IPP+ F MDE G GG+IVD G
Sbjct: 364 G-ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDSG 422

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           TA+TRLQ+ AY++LRD+FVR    L   SGV+LFDTCYD +G  SV+VP VSL F  G  
Sbjct: 423 TAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEGGGE 482

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L LPAKNYLIPVD AGT+C AFA T  A+SI+GNVQQQG RVSFD A N VGF+PNKC
Sbjct: 483 LKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 248/495 (50%), Positives = 336/495 (67%), Gaps = 22/495 (4%)

Query: 7   FVLFTITTILFSFCLFTSASSRGL--SETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
           F+  TI T L     F S  SR L  S  +T++ DVS++  Q    LS +P+ L+  +  
Sbjct: 9   FLFLTIFTSL----QFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSH- 63

Query: 65  SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
                   P   +S FSLPL+ R  LH   + DY +LV +RL RD+ARV  L   L+ ++
Sbjct: 64  -------LP---NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSL 113

Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSG-EYFSRIGVGTPPRQFSMVLDTGSDIN 183
            N   H  +     ++ +  + PVVSG S+GSG EY ++IGVG P + F +V DTGSD+ 
Sbjct: 114 -NGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVT 172

Query: 184 WLQCRPCTE---CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
           WLQC+PC     CY+Q DPIFDPK+SSSYSPL C + QCK LD + C ++ C+YQV YGD
Sbjct: 173 WLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGD 232

Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
           GSFT G+L TET+SFGNS S+  + +GCGHDNEGLF G AGL+GLGGG +SL+ Q+KA+S
Sbjct: 233 GSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS 292

Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
            +YCLV+ DS +S  LEFNS    D++T+PL++N +  ++ YV + G SVGG+ + I P+
Sbjct: 293 FSYCLVNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 352

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
            FE+DE+G GGIIVD GT I+RL +  Y SLR++FV+L  +L P  G+++FDTCY+FSG 
Sbjct: 353 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 412

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
            +V VPT++     G +L LPA+NYLI +D+AGT+C AF  T S+LSIIG+ QQQG RVS
Sbjct: 413 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 472

Query: 481 FDLANNRVGFTPNKC 495
           +DL N+ VGF+ NKC
Sbjct: 473 YDLTNSLVGFSTNKC 487


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  440 bits (1132), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 248/495 (50%), Positives = 336/495 (67%), Gaps = 22/495 (4%)

Query: 7   FVLFTITTILFSFCLFTSASSRGL--SETATTVLDVSSALQQTEHILSFEPETLEPFAEE 64
           F+  TI T L     F S  SR L  S  +T++ DVS++  Q    LS +P+ L+  +  
Sbjct: 9   FLFLTIFTSL----QFPSILSRKLTPSSYSTSIFDVSASTNQALDALSIKPKPLQNHSH- 63

Query: 65  SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
                   P   +S FSLPL+ R  LH   + DY +LV +RL RD+ARV  L   L+ ++
Sbjct: 64  -------LP---NSPFSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSL 113

Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSG-EYFSRIGVGTPPRQFSMVLDTGSDIN 183
            N   H  +     ++ +  + PVVSG S+GSG EY ++IGVG P + F +V DTGSD+ 
Sbjct: 114 -NGGTHFGESINESLIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVT 172

Query: 184 WLQCRPCTE---CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGD 240
           WLQC+PC     CY+Q DPIFDPK+SSSYSPL C + QCK LD + C ++ C+YQV YGD
Sbjct: 173 WLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGD 232

Query: 241 GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS 300
           GSFT G+L TET+SFGNS S+  + +GCGHDNEGLF G AGL+GLGGG +SL+ Q+KA+S
Sbjct: 233 GSFTTGELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASS 292

Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
            +YCLV+ DS +S  LEFNS    D++T+PL++N +  ++ YV + G SVGG+ + I P+
Sbjct: 293 FSYCLVNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPT 352

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
            FE+DE+G GGIIVD GT I+RL +  Y SLR++FV+L  +L P  G+++FDTCY+FSG 
Sbjct: 353 RFEIDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQ 412

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
            +V VPT++     G +L LPA+NYLI +D+AGT+C AF  T S+LSIIG+ QQQG RVS
Sbjct: 413 SNVEVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVS 472

Query: 481 FDLANNRVGFTPNKC 495
           +DL N+ VGF+ NKC
Sbjct: 473 YDLTNSIVGFSTNKC 487


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 221/409 (54%), Positives = 287/409 (70%), Gaps = 13/409 (3%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP-----AEAQILPEDFSTPVVSGAS 153
           + ++  RL+RD+ARV+++  ++QLA   V + E+KP      +A+   +DFS+ ++SG +
Sbjct: 88  KEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDARFDAKDFSSSIISGLA 147

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           QGSGEYF+R+GVGTPPR   MVLDTGSDI W+QC PC +CY Q+DP+F+P  SS+Y  +P
Sbjct: 148 QGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVP 207

Query: 214 CAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
           CA P CK LD+S CR  R C YQV+YGDGSFTVGD  TET++F     ++ +ALGCGHDN
Sbjct: 208 CATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTF-RGQVIRRVALGCGHDN 266

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP--ASGVLEFNSARGGDAV 327
           EGLF+G+AGLLGLG G LS   Q  A      +YCLVDR +   AS ++   +A    A+
Sbjct: 267 EGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKAAIPKSAI 326

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
             PL+ N K+DTFYYV L G SVGG+ +  IP S+F MD  G+GG+I+D GT++TRL   
Sbjct: 327 FTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVTRLVDS 386

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           AY+++RD+F    GNLK   G +LFDTCYD SGL++V+VPT+  HF  G  + LPA NYL
Sbjct: 387 AYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYL 446

Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IPVDS+ TFCFAFA  +  LSIIGN+QQQG RV FD   NRVGF    C
Sbjct: 447 IPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 207/326 (63%), Positives = 251/326 (76%), Gaps = 4/326 (1%)

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR--ANR 231
           MVLDTGSD+ W+QC+PC +CYQQSDP+FDP  S+SY+ + C + +C+ LD +ACR     
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           CLY+VAYGDGS+TVGD  TET++ G+S  V  +A+GCGHDNEGLFVG+AGLL LGGG LS
Sbjct: 61  CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120

Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
              QI A++ +YCLVDRDSPA+  L+F + A     VTAPL+R+ +  TFYYV L+G SV
Sbjct: 121 FPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISV 180

Query: 351 GGQAVQIPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           GGQ + IP S F MD  +G GG+IVD GTA+TRLQ+ AY +LRD+FV+ A +L  TSGV+
Sbjct: 181 GGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVS 240

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
           LFDTCYD S   SV VP VSL F  G AL LPAKNYLIPVD AGT+C AFAPT++A+SII
Sbjct: 241 LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSII 300

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GNVQQQGTRVSFD A   VGFTPNKC
Sbjct: 301 GNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 240/468 (51%), Positives = 317/468 (67%), Gaps = 13/468 (2%)

Query: 31  SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
           S  +T   DVS+++ Q  + LS +P+   PF    +T   ++  +S  S SL  H R  +
Sbjct: 66  SPYSTNTFDVSASINQALNALSIKPK---PF----QTTHSNYHSSSPLSLSL--HPRLTV 116

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS 150
           H   + DY SLV +RL R +AR  +L  KL+L++    +   +           + PV S
Sbjct: 117 HNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDSTNSLTAPVTS 175

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSS 207
           GASQG+GEYF+RIGVG P + +  V DTGSD++WLQC+PC     CY+Q  PIFDPK+SS
Sbjct: 176 GASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSS 235

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           SYSPL C + QC  LD +AC AN C+Y+V YGDGSFTVG+L TET SF +S S+  + +G
Sbjct: 236 SYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIG 295

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
           CGHDNEGLFVG+ GL+GLGGG +SL+ Q++ATS +YCLVD DS +S  L+FN+ +  D++
Sbjct: 296 CGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSL 355

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           T+PL++N +  TF YV + G SVGG+ + I  S FE+DE+G GGIIVD GT IT + +  
Sbjct: 356 TSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDV 415

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           Y+ LRD+FV L  NL P  GV+ FDTCYD S   +V VPT++       +L LPAKN LI
Sbjct: 416 YDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLI 475

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            VDSAGTFC AF P++  LSIIGNVQQQG RVS+DLAN+ VGF+ +KC
Sbjct: 476 QVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 240/468 (51%), Positives = 317/468 (67%), Gaps = 13/468 (2%)

Query: 31  SETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL 90
           S  +T   DVS+++ Q  + LS +P+   PF    +T   ++  +S  S SL  H R  +
Sbjct: 66  SPYSTNTFDVSASINQALNALSIKPK---PF----QTTHSNYHSSSPLSLSL--HPRLTV 116

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS 150
           H   + DY SLV +RL R +AR  +L  KL+L++    +   +           + PV S
Sbjct: 117 HNPSYEDYGSLVRARLARGAARAQSLNRKLELSLKGGKQFGRR-INGSDSTNSLTAPVTS 175

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSS 207
           GASQG+GEYF+RIGVG P + +  V DTGSD++WLQC+PC     CY+Q  PIFDPK+SS
Sbjct: 176 GASQGAGEYFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSS 235

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           SYSPL C + QC  LD +AC AN C+Y+V YGDGSFTVG+L TET SF +S S+  + +G
Sbjct: 236 SYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIG 295

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
           CGHDNEGLFVG+AGL+GLGGG +SL+ Q++ATS +YCLVD DS +S  L+FN+ +  D++
Sbjct: 296 CGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCLVDLDSESSSTLDFNADQPSDSL 355

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           T+PL++N +  TF YV + G SVGG+ + I  S FE+DE+G GGIIVD GT IT + +  
Sbjct: 356 TSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDV 415

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           Y+ LRD+FV L  NL P  GV+ FDTCYD S   +V VPT++       +L LPAKN L 
Sbjct: 416 YDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLF 475

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            VDSAGTFC AF P++  LSIIGNVQQQG RVS+DLAN+ VGF+ +KC
Sbjct: 476 QVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 210/423 (49%), Positives = 280/423 (66%), Gaps = 22/423 (5%)

Query: 80  FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA---IYNVDRHELKPAE 136
           + + +  R+ L     +D+R  +  RL+RD+ RV +LI +L       Y VD        
Sbjct: 72  WMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVD-------- 123

Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
                 DF T V+SG  QGSGEYF RIGVG+PPR   MV+D+GSDI W+QC+PCT+CY Q
Sbjct: 124 ------DFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 177

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
           SDP+FDP  S+S++ + C++  C  L+ + C A RC Y+V+YGDGS+T G L  ET++FG
Sbjct: 178 SDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFG 237

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            +  V+ +A+GCGH N G+FVG+AGLLGLGGG +S   Q+   +    +YCLV R + +S
Sbjct: 238 RT-MVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSS 296

Query: 314 GVLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           G L F   A    A   PL+RN +  +FYY+GL G  VGG  V I   +F + E GDGG+
Sbjct: 297 GSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGV 356

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           ++D GTA+TRL T AY + RD+F+    NL   +GVA+FDTCYD  G  SVRVPTVS +F
Sbjct: 357 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYF 416

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  L LPA+N+LIP+D AGTFCFAFAP++S LSI+GN+QQ+G ++SFD AN  VGF P
Sbjct: 417 SGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGP 476

Query: 493 NKC 495
           N C
Sbjct: 477 NIC 479


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 209/427 (48%), Positives = 278/427 (65%), Gaps = 20/427 (4%)

Query: 75  NSSSSFSLPLHSREIL--HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHEL 132
           +S + + L L  R+ +    T H D+R+   +R++RD+ RV  L            R  L
Sbjct: 61  SSPAKYKLKLVHRDKVPTFNTSH-DHRTRFNARMQRDTKRVAAL------------RRHL 107

Query: 133 KPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
              +     E F + VVSG  QGSGEYF RIGVG+PPR   +V+D+GSDI W+QC PCT+
Sbjct: 108 AAGKPTYAEEAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQ 167

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTET 252
           CY QSDP+F+P  SSSY+ + CA+  C  +D + C   RC Y+V+YGDGS+T G L  ET
Sbjct: 168 CYHQSDPVFNPADSSSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALET 227

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRD 309
           ++FG +  ++ +A+GCGH N+G+FVG+AGLLGLG G +S   Q+      + +YCLV R 
Sbjct: 228 LTFGRT-LIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRG 286

Query: 310 SPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
             +SG+L+F   A    A   PLI N +  +FYYVGL+G  VGG  V I   +F++ E G
Sbjct: 287 IQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELG 346

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
           DGG+++D GTA+TRL T AY + RD+F+    NL   SGV++FDTCYD  G  SVRVPTV
Sbjct: 347 DGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTV 406

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           S +F  G  L LPA+N+LIPVD  G+FCFAFAP+SS LSIIGN+QQ+G  +S D AN  V
Sbjct: 407 SFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFV 466

Query: 489 GFTPNKC 495
           GF PN C
Sbjct: 467 GFGPNVC 473


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/454 (51%), Positives = 293/454 (64%), Gaps = 22/454 (4%)

Query: 51  LSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDS 110
           +SF+PE+ EP +E        F   S S  S+ L+   I   + +   + L  SRL+RDS
Sbjct: 44  ISFQPES-EPDSES--LLGSEFESGSDSESSITLNLDHIDALSSNKTPQELFSSRLQRDS 100

Query: 111 ARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPP 169
            RV ++ T   Q+   NV  H  +          FS+ VVSG SQGSGEYF+R+GVGTP 
Sbjct: 101 RRVKSIATLAAQIPGRNVT-HAPRTG-------GFSSSVVSGLSQGSGEYFTRLGVGTPA 152

Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
           R   MVLDTGSDI WLQC PC  CY QSDPIFDP+ S +Y+ +PC++P C+ LD + C  
Sbjct: 153 RYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNT 212

Query: 230 NR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
            R  CLYQV+YGDGSFTVGD  TET++F     VKG+ALGCGHDNEGLFVG+AGLLGLG 
Sbjct: 213 RRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGLFVGAAGLLGLGK 271

Query: 288 GMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYY 342
           G LS   Q         +YCLVDR   S  S V+  N+A    A   PL+ N K+DTFYY
Sbjct: 272 GKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYY 331

Query: 343 VGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           V L G SVGG  V  +  SLF++D+ G+GG+I+D GT++TRL   AY ++RD+F   A  
Sbjct: 332 VELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKA 391

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           LK     +LFDTC+D S +  V+VPTV LHF  G  + LPA NYLIPVD+ G FCFAFA 
Sbjct: 392 LKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAG 450

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           T   LSIIGN+QQQG RV +DLA++RVGF P  C
Sbjct: 451 TMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 225/404 (55%), Positives = 273/404 (67%), Gaps = 19/404 (4%)

Query: 101 LVLSRLERDSARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
           L  SRL+RDS RV ++ T   Q+   NV  H  +P         FS+ VVSG SQGSGEY
Sbjct: 91  LFSSRLQRDSRRVKSIATLAAQIPGRNVT-HAPRPG-------GFSSSVVSGLSQGSGEY 142

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           F+R+GVGTP R   MVLDTGSDI WLQC PC  CY QSDPIFDP+ S +Y+ +PC++P C
Sbjct: 143 FTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHC 202

Query: 220 KSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
           + LD + C   R  CLYQV+YGDGSFTVGD  TET++F     VKG+ALGCGHDNEGLFV
Sbjct: 203 RRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGLFV 261

Query: 278 GSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLI 332
           G+AGLLGLG G LS   Q         +YCLVDR   S  S V+  N+A    A   PL+
Sbjct: 262 GAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLL 321

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
            N K+DTFYYVGL G SVGG  V  +  SLF++D+ G+GG+I+D GT++TRL   AY ++
Sbjct: 322 SNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAM 381

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           RD+F   A  LK     +LFDTC+D S +  V+VPTV LHF  G  + LPA NYLIPVD+
Sbjct: 382 RDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDT 440

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            G FCFAFA T   LSIIGN+QQQG RV +DLA++RVGF P  C
Sbjct: 441 NGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/406 (55%), Positives = 273/406 (67%), Gaps = 19/406 (4%)

Query: 99  RSLVLSRLERDSARVNTLIT-KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           + L  SRL+RDS RV ++ T   Q+   NV  H  +P         FS+ VVSG SQGSG
Sbjct: 89  QELFSSRLQRDSRRVRSIATLAAQIPGRNVT-HAPRPG-------GFSSSVVSGLSQGSG 140

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYF+R+GVGTP R   MVLDTGSDI WLQC PC  CY QSDPIFDP+ S +Y+ +PC++P
Sbjct: 141 EYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSP 200

Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            C+ LD + C   R  CLYQV+YGDGSFTVGD  TET++F     VKG+ALGCGHDNEGL
Sbjct: 201 HCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTF-RRNRVKGVALGCGHDNEGL 259

Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAP 330
           FVG+AGLLGLG G LS   Q         +YCLVDR   S  S V+  N+A    A   P
Sbjct: 260 FVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTP 319

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           L+ N K+DTFYYVGL G SVGG  V  +  SLF++D+ G+GG+I+D GT++TRL   AY 
Sbjct: 320 LLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYI 379

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           ++RD+F   A  LK     +LFDTC+D S +  V+VPTV LHF     + LPA NYLIPV
Sbjct: 380 AMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRAD-VSLPATNYLIPV 438

Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           D+ G FCFAFA T   LSIIGN+QQQG RV +DLA++RVGF P  C
Sbjct: 439 DTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 215/468 (45%), Positives = 304/468 (64%), Gaps = 35/468 (7%)

Query: 37  VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREIL---HKT 93
           +L+V  A+ +T+       +  E F  +++T  E         + L L  R+ +   +K+
Sbjct: 40  LLNVKEAITETK-----ASQYQELFDNQNDTLTEG-------KWKLKLVHRDKITAFNKS 87

Query: 94  RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
            + D+     +R++RD  RV TLI +L            + A +    E+F   VVSG +
Sbjct: 88  SY-DHSHNFHARIQRDKKRVATLIRRL----------SPRDATSSYSVEEFGAEVVSGMN 136

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           QGSGEYF RIGVG+PPR+  +V+D+GSDI W+QC+PCT+CY Q+DP+FDP  S+S+  +P
Sbjct: 137 QGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVP 196

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           C++  C+ ++ + C A  C Y+V YGDGS+T G L  ET++FG +  V+ +A+GCGH N 
Sbjct: 197 CSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRT-VVRNVAIGCGHRNR 255

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTA- 329
           G+FVG+AGLLGLGGG +SL  Q+   +    +YCLV R + ++G LEF   RG   V A 
Sbjct: 256 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEF--GRGAMPVGAA 313

Query: 330 --PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
             PLIRN +  +FYY+ L+G  VGG  V I   +F+++E G+GG+++D GTA+TR+ T A
Sbjct: 314 WIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVA 373

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           Y + RD+F+   GNL   SGV++FDTCY+ +G  SVRVPTVS +F  G  L LPA+N+LI
Sbjct: 374 YVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLI 433

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           PVD  GTFCFAFA + S LSIIGN+QQ+G ++SFD AN  VGF PN C
Sbjct: 434 PVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 221/401 (55%), Positives = 270/401 (67%), Gaps = 35/401 (8%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           RL RD+ RV+ L ++                        FS+ VVSG SQGSGEYF+R+G
Sbjct: 77  RLHRDTLRVHALNSR---------------------AAGFSSSVVSGLSQGSGEYFTRLG 115

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           VGTPPR   MVLDTGSD+ WLQC PC +CY QSDPIF+P  S S++ +PC++P C+ LD 
Sbjct: 116 VGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDS 175

Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVGSAG 281
           S C  R + CLYQV+YGDGSFT GD  TET++F GN   +  +ALGCGH NEGLFVG+AG
Sbjct: 176 SGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGN--KIAKVALGCGHHNEGLFVGAAG 233

Query: 282 LLGLGGGMLSLTKQIKAT---SLAYCLVDRDS---PASGVLEFNSARGGDAVTAPLIRNK 335
           LLGLG G LS   Q         +YCLVDR +   P+S V   ++A    A   PLIRN 
Sbjct: 234 LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFG-DAAISRLARFTPLIRNP 292

Query: 336 KVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
           K+DTFYYVGL G SVGG  V+ + PSLF++D AG+GG+I+D GT++TRL   AY +LRD+
Sbjct: 293 KLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDA 352

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
           F   A +LK     +LFDTCYD SG  SV+VPTV LHF  G  + LPA NYLIPVD  G+
Sbjct: 353 FRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHF-RGADMALPATNYLIPVDENGS 411

Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           FCFAFA T S LSIIGN+QQQG RV +DLA +R+GF P  C
Sbjct: 412 FCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 235/510 (46%), Positives = 312/510 (61%), Gaps = 38/510 (7%)

Query: 3   PIKPFVLFTITTILFSFCLFTSASSRGLSET---ATTVLDVSSALQQTEHILSFEPETLE 59
           P+ PF       +L       SA SR +S     A   LDV+S+L++T+           
Sbjct: 10  PLLPFTFLLCVGMLL---FLQSAQSRPISVPEVPAYHALDVASSLRETDTA--------- 57

Query: 60  PFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNTL 116
             A  +E   E+ P  S  S  + +H   +L K   N    Y   +  +L R++ RV  L
Sbjct: 58  --AGGAEYKRETKPRRSPWSVEV-VHRDALLLKNAANATASYERRLKEKLRREAVRVRGL 114

Query: 117 ITKLQLAIY----NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQF 172
             +++  +      V+R+E   AE      DF   VVSG  QGSGEYF+RIGVGTP R+ 
Sbjct: 115 ERQIERTLTLNKDPVNRYE-NVAEVD---ADFGGEVVSGMEQGSGEYFTRIGVGTPTREQ 170

Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRC 232
            MVLDTGSD+ W+QC PC ECY Q+DPIF+P  S+S+S + C +  C  LD   C +  C
Sbjct: 171 YMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGC 230

Query: 233 LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL 292
           LY+ +YGDGS++ G   TET++FG + SV  +A+GCGH N GLF+G+AGLLGLG G LS 
Sbjct: 231 LYEASYGDGSYSTGSFATETLTFGTT-SVANVAIGCGHKNVGLFIGAAGLLGLGAGALSF 289

Query: 293 TKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTG 347
             QI      + +YCLVDR+S +SG L+F   S   G   T PL +N  + TFYY+ +T 
Sbjct: 290 PNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFT-PLEKNPHLPTFYYLSVTA 348

Query: 348 FSVGGQAVQ-IPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
            SVGG  +  IPP +F +DE +G GG I+D GT +TRL T AY+++RD+FV   G L  T
Sbjct: 349 ISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRT 408

Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
             V++FDTCYD SGL+ V VPTV  HF  G +L LPAKNYLIP+D+ GTFCFAFAP +S+
Sbjct: 409 DAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASS 468

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +SI+GN QQQ  RVSFD AN+ VGF  ++C
Sbjct: 469 VSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 215/442 (48%), Positives = 291/442 (65%), Gaps = 21/442 (4%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLI 117
            P  ++  +A E+   +SS+ + L L H  ++     ++D+R+   +R++RD+ R  +L+
Sbjct: 50  HPHNKKLNSATEA---SSSAKYKLKLVHRDKVPTFNTYHDHRTRFNARMQRDTKRAASLL 106

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
            +L            KP  A    E F + VVSG  QGSGEYF RIGVG+PPR   +V+D
Sbjct: 107 RRLAAG---------KPTYAA---EAFGSDVVSGMEQGSGEYFVRIGVGSPPRNQYVVMD 154

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVA 237
           +GSDI W+QC PCT+CY QSDP+F+P  SSS+S + CA+  C  +D +AC   RC Y+V+
Sbjct: 155 SGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVS 214

Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
           YGDGS+T G L  ET++FG +  ++ +A+GCGH N+G+FVG+AGLLGLGGG +S   Q+ 
Sbjct: 215 YGDGSYTKGTLALETITFGRT-LIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLG 273

Query: 298 ATS---LAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
             +    +YCLV R   +SG+LEF   A    A   PLI N +  +FYY+GL+G  VGG 
Sbjct: 274 GQTGGAFSYCLVSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGL 333

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
            V I   +F++ E GDGG+++D GTA+TRL T AY + RD F+    NL   SGV++FDT
Sbjct: 334 RVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDT 393

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQ 473
           CYD  G  SVRVPTVS +F  G  L LPA+N+LIPVD  GTFCFAFAP+SS LSIIGN+Q
Sbjct: 394 CYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQ 453

Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
           Q+G ++S D AN  VGF PN C
Sbjct: 454 QEGIQISVDGANGFVGFGPNVC 475


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 204/422 (48%), Positives = 272/422 (64%), Gaps = 39/422 (9%)

Query: 80  FSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA---IYNVDRHELKPAE 136
           + + +  R+ L     +D+R  +  RL+RD+ RV +LI +L       Y VD        
Sbjct: 133 WMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSYRVD-------- 184

Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
                 DF T V+SG  QGSGEYF RIGVG+PPR   MV+D+GSDI W+QC+PCT+CY Q
Sbjct: 185 ------DFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 238

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
           SDP+FDP  S+S++ + C++  C  L+ + C A RC Y+V+YGDGS+T G L  ET++FG
Sbjct: 239 SDPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFG 298

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            +  V+ +A+GCGH N G+FVG+AGLLGLGGG +S   Q+   +    +YCLV       
Sbjct: 299 RT-MVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS------ 351

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
                       A   PL+RN +  +FYY+GL G  VGG  V I   +F + E GDGG++
Sbjct: 352 ------------AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVV 399

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GTA+TRL T AY + RD+F+    NL   +GVA+FDTCYD  G  SVRVPTVS +F 
Sbjct: 400 MDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 459

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  L LPA+N+LIP+D AGTFCFAFAP++S LSI+GN+QQ+G ++SFD AN  VGF PN
Sbjct: 460 GGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPN 519

Query: 494 KC 495
            C
Sbjct: 520 IC 521


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 210/402 (52%), Positives = 265/402 (65%), Gaps = 13/402 (3%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           + RD+ RV ++  ++   +  + R   +  + ++  +DF  PVVSG S GSGEYF RI V
Sbjct: 5   ISRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISV 64

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTPPR+  +V+DTGSDI WLQC PC  CY QSD IFDP  SS+YS L C+  QC +LD+ 
Sbjct: 65  GTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIG 124

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV-----KGIALGCGHDNEGLFVGSA 280
            C+AN+CLYQV YGDGSFT G+  T+ VS  ++  V       I LGCGHDNEG FVG+A
Sbjct: 125 TCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAA 184

Query: 281 GLLGLGGGMLSLTKQIKATS---LAYCLVDR--DSPASGVLEFNSAR--GGDAVTAPLIR 333
           GLLGLG G LS   Q+   +    +YCL DR  DS     L F  A      A   P   
Sbjct: 185 GLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDS 244

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
           N +V TFYY+ +TG SVGG  + IP S F++D  G+GG+I+D GT++TRLQ  AY SLRD
Sbjct: 245 NMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRD 304

Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
           +F     +L PT+G +LFDTCYD SGL SV VPTV+LHF  G  L LPA NYLIPVD++ 
Sbjct: 305 AFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSN 364

Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           TFC AFA T+   SIIGN+QQQG RV +D  +N+VGF P++C
Sbjct: 365 TFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 211/426 (49%), Positives = 278/426 (65%), Gaps = 20/426 (4%)

Query: 84  LHSREILHKTRHN---DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA----E 136
           +H   +L K   N    Y   +  +L R++ARV  L  +++  +    + +  PA     
Sbjct: 76  VHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKL----KLKKDPAGSYEN 131

Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
              +  +F + VVSG  QGSGEYF+RIG+GTP R+  MVLDTGSD+ W+QC PC ECY Q
Sbjct: 132 VAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQ 191

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
           +DPIF+P +S S+S + C +  C  LD + C    CLY+V+YGDGS+TVG   TET++FG
Sbjct: 192 ADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFG 251

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPAS 313
            + S++ +A+GCGHDN GLFVG+AGLLGLG G LS   Q+      + +YCLVDRDS +S
Sbjct: 252 TT-SIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESS 310

Query: 314 GVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GD 369
           G LEF   S   G   T PL+ N  + TFYY+ +   SVGG  +  +P   F +DE  G 
Sbjct: 311 GTLEFGPESVPIGSIFT-PLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGR 369

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           GGII+D GTA+TRLQT AY++LRD+F+    +L    G+++FDTCYD S L+SV +P V 
Sbjct: 370 GGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVG 429

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            HF  G    LPAKN LIP+DS GTFCFAFAP  S LSI+GN+QQQG RVSFD AN+ VG
Sbjct: 430 FHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVG 489

Query: 490 FTPNKC 495
           F  ++C
Sbjct: 490 FAIDQC 495


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 227/446 (50%), Positives = 292/446 (65%), Gaps = 21/446 (4%)

Query: 59  EPFAEESETAAESFPLNSSSSF-SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLI 117
           EP  +       S P +S+++F S+ LH  + L   + +  + L  SRL RD+ARV +LI
Sbjct: 54  EPGTQTFTDQTTSEPSSSATTFLSVQLHHIDALSSDKSS--QDLFNSRLVRDAARVKSLI 111

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           +   LA   V    L  A        FS+ V+SG +QGSGEYF+R+GVGTP R   MVLD
Sbjct: 112 S---LAA-TVGGTNLTRARG----PGFSSSVISGLAQGSGEYFTRLGVGTPARYVYMVLD 163

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQ 235
           TGSDI W+QC PC +CY Q+DP+FDP  S S++ +PC +P C+ LD   C   +  CLYQ
Sbjct: 164 TGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCRRLDYPGCSTKKQICLYQ 223

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
           V+YGDGSFTVG+  TET++F  +  V  + LGCGHDNEGLFVG+AGLLGLG G LS   Q
Sbjct: 224 VSYGDGSFTVGEFSTETLTFRGT-RVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQ 282

Query: 296 IKA---TSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
           I     +  +YCL DR + +  S ++  +SA        PL+ N K+DTFYYV L G SV
Sbjct: 283 IGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISV 342

Query: 351 GGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           GG  V  I  SLF++D  G+GG+I+D GT++TRL   AY +LRD+F+  A NLK     +
Sbjct: 343 GGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFS 402

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
           LFDTC+D SG   V+VPTV LHF  G  + LPA NYLIPVD++G+FCFAFA T+S LSII
Sbjct: 403 LFDTCFDLSGKTEVKVPTVVLHF-RGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSII 461

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GN+QQQG RV +DLA +RVGF P  C
Sbjct: 462 GNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 221/451 (49%), Positives = 287/451 (63%), Gaps = 25/451 (5%)

Query: 54  EPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARV 113
            P   +P    +++ + +    SS++FS+ LH  + L  + ++   +L  +RL+RD+ARV
Sbjct: 34  NPLRSQPTLSWTDSESPTDTAESSATFSVQLHHVDAL--SFNSTPETLFTTRLQRDAARV 91

Query: 114 NTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
             +    + A              + +   FS+ V+SG +QGSGEYF+RIGVGTPPR   
Sbjct: 92  EAISYLAETA-----------GTGKRVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVY 140

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-- 231
           MVLDTGSDI W+QC PC  CY QSDP+FDP+ S S++ + C +P C  LD   C   +  
Sbjct: 141 MVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQT 200

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C+YQV+YGDGSFT GD  TET++F  +  V  +ALGCGHDNEGLFVG+AGLLGLG G LS
Sbjct: 201 CMYQVSYGDGSFTFGDFSTETLTFRRT-RVARVALGCGHDNEGLFVGAAGLLGLGRGRLS 259

Query: 292 LTKQIKAT---SLAYCLVDRDS---PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
              Q         +YCLVDR +   P+S V   +SA    A   PL+ N K+DTFYYV L
Sbjct: 260 FPSQTGRRFNHKFSYCLVDRSASSKPSSMVFG-DSAVSRTARFTPLVSNPKLDTFYYVEL 318

Query: 346 TGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP 404
            G SVGG  V  I  SLF++D+ G+GG+I+D GT++TRL   AY + RD+F   A NLK 
Sbjct: 319 LGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKR 378

Query: 405 TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
               +LFDTC+D SG   V+VPTV LHF  G  + LPA NYLIPVD++G FC AFA T  
Sbjct: 379 APQFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPASNYLIPVDTSGNFCLAFAGTMG 437

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            LSIIGN+QQQG RV +DLA +RVGF P+ C
Sbjct: 438 GLSIIGNIQQQGFRVVYDLAGSRVGFAPHGC 468


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 236/504 (46%), Positives = 307/504 (60%), Gaps = 26/504 (5%)

Query: 6   PFVLFTITTILFSFCLFTSASSRGLS--ETAT-TVLDVSSALQQTEHILSFEPETLEPFA 62
           P V ++    +    L  SA SR +S  E A    LD+++ L +T+      P       
Sbjct: 47  PLVPYSFLLCIQLLLLLQSAHSRPISAPEPANYHTLDIAAWLIETKT----APAPGRDEY 102

Query: 63  EESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNTLITK 119
           E+ ET     P +        +H   +L K   N    Y   +   L RD+ RV  L  +
Sbjct: 103 EKRETKPRQTPWSVQV-----VHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQR 157

Query: 120 LQLAI-YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           ++  +  N D        A++  E F   VVSG +QGSGEYF+RIGVGTP R+  MVLDT
Sbjct: 158 IEKRLRLNKDPAGSHENVAEVAAE-FGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDT 216

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY 238
           GSD+ W+QC PC++CY Q DPIF+P  S+S+S L C +  C  LD   C    CLY+V+Y
Sbjct: 217 GSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSY 276

Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI-- 296
           GDGS+T+G   TE ++FG + SV+ +A+GCGHDN GLFVG+AGLLGLG G+LS   Q+  
Sbjct: 277 GDGSYTIGSFATEMLTFGTT-SVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGT 335

Query: 297 -KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
               + +YCLVDR S +SG LEF   S   G  +T PL+ N  + TFYYV L   SVGG 
Sbjct: 336 QTGRAFSYCLVDRFSESSGTLEFGPESVPLGSILT-PLLTNPSLPTFYYVPLISISVGGA 394

Query: 354 AVQ-IPPSLFEMDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
            +  +PP +F +DE +G GG IVD GTA+TRLQT  Y+++RD+FV     L    GV++F
Sbjct: 395 LLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIF 454

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
           DTCYD SGL  V VPTV  HF  G +L LPAKNY+IP+D  GTFCFAFAP +S LSI+GN
Sbjct: 455 DTCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGN 514

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           +QQQG RVSFD AN+ VGF   +C
Sbjct: 515 IQQQGIRVSFDTANSLVGFALRQC 538


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 212/458 (46%), Positives = 285/458 (62%), Gaps = 31/458 (6%)

Query: 46  QTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR 105
           Q  H L+F     +    +S+    +F LN        LH  ++ H   H   R     R
Sbjct: 48  QILHALNFSDGHRQVSGYKSDN--NTFKLNL-------LHRDKLSHVHGH---RRGFNDR 95

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPA--EAQILPEDFSTPVVSGASQGSGEYFSRI 163
           ++RD+ RV TL+ +L         H    A  +++    +F+T V+SG   GSGEYF RI
Sbjct: 96  MKRDAIRVATLVRRLS--------HGAPAAVKDSRYKVANFATDVISGMEAGSGEYFVRI 147

Query: 164 GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GVG+PPR   MV+D+GSDI W+QC+PC+ CYQQSDP+FDP  SSS++ + C +  C  L+
Sbjct: 148 GVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVCDRLE 207

Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
            + C A RC Y+V+YGDGS+T G L  ET++ G    ++ +A+GCGH N+G+F+G+AGLL
Sbjct: 208 NTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQV-MIRDVAIGCGHTNQGMFIGAAGLL 266

Query: 284 GLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTA---PLIRNKKV 337
           GLGGG +S   Q+   +    +YCLV R + ++G LEF   RG   V A    LIRN + 
Sbjct: 267 GLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEF--GRGALPVGATWISLIRNPRA 324

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
            +FYY+GL G  VGG  V +P   F++ E G  G+++D GTA+TR  T AY + RDSF  
Sbjct: 325 PSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTA 384

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
              NL    GV++FDTCYD +G  SVRVPTVS +F  G  L LPA+N+LIPVD  GTFC 
Sbjct: 385 QTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCL 444

Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAP+ S LSIIGN+QQ+G ++SFD AN  VGF PN C
Sbjct: 445 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/398 (53%), Positives = 268/398 (67%), Gaps = 18/398 (4%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           RL+RD+ RV  L      ++    R+  KP         FS+ V+SG +QGSGEYF+RIG
Sbjct: 84  RLQRDAIRVKKLS-----SLGATSRNLSKPGGT----TGFSSSVISGLAQGSGEYFTRIG 134

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           VGTPP+   MVLDTGSDI WLQC PC  CY Q+DP+F+P  S S++ + C  P C+ L+ 
Sbjct: 135 VGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLES 194

Query: 225 SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
             C   + CLYQV+YGDGS+T G+ VTET++F  +  V+ +ALGCGHDNEGLFVG+AGLL
Sbjct: 195 PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVEQVALGCGHDNEGLFVGAAGLL 253

Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
           GLG G LS   Q   T     +YCLVDR   S  S V+  NSA    A   PL+ N ++D
Sbjct: 254 GLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLD 313

Query: 339 TFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
           TFYYV L G SVGG  V  I  S F++D  G+GG+I+DCGT++TRL   AY +LRD+F  
Sbjct: 314 TFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRA 373

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
            A +LK     +LFDTCYD SG  +V+VPTV LHF  G  + LPA NYLIPVD +G FCF
Sbjct: 374 GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCF 432

Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFA T+S LSIIGN+QQQG RV +DLA++RVGF+P  C
Sbjct: 433 AFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 215/399 (53%), Positives = 266/399 (66%), Gaps = 19/399 (4%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           RLERD+ARV TL T L  A      ++ +PA             +S   QGSGEYF+R+G
Sbjct: 85  RLERDAARVKTL-THLAAAT-----NKTRPANPGSGFSSSVVSGLS---QGSGEYFTRLG 135

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           VGTPP+   MVLDTGSD+ WLQC+PCT+CY Q+D IFDP  S S++ +PC +P C+ LD 
Sbjct: 136 VGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCRRLDS 195

Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
             C  + N C YQV+YGDGSFT GD  TET++F    +V  +A+GCGHDNEGLFVG+AGL
Sbjct: 196 PGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-RRAAVPRVAIGCGHDNEGLFVGAAGL 254

Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKV 337
           LGLG G LS   Q         +YCL DR + A  S ++  +SA    A   PL++N K+
Sbjct: 255 LGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKL 314

Query: 338 DTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
           DTFYYV L G SVGG  V+ I  S F +D  G+GG+I+D GT++TRL   AY SLRD+F 
Sbjct: 315 DTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFR 374

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
             A +LK     +LFDTCYD SGL  V+VPTV LHF  G  + LPA NYL+PVD++G+FC
Sbjct: 375 VGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHF-RGADVSLPAANYLVPVDNSGSFC 433

Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           FAFA T S LSIIGN+QQQG RV FDLA +RVGF P  C
Sbjct: 434 FAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 222/440 (50%), Positives = 282/440 (64%), Gaps = 16/440 (3%)

Query: 64  ESETAAESFPLNSSS-SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQL 122
           E+ET   + P++ +  + ++ L  R++L      +  +L   RL+RD+ RV  L      
Sbjct: 57  ETETQISTLPVSETDPTMTMHLEHRDVLAFNATPE--ALFNLRLQRDAFRVEALSKMAAA 114

Query: 123 AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDI 182
           A           A+       FS+ V SG +QGSGEYF+R+GVGTPP+   MVLDTGSD+
Sbjct: 115 AGGRRAGRNGTHAQGG----GFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDV 170

Query: 183 NWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDG 241
            W+QC PC +CY Q+DP+FDPK S S+S + C +P C  LD   C + + CLYQVAYGDG
Sbjct: 171 VWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDG 230

Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---A 298
           SFT G+  TET++F  +  V  +ALGCGHDNEGLFVG+AGLLGLG G LS   Q      
Sbjct: 231 SFTFGEFSTETLTFRGT-RVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFG 289

Query: 299 TSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
              +YCLVDR   S  S V+   SA    AV  PLI N K+DTFYY+ LTG SVGG  V 
Sbjct: 290 RKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVA 349

Query: 357 -IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
            I  SLF++D AG+GG+I+D GT++TRL  +AY SLRD+F   A +LK     +LFDTC+
Sbjct: 350 GITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCF 409

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
           D SG   V+VPTV +HF  G  + LPA NYLIPVD+ G FCFAFA T S LSIIGN+QQQ
Sbjct: 410 DLSGKTEVKVPTVVMHF-RGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQ 468

Query: 476 GTRVSFDLANNRVGFTPNKC 495
           G RV FD+A +R+GF    C
Sbjct: 469 GFRVVFDVAASRIGFAARGC 488


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 198/394 (50%), Positives = 265/394 (67%), Gaps = 16/394 (4%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           ++RD  RV +LI ++              + A    EDF + VVSG  QGSGEYF RIGV
Sbjct: 1   MQRDVKRVVSLIRRVS-----------SGSTASYGVEDFGSEVVSGMDQGSGEYFVRIGV 49

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           G+PPR   MV+D+GSDI W+QC+PCT+CY Q+DP+FDP  S+S+  + C++  C  +D +
Sbjct: 50  GSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNA 109

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
            C + RC Y+V+YGDGS T G L  ET++ G +  V+ +A+GCGH N+G+FVG+AGLLGL
Sbjct: 110 GCNSGRCRYEVSYGDGSSTKGTLALETLTLGRT-VVQNVAIGCGHMNQGMFVGAAGLLGL 168

Query: 286 GGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNS-ARGGDAVTAPLIRNKKVDTFY 341
           GGG +S   Q+   +  + +YCLV R + ++G LEF S A    A   PLIRN    ++Y
Sbjct: 169 GGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYY 228

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           Y+GL+G  VG   V I   +FE+ E G+GG+++D GTA+TR  T AY + RD+F+   GN
Sbjct: 229 YIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGN 288

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           L   SGV++FDTCY+  G  SVRVPTVS +F  G  L LPA N+LIPVD AGTFCFAFAP
Sbjct: 289 LPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAP 348

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + S LSI+GN+QQ+G ++S D AN  VGF PN C
Sbjct: 349 SPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 213/405 (52%), Positives = 272/405 (67%), Gaps = 18/405 (4%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
           + L  SRL RD++RV +L T L  A+ + +R   +          FS+ V SG +QGSGE
Sbjct: 95  QDLFNSRLARDASRVKSL-TSLAAAVGSTNRTRARG-------PGFSSSVTSGLAQGSGE 146

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           YF+R+GVGTP R   MVLDTGSD+ W+QC PC +CY Q+DP+F+P  S S++ +PC +P 
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPL 206

Query: 219 CKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C+ LD   C   +  CLYQV+YGDGSFT G+  TET++F  +  V  +ALGCGHDNEGLF
Sbjct: 207 CRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGT-RVGRVALGCGHDNEGLF 265

Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR--DSPASGVLEFNSARGGDAVTAPL 331
           +G+AGLLGLG G LS   QI    +   +YCLVDR   S  S ++  +SA    A   PL
Sbjct: 266 IGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFTPL 325

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           + N K+DTFYYV L G SVGG  V  I  SLF++D  G+GG+I+D GT++TRL   AY +
Sbjct: 326 VSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAYVA 385

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           LRD+F   A NLK     +LFDTC+D SG   V+VPTV LHF  G  + LPA NYLIPVD
Sbjct: 386 LRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHF-RGADVSLPASNYLIPVD 444

Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++G+FCFAFA T S LSI+GN+QQQG RV +DLA +RVGF P  C
Sbjct: 445 NSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 221/423 (52%), Positives = 279/423 (65%), Gaps = 20/423 (4%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           +L LH   I   + +     L   RL+RD+ RV  ++    LA  N        + A+  
Sbjct: 61  ALSLHLHHIDALSSNKTPEQLFQLRLQRDAKRVEGVVA---LAALN-------QSHARRS 110

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
              FS+ ++SG +QGSGEYF+RIGVGTP R   MVLDTGSD+ WLQC PC +CY Q+DP+
Sbjct: 111 GSSFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPV 170

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
           FDP  S +Y+ +PC AP C+ LD   C  +   C YQV+YGDGSFT GD  TET++F  +
Sbjct: 171 FDPTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRT 230

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA--S 313
             V  +ALGCGHDNEGLF+G+AGLLGLG G LS   Q         +YCLVDR + A  S
Sbjct: 231 -RVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPS 289

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGI 372
            V+  +SA    A   PLI+N K+DTFYY+ L G SVGG  V+ +  SLF +D AG+GG+
Sbjct: 290 SVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGV 349

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           I+D GT++TRL   AY +LRD+F   A +LK  +  +LFDTC+D SGL  V+VPTV LHF
Sbjct: 350 IIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHF 409

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG RVSFDLA +RVGF P
Sbjct: 410 -RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468

Query: 493 NKC 495
             C
Sbjct: 469 RGC 471


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  370 bits (950), Expect = e-99,   Method: Compositional matrix adjust.
 Identities = 204/359 (56%), Positives = 253/359 (70%), Gaps = 9/359 (2%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           FS+ V+SG +QGSGEYF+RIGVGTPP+   MVLDTGSDI WLQC PC  CY Q+DP+F+P
Sbjct: 27  FSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNP 86

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
             S S++ + C  P C+ L+   C   + CLYQV+YGDGS+T G+ VTET++F  +  V+
Sbjct: 87  VKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRT-KVE 145

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLE 317
            +ALGCGHDNEGLFVG+AGLLGLG G LS   Q   T     +YCLVDR   S  S V+ 
Sbjct: 146 QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVF 205

Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDC 376
            NSA    A   PL+ N ++DTFYYV L G SVGG  V  I  S F++D  G+GG+I+DC
Sbjct: 206 GNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDC 265

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT++TRL   AY +LRD+F   A +LK     +LFDTCYD SG  +V+VPTV LHF  G 
Sbjct: 266 GTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHF-RGA 324

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV +DLA++RVGF+P  C
Sbjct: 325 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  369 bits (948), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 192/394 (48%), Positives = 263/394 (66%), Gaps = 16/394 (4%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           + RD  RV +LI +L              + A+   EDF + VVSG +QGSGEYF RIG+
Sbjct: 1   MHRDVKRVASLIHRLS-----------SGSAAKYEVEDFGSDVVSGMNQGSGEYFVRIGL 49

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           G+PPR   MV+D+GSDI W+QC+PCT+CY Q+DP+FDP  S+S+  + C++  C  ++ +
Sbjct: 50  GSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENA 109

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
            C + RC Y+V+YGDGS+T G L  ET++FG +  V+ +A+GCGH N G+FVG+AGLLGL
Sbjct: 110 GCNSGRCRYEVSYGDGSYTKGTLALETLTFGRT-VVRNVAIGCGHSNRGMFVGAAGLLGL 168

Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNS-ARGGDAVTAPLIRNKKVDTFY 341
           GGG +S   Q+      + +YCLV R +  +G LEF S A    A   PL+RN +  +FY
Sbjct: 169 GGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFY 228

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           Y+ L G  VG   V +   +F+++E G GG+++D GTA+TR  T AY + R++F+    N
Sbjct: 229 YIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQN 288

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           L   SGV++FDTCY+  G  SVRVPTVS +F  G  L +PA N+LIPVD AGTFCFAFAP
Sbjct: 289 LPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAP 348

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + S LSI+GN+QQ+G ++S D AN  VGF PN C
Sbjct: 349 SPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  369 bits (947), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 200/404 (49%), Positives = 268/404 (66%), Gaps = 10/404 (2%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHEL-KPAEAQ--ILPEDFSTPVVSGASQGSG 157
           LV +RL RD  R+ ++ +++ L +  + +  L  P +     L +DF TP+ SG S GSG
Sbjct: 20  LVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSG 79

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYF  +GVGTPPR  +MV DTGSD+ WLQC PC  CY Q+DP+F+P  SS++  + C + 
Sbjct: 80  EYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSS 139

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
            C+ L +  CR N+CLYQV+YGDGSFTVG+  TET+SFG S +V  +A+GCGH+N+GLF 
Sbjct: 140 LCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG-SNAVNSVAIGCGHNNQGLFT 198

Query: 278 GSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIR 333
           G+AGLLGLG G+LS   Q+     +  +YCL  R+S  S  L F N A   +A    L+ 
Sbjct: 199 GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASNAQFTTLLT 258

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLR 392
           N K+DTFYYV + G  VGG +V IP     +D + G+GG+I+D GTA+TRL T AYN +R
Sbjct: 259 NPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMR 318

Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           D+F   +  + K TSG +LFDTCYD SG  S+ +P VS  F  G  + LPA+N ++PVD+
Sbjct: 319 DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDN 378

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GT+C AFAP S   SIIGN+QQQ  R+SFD   NRVG   N+C
Sbjct: 379 SGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  369 bits (946), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 200/404 (49%), Positives = 268/404 (66%), Gaps = 10/404 (2%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHEL-KPAEAQ--ILPEDFSTPVVSGASQGSG 157
           LV +RL RD  R+ ++ +++ L +  + +  L  P +     L +DF TP+ SG S GSG
Sbjct: 20  LVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNPFLQQDFETPLRSGLSDGSG 79

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYF  +GVGTPPR  +MV DTGSD+ WLQC PC  CY Q+DP+F+P  SS++  + C + 
Sbjct: 80  EYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSS 139

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
            C+ L +  CR N+CLYQV+YGDGSFTVG+  TET+SFG S +V  +A+GCGH+N+GLF 
Sbjct: 140 LCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG-SNAVNSVAIGCGHNNQGLFT 198

Query: 278 GSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIR 333
           G+AGLLGLG G+LS   Q+     +  +YCL  R+S  S  L F N A   +A    L+ 
Sbjct: 199 GAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAVASNAQFTTLLT 258

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLR 392
           N K+DTFYYV + G  VGG +V IP     +D + G+GG+I+D GTA+TRL T AYN +R
Sbjct: 259 NPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVTRLVTSAYNPMR 318

Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           D+F   +  + K TSG +LFDTCYD SG  S+ +P VS  F  G  + LPA+N ++PVD+
Sbjct: 319 DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDN 378

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GT+C AFAP S   SIIGN+QQQ  R+SFD   NRVG   N+C
Sbjct: 379 SGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  368 bits (945), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 192/422 (45%), Positives = 268/422 (63%), Gaps = 24/422 (5%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           S  L  R+ +  + +   R  VL  + RD+AR   L ++L  A Y               
Sbjct: 60  SFALVRRDAVTGSTYPSRRHAVLDLVARDNARAEYLASRLSPAAYQ-------------- 105

Query: 141 PEDFS---TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
           P  FS   + VVSG  +GSGEYF R+G+G+PP +  +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 106 PTGFSGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQA 165

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
           DP+FDP TS+++S +PC +  C++L  S C  +  C Y+V+YGDGS+T G L  ET++ G
Sbjct: 166 DPLFDPATSATFSAVPCGSAVCRTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLG 225

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R +  S
Sbjct: 226 GT-AVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGA-GS 283

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            VL  + A    AV  PL+RN +  +FYYVGL+G  VG + + +   LF++ E G GG++
Sbjct: 284 LVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVV 343

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GTA+TRL  +AY +LRD+FV   G L    GV+L DTCYD SG  SVRVPTVS +F 
Sbjct: 344 MDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFD 403

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
               L LPA+N L+ VD  G +C AFAP+SS  SI+GN+QQ+G +++ D AN  +GF P 
Sbjct: 404 GAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPT 462

Query: 494 KC 495
            C
Sbjct: 463 TC 464


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  367 bits (942), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 194/424 (45%), Positives = 264/424 (62%), Gaps = 23/424 (5%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +    +   R  V+  + RD+ARV  L               L  + +  L
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           PED  + VV G   GSGEYF R+GVG+PP    +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
           FDP  SSS+S + C +  C++L  + C       +C Y V YGDGS+T G+L  ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R +  +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290

Query: 314 G--VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G  VL    A    AV  PL+RN +  +FYYVGLTG  VGG+ + +  SLF++ E G GG
Sbjct: 291 GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGG 350

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
           +++D GTA+TRL  +AY +LR +F    G L  +  V+L DTCYD SG  SVRVPTVS +
Sbjct: 351 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 410

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  G  L LPA+N L+ V  A  FC AFAP+SS +SI+GN+QQ+G +++ D AN  VGF 
Sbjct: 411 FDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469

Query: 492 PNKC 495
           PN C
Sbjct: 470 PNTC 473


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  365 bits (938), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 193/424 (45%), Positives = 263/424 (62%), Gaps = 23/424 (5%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +    +   R  V+  + RD+ARV  L               L  + +  L
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           PED  + VV G   GSGEYF R+GVG+PP    +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
           FDP  SSS+S + C +  C++L  + C       +C Y V YGDGS+T G+L  ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R +  +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGA 290

Query: 314 G--VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G  VL    A    AV  PL+RN +  +FYYVGLTG  VGG+ + +   LF++ E G GG
Sbjct: 291 GSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGG 350

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
           +++D GTA+TRL  +AY +LR +F    G L  +  V+L DTCYD SG  SVRVPTVS +
Sbjct: 351 VVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFY 410

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  G  L LPA+N L+ V  A  FC AFAP+SS +SI+GN+QQ+G +++ D AN  VGF 
Sbjct: 411 FDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469

Query: 492 PNKC 495
           PN C
Sbjct: 470 PNTC 473


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  363 bits (933), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 202/365 (55%), Positives = 240/365 (65%), Gaps = 16/365 (4%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PVVSG +QGSGEYF++IGVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP+ S
Sbjct: 130 PVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRS 189

Query: 207 SSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
            SY  + C+AP C+ LD   C  R   CLYQVAYGDGS T GD  TET++F     V  I
Sbjct: 190 RSYGAVGCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARI 249

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-----SGVL 316
           ALGCGHDNEGLFV +AGLLGLG G LS   QI      S +YCLVDR S A     S  +
Sbjct: 250 ALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309

Query: 317 EFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDGG 371
            F S   G  V A   P+++N +++TFYYV L G SVGG  V  +  S   +D  +G GG
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGG 369

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSL 430
           +IVD GT++TRL   AY++LRD+F   A  L+ +  G +LFDTCYD SG + V+VPTVS+
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 429

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           HF  G    LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV FD    RVGF
Sbjct: 430 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 489

Query: 491 TPNKC 495
            P  C
Sbjct: 490 VPKGC 494


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  363 bits (933), Expect = 9e-98,   Method: Compositional matrix adjust.
 Identities = 193/349 (55%), Positives = 245/349 (70%), Gaps = 9/349 (2%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           QGSGEYF+RIG+GTP R+  MVLDTGSD+ W+QC PC ECY Q+DPIF+P +S S+S + 
Sbjct: 3   QGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVG 62

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           C +  C  LD + C    CLY+V+YGDGS+TVG   TET++FG + S++ +A+GCGHDN 
Sbjct: 63  CDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTT-SIQNVAIGCGHDNV 121

Query: 274 GLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVT 328
           GLFVG+AGLLGLG G LS   Q+      + +YCLVDRDS +SG LEF   S   G   T
Sbjct: 122 GLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIFT 181

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQ 386
            PL+ N  + TFYY+ +   SVGG  +  +P   F +DE  G GGII+D GTA+TRLQT 
Sbjct: 182 -PLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTS 240

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           AY++LRD+F+    +L    G+++FDTCYD S L+SV +P V  HF  G    LPAKN L
Sbjct: 241 AYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCL 300

Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IP+DS GTFCFAFAP  S LSI+GN+QQQG RVSFD AN+ VGF  ++C
Sbjct: 301 IPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  362 bits (928), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 217/474 (45%), Positives = 298/474 (62%), Gaps = 36/474 (7%)

Query: 34  ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPL------NSSSSFSLPLHSR 87
           AT +L+V   +++ E   S  P+ LE          E++P+      +S S + L L  R
Sbjct: 27  ATQLLNVKDTIKEAETAPSRLPQDLE--------LHENYPIFELDNNSSQSQWKLKLFHR 78

Query: 88  EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTP 147
           + L      D+      R+ RDS RV++L+  L              ++ Q+   DF + 
Sbjct: 79  DKLPLNFDPDHPRRFKERISRDSKRVSSLLRLLSSG-----------SDEQV--TDFGSD 125

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           VVSG  QGSGEYF RIGVG+PPR   +V+D+GSDI W+QC+PC+ECYQQSDP+FDP  S+
Sbjct: 126 VVSGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSA 185

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           +Y+ + C +  C  LD + C   RC Y+V+YGDGS+T G L  ET++FG    ++ IA+G
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRV-LIRNIAIG 244

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGG 324
           CGH N G+F+G+AGLLGLGGG +S   Q+   +    +YCLV R + ++G LEF   RG 
Sbjct: 245 CGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEF--GRGA 302

Query: 325 DAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
             V A   PLIRN +  +FYYVGL+G  VGG  V IP  +FE+ + G GG+++D GTA+T
Sbjct: 303 MPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVT 362

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
           RL   AY + RD+F+    NL  +  V++FDTCY+ +G  SVRVPTVS +F  G  L LP
Sbjct: 363 RLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLP 422

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G ++S D +N  VGF P  C
Sbjct: 423 ARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  361 bits (926), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 190/430 (44%), Positives = 266/430 (61%), Gaps = 32/430 (7%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           S  L  R+ +    +   R  VL  + RD+AR   L ++L  A                 
Sbjct: 59  SFALVRRDAVTGATYPSPRHAVLDLVSRDNARAEYLASRLSPAYQ--------------- 103

Query: 141 PEDF---STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
           P DF    + VVSG  +GSGEYF R+G+G+PP +  +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 104 PTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQA 163

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
           DP+FDP +S+++S + C +  C++L  S C  +  C Y+V+YGDGS+T G L  ET++ G
Sbjct: 164 DPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTLG 223

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R    S
Sbjct: 224 GT-AVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGS 282

Query: 314 G--------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
           G        VL  + A    AV  PL+RN +  +FYYVG++G  VG + + +   LF++ 
Sbjct: 283 GAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLT 342

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
           E G GG+++D GTA+TRL  +AY +LRD+FV   G L    GV+L DTCYD SG  SVRV
Sbjct: 343 EDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRV 402

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           PTVS +F     L LPA+N L+ VD  G +C AFAP+SS LSI+GN+QQ+G +++ D AN
Sbjct: 403 PTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQEGIQITVDSAN 461

Query: 486 NRVGFTPNKC 495
             +GF P  C
Sbjct: 462 GYIGFGPATC 471


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  360 bits (925), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 201/448 (44%), Positives = 287/448 (64%), Gaps = 15/448 (3%)

Query: 56  ETLEPFAEESETAAE----SFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSA 111
           + L+P    +ET  +     F  +S+S ++L L  R+      + ++   + +R+ RD+ 
Sbjct: 31  DVLQPPLTVTETLPDFNNTHFSDDSNSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTD 90

Query: 112 RVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
           RV+ ++ ++   +       +  ++++    DF + VVSG  QGSGEYF RIGVG+PPR 
Sbjct: 91  RVSAILRRISGKVV------VASSDSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRD 144

Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
             MV+D+GSD+ W+QC+PC  CY+QSDP+FDP  S SY+ + C +  C  ++ S C +  
Sbjct: 145 QYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGG 204

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C Y+V YGDGS+T G L  ET++F  +  V+ +A+GCGH N G+F+G+AGLLG+GGG +S
Sbjct: 205 CRYEVMYGDGSYTKGTLALETLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMS 263

Query: 292 LTKQIKATS---LAYCLVDRDSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTG 347
              Q+   +     YCLV R + ++G L F   A    A   PL+RN +  +FYYVGL G
Sbjct: 264 FVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKG 323

Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
             VGG  + +P  +F++ E GDGG+++D GTA+TRL T AY + RD F     NL   SG
Sbjct: 324 LGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASG 383

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
           V++FDTCYD SG  SVRVPTVS +F  G  L LPA+N+L+PVD +GT+CFAFA + + LS
Sbjct: 384 VSIFDTCYDLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLS 443

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGN+QQ+G +VSFD AN  VGF PN C
Sbjct: 444 IIGNIQQEGIQVSFDGANGFVGFGPNVC 471


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  360 bits (925), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 197/428 (46%), Positives = 278/428 (64%), Gaps = 12/428 (2%)

Query: 72  FPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHE 131
           F   SSS ++L L  R+      + ++   + +R+ RD+ RV+ ++ ++   +       
Sbjct: 51  FSDESSSKYTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKV------- 103

Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
           +  ++++    DF + +VSG  QGSGEYF RIGVG+PPR   MV+D+GSD+ W+QC+PC 
Sbjct: 104 IPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCK 163

Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
            CY+QSDP+FDP  S SY+ + C +  C  ++ S C +  C Y+V YGDGS+T G L  E
Sbjct: 164 LCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALE 223

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDR 308
           T++F  +  V+ +A+GCGH N G+F+G+AGLLG+GGG +S   Q+   +     YCLV R
Sbjct: 224 TLTFAKT-VVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVSR 282

Query: 309 DSPASGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
            + ++G L F   A    A   PL+RN +  +FYYVGL G  VGG  + +P  +F++ E 
Sbjct: 283 GTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET 342

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
           GDGG+++D GTA+TRL T AY + RD F     NL   SGV++FDTCYD SG  SVRVPT
Sbjct: 343 GDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPT 402

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           VS +F  G  L LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSFD AN  
Sbjct: 403 VSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGF 462

Query: 488 VGFTPNKC 495
           VGF PN C
Sbjct: 463 VGFGPNVC 470


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  360 bits (924), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 189/356 (53%), Positives = 249/356 (69%), Gaps = 9/356 (2%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           V SG + GSGEYF R+G+G+P +   +V+DTGSD+ W+QC PC  CY+Q+D +FDP+ SS
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62

Query: 208 SYSPLPCAAPQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+  L C+ PQCK LDV AC +  NRCLYQV+YGDGSFTVGDL +++ S  + G    + 
Sbjct: 63  SFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSV-SRGRTSPVV 121

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS---PASGVLEFNSAR 322
            GCGHDNEGLFVG+AGLLGLG G LS   Q+ +   +YCLV RD+    +S +L  +SA 
Sbjct: 122 FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSAL 181

Query: 323 GGDAVTA--PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTA 379
              A  A   L++N K+DTFYY GL+G S+GG  + IP + F++  + G GG+I+D GT+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           +TRL T AY  +RD+F      L   +  +LFDTCYDFS L SV +PTVS HF  G ++ 
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQ 301

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP  NYL+PVD++GTFCFAF+ TS  LSIIGN+QQQ  RV+ DL ++RVGF P +C
Sbjct: 302 LPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 209/399 (52%), Positives = 263/399 (65%), Gaps = 24/399 (6%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           RL+RD+ RV  L+ ++                 +     FS+ ++SG +QGSGEYF+RIG
Sbjct: 78  RLQRDAKRVEALLNQIH--------------ARRSAGSSFSSSIISGLAQGSGEYFTRIG 123

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           VGTP R   MVLDTGSD+ WLQC PC +CY Q+D +FDP  S +Y+ +PC AP C+ LD 
Sbjct: 124 VGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183

Query: 225 SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
             C  +   C YQV+YGDGSFT GD  TET++F     V  +ALGCGHDNEGLF G+AGL
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF-RRNRVTRVALGCGHDNEGLFTGAAGL 242

Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKV 337
           LGLG G LS   Q         +YCLVDR + A  S V+  +SA    A   PLI+N K+
Sbjct: 243 LGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKL 302

Query: 338 DTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
           DTFYY+ L G SVGG  V+ +  SLF +D AG+GG+I+D GT++TRL   AY +LRD+F 
Sbjct: 303 DTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFR 362

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
             A +LK     +LFDTC+D SGL  V+VPTV LHF  G  + LPA NYLIPVD++G+FC
Sbjct: 363 IGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFC 421

Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           FAFA T S LSIIGN+QQQG R+S+DL  +RVGF P  C
Sbjct: 422 FAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  360 bits (923), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 219/448 (48%), Positives = 282/448 (62%), Gaps = 25/448 (5%)

Query: 64  ESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
           ES++ ++     S++S S+ L   + L          L   RL+RDS RV ++ +   LA
Sbjct: 48  ESKSFSDESVSESTTSLSVHLSHVDALSSFSDASPVDLFKLRLQRDSLRVKSITS---LA 104

Query: 124 IYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSD 181
             +  R+  K  P  A      FS  V+SG SQGSGEYF R+GVGTP     MVLDTGSD
Sbjct: 105 AVSTGRNATKRTPRSAG----GFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSD 160

Query: 182 INWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-C---RANRCLYQVA 237
           + WLQC PC  CY QSD IFDPK S +++ +PC +  C+ LD S+ C   R+  CLYQV+
Sbjct: 161 VVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVS 220

Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
           YGDGSFT GD  TET++F +   V  + LGCGHDNEGLFVG+AGLLGLG G LS   Q K
Sbjct: 221 YGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTK 279

Query: 298 AT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
           +      +YCLVDR S  S       ++  N A    +V  PL+ N K+DTFYY+ L G 
Sbjct: 280 SRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGI 339

Query: 349 SVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
           SVGG  V  +  S F++D  G+GG+I+D GT++TRL   AY +LRD+F   A  LK    
Sbjct: 340 SVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPS 399

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
            +LFDTC+D SG+ +V+VPTV  HFG G+ + LPA NYLIPV++ G FCFAFA T  +LS
Sbjct: 400 YSLFDTCFDLSGMTTVKVPTVVFHFGGGE-VSLPASNYLIPVNTEGRFCFAFAGTMGSLS 458

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGN+QQQG RV++DL  +RVGF    C
Sbjct: 459 IIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 209/418 (50%), Positives = 254/418 (60%), Gaps = 39/418 (9%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           L+  RL+RD  R   +              E   A      +  + PVVSG +QGSGEYF
Sbjct: 84  LLKHRLQRDKRRAARI-------------SEAAGAGGGNGRKGVAAPVVSGLAQGSGEYF 130

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
           ++IGVGTP  Q  MVLDTGSD+ W+QC PC  CY+QS P+FDP+ SSSY  + C A  C+
Sbjct: 131 TKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCR 190

Query: 221 SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
            LD   C  R   C+YQVAYGDGS T GD VTET++F     V  +ALGCGHDNEGLFV 
Sbjct: 191 RLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVA 250

Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV---------LEFNSARGG-- 324
           +AGLLGLG G LS   QI      S +YCLVDR S  +G          + F +   G  
Sbjct: 251 AAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGAS 310

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGTA 379
            A   P++RN +++TFYYV L G SVGG  V   P + E D       G GG+IVD GT+
Sbjct: 311 SASFTPMVRNPRMETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGTS 367

Query: 380 ITRLQTQAYNSLRDSF-VRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           +TRL   +Y++LRD+F    AG L+ +  G +LFDTCYD  G R V+VPTVS+HF  G  
Sbjct: 368 VTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAE 427

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV FD    RVGF P  C
Sbjct: 428 AALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 209/418 (50%), Positives = 250/418 (59%), Gaps = 35/418 (8%)

Query: 101 LVLSRLERD---SARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           L+  RL+RD   +AR++           N  R       A         PVVSG +QGSG
Sbjct: 88  LLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVAA---------PVVSGLAQGSG 138

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYF++IGVGTP     MVLDTGSD+ WLQC PC  CY QS P+FDP+ SSSY  + CAAP
Sbjct: 139 EYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAP 198

Query: 218 QCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            C+ LD   C  R   CLYQVAYGDGS T GD  TET++F     V  +ALGCGHDNEGL
Sbjct: 199 LCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGHDNEGL 258

Query: 276 FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR---------DSPASGVLEFNSARG 323
           FV +AGLLGLG G LS   QI      S +YCLVDR             S  + F     
Sbjct: 259 FVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVTFGPPSA 318

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGT 378
             A   P++RN +++TFYYV L G SVGG  V   P + E D       G GG+IVD GT
Sbjct: 319 SAASFTPMVRNPRMETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGT 375

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           ++TRL   +Y++LRD+F   A  L+ +  G +LFDTCYD  G + V+VPTVS+HF  G  
Sbjct: 376 SVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSMHFAGGAE 435

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV FD    RVGF P  C
Sbjct: 436 AALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 493


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 206/463 (44%), Positives = 280/463 (60%), Gaps = 43/463 (9%)

Query: 38  LDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREI-LHKTRHN 96
           L+V +A+ +T+         L+P  +++    +  P   +  F    H   I L KT H 
Sbjct: 32  LNVENAISETK---------LKPLKQQNHNTQQ--PQWKTKLF----HRDNINLKKTTH- 75

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
             ++  +SR+ RD  RV  L+ +L       +++  +          F + VVSG  +GS
Sbjct: 76  --KTRFISRINRDIKRVTFLLNRL-------NKNTQEQQTTTATEASFGSDVVSGTEEGS 126

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEYF RIG+G+P     MV+D+GSDI W+QC PC +CY Q+DPIF+P TS+S+  + C++
Sbjct: 127 GEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSS 186

Query: 217 PQCKSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
             C  LD   ACR  RC YQVAYGDGS+T G L  ET++ G +  ++  A+GCGH NEG+
Sbjct: 187 NVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT-VIQDTAIGCGHWNEGM 245

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           FVG+AGLLGLGGG +S   Q+ A +     YCLV R  P              A+  PLI
Sbjct: 246 FVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPVG------------AMWVPLI 293

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
            N    +FYYV L+G +VGG  V I   +F++ + G GG+++D GTAITRL T AYN+ R
Sbjct: 294 HNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYNAFR 353

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           D+F+    NL    GV++FDTCYD +G  +VRVPTVS +F  G+ L  PA+N+LIP D  
Sbjct: 354 DAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPADDV 413

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           GTFCFAFAP+ S LSIIGN+QQ+G +VS D  N  VGF PN C
Sbjct: 414 GTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  358 bits (919), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 188/356 (52%), Positives = 248/356 (69%), Gaps = 9/356 (2%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           V SG + GSGEYF R+G+G+P +   +V+DTGSD+ W+QC PC  CY+Q+D +FDP+ SS
Sbjct: 3   VTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASS 62

Query: 208 SYSPLPCAAPQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+  L C+ PQCK LDV AC +  NRCLYQV+YGDGSFTVGDL +++    + G    + 
Sbjct: 63  SFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF-LVSRGRTSPVV 121

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS---PASGVLEFNSAR 322
            GCGHDNEGLFVG+AGLLGLG G LS   Q+ +   +YCLV RD+    +S +L  +SA 
Sbjct: 122 FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLVSRDNGVRASSALLFGDSAL 181

Query: 323 GGDAVTA--PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTA 379
              A  A   L++N K+DTFYY GL+G S+GG  + IP + F++  + G GG+I+D GT+
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           +TRL T AY  +RD+F      L   +  +LFDTCYDFS L SV +PTVS HF  G ++ 
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGGASVQ 301

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP  NYL+PVD++GTFCFAF+ TS  LSIIGN+QQQ  RV+ DL ++RVGF P +C
Sbjct: 302 LPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  357 bits (917), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 212/407 (52%), Positives = 266/407 (65%), Gaps = 25/407 (6%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           RL+RDS RV +L +   LA  +  R+  K  P  A      FS  V+SG SQGSGEYF R
Sbjct: 87  RLQRDSLRVESLTS---LAAVSAGRNVTKRPPRSA----GGFSGVVISGLSQGSGEYFMR 139

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           +GVGTP     MVLDTGSD+ WLQC PC  CY QSDP+F+P  S +++ +PC +  C+ L
Sbjct: 140 LGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRL 199

Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
           D S+ C   R+  CLYQV+YGDGSFTVGD  TET++F +   V  +ALGCGHDNEGLFVG
Sbjct: 200 DDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF-HGARVDHVALGCGHDNEGLFVG 258

Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
           +AGLLGLG G LS   Q K       +YCLVDR S  S       ++  N A    AV  
Sbjct: 259 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFT 318

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
           PL+ N K+DTFYY+ L G SVGG  V  +  S F++D  G+GG+I+D GT++TRL   AY
Sbjct: 319 PLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAY 378

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
            +LRD+F   A  LK     +LFDTC+D SG+ +V+VPTV  HF  G+ + LPA NYLIP
Sbjct: 379 VALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGE-VSLPASNYLIP 437

Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           V++ G FCFAFA T  +LSIIGN+QQQG RV++DL  +RVGF    C
Sbjct: 438 VNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  357 bits (917), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 199/431 (46%), Positives = 263/431 (61%), Gaps = 28/431 (6%)

Query: 84  LHSREILH--KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
           +H   +L   K + + +  L+L  L+RD  RV  + +K QLA    D             
Sbjct: 61  IHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEAS---------S 111

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
            D + PV SG   GSGEYF R+GVGTP R   MV+DTGSD+ WLQC+PC  CY+Q+DPIF
Sbjct: 112 TDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIF 171

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFG 256
           DP+ SSS+  +PC +P CK+L++ +C  +R     C YQVAYGDGSF+VGD  ++  + G
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLVDR 308
                  +A GCG DNEGLF G+AGLLGLG G LS   QI         A S +YCLVDR
Sbjct: 232 TGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDR 291

Query: 309 DSP---ASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
            +P   +S  L F +A     A  +PL++N K+DTFYY  + G SVGG  + I     ++
Sbjct: 292 SNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQL 351

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
            ++G GG+I+D GT++TR  T  Y ++RD+F     NL      +LFDTCY+FSG  SV 
Sbjct: 352 SQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVD 411

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           VP + LHF  G  L LP  NYLIP+++AG+FC AFAPTS  L IIGN+QQQ  R+ FDL 
Sbjct: 412 VPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQ 471

Query: 485 NNRVGFTPNKC 495
            + + F P +C
Sbjct: 472 KSHLAFAPQQC 482


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  357 bits (916), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 188/422 (44%), Positives = 259/422 (61%), Gaps = 28/422 (6%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +    +   R  V+  + RD+ARV  L               L  + +  L
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           PED  + VV G   GSGEYF R+GVG+PP    +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
           FDP  SSS+S + C +  C++L  + C       +C Y V YGDGS+T G+L  ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R +  +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G L           T  + R ++  +FYYVGLTG  VGG+ + +  SLF++ E G GG++
Sbjct: 291 GSLVLGR-------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 343

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GTA+TRL  +AY +LR +F    G L  +  V+L DTCYD SG  SVRVPTVS +F 
Sbjct: 344 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 403

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  L LPA+N L+ V  A  FC AFAP+SS +SI+GN+QQ+G +++ D AN  VGF PN
Sbjct: 404 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 462

Query: 494 KC 495
            C
Sbjct: 463 TC 464


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  357 bits (915), Expect = 9e-96,   Method: Compositional matrix adjust.
 Identities = 210/407 (51%), Positives = 266/407 (65%), Gaps = 25/407 (6%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           RL+RDS RV ++ +   LA  +  R+  K  P  A      FS  V+SG SQGSGEYF R
Sbjct: 86  RLQRDSLRVKSITS---LAAVSTGRNATKRTPRTAG----GFSGAVISGLSQGSGEYFMR 138

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           +GVGTP     MVLDTGSD+ WLQC PC  CY Q+D IFDPK S +++ +PC +  C+ L
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRL 198

Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
           D S+ C   R+  CLYQV+YGDGSFT GD  TET++F +   V  + LGCGHDNEGLFVG
Sbjct: 199 DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVG 257

Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
           +AGLLGLG G LS   Q K       +YCLVDR S  S       ++  N+A    +V  
Sbjct: 258 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFT 317

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
           PL+ N K+DTFYY+ L G SVGG  V  +  S F++D  G+GG+I+D GT++TRL   AY
Sbjct: 318 PLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQPAY 377

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
            +LRD+F   A  LK     +LFDTC+D SG+ +V+VPTV  HFG G+ + LPA NYLIP
Sbjct: 378 VALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFGGGE-VSLPASNYLIP 436

Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           V++ G FCFAFA T  +LSIIGN+QQQG RV++DL  +RVGF    C
Sbjct: 437 VNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 483


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  355 bits (910), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 196/359 (54%), Positives = 238/359 (66%), Gaps = 16/359 (4%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           +QGSGEYF++IGVGTP     MVLDTGSD+ WLQC PC  CY+QS  +FDP+ S SY+ +
Sbjct: 134 AQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAV 193

Query: 213 PCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            CAAP C+ LD   C  R + CLYQVAYGDGS T GD  TET++F     V  +ALGCGH
Sbjct: 194 GCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAGGARVARVALGCGH 253

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-----SGVLEFNSAR 322
           DNEGLFV +AGLLGLG G LS   QI      S +YCLVDR S A     S  + F S  
Sbjct: 254 DNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSGA 313

Query: 323 GGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDGGIIVDCG 377
            G  V +   P+++N +++TFYYV L G SVGG  V  +  S   +D  +G GG+IVD G
Sbjct: 314 VGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGGVIVDSG 373

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           T++TRL   AY++LRD+F   A  L+ +  G +LFDTCYD SG + V+VPTVS+HF  G 
Sbjct: 374 TSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGA 433

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV FD    RV FTP  C
Sbjct: 434 EAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 200/369 (54%), Positives = 242/369 (65%), Gaps = 17/369 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F  PVVSG +QGSGEYF++IGVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP
Sbjct: 132 FVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191

Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           + S SY  + CAAP C+ LD   C  R   CLYQVAYGDGS T GD  TET++F +   V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV--- 315
             +ALGCGHDNEGLFV +AGLLGLG G LS   QI      S +YCLVDR S ++     
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311

Query: 316 ---LEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA- 367
              + F S   G +  A   P+++N +++TFYYV L G SVGG  V  +  S   +D + 
Sbjct: 312 SSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST 371

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
           G GG+IVD GT++TRL   AY +LRD+F   A  L+ +  G +LFDTCYD SGL+ V+VP
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVP 431

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           TVS+HF  G    LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV FD    
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 491

Query: 487 RVGFTPNKC 495
           R+GF P  C
Sbjct: 492 RLGFVPKGC 500


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  352 bits (903), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 197/376 (52%), Positives = 249/376 (66%), Gaps = 21/376 (5%)

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
           + ++  +DF  PV+SG S GSGEYF R+ VGTPPR   +V+DTGSDI WLQC PC  CY 
Sbjct: 14  QTKVPSQDFQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYH 73

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
           Q D +FDP  SS+YS L C + QC +LDV  C  N+CLYQV YGDGSF+ G+  T+ VS 
Sbjct: 74  QCDEVFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSL 133

Query: 256 GNSGSVKG------IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLV 306
            NS S  G      I LGCGHDNEG FVG+AGLLGLG G LS   QI + +    +YCL 
Sbjct: 134 -NSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLT 192

Query: 307 DRDSPASGVLEFNSARGGDAVT-------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
            RD+ ++   E +S   GDA          P   N +V TFYY+ +TG SVGG  + IP 
Sbjct: 193 GRDTDST---ERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPT 249

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG 419
           S F++D  G+GG+I+D GT++TRLQ  AY SLR++F     +L  T+  +LFDTCY+ S 
Sbjct: 250 SAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSD 309

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRV 479
           L SV VPTV+LHF  G  L LPA NYL+PVD++ TFC AFA T+   SIIGN+QQQG RV
Sbjct: 310 LSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRV 368

Query: 480 SFDLANNRVGFTPNKC 495
            +D  +N+VGF P++C
Sbjct: 369 IYDNLHNQVGFVPSQC 384


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 179/334 (53%), Positives = 236/334 (70%), Gaps = 3/334 (0%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           VG P +    VLDTGSD+ WLQC PC     CY+Q  PIFDP+ SSSY+P+ C + QC+ 
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 222 LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
           LD + C  N C+Y+V YGDGSFT+G+L TET++F +S S+  I++GCGHDNEGLFVG+ G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 282 LLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           L+GLGGG +S++ Q+KA+S +YCLVD DSP+   L+FN+    D++ +PL++N +  +F 
Sbjct: 123 LIGLGGGAISISSQLKASSFSYCLVDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFR 182

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           YV + G SVGG+ + I  S FE+DE+G GGIIVD GT IT+L +  Y  LR++F+ L  N
Sbjct: 183 YVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTN 242

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           L P   ++ FDTCYD S   +V VPT++       +L LPAKN LI VDSAGTFC AF  
Sbjct: 243 LPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVS 302

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  LSIIGN QQQG RVS+DL N+ VGF+ NKC
Sbjct: 303 ATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 195/415 (46%), Positives = 256/415 (61%), Gaps = 26/415 (6%)

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           +  L+L  L+RD  RV  + +K +LA    D              D + PV SG   GSG
Sbjct: 2   HEQLLLETLQRDERRVRWIESKAKLAGKKKDEAS---------STDLNGPVTSGLLYGSG 52

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EYF R+G+GTP R   MV+DTGSD+ WLQC+PC  CY+Q+DPIFDP+ SSS+  +PC +P
Sbjct: 53  EYFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSP 112

Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            CK+L+V +C  +R     C YQVAYGDGSF+VGD  ++  + G       +A GCG DN
Sbjct: 113 LCKALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDN 172

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLVDRDSP---ASGVLEFN-S 320
           EGLF G+AGLLGLG G LS   QI         A S +YCLVDR +P   +S  L F  +
Sbjct: 173 EGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVA 232

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
           A    A  +PL++N K+DTFYY  + G SVGG  + I     ++ ++G GG+I+D GT++
Sbjct: 233 AIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSV 292

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TR  T  Y ++RD+F     NL      +LFDTCY+FSG  SV VP + LHF  G  L L
Sbjct: 293 TRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQL 352

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P  NYLIP+++AG+FC AFAPTS  L IIGN+QQQ  R+ FDL  + + F P +C
Sbjct: 353 PPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  348 bits (893), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 186/422 (44%), Positives = 254/422 (60%), Gaps = 41/422 (9%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +    +   R  V+  + RD+ARV  L               L  + +  L
Sbjct: 64  SLSLVHRDAISGATYPSRRHQVVGLVARDNARVEHL------------EKRLVASTSPYL 111

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           PED  + VV G   GSGEYF R+GVG+PP    +V+D+GSD+ W+QCRPC +CY Q+DP+
Sbjct: 112 PEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPL 171

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFG 256
           FDP  SSS+S + C +  C++L  + C       +C Y V YGDGS+T G+L  ET++ G
Sbjct: 172 FDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLG 231

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS 313
            + +V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+   +    +YCL  R +  +
Sbjct: 232 GT-AVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGA 290

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G L                      +FYYVGLTG  VGG+ + +  SLF++ E G GG++
Sbjct: 291 GSL--------------------ASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 330

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GTA+TRL  +AY +LR +F    G L  +  V+L DTCYD SG  SVRVPTVS +F 
Sbjct: 331 MDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFD 390

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  L LPA+N L+ V  A  FC AFAP+SS +SI+GN+QQ+G +++ D AN  VGF PN
Sbjct: 391 QGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 449

Query: 494 KC 495
            C
Sbjct: 450 TC 451


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  347 bits (890), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 199/426 (46%), Positives = 273/426 (64%), Gaps = 27/426 (6%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +    +   R  +L    RD ARV  L            +  L P     +
Sbjct: 70  SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYL------------QRRLSPTT---M 114

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
             +  + VVSG S+GSGEYF R+GVG+PP +  +V+D+GSD+ W+QCRPC ECYQQ+DP+
Sbjct: 115 TTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPL 174

Query: 201 FDPKTSSSYSPLPCAAPQCKSL--DVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           FDP  S+S++ +PC +  C++L    S C  +  C YQV+YGDGS+T G L  ET++FG+
Sbjct: 175 FDPAASASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGD 234

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPA-S 313
           S  V+G+A+GCGH N GLFVG+AGLLGLG G +SL  Q+      + +YCL  R + A +
Sbjct: 235 STPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADAGA 294

Query: 314 GVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G L F  + A    AV  PL+RN +  +FYYVGLTG  VGG+ + +   LF++ E G GG
Sbjct: 295 GSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGG 354

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           +++D GTA+TRL   AY +LRD+F   + G+L    GV+L DTCYD SG  SVRVPTV+L
Sbjct: 355 VVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVAL 414

Query: 431 HFGA-GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
           +FG  G AL LPA+N L+ +   G +C AFA ++S LSI+GN+QQQG +++ D AN  VG
Sbjct: 415 YFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVG 473

Query: 490 FTPNKC 495
           F P+ C
Sbjct: 474 FGPSTC 479


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F+ P++SG  QGSGEYF+++GVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP
Sbjct: 113 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 172

Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           + S SY+ + C AP C+ LD + C  R N CLYQVAYGDGS T GD  +ET++F     V
Sbjct: 173 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 232

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
           + +A+GCGHDNEGLF+ ++GLLGLG G LS   QI  +   S +YCLVDR S        
Sbjct: 233 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTR 292

Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
           S  + F +     A  A   P+ RN ++ TFYYV L GFSVGG  V+ +  S   ++   
Sbjct: 293 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 352

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
           G GG+I+D GT++TRL    Y ++RD+F   A  L+ +  G +LFDTCY+ SG R V+VP
Sbjct: 353 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 412

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           TVS+H   G ++ LP +NYLIPVD++GTFCFA A T   +SIIGN+QQQG RV FD    
Sbjct: 413 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 472

Query: 487 RVGFTPNKC 495
           RVGF P  C
Sbjct: 473 RVGFVPKSC 481


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F+ P++SG  QGSGEYF+++GVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP
Sbjct: 107 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 166

Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           + S SY+ + C AP C+ LD + C  R N CLYQVAYGDGS T GD  +ET++F     V
Sbjct: 167 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 226

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
           + +A+GCGHDNEGLF+ ++GLLGLG G LS   QI  +   S +YCLVDR S        
Sbjct: 227 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTR 286

Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
           S  + F +     A  A   P+ RN ++ TFYYV L GFSVGG  V+ +  S   ++   
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
           G GG+I+D GT++TRL    Y ++RD+F   A  L+ +  G +LFDTCY+ SG R V+VP
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 406

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           TVS+H   G ++ LP +NYLIPVD++GTFCFA A T   +SIIGN+QQQG RV FD    
Sbjct: 407 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 466

Query: 487 RVGFTPNKC 495
           RVGF P  C
Sbjct: 467 RVGFVPKSC 475


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 188/369 (50%), Positives = 241/369 (65%), Gaps = 17/369 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F+ P++SG  QGSGEYF+++GVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP
Sbjct: 107 FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDP 166

Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           + S SY+ + C AP C+ LD + C  R N CLYQVAYGDGS T GD  +ET++F     V
Sbjct: 167 RRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFARGARV 226

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA------ 312
           + +A+GCGHDNEGLF+ ++GLLGLG G LS   QI  +   S +YCLVDR S        
Sbjct: 227 QRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTR 286

Query: 313 SGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EA 367
           S  + F +     A  A   P+ RN ++ TFYYV L GFSVGG  V+ +  S   ++   
Sbjct: 287 SSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTT 346

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVP 426
           G GG+I+D GT++TRL    Y ++RD+F   A  L+ +  G +LFDTCY+ SG R V+VP
Sbjct: 347 GRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVP 406

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           TVS+H   G ++ LP +NYLIPVD++GTFCFA A T   +SIIGN+QQQG RV FD    
Sbjct: 407 TVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQ 466

Query: 487 RVGFTPNKC 495
           RVGF P  C
Sbjct: 467 RVGFVPKSC 475


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  338 bits (868), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 193/422 (45%), Positives = 258/422 (61%), Gaps = 27/422 (6%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK---PAEAQILP-------------- 141
           + L+L+RL +D  R   +   + LA     + +L+   P +++ L               
Sbjct: 1   KQLLLARLRKDELRSKAIAATIALATNGWRKSDLRHPLPGQSESLAVAGLASGRGGRGHG 60

Query: 142 ---EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
                F++P++SG + GSG+YF+RIGVGTP R   MV DTGSD++WLQC PC +CY+Q D
Sbjct: 61  GARRGFASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQD 120

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           PIF+P  SSS+ PL CA+  C  L +  C R N C+YQV+YGDGSFTVGD  TET+SFG 
Sbjct: 121 PIFNPSLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGE 180

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
             +V+ +A+GCG +N+GLF G+AGLLGLG G LS   Q     A+  +YCL  R+S  + 
Sbjct: 181 H-AVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAA 239

Query: 315 VLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            L F  SA    A    L+ N+++DT+YYVGL    V G  V IPP  F M   G GG+I
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 299

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           VD GTAI+RL T AY +LRD+F  L        G++LFDTCYD S +++  +P V L F 
Sbjct: 300 VDSGTAISRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFD 358

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G ++ LPA   L+ VD  GT+C AFAP   A SIIGNVQQQ  R+S D    ++G  P+
Sbjct: 359 GGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPD 418

Query: 494 KC 495
           +C
Sbjct: 419 QC 420


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 181/355 (50%), Positives = 234/355 (65%), Gaps = 7/355 (1%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           +P++SG + GSG+YF+RIGVGTP R   MV DTGSD++WLQC PC +CY+Q DPIF+P  
Sbjct: 1   SPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSL 60

Query: 206 SSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
           SSS+ PL CA+  C  L +  C R N+C+YQV+YGDGSFTVGD  TET+SFG   +V+ +
Sbjct: 61  SSSFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEH-AVRSV 119

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFN-S 320
           A+GCG +N+GLF G+AGLLGLG G LS   Q     A+  +YCL  R+S  +  L F  S
Sbjct: 120 AMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPS 179

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
           A    A    L+ N+++DT+YYVGL    V G  V IPP  F M   G GG+IVD GTAI
Sbjct: 180 AVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAI 239

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           +RL T AY +LRD+F  L        G++LFDTCYD S +++  +P V L F  G ++ L
Sbjct: 240 SRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPL 298

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           PA   L+ VD  GT+C AFAP   A SIIGNVQQQ  R+S D    ++G  P++C
Sbjct: 299 PADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 174/368 (47%), Positives = 235/368 (63%), Gaps = 17/368 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F  P+ SG + G+GEYF+ +GVGTP R   +V+DTGSDI WLQC PCT CY+Q D +F+P
Sbjct: 1   FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETV----SFGNSG 259
            +SSS+  L C++  C +LDV  C +N+CLYQ  YGDGSFT+G+LVT+ V    +FG   
Sbjct: 61  SSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQ 120

Query: 260 SV-KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPAS-- 313
            V   I LGCGHDNEG F  +AG+LGLG G LS    + A++    +YCL DR+S  +  
Sbjct: 121 VVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHK 180

Query: 314 GVLEFNSA-----RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEA 367
             L F  A       G     P +RN +V T+YYV +TG SVGG  +  IP S+F++D  
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
           G+GG I D GT ITRL+ +AY ++RD+F     +L   +   +FDTCYDF+G+ S+ VPT
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPT 300

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           V+ HF     + LP  NY++PV +   FCFAFA  S   S+IGNVQQQ  RV +D  + +
Sbjct: 301 VTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA-ASMGPSVIGNVQQQSFRVIYDNVHKQ 359

Query: 488 VGFTPNKC 495
           +G  P++C
Sbjct: 360 IGLLPDQC 367


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  333 bits (854), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 188/436 (43%), Positives = 259/436 (59%), Gaps = 38/436 (8%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           SL L  R+ +  + +   R  VL  + RD+AR   L T+L  A                 
Sbjct: 105 SLALVRRDEVTGSTYPSLRHAVLDLVARDNARAEYLATRLSPAYQ--------------- 149

Query: 141 PEDFS---TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
           P  FS   + VVSG  +GSGEY  R+ VG+PP +  +V+D+GSD+ W+QC+PC ECY Q+
Sbjct: 150 PPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQA 209

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVS 254
           DP+FDP TS+++S + C +  C+ L  SAC       C Y+V+Y DGS+T G L  ET++
Sbjct: 210 DPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLT 269

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP 311
            G + +V+G+ +GCGH N GLFVG+AGL+GLG G +SL  Q+      + +YCL  R   
Sbjct: 270 LGGT-AVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGY 328

Query: 312 ASG---------VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
            SG         VL  + A    AV  PL+RN +  +FYYVGL+G  VG + + +   LF
Sbjct: 329 GSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLF 388

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGV--ALFDTCYDFSG 419
           ++ E G G +++D GT +TRL  +AY +LRD+FV  LAG +    GV  ++ DTCYD SG
Sbjct: 389 QLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSG 448

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRV 479
             SVRVPTVS  F     L L A+N L+ VD  G +C AFAP+SS LSI+GN QQ G ++
Sbjct: 449 YASVRVPTVSFCFDGDARLILAARNVLLEVD-MGIYCLAFAPSSSGLSIMGNTQQAGIQI 507

Query: 480 SFDLANNRVGFTPNKC 495
           + D AN  +GF P  C
Sbjct: 508 TVDSANGYIGFGPANC 523


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  325 bits (832), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 183/399 (45%), Positives = 250/399 (62%), Gaps = 18/399 (4%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           +ERD AR+  +  ++Q + +   R       AQ         V SG S GSGEYF+R+G+
Sbjct: 1   MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQ---------VSSGLSLGSGEYFARMGI 51

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           G+P R + + LDTGSD+ W+QC PC+ CY Q DPI+DP  SSSY  + C +  C++LD S
Sbjct: 52  GSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYS 111

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGLFVGSAGLL 283
           AC+   C Y+V YGD S + GDL  E+   G  +S +++ IA GCGH N GLF G AGLL
Sbjct: 112 ACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLL 171

Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDR----DSPASGVLEFNSARGGDAVTAPLIRNKK 336
           G+GGG LS   QI A+   + +YCLVDR     S +S ++   +A    A   PL++N +
Sbjct: 172 GMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPR 231

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
           +DTFYY  LTG SVGG A+ IPP+ F +   G GG I+D GT++TR+   AY  LRD++ 
Sbjct: 232 IDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYR 291

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
             + NL P  GV L DTC++F GL +V++P++ LHF     + LP  N LIPVD +GTFC
Sbjct: 292 AASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFC 351

Query: 457 FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFAP+S  +S+IGNVQQQ  R+ FDL  + +   P +C
Sbjct: 352 LAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 182/345 (52%), Positives = 219/345 (63%), Gaps = 26/345 (7%)

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC--RANR 231
           MVLDTGSD+ W+QC PC  CY+QS P+FDP+ SSSY  + C A  C+ LD   C  R   
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C+YQVAYGDGS T GD VTET++F     V  +ALGCGHDNEGLFV +AGLLGLG G LS
Sbjct: 61  CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120

Query: 292 LTKQIK---ATSLAYCLVDRDSPASGV---------LEFNSARGG--DAVTAPLIRNKKV 337
              QI      S +YCLVDR S  +G          + F +   G   A   P++RN ++
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRM 180

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDCGTAITRLQTQAYNSLR 392
           +TFYYV L G SVGG  V   P + E D       G GG+IVD GT++TRL   +Y++LR
Sbjct: 181 ETFYYVQLVGISVGGARV---PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALR 237

Query: 393 DSF-VRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           D+F    AG L+ +  G +LFDTCYD  G R V+VPTVS+HF  G    LP +NYLIPVD
Sbjct: 238 DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVD 297

Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S GTFCFAFA T   +SIIGN+QQQG RV FD    RVGF P  C
Sbjct: 298 SRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  316 bits (809), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 176/468 (37%), Positives = 249/468 (53%), Gaps = 49/468 (10%)

Query: 23  TSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSL 82
           TS ++     + T++ +V++A  +T  ILSF+P     F  + E        +S      
Sbjct: 98  TSKANSSSEYSITSIFNVTAANHKTSQILSFKP-----FHNQEEFPQTFSSSSSFKLKLY 152

Query: 83  PLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE 142
           P  S    H  +H +Y SL                                         
Sbjct: 153 PAASLYNTHH-QHKNYYSL----------------------------------------- 170

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D +  +  G + G+  +  +IGVG PP++F M+ D  +D  WLQC+PC +CY Q D IFD
Sbjct: 171 DLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFD 230

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           P  SSSY+ L C    C  L  S+C  +  C Y + Y DG+ T G L+ ETVSF +SG V
Sbjct: 231 PSQSSSYTLLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWV 290

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
             ++LGC + N+G FVGS G  GLG G LS   +I A+S++YCLV+ +D  +S  LEFNS
Sbjct: 291 DRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCLVESKDGYSSSTLEFNS 350

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                +V A L++N K +  YYVGL G  VGG+ + +P S F +D  G+GG+IV   + I
Sbjct: 351 PPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLI 410

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           T L+   YN +RD+FV    +L+       FDTCY+ S   +V +P +      GK+  L
Sbjct: 411 TMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLL 470

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           P ++YL  VD  GTFCFAFAP+  + SI+G +QQ GTRV+FDL N+ V
Sbjct: 471 PKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  315 bits (808), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 172/357 (48%), Positives = 233/357 (65%), Gaps = 9/357 (2%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           + SG S GSGEYF+R+G+G P R + + LDTGSD+ W+QC PC+ CY Q DPI+DP  SS
Sbjct: 1   ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIA 265
           SY  + C +  C++LD SAC+   C Y+V YGD S + GDL  E+   G  +S +++ IA
Sbjct: 61  SYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIA 120

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR----DSPASGVLEF 318
            GCGH N GLF G AGLLG+GGG LS   QI A+   + +YCLVDR     S +S ++  
Sbjct: 121 FGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFG 180

Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
            +A    A   PL++N +++TFYY  LTG SVGG  + IPP+ F +   G GG I+D GT
Sbjct: 181 RTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGT 240

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
           ++TR+   AY  LRD++   + NL P  GV L DTC++F GL +V++P++ LHF  G  +
Sbjct: 241 SVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            LP  N LIPVD +GTFC AFAP+S  +S+IGNVQQQ  R+ FDL  + +   P +C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  311 bits (797), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 183/451 (40%), Positives = 257/451 (56%), Gaps = 33/451 (7%)

Query: 65  SETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAI 124
           +  AA S P +++   SL L  R+ +  T+H   R  VL+   RD+ARV  L  +L  + 
Sbjct: 42  TAAAAPSVPSSTTRRPSLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSP 101

Query: 125 YNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
                  ++     +             S GSGEY  R+G+G+PP +  +V DTGSD+ W
Sbjct: 102 SPSSTSSVESGGTIV-------------SHGSGEYLVRVGIGSPPLEQHLVADTGSDVIW 148

Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS-----LDVSACRANRCLYQVAYG 239
           +QC PC++CY Q DP+FDP  S+S+SP+PC +  C++               C Y+V+YG
Sbjct: 149 VQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYG 208

Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--- 296
           D S+T G L  ET++      V+G+A+GCGH+N GLF  +AGLLGLG G +SL  Q+   
Sbjct: 209 DKSYTNGVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGA 268

Query: 297 KATSLAYCLVDRDSPASG-----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
              + +YCL    S         VL    A    AV  PL+RN    +FYYVG+ G  V 
Sbjct: 269 AGGAFSYCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVA 328

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS-GVAL 410
           G+ +Q+   LF++ + G GG+++D GTA+TRL  +AY +LR +F        P + GV+L
Sbjct: 329 GERLQLQDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSL 388

Query: 411 FDTCYDFSGLRSVRVPTVSLHFGA------GKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
           FDTCYD SG  SVRVPTV+L+FG         +L LPA+N L+PVD  GT+C AFA  +S
Sbjct: 389 FDTCYDLSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVAS 448

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             SI+GN+QQQG  ++ D A+  VGF P  C
Sbjct: 449 GPSILGNIQQQGIEITVDSASGYVGFGPATC 479


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  300 bits (769), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 193/483 (39%), Positives = 260/483 (53%), Gaps = 37/483 (7%)

Query: 37  VLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR-- 94
           V+ +++ ++   H  +  P +   ++  +    ++      ++ S  LH R +LH+ R  
Sbjct: 21  VVGLATPVEYEYHSYAVTPLSPHAYSAPAAADDDAQAQEDVAASSSTLHIR-LLHRDRFA 79

Query: 95  -HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
            +     L+  RL+RD  R   +I+K              P         F  PVVS A 
Sbjct: 80  ANATPAQLLARRLQRDVLRAAWIISK------AAANGTPPPVAGLSSARGFVAPVVSRAP 133

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
             SGEY ++I VGTP  +  + LDT SD+ WLQC+PC  CY QS P+FDP+ S+SY  + 
Sbjct: 134 T-SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMS 192

Query: 214 CAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
             A  C++L  S     +   C+Y V YGDGS TVGD + ET++F     +  I++GCGH
Sbjct: 193 FNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRISIGCGH 252

Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-SLAYCLVDRDSPASGVLEFNSARGGDAVT 328
           DN+GLF   +AG+LGLG G++S   QI    + +YCLVD  S   G L      G  AV 
Sbjct: 253 DNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSYCLVDFLS-GPGSLSSTLTFGAGAVD 311

Query: 329 -------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDC 376
                   P + N  + TFYYV LTG SVGG  V   P + E D       G GG+IVD 
Sbjct: 312 TSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRV---PGVTERDLQLDPYTGRGGVIVDS 368

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPTVSLHFG 433
           GTA+TRL   AY + RD+F  +A +L   S       FDTCY   G    +VPTVS+HF 
Sbjct: 369 GTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMHFA 428

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTP 492
               + L  KNYLIPVDS GT CFAFA T   ++SIIGN+QQQG R+ +D+   RVGF P
Sbjct: 429 GSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDI-GGRVGFAP 487

Query: 493 NKC 495
           N C
Sbjct: 488 NSC 490


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  295 bits (756), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 161/367 (43%), Positives = 211/367 (57%), Gaps = 13/367 (3%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           +   +PV+SG    SGEYF+ +GVGTPP    +V+DTGSD+ WLQC+PC  CY+Q  P++
Sbjct: 82  DHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLY 141

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           DP+ SS+Y+  PC+ PQC++          C Y++ YGD S T G+L T+ + F N  SV
Sbjct: 142 DPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDTSV 201

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG---- 314
             + LGCGHDNEGLF  +AGLLG+  G  S   Q+        AYCL DR    S     
Sbjct: 202 GNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYL 261

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGI 372
           V    +     +V  PL  N +  + YYV + GFSVGG+ V     +   +D A G GG+
Sbjct: 262 VFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGV 321

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLA---GNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           +VD GT+ITR    AY +LRD+F   A   G  K   G+++FD CYD  G+     P V 
Sbjct: 322 VVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVV 381

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANNRV 488
           LHF  G  + LP +NYL+P +S    CFA  A     LS+IGNV QQ  RV FD+ N RV
Sbjct: 382 LHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERV 441

Query: 489 GFTPNKC 495
           GF PN C
Sbjct: 442 GFEPNGC 448


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  291 bits (745), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 197/489 (40%), Positives = 265/489 (54%), Gaps = 42/489 (8%)

Query: 35  TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHKTR 94
           T V+ +++ ++   H     P +  P++  +  A ++F ++SSS+  + L  R+      
Sbjct: 20  TAVVGLATPVEYEYHSYVVTPLSPHPYSAPA-AADDNFSVSSSSALHIHLLHRDSF--AV 76

Query: 95  HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ 154
           +     L+  RL+RD  R   +I+K              P            PVVS A  
Sbjct: 77  NATAAELLARRLQRDELRAAWIISKAAA------NGTPPPVVGLSTGRGLVAPVVSRAPT 130

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
            SGEY ++I VGTP  Q  + LDT SD+ WLQC+PC  CY QS P+FDP+ S+SY  +  
Sbjct: 131 -SGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNY 189

Query: 215 AAPQCKSLDVSA---CRANRCLYQVAYGDG----SFTVGDLVTETVSFGNSGSVKGIALG 267
            AP C++L  S     +   C+Y V YGDG    S +VGDLV ET++F        +++G
Sbjct: 190 DAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIG 249

Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQIK----ATSLAYCLVD----RDSPASGVLEF 318
           CGHDN+GLF   +AG+LGLG G +S+  QI       S +YCLVD      SP+S  L F
Sbjct: 250 CGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSS-TLTF 308

Query: 319 NSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDG 370
            +     +  A   P + N+ + TFYYV L G SVGG  V   P + E D       G G
Sbjct: 309 GAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYTGRG 365

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG---VALFDTCYDFSGLRSVRVPT 427
           G+I+D GT +TRL   AY + RD+F   A +L   S      LFDTCY   G   V+VP 
Sbjct: 366 GVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPA 425

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
           VS+HF  G  + L  KNYLIPVDS GT CFAFA T   ++S+IGN+ QQG RV +DLA  
Sbjct: 426 VSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQ 485

Query: 487 RVGFTPNKC 495
           RVGF PN C
Sbjct: 486 RVGFAPNNC 494


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  288 bits (738), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 185/375 (49%), Positives = 214/375 (57%), Gaps = 28/375 (7%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F  PVVSG +QGSGEYF++IGVGTP     MVLDTGSD+ WLQC PC  CY QS  +FDP
Sbjct: 132 FVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDP 191

Query: 204 KTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           + S SY  + CAAP C+ LD   C  R   CLYQVAYGDGS T GD  TET++F +   V
Sbjct: 192 RASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFASGARV 251

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGV--- 315
             +ALGCGHDNEGLFV +AGLLGLG G LS   QI      S +YCLVDR S ++     
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSR 311

Query: 316 ---LEFNS-ARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI------PPSLFE 363
              + F S ARG  G  V  P     +          G     +A         PP    
Sbjct: 312 SSTVTFGSGARGALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPD--- 368

Query: 364 MDEAGDGGIIVDCGT---AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL 420
               G GG+IVD G    A  R       + R         L P  G +LFDTCYD SGL
Sbjct: 369 -PSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSP-GGFSLFDTCYDLSGL 426

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
           + V+VPTVS+HF  G    LP +NYLIPVDS GTFCFAFA T   +SIIGN+QQQG RV 
Sbjct: 427 KVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVV 486

Query: 481 FDLANNRVGFTPNKC 495
           FD    R+GF P  C
Sbjct: 487 FDGDGQRLGFVPKGC 501


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  288 bits (737), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 180/404 (44%), Positives = 227/404 (56%), Gaps = 48/404 (11%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           L+  RL RD+AR   +     ++  NV R              FS PVVSG +QGSGEYF
Sbjct: 98  LLAHRLARDAARAEAI----SVSARNVTRAG----------GGFSAPVVSGLAQGSGEYF 143

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
           + +GVGTPP    +VLDTGSD+ WLQC PC +CY QS  +FDP+ S SY+ + C AP C+
Sbjct: 144 ASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCR 203

Query: 221 SLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            LD             CLYQVAYGDGS T GDL TET+ F     V  +A+GCGHDNEGL
Sbjct: 204 GLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWFARGARVPRVAVGCGHDNEGL 263

Query: 276 FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           FV +AGLLGLG G LSL  Q         +YC                 +G D     +I
Sbjct: 264 FVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCF----------------QGSDLDHRTII 307

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           R       +  G     VG +++++ PS       G GG+I+D GT++TRL    Y ++R
Sbjct: 308 RTVHQ---HVGGARVRGVGERSLRLDPS------TGRGGVILDSGTSVTRLARPVYVAVR 358

Query: 393 DSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           ++F   AG L+    G +LFDTCYD  G R V+VPTVS+H   G  + LP +NYLIPVD+
Sbjct: 359 EAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDT 418

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            GTFC A A T   +SI+GN+QQQG RV FD    RV   P  C
Sbjct: 419 RGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  288 bits (737), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 166/416 (39%), Positives = 225/416 (54%), Gaps = 33/416 (7%)

Query: 90  LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
           L     + +  +V    +RD+ R+NT+ +K       +    L+P               
Sbjct: 85  LRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSNLPLQP--------------- 129

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
            G+  G+G Y    G GTP +   +++DTGSD+ W+QC+PC++CY Q DPIF+P+ SSSY
Sbjct: 130 -GSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSY 188

Query: 210 SPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
             L C +  C  L  ++ CR   C+Y++ YGDGS + GD   ET++ G S S    A GC
Sbjct: 189 KHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLG-SDSFPSFAFGC 247

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGD 325
           GH N GLF GSAGLLGLG   LS   Q K+      +YCL D  S  S    F+  +G  
Sbjct: 248 GHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTS-TGSFSVGQGSI 306

Query: 326 AVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
             TA   PL+ N    +FY+VGL G SVGG+ + IPP++      G GG IVD GT ITR
Sbjct: 307 PATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL-----GRGGTIVDSGTVITR 361

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L  QAY++L+ SF     NL      ++ DTCYD S    VR+PT++ HF     + + A
Sbjct: 362 LVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVAVSA 421

Query: 443 KNYLIPVDSAGT-FCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              L  + S G+  C AFA  S ++S  IIGN QQQ  RV+FD    R+GF P  C
Sbjct: 422 VGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 173/366 (47%), Positives = 225/366 (61%), Gaps = 30/366 (8%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR---PCTECYQQSDPI 200
           F+ P++SG  QG+GEYF+++GVGTP     MVLDTGSD+ W   R   P     +Q    
Sbjct: 107 FAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGS-- 164

Query: 201 FDPKTSSSYSPLP---CAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF 255
               T ++ +P P   C AP C+ LD + C  R N CLYQVAYGDGS T GD  +ET++F
Sbjct: 165 ---STGAAPAPTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF 221

Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
                V+ +A+GCGHDNEGLF+ ++GLLGLG G LS   QI  +   S +YCLVDR S  
Sbjct: 222 ARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSR 281

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMD-EAGDG 370
                 +   GG           ++ TFYYV L GFSVGG  V+ +  S   ++   G G
Sbjct: 282 R--ARPSRRWGG---------TPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 330

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRVPTVS 429
           G+I+D GT++TRL    Y ++RD+F   A  L+ +  G +LFDTCY+ SG R V+VPTVS
Sbjct: 331 GVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVS 390

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
           +H   G ++ LP +NYLIPVD++GTFCFA A T   +SIIGN+QQQG RV FD    RVG
Sbjct: 391 MHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVG 450

Query: 490 FTPNKC 495
           F P  C
Sbjct: 451 FVPKSC 456


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  286 bits (731), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 164/373 (43%), Positives = 211/373 (56%), Gaps = 21/373 (5%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
             +PV+SG    SGEYF+ IGVG PP    +V+DTGSD+ WLQC PC  CY+Q  P++DP
Sbjct: 77  LRSPVMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDP 136

Query: 204 KTSSSYSPLPCAAPQCKS-LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           + S ++  +PCA+PQC+  L    C  R   C+Y V YGDGS + GDL T+T+   +   
Sbjct: 137 RNSKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTR 196

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA---SG 314
           V  + LGCGHDNEGL   +AGLLG G G LS   Q+        +YCL DR S A   S 
Sbjct: 197 VHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSS 256

Query: 315 VLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ--IPPSLFEMDEAGDGG 371
            L F  +         PL  N +  + YYV + GFSVGG+ V      SL      G GG
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGG 316

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR---LAGNLKPTSGVALFDTCYDFSGL---RSVRV 425
           ++VD GTAI+R    AY ++RD+FV     AG  +  +  ++FDTCYD  G      VRV
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRV 376

Query: 426 PTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
           P++ LHF A   + LP  NYLIPV   D    FC         L+++GNVQQQG  V FD
Sbjct: 377 PSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFD 436

Query: 483 LANNRVGFTPNKC 495
           +   R+GFTPN C
Sbjct: 437 VERGRIGFTPNGC 449


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 170/449 (37%), Positives = 241/449 (53%), Gaps = 40/449 (8%)

Query: 71  SFPLNSSSSFS---------LPLHSREILHKTRHNDY-RSLV-LSRLERDSARVNTLITK 119
           +FP   +SS S         LP H   +  + +H D+ ++L    RL R  AR    + +
Sbjct: 281 TFPSTPNSSLSRRALQKPNKLPSHGFRV--RLKHVDHVKNLTRFERLRRGVARGKNRLHR 338

Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
           L   +       L  A A +  +    PVV+G    +GE+  ++ +G+PPR FS ++DTG
Sbjct: 339 LNAMV-------LAAANATV-GDQVKAPVVAG----NGEFLMKLAIGSPPRSFSAIMDTG 386

Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYG 239
           SD+ W QC+PC +C+ QS PIFDPK SSS+  + C++  C +L  S C ++ C Y   YG
Sbjct: 387 SDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYG 446

Query: 240 DGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTK 294
           D S T G L  ET +FG+S     S+ G+  GCG+DN G  F   AGL+GLG G LSL  
Sbjct: 447 DSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVS 506

Query: 295 QIKATSLAYCL--VDRDSPASGVL----EFNSARGGDAV-TAPLIRNKKVDTFYYVGLTG 347
           Q+K    AYCL  +D   P+S +L            D + T PLI+N    +FYY+ L G
Sbjct: 507 QLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQG 566

Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
            SVGG  + IP S FE+ + G GG+I+D GT IT ++  A+ SL++ F+         SG
Sbjct: 567 ISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSG 626

Query: 408 VALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
               D C++  +G   V VP ++ HF  G  L+LP +NY+I    AG  C A   +S  +
Sbjct: 627 TGGLDLCFNLPAGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGM 684

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           SI GN+QQQ   V  DL    + F P +C
Sbjct: 685 SIFGNLQQQNFMVVHDLQEETLSFLPTQC 713


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 161/377 (42%), Positives = 214/377 (56%), Gaps = 23/377 (6%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           +   +PV+SG    SGEYF+ I VG PP +  +V+DTGSD+ WLQC PC  CY+Q  P++
Sbjct: 71  DRLRSPVMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLY 130

Query: 202 DPKTSSSYSPLPCAAPQCKS-LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
           DP++SS++  +PCA+P+C+  L    C  R   C+Y V YGDGS + GDL T+ + F + 
Sbjct: 131 DPRSSSTHRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDD 190

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA--- 312
             V  + LGCGHDN GL   +AGLLG+G G LS   Q+        +YCL DR S A   
Sbjct: 191 THVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNG 250

Query: 313 SGVLEF-NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ--IPPSLFEMDEAGD 369
           S  L F  +         PL  N +  + YYV + GFSVGG+ V      SL      G 
Sbjct: 251 SSYLVFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGR 310

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNL-KPTSGVALFDTCYDFSG----LR 421
           GGI+VD GTAI+R    AY ++RD+F      AG + K  +  ++FD CYD  G      
Sbjct: 311 GGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAA 370

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
           +VRVP++ LHF  G  + LP  NYLIPV   D    FC         L+++GNVQQQG  
Sbjct: 371 AVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFG 430

Query: 479 VSFDLANNRVGFTPNKC 495
           + FD+   R+GFTPN C
Sbjct: 431 LVFDVERGRIGFTPNGC 447


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  278 bits (711), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 165/429 (38%), Positives = 234/429 (54%), Gaps = 31/429 (7%)

Query: 82  LPLHSREILHKTRHNDY-RSLV-LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
           LP H   +  + +H D+ ++L    RL R  AR    + +L   +       L  A A +
Sbjct: 46  LPSHGFRV--RLKHVDHVKNLTRFERLRRGVARGKNRLHRLNAMV-------LAAANATV 96

Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
             +    PVV+G    +GE+  ++ +G+PPR FS ++DTGSD+ W QC+PC +C+ QS P
Sbjct: 97  -GDQVKAPVVAG----NGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTP 151

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
           IFDPK SSS+  + C++  C +L  S C ++ C Y   YGD S T G L  ET +FG+S 
Sbjct: 152 IFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDST 211

Query: 260 ----SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPA 312
               S+ G+  GCG+DN G  F   AGL+GLG G LSL  Q+K    AYCL  +D   P+
Sbjct: 212 EDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPS 271

Query: 313 SGVL----EFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
           S +L            D + T PLI+N    +FYY+ L G SVGG  + IP S FE+ + 
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVP 426
           G GG+I+D GT IT ++  A+ SL++ F+         SG    D C++  +G   V VP
Sbjct: 332 GSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVP 391

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
            ++ HF  G  L+LP +NY+I    AG  C A   +S  +SI GN+QQQ   V  DL   
Sbjct: 392 KLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNLQQQNFMVVHDLQEE 449

Query: 487 RVGFTPNKC 495
            + F P +C
Sbjct: 450 TLSFLPTQC 458


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 190/437 (43%), Positives = 244/437 (55%), Gaps = 44/437 (10%)

Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELK-----------------PAEAQILPE 142
           +L +  L RDS  VN   T  QL    + R EL+                 P        
Sbjct: 60  ALHVRLLHRDSFAVNA--TPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSSGG 117

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
            F  PVVS A   SGEY ++I VGTP  +  + +DTGSDI WLQC+PC  CY QS P+FD
Sbjct: 118 AFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFD 177

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSA---CRANRCLYQVAYG-DGSFTVGDLVTETVSFGNS 258
           P+ S+SY  +   AP C++L  S     +   C+Y V YG DGS TVGD + ET++F   
Sbjct: 178 PRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGG 237

Query: 259 GSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSPA 312
             V  +++GCGHDN+GLF   +AG+LGLG G +S   QI A     TS +YCL D    +
Sbjct: 238 VQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSS 297

Query: 313 SGVLEFNSARGGDAVTA--------PLIRNKKVDTFYYVGLTGFSVGGQAVQIP-PSLFE 363
            G    ++   GD   A        P ++N  + TFYYV L G SVGG  V        +
Sbjct: 298 PGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLK 357

Query: 364 MDE-AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSG 419
           +D   G GG+I+D GTA+TRL  +AY + RD+F   A +L   S       FDTCY   G
Sbjct: 358 LDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGG 417

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTR 478
            R+++VPTVS+HF  G  L LP KNYLIPVDS GT CFAFA T   ++SIIGN+QQQG R
Sbjct: 418 -RAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFR 476

Query: 479 VSFDLANNRVGFTPNKC 495
           V +++   RVGF PN C
Sbjct: 477 VVYNIGGGRVGFAPNSC 493


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 166/419 (39%), Positives = 218/419 (52%), Gaps = 35/419 (8%)

Query: 90  LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
           L     + +  LV    ERD+AR+NT+ +K       +                 + P+ 
Sbjct: 84  LRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMS----------------NLPLQ 127

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SG + G+G Y    G GTP +   +++DTGSD+ W+QC+PC +CY Q D IF+PK SSSY
Sbjct: 128 SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSY 187

Query: 210 SPLPCAAPQCKSLDVSA-----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
             LPC +  C  L  S      C    C+Y++ YGDGS + GD   ET++ G S S +  
Sbjct: 188 KTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLG-SDSFQNF 246

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEF--N 319
           A GCGH N GLF GS+GLLGLG   LS   Q K+      AYCL D  S  S        
Sbjct: 247 AFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGK 306

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            +    AV  PL+ N    TFY+VGL G SVGG  + IPP++      G G  IVD GT 
Sbjct: 307 GSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL-----GRGSTIVDSGTV 361

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           ITRL  QAYN+L+ SF     +L      ++ DTCYD S    VR+PT++ HF     + 
Sbjct: 362 ITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFHFQNNADVA 421

Query: 440 LPAKNYLIPVDSAGT-FCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +     L+PV + G+  C AFA  S     +IIGN QQQ  RV+FD    R+GF    C
Sbjct: 422 VSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  275 bits (704), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 139/252 (55%), Positives = 184/252 (73%), Gaps = 1/252 (0%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           ++RD  R+  ++     +I     H +     + + E   TP+VSGASQGSGEYFSR+G+
Sbjct: 1   MDRD-LRLTLMVFHCCKSILATYFHVILLFSIKTIAEALETPLVSGASQGSGEYFSRVGI 59

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           G+PP+   MV+DTGSD+NW+QC PC +CYQQ+DPIF+P  SSSY+PL C   QCKSLDVS
Sbjct: 60  GSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKSLDVS 119

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
            CR + CLY+V+YGDGS+TVGD  TET++   S S+  +A+GCGHDNEGLFVG+AGLLGL
Sbjct: 120 ECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGL 179

Query: 286 GGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
           GGG LS   QI A+S +YCLV+RD+ ++  LEFNS     +VTAPL+RN ++DTFYY+G+
Sbjct: 180 GGGSLSFPSQINASSFSYCLVNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGM 239

Query: 346 TGFSVGGQAVQI 357
           TG     + +QI
Sbjct: 240 TGIGESYKILQI 251


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 172/441 (39%), Positives = 233/441 (52%), Gaps = 46/441 (10%)

Query: 79  SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
           +  +P+  R+ L        R SL+  RL  D+AR  +L+                    
Sbjct: 26  TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATG---------------- 69

Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
                   +PV SG    SGEYF+ +GVGTP  +  +V+DTGSD+ WLQC PC  CY Q 
Sbjct: 70  -----RLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
             +FDP+ SS+Y  +PC++PQC++L    C +       C Y VAYGDGS + GDL T+ 
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK 184

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
           ++F N   V  + LGCG DNEGLF  +AGLLG+G G +S++ Q+     +   YCL DR 
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRT 244

Query: 310 SPA--SGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEM 364
           S +  S  L F         A TA L+ N +  + YYV + GFSVGG+ V     +   +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTA-LLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303

Query: 365 DEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDFSGL 420
           D A G GG++VD GTAI+R    AY +LRD+F   A            ++FD CYD  G 
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGR 363

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSALSIIGNVQQ 474
            +   P + LHF  G  + LP +NY +PVD     A ++  C  F      LS+IGNVQQ
Sbjct: 364 PAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           QG RV FD+   R+GF P  C
Sbjct: 424 QGFRVVFDVEKERIGFAPKGC 444


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  270 bits (691), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 195/487 (40%), Positives = 259/487 (53%), Gaps = 56/487 (11%)

Query: 52  SFEPETLEPFAEESETAAE----SFPLNSSSSFSLPLHSREILHKTR---HNDYRSLVLS 104
           S+    L P A  S  AAE    +   + ++S S  +H R +LH+     +     L+  
Sbjct: 34  SYAVTPLSPHAHSSPEAAEDGAHAHQEDMAASSSSAMHVR-LLHRDSFAVNATGAELLAR 92

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP--EDFSTPVVSGASQGSGEYFSR 162
           RL+RD  R   +I+           +   P +   L        PVVS A   SG+Y ++
Sbjct: 93  RLQRDELRAAWIIS-------TAAANGTPPPDVVGLSTGRGLVAPVVSRAPT-SGDYIAK 144

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           I VGTP  +  + LDT SD+ WLQC+PC  CY QS P+FDP+ S+SY  +   AP C++L
Sbjct: 145 IAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQAL 204

Query: 223 DVSA---CRANRCLYQVAYGDG------SFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
             S     +   C+Y V YGDG      S +VGDLV ET++F        +++GCGHDN+
Sbjct: 205 GRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNK 264

Query: 274 GLF-VGSAGLLGLGGGMLSLTKQIK----ATSLAYCLVD----RDSPASGVLEFNSARGG 324
           GLF   +AG+LGL  G +S+  QI       S +YCLVD      SP+S  L F +    
Sbjct: 265 GLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSS-TLTFGAGAVD 323

Query: 325 DAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-----EAGDGGIIVDC 376
            +  A   P + N+ + TFYYV L G SVGG  V   P + E D       G GG+I+D 
Sbjct: 324 TSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYTGHGGVILDS 380

Query: 377 GTAITRLQTQAYNSLRDSFVRLA---GNLKPTSGVALFDTCYDF---SGLRS-VRVPTVS 429
           GT +TRL   AY + RD+F   A   G +       LFDTCY     +GLR  V+VP VS
Sbjct: 381 GTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVS 440

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRV 488
           +HF  G  L L  KNYLI VDS GT CFAFA T   ++S+IGN+ QQG RV +D+   RV
Sbjct: 441 MHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIGGQRV 500

Query: 489 GFTPNKC 495
           GF PN C
Sbjct: 501 GFAPNSC 507


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 156/286 (54%), Positives = 195/286 (68%), Gaps = 27/286 (9%)

Query: 25  ASSRGLSETA-TTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSS-FSL 82
           A SR +   A TT+LDV S++Q+T  +L+F     +   ++S       P  SS+S  SL
Sbjct: 20  AHSRNIPHNAKTTILDVVSSIQKTYQVLNFNQNLKQQQQQKS-------PFTSSTSTLSL 72

Query: 83  PLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE 142
            LHSR  L  + H DY+SL LSRL+RDSARV  + TKL    +N D+             
Sbjct: 73  QLHSRASL--SSHADYKSLTLSRLDRDSARVKYITTKLNQN-FNTDK------------- 116

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
             S P++SG SQGSGEYFSRIG+G PP Q  MVLDTGSDI+W+QC PC +CY+Q+DPIF+
Sbjct: 117 -LSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADPIFE 175

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
           P  S+SY+PL C A QC+ LD S CR   CLYQV+YGDGS+TVGD VTETV+ G    VK
Sbjct: 176 PTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIG-VNKVK 234

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR 308
            +ALGCGH+NEGLFVG+AGL+GLGGG LS   Q+ +TS +YCLVDR
Sbjct: 235 NVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNSTSFSYCLVDR 280


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 151/383 (39%), Positives = 211/383 (55%), Gaps = 21/383 (5%)

Query: 128 DRHELKPAEAQ-----ILPEDFST-PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSD 181
           DRH +    A+     +  E  +T PV SGAS GSG+Y   +G+GTP ++F+++ DTGSD
Sbjct: 96  DRHRVDSIHARLSSHGVFQEKQATLPVQSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSD 155

Query: 182 INWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS---ACRANRCLYQVA 237
           + W QC PC + CY+Q +P  DP  S+SY  + C++  CK LD     +C +  CLYQV 
Sbjct: 156 LTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQ 215

Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TK 294
           YGDGS+++G   TET++  +S   K    GCG  N GLF G+AGLLGLG   LSL   T 
Sbjct: 216 YGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTA 275

Query: 295 QIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQA 354
           Q      +YCL    S + G L F           PL  + K   FY + +T  SVGG  
Sbjct: 276 QKYKKLFSYCL-PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNK 334

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
           + I  S+F        G ++D GT ITRL + AY++L  +F +L  +   T G ++FDTC
Sbjct: 335 LSIDASIFSTS-----GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTC 389

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNV 472
           YDFS   ++++P V + F  G  +D+     L PV+     C AFA     +  +I GN 
Sbjct: 390 YDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNT 449

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQ+  +V +D A  RVGF P+ C
Sbjct: 450 QQKTYQVVYDDAKGRVGFAPSGC 472


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 170/441 (38%), Positives = 232/441 (52%), Gaps = 46/441 (10%)

Query: 79  SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
           +  +P+  R+ L        R SL+  RL  D+AR  +L+                    
Sbjct: 26  TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATG---------------- 69

Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
                   +PV SG    SGEYF+ +GVGTP  +  +V+DTGSD+ WLQC PC  CY Q 
Sbjct: 70  -----RLHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
             +FDP+ SS+Y  +PC++PQC++L    C +       C Y VAYGDGS + G+L T+ 
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDK 184

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
           ++F N   V  + LGCG DNEGLF  +AGLLG+  G +S++ Q+     +   YCL DR 
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCLGDRT 244

Query: 310 SPA--SGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEM 364
           S +  S  L F         A TA L+ N +  + YYV + GFSVGG+ V     +   +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTA-LLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLAL 303

Query: 365 DEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDFSGL 420
           D A G GG++VD GTAI+R    AY +LRD+F   A            ++FD CYD  G 
Sbjct: 304 DTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGR 363

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSALSIIGNVQQ 474
            +   P + LHF  G  + LP +NY +PVD     A ++  C  F      LS+IGNVQQ
Sbjct: 364 PAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQ 423

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           QG RV FD+   R+GF P  C
Sbjct: 424 QGFRVVFDVEKERIGFAPKGC 444


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 157/398 (39%), Positives = 225/398 (56%), Gaps = 21/398 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           LERD ARV+++  K+  A        + PA A    +  S P   G S G+G Y   +G+
Sbjct: 100 LERDQARVDSIHRKVAGA--GGAPSVVDPARAS--EQGVSLPAQRGISLGTGNYVVSVGL 155

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP +Q++++ DTGSD++W+QC+PC +CY+Q DP+FDP  SS+Y+ + C AP+C+ LD S
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215

Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            C ++ RC Y+V YGD S T G+LV +T++   S ++ G   GCG  N GLF    GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275

Query: 285 LGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG   +SL  Q   +      YCL    S   G L    A   +A    L  +    +FY
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSS-GRGYLSLGGAPPANAQFTALA-DGATPSFY 333

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           Y+ L G  VGG+A++IP + F          ++D GT ITRL  +AY  LR +F R    
Sbjct: 334 YIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQ 389

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAGKALDLPAKNYLIPVDSAGTFCFAF 459
            K    +++ DTCYDF+G R+ ++PTV L F  GA  +LD     Y+  V  A   C AF
Sbjct: 390 YKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQA---CLAF 446

Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AP +  S+++I+GN QQ+   V++D+AN R+GF    C
Sbjct: 447 APNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 157/398 (39%), Positives = 225/398 (56%), Gaps = 21/398 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           LERD ARV+++  K+  A        + PA A    +  S P   G S G+G Y   +G+
Sbjct: 100 LERDQARVDSIHRKVAGA--GGAPSVVDPARAS--EQGVSLPAQRGISLGTGNYVVSVGL 155

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP +Q++++ DTGSD++W+QC+PC +CY+Q DP+FDP  SS+Y+ + C AP+C+ LD S
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215

Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            C ++ RC Y+V YGD S T G+LV +T++   S ++ G   GCG  N GLF    GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275

Query: 285 LGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG   +SL  Q   +      YCL    S   G L    A   +A    L  +    +FY
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSS-GRGYLSLGGAPPANAQFTALA-DGATPSFY 333

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           Y+ L G  VGG+A++IP + F          ++D GT ITRL  +AY  LR +F R    
Sbjct: 334 YIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQ 389

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAGKALDLPAKNYLIPVDSAGTFCFAF 459
            K    +++ DTCYDF+G R+ ++PTV L F  GA  +LD     Y+  V  A   C AF
Sbjct: 390 YKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLYVSKVSQA---CLAF 446

Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AP +  S+++I+GN QQ+   V++D+AN R+GF    C
Sbjct: 447 APNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 204/360 (56%), Gaps = 18/360 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
           S P  SG +  +G Y   +G+GTP  ++++V DTGSD  W+QCRPC  +CY+Q +P+FDP
Sbjct: 149 SLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDP 208

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             SS+Y+ + C    C  LD + C    CLY V YGDGS+TVG    +T++  +  ++KG
Sbjct: 209 AKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-AIKG 267

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN- 319
              GCG  N GLF  +AGL+GLG G  SLT Q       + AYCL    +  +G L+F  
Sbjct: 268 FRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-GTGYLDFGP 326

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            + G +A   P++ +K   TFYYVG+TG  VGGQ V +  S+F        G +VD GT 
Sbjct: 327 GSAGNNARLTPMLTDKG-QTFYYVGMTGIRVGGQQVPVAESVFST-----AGTLVDSGTV 380

Query: 380 ITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           ITRL   AY +L  +F  V LA   K   G ++ DTCYDF+GL  V +PTVSL F  G  
Sbjct: 381 ITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGAC 440

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LD+     +  +  A   C AFA      +++I+GN QQ+   V +DL    VGF P  C
Sbjct: 441 LDVDVSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 170/433 (39%), Positives = 234/433 (54%), Gaps = 28/433 (6%)

Query: 75  NSSSSFSLPLHS-REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK 133
           +SSS+  LPLH  R        +   S VL+    D+AR+ +   +L             
Sbjct: 38  HSSSAVHLPLHHPRGPCSPLSADIPFSAVLTH---DAARIASFAARLAKKSSPSSASATT 94

Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TE 192
            A    L    S P+  G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC   
Sbjct: 95  QAAGSSLA---SVPLTPGTSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS 151

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACR-ANRCLYQVAYGDGSFTVG 246
           C++QS P+FDPKTSSSY+ + C++PQC      +L+ + C  +N C+YQ +YGD SF+VG
Sbjct: 152 CHRQSGPVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVG 211

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAY 303
            L  +TVSFG + SV     GCG DNEGLF  SAGL+GL    LSL  Q+  T   S +Y
Sbjct: 212 YLSKDTVSFG-ANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 270

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL    S  SG L   S   G     P++ N   D+ Y++ L+G +V G+ + +  S + 
Sbjct: 271 CLPSTSS--SGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYT 328

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRS 422
                    I+D GT ITRL T  Y +L  +    + G+ K  +  ++ DTC++    + 
Sbjct: 329 SLP-----TIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKL 383

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
             VP VS+ F  G  L L A N L+ VD A T C AFAP  SA +IIGN QQQ   V +D
Sbjct: 384 RAVPAVSMAFSGGATLKLSAGNLLVDVDGA-TTCLAFAPARSA-AIIGNTQQQTFSVVYD 441

Query: 483 LANNRVGFTPNKC 495
           + +NR+GF    C
Sbjct: 442 VKSNRIGFAAAGC 454


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 163/427 (38%), Positives = 234/427 (54%), Gaps = 21/427 (4%)

Query: 80  FSLPLHSREILHKTRHNDYRSLVLS-RLERDSARVNTLITKLQLAIYNVDR--HELKPAE 136
           F  P HS        H++ +       LE   +  N  +TK +L    V+R    L+  E
Sbjct: 18  FVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKN--LTKFELLERAVERGSRRLQRLE 75

Query: 137 AQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
           A +  P    TPV +G     GEY   + +GTP + FS ++DTGSD+ W QC+PCT+C+ 
Sbjct: 76  AMLNGPSGVETPVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFN 131

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
           QS PIF+P+ SSS+S LPC++  C++L    C  N C Y   YGDGS T G + TET++F
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF 191

Query: 256 GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
           G S S+  I  GCG +N+G   G+ AGL+G+G G LSL  Q+  T  +YC+    S  S 
Sbjct: 192 G-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSNSS 250

Query: 315 VLEFNSARGGDAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDG 370
            L   S        +P   LI++ ++ TFYY+ L G SVG   + I PS+F+++   G G
Sbjct: 251 TLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVPTV 428
           GII+D GT +T     AY ++R +F+    NL   +G +  FD C+   S   ++++PT 
Sbjct: 311 GIIIDSGTTLTYFVDNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
            +HF  G  L LP++NY I   S G  C A   +S  +SI GN+QQQ   V +D  N+ V
Sbjct: 370 VMHFDGGD-LVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVV 427

Query: 489 GFTPNKC 495
            F   +C
Sbjct: 428 SFLSAQC 434


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 150/360 (41%), Positives = 203/360 (56%), Gaps = 18/360 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
           S P  SG +  +G Y   +G+GTP  ++++V DTGSD  W+QCRPC  +CY+Q  P+FDP
Sbjct: 149 SLPATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDP 208

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             SS+Y+ + C    C  LD + C    CLY V YGDGS+TVG    +T++  +  ++KG
Sbjct: 209 AKSSTYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-AIKG 267

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN- 319
              GCG  N GLF  +AGL+GLG G  SLT Q       + AYCL    +  +G L+F  
Sbjct: 268 FRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-GTGYLDFGP 326

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            + G +A   P++ +K   TFYYVG+TG  VGGQ V +  S+F        G +VD GT 
Sbjct: 327 GSAGNNARLTPMLTDKG-QTFYYVGMTGIRVGGQQVPVAESVFST-----AGTLVDSGTV 380

Query: 380 ITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           ITRL   AY +L  +F  V LA   K   G ++ DTCYDF+GL  V +PTVSL F  G  
Sbjct: 381 ITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGAC 440

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LD+     +  +  A   C AFA      +++I+GN QQ+   V +DL    VGF P  C
Sbjct: 441 LDVDVSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 163/427 (38%), Positives = 233/427 (54%), Gaps = 21/427 (4%)

Query: 80  FSLPLHSREILHKTRHNDYRSLVLS-RLERDSARVNTLITKLQLAIYNVDR--HELKPAE 136
           F  P HS        H++ +       LE   +  N  +TK +L    V+R    L+  E
Sbjct: 18  FVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKN--LTKFELLERAVERGSRRLQRLE 75

Query: 137 AQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
           A +  P    TPV +G     GEY   + +GTP + FS ++DTGSD+ W QC+PCT+C+ 
Sbjct: 76  AMLNGPSGVETPVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFN 131

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
           QS PIF+P+ SSS+S LPC++  C++L    C  N C Y   YGDGS T G + TET++F
Sbjct: 132 QSTPIFNPQGSSSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF 191

Query: 256 GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
           G S S+  I  GCG +N+G   G+ AGL+G+G G LSL  Q+  T  +YC+    S  S 
Sbjct: 192 G-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTSS 250

Query: 315 VLEFNSARGGDAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDG 370
            L   S        +P   LI + ++ TFYY+ L G SVG   + I PS+F+++   G G
Sbjct: 251 TLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVPTV 428
           GII+D GT +T     AY ++R +F+    NL   +G +  FD C+   S   ++++PT 
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTF 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
            +HF  G  L LP++NY I   S G  C A   +S  +SI GN+QQQ   V +D  N+ V
Sbjct: 370 VMHFDGGD-LVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVV 427

Query: 489 GFTPNKC 495
            F   +C
Sbjct: 428 SFLFAQC 434


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  265 bits (677), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
           G + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP +SS+Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           + + CAAP C  LDVS C    CLY V YGDGS+++G    +T++  +  +VKG   GCG
Sbjct: 231 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
             N+GLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +      
Sbjct: 291 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPAR-STGTGYLDFGAGSPPAT 349

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            T P++      TFYYVG+TG  VGG+ + I PS+F        G IVD GT ITRL   
Sbjct: 350 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 403

Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           AY+SLR +F         +  + V+L DTCYDF+G+  V +PTVSL F  G ALD+ A  
Sbjct: 404 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 463

Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  V SA   C AFA       + I+GN Q +   V++D+    VGF+P  C
Sbjct: 464 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  265 bits (676), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
           G + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP +SS+Y
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           + + CAAP C  LDVS C    CLY V YGDGS+++G    +T++  +  +VKG   GCG
Sbjct: 232 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 291

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
             N+GLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +      
Sbjct: 292 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPR-STGTGYLDFGAGSPPAT 350

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            T P++      TFYYVG+TG  VGG+ + I PS+F        G IVD GT ITRL   
Sbjct: 351 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 404

Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           AY+SLR +F         +  + V+L DTCYDF+G+  V +PTVSL F  G ALD+ A  
Sbjct: 405 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 464

Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  V SA   C AFA       + I+GN Q +   V++D+    VGF+P  C
Sbjct: 465 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  265 bits (676), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 204/353 (57%), Gaps = 16/353 (4%)

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
           G + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP +SS+Y
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           + + CAAP C  LDVS C    CLY V YGDGS+++G    +T++  +  +VKG   GCG
Sbjct: 235 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 294

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA 326
             N+GLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +      
Sbjct: 295 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPAR-STGTGYLDFGAGSPPAT 353

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            T P++      TFYYVG+TG  VGG+ + I PS+F        G IVD GT ITRL   
Sbjct: 354 TTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVFAA-----AGTIVDSGTVITRLPPA 407

Query: 387 AYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           AY+SLR +F         +  + V+L DTCYDF+G+  V +PTVSL F  G ALD+ A  
Sbjct: 408 AYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASG 467

Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  V SA   C AFA       + I+GN Q +   V++D+    VGF+P  C
Sbjct: 468 IMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  265 bits (676), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 152/357 (42%), Positives = 203/357 (56%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  LD   C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 230 YANVSCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F   + A 
Sbjct: 290 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 348

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G  +T P++ +    TFYYVG+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 349 AGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 402

Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR +FV    A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+
Sbjct: 403 LPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDV 462

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    S    C  FA       + I+GN Q +   V++D+    VGF+P  C
Sbjct: 463 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  264 bits (674), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 151/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  LD   C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 231 YANISCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F   + A 
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 349

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G  +T P++ +    TFYYVG+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 350 AGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFTT-----AGTIVDSGTVITR 403

Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR +F     A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    S    C  FA       + I+GN Q +   V++D+    VGF+P  C
Sbjct: 464 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 170/459 (37%), Positives = 237/459 (51%), Gaps = 38/459 (8%)

Query: 66  ETAAESFPLNSSSSFSLPLHSREILH--KTRHNDYRSLVLSRLERDSARVNTLITKLQLA 123
           + A E  P + SSS  L +  R      +TR   +  L     E+D+ RV  +  ++  +
Sbjct: 60  DAADEQKPASPSSSLKLHMTHRRGAEGGRTRKGSFLDLA----EKDAVRVEAMHRRVASS 115

Query: 124 IYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDIN 183
             +  R        +++       V SG + GS EY   + VGTPPR+F M++DTGSD+N
Sbjct: 116 SSSPRRGRALSESERVV-----ATVESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLN 170

Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL------DVSACR---ANRCLY 234
           WLQC PC +C++Q  P+FDP  SSSY  L C  P+C  +         ACR    + C Y
Sbjct: 171 WLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPY 230

Query: 235 QVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
              YGD S + GDL  E+ +      G S  V G+  GCGH N GLF G+AGLLGLG G 
Sbjct: 231 YYWYGDQSNSTGDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGP 290

Query: 290 LSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR-------NKKVD 338
           LS   Q++A     + +YCLVD  S  +  + F           P ++       +   D
Sbjct: 291 LSFASQLRAVYGGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPAD 350

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-R 397
           TFYYV LTG  VGG+ + I    ++  E G GG I+D GT ++     AY  +R +F+ R
Sbjct: 351 TFYYVRLTGVLVGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDR 410

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
           ++G+  P     +   CY+ SG+    VP +SL F  G   D PA+NY I +D  G  C 
Sbjct: 411 MSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCL 470

Query: 458 AFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A   T  + +SIIGN QQQ   V++DL NNR+GF P +C
Sbjct: 471 AVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 509


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 172/446 (38%), Positives = 232/446 (52%), Gaps = 29/446 (6%)

Query: 73  PLNSSSSFSLPLHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHE 131
           P +SS S  L +  R     +TR   +    L + E+D+ R+ T+    + A   V R  
Sbjct: 68  PASSSPSLQLRMKHRSAEGGRTRKESF----LDKAEKDAVRIETM--HRRAARSGVARMP 121

Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
              +  + L E     V SG + GSGEY   + VGTPPR+F M++DTGSD+NWLQC PC 
Sbjct: 122 ASSSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCL 181

Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACR---ANRCLYQVAYGDGSFT 244
           +C++Q  P+FDP  SSSY  + C   +C  +       ACR    + C Y   YGD S T
Sbjct: 182 DCFEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNT 241

Query: 245 VGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT 299
            GDL  E+ +      G S  V G+  GCGH N GLF G+AGLLGLG G LS   Q++A 
Sbjct: 242 TGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 301

Query: 300 ---SLAYCLVDRDSPASGVLEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
              + +YCLV+  S A   + F       A      TA    +   DTFYYV L G  VG
Sbjct: 302 YGHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVG 361

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TSGVAL 410
           G  + I    +++ + G GG I+D GT ++     AY  +R +FV L   L P      +
Sbjct: 362 GDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPV 421

Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSII 469
            + CY+ SG+    VP +SL F  G   D PA+NY + +D  G  C A   T  + +SII
Sbjct: 422 LNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSII 481

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GN QQQ   V +DL NNR+GF P +C
Sbjct: 482 GNFQQQNFHVVYDLQNNRLGFAPRRC 507


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  261 bits (667), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 167/429 (38%), Positives = 238/429 (55%), Gaps = 25/429 (5%)

Query: 80  FSLPLHS--REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDR--HELKPA 135
           F  P HS  R  L+  RH    +     LE   +  N  +TK QL    ++R    L+  
Sbjct: 18  FVAPTHSTSRTALNH-RHEAKVTGFQIMLEHVDSGKN--LTKFQLLERAIERGSRRLQRL 74

Query: 136 EAQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
           EA +  P    T V +G     GEY   + +GTP + FS ++DTGSD+ W QC+PCT+C+
Sbjct: 75  EAMLNGPSGVETSVYAG----DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCF 130

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
            QS PIF+P+ SSS+S LPC++  C++L    C  N C Y   YGDGS T G + TET++
Sbjct: 131 NQSTPIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLT 190

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
           FG S S+  I  GCG +N+G   G+ AGL+G+G G LSL  Q+  T  +YC+  +   +P
Sbjct: 191 FG-SVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCMTPIGSSTP 249

Query: 312 ASGVLE--FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAG 368
           ++ +L    NS   G   T  LI++ ++ TFYY+ L G SVG   + I PS F ++   G
Sbjct: 250 SNLLLGSLANSVTAGSPNTT-LIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNG 308

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDF-SGLRSVRVP 426
            GGII+D GT +T     AY S+R  F+    NL   +G +  FD C+   S   ++++P
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQI-NLPVVNGSSSGFDLCFQTPSDPSNLQIP 367

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           T  +HF  G  L+LP++NY I   S G  C A   +S  +SI GN+QQQ   V +D  N+
Sbjct: 368 TFVMHFDGGD-LELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNS 425

Query: 487 RVGFTPNKC 495
            V F   +C
Sbjct: 426 VVSFASAQC 434


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 178/416 (42%), Positives = 229/416 (55%), Gaps = 46/416 (11%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           L+  RL+RD  R   +ITK              PA+    PE+ +  VV+GA   SGEY 
Sbjct: 85  LLARRLQRDMRRAAWIITK-----------AATPAD----PENGT--VVTGAPT-SGEYI 126

Query: 161 SRIGVGTPPRQ---FSMVL--DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           ++I VGTP      F  +L  D GSD+ WLQC PC  CY Q  P+++   SSS S + C 
Sbjct: 127 AKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGPVYNRLKSSSASDVGCY 186

Query: 216 APQCKSLDVS-ACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
           AP C++L  S  C    N C Y+V YGDGS + GD   ET++F     V G+A+GCG DN
Sbjct: 187 APACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFPPGVRVPGVAIGCGSDN 246

Query: 273 EGLFVG-SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPA-SGVLEFNSARGG--- 324
           +GLF   +AG+LGLG G LS   QI      S +YCL  + +   S  L F S       
Sbjct: 247 QGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSSTLTFGSGASATTT 306

Query: 325 ---DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEA-GDGGIIVDCGTA 379
                   P++ N ++ TFYYVGL G SVGG  V+ +  S   +D + G GG+IVD GTA
Sbjct: 307 TTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPSTGHGGVIVDSGTA 366

Query: 380 ITRLQTQAYNSLRDSF----VRLAGNLKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGA 434
           +TRL   AY + RD+F    V+  G   P    A FDTCY    G    +VP VS+HF  
Sbjct: 367 VTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAG 426

Query: 435 GKALDLPAKNYLIPVDS-AGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRV 488
           G  + LP +NYLIPVDS  GT CFAFA +    +SIIGN+Q QG RV +D+   RV
Sbjct: 427 GVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVDGQRV 482


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 148/353 (41%), Positives = 197/353 (55%), Gaps = 15/353 (4%)

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSY 209
           G + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+Y
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           + + CAAP C  LD   C    CLY V YGDGS+++G    +T++  +  +VKG   GCG
Sbjct: 231 ANVSCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDA 326
             NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +      
Sbjct: 291 ERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSPAAR 349

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           +T   +      TFYYVGLTG  VGG+ + IP S+F        G IVD GT ITRL   
Sbjct: 350 LTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFAT-----AGTIVDSGTVITRLPPA 404

Query: 387 AYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
           AY+SLR +F     A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+ A  
Sbjct: 405 AYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASG 464

Query: 445 YLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +    SA   C AFA       + I+GN Q +   V++D+    V F+P  C
Sbjct: 465 IMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 152/400 (38%), Positives = 217/400 (54%), Gaps = 26/400 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D ARV+++  K+  A   V    L  A  +   +  + P   G S G+G Y   +G+
Sbjct: 100 LNDDQARVDSIHRKIAAAASPV----LDQARGK---KGVTLPAQRGISLGTGNYVVSMGL 152

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP R  ++V DTGSD++W+QC PC++CY+Q DP+FDP  SS+YS +PCA+P+C+ LD  
Sbjct: 153 GTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPECQGLDSR 212

Query: 226 AC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
           +C R  +C Y+V YGD S T G L  +T++   S  + G   GCG  + GLF  + GL+G
Sbjct: 213 SCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVG 272

Query: 285 LGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG   +SL+ Q  +      +YCL    S A+G L        +A    +       +FY
Sbjct: 273 LGREKVSLSSQAASKYGAGFSYCLPSSPS-AAGYLSLGGPAPANARFTAMETRHDSPSFY 331

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           YV L G  V G+ V++ P +F        G ++D GT ITRL  + Y +LR +F R  G 
Sbjct: 332 YVRLVGVKVAGRTVRVSPIVFSA-----AGTVIDSGTVITRLPPRVYAALRSAFARSMGR 386

Query: 402 --LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCF 457
              K    +++ DTCYDF+G  +VR+P+V+L F  G A  LD     Y+  V  A   C 
Sbjct: 387 YGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDFSGVLYVAKVSQA---CL 443

Query: 458 AFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAP        IIGN QQ+   V +D+A  ++GF  N C
Sbjct: 444 AFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGC 483


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  258 bits (660), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 161/435 (37%), Positives = 225/435 (51%), Gaps = 35/435 (8%)

Query: 80  FSLPLHSREILHK----TRHNDYRSLVLSRLE---RDSARVNTLITKLQLAIYNVDRHEL 132
           F LP  S  + H+    +R N+ ++     +E    D ARVN++ +KL   +      E 
Sbjct: 27  FFLPESSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSES 86

Query: 133 KPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
           K  +          P   G++ GSG Y   +G+GTP    S++ DTGSD+ W QC+PC  
Sbjct: 87  KSTDL---------PAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR 137

Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVG 246
            CY Q +PIF+P  S+SY  + C++  C SL     +  +C A+ C+Y + YGD SF+VG
Sbjct: 138 TCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 197

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
            L  E  +  NS    G+  GCG +N+GLF G AGLLGLG   LS   Q         +Y
Sbjct: 198 FLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSY 257

Query: 304 CLVDRDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
           CL    S  +G L F SA    +V   P+       +FY + +   +VGGQ + IP ++F
Sbjct: 258 CLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF 316

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
                   G ++D GT ITRL  +AY +LR SF         TSGV++ DTC+D SG ++
Sbjct: 317 STP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKT 371

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVS 480
           V +P V+  F  G  ++L +K  +  V      C AFA  S  S  +I GNVQQQ   V 
Sbjct: 372 VTIPKVAFSFSGGAVVELGSKG-IFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVV 430

Query: 481 FDLANNRVGFTPNKC 495
           +D A  RVGF PN C
Sbjct: 431 YDGAGGRVGFAPNGC 445


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  258 bits (658), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 159/423 (37%), Positives = 229/423 (54%), Gaps = 31/423 (7%)

Query: 88  EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTP 147
            + H   H +Y  L L  L+R + R +  +++L      V        +A     D   P
Sbjct: 43  RLTHVDAHGNYSRLQL--LQRAARRSHHRMSRLVARATGV--------KAVAGGGDLQVP 92

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           V      G+GE+   + +GTP   ++ ++DTGSD+ W QC+PC +C++QS P+FDP +SS
Sbjct: 93  V----HAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSS 148

Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIA 265
           +Y+ +PC++  C  L  S C  A++C Y   YGD S T G L +ET + G     + G+A
Sbjct: 149 TYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVA 208

Query: 266 LGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV-----DRDSP----ASGV 315
            GCG  NEG  F   AGL+GLG G LSL  Q+     +YCL      D  SP     S  
Sbjct: 209 FGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDGKSPLLLGGSAA 268

Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
               SA      T PL++N    +FYYV LTG +VG   + +P S F + + G GG+IVD
Sbjct: 269 AISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVD 328

Query: 376 CGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHF 432
            GT+IT L+ Q Y +L+ +FV ++A      S + L D C+     G+  V+VP + LHF
Sbjct: 329 SGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGL-DLCFQGPAKGVDEVQVPKLVLHF 387

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  LDLPA+NY++   ++G  C   AP S  LSIIGN QQQ  +  +D+A + + F P
Sbjct: 388 DGGADLDLPAENYMVLDSASGALCLTVAP-SRGLSIIGNFQQQNFQFVYDVAGDTLSFAP 446

Query: 493 NKC 495
            +C
Sbjct: 447 VQC 449


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  257 bits (657), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 164/398 (41%), Positives = 220/398 (55%), Gaps = 26/398 (6%)

Query: 117 ITKLQLAIYNVDRHE--LKPAEAQILPEDFSTP-----VVSGASQGSGEYFSRIGVGTPP 169
           +TKL+   + + R +  L+   A +L    STP     + +    G+GEY   + +GTPP
Sbjct: 60  LTKLERVQHGIKRGKSRLQKLNAMVLAAS-STPDSEDQLEAPIHAGNGEYLIELAIGTPP 118

Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
             +  VLDTGSD+ W QC+PCT CY+Q  PIFDPK SSS+S + C +  C +L  S C +
Sbjct: 119 VSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTC-S 177

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGHDNEGL-FVGSAGLLGL 285
           + C Y  +YGD S T G L TET +FG   N  SV  I  GCG DNEG  F  ++GL+GL
Sbjct: 178 DGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGL 237

Query: 286 GGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKKVDTFY 341
           G G LSL  Q+K    +YCL   D     VL   S        + VT PL++N    +FY
Sbjct: 238 GRGPLSLVSQLKEQRFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFY 297

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV---RL 398
           Y+ L   SVG   + I  S FE+ + G+GG+I+D GT IT +Q +AY +L+  F+   +L
Sbjct: 298 YLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKL 357

Query: 399 AGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
           A  L  TS   L D C+   SG   V +P +  HF  G  L+LPA+NY+I   + G  C 
Sbjct: 358 A--LDKTSSTGL-DLCFSLPSGSTQVEIPKLVFHFKGGD-LELPAENYMIGDSNLGVACL 413

Query: 458 AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A    SS +SI GNVQQQ   V+ DL    + F P  C
Sbjct: 414 AMG-ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  L++  C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +     
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSLAA 349

Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           A   +T P++  +   TFYYVG+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 350 ARARLTTPML-TENGPTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 403

Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR   +    A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    SA   C AFA       + I+GN Q +   V++D+    VGF P  C
Sbjct: 464 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 166/414 (40%), Positives = 219/414 (52%), Gaps = 31/414 (7%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
           +D AR++T++ ++  A          P  A  L E     V SG + GSGEY   + VGT
Sbjct: 103 KDVARIHTMLRRVAGAGGGRAATNSTPRRA--LAERIVATVESGVAVGSGEYLVDLYVGT 160

Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----D 223
           PPR+F M++DTGSD+NWLQC PC +C++Q  P+FDP TS SY  + C  P+C  +     
Sbjct: 161 PPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPTA 220

Query: 224 VSACR---ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGL 275
             ACR   ++ C Y   YGD S T GDL  E  +      G S  V  +  GCGH N GL
Sbjct: 221 PRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGL 280

Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           F G+AGLLGLG G LS   Q++A    + +YCLVD  S     + F      DA+     
Sbjct: 281 FHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGD---DDALLGHPR 337

Query: 333 RN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
            N            DTFYYV L G  VGG+ + I PS +++ + G GG I+D GT ++  
Sbjct: 338 LNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYF 397

Query: 384 QTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
              AY  +R +FV       P  +   +   CY+ SG+  V VP  SL F  G   D PA
Sbjct: 398 AEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPA 457

Query: 443 KNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +NY + +D  G  C A   T  SA+SIIGN QQQ   V +DL NNR+GF P +C
Sbjct: 458 ENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 152/391 (38%), Positives = 211/391 (53%), Gaps = 11/391 (2%)

Query: 109 DSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTP 168
           DS    T   +LQ A+    R +L+          F + V +    G+GE+  ++ +GTP
Sbjct: 50  DSGGNYTKFERLQRAM---KRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMKLAIGTP 106

Query: 169 PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR 228
              +S ++DTGSD+ W QC+PC +C+ Q  PIFDPK SSS+S LPC++  C +L +S+C 
Sbjct: 107 AETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSC- 165

Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGG 287
           ++ C Y  +YGD S T G L TET +FG++ SV  I  GCG DN+G  F   AGL+GLG 
Sbjct: 166 SDGCEYLYSYGDYSSTQGVLATETFAFGDA-SVSKIGFGCGEDNDGSGFSQGAGLVGLGR 224

Query: 288 GMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
           G LSL  Q+     +YCL   D     S +L  + A   +A+T PLI+N    +FYY+ L
Sbjct: 225 GPLSLISQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSL 284

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
            G SVG   + I  S F +   G GG+I+D GT IT L+  A+ +L+  F+         
Sbjct: 285 EGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDE 344

Query: 406 SGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS 464
           SG    D C+       +V VP +  HF  G  L LPA+NY+I     G  C     +SS
Sbjct: 345 SGSTGLDLCFTLPPDASTVDVPQLVFHF-EGADLKLPAENYIIADSGLGVICLTMG-SSS 402

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +SI GN QQQ   V  DL    + F P +C
Sbjct: 403 GMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 178/461 (38%), Positives = 239/461 (51%), Gaps = 36/461 (7%)

Query: 61  FAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL 120
            AEE E        + S S  L +  R     T     +   L   ++D  R+ T+  ++
Sbjct: 57  LAEEEEQK------DRSPSLKLHMSRRSPAEATAGRTRKDSFLESAQKDGVRIATMHRRV 110

Query: 121 QL-AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
            L A     R     +  + L E     V SG + GSGEY   + VGTPPR+F M++DTG
Sbjct: 111 ALQAQAQPGRRSASSSPRRALSERLVATVESGVAVGSGEYLVEVYVGTPPRRFQMIMDTG 170

Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA----CRANR---C 232
           SD+NWLQC PC +C+ Q  P+FDP  S+SY  + C   +C  +   A    CR++R   C
Sbjct: 171 SDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPC 230

Query: 233 LYQVAYGDGSFTVGDLVTE--TVSFGNSGS--VKGIALGCGHDNEGLFVGSAGLLGLGGG 288
            Y   YGD S T GDL  E  TV+   S S  V G+ LGCGH N GLF G+AGLLGLG G
Sbjct: 231 PYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRG 290

Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV--TAPLIR------NKKV 337
            LS   Q++A    + +YCLVD  S     + F    G D V  + P +       +   
Sbjct: 291 PLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVF----GDDNVLLSHPQLNYTAFAPSAAE 346

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
           +TFYYV L G  VGG+ + IP + + +  E G GG I+D GT ++     AY ++R +FV
Sbjct: 347 NTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFV 406

Query: 397 RLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                  P  +   +   CY+ SG+  V VP  SL F  G   D PA+NY I +D+ G  
Sbjct: 407 DRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIM 466

Query: 456 CFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C A   T  SA+SIIGN QQQ   V +DL +NR+GF P +C
Sbjct: 467 CLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 157/394 (39%), Positives = 218/394 (55%), Gaps = 19/394 (4%)

Query: 117 ITKLQLAIYNVDRHE--LKPAEAQILPE---DFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
           +TKL+   + + R +  L+   A +L     D    + +    G+GEY   + +GTPP  
Sbjct: 61  LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGEYLMELAIGTPPVS 120

Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
           +  VLDTGSD+ W QC+PCT+CY+Q  PIFDPK SSS+S + C +  C ++  S C ++ 
Sbjct: 121 YPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTC-SDG 179

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGG 287
           C Y  +YGD S T G L TET +FG   N  SV  I  GCG DNEG  F  ++GL+GLG 
Sbjct: 180 CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGR 239

Query: 288 GMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKKVDTFYYV 343
           G LSL  Q+K    +YCL   D     +L   S        + VT PL++N    +FYY+
Sbjct: 240 GPLSLVSQLKEPRFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYL 299

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNL 402
            L G SVG   + I  S FE+ + G+GG+I+D GT IT ++ +A+ +L+  F+ +    L
Sbjct: 300 SLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPL 359

Query: 403 KPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
             TS   L D C+   SG   V +P +  HF  G  L+LPA+NY+I   + G  C A   
Sbjct: 360 DKTSSTGL-DLCFSLPSGSTQVEIPKIVFHFKGGD-LELPAENYMIGDSNLGVACLAMG- 416

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            SS +SI GNVQQQ   V+ DL    + F P  C
Sbjct: 417 ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 155/388 (39%), Positives = 213/388 (54%), Gaps = 13/388 (3%)

Query: 117 ITKLQLAIYNVDR--HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSM 174
           +TK Q   + + R  H L+   A +L    +  + S    G+GE+   + +GTPP  +S 
Sbjct: 56  LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSA 115

Query: 175 VLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLY 234
           ++DTGSD+ W QC+PCT+C+ Q  PIFDPK SSS+S L C++  CK+L  S+C ++ C Y
Sbjct: 116 IMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSC-SDSCEY 174

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLT 293
              YGD S T G + TET +FG   S+  +  GCG DNEG  F   +GL+GLG G LSL 
Sbjct: 175 LYTYGDYSSTQGTMATETFTFGKV-SIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLV 233

Query: 294 KQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAV--TAPLIRNKKVDTFYYVGLTGF 348
            Q+K    +YCL   D   +  L      S  G  A   T PLI+N    +FYY+ L G 
Sbjct: 234 SQLKEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGI 293

Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
           SVGG  + I  S F++ + G GG+I+D GT IT L+  A++ ++  F    G     SG 
Sbjct: 294 SVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGA 353

Query: 409 ALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS 467
              + CY+  S    + VP + LHF  G  L+LP +NY+I   S G  C A   +S  +S
Sbjct: 354 TGLELCYNLPSDTSELEVPKLVLHF-TGADLELPGENYMIADSSMGVICLAMG-SSGGMS 411

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           I GNVQQQ   VS DL    + F P  C
Sbjct: 412 IFGNVQQQNMFVSHDLEKETLSFLPTNC 439


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 151/362 (41%), Positives = 200/362 (55%), Gaps = 19/362 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDP 203
           S P  SG++ G+G Y   IG+GTP  ++++V DTGSD  W+QC PC   CY+Q + +FDP
Sbjct: 147 SLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDP 206

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             SS+Y+ + CAAP C  L +  C    CLY V YGDGS+++G    +T++  +  ++KG
Sbjct: 207 ARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG 266

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNS 320
              GCG  NEGL+  +AGLLGLG G  SL  Q         A+C   R S  +G L+F  
Sbjct: 267 FRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-GTGYLDFGP 325

Query: 321 ARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                AV+A L     VD   TFYYVGLTG  VGG+ + IP S+F        G IVD G
Sbjct: 326 GS-LPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----GTIVDSG 379

Query: 378 TAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
           T ITRL   AY+SLR +F         K    ++L DTCYDF+G+  V +PTVSL F  G
Sbjct: 380 TVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGG 439

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            +LD+ A   +I   S    C  FA       + I+GN Q +   V +D+    VGF P 
Sbjct: 440 ASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPG 498

Query: 494 KC 495
            C
Sbjct: 499 AC 500


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  256 bits (653), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 153/362 (42%), Positives = 209/362 (57%), Gaps = 23/362 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC   C++QS P+FDP
Sbjct: 123 SVPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDP 182

Query: 204 KTSSSYSPLPCAAPQCK-----SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           KTSSSY+ + C+ PQC      +L+ +AC  ++ C+YQ +YGD SF+VG L  +TVSFG 
Sbjct: 183 KTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFG- 241

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCL-VDRDSPAS 313
           S SV     GCG DNEGLF  SAGL+GL    LSL  Q+  T   S +YCL     S   
Sbjct: 242 SNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYL 301

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            +  +N    G     P++ +   D+ Y++ L+G +V G+ + +  S     E      I
Sbjct: 302 SIGSYNP---GQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS-----EYSSLPTI 353

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT ITRL T  Y++L  +        K     ++ DTC+      S+RVP VS+ F 
Sbjct: 354 IDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPAVSMAFS 412

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G AL L A+N L+ VDS+ T C AFAP  SA +IIGN QQQ   V +D+ +NR+GF   
Sbjct: 413 GGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAG 470

Query: 494 KC 495
            C
Sbjct: 471 GC 472


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 165/414 (39%), Positives = 218/414 (52%), Gaps = 31/414 (7%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
           +D AR++T++ ++  A          P  A  L E     V SG + GSGEY   + VGT
Sbjct: 103 KDVARIHTMLRRVAGAGGGRAATNSTPRRA--LAERIVATVESGVAVGSGEYLVDLYVGT 160

Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----D 223
           PPR+F M++DTGSD+NWLQC PC +C++Q  P+FDP  S SY  + C  P+C  +     
Sbjct: 161 PPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPTA 220

Query: 224 VSACR---ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGL 275
             ACR   ++ C Y   YGD S T GDL  E  +      G S  V  +  GCGH N GL
Sbjct: 221 PRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGL 280

Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           F G+AGLLGLG G LS   Q++A    + +YCLVD  S     + F      DA+     
Sbjct: 281 FHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSVGSKIVFGD---DDALLGHPR 337

Query: 333 RN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
            N            DTFYYV L G  VGG+ + I PS +++ + G GG I+D GT ++  
Sbjct: 338 LNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYF 397

Query: 384 QTQAYNSLRDSFVRLAGNLKP-TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
              AY  +R +FV       P  +   +   CY+ SG+  V VP  SL F  G   D PA
Sbjct: 398 AEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPA 457

Query: 443 KNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +NY + +D  G  C A   T  SA+SIIGN QQQ   V +DL NNR+GF P +C
Sbjct: 458 ENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 158/419 (37%), Positives = 226/419 (53%), Gaps = 25/419 (5%)

Query: 89  ILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS--- 145
           +LH  +      L +   + DS +    +TK +L    + R E +      + +  S   
Sbjct: 30  LLHHGQKRPQPGLRVDLEQVDSGKN---LTKYELIKRAIKRGERRMRSINAMLQSSSGIE 86

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           TPV +G     GEY   + +GTP   FS ++DTGSD+ W QC PCT+C+ Q  PIF+P+ 
Sbjct: 87  TPVYAG----DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQD 142

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           SSS+S LPC +  C+ L    C  N C Y   YGDGS T G + TET +F  S SV  IA
Sbjct: 143 SSSFSTLPCESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETS-SVPNIA 201

Query: 266 LGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
            GCG DN+G   G+ AGL+G+G G LSL  Q+     +YC+    S +   L   SA  G
Sbjct: 202 FGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG 261

Query: 325 DAVTAP---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
               +P   LI +    T+YY+ L G +VGG  + IP S F++ + G GG+I+D GT +T
Sbjct: 262 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 321

Query: 382 RLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKA 437
            L   AYN++  +F   + L    + +SG++   TC+   S   +V+VP +S+ F  G  
Sbjct: 322 YLPQDAYNAVAQAFTDQINLPTVDESSSGLS---TCFQQPSDGSTVQVPEISMQFDGG-V 377

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+L  +N LI   + G  C A   +S   +SI GN+QQQ T+V +DL N  V F P +C
Sbjct: 378 LNLGEQNILIS-PAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 169/422 (40%), Positives = 225/422 (53%), Gaps = 36/422 (8%)

Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
           S  L   E+D+ R++T+  +  L+     R +  P  A  L E     V SG   GSGEY
Sbjct: 92  SFFLDSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRA--LSERVVATVESGVPVGSGEY 149

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
              + +GTPPR+F M++DTGSD+NWLQC PC +C++QS PIFDP  S SY  + C   +C
Sbjct: 150 LVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRC 209

Query: 220 KSLDVSA------CRANR---CLYQVAYGDGSFTVGDLVTE--TVSFGNSGS--VKGIAL 266
           + +   A      CR  R   C Y   YGD S T GDL  E  TV+   SG+  V G+A 
Sbjct: 210 RLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAF 269

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSAR 322
           GCGH N GLF G+AGLLGLG G LS   Q++      + +YCLV+  S A   + F    
Sbjct: 270 GCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGH-- 327

Query: 323 GGDAVTAPLIRN-------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
             DA+ A    N          DTFYY+ L    VGG+AV I       D    GG I+D
Sbjct: 328 -DDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNI-----SSDTLSAGGTIID 381

Query: 376 CGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
            GT ++     AY ++R +F+ R++ +     G  +   CY+ SG   V VP +SL F  
Sbjct: 382 SGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFAD 441

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           G A + PA+NY I ++  G  C A   T  S +SIIGN QQQ   V +DL +NR+GF P 
Sbjct: 442 GAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPR 501

Query: 494 KC 495
           +C
Sbjct: 502 RC 503


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  255 bits (651), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 211/402 (52%), Gaps = 28/402 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D ARVN++ +KL   +      E K  +          P   G++ GSG Y   +G+
Sbjct: 88  LRLDQARVNSIHSKLSKKLATDHVSESKSTDL---------PAKDGSTLGSGNYIVTVGL 138

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           GTP    S++ DTGSD+ W QC+PC   CY Q +PIF+P  S+SY  + C++  C SL  
Sbjct: 139 GTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSS 198

Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
              +  +C A+ C+Y + YGD SF+VG L  E  +  NS    G+  GCG +N+GLF G 
Sbjct: 199 ATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGV 258

Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRNK 335
           AGLLGLG   LS   Q         +YCL    S  +G L F SA    +V   P+    
Sbjct: 259 AGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTIT 317

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
              +FY + +   +VGGQ + IP ++F        G ++D GT ITRL  +AY +LR SF
Sbjct: 318 DGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSF 372

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                    TSGV++ DTC+D SG ++V +P V+  F  G  ++L +K  +  V      
Sbjct: 373 KAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG-IFYVFKISQV 431

Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFA  S  S  +I GNVQQQ   V +D A  RVGF PN C
Sbjct: 432 CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 473


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 202/357 (56%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  L++  C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +     
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSLAA 349

Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           A   +T P++ +    TFYYVG+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 350 ASARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 403

Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR   +    A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    SA   C AFA       + I+GN Q +   V++D+    VGF P  C
Sbjct: 464 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 158/416 (37%), Positives = 217/416 (52%), Gaps = 27/416 (6%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL------PEDFSTPVVSGASQGS 156
           LS    DS +  T I K+Q  I N   H L    A  +      P+D +  + +    GS
Sbjct: 47  LSLRHVDSGKNLTKIQKIQRGI-NRGFHRLNRLGAVAVLAVASKPDD-TNNIKAPTHGGS 104

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GE+   + +G P  ++S ++DTGSD+ W QC+PCTEC+ Q  PIFDP+ SSSYS + C++
Sbjct: 105 GEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSS 164

Query: 217 PQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
             C +L  S C  ++  C Y   YGD S T G L TET +F +  S+ GI  GCG +NEG
Sbjct: 165 GLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEG 224

Query: 275 L-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDA------ 326
             F   +GL+GLG G LSL  Q+K T  +YCL    DS AS  L   S   G        
Sbjct: 225 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAS 284

Query: 327 ------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                  T  L+RN    +FYY+ L G +VG + + +  S FE+ E G GG+I+D GT I
Sbjct: 285 LDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTI 344

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALD 439
           T L+  A+  L++ F          SG    D C+      +++ VP +  HF  G  L+
Sbjct: 345 TYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF-KGADLE 403

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY++   S G  C A   +S+ +SI GNVQQQ   V  DL    V F P +C
Sbjct: 404 LPGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 149/357 (41%), Positives = 198/357 (55%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   IG+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  L    C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 233 YANVSCAAPACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 292

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF---NSAR 322
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F   + A 
Sbjct: 293 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSS-GTGYLDFGPGSPAA 351

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G   T P++ +    TFYYVG+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 352 VGARQTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQSVFST-----AGTIVDSGTVITR 405

Query: 383 LQTQAYNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR +F     A   K    ++L DTCYDF+G+  V +P VSL F  G  LD+
Sbjct: 406 LPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDV 465

Query: 441 PAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    S    C  FA       + I+GN Q +   V +D+    VGF+P  C
Sbjct: 466 NASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 161/451 (35%), Positives = 232/451 (51%), Gaps = 48/451 (10%)

Query: 92  KTRHNDYRSLVLSRLERDSARVNTLITKL-----QLAIYNVDRHELKPAEAQIL------ 140
           K R ++ +   +    RD AR+ TL T++     Q  I  + + + +P E QI       
Sbjct: 3   KDRKSEGKESFVESTNRDLARIQTLHTRIIEKKNQNDISRLKKDKERP-EKQIKTVVATA 61

Query: 141 --PEDFST--------PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
             PE + T         + SG + GSGEYF  + +GTPP+ +S++LDTGSD+NW+QC PC
Sbjct: 62  ASPESYGTGLSGQLMATLESGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC 121

Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFT 244
            +C++Q+ P +DPK SSS+  + C  P+C  +        C+A    C Y   YGD S T
Sbjct: 122 HDCFEQNGPYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNT 181

Query: 245 VGDLVTETVSF------GNS--GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI 296
            GD  TET +       G S    V+ +  GCGH N GLF G++GLLGLG G LS + Q+
Sbjct: 182 TGDFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQL 241

Query: 297 KA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGL 345
           ++    S +YCLVDR+S  +   +       D +  P +        +   VDTFYYV +
Sbjct: 242 QSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQI 301

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
               VGG+ + IP S + M   G GG IVD GT ++     AY  ++D+FV+        
Sbjct: 302 KSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIV 361

Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SS 464
               + D CY+ SG+  + +P   + F  G   + P +NY I +D     C A   T  S
Sbjct: 362 QDFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRS 421

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ALSIIGN QQQ   V +D   +R+G+ P  C
Sbjct: 422 ALSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 156/416 (37%), Positives = 220/416 (52%), Gaps = 27/416 (6%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL------PEDFSTPVVSGASQGS 156
           LS    DS +  T I K+Q  I N   H L    A  +      P+D +  + +    GS
Sbjct: 48  LSLRHVDSGKNLTKIQKIQRGI-NRGFHRLNRLGAVAVLAVASNPDD-TNNIKAPTHGGS 105

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GE+   + +G P  +++ ++DTGSD+ W QC+PCTEC+ Q  PIFDP+ SSSYS + C++
Sbjct: 106 GEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSS 165

Query: 217 PQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
             C +L  S C  ++  C Y   YGD S T G L TET +F +  S+ GI  GCG +NEG
Sbjct: 166 GLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEG 225

Query: 275 L-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----------SPASGVLEFNSAR 322
             F   +GL+GLG G LSL  Q+K T  +YCL   +           S ASG++    A 
Sbjct: 226 DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGAN 285

Query: 323 --GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
             G    T  L+RN    +FYY+ L G +VG + + +  S FE+ E G GG+I+D GT I
Sbjct: 286 LDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTI 345

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALD 439
           T L+  A+  L++ F          SG    D C+   +  +++ VP +  HF  G  L+
Sbjct: 346 TYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHF-KGADLE 404

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY++   S G  C A   +S+ +SI GNVQQQ   V  DL    V F P +C
Sbjct: 405 LPGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 164/418 (39%), Positives = 228/418 (54%), Gaps = 19/418 (4%)

Query: 87  REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPEDF 144
           R + H      +R     RL+   +  N  +TKL+   + V R  + L+  +A  L    
Sbjct: 29  RALEHPKMQKGFRV----RLKHVDSGKN--LTKLERIRHGVKRGRNRLQRLQAMALVASS 82

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S+ + +    G+GE+  ++ +GTPP  +S +LDTGSD+ W QC+PCT+C+ QS PIFDPK
Sbjct: 83  SSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPK 142

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
            SSS+S L C++  C++L  S+C  N C Y  +YGD S T G L +ET++FG + SV  +
Sbjct: 143 KSSSFSKLSCSSQLCEALPQSSCN-NGCEYLYSYGDYSSTQGILASETLTFGKA-SVPNV 200

Query: 265 ALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG 323
           A GCG DNEG  F   AGL+GLG G LSL  Q+K    +YCL   D   +  L   S   
Sbjct: 201 AFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLAS 260

Query: 324 GDA-----VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
            +A      T PLI +    +FYY+ L G SVG   + I  S F + + G GG+I+D GT
Sbjct: 261 VNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGT 320

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKA 437
            IT L+  A+N +   F         +SG    D C+   SG  ++ VP +  HF  G  
Sbjct: 321 TITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFD-GAD 379

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+LPA+NY+I   S G  C A   +SS +SI GNVQQQ   V  DL    + F P +C
Sbjct: 380 LELPAENYMIGDSSMGVACLAMG-SSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 149/357 (41%), Positives = 202/357 (56%), Gaps = 19/357 (5%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSS 208
           SG + G+G Y   +G+GTP  ++++V DTGSD  W+QC+PC   CY+Q + +FDP  SS+
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           Y+ + CAAP C  L++  C    CLY V YGDGS+++G    +T++  +  +VKG   GC
Sbjct: 229 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 288

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGD 325
           G  NEGLF  +AGLLGLG G  SL  Q         A+CL  R S  +G L+F +     
Sbjct: 289 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPAR-STGTGYLDFGAGSPAA 347

Query: 326 A---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           A   +T P++ +    TFYY+G+TG  VGGQ + IP S+F        G IVD GT ITR
Sbjct: 348 ASARLTTPMLTDNG-PTFYYIGMTGIRVGGQLLSIPQSVFAT-----AGTIVDSGTVITR 401

Query: 383 LQTQAYNSLR--DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY+SLR   +    A   K    V+L DTCYDF+G+  V +PTVSL F  G  LD+
Sbjct: 402 LPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 461

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   +    SA   C AFA       + I+GN Q +   V++D+    VGF P  C
Sbjct: 462 DASGIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 167/505 (33%), Positives = 255/505 (50%), Gaps = 53/505 (10%)

Query: 13  TTILFSFCLFTS--ASSRGLS-ETATTVLDVSSALQQTEHILSFEPETL---EPFAEESE 66
           T  L  F L+++  +S RGL+ +   T L   S L    HI S  P ++    P  ++  
Sbjct: 7   TIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNV-HITSLMPSSVCSPSPKGDDKR 65

Query: 67  TAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR-LERDSARVNTLITKLQLAIY 125
            + E             +H      K   +  RS   ++ L++D +RVN++  + +LA  
Sbjct: 66  ASLEV------------IHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSI--RSRLAKN 111

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
             D  +LK ++  +       P  SG++ G+G Y   +G+GTP R  + + DTGSD+ W 
Sbjct: 112 PADGGKLKGSKVTL-------PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164

Query: 186 QCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYG 239
           QC PC   CY Q +PIF+P  S+SY+ + C++P C  L     +  +C A+ C+Y + YG
Sbjct: 165 QCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYG 224

Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQI 296
           D S++VG    + ++  ++        GCG +N GLFVG AGL+GLG   LSL   T Q 
Sbjct: 225 DQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQK 284

Query: 297 KATSLAYCLVDRDSPASGVLEFNSARGGDAVT--APLIRNKKVDTFYYVGLTGFSVGGQA 354
                +YCL    S ++G L F S  G        P + N +  +FY++ L   SVGG+ 
Sbjct: 285 YGKLFSYCLPSTSS-STGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRK 343

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
           +    S+F        G I+D GT I+RL   AY+ LR SF +        +  ++ DTC
Sbjct: 344 LSTSASVFST-----AGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTC 398

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSA--LSIIG 470
           YDFS   +V VP ++L+F  G  +DL      Y++ +      C AFA  S A  ++I+G
Sbjct: 399 YDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQV---CLAFAGNSDATDIAILG 455

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           NVQQ+   V +D+A  R+GF P  C
Sbjct: 456 NVQQKTFDVVYDVAGGRIGFAPGGC 480


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  251 bits (642), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 139/360 (38%), Positives = 199/360 (55%), Gaps = 17/360 (4%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
           P  +G S  + E+   +G GTP + ++++ DTGSD++W+QC PC+  CY+Q DPIFDP  
Sbjct: 123 PDSTGTSLDTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTK 182

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S++YS +PC  PQC + D S C    CLY+V YGDGS + G L  ET+S  ++ ++ G A
Sbjct: 183 SATYSVVPCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGFA 242

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEF---N 319
            GCG  N G F    GL+GLG G LSL+ Q  A+   + +YCL   D+   G L      
Sbjct: 243 FGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCL-PSDNTTHGYLTIGPTT 301

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            A   D     +++ +   +FY+V L    +GG  + +PP+LF      D G  +D GT 
Sbjct: 302 PASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTI 356

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           +T L  +AY +LRD F       KP      FDTCYDF+G  ++ +P VS  F  G   D
Sbjct: 357 LTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFD 416

Query: 440 LPAKNYLI-PVDSA---GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L     LI P D+A   G   F   P++   +I+GN+QQ+ T V +D+A  ++GF    C
Sbjct: 417 LSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 147/403 (36%), Positives = 220/403 (54%), Gaps = 26/403 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L+RD  RV+++    +LA          P+ A    +  S P   G   G+  Y   +G+
Sbjct: 91  LDRDQDRVDSI---HRLAAARPSSTADDPSSAS---KGVSLPARRGVPLGTANYIVSVGL 144

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP R   +V DTGSD++W+QC+PC  CYQQ DP+FDP  S++YS +PC A +C+ LD  
Sbjct: 145 GTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSG 204

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGHDNEGLFVGS 279
           +C + +C Y+V YGD S T G+L  +T++ G      +S  ++    GCG D+ GLF  +
Sbjct: 205 SCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKA 264

Query: 280 AGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
            GL GLG   +SL  Q  A      +YCL    S A G L   SA   +A    ++    
Sbjct: 265 DGLFGLGRDRVSLASQAAAKYGAGFSYCLPS-SSTAEGYLSLGSAAPPNARFTAMVTRSD 323

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
             +FYY+ L G  V G+ V++ P++F        G ++D GT ITRL ++AY +LR SF 
Sbjct: 324 TPSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFA 378

Query: 397 RLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
            L    + K    +++ DTCYDF+G   V++P+V+L F  G  L+L     L  V +   
Sbjct: 379 GLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQ 437

Query: 455 FCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C AFA     ++++I+GN+QQ+   V +D+AN ++GF    C
Sbjct: 438 ACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  251 bits (640), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 148/352 (42%), Positives = 194/352 (55%), Gaps = 19/352 (5%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLP 213
           G+G Y   IG+GTP  ++++V DTGSD  W+QC PC   CY+Q + +FDP  SS+ + + 
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANIS 241

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           CAAP C  L    C    CLY V YGDGS+++G    +T++  +  ++KG   GCG  NE
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNE 301

Query: 274 GLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
           GLF  +AGLLGLG G  SL  Q         A+C   R S  +G L+F       AV+  
Sbjct: 302 GLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-GTGYLDFGPGS-SPAVSTK 359

Query: 331 LIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           L     VD   TFYYVGLTG  VGG+ + IPPS+F        G IVD GT ITRL   A
Sbjct: 360 LTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTT-----AGTIVDSGTVITRLPPAA 414

Query: 388 YNSLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           Y+SLR +F     A   K    ++L DTCYDF+G+  V +PTVSL F  G +LD+ A   
Sbjct: 415 YSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASG- 473

Query: 446 LIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +I   S    C  FA       + I+GN Q +   V +D+    VGF+P  C
Sbjct: 474 IIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 157/421 (37%), Positives = 218/421 (51%), Gaps = 30/421 (7%)

Query: 89  ILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPV 148
           + H   H +Y  L L  L R + R +  +++L      V R      +A   P D   PV
Sbjct: 61  LTHVDAHGNYTKLQL--LRRAARRSHHRMSRL------VARTATGSVKAAAAP-DLQVPV 111

Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
                 G+GE+   + +GTP   ++ ++DTGSD+ W QC+PC EC+ QS P+FDP +SS+
Sbjct: 112 ----HAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSST 167

Query: 209 YSPLPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           YS LPC++  C  L  S C   A  C Y   YGD S T G L  ET +   +  + G+A 
Sbjct: 168 YSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKT-KLPGVAF 226

Query: 267 GCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----- 320
           GCG  NEG  F   AGL+GLG G LSL  Q+     +YCL   D  +   L   S     
Sbjct: 227 GCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTSKSPLLLGSLAAIS 286

Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
              A      T PLI+N    +FYYV L   +VG   + +P S F + + G GG+IVD G
Sbjct: 287 TDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSG 346

Query: 378 TAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGA 434
           T+IT L+ Q Y  L+ +F  ++   +   S V L D C+    SG+  V VP + LHF  
Sbjct: 347 TSITYLELQGYRPLKKAFAAQMKLPVADGSAVGL-DLCFKAPASGVDDVEVPKLVLHFDG 405

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           G  LDLPA+NY++   ++G  C      S  LSIIGN QQQ  +  +D+  + + F P +
Sbjct: 406 GADLDLPAENYMVLDSASGALCLTVM-GSRGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQ 464

Query: 495 C 495
           C
Sbjct: 465 C 465


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  250 bits (638), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 142/356 (39%), Positives = 196/356 (55%), Gaps = 22/356 (6%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G+GE+   + +GTP   ++ ++DTGSD+ W QC+PC EC+ QS P+FDP +SS+Y+ LPC
Sbjct: 98  GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPC 157

Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           ++  C  L  S C + +C Y   YGD S T G L  ET +   +  +  +A GCG  NEG
Sbjct: 158 SSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKT-KLPDVAFGCGDTNEG 216

Query: 275 -LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS--------ARGGD 325
             F   AGL+GLG G LSL  Q+     +YCL   D  +   L   S        A    
Sbjct: 217 DGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTSKSPLLLGSLATISESAAAASS 276

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
             T PLIRN    +FYYV L G +VG   + +P S F + + G GG+IVD GT+IT L+ 
Sbjct: 277 VQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYLEL 336

Query: 386 QAYNSLRDSFVRLAGNLK----PTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALD 439
           Q Y +L+ +F   A  +K      SG+ L DTC++   SG+  V VP +  H   G  LD
Sbjct: 337 QGYRALKKAF---AAQMKLPAADGSGIGL-DTCFEAPASGVDQVEVPKLVFHLD-GADLD 391

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LPA+NY++    +G  C      S  LSIIGN QQQ  +  +D+  N + F P +C
Sbjct: 392 LPAENYMVLDSGSGALCLTVM-GSRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQC 446


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 160/419 (38%), Positives = 228/419 (54%), Gaps = 18/419 (4%)

Query: 86  SREIL-HKTRHNDYRSLVLSRLER-DSARVNTLITKLQLAIYNVDRHELKPAEAQILPED 143
           SR +L H    N +R+    +L+  DS +  T   ++Q  +    RH L+  +A  L   
Sbjct: 27  SRRVLEHPKVQNGFRA----KLKHVDSGKNLTKFERIQHGVKR-GRHRLQRFKAMALVAS 81

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
            ++ + +    G+GE+  ++ +GTPP  +S ++DTGSD+ W QC+PCT+C+ Q  PIFDP
Sbjct: 82  SNSEIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDP 141

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
           K SSS+S L C++  C++L  S C ++ C Y   YGD S T G L +ET++FG   SV  
Sbjct: 142 KKSSSFSKLSCSSKLCEALPQSTC-SDGCEYLYGYGDYSSTQGMLASETLTFGKV-SVPE 199

Query: 264 IALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
           +A GCG DNEG  F   +GL+GLG G LSL  Q+K    +YCL   D   +  L   S  
Sbjct: 200 VAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLA 259

Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
              A   +  T PLI+N    +FYY+ L G SVG  ++ I  S F + E G GG+I+D G
Sbjct: 260 SVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSG 319

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGK 436
           T IT L+  A++ +   F          SG    + C+   SG   + VP +  HF  G 
Sbjct: 320 TTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFD-GA 378

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L+LPA+NY+I   S G  C A   +SS +SI GN+QQQ   V  DL    + F P +C
Sbjct: 379 DLELPAENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 143/366 (39%), Positives = 200/366 (54%), Gaps = 20/366 (5%)

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D   PV      G+GE+   + +GTP   +S ++DTGSD+ W QC+PC +C++QS P+FD
Sbjct: 93  DLQVPV----HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 148

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           P +SS+Y+ +PC++  C  L  S C  A++C Y   YGD S T G L TET +   S  +
Sbjct: 149 PSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KL 207

Query: 262 KGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
            G+  GCG  NEG  F   AGL+GLG G LSL  Q+     +YCL   D   +  L   S
Sbjct: 208 PGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGS 267

Query: 321 ARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
             G            T PLI+N    +FYYV L   +VG   + +P S F + + G GG+
Sbjct: 268 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 327

Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVS 429
           IVD GT+IT L+ Q Y +L+ +F  ++A      SGV L D C+     G+  V VP + 
Sbjct: 328 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLV 386

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            HF  G  LDLPA+NY++    +G  C      S  LSIIGN QQQ  +  +D+ ++ + 
Sbjct: 387 FHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLS 445

Query: 490 FTPNKC 495
           F P +C
Sbjct: 446 FAPVQC 451


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 155/358 (43%), Positives = 202/358 (56%), Gaps = 22/358 (6%)

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSY 209
           G S G+  Y   IG+GTPP +F++V DTGSD  W+QCRPC   CY+Q D +FDP  SS+Y
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           + + CA P C  LD S C A  CLY + YGDGS+TVG    +T++     ++KG   GCG
Sbjct: 215 ANVSCADPACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQD-AIKGFKFGCG 273

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF----NSAR 322
             N GLF  +AGLLGLG G  S+T Q       S +YCL    S A+G LEF     S+ 
Sbjct: 274 EKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCL-PASSAATGYLEFGPLSPSSS 332

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAIT 381
           G +A T P++ +K   TFYYVGLTG  VGG+ +  IP S+F      + G +VD GT IT
Sbjct: 333 GSNAKTTPMLTDKG-PTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVIT 386

Query: 382 RL--QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           RL     A  S   +    A   K  +  ++ DTCYDF+GL  V +PTVSL F  G  LD
Sbjct: 387 RLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACLD 446

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L A   +  + S    C  FA      ++ I+GN QQ+   V +D++   VGF P  C
Sbjct: 447 LDASGIVYAI-SQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 143/366 (39%), Positives = 200/366 (54%), Gaps = 20/366 (5%)

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D   PV      G+GE+   + +GTP   +S ++DTGSD+ W QC+PC +C++QS P+FD
Sbjct: 83  DLQVPV----HAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFD 138

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           P +SS+Y+ +PC++  C  L  S C  A++C Y   YGD S T G L TET +   S  +
Sbjct: 139 PSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KL 197

Query: 262 KGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
            G+  GCG  NEG  F   AGL+GLG G LSL  Q+     +YCL   D   +  L   S
Sbjct: 198 PGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGS 257

Query: 321 ARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
             G            T PLI+N    +FYYV L   +VG   + +P S F + + G GG+
Sbjct: 258 LAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGV 317

Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVS 429
           IVD GT+IT L+ Q Y +L+ +F  ++A      SGV L D C+     G+  V VP + 
Sbjct: 318 IVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLV 376

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            HF  G  LDLPA+NY++    +G  C      S  LSIIGN QQQ  +  +D+ ++ + 
Sbjct: 377 FHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLS 435

Query: 490 FTPNKC 495
           F P +C
Sbjct: 436 FAPVQC 441


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 168/483 (34%), Positives = 245/483 (50%), Gaps = 56/483 (11%)

Query: 62  AEESETAAESFPLNSSSSFSLPLHSREILHKT--RHNDYRSLVLSRLERDSARVNTL--- 116
            EE++  +E+FP       S+ LH +   H++  +  + ++ V+    RD  R+  L   
Sbjct: 81  GEETDEESEAFPAPKPHKNSVKLHLK---HRSGSKGAEPKNSVIDSTVRDLTRIQNLHRR 137

Query: 117 ---------ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG---------ASQGSGE 158
                    I++LQ       +   KP  A   P   ST  VSG          S GSGE
Sbjct: 138 VIENRNQNTISRLQRLQKEQPKQSFKPVFA---PAASSTSPVSGQLVATLESGVSLGSGE 194

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           YF  + VGTPP+ FS++LDTGSD+NW+QC PC  C++QS P +DPK SSS+  + C  P+
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPR 254

Query: 219 CKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGI 264
           C+ +      + C+A    C Y   YGDGS T GD   ET +       G S    V+ +
Sbjct: 255 CQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHVENV 314

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSA 321
             GCGH N GLF G+AGLLGLG G LS   Q+++    S +YCLVDR+S AS   +    
Sbjct: 315 MFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFG 374

Query: 322 RGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
              + ++ P +        ++  VDTFYYV +    V  + ++IP   + +   G GG I
Sbjct: 375 EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEGAGGTI 434

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +T     AY  ++++FVR     +   G+     CY+ SG+  + +P   + F 
Sbjct: 435 IDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILFA 494

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
            G   + P +NY I +D     C A      SALSIIGN QQQ   + +D+  +R+G+ P
Sbjct: 495 DGAVWNFPVENYFIQID-PDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 553

Query: 493 NKC 495
            KC
Sbjct: 554 MKC 556


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 207/383 (54%), Gaps = 18/383 (4%)

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           N  RH+L    A+      S  V   A  G+GE+   + +GTP   +S ++DTGSD+ W 
Sbjct: 43  NYSRHQLLRRAARRSHHRMSRLVPVHA--GNGEFLMDVSIGTPALAYSAIVDTGSDLVWT 100

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFT 244
           QC+PC +C++QS P+FDP +SS+Y+ +PC++  C  L  S C  A++C Y   YGD S T
Sbjct: 101 QCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSST 160

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAY 303
            G L TET +   S  + G+  GCG  NEG  F   AGL+GLG G LSL  Q+     +Y
Sbjct: 161 QGVLATETFTLAKS-KLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSY 219

Query: 304 CLVDRDSPASGVLEFNSARG--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
           CL   D   +  L   S  G            T PLI+N    +FYYV L   +VG   +
Sbjct: 220 CLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRI 279

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTC 414
            +P S F + + G GG+IVD GT+IT L+ Q Y +L+ +F  ++A      SGV L D C
Sbjct: 280 SLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGL-DLC 338

Query: 415 YD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
           +     G+  V VP +  HF  G  LDLPA+NY++    +G  C      S  LSIIGN 
Sbjct: 339 FRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNF 397

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQQ  +  +D+ ++ + F P +C
Sbjct: 398 QQQNFQFVYDVGHDTLSFAPVQC 420


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  249 bits (636), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 158/403 (39%), Positives = 223/403 (55%), Gaps = 27/403 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS---TPVVSGASQGSGEYFSR 162
           LE+  + +N  +TK +L    + R E +      + +  S   TPV +G    SGEY   
Sbjct: 46  LEQVDSGMN--LTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAG----SGEYLMN 99

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + +GTP    S ++DTGSD+ W QC PCT+C+ Q  PIF+P+ SSS+S LPC +  C+ L
Sbjct: 100 VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYCQDL 159

Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AG 281
              +C  N C Y   YGDGS T G + TET +F  S SV  IA GCG DN+G   G+ AG
Sbjct: 160 PSESCY-NDCQYTYGYGDGSSTQGYMATETFTFETS-SVPNIAFGCGEDNQGFGQGNGAG 217

Query: 282 LLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP---LIRNKKVD 338
           L+G+G G LSL  Q+     +YC+    S +   L   SA  G    +P   LI +    
Sbjct: 218 LIGMGWGPLSLPSQLGVGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNP 277

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           T+YY+ L G +VGG  + IP S F++ + G GG+I+D GT +T L   AYN++  +F   
Sbjct: 278 TYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQ 337

Query: 399 AGNLKP----TSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
             NL P    +SG++   TC+   S   +V+VP +S+ F  G  L+L  +N LI   + G
Sbjct: 338 I-NLSPVDESSSGLS---TCFQLPSDGSTVQVPEISMQFDGG-VLNLGEENVLIS-PAEG 391

Query: 454 TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             C A   +S   +SI GN+QQQ T+V +DL N  V F P +C
Sbjct: 392 VICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 150/402 (37%), Positives = 213/402 (52%), Gaps = 28/402 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D ARVN++ +KL      +  + +  +++  LP         G++ GSG Y   +G+
Sbjct: 89  LRLDQARVNSIHSKLS---KKLTTNHVSQSQSTDLPAK------DGSTLGSGNYIVTVGL 139

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           GTP    S++ DTGSD+ W QC+PC   CY Q +PIF+P  S+SY  + C++  C SL  
Sbjct: 140 GTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSS 199

Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
              +  +C A+ C+Y + YGD SF+VG L  +  +  +S    G+  GCG +N+GLF G 
Sbjct: 200 ATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGV 259

Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRNK 335
           AGLLGLG   LS   Q         +YCL    S  +G L F SA    +V   P+    
Sbjct: 260 AGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTIT 318

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
              +FY + +   +VGGQ + IP ++F        G ++D GT ITRL  +AY +LR SF
Sbjct: 319 DGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSF 373

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                    TSGV++ DTC+D SG ++V +P V+  F  G  ++L +K        +   
Sbjct: 374 KAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYAFKIS-QV 432

Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFA  S  S  +I GNVQQQ   V +D A  RVGF PN C
Sbjct: 433 CLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 474


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 152/363 (41%), Positives = 205/363 (56%), Gaps = 25/363 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y + +G+GTP   ++MV+DTGS + WLQC PC   C++Q  P++DP
Sbjct: 120 SVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDP 179

Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           + SS+Y+ +PC+A QC      +L+ SAC   N C+YQ +YGD SF+VG L  +TVSFG 
Sbjct: 180 RASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFG- 238

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPAS- 313
           SGS      GCG DNEGLF  SAGL+GL    LSL  Q+  +   S +YCL    +PAS 
Sbjct: 239 SGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL---PTPAST 295

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G L       G     P+  +    + Y+V L+G SVGG  + + P+     E      I
Sbjct: 296 GYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPA-----EYSSLPTI 350

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHF 432
           +D GT ITRL T  Y +L  +       ++     ++ DTC  F G  S +RVP V++ F
Sbjct: 351 IDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTC--FQGQASQLRVPAVAMAF 408

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  L L  +N LI VD + T C AFAPT S  +IIGN QQQ   V +D+A +R+GF  
Sbjct: 409 AGGATLKLATQNVLIDVDDS-TTCLAFAPTDST-TIIGNTQQQTFSVVYDVAQSRIGFAA 466

Query: 493 NKC 495
             C
Sbjct: 467 GGC 469


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  248 bits (633), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 164/465 (35%), Positives = 239/465 (51%), Gaps = 54/465 (11%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL----------QLAIYNVDRH 130
           S+ LH ++    T +    S+  S + RD AR+ TL T++          +L   NV+R 
Sbjct: 99  SVKLHLKKRSTNTANKPKESITESAV-RDLARIQTLHTRITERKNQDTTSRLKKSNVERK 157

Query: 131 E-----LKPAEAQILPEDFS--------TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           +       PAE+   PE ++          + SG S GSGEYF  + +G+PP+ FS++LD
Sbjct: 158 KPMEEVSSPAES---PESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILD 214

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACR--ANR 231
           TGSD+NW+QC PC +C++Q+ P +DPK S S+  + C  P+C+ +        C+     
Sbjct: 215 TGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQS 274

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGS---------VKGIALGCGHDNEGLFVGSAGL 282
           C Y   YGD S T GD   ET +   + S         V+ +  GCGH N GLF G+AGL
Sbjct: 275 CPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGL 334

Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIR 333
           LGLG G LS + Q+++    S +YCLVDRDS  S   +       D +T P      LI 
Sbjct: 335 LGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIA 394

Query: 334 NKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
            K+  VDTFYY+ +    VGG+ +QIP   + +   G GG I+D GT ++     AY  +
Sbjct: 395 GKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII 454

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           +++F+R     K      +   CY+ SG   +  P   + F  G   + P +NY I +  
Sbjct: 455 KEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQ 514

Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               C A   T  SALSIIGN QQQ   + +D  N+R+G+ P +C
Sbjct: 515 LDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 164/465 (35%), Positives = 239/465 (51%), Gaps = 54/465 (11%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL----------QLAIYNVDRH 130
           S+ LH ++    T +    S+  S + RD AR+ TL T++          +L   NV+R 
Sbjct: 99  SVKLHLKKRSTNTANKPKESITESAV-RDLARIQTLHTRITERKNQDTTSRLKKSNVERK 157

Query: 131 E-----LKPAEAQILPEDFS--------TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           +       PAE+   PE ++          + SG S GSGEYF  + +G+PP+ FS++LD
Sbjct: 158 KPMEEVSSPAES---PESYADYFSGQLMATLESGVSLGSGEYFIDVFIGSPPKHFSLILD 214

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACR--ANR 231
           TGSD+NW+QC PC +C++Q+ P +DPK S S+  + C  P+C+ +        C+     
Sbjct: 215 TGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQS 274

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGS---------VKGIALGCGHDNEGLFVGSAGL 282
           C Y   YGD S T GD   ET +   + S         V+ +  GCGH N GLF G+AGL
Sbjct: 275 CPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGL 334

Query: 283 LGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIR 333
           LGLG G LS + Q+++    S +YCLVDRDS  S   +       D +T P      LI 
Sbjct: 335 LGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIA 394

Query: 334 NKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
            K+  VDTFYY+ +    VGG+ +QIP   + +   G GG I+D GT ++     AY  +
Sbjct: 395 GKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRII 454

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           +++F+R     K      +   CY+ SG   +  P   + F  G   + P +NY I +  
Sbjct: 455 KEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQ 514

Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               C A   T  SALSIIGN QQQ   + +D  N+R+G+ P +C
Sbjct: 515 LDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 161/462 (34%), Positives = 233/462 (50%), Gaps = 45/462 (9%)

Query: 76  SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKL-----QLAIYNVDRH 130
           S  +  L L  R I    R + ++   ++   RD  R+ TL  ++     Q A+  +++ 
Sbjct: 96  SKQTLKLHLKHRWI---NRDSTHKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKE 152

Query: 131 ELK-----PAE------AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
           E K     PA       A  L       + SG S GSGEYF  + +GTPPR FS++LDTG
Sbjct: 153 EPKQPVVAPAASPESYPANGLSGQLMATLESGVSLGSGEYFMDVFIGTPPRHFSLILDTG 212

Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR--CL 233
           SD+NW+QC PC +C+ Q+ P +DPK SSS+  + C  P+C  +        C+A    C 
Sbjct: 213 SDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCP 272

Query: 234 YQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGIALGCGHDNEGLFVGSAGLLGL 285
           Y   YGD S T GD   ET +       G S    V+ +  GCGH N GLF G+AGLLGL
Sbjct: 273 YFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGL 332

Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RN 334
           G G LS + Q+++    S +YCLVDR+S  +   +       D +  P +        + 
Sbjct: 333 GRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKE 392

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
             VDTFYYV +    VGG+ ++IP   + +   G GG IVD GT ++     +Y  ++D+
Sbjct: 393 NPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDA 452

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
           FV+            + D CY+ SG+  + +P   + F  G   + P +NY I ++    
Sbjct: 453 FVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEI 512

Query: 455 FCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C A   T  SALSIIGN QQQ   + +D   +R+G+ P KC
Sbjct: 513 VCLAILGTPRSALSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 141/368 (38%), Positives = 202/368 (54%), Gaps = 24/368 (6%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F+ PV +      GEY + + +GTP R FS+++DTGSD+ W+QC PC +CY Q+D +F P
Sbjct: 2   FTAPVAAA----RGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLP 57

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSG 259
            TS+S++ L C +  C  L    C    C+Y  +YGDGS T GD V +T++     G   
Sbjct: 58  NTSTSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQ 117

Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV- 315
            V   A GCGHDNEG F G+ G+LGLG G LS   Q+K+      +YCLVD  +P +   
Sbjct: 118 QVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTS 177

Query: 316 -LEFNSARG---GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
            L F  A      D    P++ N KV T+YYV L G SVG   + I  ++F++D  G  G
Sbjct: 178 PLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG 237

Query: 372 IIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV-- 428
            I D GT +T+L   AY  +  +         +    ++  D C   SG    ++PTV  
Sbjct: 238 TIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQLPTVPA 295

Query: 429 -SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
            + HF  G  + LP  NY I ++S+ ++CFA   +S  ++IIG+VQQQ  +V +D A  +
Sbjct: 296 MTFHFEGGDMV-LPPSNYFIYLESSQSYCFAMT-SSPDVNIIGSVQQQNFQVYYDTAGRK 353

Query: 488 VGFTPNKC 495
           +GF P  C
Sbjct: 354 LGFVPKDC 361


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  248 bits (632), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 164/465 (35%), Positives = 236/465 (50%), Gaps = 44/465 (9%)

Query: 73  PLNSSSSFSLPLHS--REILHKTRHNDYRSLVLSRLERDSARV-----NTLITKLQLAIY 125
           P N S  F L   S   EI  K    DY    L+R++    RV        I++LQ +  
Sbjct: 91  PQNQSVKFHLKHISMKNEIEPKKSVIDYSIRDLTRIQTLHTRVIEKKNQNTISRLQKSTK 150

Query: 126 NV--DRHELKPAEAQIL---PEDFSTPVV----SGASQGSGEYFSRIGVGTPPRQFSMVL 176
                +   KPA + +    PE +S+ +V    SG S GSGEYF  + +GTPP+ +S++L
Sbjct: 151 KQTNSKQSYKPAVSPVAAASPE-YSSQLVATLESGVSLGSGEYFMDVFIGTPPKHYSLIL 209

Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR- 231
           DTGSD+NW+QC PC  C++QS P +DPK SSS+  + C  P+CK +        C+    
Sbjct: 210 DTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPPKPCKDENQ 269

Query: 232 -CLYQVAYGDGSFTVGDLVTETVSFG--------NSGSVKGIALGCGHDNEGLFVGSAGL 282
            C Y   YGD S T GD   ET +              V+ +  GCGH N GLF G+AGL
Sbjct: 270 TCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGL 329

Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI------- 332
           LGLG G LS   Q+++    S +YCLVDR+S  S   +       + ++ P +       
Sbjct: 330 LGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVG 389

Query: 333 -RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
                VDTFYYVG+    V G+ ++IP   + + + G GG I+D GT +T     AY  +
Sbjct: 390 GEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEII 449

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           +++F++     +   G      CY+ SG+  + +P   + F  G   D P +NY I ++ 
Sbjct: 450 KEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIE- 508

Query: 452 AGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               C A   T  SALSIIGN QQQ   + +D+  +R+G+ P KC
Sbjct: 509 PDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  247 bits (631), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 162/478 (33%), Positives = 238/478 (49%), Gaps = 44/478 (9%)

Query: 62  AEESETAAESFPLNSSSSFSLPLH------SREILHKTRHNDYRSLVLSRLERDSARV-- 113
            EE++  +E+FP        +  H      S++   K    D+    L+R++    RV  
Sbjct: 81  GEETDEESEAFPAQKPHQNLVKFHLKHRSGSKDAEPKQSVVDFTLSDLTRIQNLHRRVIE 140

Query: 114 ---NTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV--------SGASQGSGEYFSR 162
                 I++LQ +     +   KP  A       ++PV         SG S GSGEYF  
Sbjct: 141 KKNQNTISRLQKSQKEQPKQSYKPVVAAPAASRTTSPVSGQLVATLESGVSLGSGEYFMD 200

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + VGTPP+ FS++LDTGSD+NW+QC PC  C++QS P +DPK SSS+  + C  P+C+ +
Sbjct: 201 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 260

Query: 223 DV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSF------GNS--GSVKGIALGC 268
                   C+A    C Y   YGDGS T GD   ET +       G S    V+ +  GC
Sbjct: 261 SAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHVENVMFGC 320

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGD 325
           GH N GLF G+AGLLGLG G LS   Q+++    S +YCLVDR+S AS   +       +
Sbjct: 321 GHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNASVSSKLIFGEDKE 380

Query: 326 AVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
            ++ P +        ++  VDTFYYV +    V  + ++IP   + +   G GG I+D G
Sbjct: 381 LLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSEGAGGTIIDSG 440

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T +T     AY  ++++FVR     +   G+     CY+ SG+  + +P   + F     
Sbjct: 441 TTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGILFADEAV 500

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + P +NY I +D             SALSIIGN QQQ   + +D+  +R+G+ P KC
Sbjct: 501 WNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 558


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 149/403 (36%), Positives = 214/403 (53%), Gaps = 29/403 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D +RV ++ ++L   +       LK ++A +       P  S ++ GSG Y   +G+
Sbjct: 103 LAQDESRVASIQSRLAKNL--AGGSNLKASKATL-------PSKSASTLGSGNYVVTVGL 153

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           G+P R  + + DTGSD+ W QC PC   CYQQ + IFDP TS SYS + C +P C+ L+ 
Sbjct: 154 GSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLES 213

Query: 225 S-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
           +      C ++ CLY + YGDGS+++G    E +S  ++        GCG +N GLF G+
Sbjct: 214 ATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGT 273

Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT--APLIRN 334
           AGLLGL    LSL   T Q      +YCL    S ++G L F S  G        P   N
Sbjct: 274 AGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSSTGYLSFGSGDGDSKAVKFTPSEVN 332

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
               +FY++ + G SVG + + IP S+F        G I+D GT I+RL    Y+S++  
Sbjct: 333 SDYPSFYFLDMVGISVGERKLPIPKSVFST-----AGTIIDSGTVISRLPPTVYSSVQKV 387

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
           F  L  +     GV++ DTCYD S  ++V+VP + L+F  G  +DL A   +I V     
Sbjct: 388 FRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQ 446

Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C AFA  S    ++IIGNVQQ+   V +D A  RVGF P+ C
Sbjct: 447 VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 489


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  247 bits (631), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 153/414 (36%), Positives = 215/414 (51%), Gaps = 14/414 (3%)

Query: 86  SREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS 145
           SR +  +   N +R   +S    DS    T   +LQ A   V R  L+          F 
Sbjct: 30  SRSLDRRPEKNGFR---VSLRHVDSGGNYTKFERLQRA---VKRGRLRLQRLSAKTASFE 83

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
             V +    G+GE+   + +GTP   +S ++DTGSD+ W QC+PC  C+ Q  PIFDP+ 
Sbjct: 84  PSVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEK 143

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           SSS+S LPC++  C +L +S+C ++ C Y+ +YGD S T G L TET +FG++ SV  I 
Sbjct: 144 SSSFSKLPCSSDLCVALPISSC-SDGCEYRYSYGDHSSTQGVLATETFTFGDA-SVSKIG 201

Query: 266 LGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNSAR 322
            GCG DN G  +   AGL+GLG G LSL  Q+     +YCL  +D     S +L  + A 
Sbjct: 202 FGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEAT 261

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
              A+  PLI+N    +FYY+ L G SVG   + I  S F + + G GG+I+D GT IT 
Sbjct: 262 VKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLP 441
           L+  A+ +L+  F+         SG    + C+      S V VP +  HF  G  L LP
Sbjct: 322 LKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHF-EGVDLKLP 380

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +NY+I   +    C     +SS +SI GN QQQ   V  DL    + F P +C
Sbjct: 381 KENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 143/352 (40%), Positives = 199/352 (56%), Gaps = 12/352 (3%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           S GSGEY  +I +GTPP+QFS ++DTGSD+ W+QC PC  C++Q DP+F P  SSSYS  
Sbjct: 2   SAGSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNA 61

Query: 213 PCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
            C    C +L    C   N C Y  +YGDGS T GD   ETV+  N  ++  I  GCGH+
Sbjct: 62  SCTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTL-NGSTLARIGFGCGHN 120

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNSARGGDA 326
            EG F G+ GL+GLG G LSL  Q+ ++     +YCLVD+ +    S +   N+A    A
Sbjct: 121 QEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRA 180

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
              PL++N+   ++YYVG+   SVG + V  PPS F +D  G GG+I+D GT IT  +  
Sbjct: 181 SFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLA 240

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKN 444
           A+  +     R     +        + CYD S +   S+ +P++++H       ++P  N
Sbjct: 241 AFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHL-TNVDFEIPVSN 299

Query: 445 YLIPVDSAG-TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             + VD+ G T C A + TS   SIIGNVQQQ   +  D+AN+RVGF    C
Sbjct: 300 LWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDC 350


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 157/394 (39%), Positives = 212/394 (53%), Gaps = 22/394 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           + RD ARV ++ +KL     N    E+  A++  LP        SG + GSG Y   IG+
Sbjct: 89  IRRDQARVESIYSKLSKNSAN----EVSEAKSTELPAK------SGITLGSGNYIVTIGI 138

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           GTP    S+V DTGSD+ W QC PC   CY Q +P F+P +SS+Y  + C++P C+  D 
Sbjct: 139 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE--DA 196

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            +C A+ C+Y + YGD SFT G L  E  +  NS  ++ +  GCG +N+GLF G AGLLG
Sbjct: 197 ESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLG 256

Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG G LSL  Q   T     +YCL    S ++G L F SA   ++V    I +      Y
Sbjct: 257 LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNY 316

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
            + + G SVG + + I P+ F  +     G I+D GT  TRL T+ Y  LR  F     +
Sbjct: 317 GIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
            K TSG  LFDTCYDF+GL +V  PT++  F  G  ++L      +P+      C AFA 
Sbjct: 372 YKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPI-KISQVCLAFAG 430

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                +I GNVQQ    V +D+A  RVGF PN C
Sbjct: 431 NDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 143/397 (36%), Positives = 214/397 (53%), Gaps = 25/397 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L+RD  RV++ I ++    +   +            +  S P   G   G+  Y   +G+
Sbjct: 144 LDRDQDRVDS-IHRMTAGPWTAGQSSAS--------KGVSLPAHRGLRLGTANYIVSVGL 194

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP R   +V DTGSD++W+QC+PC  CY+Q DP+FDP  S++YS +PC A +C  LD  
Sbjct: 195 GTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC--LDSG 252

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            C + +C Y+V YGD S T G+L  +T++ G +S  ++G   GCG D+ GLF  + GL G
Sbjct: 253 TCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFG 312

Query: 285 LGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARG-GDAVTAPLIRNKKVDTF 340
           LG   +SL  Q  A      +YCL      A G L   SA     A    ++      +F
Sbjct: 313 LGRDRVSLASQAAARYGAGFSYCLPS-SWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSF 371

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           YY+ L G  V G+ V++ P++F+       G ++D GT ITRL ++AY++LR SF     
Sbjct: 372 YYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMR 426

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
             K    +++ DTCYDF+G   V++P+V+L F  G  L+L     L  V +    C AFA
Sbjct: 427 RYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFA 485

Query: 461 PT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                +++ I+GN+QQ+   V +DLAN ++GF    C
Sbjct: 486 SNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  245 bits (625), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 158/430 (36%), Positives = 222/430 (51%), Gaps = 28/430 (6%)

Query: 74  LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELK 133
           +NS   FS  +   E++H+                   R NT  T  ++ +  V R   +
Sbjct: 7   INSFYDFSFQVLRTELIHREH------------PSSPLRSNTSKTTTEIFLAAVKRGAER 54

Query: 134 PAE--AQILPED--FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
            A+    IL E   FSTPV SG    +GEY   I  G+PP++ S+++DTGSD+ W QC P
Sbjct: 55  RAQLSKHILAEGRLFSTPVASG----NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLP 110

Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLV 249
           C  C   +  IFDP  SS+Y  + CA+  C SL   +C  + C Y   YGDGS T G L 
Sbjct: 111 CETCNAAASVIFDPVKSSTYDTVSCASNFCSSLPFQSCTTS-CKYDYMYGDGSSTSGAL- 168

Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLV 306
           +       +G++  +A GCGH N G F G+AG++GLG G LSL  Q   I +   +YCLV
Sbjct: 169 STETVTVGTGTIPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLV 228

Query: 307 DRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
              S   S +L  +SA  G      L+ N    TFYY  LTG SV G+AV  P   F +D
Sbjct: 229 PLGSTKTSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSID 288

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
            +G GG I+D GT +T L+T A+N+L  +        +    +   D C+  +G+ +   
Sbjct: 289 ASGQGGFILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTY 348

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           PT++ HF  G   +LP +N  + +D+ G+ C A A  S+  SI+GN+QQQ   +  DL N
Sbjct: 349 PTMTFHF-KGADYELPPENVFVALDTGGSICLAMA-ASTGFSIMGNIQQQNHLIVHDLVN 406

Query: 486 NRVGFTPNKC 495
            RVGF    C
Sbjct: 407 QRVGFKEANC 416


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 152/413 (36%), Positives = 214/413 (51%), Gaps = 14/413 (3%)

Query: 87  REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
           R +  +   N +R   +S    DS    T   +LQ A   V R  L+          F  
Sbjct: 31  RSLDRRPEKNGFR---VSLRHVDSGGNYTKFERLQRA---VKRGRLRLQRLSAKTASFEP 84

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
            V +    G+GE+   + +GTP   +S ++DTGSD+ W QC+PC  C+ Q  PIFDP+ S
Sbjct: 85  SVEAPVHAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKS 144

Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           SS+S LPC++  C +L +S+C ++ C Y+ +YGD S T G L TET +FG++ SV  I  
Sbjct: 145 SSFSKLPCSSDLCVALPISSC-SDGCEYRYSYGDHSSTQGVLATETFTFGDA-SVSKIGF 202

Query: 267 GCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNSARG 323
           GCG DN G  +   AGL+GLG G LSL  Q+     +YCL  +D     S +L  + A  
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTSIDDSKGISTLLVGSEATV 262

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
             A+  PLI+N    +FYY+ L G SVG   + I  S F + + G GG+I+D GT IT L
Sbjct: 263 KSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYL 322

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPA 442
           +  A+ +L+  F+         SG    + C+      S V VP +  HF  G  L LP 
Sbjct: 323 KDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHF-EGVDLKLPK 381

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +NY+I   +    C     +SS +SI GN QQQ   V  DL    + F P +C
Sbjct: 382 ENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  244 bits (623), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 170/470 (36%), Positives = 235/470 (50%), Gaps = 42/470 (8%)

Query: 62  AEESETAAESFPLNSSSSFSLPLHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKL 120
           A E     +  P + S S  L L+ R     +TR       +L   E+D+ R+ T+  + 
Sbjct: 59  AAEEALDEQKQPASPSPSLKLRLNHRAAEGGRTREES----LLDLAEKDAVRIETMYRRA 114

Query: 121 QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
             A     R     +  + L E     V SG + GSGEY   + VGTPPR+F M++DTGS
Sbjct: 115 --ARSGGGRMPASSSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGS 172

Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---------CR--- 228
           D+NWLQC PC +C++Q  P+FDP  SSSY  + C   +C  +             CR   
Sbjct: 173 DLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPG 232

Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLL 283
            + C Y   YGD S T GDL  E+ +      G S  V G+  GCGH N GLF G+AGLL
Sbjct: 233 EDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLL 292

Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR------ 333
           GLG G LS   Q++A    + +YCLVD  S     + F       A+ A P ++      
Sbjct: 293 GLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAP 352

Query: 334 ----NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
               +   DTFYYV L G  VGG+ + I    +++ + G GG I+D GT ++     AY 
Sbjct: 353 ASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQ 412

Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
            +R +F+ R++ +        +   CY+ SG+    VP +SL F  G   D PA+NY I 
Sbjct: 413 VIRHAFMDRMSRSYPLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIR 472

Query: 449 VDSAG--TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +D  G    C A   T  + +SIIGN QQQ   V +DL NNR+GF P +C
Sbjct: 473 LDPDGGSIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  244 bits (622), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 156/394 (39%), Positives = 211/394 (53%), Gaps = 22/394 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           + RD ARV ++ +KL     N    E+  A++  LP        SG + GSG Y   IG+
Sbjct: 89  IRRDQARVESIYSKLSKNSAN----EVSEAKSTELPAK------SGITLGSGNYIVTIGI 138

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           GTP    S+V DTGSD+ W QC PC   CY Q +P F+P +SS+Y  + C++P C+  D 
Sbjct: 139 GTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCE--DA 196

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            +C A+ C+Y + YGD SFT G L  E  +  NS  ++ +  GCG +N+GLF G AGLLG
Sbjct: 197 ESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLG 256

Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG G LSL  Q   T     +YCL    S ++G L F SA   ++V    I +      Y
Sbjct: 257 LGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNY 316

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
            + + G SVG + + I P+ F  +     G I+D GT  TRL T+ Y  LR  F     +
Sbjct: 317 GIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
            K TSG  LFDTCYDF+GL +V  PT++  F     ++L      +P+      C AFA 
Sbjct: 372 YKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI-KISQVCLAFAG 430

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                +I GNVQQ    V +D+A  RVGF PN C
Sbjct: 431 NDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 144/403 (35%), Positives = 218/403 (54%), Gaps = 28/403 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D +RV+ + +K+   + +VDR  L+ ++A  +P        SGA+ GSG Y   +G+
Sbjct: 86  LVKDQSRVDFIHSKIAGELESVDR--LRGSKATKIPAK------SGATIGSGNYIVSVGL 137

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           GTP +  S++ DTGSD+ W QC+PC   CY Q DP+F P  S++YS + C++P C  L+ 
Sbjct: 138 GTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLES 197

Query: 225 S-----ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
                  C A R C+Y + YGD SF+VG    ET++  ++  ++    GCG +N GLF  
Sbjct: 198 GTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGS 257

Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAVT-APLIRN 334
           +AGL+GLG   +S+ KQ         +YCL  + S ++G L F    GG A+   P+ + 
Sbjct: 258 AAGLIGLGQDKISIVKQTAQKYGQVFSYCL-PKTSSSTGYLTFGGGGGGGALKYTPITKA 316

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
             V  FY V + G  VGG  + I  S+F        G I+D GT ITRL   AY++L+ +
Sbjct: 317 HGVANFYGVDIVGMKVGGTQIPISSSVFSTS-----GAIIDSGTVITRLPPDAYSALKSA 371

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
           F +          +++ DTCYD S   ++++P V   F  G+ LDL     +    S   
Sbjct: 372 FEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGA-STSQ 430

Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C AFA     S ++IIGNVQQ+  +V +D+   ++GF  N C
Sbjct: 431 VCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 146/362 (40%), Positives = 209/362 (57%), Gaps = 22/362 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y +++G+GTP   ++MV+DTGS + WLQC PC   C++Q  P+FDP
Sbjct: 120 SVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDP 179

Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           + SS+Y+ + C+A QC      +L+ SAC A N C+YQ +YGD SF+VG L T+TVSFG+
Sbjct: 180 RASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGS 239

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
           + S      GCG DNEGLF  SAGL+GL    LSL  Q+  +   S +YCL    S  +G
Sbjct: 240 T-SYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS--TG 296

Query: 315 VLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            L       G   +   + +  +D + Y++ L+G SVGG  + + PS     E      I
Sbjct: 297 YLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-----EYSSLPTI 351

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT ITRL T  + +L  +  +     +     ++ DTC++    + +RVPTV + F 
Sbjct: 352 IDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVVMAFA 410

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G ++ L  +N LI VD + T C AFAPT S  +IIGN QQQ   V +D+A +R+GF+  
Sbjct: 411 GGASMKLTTRNVLIDVDDS-TTCLAFAPTDST-AIIGNTQQQTFSVIYDVAQSRIGFSAG 468

Query: 494 KC 495
            C
Sbjct: 469 GC 470


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 145/362 (40%), Positives = 209/362 (57%), Gaps = 22/362 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y +++G+GTP   ++MV+DTGS + WLQC PC   C++Q  P+FDP
Sbjct: 120 SVPLSPGTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDP 179

Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           + SS+Y+ + C+A QC      +L+ SAC A N C+YQ +YGD SF+VG L T+TVSFG+
Sbjct: 180 RASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGS 239

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
           +        GCG DNEGLF  SAGL+GL    LSL  Q+  +   S +YCL    S  +G
Sbjct: 240 T-RYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAAS--TG 296

Query: 315 VLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            L       G   +   + +  +D + Y++ L+G SVGG  + + PS     E      I
Sbjct: 297 YLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-----EYSSLPTI 351

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT ITRL T  + +L  +  +     +     ++ DTC++    + +RVPTV++ F 
Sbjct: 352 IDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQ-LRVPTVAMAFA 410

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G ++ L  +N LI VD + T C AFAPT S  +IIGN QQQ   V +D+A +R+GF+  
Sbjct: 411 GGASMKLTTRNVLIDVDDS-TTCLAFAPTDST-AIIGNTQQQTFSVIYDVAQSRIGFSAG 468

Query: 494 KC 495
            C
Sbjct: 469 GC 470


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/351 (39%), Positives = 190/351 (54%), Gaps = 19/351 (5%)

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
            + +G P  ++S ++DTGSD+ W QC+PCTEC+ Q  PIFDP+ SSSYS + C++  C +
Sbjct: 2   ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNA 61

Query: 222 LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVG 278
           L  S C  ++  C Y   YGD S T G L TET +F +  S+ GI  GCG +NEG  F  
Sbjct: 62  LPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQ 121

Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDA----------- 326
            +GL+GLG G LSL  Q+K T  +YCL    DS AS  L   S   G             
Sbjct: 122 GSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEV 181

Query: 327 -VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
             T  L+RN    +FYY+ L G +VG + + +  S FE+ E G GG+I+D GT IT L+ 
Sbjct: 182 TKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEE 241

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKN 444
            A+  L++ F          SG    D C+      +++ VP +  HF  G  L+LP +N
Sbjct: 242 TAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGEN 300

Query: 445 YLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           Y++   S G  C A   +S+ +SI GNVQQQ   V  DL    V F P +C
Sbjct: 301 YMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 133/358 (37%), Positives = 195/358 (54%), Gaps = 24/358 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   +G+G+PPR FS ++DTGSD+ W QC PC  C +Q  P F+P  S+SY+ LPC++
Sbjct: 86  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS---VKGIALGCGHDNE 273
             C +L    C  N C+YQ  YGD + + G L  ET +FG + +   V  ++ GCG+ N 
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 205

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGG 324
           G     +G++G G G LSL  Q+ +   +YCL    SPA+  L F         N++  G
Sbjct: 206 GTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSG 265

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
              + P I N  + T Y++ +TG SV G  + I PS+F ++E  G GG+I+D GT +T L
Sbjct: 266 PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFL 325

Query: 384 QTQAYNSLRDSFVRLAG----NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKA 437
              AY  ++ +FV   G    N  P+     FDTC+ +     R V +P + LHF  G  
Sbjct: 326 AQPAYAMVQGAFVAWVGLPRANATPSD---TFDTCFKWPPPPRRMVTLPEMVLHFD-GAD 381

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++LP +NY++     G  C A  P+    SIIG+ Q Q   + +DL N+ + F P  C
Sbjct: 382 MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 133/358 (37%), Positives = 195/358 (54%), Gaps = 24/358 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   +G+G+PPR FS ++DTGSD+ W QC PC  C +Q  P F+P  S+SY+ LPC++
Sbjct: 83  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS---VKGIALGCGHDNE 273
             C +L    C  N C+YQ  YGD + + G L  ET +FG + +   V  ++ GCG+ N 
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 202

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGG 324
           G     +G++G G G LSL  Q+ +   +YCL    SPA+  L F         N++  G
Sbjct: 203 GTLFNGSGMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSSG 262

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
              + P I N  + T Y++ +TG SV G  + I PS+F ++E  G GG+I+D GT +T L
Sbjct: 263 PVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTFL 322

Query: 384 QTQAYNSLRDSFVRLAG----NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKA 437
              AY  ++ +FV   G    N  P+     FDTC+ +     R V +P + LHF  G  
Sbjct: 323 AQPAYAMVQGAFVAWVGLPRANATPSD---TFDTCFKWPPPPRRMVTLPEMVLHFD-GAD 378

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++LP +NY++     G  C A  P+    SIIG+ Q Q   + +DL N+ + F P  C
Sbjct: 379 MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/379 (37%), Positives = 209/379 (55%), Gaps = 38/379 (10%)

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D+ +PV SG     G+Y + I +GTP + FS++ DTGSD+ W+QC+PC  C+ Q DPIFD
Sbjct: 28  DYESPVASGG----GDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFD 83

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
           P+ SSSY+ + C    C SL   +C  + C Y   YGDGS T G L +ETV+     G  
Sbjct: 84  PEGSSSYTTMSCGDTLCDSLPRKSCSPD-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEK 142

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-RDSPASG 314
            + K IA GCGH N G F  ++GL+GLG G LS   Q+        +YCLV  RD+P+  
Sbjct: 143 LAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKT 202

Query: 315 VLEF--------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
              F        +S +       P+I N  +++FYYV L   S+ G+A++IP   F++  
Sbjct: 203 SPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKP 262

Query: 367 AGDGGIIVDCGTAITRLQTQAYN----SLRD--SFVRLAGNLKPTSGVALFDTCYDFSGL 420
            G GG+I D GT +T L    Y     +LR   SF ++ G+       A  D CYD SG 
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGS------SAGLDLCYDVSGS 316

Query: 421 RS---VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQG 476
           ++   +++P +  HF  G    LP +NY I  + AGT  C A   ++  + I GN+ QQ 
Sbjct: 317 KASYKMKIPAMVFHF-EGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQN 375

Query: 477 TRVSFDLANNRVGFTPNKC 495
            RV +D+ ++++G+ P++C
Sbjct: 376 FRVMYDIGSSKIGWAPSQC 394


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/376 (38%), Positives = 206/376 (54%), Gaps = 22/376 (5%)

Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
           + PAEA  +    + P  +G S G+ E+   +G GTP + ++++ DTGSD++W+QC PC+
Sbjct: 97  IPPAEAPAV----TIPDSTGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS 152

Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
             CY+Q DPIFDP  S++YS +PC  PQC +          CLY+V YGDGS T G L  
Sbjct: 153 GHCYKQHDPIFDPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSH 212

Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD 307
           ET+S  ++ ++ G A GCG  N G F    GL+GLG G LSL          + +YCL  
Sbjct: 213 ETLSLTSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPS 272

Query: 308 RDSPASGVLEFNS---ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
            ++ + G L   +   A G D V    +I+ +   +FY+V L    VGG  + +PP LF 
Sbjct: 273 YNT-SHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFT 331

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
            D     G ++D GT +T L  +AY +LRD F       KP      FDTCYDF+G  ++
Sbjct: 332 RD-----GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAI 386

Query: 424 RVPTVSLHFGAGKALDLPAKNYLI-PVDSA-GTFCFAFAPTSSAL--SIIGNVQQQGTRV 479
            +P VS  F  G + DL     LI P D+A  T C AF P  S +  +I+GN QQ+ T +
Sbjct: 387 FMPLVSFKFSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEM 446

Query: 480 SFDLANNRVGFTPNKC 495
            +D+A  ++GF    C
Sbjct: 447 IYDVAAEKIGFVSGSC 462


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 157/400 (39%), Positives = 219/400 (54%), Gaps = 21/400 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D AR+++L  +L              A+A +     S P+  GAS G G Y +R+G+
Sbjct: 69  LTHDDARISSLAARLAKTPSARATSLDADADAGLAGSLASVPLSPGASVGVGNYVTRMGL 128

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           GTP  Q+ MV+DTGS + WLQC PC   C++QS P+F+PK+SS+Y+ + C+A QC     
Sbjct: 129 GTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSDLPS 188

Query: 221 -SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
            +L+ SAC  +N C+YQ +YGD SF+VG L  +TVSFG++ S+     GCG DNEGLF  
Sbjct: 189 ATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SLPNFYYGCGQDNEGLFGR 247

Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
           SAGL+GL    LSL  Q+  +   S  YCL    S +SG L   S   G     P++ + 
Sbjct: 248 SAGLIGLARNKLSLLYQLAPSLGYSFTYCL--PSSSSSGYLSLGSYNPGQYSYTPMVSSS 305

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
             D+ Y++ L+G +V G      P             I+D GT ITRL T  Y++L  + 
Sbjct: 306 LDDSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAV 360

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                     S  ++ DTC+     R V  P V++ F  G AL L A+N L+ VD + T 
Sbjct: 361 AAAMKGTSRASAYSILDTCFKGQASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDS-TT 418

Query: 456 CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFAP  SA +IIGN QQQ   V +D+ ++R+GF    C
Sbjct: 419 CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 457


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 144/379 (37%), Positives = 207/379 (54%), Gaps = 38/379 (10%)

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D+ +PV SG     G+Y + I +GTP + FS++ DTGSD+ W+QC+PC  C+ Q DPIFD
Sbjct: 28  DYESPVASGG----GDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFD 83

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
           P+ SSSY+ + C    C SL   +C  N C Y   YGDGS T G L +ETV+     G  
Sbjct: 84  PEGSSSYTTMSCGDTLCDSLPRKSCSPN-CDYSYGYGDGSGTRGTLSSETVTLTSTQGEK 142

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-RDSPASG 314
            + K IA GCGH N G F  ++GL+GLG G LS   Q+        +YCLV  RD+P+  
Sbjct: 143 LAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKT 202

Query: 315 VLEF--------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
              F        +S +       P+I N  +++FYYV L   S+ G+A++IP   F++  
Sbjct: 203 SPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKP 262

Query: 367 AGDGGIIVDCGTAITRLQTQAYN----SLRD--SFVRLAGNLKPTSGVALFDTCYDFSGL 420
            G GG+I D GT +T L    Y     +LR   SF  + G+       A  D CYD SG 
Sbjct: 263 DGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGS------SAGLDLCYDVSGS 316

Query: 421 RS---VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQG 476
           ++    ++P +  HF  G    LP +NY I  + AGT  C A   ++  + I GN+ QQ 
Sbjct: 317 KASYKKKIPAMVFHF-EGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQN 375

Query: 477 TRVSFDLANNRVGFTPNKC 495
            RV +D+ ++++G+ P++C
Sbjct: 376 FRVMYDIGSSKIGWAPSQC 394


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  242 bits (618), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 139/356 (39%), Positives = 193/356 (54%), Gaps = 19/356 (5%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   +G+GTPPR +S +LDTGSD+ W QC PC  C  Q  P FDP  S SY+ LPC +
Sbjct: 87  GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNS 146

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
           P C +L    C  N C+YQ  YGD + T G L  ET +FG +    +V  IA GCG+ N 
Sbjct: 147 PMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNA 206

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGD 325
           G     +G++G G G LSL  Q+ +   +YCL    SP    L F         SA  G+
Sbjct: 207 GSLFNGSGMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGAYATLNSTSASTGE 266

Query: 326 AV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRL 383
            V + P I N  + T YY+ +TG SVGG+ + I PS+F +++A G GG+I+D G+ IT L
Sbjct: 267 PVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGSTITYL 326

Query: 384 QTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALD 439
              AY+ +  +F    G      TS   + DTC+ +     + V +P ++ HF  G  ++
Sbjct: 327 ARAAYDMVHQAFADQVGLPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHF-EGANME 385

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY++     G  C A A +    SIIG+ Q Q   V +D  N+ + FTP  C
Sbjct: 386 LPLENYMLIDGDTGNLCLAIAASDDG-SIIGSFQHQNFHVLYDNENSLLSFTPATC 440


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  241 bits (614), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 140/372 (37%), Positives = 200/372 (53%), Gaps = 26/372 (6%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SG S GSGEYF  + VGTPP+ FS++LDTGSD+NW+QC PC  C++Q+ P +DPK SSS+
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSF 245

Query: 210 SPLPCAAPQCKSLDV----SACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
             + C  P+C+ +        C+     C Y   YGD S T GD   ET +   +     
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305

Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA 312
                V+ +  GCGH N GLF G+AGLLGLG G LS   Q+++    S +YCLVDR+S +
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNS 365

Query: 313 SGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           S   +       + ++ P +        +   VDTFYYV +    VGG+ ++IP   + +
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHL 425

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
              G GG I+D GT +T     AY  ++++F+R                CY+ SG+  + 
Sbjct: 426 SAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKME 485

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDL 483
           +P  ++ F  G   D P +NY I ++     C A   T  SALSIIGN QQQ   + +DL
Sbjct: 486 LPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDL 545

Query: 484 ANNRVGFTPNKC 495
             +R+G+ P KC
Sbjct: 546 KKSRLGYAPMKC 557


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  241 bits (614), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 141/365 (38%), Positives = 197/365 (53%), Gaps = 17/365 (4%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           ++F +PV +G    +GEY   + +G+PP+ F +++DTGSD+NW+QC PC  CYQQ  P F
Sbjct: 26  QEFQSPVKAG----NGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKF 81

Query: 202 DPKTSSSYSPLPCAAPQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
           DP  S S+    C    C   +L + AC AN C YQ  YGD S T GDL  ET+S  N  
Sbjct: 82  DPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETISLNNGA 141

Query: 260 ---SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPAS 313
              SV   A GCG  N G F G+AGL+GLG G LSL  Q+    A   +YCLV  +S ++
Sbjct: 142 GTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANKFSYCLVSLNSLSA 201

Query: 314 GVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGG 371
             L F S A   +     ++ N +  T+YYV L    VGGQ + + PS+F +D++ G GG
Sbjct: 202 SPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGG 261

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
            I+D GT IT L   AY+++  ++       +        D C++ +G+ +  VP +   
Sbjct: 262 TIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMVFK 321

Query: 432 FGAGKALDLPAKNYLIPVD-SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           F  G    +  +N  + VD SA T C A    S   SIIGN+QQQ   V +DL   ++GF
Sbjct: 322 F-QGADFQMRGENLFVLVDTSATTLCLAMG-GSQGFSIIGNIQQQNHLVVYDLEAKKIGF 379

Query: 491 TPNKC 495
               C
Sbjct: 380 ATADC 384


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 165/434 (38%), Positives = 227/434 (52%), Gaps = 26/434 (5%)

Query: 75  NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
           N SS  S  L S E++H  RH     +V      D+     +  + Q     VD    + 
Sbjct: 39  NHSSKVSNSL-SLEVVH--RHGPCIGIVNQEKGADAPSNMEIFLRDQ---NRVDSIHARL 92

Query: 135 AEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
           +   + PE  +T  PV SGAS G+G+Y   +G+GTP ++F+++ DTGSDI W QC PC +
Sbjct: 93  SSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVK 152

Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAYGDGSFTVG 246
            CY+Q +P  +P TS+SY  + C++  CK +        +C ++ CLYQV YGDGS+++G
Sbjct: 153 TCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIG 212

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
              TET++  +S   K    GCG  N GLF G+AGLLGLG   L+L  Q   T     +Y
Sbjct: 213 FFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSY 272

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL    S + G L             PL  +     FY + +TG SVGG+ + I  S F 
Sbjct: 273 CL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS 331

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                  G ++D GT ITRL   AY+ L  +F  L  +   TSG ++FDTCYDFS   +V
Sbjct: 332 ------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTV 385

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSF 481
           R+P V + F  G  +D+     L PV+     C AFA     S  SI GNVQQ+  +V +
Sbjct: 386 RIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVY 445

Query: 482 DLANNRVGFTPNKC 495
           D A  RVGF P  C
Sbjct: 446 DGAKGRVGFAPGGC 459


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 165/434 (38%), Positives = 227/434 (52%), Gaps = 26/434 (5%)

Query: 75  NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
           N SS  S  L S E++H  RH     +V      D+     +  + Q     VD    + 
Sbjct: 51  NHSSKVSNSL-SLEVVH--RHGPCIGIVNQEKGADAPSNMEIFLRDQ---NRVDSIHARL 104

Query: 135 AEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
           +   + PE  +T  PV SGAS G+G+Y   +G+GTP ++F+++ DTGSDI W QC PC +
Sbjct: 105 SSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVK 164

Query: 193 -CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAYGDGSFTVG 246
            CY+Q +P  +P TS+SY  + C++  CK +        +C ++ CLYQV YGDGS+++G
Sbjct: 165 TCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIG 224

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
              TET++  +S   K    GCG  N GLF G+AGLLGLG   L+L  Q   T     +Y
Sbjct: 225 FFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSY 284

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL    S + G L             PL  +     FY + +TG SVGG+ + I  S F 
Sbjct: 285 CL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFS 343

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                  G ++D GT ITRL   AY+ L  +F  L  +   TSG ++FDTCYDFS   +V
Sbjct: 344 ------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTV 397

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSF 481
           R+P V + F  G  +D+     L PV+     C AFA     S  SI GNVQQ+  +V +
Sbjct: 398 RIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVY 457

Query: 482 DLANNRVGFTPNKC 495
           D A  RVGF P  C
Sbjct: 458 DGAKGRVGFAPGGC 471


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 137/344 (39%), Positives = 191/344 (55%), Gaps = 16/344 (4%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTP   +S ++DTGSD+ W QC+PC +C++QS P+FDP +SS+Y+ +PC++  C  L  
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 225 SAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG-LFVGSAGL 282
           S C  A++C Y   YGD S T G L TET +   S  + G+  GCG  NEG  F   AGL
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS-KLPGVVFGCGDTNEGDGFSQGAGL 291

Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG--------GDAVTAPLIRN 334
           +GLG G LSL  Q+     +YCL   D   +  L   S  G            T PLI+N
Sbjct: 292 VGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKN 351

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
               +FYYV L   +VG   + +P S F + + G GG+IVD GT+IT L+ Q Y +L+ +
Sbjct: 352 PSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKA 411

Query: 395 F-VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           F  ++A      SGV L D C+     G+  V VP +  HF  G  LDLPA+NY++    
Sbjct: 412 FAAQMALPAADGSGVGL-DLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGG 470

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +G  C      S  LSIIGN QQQ  +  +D+ ++ + F P +C
Sbjct: 471 SGALCLTVM-GSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 138/354 (38%), Positives = 190/354 (53%), Gaps = 17/354 (4%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   +G+GTP R +S +LDTGSD+ W QC PC  C  Q  P FDP  SS+Y  L C+A
Sbjct: 90  GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSA 149

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
           P C +L    C    C+YQ  YGD + T G L  ET +FG +    ++  I+ GCG+ N 
Sbjct: 150 PACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNA 209

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF------NSARGGDAV 327
           G     +G++G G G LSL  Q+ +   +YCL    SP    L F      NS       
Sbjct: 210 GSLANGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTVQ 269

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQ 386
           + P I N  + T Y++ +TG SVGG  + I P++  + D  G GG I+D GT IT L   
Sbjct: 270 STPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEP 329

Query: 387 AYNSLRDSFVRLAGNLKPTSGV---ALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLP 441
           AY ++R++FV    +  P   V   ++ DTC+ +     +SV +P + LHF  G   +LP
Sbjct: 330 AYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELP 388

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +NY++   S G  C A A TSS  SIIG+ Q Q   V +DL N+ + F P  C
Sbjct: 389 LQNYMLVDPSTGGLCLAMA-TSSDGSIIGSYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 131/366 (35%), Positives = 202/366 (55%), Gaps = 21/366 (5%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           ++  ASQG  EY   + +GTPP +++ ++DTGSD+ W QC PC  C  Q  P F P  S+
Sbjct: 83  ILVAASQG--EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSA 140

Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VK 262
           +Y  +PC +P C +L   AC + + C+YQ  YGD + T G L +ET +FG + S    V 
Sbjct: 141 TYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS 200

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN--- 319
            +A GCG+ N G    S+G++GLG G LSL  Q+  +  +YCL    SP    L F    
Sbjct: 201 DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFA 260

Query: 320 -------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
                  S+ G    + PL+ N  + + Y++ L G S+G + + I P +F +++ G GG+
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS--VRVPTVS 429
            +D GT++T LQ  AY+++R   V +   L PT+   +  +TC+ +    S  V VP + 
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDME 380

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
           LHF  G  + +P +NY++   + G  C A   +  A +IIGN QQQ   + +D+AN+ + 
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSLLS 439

Query: 490 FTPNKC 495
           F P  C
Sbjct: 440 FVPAPC 445


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 160/404 (39%), Positives = 231/404 (57%), Gaps = 27/404 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDF-STPVVSGASQGSGEYFSRIG 164
           L  D AR+ +L  +L     +      + + +    E   S P+  G S G G Y +R+G
Sbjct: 67  LTHDHARIASLAARLAKTPSSRPTKLRRGSSSSPDAESLASVPLGPGTSVGVGNYVTRMG 126

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC---- 219
           +GTP + + MV+DTGS + WLQC PC   C++QS P+F+P++SSSY+ + C+APQC    
Sbjct: 127 LGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQCDALT 186

Query: 220 -KSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L+ S C  +N C+YQ +YGD SF+VG L  +TVSFG++ SV     GCG DNEGLF 
Sbjct: 187 TATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SVPNFYYGCGQDNEGLFG 245

Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRN 334
            SAGL+GL    LSL  Q+  +   S +YCL    S +  +   +   G  + T P+ ++
Sbjct: 246 QSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYNPGQYSYT-PMAKS 304

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
              D+ Y++ +TG +V G+ + +  S +          I+D GT ITRL T  Y++L  +
Sbjct: 305 SLDDSLYFIKMTGITVAGKPLSVSASAYSSLP-----TIIDSGTVITRLPTDVYSALSKA 359

Query: 395 FVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
              +AG +K T   S  ++ DTC+     R +RVP VS+ F  G AL L A N L+ VDS
Sbjct: 360 ---VAGAMKGTPRASAFSILDTCFQGQASR-LRVPQVSMAFAGGAALKLKATNLLVDVDS 415

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A T C AFAP  SA +IIGN QQQ   V +D+ N+++GF    C
Sbjct: 416 ATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 457


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 131/366 (35%), Positives = 202/366 (55%), Gaps = 21/366 (5%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           ++  ASQG  EY   + +GTPP +++ ++DTGSD+ W QC PC  C  Q  P F P  S+
Sbjct: 83  ILVAASQG--EYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSA 140

Query: 208 SYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VK 262
           +Y  +PC +P C +L   AC + + C+YQ  YGD + T G L +ET +FG + S    V 
Sbjct: 141 TYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS 200

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN--- 319
            +A GCG+ N G    S+G++GLG G LSL  Q+  +  +YCL    SP    L F    
Sbjct: 201 DVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFA 260

Query: 320 -------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
                  S+ G    + PL+ N  + + Y++ L G S+G + + I P +F +++ G GG+
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS--VRVPTVS 429
            +D GT++T LQ  AY+++R   V +   L PT+   +  +TC+ +    S  V VP + 
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDME 380

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
           LHF  G  + +P +NY++   + G  C A   +  A +IIGN QQQ   + +D+AN+ + 
Sbjct: 381 LHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDA-TIIGNYQQQNMHILYDIANSLLS 439

Query: 490 FTPNKC 495
           F P  C
Sbjct: 440 FVPAPC 445


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 152/382 (39%), Positives = 208/382 (54%), Gaps = 20/382 (5%)

Query: 127 VDRHELKPAEAQILPEDFST--PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINW 184
           VD    + +   + PE  +T  PV SGAS G+G+Y   +G+GTP ++F+++ DTGSDI W
Sbjct: 37  VDSIHARLSSRGMFPEKQATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITW 96

Query: 185 LQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV-----SACRANRCLYQVAY 238
            QC PC + CY+Q +P  +P TS+SY  + C++  CK +        +C ++ CLYQV Y
Sbjct: 97  TQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQY 156

Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
           GDGS+++G   TET++  +S   K    GCG  N GLF G+AGLLGLG   L+L  Q   
Sbjct: 157 GDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAK 216

Query: 299 TS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
           T     +YCL    S + G L             PL  +     FY + +TG SVGG+ +
Sbjct: 217 TYKKLFSYCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQL 275

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
            I  S F        G ++D GT ITRL   AY+ L  +F  L  +   TSG ++FDTCY
Sbjct: 276 SIDESAFS------AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCY 329

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQ 473
           DFS   +VR+P V + F  G  +D+     L PV+     C AFA     S  SI GNVQ
Sbjct: 330 DFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQ 389

Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
           Q+  +V +D A  RVGF P  C
Sbjct: 390 QRTYQVVYDGAKGRVGFAPGGC 411


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  238 bits (607), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 142/401 (35%), Positives = 212/401 (52%), Gaps = 25/401 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD  RV+ +  K+           +  A +   P+     V  G    +  YF+ + +
Sbjct: 90  LGRDQDRVDAIRRKVA---------AVTTAASSSKPKGVPLQVGWGKYLDTTNYFTSLRL 140

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP     + LDTGSD +W+QC+PC +CY+Q + +FDP  SS+YS + C++ +C+ L  S
Sbjct: 141 GTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQELGSS 200

Query: 226 A---CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
               C ++ +C Y++ Y D S+TVG+L  +T++   + +V G   GCGH+N G F    G
Sbjct: 201 HKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDG 260

Query: 282 LLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR--NKK 336
           LLGLG G  SL+ Q+ A      +YCL    S A+G L F+ A       A        +
Sbjct: 261 LLGLGRGKASLSSQVAARYGAGFSYCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAGQ 319

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
             +FYY+ LTG +V G+A+++PPS+F    A   G I+D GTA + L   AY +LR S  
Sbjct: 320 HPSFYYLNLTGITVAGRAIKVPPSVF----ATAAGTIIDSGTAFSCLPPSAYAALRSSVR 375

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
              G  K      +FDTCYD +G  +VR+P+V+L F  G  + L     L    +    C
Sbjct: 376 SAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTC 435

Query: 457 FAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AF P    ++L ++GN QQ+   V +D+ N +VGF  N C
Sbjct: 436 LAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  238 bits (606), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 145/361 (40%), Positives = 194/361 (53%), Gaps = 22/361 (6%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
           P  SG S  +G Y   I +GTP  +F++V DTGSD  W+QC+PC   CYQQ +P+F P  
Sbjct: 153 PAKSGLSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTK 212

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S++Y+ + C +  C  LD   C    CLY V YGDGS+TVG    +T++ G   +VK   
Sbjct: 213 SATYANISCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYD-TVKDFR 271

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSAR 322
            GCG  N GLF  +AGL+GLG G  S+  Q     +   AYC +   S  +G L+F    
Sbjct: 272 FGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYC-IPATSSGTGFLDFGPGA 330

Query: 323 GGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
              A   +T  L+ N    TFYYVG+TG  VGG  + IP ++F      D G +VD GT 
Sbjct: 331 PAAANARLTPMLVDNGP--TFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTV 383

Query: 380 ITRLQTQAYNSLRDSFVRLAGNL--KPTSGVALFDTCYDFSGLR-SVRVPTVSLHFGAGK 436
           ITRL   AY  LR +F +    L  K     ++ DTCYD +G + S+ +P VSL F  G 
Sbjct: 384 ITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGA 443

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
            LD+ A   L   D +   C AFA     + ++I+GN QQ+   V +DL    VGF P  
Sbjct: 444 CLDVDASGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGA 502

Query: 495 C 495
           C
Sbjct: 503 C 503


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 146/362 (40%), Positives = 203/362 (56%), Gaps = 21/362 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
           S P+  GAS   G Y +R+G+GTP   + MV+DTGS + WLQC PC+  C++Q+ P+FDP
Sbjct: 117 SVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDP 176

Query: 204 KTSSSYSPLPCAAPQC-----KSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           + S +Y+ + C++ +C      +L+ SAC  +N C+YQ +YGD S++VG L  +TVSFG 
Sbjct: 177 RASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFG- 235

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
           SGS  G   GCG DNEGLF  SAGL+GL    LSL  Q+  +   + +YCL    S A+G
Sbjct: 236 SGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCL-PTSSAAAG 294

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            L   S   G     P+  +    + Y+V L+G SV G  + +PPS +          I+
Sbjct: 295 YLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLP-----TII 349

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFG 433
           D GT ITRL    Y +L  +      +  P +   ++ DTC+  S    +RVP V + F 
Sbjct: 350 DSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA-AGLRVPRVDMAFA 408

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  L L   N LI VD + T C AFAPT    +IIGN QQQ   V +D+A +R+GF   
Sbjct: 409 GGATLALSPGNVLIDVDDS-TTCLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIGFAAG 466

Query: 494 KC 495
            C
Sbjct: 467 GC 468


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 141/360 (39%), Positives = 198/360 (55%), Gaps = 20/360 (5%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
           P   G + G+G Y   + +GTP  +F++V DTGSD  W+QC+PC   CY+Q +P+FDP  
Sbjct: 84  PASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTK 143

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S++Y+ + C++  C  L VS C    CLY + YGDGS+T+G    +T++     ++K   
Sbjct: 144 SATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD-TIKNFR 202

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN-SA 321
            GCG  N GLF  +AGLLGLG G  SL  Q         AYCL    S  +G L+    A
Sbjct: 203 FGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSAGTGFLDLGPGA 261

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
              +A   P++ ++   TFYYVG+TG  VGG  + IP S+F        G +VD GT IT
Sbjct: 262 PAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVFST-----AGTLVDSGTVIT 315

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLR--SVRVPTVSLHFGAGKA 437
           RL   AY  LR +F +    L  ++  A  + DTCYD +G +  S+ +P VSL F  G  
Sbjct: 316 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 375

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LD+ A   L   D +   C AFAP +  + ++I+GN QQ+   V +D+    VGF P  C
Sbjct: 376 LDVDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 143/427 (33%), Positives = 210/427 (49%), Gaps = 26/427 (6%)

Query: 90  LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPEDFST 146
           L   R ND     L     D+    T  TK QL    + R + + A  Q   + P   + 
Sbjct: 17  LPVARCNDNVGFQLKLTHVDAG---TSYTKPQLLSRAIARSKARVAALQSAAVSPAPVAD 73

Query: 147 PVVSG---ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           P+ +     +  SGEY   + +GTPP  ++ ++DTGSD+ W QC PC  C  Q  P FD 
Sbjct: 74  PITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDV 133

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
           K S++Y  LPC + +C +L   +C    C+YQ  YGD + T G L  ET +FG + S K 
Sbjct: 134 KRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKV 193

Query: 263 ---GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF- 318
               I+ GCG  N G    S+G++G G G LSL  Q+  +  +YCL    SP    L F 
Sbjct: 194 RAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSPTPSRLYFG 253

Query: 319 --------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
                   N++ G    + P + N  +   Y++ + G S+G + + I P +F +++ G G
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR--SVRVPTV 428
           G+I+D GT+IT LQ  AY ++R                   DTC+ +      +V VP  
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDF 373

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
             HF  G  + LP +NY++   + G  C A APTS   +IIGN QQQ   + +D+AN+ +
Sbjct: 374 VFHFD-GANMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIGNYQQQNLHLLYDIANSFL 431

Query: 489 GFTPNKC 495
            F P  C
Sbjct: 432 SFVPAPC 438


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 135/359 (37%), Positives = 200/359 (55%), Gaps = 28/359 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY + + +GTP R FS+++DTGSD+ W+QC PC  CY Q+D +F P TS+S++ L C  
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGT 60

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN 272
             C  L    C    C+Y  +YGDGS + GD V +T++     G    V   A GCGHDN
Sbjct: 61  ELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV-- 327
           EG F G+ G+LGLG G LS   Q+K       +YCLVD  +P +   + +    GDA   
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPT---QTSPLLFGDAAVP 177

Query: 328 TAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
           T P      L+ N KV T+YYV L G SVGG+ + I  + F++D  G  G I D GT +T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237

Query: 382 RLQTQAYN----SLRDSFVRLAGNLKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGK 436
           +L  + +     ++  S +        +SG+   D C   F+  +   VP+++ HF  G 
Sbjct: 238 QLAGEVHQEVLAAMNASTMDYPRKSDDSSGL---DLCLGGFAEGQLPTVPSMTFHFEGGD 294

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            ++LP  NY I ++S+ ++CF+   +S  ++IIG++QQQ  +V +D    ++GF P  C
Sbjct: 295 -MELPPSNYFIFLESSQSYCFSMV-SSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 148/420 (35%), Positives = 215/420 (51%), Gaps = 52/420 (12%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           LE D ARV+++             H +   E  ++ +D S P   G S G+G Y   +G+
Sbjct: 45  LEHDQARVDSI-------------HRMIANETAVVGQDVSLPAERGISVGTGNYVVSVGL 91

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GTP R  ++V DTGSD++W+QC PC+   CY Q DP+F P +SS++S + C  P+C    
Sbjct: 92  GTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRAR 151

Query: 224 VSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFG----------NSGSVKGIALGCGH 270
            S   +   +RC Y+V YGD S TVG L  +T++ G          NS  + G   GCG 
Sbjct: 152 QSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGE 211

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAV 327
           +N GLF  + GL GLG G +SL+ Q         +YCL    S A G L   +     A 
Sbjct: 212 NNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAH 271

Query: 328 T--APLIRNKKVDTFYYVGLTGFSVGGQAVQIP--PSLFEMDEAGDGGIIVDCGTAITRL 383
               P++      +FYYV L G  V G+A+++   P+L+        G+IVD GT ITRL
Sbjct: 272 ARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPA------GLIVDSGTVITRL 325

Query: 384 QTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGK--A 437
             +AY++LR +F+   G    K    +++ DTCYDF+     +V +P V+L F  G   +
Sbjct: 326 APRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATIS 385

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +D     Y+  V  A   C AFAP  +  S  I+GN QQ+   V +D+   ++GF    C
Sbjct: 386 VDFSGVLYVAKVAQA---CLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 141/360 (39%), Positives = 198/360 (55%), Gaps = 20/360 (5%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKT 205
           P   G + G+G Y   + +GTP  +F++V DTGSD  W+QC+PC   CY+Q +P+FDP  
Sbjct: 149 PASYGVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTK 208

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S++Y+ + C++  C  L VS C    CLY + YGDGS+T+G    +T++     ++K   
Sbjct: 209 SATYANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYD-TIKNFR 267

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFN-SA 321
            GCG  N GLF  +AGLLGLG G  SL  Q         AYCL    S  +G L+    A
Sbjct: 268 FGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSAGTGFLDLGPGA 326

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
              +A   P++ ++   TFYYVG+TG  VGG  + IP S+F        G +VD GT IT
Sbjct: 327 PAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVFST-----AGTLVDSGTVIT 380

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLR--SVRVPTVSLHFGAGKA 437
           RL   AY  LR +F +    L  ++  A  + DTCYD +G +  S+ +P VSL F  G  
Sbjct: 381 RLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGAC 440

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LD+ A   L   D +   C AFAP +  + ++I+GN QQ+   V +D+    VGF P  C
Sbjct: 441 LDVDASGILYVADVS-QACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 152/438 (34%), Positives = 226/438 (51%), Gaps = 42/438 (9%)

Query: 99  RSLVLSRLERDSARVNTLITKL-----QLAIYNVDRHELKP----------AEAQILPED 143
            S+ +S++ +D AR+ TL  ++     Q  +  + + + KP          + A +    
Sbjct: 107 ESVGVSKM-KDLARIQTLYKRMTEKKNQNTVSRLKKQQSKPQVAPPAAAPESSASVFSGQ 165

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
               + SG S GSGEYF  + VGTPP+ FS++LDTGSD+NW+QC PC EC++Q+ P +DP
Sbjct: 166 LIATLESGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDP 225

Query: 204 KTSSSYSPLPCAAPQCKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSFG- 256
             SSSY  + C   +C  +        C+A    C Y   YGD S T GD   ET +   
Sbjct: 226 GQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNL 285

Query: 257 --NSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV 306
             +SG      V+ +  GCGH N GLF G+AGLLGLG G LS + Q+++    S +YCLV
Sbjct: 286 TMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 345

Query: 307 DRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIP 358
           DR+S A+   +       D ++ P +        +   VDTFYYV +    VGG+ V IP
Sbjct: 346 DRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIP 405

Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS 418
              +++   G GG I+D GT ++     AY  ++++F+             + + CY+ +
Sbjct: 406 EEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVT 465

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGT 477
           G+    +P   + F  G   + P +NY I ++     C A   T  SALSIIGN QQQ  
Sbjct: 466 GVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNF 525

Query: 478 RVSFDLANNRVGFTPNKC 495
            + +D   +R+GF P KC
Sbjct: 526 HILYDTKKSRLGFAPTKC 543


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 111/165 (67%), Positives = 134/165 (81%)

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           L RN ++DT+YYVGL G SVGG+ + IP + FE+D AG+GGIIVD GTA+TRLQ+  YN 
Sbjct: 1   LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           +RD+FV+   +L  T+ V+LFDTCYD S   SV VPTV+ HFG GK L LPAKNYL+PVD
Sbjct: 61  VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120

Query: 451 SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S GTFCFAFAPT S+LSIIGN+QQQGTRVSFDLAN+ VGF+PN+C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 156/465 (33%), Positives = 237/465 (50%), Gaps = 36/465 (7%)

Query: 42  SALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSRE-ILHKTRHNDYRS 100
           +A  +T  +LS    +L+  A  SE  A   P ++S   ++PLH R         N   +
Sbjct: 27  AADHRTHKVLSVG--SLKSAATCSEPKAT--PPSTSGGITVPLHHRHGPCSPVPSNKMPA 82

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
            +  RL+RD  R   +  K   A       +++ ++A  +P         G S  + EY 
Sbjct: 83  SLEERLQRDQLRAAYIKRKFSGA----KGGDVEQSDAATVPTTL------GTSLSTLEYV 132

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
             +G+G+P    +M +DTGSD++W+QC+PC++C+ + D +FDP  SS+YSP  C++  C 
Sbjct: 133 ITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAACV 192

Query: 221 SLDVS----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
            L  S     C +++C Y V+Y DGS T G   ++T++ G S ++KG   GC     G F
Sbjct: 193 QLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLG-SNAIKGFQFGCSQSESGGF 251

Query: 277 VGSA-GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
                GL+GLGG   SL  Q   T   + +YCL      +SG L   +A     V  P++
Sbjct: 252 SDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPG-SSGFLTLGAASRSGFVKTPML 310

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           R+ ++ T+Y V L    VGGQ + IP S+F        G ++D GT ITRL   AY++L 
Sbjct: 311 RSTQIPTYYGVLLEAIRVGGQQLNIPTSVFS------AGSVMDSGTVITRLPPTAYSALS 364

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
            +F        P     + DTC+DFSG  SV +P+V+L F  G  ++L     ++ +D+ 
Sbjct: 365 SAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLDFNGIMLELDN- 423

Query: 453 GTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +C AFA  S  S+L  IGNVQQ+   V +D+    VGF    C
Sbjct: 424 --WCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 153/431 (35%), Positives = 213/431 (49%), Gaps = 29/431 (6%)

Query: 88  EILHKTRHNDYRSL-VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
            + H   H +Y  L +L R  R S    + +        +    +   A      +D   
Sbjct: 48  RLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGDGSGGKDLQV 107

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PV      G+GE+   + VGTP   ++ ++DTGSD+ W QC+PC EC+ Q+ P+FDP  S
Sbjct: 108 PV----HAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAAS 163

Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCL--------YQVAYGDGSFTVGDLVTETVSFGNS 258
           S+Y+ LPC++  C  L  S C ++           Y   YGD S T G L TET +    
Sbjct: 164 STYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQ 223

Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA----- 312
             V G+A GCG  NEG  F   AGL+GLG G LSL  Q+     +YCL   D  A     
Sbjct: 224 -KVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAGRSPL 282

Query: 313 ---SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
              S      SA    A T PL++N    +FYYV LTG +VG   + +P S F + + G 
Sbjct: 283 LLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGT 342

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD-----FSGLRSVR 424
           GG+IVD GT+IT L+ +AY +LR +FV         +     D C+            V+
Sbjct: 343 GGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQDVQVQ 402

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           VP + LHF  G  LDLPA+NY++   ++G  C      S  LSIIGN QQQ  +  +D+A
Sbjct: 403 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-ASRGLSIIGNFQQQNFQFVYDVA 461

Query: 485 NNRVGFTPNKC 495
            + + F P +C
Sbjct: 462 GDTLSFAPAEC 472


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  234 bits (598), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 150/404 (37%), Positives = 213/404 (52%), Gaps = 36/404 (8%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           +  D+AR+  L ++L       D+  +  +         S P+ SGAS G G Y +R+G+
Sbjct: 68  ITHDAARIAGLASRLA----TKDKDWVAAS---------SVPLASGASVGVGNYITRLGL 114

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           GTP   + MV+D+GS + WLQC PC   C+ Q+ P++DP+ SS+Y+ +PC+APQC     
Sbjct: 115 GTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQA 174

Query: 221 -SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
            +L+ S+C  +  C YQ +YGDGSF+ G L  +TVS  +SGS  G   GCG DN GLF  
Sbjct: 175 ATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGR 234

Query: 279 SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPL 331
           +AGL+GL    LSL  Q+      S AYCL    + ++G L F S       G      +
Sbjct: 235 AAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSM 294

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
           + +    + Y+V L G SV G  + +P S     E G    I+D GT ITRL T  Y +L
Sbjct: 295 VSSSLDASLYFVSLAGMSVAGSPLAVPSS-----EYGSLPTIIDSGTVITRLPTPVYTAL 349

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
             + V  A         ++  TC+    +  + VP V++ F  G  L L   N L+ V+ 
Sbjct: 350 SKA-VGAALAAPSAPAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNE 407

Query: 452 AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             T C AFAPT S  +IIGN QQQ   V +D+  +R+GF    C
Sbjct: 408 T-TTCLAFAPTDST-AIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  234 bits (598), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 138/412 (33%), Positives = 207/412 (50%), Gaps = 30/412 (7%)

Query: 106 LERDSARVNTL---ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L  D ARV+++   +T     ++     +    +  +     + P  SG   G+G Y   
Sbjct: 98  LAHDQARVDSIQARVTDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVN 157

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           +G+GTP +  S++ DTGSD+ W QC+PC + CY Q  PIFDP  S +YS + C +  C  
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTACSG 217

Query: 222 LDVS-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           L  +      C ++ C+Y + YGD SFTVG    +T++   +    G   GCG +N GLF
Sbjct: 218 LKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDGFMFGCGQNNRGLF 277

Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARG-------GD 325
             +AGL+GLG   LS+ +Q         +YCL   R S  +G L F +  G        +
Sbjct: 278 GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKTSKAVKN 335

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            +T     + +  TFY++ + G SVGG+A+ I P LF+     + G I+D GT ITRL +
Sbjct: 336 GITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPS 390

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
             Y SL+ +F +          ++L DTCYD S   S+ +P +S +F     +DL     
Sbjct: 391 TVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVDLEPNGI 450

Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LI  + A   C AFA       + I GN+QQQ   V +D+A  ++GF    C
Sbjct: 451 LI-TNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFGYKGC 501


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 153/408 (37%), Positives = 213/408 (52%), Gaps = 39/408 (9%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L  L  D  R   +  ++  A       +L  ++A  +P +       G S G+ +Y   
Sbjct: 92  LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANL------GFSIGTLQYVVT 145

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
           + +GTP    ++ +DTGSD++W+QC+PC    CY Q DP+FDP  SSSYS +PCAA  C 
Sbjct: 146 VSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCS 205

Query: 221 SLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
            L +  + C   +C Y V+YGDGS T G   ++T++   S ++KG   GCGH  +GLF G
Sbjct: 206 QLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 265

Query: 279 SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAV----TAPL 331
             GLLGLG    SL  Q  +T     +YCL     P    + + S  G  +     T PL
Sbjct: 266 VDGLLGLGRQGQSLVSQASSTYGGVFSYCL----PPTQNSVGYISLGGPSSTAGFSTTPL 321

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
           +      T+Y V L G SVGGQ + I  S+F        G +VD GT +TRL   AY++L
Sbjct: 322 LTASNDPTYYIVMLAGISVGGQPLSIDASVFAS------GAVVDTGTVVTRLPPTAYSAL 375

Query: 392 RDSF-VRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           R +F   +A    P++    + DTCYDF+   +V +PT+S+ FG G A+DL     L   
Sbjct: 376 RSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT-- 433

Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               + C AFAPT   S  SI+GNVQQ+   V FD   + VGF P  C
Sbjct: 434 ----SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 137/412 (33%), Positives = 210/412 (50%), Gaps = 30/412 (7%)

Query: 106 LERDSARVNTL---ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L  D ARV+++   IT     ++     +    +  +     + P  SG   G+G Y   
Sbjct: 98  LAHDQARVDSIQARITDQSYDLFKKKDKKSSNKKKSVKDSKANLPAQSGLPLGTGNYIVN 157

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           +G+GTP +  S++ DTGSD+ W QC+PC + CY Q  PIFDP TS +YS + C +  C S
Sbjct: 158 VGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAACSS 217

Query: 222 LDVS-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           L  +      C ++ C+Y + YGD SFT+G    + ++   +    G   GCG +N+GLF
Sbjct: 218 LKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDGFMFGCGQNNKGLF 277

Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARG-------GD 325
             +AGL+GLG   LS+ +Q         +YCL   R S  +G L F +  G        +
Sbjct: 278 GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGS--NGHLTFGNGNGVKASKAVKN 335

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            +T     + +   +Y++ + G SVGG+A+ I P LF+     + G I+D GT ITRL +
Sbjct: 336 GITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NAGTIIDSGTVITRLPS 390

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            AY SL+ +F +          ++L DTCYD S   S+ +P +S +F     ++L     
Sbjct: 391 TAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNFNGNANVELDPNGI 450

Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LI  + A   C AFA      ++ I GN+QQQ   V +D+A  ++GF    C
Sbjct: 451 LI-TNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFGYKGC 501


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 167/468 (35%), Positives = 240/468 (51%), Gaps = 42/468 (8%)

Query: 66  ETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ---- 121
           +T +    L  S   SL +  +   H     + RSL+L  L+RD  R+ +   ++     
Sbjct: 67  QTPSRRVLLEESMKTSLKMELKHRDHGQPTRNRRSLLLESLKRDITRLQSFQKRVSEKLT 126

Query: 122 --------LAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
                   L + N    +  P+ +    E  ST V SGA  G+GEYF  + VG PPR F 
Sbjct: 127 ASANPEAYLEMTNSSSTKSPPSPSSSWEEVDST-VESGAELGAGEYFMDVFVGNPPRHFL 185

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-- 231
           +++DTGSD+ WLQC+PC  C+ QS P+FDP  S+S+  +PC A  C  +    CR N   
Sbjct: 186 LIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSK 245

Query: 232 -----CLYQVAYGDGSFTVGDLVTETVSFG-----NSGSVKGIALGCGHDNEGLFVGSAG 281
                C Y   YGD S T GDL  E++S       +S  ++ + +GCGH N+GLF G+ G
Sbjct: 246 TSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGG 305

Query: 282 LLGLGGGMLSLTKQIKAT----SLAYCLVDRDS--PASGVLEFNS----ARGGDAVT-AP 330
           LLGLG G LS   Q++++    S +YCLVDR +    S  + F +    +R  D +   P
Sbjct: 306 LLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFDQMRFTP 365

Query: 331 LIR-NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
            +R N  V+TFYY+G+ G  +  + + IP   F +   G GG I+D GT +T L   AY 
Sbjct: 366 FVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYR 425

Query: 390 SLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI- 447
           ++  +F  LA    P +    +   CY+ +G  +V  PT+S+ F  G  LDLP +NY I 
Sbjct: 426 AVESAF--LARISYPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQ 483

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P       C A  PT   +SIIGN QQQ     +D+ + R+GF    C
Sbjct: 484 PDPQEAKHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 137/359 (38%), Positives = 192/359 (53%), Gaps = 17/359 (4%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFD 202
            S P   G   G+  Y   +G GTP +  +++ DTGS++NW+QC+PC   CY Q +P+FD
Sbjct: 1   ISIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFD 60

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
           P  SS+Y  + C +  C  L    C  + C+Y V YGDGS TVG L TET +        
Sbjct: 61  PTLSSTYRNISCTSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFN 120

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSPASGVLEF 318
               GCG +N+GLF G+AGL+GLG    SL  Q+ ATSL    +YCL    S A+G L  
Sbjct: 121 NFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQL-ATSLGNIFSYCL-PSTSSATGYLNI 178

Query: 319 NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
            +       TA ++ N +  T Y++ L G SVGG  + +  ++F+       G I+D GT
Sbjct: 179 GNPLRTPGYTA-MLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGT 232

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            ITRL   AY +LR +F          +  ++ DTCYDFS   +V  PT+ LH+  G  +
Sbjct: 233 VITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHY-TGLDV 291

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +P    +  V S+   C AFA    S+ + IIGNVQQ+   V++D A  R+GF    C
Sbjct: 292 TIPGAG-VFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 136/375 (36%), Positives = 211/375 (56%), Gaps = 35/375 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
           +TP+ SG S GSG Y+ +IG+GTP + FSM++DTGS ++WLQC+PC   C+ Q DPIF P
Sbjct: 99  TTPLKSGLSIGSGNYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTP 158

Query: 204 KTSSSYSPLPC-----AAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG 256
            TS +Y  LPC     ++ +  +L+   C      C+Y+ +YGD SF++G L  + ++  
Sbjct: 159 STSKTYKALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLT 218

Query: 257 NSGS-VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-----VD 307
            S +   G   GCG DN+GLF  S+G++GL    +S+  Q+      + +YCL       
Sbjct: 219 PSEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAP 278

Query: 308 RDSPASGVLEFNSARGGDAVTA------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
             S  SG L      G  ++T+      PL++N+K+ + Y++ LT  +V G+ + +  S 
Sbjct: 279 NSSSLSGFLSI----GASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASS 334

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
           + +        I+D GT ITRL    YN+L+ SFV  ++       G ++ DTC+  S  
Sbjct: 335 YNVPT------IIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVK 388

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
               VP + + F  G  L+L A N L+ ++  GT C A A +S+ +SIIGN QQQ  +V+
Sbjct: 389 EMSTVPEIQIIFRGGAGLELKAHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQQTFKVA 447

Query: 481 FDLANNRVGFTPNKC 495
           +D+AN ++GF P  C
Sbjct: 448 YDVANFKIGFAPGGC 462


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 153/408 (37%), Positives = 213/408 (52%), Gaps = 39/408 (9%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L  L  D  R   +  ++  A       +L  ++A  +P +       G S G+ +Y   
Sbjct: 81  LDTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANL------GFSIGTLQYVVT 134

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
           + +GTP    ++ +DTGSD++W+QC+PC    CY Q DP+FDP  SSSYS +PCAA  C 
Sbjct: 135 VSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCS 194

Query: 221 SLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
            L +  + C   +C Y V+YGDGS T G   ++T++   S ++KG   GCGH  +GLF G
Sbjct: 195 QLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAG 254

Query: 279 SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAV----TAPL 331
             GLLGLG    SL  Q  +T     +YCL     P    + + S  G  +     T PL
Sbjct: 255 VDGLLGLGRQGQSLVSQASSTYGGVFSYCL----PPTQNSVGYISLGGPSSTAGFSTTPL 310

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
           +      T+Y V L G SVGGQ + I  S+F        G +VD GT +TRL   AY++L
Sbjct: 311 LTASNDPTYYIVMLAGISVGGQPLSIDASVFAS------GAVVDTGTVVTRLPPTAYSAL 364

Query: 392 RDSF-VRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           R +F   +A    P++    + DTCYDF+   +V +PT+S+ FG G A+DL     L   
Sbjct: 365 RSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT-- 422

Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               + C AFAPT   S  SI+GNVQQ+   V FD   + VGF P  C
Sbjct: 423 ----SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 171/450 (38%), Positives = 227/450 (50%), Gaps = 34/450 (7%)

Query: 76  SSSSFSLPLH-SREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
           +S S SL LH +R      R    +  VL   ++D+ R+ T+    + A    DR    P
Sbjct: 69  ASLSPSLKLHMNRRAAEGGRTR--KESVLDLADKDAVRIETM--HRRAARSGGDRTPASP 124

Query: 135 AEA--QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE 192
           + +  + L E     V SG + GSGEY   + VGTPPR+F M++DTGSD+NWLQC PC +
Sbjct: 125 SSSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLD 184

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACR---ANRCLYQVAYGDGSFTV 245
           C+ Q  P+FDP  SSSY  + C   +C  +       ACR    + C Y   YGD S T 
Sbjct: 185 CFDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTT 244

Query: 246 GDLVTETVSF-----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT- 299
           GDL  E+ +      G S  V  +  GCGH N GLF G+AGLLGLG G LS   Q++A  
Sbjct: 245 GDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVY 304

Query: 300 --SLAYCLVDRDSPASGVLEFNSARGGDA--------VTAPLIRNKKVDTFYYVGLTGFS 349
             + +YCLVD  S  +  + F                 TA    +   DTFYYV L G  
Sbjct: 305 GHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVL 364

Query: 350 VGGQAVQIPPSLF--EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP-TS 406
           VGG+ + I    +     E G GG I+D GT ++     AY  +R +F+   G   P   
Sbjct: 365 VGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP 424

Query: 407 GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSA 465
              +   CY+ SG+    VP +SL F  G   D PA+NY I +D  G  C A   T  + 
Sbjct: 425 DFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +SIIGN QQQ   V +DL NNR+GF P +C
Sbjct: 485 MSIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  231 bits (590), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 139/365 (38%), Positives = 198/365 (54%), Gaps = 20/365 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
           + P  +G S  + E+   +G G+P + +++ +DTGSD++W+QC PC+  CY+Q DP+FDP
Sbjct: 147 TIPDSTGTSLDTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDP 206

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             S++YS +PC  PQC +       +  CLY+V YGDGS T G L  ET+S  ++  + G
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPG 266

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS 320
            A GCG  N G F G  GL+GLG G LSL  Q  AT   + +YCL   D+   G L   S
Sbjct: 267 FAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT-THGYLTMGS 325

Query: 321 ARGG------DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
                     D     +I+ +   + Y+V +    +GG  + +PP++F  D     G + 
Sbjct: 326 TTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRD-----GTLF 380

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +T L  +AY SLRD F       KP      FDTCYDF+G  ++ +P V+  F  
Sbjct: 381 DSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSD 440

Query: 435 GKALDL-PAKNYLIPVDSA-GTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGF 490
           G   DL P    + P D+A  T C AF P  S +  +IIGN QQ+GT V +D+A  ++GF
Sbjct: 441 GAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGF 500

Query: 491 TPNKC 495
               C
Sbjct: 501 GQFTC 505


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 139/358 (38%), Positives = 189/358 (52%), Gaps = 16/358 (4%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P   G   GSG Y   +G GTP R  ++V DTGSD+NWLQC+PC   CY Q +P+FDP
Sbjct: 2   SIPARIGLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDP 61

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             SS+Y  + C  P C  L    C ++ CLY V YGDGS T+G L  +T     +   K 
Sbjct: 62  SLSSTYRNVSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKN 121

Query: 264 IALGCGHDNEGLFVGSAGLLGLG-GGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFN 319
              GCG +N GLF G+AGL+GLG     SL  Q+        +YCL    S A+G L   
Sbjct: 122 FIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL-PSTSSATGYLNIG 180

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
           + +     TA ++ + +V T Y++ L G SVGG  + +  ++F+       G I+D GT 
Sbjct: 181 NPQNTPGYTA-MLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTV 234

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           ITRL   AY++L+ +             V + DTCYDFS   SV  P + LHF AG  + 
Sbjct: 235 ITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHF-AGLDVR 293

Query: 440 LPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +PA       +S+   C AFA    S+ + IIGNVQQ    V++D    R+GF+   C
Sbjct: 294 IPATGVFFVFNSS-QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 161/443 (36%), Positives = 232/443 (52%), Gaps = 42/443 (9%)

Query: 91  HKTRHNDYRSLVLSRLERDSARVNTLITKLQ------------LAIYNVDRHELKPAEAQ 138
           H+   ++ RSL+L  L+RD  R+ +   ++             L + N    +  P+ + 
Sbjct: 8   HRQPTSNRRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAYLEMTNSSSTKSPPSPSS 67

Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
              E  ST V SGA  G+GEYF  + VG PPR F +++DTGSD+ WLQC+PC  C+ QS 
Sbjct: 68  SWEEVDST-VESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSG 126

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-------CLYQVAYGDGSFTVGDLVTE 251
           P+FDP  S+S+  +PC A  C  +    CR N        C Y   YGD S T GDL  E
Sbjct: 127 PVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALE 186

Query: 252 TVSFG-----NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLA 302
           ++S       +S  ++ + +GCGH N+GLF G+ GLLGLG G LS   Q++++    S +
Sbjct: 187 SLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFS 246

Query: 303 YCLVDRDS--PASGVLEFNS----ARGGDAVT-APLIR-NKKVDTFYYVGLTGFSVGGQA 354
           YCLVDR +    S  + F +    +R  D +   P +R N  V+TFYY+G+ G  +  + 
Sbjct: 247 YCLVDRTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQEL 306

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDT 413
           + IP   F +   G GG I+D GT +T L   AY ++  +F  LA    P +    +   
Sbjct: 307 LPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAF--LARISYPRADPFDILGI 364

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNV 472
           CY+ +G  +V  P +S+ F  G  LDLP +NY I P       C A  PT   +SIIGN 
Sbjct: 365 CYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT-DGMSIIGNF 423

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQQ     +D+ + R+GF    C
Sbjct: 424 QQQNIHFLYDVQHARLGFANTDC 446


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 143/412 (34%), Positives = 213/412 (51%), Gaps = 29/412 (7%)

Query: 104 SRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST--PVVSGASQGSGEYFS 161
           S  ER    V   +    L + ++  H  K   +  + +   T  P+ SG    +  Y  
Sbjct: 65  SESERKGDWVEKQLVLDGLHVRSIQNHIRKRTSSSQIADSSETQVPLTSGIKFQTLNYIV 124

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
            +G+G+  +  S+++DTGSD+ W+QC PC  CY Q+ P+F P TS SY P+ C +  C+S
Sbjct: 125 TMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQS 182

Query: 222 LDVSACRAN-----RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           L++ AC ++      C Y V YGDGS+T G+L  E + FG   SV     GCG +N+GLF
Sbjct: 183 LELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGI-SVSNFVFGCGRNNKGLF 241

Query: 277 VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP-ASGVLEFNSARGGDAVTAP-- 330
            G++GL+GLG   LS+  Q  AT     +YCL   D   ASG L   +  G      P  
Sbjct: 242 GGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASGSLVMGNQSGVFKNVTPIA 301

Query: 331 ---LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
              ++ N ++  FY + LTG  VGG ++ +  S F     G+GG+I+D GT I+RL    
Sbjct: 302 YTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDSGTVISRLAPSV 356

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN--Y 445
           Y +L+  F+          G ++ DTC++ +G   V +PT+S++F     L++ A    Y
Sbjct: 357 YKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMYFEGNAELNVDATGIFY 416

Query: 446 LIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+  D A   C A A  S    + IIGN QQ+  RV +D   ++VGF    C
Sbjct: 417 LVKED-ASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGFAKEPC 467


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  231 bits (588), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 134/371 (36%), Positives = 195/371 (52%), Gaps = 28/371 (7%)

Query: 143 DFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           D   P+ SG    +  Y   + +G   R+ ++++DTGSD++W+QC+PC  CY Q DP+F+
Sbjct: 119 DAPIPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFN 176

Query: 203 PKTSSSYSPLPCAAPQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSF 255
           P TS SY  + C++P C+SL     ++  C +N   C Y V YGDGS+T G+L TE +  
Sbjct: 177 PSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL 236

Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
           GNS +V     GCG +N+GLF G++GL+GLG   LSL  Q  A      +YCL   ++ A
Sbjct: 237 GNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLPITETEA 296

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
           SG L           T P+   + +      FY++ LTG +VG  AVQ P         G
Sbjct: 297 SGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAP-------SFG 349

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
             G+++D GT ITRL    Y +L+D FV+            + DTC++ SG + V +P +
Sbjct: 350 KDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNI 409

Query: 429 SLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLA 484
            +HF     L  D+    Y +  D A   C A A  S  + + IIGN QQ+  RV +D  
Sbjct: 410 KMHFEGNAELNVDVTGVFYFVKTD-ASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTK 468

Query: 485 NNRVGFTPNKC 495
            + +GF    C
Sbjct: 469 GSMLGFAAEAC 479


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 136/375 (36%), Positives = 207/375 (55%), Gaps = 27/375 (7%)

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDP 199
           P   STP+ SG S GSG Y+ +IGVGTP + FSM++DTGS ++WLQC+PC   C+ Q DP
Sbjct: 89  PSLVSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDP 148

Query: 200 IFDPKTSSSYSP-----LPCAAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTET 252
           IF P  S +Y         C++ +  +L+   C      C+Y+ +YGD SF++G L  + 
Sbjct: 149 IFTPSVSKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDV 208

Query: 253 VSFGNSGS-VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL--- 305
           ++   S +   G   GCG DN+GLF  SAG++GL    LS+  Q+      + +YCL   
Sbjct: 209 LTLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSS 268

Query: 306 --VDRDSPASGVLEFNSARGGDAVT--APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
                +S  SG L   ++    +     PL++N K+ + Y++GLT  +V G+ + +  S 
Sbjct: 269 FSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASS 328

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
           + +        I+D GT ITRL    YN+L+ SFV  ++       G ++ DTC+  S  
Sbjct: 329 YNVPT------IIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVK 382

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
               VP + + F  G  L+L   N L+ ++  GT C A A +S+ +SIIGN QQQ   V+
Sbjct: 383 EMSTVPEIRIIFRGGAGLELKVHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQQTFTVA 441

Query: 481 FDLANNRVGFTPNKC 495
           +D+AN+++GF P  C
Sbjct: 442 YDVANSKIGFAPGGC 456


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 141/412 (34%), Positives = 206/412 (50%), Gaps = 24/412 (5%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPE--DFSTPVVSGASQGSGEY 159
           +L+       T  TKLQL    + R + + A  Q   +LP   D  T      +  SGEY
Sbjct: 30  QLKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEY 89

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
              + +GTPP  ++ ++DTGSD+ W QC PC  C  Q  P FD K S++Y  LPC + +C
Sbjct: 90  LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149

Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGL 275
            SL   +C    C+YQ  YGD + T G L  ET +FG + S K     IA GCG  N G 
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDA 326
              S+G++G G G LSL  Q+  +  +YCL    S     L F         N++ G   
Sbjct: 210 LANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPV 269

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            + P + N  +   Y++ L   S+G + + I P +F +++ G GG+I+D GT+IT LQ  
Sbjct: 270 QSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329

Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAK 443
           AY ++R   V  A  L   +   +  DTC+ +      +V VP +  HF +     LP +
Sbjct: 330 AYEAVRRGLVS-AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-E 387

Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           NY++   + G  C   APT    +IIGN QQQ   + +D+ N+ + F P  C
Sbjct: 388 NYMLIASTTGYLCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  229 bits (584), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 158/439 (35%), Positives = 217/439 (49%), Gaps = 66/439 (15%)

Query: 95  HNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ 154
           H+ Y  +    L RD  RV ++  +L                A+      + P   G + 
Sbjct: 76  HHHYTGI----LRRDRHRVRSIYRRL--------------TAAETTTTTTTIPARLGLAF 117

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPL 212
            S EY   IG+GTPPR F+++ DTGSD+ W+QC PC  + CY Q +P+FDP  SS+Y  +
Sbjct: 118 QSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDV 177

Query: 213 PCAAPQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN----SGSVKGIAL 266
           PC+AP+C    +  + C A  C Y V YGD S T G L  ET +       + +  G+  
Sbjct: 178 PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVF 237

Query: 267 GCGHDNEGLF----VGSAGLLGLGGGMLSLTKQIKAT------SLAYCLVDRDSPASGVL 316
           GC H+   +F    +G AGLLGLG G  S+  Q + +        +YCL  R S ++G L
Sbjct: 238 GCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGS-STGYL 296

Query: 317 EFNSARGGDAVT---------APLIRN-KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
                 GG A            PLI    ++ + Y V L G SV G AV IP S F +  
Sbjct: 297 TIG---GGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL-- 351

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK--PTSGVALFDTCYDFSGLRSVR 424
               G ++D GT +T +   AY  LRD F    G+ K  P   + L DTCYD +G   V 
Sbjct: 352 ----GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVT 407

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPV---DSAGT----FCFAFAPTSSA-LSIIGNVQQQG 476
            P V+L FG G  +D+ A   L+ +   D +G      C AF PT+SA L I+GN+QQ+ 
Sbjct: 408 APRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRA 467

Query: 477 TRVSFDLANNRVGFTPNKC 495
             V FD+   R+GF PN C
Sbjct: 468 YNVVFDVDGGRIGFGPNGC 486


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 156/435 (35%), Positives = 216/435 (49%), Gaps = 41/435 (9%)

Query: 86  SREILHK------TRHNDYRSLVLSR---LERDSARVNTLITKLQLAIYNVDR-HELKPA 135
           S E++HK        HN      +S    +  D+ RV  + ++L     N+ R + +K  
Sbjct: 62  SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLS---KNLGRENSVKEL 118

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECY 194
           ++  LP        SG+  GS  YF  +G+GTP R  S+V DTGSD+ W QC PC   CY
Sbjct: 119 DSTTLPAK------SGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCY 172

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDL 248
           +Q D IFDP  SSSY  + C +  C  L  +  ++        C+Y + YGD S +VG L
Sbjct: 173 KQQDAIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFL 232

Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCL 305
             E ++   +  V     GCG DNEGLF GSAGL+GLG   +S  +Q   I     +YCL
Sbjct: 233 SQERLTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL 292

Query: 306 VDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLF 362
               S + G L F  ++A   +    PL      +TFY + + G SVGG  +  +  S F
Sbjct: 293 -PSTSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTF 351

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
                  GG I+D GT ITRL   AY +LR +F +        +   LFDTCYDFSG + 
Sbjct: 352 SA-----GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKE 406

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGNVQQQGTRVS 480
           + VP +   F  G  ++LP    LI   SA   C AFA     + ++I GNVQQ+   V 
Sbjct: 407 ISVPKIDFEFAGGVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVV 465

Query: 481 FDLANNRVGFTPNKC 495
           +D+   R+GF    C
Sbjct: 466 YDVEGGRIGFGAAGC 480


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 133/409 (32%), Positives = 220/409 (53%), Gaps = 30/409 (7%)

Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHE----LKPAEAQILPEDFSTPVVSGASQGSG 157
           +LSR E     +++ + K  +   +  RH+    L+P  A I       P+  G S GSG
Sbjct: 66  ILSRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANI-------PLNPGLSIGSG 118

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAA 216
            Y+ ++G+G+PP+ ++M+LDTGS ++WLQC+PC   C+ Q DP+F+P  S++Y PL C++
Sbjct: 119 NYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSS 178

Query: 217 PQCKSLDVSA-----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            +C  L  +      C A+  C+Y  +YGD S+++G L  + ++   S ++     GCG 
Sbjct: 179 SECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQ 238

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGDAV 327
           DNEGLF  +AG++GL    LS+  Q+      + +YCL    S   G L           
Sbjct: 239 DNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGKISPSSYK 298

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
             P+IRN +  + Y++ L   +V G+ V +  + +++        I+D GT +TRL    
Sbjct: 299 FTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------IIDSGTVVTRLPISI 352

Query: 388 YNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           Y +LR++FV+ ++   +     ++ DTC+  S       P + + F  G  L L A N L
Sbjct: 353 YAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNIL 412

Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           I  D  G  C AFA +S+ ++IIGN QQQ   +++D++ +++GF P  C
Sbjct: 413 IEADK-GIACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  228 bits (581), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 145/417 (34%), Positives = 217/417 (52%), Gaps = 38/417 (9%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           D+   +  +L  D  RV ++  +++     V  H ++ ++ QI       P+ SG +  +
Sbjct: 13  DWNRRLQKQLISDDLRVRSMQNRIRRV---VSSHNVEASQTQI-------PLSSGINLQT 62

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
             Y   +G+G+     ++++DTGSD+ W+QC PC  CY Q  PIF P TSSSY  + C +
Sbjct: 63  LNYIVTMGLGST--NMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 217 PQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
             C+SL     +  AC +N   C Y V YGDGS+T G+L  E +SFG   SV     GCG
Sbjct: 121 STCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGV-SVSDFVFGCG 179

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA 326
            +N+GLF G +GL+GLG   LSL  Q  AT     +YCL   +S ASG L   +      
Sbjct: 180 RNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFK 239

Query: 327 VTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
              P     ++ N ++  FY + LTG  V G A+Q+P         G+GG+++D GT IT
Sbjct: 240 NVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVP-------SFGNGGVLIDSGTVIT 292

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
           RL +  Y +L+  F++         G ++ DTC++ +G   V +PT+S+HF     L + 
Sbjct: 293 RLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVD 352

Query: 442 AK-NYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A   + +  + A   C A A  S A   +IIGN QQ+  RV +D   ++VGF    C
Sbjct: 353 ATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESC 409


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  228 bits (580), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 153/412 (37%), Positives = 215/412 (52%), Gaps = 38/412 (9%)

Query: 102 VLSR-LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           +LSR L R SARV TL +   LA        +  A   +L  D             GEY 
Sbjct: 49  LLSRALRRSSARVATLQSLAALA----PGDAITAARILVLASD-------------GEYL 91

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
             +G+GTP R +S +LDTGSD+ W QC PC  C  Q  P FDP  S++Y  L CA+P C 
Sbjct: 92  MEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACN 151

Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNEGLFV 277
           +L    C    C+YQ  YGD + T G L  ET +FG +    S+ GI+ GCG+ N GL  
Sbjct: 152 ALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLA 211

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGDAVTA 329
             +G++G G G LSL  Q+ +   +YCL    SP    L F         +A      + 
Sbjct: 212 NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQST 271

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAY 388
           P + N  + T Y++ +TG SVGG  + I P++F + D  G GG I+D GT IT L   AY
Sbjct: 272 PFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAY 331

Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           +++R +F  ++   L   +  ++ DTC+ +     +SV +P + LHF  G   +LP +NY
Sbjct: 332 DAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY 390

Query: 446 LIPVD--SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++ VD  + G  C A A +SS  SIIG+ Q Q   V +DL N+ + F P  C
Sbjct: 391 ML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 148/424 (34%), Positives = 211/424 (49%), Gaps = 36/424 (8%)

Query: 85  HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDF 144
           HS +      HND  +L       D+ RV  + ++L   +   +R  +K  ++  LP   
Sbjct: 81  HSGKAEATISHNDIMNL-------DNERVKYIQSRLSKNLGGENR--VKELDSTTLPAK- 130

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDP 203
                SG   GS +Y+  +G+GTP R  S++ DTGS + W QC PC   CY+Q DPIFDP
Sbjct: 131 -----SGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDP 185

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
             SSSY+ + C +  C     + C ++    C+Y V YGD S + G L  E ++   +  
Sbjct: 186 SKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATDI 245

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPAS-GVL 316
           V     GCG DNEGLF G+AGL+GL    +S  +Q   I     +YCL    +P+S G L
Sbjct: 246 VHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL--PSTPSSLGHL 303

Query: 317 EF--NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGII 373
            F  ++A   +    P       ++FY + + G SVGG  +  +  S F       GG I
Sbjct: 304 TFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSI 358

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT ITRL   AY +LR +F +         G  L DTCYDFSG + + VP +   F 
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA 418

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFT 491
            G  ++LP    L   +SA   C AFA   +   ++I GNVQQ+   V +D+   R+GF 
Sbjct: 419 GGVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFG 477

Query: 492 PNKC 495
              C
Sbjct: 478 AAGC 481


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 144/385 (37%), Positives = 199/385 (51%), Gaps = 33/385 (8%)

Query: 124 IYNVDRHELKPAEAQI---LPEDFST--------PVVSGASQGSGEYFSRIGVGTPPRQF 172
           I N D+  +K   ++I   L +D S         P  SG+  GSG YF  +G+GTP R  
Sbjct: 99  ILNQDKERVKYINSRISKNLGQDSSVSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDL 158

Query: 173 SMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----A 226
           S++ DTGSD+ W QC PC   CY+Q D IFDP  S+SYS + C +  C  L  +      
Sbjct: 159 SLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPG 218

Query: 227 CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
           C A+   C+Y + YGD SF+VG    E +S   +  V     GCG +N+GLF GSAGL+G
Sbjct: 219 CSASTKACIYGIQYGDSSFSVGYFSRERLSVTATDIVDNFLFGCGQNNQGLFGGSAGLIG 278

Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFY 341
           LG   +S  +Q  A      +YCL    S ++G L F +         P     +  +FY
Sbjct: 279 LGRHPISFVQQTAAVYRKIFSYCL-PATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFY 337

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
            + +TG SVGG  + +  S F       GG I+D GT ITRL   AY +LR +F R   +
Sbjct: 338 GLDITGISVGGAKLPVSSSTFST-----GGAIIDSGTVITRLPPTAYTALRSAF-RQGMS 391

Query: 402 LKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
             P++G +++ DTCYD SG     +P +   F  G  + LP +  L  V SA   C AFA
Sbjct: 392 KYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAGGVTVQLPPQGILY-VASAKQVCLAFA 450

Query: 461 PT--SSALSIIGNVQQQGTRVSFDL 483
                S ++I GNVQQ+   V +D+
Sbjct: 451 ANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 145/429 (33%), Positives = 219/429 (51%), Gaps = 28/429 (6%)

Query: 75  NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
           +SS + ++PLH R              +  RL RD  R   +  K     ++ D  +   
Sbjct: 52  SSSGATTVPLHHRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRK-----FSGDVKKDGQ 106

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
               +     + P   G S  + EY   + +G+P +  ++++D+GSD++W+QC+PC +C+
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCH 166

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQVAYGDGSFTVGDLVTE 251
            Q DP+FDP  SS+YSP  C++  C  L  D + C  +++C Y V Y DGS T G   ++
Sbjct: 167 SQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSD 226

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
           T++ G S ++     GC H   G    + GL+GLGGG  SL  Q      T+ +YCL   
Sbjct: 227 TLALG-SNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPT 285

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            S +SG L   +   G  V  P++R+  V TFY V L    VGG  + IP S+F      
Sbjct: 286 PS-SSGFLTLGAGTSG-FVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS----- 338

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
             G+++D GT ITRL   AY++L  +F       +P    ++ DTC+DFSG  SVR+P+V
Sbjct: 339 -AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSV 397

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
           +L F  G  ++L A   ++        C AFA  S  S+  I+GNVQQ+   V +D+   
Sbjct: 398 ALVFSGGAVVNLDANGIIL------GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGG 451

Query: 487 RVGFTPNKC 495
            VGF    C
Sbjct: 452 AVGFKAGAC 460


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 126/334 (37%), Positives = 182/334 (54%), Gaps = 24/334 (7%)

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACRANR 231
           +++DTGSDI W+QC PC +CY+Q D +F P  S++Y PLPC +  C+ L     +C  + 
Sbjct: 3   LLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSS 62

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVGSAGLLGLGG 287
           C Y V+YGD S T GD   ET++  +      SV   A GCGH N+GLF G+AGL+GLG 
Sbjct: 63  CNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLGK 122

Query: 288 GMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSAR--GGDAVTAPLIRNKKVDTFY 341
             +    Q         +YCL    S   SG+L F  A     D    PL+ +    + Y
Sbjct: 123 SSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQY 182

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           +V +TG +VG + + I  +           ++VD GT I+R +  AY  LRD+F ++   
Sbjct: 183 FVSMTGINVGDELLPISAT-----------VMVDSGTVISRFEQSAYERLRDAFTQILPG 231

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           L+    VA FDTC+  S +  + +P ++LHF     L L   + L PVD  G  CFAFAP
Sbjct: 232 LQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDD-GVMCFAFAP 290

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +SS  S++GN QQQ  R  +D+  +R+G +  +C
Sbjct: 291 SSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 153/471 (32%), Positives = 238/471 (50%), Gaps = 44/471 (9%)

Query: 62  AEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQ 121
           ++E + A E    ++  S  L L  REI  +T+   +   V+    +D  R+ TL  + +
Sbjct: 63  SKEHDPAKE----HTRESVKLHLRRREIKQETKRTTHS--VVDLQIQDLTRIQTLHARFK 116

Query: 122 LAIYNVDRHELKPAEA--------QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFS 173
            +    +    K   +        ++ P      + SG + GSGEYF  + VGTPP+ FS
Sbjct: 117 KSKKQRNEKVKKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFS 176

Query: 174 MVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA----CRA 229
           ++LDTGSD+NWLQC PC +C+ Q++  +DPKTS+S+  + C  P+C  +        C++
Sbjct: 177 LILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLISSPEPPVQCKS 236

Query: 230 NR--CLYQVAYGDGSFTVGDLVTETVSF------GNSG--SVKGIALGCGHDNEGLFVGS 279
           +   C Y   YGD S T GD   ET +       G S    V+ +  GCGH N GLF G+
Sbjct: 237 DNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGA 296

Query: 280 AGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI---- 332
           +GLLGLG G LS + Q+++    S +YCLVDR+S  +   +       D +    +    
Sbjct: 297 SGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTS 356

Query: 333 ----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
               +   V+TFYY+ +    VGG+A+ IP   + +   G GG I+D GT ++     AY
Sbjct: 357 FVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAY 416

Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNY 445
             +++ F  ++  N        + D C++ SG+   ++ +P + + F  G   + PA+N 
Sbjct: 417 EIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENS 476

Query: 446 LIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            I + S    C A   T  S  SIIGN QQQ   + +D   +R+GFTP KC
Sbjct: 477 FIWL-SEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 145/402 (36%), Positives = 214/402 (53%), Gaps = 34/402 (8%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           R +R   R    + KLQ+++      E+K  EA         PV +G    +GE+  ++ 
Sbjct: 79  RFKRAIKRSQDRLEKLQMSV-----DEVKAVEA---------PVYAG----NGEFLMKMA 120

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTP   FS +LDTGSD+ W QC+PCT+CY Q  PI+DP  SS+YS +PC++  C++L +
Sbjct: 121 IGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPM 180

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
            +C    C Y  +YGD S T G L  E+ +   S S+  IA GCG +NEG      G L 
Sbjct: 181 YSCSGANCEYLYSYGDQSSTQGILSYESFTL-TSQSLPHIAFGCGQENEGGGFSQGGGLV 239

Query: 285 LGGGM-LSLTKQIKAT---SLAYCLVD-RDSPASGVLEF----NSARGGDAVTAPLIRNK 335
             G   LSL  Q+  +     +YCLV   DSP+     F     S       + PL++++
Sbjct: 240 GFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSR 299

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
              TFYY+ L G SVGGQ + I    F++   G GG+I+D GT +T L+   Y+ ++ + 
Sbjct: 300 SRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAV 359

Query: 396 VRLAGNLKPTSGVAL-FDTCYD-FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
           +  + NL    G  +  D C++  SG  +   PT++ HF  G   +LP +NY I  DS+G
Sbjct: 360 IS-SINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFHF-EGADFNLPKENY-IYTDSSG 416

Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             C A  P S+ +SI GN+QQQ  ++ +D   N + F P  C
Sbjct: 417 IACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 144/418 (34%), Positives = 222/418 (53%), Gaps = 38/418 (9%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           D+   +  +L  D  RV ++  +++        H ++ ++ QI       P+ SG +  +
Sbjct: 13  DWNRRLQKQLILDDLRVRSMQNRIRRV---ASTHNVEASQTQI-------PLSSGINLQT 62

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
             Y   +G+G+  +  ++++DTGSD+ W+QC PC  CY Q  PIF P TSSSY  + C +
Sbjct: 63  LNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 217 PQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
             C+SL     +  AC ++    C Y V YGDGS+T G+L  E +SFG   SV     GC
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGV-SVSDFVFGC 179

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVL----EFNSA 321
           G +N+GLF G +GL+GLG   LSL  Q  AT     +YCL   ++ +SG L    E +  
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVF 239

Query: 322 RGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
           +  + +T   ++ N ++  FY + LTG  VGG A++ P S       G+GGI++D GT I
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSF------GNGGILIDSGTVI 293

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRL +  Y +L+  F++         G ++ DTC++ +G   V +PT+SL F     L++
Sbjct: 294 TRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNV 353

Query: 441 PAK-NYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   + +  + A   C A A  S A   +IIGN QQ+  RV +D   ++VGF    C
Sbjct: 354 DATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPC 411


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 150/436 (34%), Positives = 224/436 (51%), Gaps = 46/436 (10%)

Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVS----------- 150
           VL    RD  R+ TL  ++ LA  N  ++ +   + +   E  +TPV S           
Sbjct: 86  VLELQIRDLTRIQTLHKRV-LAKKN--QNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVA 142

Query: 151 ----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
               G + GSGEYF  + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+   +DPK S
Sbjct: 143 TLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKAS 202

Query: 207 SSYSPLPCAAPQCKSLD----VSACRANR--CLYQVAYGDGSFTVGDLVTE--TVSFGNS 258
           +SY  + C  P+C  +        C+++   C Y   YGD S T GD   E  TV+   S
Sbjct: 203 ASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTS 262

Query: 259 G------SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRD 309
           G      +V+ +  GCGH N GLF G+AGLLGLG G LS + Q+++    S +YCLVDR+
Sbjct: 263 GGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRN 322

Query: 310 SPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
           S  +   +       D ++ P +        +   VDTFYYV +    V G+ + IP   
Sbjct: 323 SDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEET 382

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGL 420
           + +   G GG I+D GT ++     AY  +++     A    P      + D C++ SG+
Sbjct: 383 WNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGI 442

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRV 479
            S+++P + + F  G   + P +N  I ++     C A   T  SA SIIGN QQQ   +
Sbjct: 443 DSIQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSAFSIIGNYQQQNFHI 501

Query: 480 SFDLANNRVGFTPNKC 495
            +D   +R+G+ P KC
Sbjct: 502 LYDTKRSRLGYAPTKC 517


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  224 bits (572), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 159/470 (33%), Positives = 224/470 (47%), Gaps = 64/470 (13%)

Query: 61  FAEESETAAESFPLNSSSSFSLPLHSREI----LHKTRH---------NDYRSLVLSRLE 107
           F     TA+E  P+ S+S  +L   S  +    +H  RH         +D  S    RL 
Sbjct: 28  FVAVPTTASEPEPVCSTSGVTLDPGSNTVSVPLVH--RHGPCAPTQLSSDKPSSFTDRLR 85

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
           R+ AR   +++++   +   D              D S P   G S  S EY   +G+GT
Sbjct: 86  RNRARSKYIMSRVSKGMMGDD-------------ADVSIPTHLGGSVDSLEYVVTVGLGT 132

Query: 168 PPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD-- 223
           P     +++DTGSD++W+QC+PC  T CY Q DP+FDP  SS+Y+P+PC    C+ L   
Sbjct: 133 PSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTDACRDLTDD 192

Query: 224 ------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
                  S   A +C + + YGDGS T G    ET++     +VK    GCGHD +G   
Sbjct: 193 GYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKDFRFGCGHDQDGAND 252

Query: 278 GSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPASGVLEFNSARGGDAVT------ 328
              GLLGLGG   SL  Q   +   + +YCL   ++    +           V       
Sbjct: 253 KYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSGGVVNTSGFV 312

Query: 329 -APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
             P+IR ++  TFY V +TG +VGG+ + +PPS F       GG+I+D GT +T LQ  A
Sbjct: 313 FTPMIREEE--TFYVVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVTELQHTA 364

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           YN+L+ +F R A    P       DTCYDFSG  +V +P V+L F  G  +DL   N ++
Sbjct: 365 YNALQAAF-RKAMAAYPLVRNGELDTCYDFSGYSNVTLPKVALTFSGGATIDLDVPNGIL 423

Query: 448 PVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             D     C AF  +       I+GNV Q+   V +D    RVGF    C
Sbjct: 424 LDD-----CLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAVC 468


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  224 bits (572), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 152/412 (36%), Positives = 214/412 (51%), Gaps = 38/412 (9%)

Query: 102 VLSR-LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           +LSR L R SARV TL +   LA        +  A   +L  D             GEY 
Sbjct: 49  LLSRALRRSSARVATLQSLAALA----PGDAITAARILVLASD-------------GEYL 91

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
             +G+GTP R +S +LDTGSD+ W QC PC  C  Q  P FDP  S++Y  L CA+P C 
Sbjct: 92  MEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACN 151

Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNEGLFV 277
           +L    C    C+YQ  YGD + T G L  ET +FG +    S+ GI+ GCG+ N G   
Sbjct: 152 ALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLA 211

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF--------NSARGGDAVTA 329
             +G++G G G LSL  Q+ +   +YCL    SP    L F         +A      + 
Sbjct: 212 NGSGMVGFGRGSLSLVSQLGSPRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQST 271

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQAY 388
           P + N  + T Y++ +TG SVGG  + I P++F + D  G GG I+D GT IT L   AY
Sbjct: 272 PFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAY 331

Query: 389 NSLRDSFV-RLAGNLKPTSGVALFDTCYDF--SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           +++R +F  ++   L   +  ++ DTC+ +     +SV +P + LHF  G   +LP +NY
Sbjct: 332 DAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFD-GADWELPLQNY 390

Query: 446 LIPVD--SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++ VD  + G  C A A +SS  SIIG+ Q Q   V +DL N+ + F P  C
Sbjct: 391 ML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 148/444 (33%), Positives = 228/444 (51%), Gaps = 31/444 (6%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
           E  A +S T A + PL+       PL ++++            +  RL RD  R      
Sbjct: 47  ESKAVKSSTGAATVPLHHRHGPCSPLPTKKM----------PTLEERLHRDQLRA----A 92

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
            +Q        +  +     +     + P   G S  + EY   + +G+P +  +M++DT
Sbjct: 93  YIQRKFSGGGVNGSRGGAGDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDT 152

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQV 236
           GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP  C++  C  L  + + C +++C Y V
Sbjct: 153 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTV 212

Query: 237 AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI 296
            YGDGS T G   ++T++ G S +V+    GC +   G    + GL+GLGGG  SL  Q 
Sbjct: 213 TYGDGSSTTGTYSSDTLALG-SNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQT 271

Query: 297 KAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
             T   + +YCL    S +SG L   +   G  V  P++R+ +V TFY V +    VGG+
Sbjct: 272 AGTFGAAFSYCL-PATSSSSGFLTLGAGTSG-FVKTPMLRSSQVPTFYGVRIQAIRVGGR 329

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
            + IP S+F        G I+D GT +TRL   AY++L  +F              + DT
Sbjct: 330 QLSIPTSVFS------AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDT 383

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGN 471
           C+DFSG  SV +PTV+L F  G  +D+ +   ++   ++   C AFA  S  S+L IIGN
Sbjct: 384 CFDFSGQSSVSIPTVALVFSGGAVVDIASDGIMLQTSNS-ILCLAFAANSDDSSLGIIGN 442

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           VQQ+   V +D+    VGF    C
Sbjct: 443 VQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 142/365 (38%), Positives = 190/365 (52%), Gaps = 35/365 (9%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAA 216
           EY   IG+GTP R F+++ DTGSD+ W+QC+PCT+ CYQQ +P+FDP  SS+Y  +PC  
Sbjct: 125 EYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGT 184

Query: 217 PQCK---SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG-SVKGIALGCGHDN 272
           PQCK     D++ C    C Y V YGD S T G+L  E  +   S     G+  GC H+ 
Sbjct: 185 PQCKIGGGQDLT-CGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGVVFGCSHEY 243

Query: 273 EGLFVGS------AGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPASGVLEFNSA- 321
                G+      AGLLGLG G  S+  Q +  +     +YCL  R S A G L   +A 
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSA-GYLTIGAAA 302

Query: 322 --RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
             +   + T  +  N ++ + Y V L G SV G A+ I  S F +      G ++D GT 
Sbjct: 303 PPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI------GTVIDSGTV 356

Query: 380 ITRLQTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           IT +   AY  LRD F R  G   + P   V   DTCYD +G   V  P V+L FG G  
Sbjct: 357 ITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDTCYDVTGHDVVTAPPVALEFGGGAR 416

Query: 438 LDLPAKNYLI--PVDSAGT----FCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
           +D+ A   L+   VD++G      C AF PT+     IIGN+QQ+   V FD+   R+GF
Sbjct: 417 IDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGRRIGF 476

Query: 491 TPNKC 495
             N C
Sbjct: 477 GANGC 481


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 133/425 (31%), Positives = 229/425 (53%), Gaps = 36/425 (8%)

Query: 92  KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
           K+  N    L      +D  R+    ++L          +   +  ++ P+    P+ SG
Sbjct: 42  KSPPNSTSLLFAYMFAKDEERIRYFHSRL------AKNSDANASSKKVGPKLAGIPLKSG 95

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
            S GSG Y+ ++G+G+P + ++M++DTGS  +WLQC+PCT  C+ Q DP+F+P  S +Y 
Sbjct: 96  LSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYK 155

Query: 211 PLPCAAPQCK-----SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
            +PC++ QC      +L+   C  ++N C+Y+ +YGD SF++G L  + ++   S ++  
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSS 215

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR----DSPASGVL 316
              GCG DN+GLF  + G++GL    LS+  Q+      + +YCL       +SP  G L
Sbjct: 216 FVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFL 275

Query: 317 EFNSARGGDAVT---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
              ++    + +    PL++N    + Y++ L   +V G+ + +  S +++        I
Sbjct: 276 SIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------I 329

Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSL 430
           +D GT ITRL T  Y +L++++V  L+   +   G++L DTC+    +G+  V  P + +
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEV-APDIRI 388

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
            F  G  L L   N L+ +++ G  C A A  SS+++IIGN QQQ  +V++D+ N+RVGF
Sbjct: 389 IFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446

Query: 491 TPNKC 495
            P  C
Sbjct: 447 APGGC 451


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  222 bits (566), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 140/343 (40%), Positives = 194/343 (56%), Gaps = 21/343 (6%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCK- 220
           +G+GTP  Q+ MV+DTGS + WLQC PC   C++QS P+F+PK+SS+Y+ + C+A QC  
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 221 ----SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
               +L+ SAC  +N C+YQ +YGD SF+VG L  +TVSFG++ S+     GCG DNEGL
Sbjct: 61  LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGST-SLPNFYYGCGQDNEGL 119

Query: 276 FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
           F  SAGL+GL    LSL  Q+  +   S  YCL    S +SG L   S   G     P++
Sbjct: 120 FGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL--PSSSSSGYLSLGSYNPGQYSYTPMV 177

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
            +   D+ Y++ L+G +V G      P             I+D GT ITRL T  Y++L 
Sbjct: 178 SSSLDDSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTVITRLPTSVYSALS 232

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
            +           S  ++ DTC+     R V  P V++ F  G AL L A+N L+ VD +
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFKGQASR-VSAPAVTMSFAGGAALKLSAQNLLVDVDDS 291

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            T C AFAP  SA +IIGN QQQ   V +D+ ++R+GF    C
Sbjct: 292 TT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 332


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 164/476 (34%), Positives = 238/476 (50%), Gaps = 64/476 (13%)

Query: 45  QQTEHILSFEPETLEPFAEESETAAES---FPLNSSSSFSLPLHSRE--ILHKTRHNDYR 99
            + EH+L   P +   ++E + T + S   +    S++ S+PL  R       TR +D  
Sbjct: 23  NEEEHVLVAVPTSR--YSEPAATCSTSRVRWLDEGSNTVSVPLVHRHGPCAPSTRSSDEP 80

Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
           SL   RL R  AR   ++++   +                   + S P   G S  S EY
Sbjct: 81  SLS-ERLRRSRARSKYIMSRASKS-------------------NVSIPTHLGGSVDSLEY 120

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAP 217
              +G+GTP     +++DTGSD++W+QC PC  T CY Q DP+FDP  SS+Y+P+PC   
Sbjct: 121 VVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTD 180

Query: 218 QCKSLDV----SACRAN-----RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
            C+ L      S C +      +C Y + YGDGS T G    ET++     +VK    GC
Sbjct: 181 ACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGC 240

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEF----NSA 321
           GHD +G      GLLGLGG   SL   T  +   + +YCL   +  A G L      N A
Sbjct: 241 GHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQA-GFLALGAPVNDA 299

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G   V  P++R ++  TFY V +TG +VGG+ + +PPS F       GG+I+D GT +T
Sbjct: 300 SG--FVFTPMVREQQ--TFYVVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVT 349

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            LQ  AY +L+ +F R A    P       DTCY+F+G  +V VP V+L F  G  +DL 
Sbjct: 350 ELQHTAYAALQAAF-RKAMAAYPLLPNGELDTCYNFTGHSNVTVPRVALTFSGGATVDLD 408

Query: 442 AKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             + ++ +D+    C AF  A   +   I+GNV Q+   V +D+ + RVGF  + C
Sbjct: 409 VPDGIL-LDN----CLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 154/441 (34%), Positives = 218/441 (49%), Gaps = 29/441 (6%)

Query: 74  LNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVD--RHE 131
           +N+ S  +L L S   +    H       +  + RDS +            + VD  R  
Sbjct: 1   MNTLSFLTLSLFSLCFIASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRS 60

Query: 132 LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT 191
           +  A       D STP  S      G Y     VGTPP +   + DTGSDI WLQC PC 
Sbjct: 61  INRANHFFKDSDTSTPE-STVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCE 119

Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVT 250
           +CY Q+ PIF+P  SSSY  +PC++  C S+ D S    N C Y+++YGD S + GDL  
Sbjct: 120 QCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSV 179

Query: 251 ETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLA 302
           +T+S     G+  S   I +GCG DN G F G S+G++GLGGG +SL  Q+ ++     +
Sbjct: 180 DTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFS 239

Query: 303 YCLV---DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           YCLV   +++S AS +L F  A    G   V+ PLI+   V  FY++ L  FSVG + V+
Sbjct: 240 YCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVGNKRVE 297

Query: 357 IPPSLFEMDEAGD--GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
              S     E GD  G II+D GT +T + +  Y +L  + V L    +       F  C
Sbjct: 298 FGGS----SEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLC 353

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
           Y          P +++HF  G  ++L + +  +P+ + G  CFAF P+    SI GN+ Q
Sbjct: 354 YSLKS-NEYDFPIITVHF-KGADVELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGNLAQ 410

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           Q   V +DL    V F P  C
Sbjct: 411 QNLLVGYDLQQKTVSFKPTDC 431


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 148/437 (33%), Positives = 221/437 (50%), Gaps = 47/437 (10%)

Query: 102 VLSRLERDSARVNTLITK-LQLAIYNVDRHELKPAEAQILPEDFSTPVVS---------- 150
           VL    RD  R+ TL  + L+    N    + K  + +++    +TPV S          
Sbjct: 100 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT---TTPVASSVEEQAGQLV 156

Query: 151 -----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
                G + GSGEYF  + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+   +DPK 
Sbjct: 157 ATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKA 216

Query: 206 SSSYSPLPCAAPQCKSLDV----SACRANR--CLYQVAYGDGSFTVGDLVTETVSFG--- 256
           S+SY  + C   +C  +        C+++   C Y   YGD S T GD   ET +     
Sbjct: 217 SASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTT 276

Query: 257 NSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDR 308
           N GS     V+ +  GCGH N GLF G+AGLLGLG G LS + Q+++    S +YCLVDR
Sbjct: 277 NGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDR 336

Query: 309 DSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
           +S  +   +       D ++ P +        +   VDTFYYV +    V G+ + IP  
Sbjct: 337 NSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEE 396

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSG 419
            + +   G GG I+D GT ++     AY  +++     A    P      + D C++ SG
Sbjct: 397 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSG 456

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTR 478
           + +V++P + + F  G   + P +N  I ++     C A   T  SA SIIGN QQQ   
Sbjct: 457 IHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFH 515

Query: 479 VSFDLANNRVGFTPNKC 495
           + +D   +R+G+ P KC
Sbjct: 516 ILYDTKRSRLGYAPTKC 532


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 144/366 (39%), Positives = 198/366 (54%), Gaps = 32/366 (8%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFD 202
           + P   G + G+  Y   + +GTP    ++ +DTGSD++W+QC PC    CY Q DP+FD
Sbjct: 126 TVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFD 185

Query: 203 PKTSSSYSPLPCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           P  SSSY+ +PC  P C  L +  S+C A +C Y V+YGDGS T G   ++T++   + +
Sbjct: 186 PAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDA 245

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLE 317
           V+G   GCGH   G F G+ GLLGLG    SL +Q   T     +YCL  R S  +G L 
Sbjct: 246 VRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFSYCLPTRPS-TTGYLT 303

Query: 318 FNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
                G       T  L+ +    T+Y V LTG SVGGQ + +P S+F       GG +V
Sbjct: 304 LGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFA------GGTVV 357

Query: 375 DCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHF 432
           D GT ITRL   AY +LR +F   +A    P++    + DTCY+FSG  +V +P V+L F
Sbjct: 358 DTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTF 417

Query: 433 GAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVG 489
             G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D     VG
Sbjct: 418 SGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVG 468

Query: 490 FTPNKC 495
           F P+ C
Sbjct: 469 FKPSSC 474


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  221 bits (562), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 133/425 (31%), Positives = 229/425 (53%), Gaps = 36/425 (8%)

Query: 92  KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
           K+  N    L      +D  R+    ++L          +   +  ++ P+    P+ SG
Sbjct: 42  KSPPNSTSLLFAYMFAKDEERIRYFHSRL------AKNSDANASFKKVGPKLAGIPLKSG 95

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
            S GSG Y+ ++G+G+P + ++M++DTGS  +WLQC+PCT  C+ Q DP+F+P  S +Y 
Sbjct: 96  LSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYK 155

Query: 211 PLPCAAPQCK-----SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
            +PC++ QC      +L+   C  ++N C+Y+ +YGD SF++G L  + ++   S ++  
Sbjct: 156 TVPCSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPSQTLSS 215

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR----DSPASGVL 316
              GCG DN+GLF  + G++GL    LS+  Q+      + +YCL       +SP  G L
Sbjct: 216 FVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFL 275

Query: 317 EFNSARGGDAVT---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
              ++    + +    PL++N    + Y++ L   +V G+ + +  S +++        I
Sbjct: 276 SIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT------I 329

Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSL 430
           +D GT ITRL T  Y +L++++V  L+   +   G++L DTC+    +G+  V  P + +
Sbjct: 330 IDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEV-APDIRI 388

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
            F  G  L L   N L+ +++ G  C A A  SS+++IIGN QQQ  +V++D+ N+RVGF
Sbjct: 389 IFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGNSRVGF 446

Query: 491 TPNKC 495
            P  C
Sbjct: 447 APGGC 451


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  221 bits (562), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 150/424 (35%), Positives = 215/424 (50%), Gaps = 43/424 (10%)

Query: 86  SREILHK----TRHNDYRSLVLSR------LERDSARVNTLITKLQLAI-YNVDRHELKP 134
           S E++HK    ++ ND+     S       L +D  RV  + ++L   +  +    EL  
Sbjct: 71  SLEVVHKHGPCSQLNDHDGKAKSTTPHSDILNQDKERVKYINSRLSKNLGQDSSVEELDS 130

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-C 193
           A         + P  SG+  GSG YF  +G+GTP R  S++ DTGSD+ W QC PC   C
Sbjct: 131 A---------TLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSC 181

Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----ACRANR--CLYQVAYGDGSFTVG 246
           Y+Q D IFDP  S+SYS + C +  C  L  +      C A+   C+Y + YGD SF+VG
Sbjct: 182 YKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVG 241

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAY 303
               E ++   +  V     GCG +N+GLF GSAGL+GLG   +S  +Q  A      +Y
Sbjct: 242 YFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSY 301

Query: 304 CLVDRDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
           CL    S ++G L F  A  G  +   P     +  +FY + +T  +VGG  + +  S F
Sbjct: 302 CLPSTSS-STGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTF 360

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLR 421
                  GG I+D GT ITRL   AY +LR +F R   +  P++G +++ DTCYD SG +
Sbjct: 361 ST-----GGAIIDSGTVITRLPPTAYGALRSAF-RQGMSKYPSAGELSILDTCYDLSGYK 414

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRV 479
              +PT+   F  G  + LP +  L  V S    C AFA     S ++I GNVQQ+   V
Sbjct: 415 VFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQRTIEV 473

Query: 480 SFDL 483
            +D+
Sbjct: 474 VYDV 477


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 146/435 (33%), Positives = 221/435 (50%), Gaps = 63/435 (14%)

Query: 86  SREILHK-------TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ 138
           S +++HK        + N     ++  L  D +RV+++  KL       D   +K  +A 
Sbjct: 66  SLKVVHKHGPCSQLNQQNGNAPNLVEILLEDQSRVDSIHAKLS------DHSGVKETDAA 119

Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
            LP        SG S G+G Y   IG+G+P +   ++ DTGSD+ W +C         + 
Sbjct: 120 KLPTK------SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC--------SAA 165

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
             FDP  S+SY+ + C+ P C S+     + S C A+ C+Y + YGDGS+++G L  E +
Sbjct: 166 ETFDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERL 225

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI--KATSL-AYCLVDRDS 310
           + G++        GCG D +GLF  +AGLLGLG   LS+  Q   K   L +YCL    S
Sbjct: 226 TIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL--PSS 283

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            ++G L F S++   A   PL  +    +FY + LTG +VGGQ + IP S+F        
Sbjct: 284 SSTGFLSFGSSQSKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFST-----A 336

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT +TRL   AY++LR +F +   +      +++ DTCYDFS  ++++VP + +
Sbjct: 337 GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVI 396

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF--------CFAFAPTSSA--LSIIGNVQQQGTRVS 480
            F  G  +D         VD AG F        C AFA  + A   +I GN QQ+   V 
Sbjct: 397 SFSGGVDVD---------VDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVV 447

Query: 481 FDLANNRVGFTPNKC 495
           +D++  +VGF P  C
Sbjct: 448 YDVSGGKVGFAPASC 462


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  220 bits (561), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 151/407 (37%), Positives = 213/407 (52%), Gaps = 17/407 (4%)

Query: 98  YRSLVLSRLERDSA-RVNTLITKLQLAIYNVDR-HELKPAEAQ-ILPED--FSTPVVSGA 152
           +R+ ++ R  + S  R  TL T  ++ I  V R HE +   A+ +L  D  F TPV SG 
Sbjct: 28  FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHVLAGDQLFETPVASG- 86

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
              +GEY   I  G PP++ + ++DTGSD+NW+QC PC  CY+     FDP  S+SY  L
Sbjct: 87  ---NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTL 143

Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            C +  C+ L   +C A+ C Y   YGDGS T G L T+ V+ G +G +  +A GCG+ N
Sbjct: 144 GCGSNFCQDLPFQSCAAS-CQYDYMYGDGSSTSGALSTDDVTIG-TGKIPNVAFGCGNSN 201

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF-NSARGGDAVT 328
            G F G+ GL+GLG G LSL  Q+  T+    +YCLV   S  +  L   +S   G    
Sbjct: 202 LGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAY 261

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
            P++ N    TFYY  L G SV G+AV  P + F++   G GG+I+D GT +T L   A+
Sbjct: 262 TPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAF 321

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           N +  +        +        + C+  +G+ +   PTV  HF  G  + L   N  I 
Sbjct: 322 NPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFN-GADVALAPDNTFIA 380

Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +D  GT C A A +S+  SI GN+QQ    +  DL N R+GF    C
Sbjct: 381 LDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 154/364 (42%), Positives = 212/364 (58%), Gaps = 25/364 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC   C++QS P+F+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174

Query: 204 KTSSSYSPLPCAAPQCK-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           K SSSY+ + C+A QC      +L+ ++C  +N C+YQ +YGD SF+VG L  +TVSFG+
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS 234

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
           + SV     GCG DNEGLF  SAGL+GL    LSL  Q+  +   S +YCL    S +SG
Sbjct: 235 T-SVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 293

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            L   S   G     P+  +   D+ Y++ +TG  V G+     P             I+
Sbjct: 294 YLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTII 348

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLH 431
           D GT ITRL T  Y++L  +   +AG +K T   S  ++ DTC+     R +RVP V++ 
Sbjct: 349 DSGTVITRLPTGVYSALSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMA 404

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  G AL L A+N L+ VDSA T C AFAP  SA +IIGN QQQ   V +D+ N+++GF 
Sbjct: 405 FAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFA 462

Query: 492 PNKC 495
              C
Sbjct: 463 AGGC 466


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 154/364 (42%), Positives = 208/364 (57%), Gaps = 25/364 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDP 203
           S P+  G S G G Y +R+G+GTP + + MV+DTGS + WLQC PC   C++QS P+F+P
Sbjct: 115 SVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNP 174

Query: 204 KTSSSYSPLPCAAPQCKSLD------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           K SSSY+ + C+A QC  L        S   +N C+YQ +YGD SF+VG L  +TVSFG+
Sbjct: 175 KASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGS 234

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG 314
           + SV     GCG DNEGLF  SAGL+GL    LSL  Q+  +   S +YCL    S +SG
Sbjct: 235 T-SVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSG 293

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            L   S   G     P+  +   D+ Y++ +TG  V G+     P             I+
Sbjct: 294 YLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTII 348

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLH 431
           D GT ITRL T  Y++L  +   +AG +K T   S  ++ DTC+     R +RVP V++ 
Sbjct: 349 DSGTVITRLPTGVYSALSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMA 404

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           F  G AL L A+N L+ VDSA T C AFAP  SA +IIGN QQQ   V +D+ N+++GF 
Sbjct: 405 FAGGAALKLAARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFA 462

Query: 492 PNKC 495
              C
Sbjct: 463 AGGC 466


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 152/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
            P  +   + +++ P +S+ + ++PLH R              +   L RD  R   +  
Sbjct: 37  SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 96

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           K               A   +   D + P   G S  + EY   +G+G+P    +M++DT
Sbjct: 97  KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 147

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
           GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP  C +  C  L  + + C  +++C Y 
Sbjct: 148 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYI 207

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
           V YGDGS T G   ++T++ G+S +VK    GC +   G    + GL+GLGGG  SL  Q
Sbjct: 208 VTYGDGSSTTGTYSSDTLALGSS-AVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 266

Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
              T   + +YCL    S +SG L   +A G      V  P++R+ +V TFY V L    
Sbjct: 267 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 325

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VGG+ + IP S+F        G ++D GT ITRL   AY++L  +F        P     
Sbjct: 326 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 379

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
           + DTC+DFSG  SV +P+V+L F  G  + L A   ++      + C AFA  S  S+L 
Sbjct: 380 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAANSDDSSLG 433

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGNVQQ+   V +D+    VGF    C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 142/421 (33%), Positives = 220/421 (52%), Gaps = 38/421 (9%)

Query: 94  RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
           R  ++   +  +L  D  RV ++  +++  +   +  E + +E QI       P+ SG +
Sbjct: 76  RKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSE-QSSEIQI-------PLASGIN 127

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
             +  Y   IG+G   +  ++++DTGSD+ W+QC PC  CY Q  P+F+P  SSSY+ L 
Sbjct: 128 LETLNYIVTIGLGN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLL 185

Query: 214 CAAPQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           C +  C++L     +  AC +N    C + V+YGDGSFT G+L  E +SFG   SV    
Sbjct: 186 CNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGI-SVSNFV 244

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR 322
            GCG +N+GLF G +G++GLG   LS+  Q   T     +YCL   DS ASG L   +  
Sbjct: 245 FGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNES 304

Query: 323 GGDAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDC 376
                  P     ++ N ++  FY + LTG  VGG A+Q        D + G+GGI++D 
Sbjct: 305 SLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQ--------DTSFGNGGILIDS 356

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT ITRL    YN+L+  F++          +++ DTC++ +G+  V +PT+S+HF    
Sbjct: 357 GTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNV 416

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
            L++ A   L         C A A  S  + ++IIGN QQ+  RV +D   +++GF    
Sbjct: 417 DLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFARED 476

Query: 495 C 495
           C
Sbjct: 477 C 477


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 227/408 (55%), Gaps = 30/408 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPED---FSTPVVSGASQGSGEYF 160
           L  D AR+ +L  +L     +      E +   +   P+D    S P+  G S G G Y 
Sbjct: 69  LAHDGARIASLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYV 128

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           +R+G+GTP + + MV+DTGS + WLQC PC   C++QS P+F+PK SSSY+ + C+A QC
Sbjct: 129 TRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQC 188

Query: 220 K-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
                 +L+ ++C  +N C+YQ +YGD SF+VG L  +TVSFG S SV     GCG DNE
Sbjct: 189 SDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFG-STSVPNFYYGCGQDNE 247

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
           GLF  SAGL+GL    LSL  Q+  +   S +YCL    S +SG L   S   G     P
Sbjct: 248 GLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNPGQYSYTP 307

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           +  +   D+ Y++ +TG  V G+     P             I+D GT ITRL T  Y++
Sbjct: 308 MASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVITRLPTGVYSA 362

Query: 391 LRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           L  +   +AG +K T   S  ++ DTC+     R +RVP V++ F  G AL L A+N L+
Sbjct: 363 LSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLV 418

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            VDSA T C AFAP  SA +IIGN QQQ   V +D+ N+++GF    C
Sbjct: 419 DVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 160/440 (36%), Positives = 213/440 (48%), Gaps = 41/440 (9%)

Query: 81  SLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
           S+PL H       +  +  +  +  RL RD AR N ++TK               A +  
Sbjct: 18  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKA------TGGRTAATALSDA 71

Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQS 197
                S P   G S  S EY   +G+GTP  Q ++++DTGSD++W+QC+PC   ECY Q 
Sbjct: 72  AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 131

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSL----------DVSACRANRCLYQVAYGDGSFTVGD 247
           DP+FDP +SSSY+ +PC +  C+ L           VS   A  C Y + YG+ + T G 
Sbjct: 132 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 191

Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
             TET++      V     GCG    G +    GLLGLGG   SL  Q  +      +YC
Sbjct: 192 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 251

Query: 305 LVDRDSPASGVLEFNSARGGDAVTA-------PLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
           L    S  +G L   +     + TA       P+ R   V TFY V LTG SVGG  + I
Sbjct: 252 L-PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAI 310

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCY 415
           PPS F        G+++D GT IT L   AY +LR +F        L P S   + DTCY
Sbjct: 311 PPSAFSS------GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 364

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
           DF+G  +V VPT+SL F  G  +DL A   ++ VD  G   FA A T +A+ IIGNV Q+
Sbjct: 365 DFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIGNVNQR 421

Query: 476 GTRVSFDLANNRVGFTPNKC 495
              V +D     VGF    C
Sbjct: 422 TFEVLYDSGKGTVGFRAGAC 441


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 138/387 (35%), Positives = 204/387 (52%), Gaps = 30/387 (7%)

Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
           ++ P      + SG + GSGEYF  + VGTPP+ FS++LDTGSD+NWLQC PC +C+ Q+
Sbjct: 139 EVSPGKLIATLESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQN 198

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFTVGDLVTE 251
              +DPKTS+S+  + C  P+C  +        C ++   C Y   YGD S T GD   E
Sbjct: 199 GMFYDPKTSASFKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVE 258

Query: 252 TVSF------GNSGSVK--GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TS 300
           T +       G S   K   +  GCGH N GLF G++GLLGLG G LS + Q+++    S
Sbjct: 259 TFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHS 318

Query: 301 LAYCLVDRDSPASGVLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGG 352
            +YCLVDR+S  +   +       D +    +        +   V+TFYY+ +    VGG
Sbjct: 319 FSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGG 378

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALF 411
           +A+ IP   + +   GDGG I+D GT ++     AY  +++ F  ++  N        + 
Sbjct: 379 KALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVL 438

Query: 412 DTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSI 468
           D C++ SG+   ++ +P + + F  G   + PA+N  I + S    C A   T  S  SI
Sbjct: 439 DPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL-SEDLVCLAILGTPKSTFSI 497

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IGN QQQ   + +D   +R+GFTP KC
Sbjct: 498 IGNYQQQNFHILYDTKRSRLGFTPTKC 524


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 119/227 (52%), Positives = 155/227 (68%), Gaps = 8/227 (3%)

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR---GGDAVT 328
           +FVG+AGLLGLG G +S   Q+      + +YCLV R + +SG LEF       G   V+
Sbjct: 1   MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
             LI N +  +FYY+GL+G  VGG  V I   +F ++E G+GG+++D GTA+TRL   AY
Sbjct: 61  --LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAY 118

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           N+ RD+FV    NL  TSGV++FDTCYD +G  +VRVPT+S +F  G  L LPA+N+LIP
Sbjct: 119 NAFRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIP 178

Query: 449 VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           VDS GTFCFAFAP+SS LSIIGN+QQ+G  +S D AN  +GF PN C
Sbjct: 179 VDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 227/408 (55%), Gaps = 30/408 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDR--HELKPAEAQILPED---FSTPVVSGASQGSGEYF 160
           L  D AR+ +L  +L     +      E +   +   P+D    S P+  G S G G Y 
Sbjct: 69  LAHDGARIASLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYV 128

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           +R+G+GTP + + MV+DTGS + WLQC PC   C++QS P+F+PK SSSY+ + C+A QC
Sbjct: 129 TRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQC 188

Query: 220 K-----SLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
                 +L+ ++C  +N C+YQ +YGD SF+VG L  +TVSFG S SV     GCG DNE
Sbjct: 189 SDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFG-STSVPNFYYGCGQDNE 247

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
           GLF  SAGL+GL    LSL  Q+  +   S +YCL    S +SG L   S   G     P
Sbjct: 248 GLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYNPGQYSYTP 307

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           +  +   D+ Y++ +TG  V G+     P             I+D GT ITRL T  Y++
Sbjct: 308 MASSSLDDSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDSGTVITRLPTGVYSA 362

Query: 391 LRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           L  +   +AG +K T   S  ++ DTC+     R +RVP V++ F  G AL L A+N L+
Sbjct: 363 LSKA---VAGAMKGTPRASAFSILDTCFQGQAAR-LRVPEVTMAFAGGAALKLAARNLLV 418

Query: 448 PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            VDSA T C AFAP  SA +IIGN QQQ   V +D+ N+++GF    C
Sbjct: 419 DVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 160/440 (36%), Positives = 213/440 (48%), Gaps = 41/440 (9%)

Query: 81  SLPL-HSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQI 139
           S+PL H       +  +  +  +  RL RD AR N ++TK               A +  
Sbjct: 98  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKA------TGGRTAATALSDA 151

Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQS 197
                S P   G S  S EY   +G+GTP  Q ++++DTGSD++W+QC+PC   ECY Q 
Sbjct: 152 AGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQK 211

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSL----------DVSACRANRCLYQVAYGDGSFTVGD 247
           DP+FDP +SSSY+ +PC +  C+ L           VS   A  C Y + YG+ + T G 
Sbjct: 212 DPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGV 271

Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
             TET++      V     GCG    G +    GLLGLGG   SL  Q  +      +YC
Sbjct: 272 YSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYC 331

Query: 305 LVDRDSPASGVLEFNSARGGDAVTA-------PLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
           L    S  +G L   +     + TA       P+ R   V TFY V LTG SVGG  + I
Sbjct: 332 L-PPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAI 390

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCY 415
           PPS F        G+++D GT IT L   AY +LR +F        L P S   + DTCY
Sbjct: 391 PPSAFSS------GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCY 444

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
           DF+G  +V VPT+SL F  G  +DL A   ++ VD  G   FA A T +A+ IIGNV Q+
Sbjct: 445 DFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIGNVNQR 501

Query: 476 GTRVSFDLANNRVGFTPNKC 495
              V +D     VGF    C
Sbjct: 502 TFEVLYDSGKGTVGFRAGAC 521


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 129/380 (33%), Positives = 209/380 (55%), Gaps = 26/380 (6%)

Query: 134 PAEAQIL-PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-T 191
           P    +L P   S P+  G S GSG Y+ ++G+GTPP+ ++M+LDTGS ++WLQC+PC  
Sbjct: 99  PKSGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAV 158

Query: 192 ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR-------ANRCLYQVAYGDGSFT 244
            C+ Q+DP++DP  S +Y  L CA+ +C  L  +          +N CLY  +YGD SF+
Sbjct: 159 YCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFS 218

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SL 301
           +G L  + ++  +S ++     GCG DN+GLF  +AG++GL    LS+  Q+      + 
Sbjct: 219 IGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAF 278

Query: 302 AYCL--VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           +YCL   +  S   G L   S         P++ + K  + Y++ LT  +V G+ + +  
Sbjct: 279 SYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAA 338

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
           +++ +        ++D GT ITRL    Y +LR +FV+ ++         ++ DTC+  S
Sbjct: 339 AMYRVPT------LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGS 392

Query: 419 GLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQ 475
            L+S+  VP + + F  G  L L A + LI  D  G  C AFA +S  + ++IIGN QQQ
Sbjct: 393 -LKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAFAGSSGTNQIAIIGNRQQQ 450

Query: 476 GTRVSFDLANNRVGFTPNKC 495
              +++D++ +R+GF P  C
Sbjct: 451 TYNIAYDVSTSRIGFAPGSC 470


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 151/450 (33%), Positives = 209/450 (46%), Gaps = 63/450 (14%)

Query: 79  SFSLPLHSREILHKTRHNDYR-SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA 137
           +  +P+  R+ L        R SL+  RL  D+AR  +L+                    
Sbjct: 26  TLHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGR--------------- 70

Query: 138 QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
                   +PV SG    SGEYF+ +GVGTP  +  +V+DTGSD+ WLQC PC  CY Q 
Sbjct: 71  ------LHSPVFSGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQR 124

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTET 252
             +FDP+ SS+Y  +PC++PQC++L    C +       C Y VAYGDGS + GDL T+ 
Sbjct: 125 GQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDK 184

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
           ++F N   V  + LGCG DNEGLF  +AGLLG        +++            R +P+
Sbjct: 185 LAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGRRAAARYPSRRR--------WPRRTAPS 236

Query: 313 SGVLEFNSARGGDAVTAPLIRN------------------KKVDTFYYVGLTGFSVGGQA 354
           S        R   A                          +   T+ + G    + G   
Sbjct: 237 SSTASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSASAARGSPG 296

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV---ALF 411
            + P S +       GG++VD GTAI+R    AY +LRD+F   A            ++F
Sbjct: 297 SRTPASRWTRRRG-RGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVF 355

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS----AGTF--CFAFAPTSSA 465
           D CYD  G  +   P + LHF  G  + LP +NY +PVD     A ++  C  F      
Sbjct: 356 DACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG 415

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LS+IGNVQQQG RV FD+   R+GF P  C
Sbjct: 416 LSVIGNVQQQGFRVVFDVEKERIGFAPKGC 445


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 142/359 (39%), Positives = 198/359 (55%), Gaps = 21/359 (5%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G  EY   + +GTPP  F  + DTGSD+ W QC+PC  C+ Q  PI+D   SSS+SP+PC
Sbjct: 89  GQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPC 148

Query: 215 AAPQCKSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSF-GNSG-SVKGIALGCG 269
           A+  C  +  S  C A+   C Y+ AYGDG+++ G L TET++F G  G SV GIA GCG
Sbjct: 149 ASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAFGCG 208

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVL-----EFNSAR 322
            DN GL   S G +GLG G LSL  Q+     +YCL D    S  S VL     E  +  
Sbjct: 209 VDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGALAELAAPS 268

Query: 323 GGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G AV + PL+++  V T+YYV L G S+G   + IP   F++ + G GG+IVD GT  T
Sbjct: 269 TGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVDSGTTFT 328

Query: 382 RLQTQAYNSLRDSFVRLAGNLK-PTSGVALFDT-CYD-FSGLRSV-RVPTVSLHFGAGKA 437
            L   A+  + D    +AG L+ P    +  D+ C+   +G + +  +P + LHF  G  
Sbjct: 329 FLVESAFRVVVD---HVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHFAGGAD 385

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + L   NY+       +FC   A + SA +SI+GN QQQ  ++ FD+   ++ F P  C
Sbjct: 386 MRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMPTDC 444


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 158/455 (34%), Positives = 222/455 (48%), Gaps = 52/455 (11%)

Query: 63  EESETAAESFPLNSSSSFSLPLHSRE-----ILHKTRHNDYRSLVLSRLERDSARVNTLI 117
           E SE  +     +S +  +LPL  R      ++ K + +   +L      RD  R   + 
Sbjct: 42  EPSEVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETL-----GRDQLRAANIH 96

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
            KL  +  N    EL+ +   I       P  SG S G+ EY   + +GTP     M +D
Sbjct: 97  AKLS-SPRNSSAKELQQSGVTI-------PTSSGYSLGTPEYVITVSLGTPAVTQVMSID 148

Query: 178 TGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCL 233
           TGSD++W+QC PC    C  Q D +FDP  S++YS   C++ QC  L  + + C  + C 
Sbjct: 149 TGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQ 208

Query: 234 YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSL 292
           Y V Y D S T G   ++T+    S +VK    GC H   G FVG   GL+GLGG   SL
Sbjct: 209 YIVKYVDHSNTTGTYGSDTLGLTTSDAVKNFQFGCSHRANG-FVGQLDGLMGLGGDTESL 267

Query: 293 TKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGL 345
             Q  AT   + +YCL    S A G L   +A GG + +     PL+R   V TFY V L
Sbjct: 268 VSQTAATYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFN-VPTFYGVFL 326

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
              +V G  + +P S+F       G  +VD GT IT+L   AY +LR +F +        
Sbjct: 327 QAITVAGTKLNVPASVFS------GASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSA 380

Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF---CFAFAPT 462
           + V + DTC+DFSG+++VRVP V+L F  G  +DL         D +G F   C AF  T
Sbjct: 381 APVGILDTCFDFSGIKTVRVPVVTLTFSRGAVMDL---------DVSGIFYAGCLAFTAT 431

Query: 463 SS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +      I+GNVQQ+   + FD+  + +GF P  C
Sbjct: 432 AQDGDTGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 146/401 (36%), Positives = 198/401 (49%), Gaps = 31/401 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD  R   +  K+     NV + EL+ +   I       P  SG S G+ EY   + +
Sbjct: 84  LRRDQLRAAYIQAKVSSRYNNVAK-ELQQSAVTI-------PTSSGYSLGTTEYVITVTI 135

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL- 222
           GTP     M +DTGSD++W+QC PC    C  Q D +FDP  S++YS   C + QC  L 
Sbjct: 136 GTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSAQCAQLG 195

Query: 223 -DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA- 280
            + + C  ++C Y V YGDGS T G   ++T+S  +S +VK    GC H   G FVG   
Sbjct: 196 DEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSFQFGCSHRAAG-FVGELD 254

Query: 281 GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVT---APLIRN 334
           GL+GLGG   SL  Q  AT   + +YCL    S   G L   +A G  +      P++R 
Sbjct: 255 GLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRF 314

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
             V TFY V L G +V G  + +P S+F       G  +VD GT IT+L   AY +LR +
Sbjct: 315 S-VPTFYGVFLQGITVAGTMLNVPASVFS------GASVVDSGTVITQLPPTAYQALRTA 367

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
           F +        + V   DTC+DFSG  ++ VPTV+L F  G A+DL     L     AG 
Sbjct: 368 FKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLDISGILY----AGC 423

Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             F          I+GNVQQ+   + FD+    +GF    C
Sbjct: 424 LAFTATAHDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 188/379 (49%), Gaps = 37/379 (9%)

Query: 148 VVSGASQGSG----EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFD 202
           V +G   G G    EY   + VGTPPR  ++ LDTGSD+ W QC PC +C++Q + P+ D
Sbjct: 75  VRAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLD 134

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           P  SS+++ LPC AP C++L  ++C         C+Y   YGD S TVG L T++ +FG 
Sbjct: 135 PAASSTHAALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGG 194

Query: 258 SGSVKGIA-----LGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR-DS 310
             +  G+A      GCGH N+G+F     G+ G G G  SL  Q+  TS +YC     D+
Sbjct: 195 DDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDT 254

Query: 311 PASGVLEF-----------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
            +S V+             ++A  GD  T  LI+N    + Y+V L G SVGG  V +P 
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-- 417
           S            I+D G +IT L    Y +++  FV   G     +G A  D C+    
Sbjct: 315 SRLR------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPV 368

Query: 418 -SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQG 476
            +  R   VP ++LH   G   +LP  NY+    +A   C      +    +IGN QQQ 
Sbjct: 369 AALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQN 428

Query: 477 TRVSFDLANNRVGFTPNKC 495
           T V +DL N+ + F P +C
Sbjct: 429 THVVYDLENDVLSFAPARC 447


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 151/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
            P  +   + +++ P +S+ + ++PLH R              +   L RD  R   +  
Sbjct: 37  SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 96

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           K               A   +   D + P   G S  + EY   +G+G+P    +M++DT
Sbjct: 97  KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 147

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
           GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP  C +  C  L  + + C  +++C Y 
Sbjct: 148 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYI 207

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
           V YGDGS T G   ++T++ G+S +V+    GC +   G    + GL+GLGGG  SL  Q
Sbjct: 208 VTYGDGSSTTGTYSSDTLALGSS-AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 266

Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
              T   + +YCL    S +SG L   +A G      V  P++R+ +V TFY V L    
Sbjct: 267 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 325

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VGG+ + IP S+F        G ++D GT ITRL   AY++L  +F        P     
Sbjct: 326 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 379

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
           + DTC+DFSG  SV +P+V+L F  G  + L A   ++      + C AFA  S  S+L 
Sbjct: 380 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAGNSDDSSLG 433

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGNVQQ+   V +D+    VGF    C
Sbjct: 434 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 141/407 (34%), Positives = 204/407 (50%), Gaps = 33/407 (8%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHE-LKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           +  D+ RV  + ++L     N+ R   +K  ++  LP +      SG+  GS  Y   +G
Sbjct: 1   MNLDNERVKYIQSRLS---KNLGRENTVKDLDSTTLPAE------SGSLIGSANYVVVVG 51

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL- 222
           +GTP R  S+V DTGSD+ W QC PC   CY+Q D IFDP  SSSY+ + C +  C  L 
Sbjct: 52  LGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLT 111

Query: 223 ------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
                 + S+     C+Y   YGD S +VG L  E ++   +  V     GCG DNEGLF
Sbjct: 112 SDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLF 171

Query: 277 VGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA--VTAPL 331
            GSAGL+GLG   +S+ +Q  +      +YCL    S + G L F ++   +A  +  PL
Sbjct: 172 NGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL-PATSSSLGHLTFGASAATNASLIYTPL 230

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
                 ++FY + +   SVGG  +  +  S F       GG I+D GT ITRL    Y +
Sbjct: 231 STISGDNSFYGLDIVSISVGGTKLPAVSSSTFSA-----GGSIIDSGTVITRLAPTVYAA 285

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           LR +F R        +   L DTCYD SG + + VP +   F  G  ++L  +  ++ V+
Sbjct: 286 LRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRG-ILXVE 344

Query: 451 SAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S    C AFA   S   +++ GNVQQ+   V +D+   R+GF    C
Sbjct: 345 SEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 36/366 (9%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           + EY  R+ VGTP R  ++ LDTGSD+ W QC PC +C+ Q  P+ DP  SS+Y+ LPC 
Sbjct: 81  TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140

Query: 216 APQCKSLDVSAC------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG------SVKG 263
           A +C++L  ++C          C+Y   YGD S TVG++ T+  +FG+SG        + 
Sbjct: 141 AARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR 200

Query: 264 IALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE----- 317
           +  GCGH N+G+F  +  G+ G G G  SL  Q+  TS +YC        S ++      
Sbjct: 201 LTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGSP 260

Query: 318 ---FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
              ++ A  G+  T P+++N    + Y++ L G SVG   + +P + F          I+
Sbjct: 261 AALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFR-------STII 313

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDF---SGLRSVRVPTVS 429
           D G +IT L  + Y +++  F    G   P SGV  +  D C+     +  R   VP+++
Sbjct: 314 DSGASITTLPEEVYEAVKAEFAAQVG--LPPSGVEGSALDLCFALPVTALWRRPAVPSLT 371

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
           LH   G   +LP  NY+     A   C          ++IGN QQQ T V +DL N+R+ 
Sbjct: 372 LHL-EGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLS 430

Query: 490 FTPNKC 495
           F P +C
Sbjct: 431 FAPARC 436


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 151/448 (33%), Positives = 228/448 (50%), Gaps = 34/448 (7%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
            P  +   + +++ P +S+ + ++PLH R              +   L RD  R   +  
Sbjct: 107 SPRTDSVCSQSKAVPSSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQR 166

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           K               A   +   D + P   G S  + EY   +G+G+P    +M++DT
Sbjct: 167 KFSGGGG---------AGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDT 217

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQ 235
           GSD++W+QC+PC++C+ Q+DP+FDP +SS+YSP  C +  C  L  + + C  +++C Y 
Sbjct: 218 GSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYI 277

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
           V YGDGS T G   ++T++ G+S +V+    GC +   G    + GL+GLGGG  SL  Q
Sbjct: 278 VTYGDGSSTTGTYSSDTLALGSS-AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQ 336

Query: 296 IKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFS 349
              T   + +YCL    S +SG L   +A G      V  P++R+ +V TFY V L    
Sbjct: 337 TAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIR 395

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VGG+ + IP S+F        G ++D GT ITRL   AY++L  +F        P     
Sbjct: 396 VGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSG 449

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALS 467
           + DTC+DFSG  SV +P+V+L F  G  + L A   ++      + C AFA  S  S+L 
Sbjct: 450 ILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAGNSDDSSLG 503

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGNVQQ+   V +D+    VGF    C
Sbjct: 504 IIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 149/415 (35%), Positives = 211/415 (50%), Gaps = 49/415 (11%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIG 164
           RL  D AR + ++ K         R  +            S P   G    S EY   +G
Sbjct: 83  RLRSDRARADHILRKAS------GRRMMSEGGGA------SIPTYLGGFVDSLEYVVTLG 130

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           +GTP  Q ++++DTGSD++W+QC+PC  ++CY Q DP+FDP  SS+++ +PCA+  CK L
Sbjct: 131 IGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASDACKQL 190

Query: 223 DV----SACRAN------RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            V    + C  N      +C Y + YG+G+ T G   TET++ G+S  VK    GCG D 
Sbjct: 191 PVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSAVVKSFRFGCGSDQ 250

Query: 273 EGLFVGSAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEF------NSARG 323
            G +    GLLGLGG    ++S T  +   + +YCL   +S  +G L        N++  
Sbjct: 251 HGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNS-GAGFLTLGAPNSTNNSNS 309

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           G   T     + K+ TFY V LTG SVGG+A+ IPP++F        G IVD GT IT +
Sbjct: 310 GFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAK------GNIVDSGTVITGI 363

Query: 384 QTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            T AY +LR +F        L P +  AL DTCY+F+G  +V VP V+L F  G  +DL 
Sbjct: 364 PTTAYKALRTAFRSAMAEYPLLPPADSAL-DTCYNFTGHGTVTVPKVALTFVGGATVDLD 422

Query: 442 AKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             + ++  D     C AFA     +  IIGNV  +   V +D     +GF    C
Sbjct: 423 VPSGVLVED-----CLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 142/384 (36%), Positives = 196/384 (51%), Gaps = 27/384 (7%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
           R  +  A       D STP  S      G Y     VGTPP +   + DTGSDI WLQC 
Sbjct: 58  RRSINRANHFFKDSDTSTPE-STVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116

Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGD 247
           PC +CY Q+ PIF+P  SSSY  +PC +  C S+ D S    N C Y+++YGD S + GD
Sbjct: 117 PCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGD 176

Query: 248 LVTETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT--- 299
           L  +T+S     G+  S     +GCG DN G F G S+G++GLGGG +SL  Q+ ++   
Sbjct: 177 LSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGG 236

Query: 300 SLAYCLV---DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
             +YCLV   +++S AS +L F  A    G   V+ PLI+   V  FY++ L  FSVG +
Sbjct: 237 KFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVGNK 294

Query: 354 AVQIPPSLFEMDEAGD--GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
            V+   S     E GD  G II+D GT +T + +  Y +L  + V L    +       F
Sbjct: 295 RVEFGGS----SEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
             CY          P ++ HF  G  ++L + +  +P+ + G  CFAF P+    SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITAHF-KGADIELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           + QQ   V +DL    V F P  C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  217 bits (552), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 162/469 (34%), Positives = 228/469 (48%), Gaps = 44/469 (9%)

Query: 46  QTEHILSFEPE-TLEPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVL 103
             EH     P  + EP A  S ++    P  SS++ S+PL H       ++++D  +   
Sbjct: 22  DNEHGFVVVPRRSYEPKAVCSASSVNLEP--SSATLSVPLVHRYGPCAASQYSDMPTPSF 79

Query: 104 SRLERDS-ARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           S   R S AR N + ++    + +       P +A +     + P   G    S EY   
Sbjct: 80  SETLRHSRARTNYIKSRASTGMAST------PDDAAV-----TVPTRLGGFVDSLEYMVT 128

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
           +G GTP     +++DTGSD++W+QC PC  TECY Q DP+FDP  SS+Y+P+ C A  C 
Sbjct: 129 LGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGADACN 188

Query: 221 SLD---VSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            L     + C +   +C Y+V YGDGS T G    ET++F    +VK    GCGHD  G 
Sbjct: 189 KLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITFAPGITVKDFHFGCGHDQRGP 248

Query: 276 FVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPAS----GVLEFNSARGGDAVT 328
                GLLGLGG   SL  Q   +   + +YCL   +S A     GV    +      V 
Sbjct: 249 SDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVF 308

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
            P+       T Y V +TG SVGG+ + IP S F       GG+++D GT +T L   AY
Sbjct: 309 TPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFR------GGMLIDSGTIVTELPETAY 362

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           N+L ++ +R A    P      FDTCY+F+G  +V VP V+L F  G  +DL   N ++ 
Sbjct: 363 NAL-NAALRKAFAAYPMVASEDFDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILV 421

Query: 449 VDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            D     C AF  +     L IIGNV Q+   V +D  + +VGF    C
Sbjct: 422 KD-----CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 148/371 (39%), Positives = 198/371 (53%), Gaps = 38/371 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
           + P   G   G+  Y     +GTP    +M +DTGSD++W+QC+PC+    CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
           DP  SSSY+ +PC  P C  L + A  A       Y V+YGDGS T G   ++T++   S
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 245

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
            +V+G   GCGH   GLF G  GLLGLG    SL +Q   T     +YCL  + S A G 
Sbjct: 246 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 304

Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           L      GG +  AP      L+ +    T+Y V LTG SVGGQ + +P S F       
Sbjct: 305 LTLG--LGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA------ 356

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
           GG +VD GT ITRL   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P 
Sbjct: 357 GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 416

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
           V+L FG+G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D  
Sbjct: 417 VALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 467

Query: 485 NNRVGFTPNKC 495
              VGF P+ C
Sbjct: 468 GTSVGFKPSSC 478


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 167/466 (35%), Positives = 223/466 (47%), Gaps = 46/466 (9%)

Query: 52  SFEPETLEPFAEESETAAESFPLNSSSSFSLPL-HSREILHKTRHNDYRSLVLSRLERDS 110
           SFEPE     A  S ++A S P  +S    +PL H       +  +  +  +  RL RD 
Sbjct: 24  SFEPE-----AACSTSSANSDPNRAS----VPLVHRHGPCAPSAASGGKPSLAERLRRDR 74

Query: 111 ARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPR 170
           AR N ++TK         R         +     S P   G S  S EY   +G+GTP  
Sbjct: 75  ARANYIVTKAAGG-----RTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAV 129

Query: 171 QFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-- 226
           Q  +++DTGSD++W+QC+PC   ECY Q DP+FDP +SSSY+ +PC +  C+ L   A  
Sbjct: 130 QQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYG 189

Query: 227 --CR---ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAG 281
             C    A  C Y + YG+ + T G   TET++      V     GCG    G +    G
Sbjct: 190 HGCTSGAAALCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDG 249

Query: 282 LLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA-------PL 331
           LLGLGG   SL  Q  +      +YCL    S  +G L   +     + TA       P+
Sbjct: 250 LLGLGGAPESLVSQTSSQFGGPFSYCL-PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPM 308

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
            R   V TFY V LTG SVGG  + +PPS F        G+++D GT IT L   AY +L
Sbjct: 309 RRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS------GMVIDSGTVITGLPATAYAAL 362

Query: 392 RDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           R +F        L P S  A+ DTCYDF+G  +V VPT++L F  G  +DL     ++ V
Sbjct: 363 RSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVL-V 421

Query: 450 DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           D  G   FA A T   + IIGNV Q+   V +D     VGF    C
Sbjct: 422 D--GCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 189/362 (52%), Gaps = 28/362 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   ++LDTGSD+ W QCRPC  C+ ++    DP  SS++  LPC++P
Sbjct: 414 EYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSP 473

Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALG 267
            C +L  S+C  +      C+Y  AY DGS T G L  ET +F  +      +V  +A G
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533

Query: 268 CGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVL----EFNS 320
           CG  N G+F  +  G+ G G G LSL  Q+K  + ++C   +    P+S +L       S
Sbjct: 534 CGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSEPSSVLLGLPANLYS 593

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
              G   + PL++N      YY+ L G +VG   + IP S F + + G GG I+D GT +
Sbjct: 594 DADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGM 653

Query: 381 TRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVR--VPTVSLHFGAG 435
           T L   AY  + D+F   VRL  +   +S ++    C+ FS  R  +  VP + LHF  G
Sbjct: 654 TTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRL--CFSFSVPRRAKPDVPKLVLHF-EG 710

Query: 436 KALDLPAKNYLIPVDSAG--TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
             LDLP +NY+   + AG    C A       L+IIGN QQQ   V +DL  N + F P 
Sbjct: 711 ATLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVLYDLVRNMLSFVPA 769

Query: 494 KC 495
           +C
Sbjct: 770 QC 771


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 148/401 (36%), Positives = 213/401 (53%), Gaps = 27/401 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D +RV+++ +KL     +    ++K   A  LP         G+  GSG YF  +G+
Sbjct: 109 LLQDQSRVDSIHSKLS---KDSGLSDVKATAATTLPAK------DGSIIGSGNYFVTVGL 159

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           GTP + FS++ DTGSD+ W QC PC + CY Q + IF+P  S+SY+ + C +  C SL  
Sbjct: 160 GTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLAS 219

Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
              ++  C ++ C+Y + YGD SF++G    E +S   +        GCG +N+GLF G+
Sbjct: 220 ATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGA 279

Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           AGLLGLG   LSL   T Q      +YCL    S ++G L F  +    A   PL     
Sbjct: 280 AGLLGLGRDKLSLVSQTAQRYNKIFSYCL-PSSSSSTGFLTFGGSTSKSASFTPLATISG 338

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
             +FY + LTG SVGG+ + I PS+F        G I+D GT ITRL   AY++L  +F 
Sbjct: 339 GSSFYGLDLTGISVGGRKLAISPSVFST-----AGTIIDSGTVITRLPPAAYSALSSTFR 393

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
           +L         +++ DTC+DFS   ++ VP + L F  G  +D+  K  +  V+     C
Sbjct: 394 KLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDID-KTGIFYVNDLTQVC 452

Query: 457 FAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFA  S A  ++I GNVQQ+   V +D A  RVGF P  C
Sbjct: 453 LAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGC 493


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/363 (35%), Positives = 198/363 (54%), Gaps = 24/363 (6%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKT 205
           P+  GAS GSG Y+ ++G+G+P R +SM++DTGS ++WLQC+PC   C+ Q+DP+FDP  
Sbjct: 1   PLNPGASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSA 60

Query: 206 SSSYSPLPCAAPQCKSL-------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
           S +Y  L C + QC SL        +    +N C+Y  +YGD S+++G L  + ++   S
Sbjct: 61  SKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPS 120

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
            ++ G   GCG D+EGLF  +AG+LGLG   LS+  Q+ +    + +YCL  R     G 
Sbjct: 121 QTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG--GGGF 178

Query: 316 LEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           L    A   G      P+  +    + Y++ LT  +VGG+A+ +  + + +        I
Sbjct: 179 LSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------I 232

Query: 374 VDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           +D GT ITRL    Y   + +FV+ ++       G ++ DTC+  +      VP V L F
Sbjct: 233 IDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF 292

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  L+L   N L+ VD  G  C AFA  ++ ++IIGN QQQ  +V+ D++  R+GF  
Sbjct: 293 QGGADLNLRPVNVLLQVDE-GLTCLAFA-GNNGVAIIGNHQQQTFKVAHDISTARIGFAT 350

Query: 493 NKC 495
             C
Sbjct: 351 GGC 353


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/355 (36%), Positives = 190/355 (53%), Gaps = 16/355 (4%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           S G GE+   I +GTPP++  +++DTGSD+ W+Q  PC  C++Q+DPIFDP  SS+Y+ +
Sbjct: 19  SAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKI 78

Query: 213 PCAAPQCKS-LDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            C++  C   L    C  A  C+Y   YGDGS T G    ET++  ++   + +  G   
Sbjct: 79  ACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEE-VKFGASV 137

Query: 271 DNEGLF--VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPAS--GVLEFNSAR- 322
            N G F   G  G+LGLG G +S+  Q+ +      +YCLVD  S  S    + F  A  
Sbjct: 138 YNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAV 197

Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
             G+    P++ N    T+YY+ + G SVGG  + I  S++E+D  G GG I+D GT IT
Sbjct: 198 PSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTIT 257

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            LQ + +N+L  ++         TS   L D C++  G  S   P +++H   G  L+LP
Sbjct: 258 YLQQEVFNALVAAYTSQVRYPTTTSATGL-DLCFNTRGTGSPVFPAMTIHLD-GVHLELP 315

Query: 442 AKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             N  I +++    C AFA      ++I GN+QQQ   + +DL N R+GF P  C
Sbjct: 316 TANTFISLET-NIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 152/368 (41%), Positives = 208/368 (56%), Gaps = 38/368 (10%)

Query: 3   PIKPFV-LFTITTILFSFCLFTSASSRGLSET---ATTVLDVSSALQQTEHILSFEPETL 58
           P+ PF  L  +  +LF      SA SR +S     A   LDV+S+L++T+          
Sbjct: 10  PLLPFTFLLCVGMLLF----LQSAQSRPISVPEVPAYHALDVASSLRETDTA-------- 57

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHN---DYRSLVLSRLERDSARVNT 115
              A  +E   E+ P  S  S  + +H   +L K   N    Y   +  +L R++ RV  
Sbjct: 58  ---AGGAEYKRETKPRRSPWSVEV-VHRDALLLKNAANATASYERRLKEKLRREAVRVRG 113

Query: 116 LITKLQLAIY----NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
           L  +++  +      V+R+E   AE      DF   VVSG  QGSGEYF+RIGVGTP R+
Sbjct: 114 LERQIERTLTLNKDPVNRYE-NVAEVDA---DFGGEVVSGMEQGSGEYFTRIGVGTPTRE 169

Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
             MVLDTGSD+ W+QC PC ECY Q+DPIF+P  S+S+S + C +  C  LD   C +  
Sbjct: 170 QYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGG 229

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           CLY+ +YGDGS++ G   TET++FG + SV  +A+GCGH N GLF+G+AGLLGLG G LS
Sbjct: 230 CLYEASYGDGSYSTGSFATETLTFGTT-SVANVAIGCGHKNVGLFIGAAGLLGLGAGALS 288

Query: 292 LTKQI---KATSLAYCLVDRDSPASGVLEF--NSARGGDAVTAPLIRNKKVDTFYYVGLT 346
              QI      + +YCLVDR+S +SG L+F   S   G   T PL +N  + TFYY+ +T
Sbjct: 289 FPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFT-PLEKNPHLPTFYYLSVT 347

Query: 347 GFSVGGQA 354
             S+   A
Sbjct: 348 AISISAIA 355


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 144/401 (35%), Positives = 215/401 (53%), Gaps = 32/401 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           ++R   R    + KLQ+    V+ H++K  E  + P+            GSGEY  ++ +
Sbjct: 1   MKRAIQRSQERLEKLQIT-SAVNTHQMKDIETPVTPD-----------IGSGEYLIQMAI 48

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP    S ++DTGSD+ W +C PCT+C   +  I+DP +SS+YS + C +  C+   + 
Sbjct: 49  GTPALSLSAIMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSLCQPPSIF 106

Query: 226 ACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLG 284
           +C  +  C Y   YGD S T G L  ET S  +S S+  I  GCGHDN+G F    GL+G
Sbjct: 107 SCNNDGDCEYVYPYGDRSSTSGILSDETFSI-SSQSLPNITFGCGHDNQG-FDKVGGLVG 164

Query: 285 LGGGMLSLTKQI---KATSLAYCLVDR-DSPASGVLEFNSARGGDAVTA---PLIRNKKV 337
            G G LSL  Q+        +YCLV R DS  +  L   +    +A T    PL+++   
Sbjct: 165 FGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSST 224

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
           +  YY+ L G SVGGQ++ IP   F++   G GG+I+D GT +T LQ  AY++++++ V 
Sbjct: 225 N-HYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVS 283

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
            + NL    G    D C++  G  +   P+++ HF  G   D+P +NYL P  ++   C 
Sbjct: 284 -SINLPQADGQ--LDLCFNQQGSSNPGFPSMTFHF-KGADYDVPKENYLFPDSTSDIVCL 339

Query: 458 AFAPTSSAL---SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A  PT+S L   +I GNVQQQ  ++ +D  NN + F P  C
Sbjct: 340 AMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 144/372 (38%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           ED   P+ SG +  S  Y  ++G GTPP+ F  VLDTGS+I W+ C PC+ C  +  P F
Sbjct: 107 EDADIPLASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-F 165

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
           +P  SS+Y+ L CA+ QC+ L V     N   C     YGD S     L +ET+S G S 
Sbjct: 166 EPSKSSTYNYLTCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVG-SQ 224

Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA-SGV 315
            V+    GC +   GL   +  L+G G   LS   Q   +  ++ +YCL    S A +G 
Sbjct: 225 QVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGS 284

Query: 316 LEFNSARGGDAVTA------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           L      G +A++A      PL+ N +  +FYYVGL G SVG + V IP     +DE+  
Sbjct: 285 LLL----GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTG 340

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTV 428
            G I+D GT ITRL   AYN++RDSF     NL   S   LFDTCY+  SG   V  P +
Sbjct: 341 RGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSG--DVEFPLI 398

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGT-FCFAF----APTSSALSIIGNVQQQGTRVSFDL 483
           +LHF     L LP  N L P +  G+  C AF          LS  GN QQQ  R+  D+
Sbjct: 399 TLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDV 458

Query: 484 ANNRVGFTPNKC 495
           A +R+G     C
Sbjct: 459 AESRLGIASENC 470


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 221/422 (52%), Gaps = 42/422 (9%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           D+  +  +R+  D+  VN+L +  + AI+    H+L  +++QI       P+ SGA   +
Sbjct: 92  DWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL--SDSQI-------PISSGARLQT 142

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
             Y   +G+G   +  ++++DTGSD+ W+QC PC  CY Q +P+F+P  SSS+  LPC +
Sbjct: 143 LNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNS 200

Query: 217 PQCKSLDVSA-----C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           P C +L  +A     C    +  C YQ+ YGDGS++ G+L  E ++ G +  +     GC
Sbjct: 201 PTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-EIDNFIFGC 259

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL-----EFNS 320
           G +N+GLF G++GL+GL    LSL  Q  +   +  +YCL      +SG L     +F++
Sbjct: 260 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSN 319

Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI--IVDCG 377
            +    ++   +I+N ++  FY++ LTG S+GG  + +P        + + G+  ++D G
Sbjct: 320 FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLSSNEGVLSLLDSG 373

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAG 435
           T ITRL    Y + +  F +     + T G ++ +TC++ +G   V +PTV   F   A 
Sbjct: 374 TVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAE 433

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
             +D+    Y +  D A   C AFA         IIGN QQ+  RV ++   ++VGF   
Sbjct: 434 MIVDVEGVFYFVKSD-ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 492

Query: 494 KC 495
            C
Sbjct: 493 PC 494


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 148/410 (36%), Positives = 207/410 (50%), Gaps = 41/410 (10%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD  RV+ +  K+  +         KP     L  ++      G S  +  Y + + +
Sbjct: 99  LRRDQDRVDAIRRKVTAS-------SNKPKGGVSLLANW------GKSLSTTNYVASLRL 145

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL--- 222
           GTP  +  + LDTGSD +W+QC+PC +CY+Q DP+FDP  SS+YS +PC A +C+ L   
Sbjct: 146 GTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASS 205

Query: 223 ----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVKGIALGCGHDN 272
               + S+     C Y+V+Y D S TVGDL  +T++         + +V G   GCGH N
Sbjct: 206 SSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSN 265

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
            G F    GLLGLG G  SL  Q+ A    + +YCL    S A+G L F  A        
Sbjct: 266 AGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS-AAGYLSFGGAAARANAQF 324

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
             +   +  T YY+ LTG  V G+A+++P S F    A   G I+D GTA +RL   AY 
Sbjct: 325 TEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAF----ATAAGTIIDSGTAFSRLPPSAYA 380

Query: 390 SLRDSFVRLAGNLK----PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           +LR SF    G  +    P+S   +FDTCYDF+G  +VR+P V L F  G  + L     
Sbjct: 381 ALRSSFRSAMGRYRYKRAPSS--PIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGV 438

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L   +     C AF P    L I+GN QQ+   V +D+ + R+GF    C
Sbjct: 439 LYTWNDVAQTCLAFVPNHD-LGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 156/455 (34%), Positives = 227/455 (49%), Gaps = 50/455 (10%)

Query: 68  AAESFPLNSSSSFSLPLHSRE-----ILHKT-RHNDYRSLVLSRLERDSARVNTLITKLQ 121
           +A SF  +S+ S S P+  ++     +L  T RH     L  S L   S   +TL    +
Sbjct: 39  SAASFAPSSTCSASDPVAPQQNDTFTVLRLTHRHGPCAPLRASSLAAPSV-ADTLRADQR 97

Query: 122 LAIYNVDRHELKPAEAQILPEDF-------STPVVSGASQGSGEYFSRIGVGTPPRQFSM 174
            A      H L+    +  P+ +       + P   G   G+  Y     +GTP    ++
Sbjct: 98  RA-----EHILRRVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPGMAQTL 152

Query: 175 VLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRAN 230
            +DTGSD++W+QC+PC    CY+Q DP+FDP  SSSY+ +PC    C  L +  SAC A 
Sbjct: 153 EVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYASACSAA 212

Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGM 289
           +C Y V+YGDGS T G   ++T++   + +V+G   GCGH  + GLF G  GLLG G   
Sbjct: 213 QCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQ 272

Query: 290 LSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIRNKKVDTF 340
            SL +Q         +YCL  + S  +G L      GG +  AP      L+ +    T+
Sbjct: 273 PSLVQQTAGAYGGVFSYCLPTKSS-TTGYLTL----GGPSGVAPGFSTTQLLPSPNAPTY 327

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           Y V LTG SVGGQ + +P S F        G +VD GT ITRL   AY +LR +F     
Sbjct: 328 YVVMLTGISVGGQPLSVPASAFAA------GTVVDTGTVITRLPPAAYAALRSAFRSGMA 381

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
           +      + + DTCY F+G  +V + +V+L F +G  + L A   +    S G   FA +
Sbjct: 382 SYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASS 437

Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  +++I+GNVQQ+   V  D   + VGF P+ C
Sbjct: 438 GSDGSMAILGNVQQRSFEVRID--GSSVGFRPSSC 470


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 133/422 (31%), Positives = 221/422 (52%), Gaps = 42/422 (9%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           D+  +  +R+  D+  VN+L +  + AI+    H+L  +++QI       P+ SGA   +
Sbjct: 13  DWEKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQL--SDSQI-------PISSGARLQT 63

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
             Y   +G+G   +  ++++DTGSD+ W+QC PC  CY Q +P+F+P  SSS+  LPC +
Sbjct: 64  LNYIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNS 121

Query: 217 PQCKSLDVSA-----C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           P C +L  +A     C    +  C YQ+ YGDGS++ G+L  E ++ G +  +     GC
Sbjct: 122 PTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-EIDNFIFGC 180

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL-----EFNS 320
           G +N+GLF G++GL+GL    LSL  Q  +   +  +YCL      +SG L     +F++
Sbjct: 181 GRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSN 240

Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI--IVDCG 377
            +    ++   +I+N ++  FY++ LTG S+GG  + +P        + + G+  ++D G
Sbjct: 241 FKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVP------RLSSNEGVLSLLDSG 294

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF--GAG 435
           T ITRL    Y + +  F +     + T G ++ +TC++ +G   V +PTV   F   A 
Sbjct: 295 TVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAE 354

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
             +D+    Y +  D A   C AFA         IIGN QQ+  RV ++   ++VGF   
Sbjct: 355 MIVDVEGVFYFVKSD-ASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGE 413

Query: 494 KC 495
            C
Sbjct: 414 PC 415


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/371 (34%), Positives = 191/371 (51%), Gaps = 20/371 (5%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
            DF +PVVSG++ GSG+YF    +GTPP++FS+++D+GSD+ W+QC PC +CY Q  P++
Sbjct: 48  HDFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLY 107

Query: 202 DPKTSSSYSPLPCAAPQC------KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
            P  SS+++P+PC +P+C      +           C Y+  Y D S + G    E+ + 
Sbjct: 108 APSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV 167

Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSP- 311
            +   +  +A GCG DN+G F  + G+LGLG G LS   Q+        AYCLV+   P 
Sbjct: 168 DDV-RIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPT 226

Query: 312 -ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
             S  L F     +   D    P++ N +  T YYV +    VGG+++ I  S + +D  
Sbjct: 227 SVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFL 286

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
           G+GG I D GT +T     AY ++  +F +     +  S V   D C D +G+     P+
Sbjct: 287 GNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAAS-VQGLDLCVDVTGVDQPSFPS 345

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA--PTS-SALSIIGNVQQQGTRVSFDLA 484
            ++  G G        NY + V +    C A A  P+S    + IGN+ QQ   V +D  
Sbjct: 346 FTIVLGGGAVFQPQQGNYFVDV-APNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDRE 404

Query: 485 NNRVGFTPNKC 495
            NR+GF P KC
Sbjct: 405 ENRIGFAPAKC 415


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 138/365 (37%), Positives = 202/365 (55%), Gaps = 25/365 (6%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
            D + P   G S  + EY   +G+G+P    +M++DTGSD++W+QC+PC++C+ Q+DP+F
Sbjct: 35  SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLF 94

Query: 202 DPKTSSSYSPLPCAAPQCKSL--DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS 258
           DP +SS+YSP  C +  C  L  + + C  +++C Y V YGDGS T G   ++T++ G+S
Sbjct: 95  DPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS 154

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
            +V+    GC +   G    + GL+GLGGG  SL  Q   T   + +YCL    S +SG 
Sbjct: 155 -AVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGF 212

Query: 316 LEFNSARGGDA---VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           L   +A G      V  P++R+ +V TFY V L    VGG+ + IP S+F        G 
Sbjct: 213 LTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS------AGT 266

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           ++D GT ITRL   AY++L  +F        P     + DTC+DFSG  SV +P+V+L F
Sbjct: 267 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 326

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGF 490
             G  + L A   ++      + C AFA  S  S+L IIGNVQQ+   V +D+    VGF
Sbjct: 327 SGGAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGF 380

Query: 491 TPNKC 495
               C
Sbjct: 381 RAGAC 385


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 141/419 (33%), Positives = 208/419 (49%), Gaps = 53/419 (12%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L++D ARV++++  +               E   +    S P   G S G+G Y   +G+
Sbjct: 114 LDQDQARVDSILGMIT-------------NETSAVGPGVSLPAERGISVGTGNYVVSVGL 160

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GTP R  ++V DTGSD++W+QC PC+   CY+Q DP+F P  SS++S + C A +C++  
Sbjct: 161 GTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGARECRARQ 220

Query: 224 VSACRA----NRCLYQVAYGDGSFTVGDLVTETVSFG----------NSGSVKGIALGCG 269
             +C      +RC Y+V YGD S T G L  +T++ G          N   + G   GCG
Sbjct: 221 --SCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFVFGCG 278

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS--ARGG 324
            +N GLF  + GL GLG G +SL+ Q         +YCL    S A G L   +      
Sbjct: 279 ENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPA 338

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            A   P++      +FYYV L G  V G+A+++      +       +IVD GT ITRL 
Sbjct: 339 HAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------LIVDSGTVITRLA 392

Query: 385 TQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGK--AL 438
            +AY +LR +F+   G    K    +++ DTCYDF+     +V +P V+L F  G   ++
Sbjct: 393 PRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISV 452

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           D     Y+  V  A   C AFAP     S  I+GN QQ+   V +D+A  ++GF    C
Sbjct: 453 DFSGVLYVAKVAQA---CLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  212 bits (539), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 136/403 (33%), Positives = 204/403 (50%), Gaps = 30/403 (7%)

Query: 107 ERDSARVNTLITKLQLAIYNVDRHELKP-AEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           E D  R+N     L+ +I  V  H   P A A + P+   + V S      GEY   + +
Sbjct: 51  ETDLQRINN---ALRRSISRV--HHFDPIAAASVSPKAAESDVTSN----RGEYLMSLSL 101

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTPP +   + DTGSD+ W QC+PC  CY+Q DP+FDPK+S +Y    C A QC  LD S
Sbjct: 102 GTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQS 161

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVGS-A 280
            C  N C YQ +YGD S+T+G++ ++T++     G+  S     +GCGH+N+G F    +
Sbjct: 162 TCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGS 221

Query: 281 GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNS---ARGGDAVTAPLI 332
           G++GLG G LSL  Q+ ++     +YCLV   S A  S  L F S     G    + PL+
Sbjct: 222 GIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLL 281

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
            ++ + +FY++ L   SVG + ++   S       G+G II+D GT +T +    +++L 
Sbjct: 282 SSETMSSFYFLTLEAMSVGNERIKFGDSSL---GTGEGNIIIDSGTTLTIVPDDFFSNLS 338

Query: 393 DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
            +        +          CY  S    ++VP ++ HF  G  + L   N  + V S 
Sbjct: 339 TAVGNQVEGRRAEDPSGFLSVCY--SATSDLKVPAITAHF-TGADVKLKPINTFVQV-SD 394

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              C AFA T+S +SI GNV Q    V +++    + F P  C
Sbjct: 395 DVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDC 437


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   ++LDTGSD+ W QC PC  C++QS P F+P  S ++S LPC   
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169

Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
            C+ L  S+C         C+Y  AY D S T G L ++T SF ++       SV  +  
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229

Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
           GCG  N G+FV +  G+ G   G LS+  Q+K  + +YC         SP   GV   L 
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 289

Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            ++A GG  V  +  LIR        YY+ L G +VG   + IP S+F + E G GG IV
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +T L    YN + D+FV         S  +L   C+         VP + LHF  
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 408

Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           G  LDLP +NY+  ++ AG     C A       LS+IGN QQQ   V +DLAN+ + F 
Sbjct: 409 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467

Query: 492 PNKC 495
           P +C
Sbjct: 468 PARC 471


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   ++LDTGSD+ W QC PC  C++QS P F+P  S ++S LPC   
Sbjct: 84  EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 143

Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
            C+ L  S+C         C+Y  AY D S T G L ++T SF ++       SV  +  
Sbjct: 144 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 203

Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
           GCG  N G+FV +  G+ G   G LS+  Q+K  + +YC         SP   GV   L 
Sbjct: 204 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 263

Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            ++A GG  V  +  LIR        YY+ L G +VG   + IP S+F + E G GG IV
Sbjct: 264 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 323

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +T L    YN + D+FV         S  +L   C+         VP + LHF  
Sbjct: 324 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 382

Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           G  LDLP +NY+  ++ AG     C A       LS+IGN QQQ   V +DLAN+ + F 
Sbjct: 383 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 441

Query: 492 PNKC 495
           P +C
Sbjct: 442 PARC 445


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  211 bits (536), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 135/364 (37%), Positives = 184/364 (50%), Gaps = 28/364 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   ++LDTGSD+ W QC PC  C++QS P F+P  S ++S LPC   
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169

Query: 218 QCKSLDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNS------GSVKGIAL 266
            C+ L  S+C         C+Y  AY D S T G L ++T SF ++       SV  +  
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229

Query: 267 GCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA-SGV---LE 317
           GCG  N G+FV +  G+ G   G LS+  Q+K  + +YC         SP   GV   L 
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTAITGSEPSPVFLGVPPNLY 289

Query: 318 FNSARGGDAV--TAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            ++A GG  V  +  LIR        YY+ L G +VG   + IP S+F + E G GG IV
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +T L    YN + D+FV         S  +L   C+         VP + LHF  
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHF-E 408

Query: 435 GKALDLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           G  LDLP +NY+  ++ AG     C A       LS+IGN QQQ   V +DLAN+ + F 
Sbjct: 409 GATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467

Query: 492 PNKC 495
           P +C
Sbjct: 468 PARC 471


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 137/421 (32%), Positives = 211/421 (50%), Gaps = 40/421 (9%)

Query: 94  RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGAS 153
           +  D+   +   L  D  RV +L ++++ +I++   + +   ++QI       P+ SG  
Sbjct: 12  KSTDWNKKLQKSLILDDFRVRSLQSRIK-SIFS--GNNIDALDSQI-------PLSSGVR 61

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
             +  Y   + +G   R  ++++DTGSD+ W+QC+PC  CY Q DP+F+P  S SY  + 
Sbjct: 62  LQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTIL 119

Query: 214 CAAPQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           C +  C+SL     ++  C +N   C Y V YGDGS+T GDL  E ++ G +  V     
Sbjct: 120 CNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT-HVSNFIF 178

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARG 323
           GCG +N+GLF G++GL+GLG   LSL  Q  A      +YCL    + ASG L       
Sbjct: 179 GCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASGSLILGGNSS 238

Query: 324 GDAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
               T P     +I N ++ TFY++ LTG S+GG A+Q P            GI++D GT
Sbjct: 239 VYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAP-------NYRQSGILIDSGT 291

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            ITRL    Y  L+  F++           ++ DTC++ +G   V +PT+ + F     L
Sbjct: 292 VITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQFEGNAEL 351

Query: 439 --DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
             D+    Y +  D A   C A A  S    + IIGN QQ+  RV ++   +++GF    
Sbjct: 352 TVDVTGIFYFVKTD-ASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEA 410

Query: 495 C 495
           C
Sbjct: 411 C 411


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 137/419 (32%), Positives = 210/419 (50%), Gaps = 30/419 (7%)

Query: 95  HNDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSG 151
           H+   S     + RDS++         K Q  + N  R  +  A  ++  +  S    S 
Sbjct: 22  HSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVV-NAARRSINRAN-RLFKDSLSNTPEST 79

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
                GEY     VGTPP     V+DTGSDI WLQC+PC +CY+Q+ PIF+P  SSSY  
Sbjct: 80  VYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKN 139

Query: 212 LPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
           +PC++  C+S+  ++C + N C Y + + D S++ G+L  ET++     G+S S     +
Sbjct: 140 IPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVI 199

Query: 267 GCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEFNS 320
           GCGH+N G+F G ++G++GLG G +SLT Q+K++     +YCL+    DS  +  L F  
Sbjct: 200 GCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGD 259

Query: 321 A---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE-MDEAGDGGIIVDC 376
           A    G   V+ P ++ K    FYY+ L  FSVG + ++     FE +D++ +G II+D 
Sbjct: 260 AAVVSGDGVVSTPFVK-KDPQAFYYLTLEAFSVGNKRIE-----FEVLDDSEEGNIILDS 313

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT +T L +  Y +L  +  +L    +      L + CY  +       P ++ HF    
Sbjct: 314 GTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITS-DQYDFPIITAHFKGAD 372

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               P   +    D  G  C AF  + +   I GN+ Q    V +DL  N V F P+ C
Sbjct: 373 IKLNPISTFAHVAD--GVVCLAFTSSQTG-PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 142/357 (39%), Positives = 187/357 (52%), Gaps = 35/357 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPL 212
           G+ +Y   + +GTP    ++ +DTGSD++W+QC+PC+   C  Q D +FDP  SS+YS +
Sbjct: 139 GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAV 198

Query: 213 PCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
           PC A  C  L +  + C  ++C Y V+YGDGS T G   ++T++     +V     GCGH
Sbjct: 199 PCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGH 258

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFN--SARGGD 325
              G+F G  GLL LG   +SL  Q         +YCL  + S A+G L     S+  G 
Sbjct: 259 AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS-AAGYLTLGGPSSASGF 317

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           A T  L+      TFY V LTG SVGGQ V +P S F       GG +VD GT ITRL  
Sbjct: 318 ATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPP 370

Query: 386 QAYNSLRDSFVRLAGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
            AY +LR +F    G + P          + DTCYDFS    V +PTV+L F  G  L L
Sbjct: 371 TAYAALRSAF---RGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLAL 427

Query: 441 PAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   L    S+G  C AFAP       +I+GNVQQ+   V FD   + VGF P  C
Sbjct: 428 EAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/373 (33%), Positives = 191/373 (51%), Gaps = 27/373 (7%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SGAS G+GEYF  + VGTPP+   ++LDTGSD++W+QC PC +C++Q+ P ++P  SSSY
Sbjct: 161 SGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSY 220

Query: 210 SPLPCAAPQCKSLD----VSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
             + C  P+C+ +     +  C+     C Y   Y DGS T GD   ET +   +     
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGK 280

Query: 260 ----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA 312
                V  +  GCGH N+G F G+ GLLGLG G LS   Q+++    S +YCL D  S  
Sbjct: 281 EKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNT 340

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKV--------DTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           S   +       + +    +   K+        DTFYY+ +    VGG+ + IP   +  
Sbjct: 341 SVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHW 400

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
              G GG I+D G+ +T     AY+ ++++F +     +  +   +   CY+ SG   V 
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVE 460

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFD 482
           +P   +HF  G   + PA+NY    +     C A    P  S L+IIGN+ QQ   + +D
Sbjct: 461 LPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYD 520

Query: 483 LANNRVGFTPNKC 495
           +  +R+G++P +C
Sbjct: 521 VKRSRLGYSPRRC 533


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 132/418 (31%), Positives = 209/418 (50%), Gaps = 40/418 (9%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIY--NVDRHELKPAEAQILPEDFSTPVVSGASQ 154
           D+   +  RL  D+ ++ +L ++++  I   N+D       + QI       P+ SG   
Sbjct: 13  DWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNID----DSVDTQI-------PLTSGIRL 61

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
            S  Y   + +G   R+ ++++DTGSD++W+QC+PC  CY Q DP+F+P  S SY  + C
Sbjct: 62  QSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLC 119

Query: 215 AAPQCKSLDVS-----ACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
            +  C+SL ++      C +N   C Y V YGDGS+T G++  E ++ GN+ +V     G
Sbjct: 120 NSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNT-TVNNFIFG 178

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
           CG  N+GLF G++GL+GLG   LSL  QI        +YCL   ++ ASG L        
Sbjct: 179 CGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSLVMGGNSSV 238

Query: 325 DAVTAPLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
              T P+   + +      FY++ LTG +VGG  VQ P         G   +I+D GT I
Sbjct: 239 YKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAP-------SFGKDRMIIDSGTVI 291

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           +RL    Y +L+  FV+            + D+C++ SG + V++P + ++F     L++
Sbjct: 292 SRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELNV 351

Query: 441 PAKNYLIPVDS-AGTFCFAFA--PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                   V + A   C A A  P    + IIGN QQ+  R+ +D   + +GF    C
Sbjct: 352 DVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 142/400 (35%), Positives = 197/400 (49%), Gaps = 25/400 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D  RV ++    + +  N   H  K  +A I       PV SG   G+G Y  ++ +
Sbjct: 2   LLQDQLRVKSM--HARFSNKNAGSH-FKEMQADI-------PVQSGIPLGAGNYLVKMAL 51

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           GTP    S+ LDTGSDI W QC PC   CY+Q+   FDP+ SSSY  + C++  C+ +  
Sbjct: 52  GTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITD 111

Query: 225 SA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VG 278
           S     C ++ C+Y+V YGDGS++VG   TE ++   S  +     GCG  N G F  + 
Sbjct: 112 SGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIA 171

Query: 279 SAGLLGLGGGMLSLTKQIKATSL-AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
               LG G   L+L    K  +L  YCL    S ++G L             PL    K 
Sbjct: 172 GLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKN 231

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
             FY + + G SVGG  + I  S+F      + G I+D GT ITRLQ   Y++L   F +
Sbjct: 232 TPFYGIDIKGLSVGGHVLPIDASVFS-----NAGAIIDSGTVITRLQPTVYSALSSKFQQ 286

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
           L  +   T G ++ DTCYDFSG  S+ VP +S  F  G  +D+     L  +++    C 
Sbjct: 287 LMKDYPKTDGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCL 346

Query: 458 AFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAP        + GN QQQ   V  DLA  R+GF P+ C
Sbjct: 347 AFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 137/364 (37%), Positives = 187/364 (51%), Gaps = 23/364 (6%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYS 210
           A  G+G Y   + VGTPP  F  ++DTGSD+ W QC PCT  C+ Q  P++DP  SS++S
Sbjct: 89  AENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFS 148

Query: 211 PLPCAAPQCKSLDVS--ACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-------SGSV 261
            LPCA+P C++L  +  AC A  C+Y   Y  G FT G L  +T++ G+       S S 
Sbjct: 149 KLPCASPLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSF 207

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVL--EF 318
            G+A GC   N G   G++G++GLG   LSL  QI     +YCL  D D+ AS +L    
Sbjct: 208 AGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILFGAL 267

Query: 319 NSARGGDAVTAPLIRN----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            +  G    +  L+RN    ++   +YYV LTG +VG   + +  S F    AG GG+IV
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIV 327

Query: 375 DCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHF 432
           D GT  T L    Y  LR +F+ + AG L   SG    FD C++ +G     VP +   F
Sbjct: 328 DSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRLVFRF 386

Query: 433 GAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
             G    +P ++Y   VD  G   C    PT   +S+IGNV Q    V +DL      F 
Sbjct: 387 AGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGATFSFA 445

Query: 492 PNKC 495
           P  C
Sbjct: 446 PADC 449


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 139/356 (39%), Positives = 183/356 (51%), Gaps = 33/356 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPL 212
           G+ +Y   + +GTP    ++ +DTGSD++W+QC+PC+   C  Q D +FDP  SS+YS +
Sbjct: 139 GTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAV 198

Query: 213 PCAAPQCKSLDV--SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
           PC A  C  L +  + C  ++C Y V+YGDGS T G   ++T++     +V     GCGH
Sbjct: 199 PCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGTFLFGCGH 258

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDA- 326
              G+F G  GLL LG   +SL  Q         +YCL  + S A+G L           
Sbjct: 259 AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS-AAGYLTLGGPTSASGF 317

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            T  L+      TFY V LTG SVGGQ V +P S F       GG +VD GT ITRL   
Sbjct: 318 ATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVDTGTVITRLPPT 371

Query: 387 AYNSLRDSFVRLAGNLKP-----TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
           AY +LR +F    G + P          + DTCYDFS    V +PTV+L F  G  L L 
Sbjct: 372 AYAALRSAF---RGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALE 428

Query: 442 AKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A   L    S+G  C AFAP       +I+GNVQQ+   V FD   + VGF P  C
Sbjct: 429 APGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 163/468 (34%), Positives = 224/468 (47%), Gaps = 41/468 (8%)

Query: 43  ALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILHK-----TRHND 97
           A++  EHI  +   TLE     S  A++S   + SS       S ++LHK        ND
Sbjct: 32  AVEANEHIKKY-VHTLEV---NSLLASDS--CDQSSKVIDKASSLQVLHKYGPCMQVLND 85

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
            RS V   L +D  RV+++  +L      +  H +       LP        SG + G+G
Sbjct: 86  -RSHV-EFLLQDQLRVDSIQARLS----KISGHGIFEEMVTKLPAQ------SGIAIGTG 133

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAA 216
            Y   +G+GTP   F++V DTGS I W QC+PC   CY Q +  FDP  S+SY+ + C++
Sbjct: 134 NYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSS 193

Query: 217 PQCKSLDVS--ACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
             C  L  S   C A+   CLYQ+ YGD S++ G   TET++  +S        GCG  N
Sbjct: 194 ASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFLFGCGQSN 253

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
            GLF  +AGLLGL    +SL  Q         +YCL    S ++G L F       A   
Sbjct: 254 NGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPS-STGYLNFGGKVSQTAGFT 312

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           P+  +    +FY + + G SV G  + I PS+F        G I+D GT ITRL   AY 
Sbjct: 313 PI--SPAFSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGTVITRLPPTAYK 365

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           +L+++F     N   T+G  L DTCYDFS   +V  P VS+ F  G  +D+ A   L  V
Sbjct: 366 ALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLV 425

Query: 450 DSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +     C AFA     S   I GN QQ+   V +D A   +GF    C
Sbjct: 426 NGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 138/442 (31%), Positives = 217/442 (49%), Gaps = 49/442 (11%)

Query: 79  SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ 138
           S +L +  RE L   +  D+   +   L  D+ RV +L  +L++        E   +E Q
Sbjct: 68  STTLEMKHRE-LCSGKTIDWGKKMRRALLLDNIRVQSL--QLRIKAMTSSTTEQSVSETQ 124

Query: 139 ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
           I       P+ SG    +  Y   + +G   +  S+++DTGSD+ W+QC+PC  CY Q  
Sbjct: 125 I-------PLTSGIKLETLNYIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQG 175

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-------------CLYQVAYGDGSFTV 245
           P++DP  SSSY  + C +  C+  D+ A   N              C Y V+YGDGS+T 
Sbjct: 176 PLYDPSVSSSYKTVFCNSSTCQ--DLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTR 233

Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLA 302
           GDL +E++  G++  ++ +  GCG +N+GLF G++GL+GLG   +SL  Q   T     +
Sbjct: 234 GDLASESIVLGDT-KLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFS 292

Query: 303 YCLVDRDSPASGVLEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
           YCL   +  ASG L F +              PL++N ++ +FY + LTG S+GG  V++
Sbjct: 293 YCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG--VEL 350

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
               F        GI++D GT ITRL    Y +++  F++         G ++ DTC++ 
Sbjct: 351 KTLSFGR------GILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNL 404

Query: 418 SGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQ 473
           +    + +PT+ + F     L  D+    Y +  D A   C A A  S  + + IIGN Q
Sbjct: 405 TSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQ 463

Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
           Q+  RV +D    R+G     C
Sbjct: 464 QKNQRVIYDTTQERLGIAGENC 485


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/369 (34%), Positives = 192/369 (52%), Gaps = 20/369 (5%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F +PVVSG++ GSG+YF    +GTPP++FS+++D+GSD+ W+QC PC +CY Q  P++ P
Sbjct: 49  FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108

Query: 204 KTSSSYSPLPCAAPQCKSLDVSA---C---RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
             SS++SP+PC +  C  +  +    C       C Y+  Y D S + G    E+ +  +
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-D 167

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSP--A 312
              +  +A GCG DN+G F  + G+LGLG G LS   Q+        AYCLV+   P   
Sbjct: 168 GVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSV 227

Query: 313 SGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           S  L F     +   D    P++ N K  T YYV +   +VGG+++ I  S +E+D  G+
Sbjct: 228 SSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGN 287

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           GG I D GT +T     AY+ +  +F       +  S V   D C + +G+     P+ +
Sbjct: 288 GGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAES-VQGLDLCVELTGVDQPSFPSFT 346

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL---SIIGNVQQQGTRVSFDLANN 486
           + F  G      A+NY + V +    C A A  +S L   + IGN+ QQ   V +D   N
Sbjct: 347 IEFDDGAVFQPEAENYFVDV-APNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREEN 405

Query: 487 RVGFTPNKC 495
            +GF P KC
Sbjct: 406 LIGFAPAKC 414


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  208 bits (529), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 124/386 (32%), Positives = 194/386 (50%), Gaps = 47/386 (12%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SGAS G+GEYF  + VGTPP+   ++LDTGSD++W+QC PC +C++Q+   + PK SS+Y
Sbjct: 162 SGASLGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTY 221

Query: 210 SPLPCAAPQCKSLDVS----ACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
             + C  P+C+ +  S     C+A    C Y   Y DGS T GD  +ET +   +     
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGK 281

Query: 260 ----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD--RDS 310
                V  +  GCGH N+G F G++GLLGLG G +S   QI++    S +YCL D   ++
Sbjct: 282 EKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNT 341

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKV-------------DTFYYVGLTGFSVGGQAVQI 357
             S  L F   +        L+ N  +             +TFYY+ +    VGG+ + I
Sbjct: 342 SVSSKLIFGEDK-------ELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDI 394

Query: 358 PPSLFEMDEA-----GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
               +            GG I+D G+ +T     AY+ ++++F +     +  +   +  
Sbjct: 395 SEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMS 454

Query: 413 TCYDFSG-LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSII 469
            CY+ SG +  V +P   +HF  G   + PA+NY    +     C A    P  S L+II
Sbjct: 455 PCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTII 514

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GN+ QQ   + +D+  +R+G++P +C
Sbjct: 515 GNLLQQNFHILYDVKRSRLGYSPRRC 540


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 125/375 (33%), Positives = 186/375 (49%), Gaps = 44/375 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           + EY   + VGTPPR  ++ LDTGSD+ W QC PC +C+ Q  P+ DP  SS+Y+ LPC 
Sbjct: 89  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148

Query: 216 APQCKSLDVSAC---------RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS----- 260
           AP+C++L  ++C           NR C Y   YGD S TVG++ T+  +FG         
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 261 --VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDR--------- 308
              + +  GCGH N+G+F  +  G+ G G G  SL  Q+  T+ +YC             
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFESKSSLVT 268

Query: 309 --DSPASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
              +PA+ +L  ++A   G+  T PL++N    + Y++ L G SVG   + +P       
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVP------- 321

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDF---SGL 420
           EA     I+D G +IT L    Y +++  F    G L PT  V  +  D C+     +  
Sbjct: 322 EAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVG-LPPTGVVEGSALDLCFALPVTALW 380

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
           R   VP+++LH   G   +LP  NY+    +A   C          ++IGN QQQ T V 
Sbjct: 381 RRPPVPSLTLHLD-GADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVV 439

Query: 481 FDLANNRVGFTPNKC 495
           +DL N+ + F P +C
Sbjct: 440 YDLENDWLSFAPARC 454


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 192/350 (54%), Gaps = 16/350 (4%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           Y   I +GTPP   + VLDTGSD+ W QC  PC  C+ Q  P++ P  S++Y+ + C +P
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            C++L     R +     C Y  +YGDG+ T G L TET + G+  +V+G+A GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAV-TAPL 331
           G    S+GL+G+G G LSL  Q+  T  +YC    ++ A+  L   +SAR   A  T P 
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLSSAAKTTPF 271

Query: 332 IRN-----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           + +     ++  ++YY+ L G +VG   + I P++F +   GDGG+I+D GT  T L+ +
Sbjct: 272 VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEER 331

Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           A+ +L  +       L   SG  L    C+  +   +V VP + LHF  G  ++L  ++Y
Sbjct: 332 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY 389

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++   SAG  C     ++  +S++G++QQQ T + +DL    + F P KC
Sbjct: 390 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 136/366 (37%), Positives = 183/366 (50%), Gaps = 31/366 (8%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G  EY   + VGTPP+  S +LDTGSD+ W QC PC  C  Q DPIF P  SSSY P+ C
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRC 159

Query: 215 AAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG-------IAL 266
           A   C  +   +C R + C Y+ +YGDG+ T G   TE  +F +S S          +  
Sbjct: 160 AGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGF 219

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG-- 324
           GCG  N+G     +G++G G   LSL  Q+     +YCL    S     L F S RGG  
Sbjct: 220 GCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRGGVY 279

Query: 325 DAVTAP-----LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
           DA TA      L+R+++  TFYYV  TG +VG + ++IP S F +   G GG IVD GTA
Sbjct: 280 DAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTA 339

Query: 380 ITRLQTQAYNSLRDSF---VRLA----GNLKPTSGVALFDTCYDFSGLRSVR---VPTVS 429
           +T         +  +F   +RL     G+  P  GV     C+  +  R  R   VP + 
Sbjct: 340 LTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGV-----CFAAAASRVPRPAVVPRMV 394

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            H   G  LDLP +NY++     G  C   A +  + + IGN  QQ  RV +DL  + + 
Sbjct: 395 FHL-QGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLS 453

Query: 490 FTPNKC 495
           F P +C
Sbjct: 454 FAPAQC 459


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  207 bits (527), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 122/349 (34%), Positives = 181/349 (51%), Gaps = 12/349 (3%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +G PP  F  + DTGSD+ W QC+PC  C+ Q  P++DP  SS++SPLPC++ 
Sbjct: 70  EYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSA 129

Query: 218 QCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---SVKGIALGCGHDNE 273
            C  +    C  ++ C Y+ AYGDG+++ G L TET++ G S    SV G+A GCG DN 
Sbjct: 130 TCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNG 189

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSARGGDAV 327
           G  + S G +GLG G LSL  Q+     +YCL D      DSP   G L   +       
Sbjct: 190 GDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQ 249

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           + PL+++ +  + Y+V L G S+G   + IP   F++   G GG+IVD GT  T L    
Sbjct: 250 STPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTILAESG 309

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           +  +     R+ G   P +  +L   C+         +P + LHF  G  + L   NY+ 
Sbjct: 310 FREVVGRVARVLGQ-PPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMS 368

Query: 448 PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +   +FC   A T+  + S++GN QQQ  ++ FD    ++ F P  C
Sbjct: 369 YNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDC 417


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 191/350 (54%), Gaps = 16/350 (4%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           Y   I +GTPP   + VLDTGSD+ W QC  PC  C+ Q  P++ P  S++Y+ + C +P
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            C++L     R +     C Y  +YGDG+ T G L TET + G+  +V+G+A GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF-NSARGGDAV-TAPL 331
           G    S+GL+G+G G LSL  Q+  T  +YC    ++ A+  L   +SAR   A  T P 
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCFTPFNATAASPLFLGSSARLSSAAKTTPF 271

Query: 332 IRN-----KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           + +     ++  ++YY+ L G +VG   + I P++F +   GDGG+I+D GT  T L+  
Sbjct: 272 VPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEES 331

Query: 387 AYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           A+ +L  +       L   SG  L    C+  +   +V VP + LHF  G  ++L  ++Y
Sbjct: 332 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESY 389

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++   SAG  C     ++  +S++G++QQQ T + +DL    + F P KC
Sbjct: 390 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 150/421 (35%), Positives = 207/421 (49%), Gaps = 52/421 (12%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D +R N+     QL I N DR     A A         P+ SG    +  Y + I +
Sbjct: 141 LAADESRANSF----QLRIRN-DR----AAAASTQSGSAEVPLTSGIRFQTLNYVTTIAL 191

Query: 166 G-----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC- 219
           G     +P    ++++DTGSD+ W+QC+PC+ CY Q DP+FDP  S++Y+ + C A  C 
Sbjct: 192 GGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACA 251

Query: 220 KSLDVS-----ACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
            SL  +     +C     RC Y +AYGDGSF+ G L T+TV+ G + S+ G   GCG  N
Sbjct: 252 ASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGA-SLDGFVFGCGLSN 310

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARGGDAV- 327
            GLF G+AGL+GLG   LSL  Q         +YCL       ASG L      GGDA  
Sbjct: 311 RGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSL----GGDASS 366

Query: 328 ---TAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
              T P+   + +       FY++ +TG +VGG A+            G   +++D GT 
Sbjct: 367 YRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL-------AAQGLGASNVLIDSGTV 419

Query: 380 ITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           ITRL    Y  +R  F R  A    PT+ G ++ DTCYD +G   V+VP ++L    G  
Sbjct: 420 ITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAE 479

Query: 438 LDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           + + A   L  V   G+  C A A  S      IIGN QQ+  RV +D   +R+GF    
Sbjct: 480 VTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADED 539

Query: 495 C 495
           C
Sbjct: 540 C 540


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/416 (34%), Positives = 197/416 (47%), Gaps = 33/416 (7%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYF 160
           L+   + R  AR   L      A ++    +  PA   +LP   S         G  EY 
Sbjct: 49  LIRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAG--VLPVRPS---------GDLEYV 97

Query: 161 SRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK 220
             + +GTPP+  S +LDTGSD+ W QC PC  C  Q DP+F P  S+SY P+ CA   C 
Sbjct: 98  VDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCS 157

Query: 221 SLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG------IALGCGHDNE 273
            +   +C R + C Y+  YGDG+ TVG   TE  +F +SG          +  GCG  N 
Sbjct: 158 DILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNV 217

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAV--- 327
           G     +G++G G   LSL  Q+     +YCL    S     L F S      GDA    
Sbjct: 218 GSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRV 277

Query: 328 -TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            T PL+++ +  TFYYV  TG +VG + ++IP S F +   G GG+IVD GTA+T L   
Sbjct: 278 QTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAA 337

Query: 387 AYNSLRDSF---VRL--AGNLKPTSGVA-LFDTCYDFSGLRS-VRVPTVSLHFGAGKALD 439
               +  +F   +RL  A    P  GV  L    +  S   S + VP + LHF  G  LD
Sbjct: 338 VLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLD 396

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY++     G  C   A +    S IGN+ QQ  RV +DL    +   P +C
Sbjct: 397 LPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 196/358 (54%), Gaps = 27/358 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q+ P FDP TSS+ S   C + 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
            C+ L V++C + +      C+Y  +YGD S T G L  +  +F G   SV G+A GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
            N G+F  +  G+ G G G LSL  Q+K  + ++C   V+   P++ +L+     + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           G    T PLI+N    TFYY+ L G +VG   + +P S F +   G GG I+D GTA+T 
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTIIDSGTAMTS 318

Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
           L T+ Y  +RD+F   A  +K P       D  +  S  LR+   VP + LHF  G  +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374

Query: 440 LPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY+  V+ AG+   C A       ++ IGN QQQ   V +DL N+++ F P +C
Sbjct: 375 LPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 142/433 (32%), Positives = 213/433 (49%), Gaps = 39/433 (9%)

Query: 89  ILHKTRHNDYRSLVLSRLER-------DSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
           +L    H  + S   SR E        D+ARV++L  + ++  Y + R     A A  L 
Sbjct: 42  VLELRHHASFSSGGKSRAEEAHAVLASDAARVSSL--QRRIGSYGLIRSS-DAASASKLA 98

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           +    PV SGA   +  Y + +G+G    + ++++DT S++ W+QC PC  C+ Q +P+F
Sbjct: 99  Q---VPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPCDACHDQQEPLF 153

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVS------AC--RANRCLYQVAYGDGSFTVGDLVTETV 253
           DP +S SY+ +PC +  C +L V+      AC  +   C Y ++Y DGS++ G L  + +
Sbjct: 154 DPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRL 213

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS 310
           S      ++G   GCG  N+G F G++GL+GLG   LSL  Q         +YCL  ++S
Sbjct: 214 SLAGE-DIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKES 272

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMD 365
            +SG L           + P++    V       FY   LTG +VGG+ VQ P       
Sbjct: 273 GSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP----GFS 328

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
             G G  IVD GT IT L    Y ++R  FV         +  ++ DTC+D +GLR V+V
Sbjct: 329 AGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGLREVQV 388

Query: 426 PTVSLHFGAGKALDLPAKNYLIPV-DSAGTFCFAFAPTSSALS--IIGNVQQQGTRVSFD 482
           P++ L F  G  +++ +K  L  V   A   C A A   S     IIGN QQ+  RV FD
Sbjct: 389 PSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFD 448

Query: 483 LANNRVGFTPNKC 495
              +++GF    C
Sbjct: 449 TVGSQIGFAQETC 461


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 196/358 (54%), Gaps = 27/358 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q+ P FDP TSS+ S   C + 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
            C+ L V++C + +      C+Y  +YGD S T G L  +  +F G   SV G+A GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
            N G+F  +  G+ G G G LSL  Q+K  + ++C   V+   P++ +L+     + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           G    T PLI+N    TFYY+ L G +VG   + +P S F +   G GG I+D GTA+T 
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKN-GTGGTIIDSGTAMTS 318

Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
           L T+ Y  +RD+F   A  +K P       D  +  S  LR+   VP + LHF  G  +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374

Query: 440 LPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP +NY+  V+ AG+   C A       ++ IGN QQQ   V +DL N+++ F P +C
Sbjct: 375 LPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/427 (34%), Positives = 218/427 (51%), Gaps = 30/427 (7%)

Query: 76  SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
           SS   ++PLH R     T  +     +   L RD  R   +  K            +  +
Sbjct: 53  SSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYS---------GVNGS 103

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
              +   D + P   G S  + EY   +G+G+P    +M++DTGSD++W+QC+PC++C+ 
Sbjct: 104 AGDVEGSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHS 163

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
           Q+D +FDP +SS+YS   C +  C  L    C +++C Y V YGDGS   G   ++T++ 
Sbjct: 164 QADSLFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL 223

Query: 256 GNSGSVKGIALGCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDS 310
           G+S +V+    GC     G  +   +AGL+GLGGG  SL  Q   T   + +YCL     
Sbjct: 224 GSS-TVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            +SG L   ++  G  V  P++R+ +V ++Y V L    VGG+ + IP S F        
Sbjct: 283 -SSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFS------A 335

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT ITRL   AY++L  +F        P   + +FDTC+DFSG  SV +PTV+L
Sbjct: 336 GSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVAL 395

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRV 488
            F  G  +DL +   ++        C AFA  S  ++L IIGNVQQ+   V +D+    V
Sbjct: 396 VFSGGAVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAV 449

Query: 489 GFTPNKC 495
           GF    C
Sbjct: 450 GFKAGAC 456


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 130/368 (35%), Positives = 195/368 (52%), Gaps = 29/368 (7%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           SG Y   I +G+PP++F+ ++DTGSD+ W+QC+PC++CY QSDPI+DP  SS+++   C+
Sbjct: 1   SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60

Query: 216 APQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
              C+SL  S C   A  C+Y   YGD S T GD   ET++     G+S +      GCG
Sbjct: 61  TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASG----VLEFNSAR 322
             N G F G+AG++GLG G +SL+ Q+ +      +YCLVD D  +S     +   +++ 
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-------------DEAGD 369
           G  A++ P+I N    T+Y+VGL G SVGG+ + +     +               E   
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           GG I D GT +T L    Y+ ++ +F          +  + FD CYD S  ++ + P ++
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPALT 300

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTF-CFAF-APTSSALSIIGNVQQQGTRVSFDLANNR 487
           L F  G     P KNY + VD+A T  C A     S  L IIGN+ QQ   V +D   + 
Sbjct: 301 LAF-KGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTST 359

Query: 488 VGFTPNKC 495
           +  +P +C
Sbjct: 360 ISMSPAQC 367


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 128/375 (34%), Positives = 189/375 (50%), Gaps = 28/375 (7%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F TP+VSG + GSG+YF    +GTP ++F +++DTGSD+ ++QC PC  CY+Q  P++ P
Sbjct: 19  FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78

Query: 204 KTSSSYSPLPCAAPQCKSLDV---SACRANR--------CLYQVAYGDGSFTVGDLVTET 252
             SS+++P+PC + +C  +     + C ++         C Y+  YGD S TVG    ET
Sbjct: 79  SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRD 309
            + G    V  +A GCG+ N+G FV + G+LGLG G LS T Q         AYCL    
Sbjct: 139 ATVGGI-RVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYL 197

Query: 310 SPASGVLEFNSARGGDAVTA--------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
           SP S    F+S   GD + +        PL+ N    + YYV +     GG+ + IP S 
Sbjct: 198 SPTS---VFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSA 254

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           +++D  G+GG I D GT +T    QAY  +  +F +     +          C + SG+ 
Sbjct: 255 WKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPLCVNVSGID 314

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVS 480
               P+ ++ F  G        NY I V S    C A   +SS   ++IGN+ QQ   V 
Sbjct: 315 HPIYPSFTIEFDQGATYRPNQGNYFIEV-SPNIDCLAMLESSSDGFNVIGNIIQQNYLVQ 373

Query: 481 FDLANNRVGFTPNKC 495
           +D   +R+GF    C
Sbjct: 374 YDREEHRIGFAHANC 388


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 133/366 (36%), Positives = 178/366 (48%), Gaps = 20/366 (5%)

Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
           VS    G  EY   + +GTPP+  S +LDTGSD+ W QC PC  C  Q DP+F P  S+S
Sbjct: 92  VSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESAS 151

Query: 209 YSPLPCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----G 263
           Y P+ CA   C  +    C   + C Y+  YGDG+ T+G   TE  +F +SG  +     
Sbjct: 152 YEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVP 211

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG 323
           +  GCG  N G     +G++G G   LSL  Q+     +YCL    S     L F S  G
Sbjct: 212 LGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYGSGRKSTLLFGSLSG 271

Query: 324 ---GDAV----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
              GDA     T PL+++ +  TFYYV L G +VG + ++IP S F +   G GG+IVD 
Sbjct: 272 GVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDS 331

Query: 377 GTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALF--DTCYDFSGLRSVRVPTVS 429
           GTA+T L       +  +F   +RL  A    P  GV           S    V VP + 
Sbjct: 332 GTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMV 391

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            HF     LDLP +NY++     G  C   A +    S IGN+ QQ  RV +DL    + 
Sbjct: 392 FHFQDAD-LDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLS 450

Query: 490 FTPNKC 495
           F P +C
Sbjct: 451 FAPAQC 456


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/361 (34%), Positives = 181/361 (50%), Gaps = 20/361 (5%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
           P  +G +  + E+   +G GTP +  +++LDTGSD++W+QC+PC+  CY+Q DP FDP  
Sbjct: 125 PDHTGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAK 184

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           SSSY+ +PC  P C +     C    CLY V YGDGS T G L  +T++F +S    G  
Sbjct: 185 SSSYAAVPCGTPVCAAAG-GMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFT 243

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPASGVLEFNSA 321
            GCG  N G F G    L   G          A S     +YCL   ++   G L   + 
Sbjct: 244 FGCGEKNIGDF-GEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT-TPGYLNIGAT 301

Query: 322 RGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
           +    V      +I+  +  +FY++ L   ++GG  + +PPS+F        G ++D GT
Sbjct: 302 KPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGT 356

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            +T L   AY SLRD F       KP       DTCYDF+G  ++ +P VS +F  G   
Sbjct: 357 ILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVF 416

Query: 439 DLPAKNYLIPVDSAGTF--CFAFA--PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           DL     +I  D A     C AF   P +   SI+GN QQ+   V +D+ + ++GF P  
Sbjct: 417 DLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPIS 476

Query: 495 C 495
           C
Sbjct: 477 C 477


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 136/402 (33%), Positives = 202/402 (50%), Gaps = 34/402 (8%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD  RV ++  K          H +  +   +  E   T V +  +   G Y   +G+
Sbjct: 92  LRRDQLRVKSIRAK----------HSMNSSTTGVFNE-MKTRVPT--THFGGGYAVTVGL 138

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           GTP + FS++ DTGSD+ W QC PC+  C+ Q+D  FDP  S+SY  L C++  CKS+  
Sbjct: 139 GTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGK 198

Query: 225 SACR----ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA 280
            + +    +N CLY V YG G +TVG L TET++   S   +   +GCG  N G F G+A
Sbjct: 199 ESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTA 257

Query: 281 GLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
           GLLGLG   ++L  Q  +T     +YCL    S ++G L F       A   P+    K+
Sbjct: 258 GLLGLGRSPVALPSQTSSTYKNLFSYCL-PASSSSTGHLSFGGGVSQAAKFTPI--TSKI 314

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
              Y + ++G SVGG+ + I PS+F        G I+D GT +T L + A+++L  +F  
Sbjct: 315 PELYGLDVSGISVGGRKLPIDPSVFRT-----AGTIIDSGTTLTYLPSTAHSALSSAFQE 369

Query: 398 LAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
           +  N   T G +    CYDFS     ++ +P +S+ F  G  +D+      I  +     
Sbjct: 370 MMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEV 429

Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AF      + ++I GNVQQ+   V +D+A   VGF P  C
Sbjct: 430 CLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/369 (34%), Positives = 184/369 (49%), Gaps = 21/369 (5%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           P ++  + G  EY   + VGTPP+  + +LDTGSD+ W QC  CT C +Q DP+F P+ S
Sbjct: 86  PGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMS 145

Query: 207 SSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGI 264
           SSY P+ CA   C  +   +C R + C Y+ +YGDG+ T+G   TE  +F  +SG  + +
Sbjct: 146 SSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSV 205

Query: 265 AL--GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
            L  GCG  N G    ++G++G G   LSL  Q+     +YCL    S     L+F S  
Sbjct: 206 PLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYASSRKSTLQFGSLA 265

Query: 323 G--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
                    G   T P++++ +  TFYYV  TG +VG + ++IP S F +   G GG+I+
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325

Query: 375 DCGTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALFDTCYDFSGLRSVR---VP 426
           D GTA+T         +  +F   +RL  A    P  GV          G R  R   VP
Sbjct: 326 DSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
            +  HF  G  LDLP +NY++     G  C     +    + IGN  QQ  RV +DL   
Sbjct: 386 RMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERE 444

Query: 487 RVGFTPNKC 495
            + F P +C
Sbjct: 445 TLSFAPVEC 453


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 129/369 (34%), Positives = 184/369 (49%), Gaps = 21/369 (5%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           P ++  + G  EY   + VGTPP+  + +LDTGSD+ W QC  CT C +Q DP+F P+ S
Sbjct: 86  PGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMS 145

Query: 207 SSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGI 264
           SSY P+ CA   C  +   +C R + C Y+ +YGDG+ T+G   TE  +F  +SG  + +
Sbjct: 146 SSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSV 205

Query: 265 AL--GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
            L  GCG  N G    ++G++G G   LSL  Q+     +YCL    S     L+F S  
Sbjct: 206 PLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFSYCLTPYASSRKSTLQFGSLA 265

Query: 323 G--------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
                    G   T P++++ +  TFYYV  TG +VG + ++IP S F +   G GG+I+
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325

Query: 375 DCGTAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALFDTCYDFSGLRSVR---VP 426
           D GTA+T         +  +F   +RL  A    P  GV          G R  R   VP
Sbjct: 326 DSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
            +  HF  G  LDLP +NY++     G  C     +    + IGN  QQ  RV +DL   
Sbjct: 386 RMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERE 444

Query: 487 RVGFTPNKC 495
            + F P +C
Sbjct: 445 TLSFAPVEC 453


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 131/329 (39%), Positives = 179/329 (54%), Gaps = 62/329 (18%)

Query: 64  ESETAAESFPLNSSS-SFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQL 122
           E+ET   + P++ +  + ++ L  R++L      +  +L   RL+RD+ RV  L      
Sbjct: 84  ETETQISTLPVSETDPTMTMHLEHRDVLAFNATPE--ALFNLRLQRDAFRVEALSKMAAA 141

Query: 123 AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDI 182
           A           A+       FS+ V SG +QGSGEYF+R+GVGTPP+   MVLDTGSD+
Sbjct: 142 AGGRRAGRNGTHAQGG----GFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDV 197

Query: 183 NWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDG 241
            W+QC PC +CY Q+DP+FDPK S S+S + C +P C  LD   C + + CLYQVAYGDG
Sbjct: 198 VWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDG 257

Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL 301
           SFT G+  TET++F  +  V  +ALGCGHDNEGLFVG+AGLLGLG       +Q +    
Sbjct: 258 SFTFGEFSTETLTFRGT-RVPKVALGCGHDNEGLFVGAAGLLGLG-------RQPRL--- 306

Query: 302 AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
                  + P  G      AR    +TA L    K+DT                      
Sbjct: 307 -------NRPPVG-----GARVA-GITASLF---KLDT---------------------- 328

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
                AG+GG+I+D GT++TRL  +AY +
Sbjct: 329 -----AGNGGVIIDSGTSVTRLTRRAYGT 352


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 135/402 (33%), Positives = 199/402 (49%), Gaps = 29/402 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L+RD  R   +  K  +        +L+ ++        S P   G+S  + EY   +G+
Sbjct: 79  LKRDQLRAEHIQRKFAMNAAVDGAGDLQQSKV-----SSSVPTKLGSSLDTLEYVISVGL 133

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GTP    ++ +DTGSD++W+QC PC    C+ Q+  +FDP  SS+Y  + CAA +C  L+
Sbjct: 134 GTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLE 193

Query: 224 V--SACRAN--RCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVG 278
              + C A    C Y V YGDGS T G    +T++  G S +VKG   GC H   G    
Sbjct: 194 QQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQ 253

Query: 279 SAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
           + GL+GLGGG  SL  Q  A    S +YCL      +  +           VT  ++R+K
Sbjct: 254 TDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSK 313

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
           ++ TFY   L   +VGG+ + + PS+F        G +VD GT ITRL   AY++L  +F
Sbjct: 314 QIPTFYGARLQDIAVGGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAF 367

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                  +     ++ DTC+DF+G   + +PTV+L F  G A+DL     +         
Sbjct: 368 KAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GN 421

Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFA T       IIGNVQQ+   V +D+ ++ +GF    C
Sbjct: 422 CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  204 bits (520), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 193/383 (50%), Gaps = 27/383 (7%)

Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
           K    ++L    + PV  GA        EY   + +GTPP+   + LDTGSD+ W QC+P
Sbjct: 62  KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSDLVWTQCQP 121

Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
           C  C+ QS P +D   SS+++   C + QCK LD  V+ C       C +  +YGD S T
Sbjct: 122 CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAFSYSYGDKSAT 180

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
           +G L  ETVSF    SV G+  GCG +N G+F  +  G+ G G G LSL  Q+K  + ++
Sbjct: 181 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 240

Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           C   V    P++ + +     + + R G   T PLI+N    TFYY+ L G +VG   + 
Sbjct: 241 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 299

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
           +P S F +   G GG I+D GTA T L  + Y  + D F   V+L       +G  L   
Sbjct: 300 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 355

Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
           C+    L ++  VP + LHF  G  + LP +NY+      G      A     ++IIGN 
Sbjct: 356 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 414

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQQ   V +DL N+++ F   KC
Sbjct: 415 QQQNMHVLYDLKNSKLSFVRAKC 437


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 133/369 (36%), Positives = 192/369 (52%), Gaps = 31/369 (8%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G  EY   + +GTPP  F  + DTGSD+ W QC+PC  C+ Q  PI+D   S+S+SP+PC
Sbjct: 91  GQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPC 150

Query: 215 AAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--------SV 261
           A+  C      S + +A   + C Y+ AY DG+++ G L TET++F  S         SV
Sbjct: 151 ASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSV 210

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV------ 315
            G+A GCG DN GL   S G +GLG G LSL  Q+     +YCL D  + + G       
Sbjct: 211 GGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLFGS 270

Query: 316 ---LEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
              L   S  GG AV + PL++     + YYV L G S+G   + IP   F++ + G GG
Sbjct: 271 LAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGSGG 330

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDT-CYDFS-GLRSV-RVPT 427
           +IVD GT  T L   A+  + +    +AG L +P    +  D+ C+  + G + +  +P 
Sbjct: 331 MIVDSGTIFTVLVESAFRVVVN---HVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPD 387

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANN 486
           + LHF  G  + L   NY+     + +FC   A   SA  SI+GN QQQ  ++ FD+   
Sbjct: 388 MLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVG 447

Query: 487 RVGFTPNKC 495
           ++ F P  C
Sbjct: 448 QLSFVPTDC 456


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 147/401 (36%), Positives = 216/401 (53%), Gaps = 25/401 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D +RV ++ ++L  +  +  + ++K  ++  +P         G++ GSG Y   +G+
Sbjct: 103 LLQDQSRVKSIHSRLSNSKTSGGK-DVKVTDSTTIPAK------DGSTVGSGNYIVTVGL 155

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           GTP +  S++ DTGSDI W QC+PC   CY+Q + IFDP  S+SY+ + C++  C SL  
Sbjct: 156 GTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTS 215

Query: 223 ---DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
              +   C ++ C+Y + YGD SF+VG   TE ++  ++ +   I  GCG +N+GLF GS
Sbjct: 216 ATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGS 275

Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           AGLLGLG   LS+   T Q      +YCL    S ++G L F  +   +A   PL     
Sbjct: 276 AGLLGLGRDKLSVVSQTAQKYNKIFSYCL-PSSSSSTGFLTFGGSASKNAKFTPLSTISA 334

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
             +FY +  TG SVGG+ + I  S+F        G I+D GT ITRL   AY++LR SF 
Sbjct: 335 GPSFYGLDFTGISVGGKKLAISASVFST-----AGAIIDSGTVITRLPPAAYSALRASFR 389

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
            L      T  +++ DTCYDFS   ++ VP +   F +G  +D+ A   L    S    C
Sbjct: 390 NLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVC 448

Query: 457 FAFAPTSSALS--IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFA  S A    I GNVQQ+   V +D +  +VGF P  C
Sbjct: 449 LAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 186/356 (52%), Gaps = 24/356 (6%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +GEY   + +GTPP     ++DTGSD+ W QCRPCT CY+Q  P+FDPK SS+Y    C 
Sbjct: 89  AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCG 148

Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
              C +L  D S  +  +C ++ +Y DGSFT G+L +ET++     G   S  G A GCG
Sbjct: 149 TSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCG 208

Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKATS---LAYCL--VDRDSPASGVLEFNSA-- 321
           H + G+F   S+G++GLGGG LSL  Q+K+T     +YCL  V  DS  S  + F ++  
Sbjct: 209 HSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGR 268

Query: 322 -RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF-EMDEAGDGGIIVDCGTA 379
             G   V+ PL++ K  DTFYY+ L G SVG +  ++P   + +  E  +G IIVD GT 
Sbjct: 269 VSGYGTVSTPLVQ-KSPDTFYYLTLEGISVGKK--RLPYKGYSKKTEVEEGNIIVDSGTT 325

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
            T L  + Y+ L  S        +      +F  CY+ +    +  P ++ HF       
Sbjct: 326 YTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHFKDANVEL 383

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            P   ++   +     CF  APTS  + ++GN+ Q    V FDL   RV F    C
Sbjct: 384 QPLNTFMRMQEDL--VCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKRVSFKAADC 436


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 137/375 (36%), Positives = 195/375 (52%), Gaps = 29/375 (7%)

Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           S PV  GA      + EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q  P F
Sbjct: 18  SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYF 77

Query: 202 DPKTSSSYSPLPCAAPQCKSLD--VSAC-RANR----CLYQVAYGDGSFTVGDLVTETVS 254
           D   SS+ + LPC + QCK LD  V+ C + N+    C Y  +YGD S T+G L  +  +
Sbjct: 78  DTSRSSTNALLPCESTQCK-LDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFT 136

Query: 255 FGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
           F    S+ G+  GCG +N G+F     G+ G G G LSL  Q+K  + ++C   +    P
Sbjct: 137 FVAGTSLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIP 196

Query: 312 ASGVLEFNS---ARGGDAV-TAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           ++ +L+  +   + G  AV T PLI   +N+   T YY+ L G +VG   + +P S F +
Sbjct: 197 STVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL 256

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
              G GG I+D GT+IT L  Q Y  +RD F  ++   + P +    + TC+        
Sbjct: 257 TN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKP 314

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVS 480
            VP + LHF  G  +DLP +NY+  V D AG    C A        +IIGN QQQ   V 
Sbjct: 315 DVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVL 372

Query: 481 FDLANNRVGFTPNKC 495
           +DL NN + F   +C
Sbjct: 373 YDLQNNMLSFVAAQC 387


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 133/388 (34%), Positives = 205/388 (52%), Gaps = 40/388 (10%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F++PVV+   Q   EY+  + VGTP  +  +++DTGSD++W+QC PC +C     P F+P
Sbjct: 125 FTSPVVT-LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 183

Query: 204 KTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETVS---- 254
           + SSS+  LPCA+  C ++        +     CL+ + YGDGS + G L  ET++    
Sbjct: 184 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 243

Query: 255 -FGNSGSVK--GIALGCGH-DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
            FG+   VK   I LGC   D EGL  G++GLLG+    +S   Q+    A   ++C  D
Sbjct: 244 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 303

Query: 308 RDSP--ASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQ 356
           + +   +SG++ F  +   D ++      PL++N  V +    +YYVGL G SV    + 
Sbjct: 304 KIAHLNSSGLVFFGES---DIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP 360

Query: 357 IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
           +    F++D+  G GG I+D GTA T L+  A+ ++R  F+    +L      + F  CY
Sbjct: 361 LSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 420

Query: 416 DFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPTSS-ALS 467
           + +     L S  +P+++LHF  G  + LP  + LIPV S+    T C AF  +     +
Sbjct: 421 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFN 480

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGN QQQ   V +DL   R+G  P +C
Sbjct: 481 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 508


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 138/389 (35%), Positives = 196/389 (50%), Gaps = 38/389 (9%)

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-P 199
           P+   +PVVSGAS GSG+YF  + +GTPP++  +V DTGSD+ W++C  C  C + +   
Sbjct: 71  PQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGS 130

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSA---CRANR----CLYQVAYGDGSFTVGDLVTET 252
            F  + S+++SP  C    C+ + +     C   R    C Y+ +YGDGS T G    ET
Sbjct: 131 AFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKET 190

Query: 253 VSF----GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---AT 299
            +     G    +KGIA GC     G       F G+ G++GLG G +SL+ Q+      
Sbjct: 191 TTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGN 250

Query: 300 SLAYCLVDRD---SPASGVL----EFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVG 351
             +YCL+D D   SP S +L    + + A G   +   PL  N    TFYY+G+   SV 
Sbjct: 251 KFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVD 310

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGV 408
           G  + I PS++ +DE G+GG IVD GT +T L   AY  +       VRL    +PT G 
Sbjct: 311 GIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPG- 369

Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSAL 466
             FD C + S +   R+P +S   G       P +NY +  D     C A     T S  
Sbjct: 370 --FDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDE-DVKCLALQAVMTPSGF 426

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S+IGN+ QQG  + FD    R+GF+ + C
Sbjct: 427 SVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/395 (31%), Positives = 192/395 (48%), Gaps = 26/395 (6%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD  RV+++I   +          +K           S P    +   + +Y   +G+
Sbjct: 89  LRRDKLRVDSIIQARRSMNLTSSVEHMKS----------SVPFYGLSKITASDYIVNVGI 138

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS 225
           GTP ++  ++ DTGS + W QC+PC  CY +  P+FDP  S+S+  LPC++  C+S+   
Sbjct: 139 GTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIR-Q 196

Query: 226 ACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLG 284
            C + +C Y  AY D S + G L TET+SF +     K I +GC     G  +G +G++G
Sbjct: 197 GCSSPKCTYLTAYVDNSSSTGTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMG 256

Query: 285 LGGGMLSLTKQ---IKATSLAYCLVDRDSPAS-GVLEFNSARGGDAVTAPLIRNKKVDTF 340
           L    +SL  Q   I     +YC+    +P S G L F      D   +P+ +     + 
Sbjct: 257 LNRSPISLASQTANIYDKLFSYCI--PSTPGSTGHLTFGGKVPNDVRFSPVSKTAP-SSD 313

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           Y + +TG SVGG+ + I  S F++         +D G  +TRL  +AY++LR  F  +  
Sbjct: 314 YDIKMTGISVGGRKLLIDASAFKIAST------IDSGAVLTRLPPKAYSALRSVFREMMK 367

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
                      DTCYDFS   +V +P++S+ F  G  +D+     +  V  +  +C AFA
Sbjct: 368 GYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFA 427

Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                +SI GN QQ+   V FD A  R+GF P  C
Sbjct: 428 ELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 139/420 (33%), Positives = 198/420 (47%), Gaps = 31/420 (7%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
           R L+   ++R  AR       L +A     R   K A+     E    P V     G  E
Sbjct: 50  RELIRRAMQRSKARA----AALSVARSGSGRVPGKSAQQG---EQHQQPGVPVRPSGDLE 102

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y   + +GTPP+  S +LDTGSD+ W QC PC  C  Q DP+F P  SSSY P+ C+   
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQL 162

Query: 219 CKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK---GIALGCGHDNEG 274
           C  +   +C R + C Y+  YGDG+ T+G   TE  +F +S   K    +  GCG  N G
Sbjct: 163 CNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVG 222

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP----------ASGVLEFNSARGG 324
                +G++G G   LSL  Q+     +YCL    S           + GV E + A  G
Sbjct: 223 SLNNGSGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKSTLMFGSLSDGVFEGDDAATG 282

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
              T  L+++++  TFYYV  TG +VG + ++IP S F +   G GG+IVD GTA+T   
Sbjct: 283 QVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFP 342

Query: 385 TQAYNSLRDSF---VRL--AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAG 435
                 +  +F   +RL    +  P  GV     +       S    V VP ++ HF  G
Sbjct: 343 AAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHF-QG 401

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L+LP +NY++     G+ C   A +  + + IGN  QQ  RV +DL    + F P +C
Sbjct: 402 ADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 131/388 (33%), Positives = 193/388 (49%), Gaps = 38/388 (9%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-PI 200
             F +PV+SGAS GSG+YF  + +GTPP+   +V DTGSD+ W++C PC  C  +S    
Sbjct: 69  NSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA 128

Query: 201 FDPKTSSSYSPLPCAAPQCKSL---DVSACRANR----CLYQVAYGDGSFTVGDLVTETV 253
           F  + S++YS + C +PQC+ +     + C   R    C YQ  Y D S T G    E +
Sbjct: 129 FFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEAL 188

Query: 254 SFGNS-GSVK---GIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATS 300
           +   S G VK   G++ GCG    G       F G+ G++GLG   +S + Q+     + 
Sbjct: 189 TLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSK 248

Query: 301 LAYCLVDR--DSPASGVLEFNSA------RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
            +YCL+D     P +  L    A      + G     PL+ N    TFYY+ + G  V G
Sbjct: 249 FSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNG 308

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVA 409
             + I PS++ +D+ G+GG I+D GT +T +   AY  +  +F   V+L    +PT G  
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPG-- 366

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALS 467
            FD C + SG+    +P +S +   G     P +NY I        C A  P S     S
Sbjct: 367 -FDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQ-IKCLAVQPVSQDGGFS 424

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++GN+ QQG  + FD   +R+GFT   C
Sbjct: 425 VLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 155/439 (35%), Positives = 224/439 (51%), Gaps = 32/439 (7%)

Query: 81  SLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL 140
           ++PLH R        N     +  RL RD  R   +  KL                 Q  
Sbjct: 63  TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQ-Q 121

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPP-RQFSMVLDTGSDINWLQCRPC-TECYQQSD 198
               + P   G S  + EY   + +G+PP +  +M++DTGSDI+W++C+PC  +C  Q D
Sbjct: 122 SHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVD 181

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSL----DVSACRAN-RCLYQVAYGDGSF-TVGDLVTET 252
           P+FDP  SS+YSP  C++  C  L    + + C ++ +C Y   YGDGS  T G   ++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241

Query: 253 VSFG---NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA----TSLAYCL 305
           ++ G   N+  V     GC H   G+   +AGL+GLGGG  SL  Q       T+ +YCL
Sbjct: 242 LALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCL 301

Query: 306 VDRDSPASGVLEFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
               S +SG L   +A    A  V  P++R+ +V  FY V L    VGG+ + IP ++F 
Sbjct: 302 PPTPS-SSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS 360

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKP---TSGVALFDTCYDFSGL 420
                  G+I+D GT +TRL   AY+SL  +F        P   ++G    DTC+D SG 
Sbjct: 361 ------AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQ 414

Query: 421 RSVRVPTVSLHF-GAGKA-LDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQG 476
            SV +PTV+L F GAG A ++L A   L+ ++++  FC AF  TS   +  IIGNVQQ+ 
Sbjct: 415 SSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRT 474

Query: 477 TRVSFDLANNRVGFTPNKC 495
            +V +D+A   VGF    C
Sbjct: 475 FQVLYDVAGGAVGFKAGAC 493


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 192/383 (50%), Gaps = 27/383 (7%)

Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
           K    ++L    + PV  GA        EY   + +GTPP+   + LDTGS + W QC+P
Sbjct: 6   KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQP 65

Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
           C  C+ QS P +D   SS+++   C + QCK LD  V+ C       C Y  +YGD S T
Sbjct: 66  CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAYSYSYGDKSAT 124

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
           +G L  ETVSF    SV G+  GCG +N G+F  +  G+ G G G LSL  Q+K  + ++
Sbjct: 125 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 184

Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           C   V    P++ + +     + + R G   T PLI+N    TFYY+ L G +VG   + 
Sbjct: 185 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 243

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
           +P S F +   G GG I+D GTA T L  + Y  + D F   V+L       +G  L   
Sbjct: 244 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 299

Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
           C+    L ++  VP + LHF  G  + LP +NY+      G      A     ++IIGN 
Sbjct: 300 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 358

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQQ   V +DL N+++ F   KC
Sbjct: 359 QQQNMHVLYDLKNSKLSFVRAKC 381


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 192/383 (50%), Gaps = 27/383 (7%)

Query: 133 KPAEAQILPEDFSTPVVSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
           K    ++L    + PV  GA        EY   + +GTPP+   + LDTGS + W QC+P
Sbjct: 62  KARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQP 121

Query: 190 CTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC---RANRCLYQVAYGDGSFT 244
           C  C+ QS P +D   SS+++   C + QCK LD  V+ C       C Y  +YGD S T
Sbjct: 122 CAVCFNQSLPYYDASRSSTFALPSCDSTQCK-LDPSVTMCVNQTVQTCAYSYSYGDKSAT 180

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAY 303
           +G L  ETVSF    SV G+  GCG +N G+F  +  G+ G G G LSL  Q+K  + ++
Sbjct: 181 IGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSH 240

Query: 304 CL--VDRDSPASGVLE-----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           C   V    P++ + +     + + R G   T PLI+N    TFYY+ L G +VG   + 
Sbjct: 241 CFTAVSGRKPSTVLFDLPADLYKNGR-GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLP 299

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
           +P S F +   G GG I+D GTA T L  + Y  + D F   V+L       +G  L   
Sbjct: 300 VPESAFALKN-GTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPLL--- 355

Query: 414 CYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNV 472
           C+    L ++  VP + LHF  G  + LP +NY+      G      A     ++IIGN 
Sbjct: 356 CFSAPPLGKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNF 414

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
           QQQ   V +DL N+++ F   KC
Sbjct: 415 QQQNMHVLYDLKNSKLSFVRAKC 437


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 119/336 (35%), Positives = 172/336 (51%), Gaps = 19/336 (5%)

Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQ 235
           +DTGSD+ W QC PC  C  Q  P FD K S++Y  LPC + +C SL   +C    C+YQ
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
             YGD + T G L  ET +FG + S K     IA GCG  N G    S+G++G G G LS
Sbjct: 61  YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLS 120

Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDAVTAPLIRNKKVDTFYY 342
           L  Q+  +  +YCL    S     L F         N++ G    + P + N  +   Y+
Sbjct: 121 LVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYF 180

Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
           + L   S+G + + I P +F +++ G GG+I+D GT+IT LQ  AY ++R   V  A  L
Sbjct: 181 LSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPL 239

Query: 403 KPTSGVAL-FDTCYDFSGLR--SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
              +   +  DTC+ +      +V VP +  HF +     LP +NY++   + G  C   
Sbjct: 240 PAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVM 298

Query: 460 APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           APT    +IIGN QQQ   + +D+ N+ + F P  C
Sbjct: 299 APTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 145/429 (33%), Positives = 220/429 (51%), Gaps = 32/429 (7%)

Query: 77  SSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAE 136
           S+  ++PLH R        +     +  RL RD  R   +  K   A       +++ ++
Sbjct: 52  STGVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGA------GDIEQSD 105

Query: 137 AQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ 196
           A  +P         G S  + EY   +G+G+P    +M +DTGSD++W+QC+PC++C+ +
Sbjct: 106 AATVPTTL------GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSE 159

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVS----ACRANRCLYQVAYGDGSFTVGDLVTET 252
            D +FDP +SS+YSP  C++  C  L  S     C +++C Y V YGD S T G   ++T
Sbjct: 160 VDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDT 219

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
           ++ G+S ++     GC     G F     GL+GLGGG  SL  Q      T+ +YCL   
Sbjct: 220 LTLGSS-AMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCL-PP 277

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            S +SG L   +   G  V  P++R+ ++ T+Y V L    VG Q + +P S+F      
Sbjct: 278 TSGSSGFLTLGTGSSG-FVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFS----- 331

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
             G ++D GT ITRL   AY++L  +F        P +   + DTC+DFSG  S+ +PTV
Sbjct: 332 -AGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTV 390

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
           +L F  G A+DL     ++ + S+   C AF P    S+L IIGNVQQ+   V +D+   
Sbjct: 391 TLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGG 449

Query: 487 RVGFTPNKC 495
            VGF    C
Sbjct: 450 AVGFKAGAC 458


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 144/375 (38%), Positives = 199/375 (53%), Gaps = 33/375 (8%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDP 199
            D S P   GA+  S EY   +G+GTP  Q ++++DTGSD++W+QC+PC  + CY Q DP
Sbjct: 110 SDVSIPTSLGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDP 169

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSA----CR----ANRCLYQVAYGDGSFTVGDLVTE 251
           ++DP  SS+Y+P+PC +  CK L   A    C      + C Y + YG+   TVG   TE
Sbjct: 170 LYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTE 229

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
           T++     SVK    GCG   +G F    GLLGLGG   SL  Q   T   + +YCL   
Sbjct: 230 TLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPG 289

Query: 309 DSPASGVLEFNSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           +S  +G L   +    +        PL    +  TFY V LTG SVGG+ + IPP++   
Sbjct: 290 NS-TTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLS- 347

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRS 422
                GG+I+D GT IT L   AY++LR +F     A  L P +   + DTCY+F+G+ +
Sbjct: 348 -----GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIAN 402

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVS 480
           V VPTV+L F  G  +DL   + ++  D     C AFA  +S   + IIGNV Q+   V 
Sbjct: 403 VTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQRTFEVL 457

Query: 481 FDLANNRVGFTPNKC 495
           +D     VGF P  C
Sbjct: 458 YDSGRGHVGFRPGAC 472


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 132/388 (34%), Positives = 205/388 (52%), Gaps = 40/388 (10%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           F++PVV+   Q   EY+  + +GTP  +  +++DTGSD++W+QC PC +C     P F+P
Sbjct: 124 FTSPVVT-LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNP 182

Query: 204 KTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYGDGSFTVGDLVTETVS---- 254
           + SSS+  LPCA+  C ++        +     CL+ + YGDGS + G L  ET++    
Sbjct: 183 RHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTP 242

Query: 255 -FGNSGSVK--GIALGCGH-DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
            FG+   VK   I LGC   D EGL  G++GLLG+    +S   Q+    A   ++C  D
Sbjct: 243 NFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPD 302

Query: 308 RDSP--ASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQ 356
           + +   +SG++ F  +   D ++      PL++N  V +    +YYVGL G SV    + 
Sbjct: 303 KIAHLNSSGLVFFGES---DIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLP 359

Query: 357 IPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
           +    F++D+  G GG I+D GTA T L+  A+ ++R  F+    +L      + F  CY
Sbjct: 360 LSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCY 419

Query: 416 DFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPTSS-ALS 467
           + +     L S  +P+++LHF  G  + LP  + LIPV S+    T C AF  +     +
Sbjct: 420 NITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFN 479

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IIGN QQQ   V +DL   R+G  P +C
Sbjct: 480 IIGNYQQQNLWVEYDLEKLRLGIAPAQC 507


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  201 bits (512), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 135/402 (33%), Positives = 199/402 (49%), Gaps = 29/402 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L+RD  R   +  K  +        +L+ ++        S P   G+S  + EY   +G+
Sbjct: 79  LKRDQLRAEHIQRKFAMNAAVDGAGDLQQSKV-----SSSVPTKLGSSLDTLEYVISVGL 133

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GTP    ++ +DTGSD++W+QC PC    CY Q+  +FDP  SS+Y  + CAA +C  L+
Sbjct: 134 GTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLE 193

Query: 224 V--SACRAN--RCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVG 278
              + C A    C Y V YGDGS T G    +T++  G S +VKG   GC H   G    
Sbjct: 194 QQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQ 253

Query: 279 SAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
           + GL+GLGGG  SL  Q  A    S +YCL      +  +           VT  ++R++
Sbjct: 254 TDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSR 313

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
           ++ TFY   L   +VGG+ + + PS+F        G +VD GT ITRL   AY++L  +F
Sbjct: 314 QIPTFYGARLQDIAVGGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAF 367

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
                  +     ++ DTC+DF+G   + +PTV+L F  G A+DL     +         
Sbjct: 368 KAGMKQYRSAPARSILDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GN 421

Query: 456 CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFA T       IIGNVQQ+   V +D+ ++ +GF    C
Sbjct: 422 CLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 133/384 (34%), Positives = 191/384 (49%), Gaps = 46/384 (11%)

Query: 147 PVVSGASQGSGEYFSRIGVG----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           P+ SG    +  Y + I +G    +P    ++++DTGSD+ W+QC+PC+ CY Q DP+FD
Sbjct: 132 PLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFD 191

Query: 203 PKTSSSYSPLPCAAPQCK-----------SLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
           P  S++Y+ + C A  C            S   +   + +C Y +AYGDGSF+ G L T+
Sbjct: 192 PAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATD 251

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDR 308
           TV+ G + S+ G   GCG  N GLF G+AGL+GLG   LSL  Q  +      +YCL   
Sbjct: 252 TVALGGA-SLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 310

Query: 309 DS-PASGVLEFNSARGGDAV------TAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQ 356
            S  ASG L      GGD        T P+   + +       FY++ +TG +VGG A+ 
Sbjct: 311 TSGDASGSLSLG---GGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA 367

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTC 414
                      G   +++D GT ITRL    Y ++R  F+R   A       G ++ DTC
Sbjct: 368 -------AQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTC 420

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGN 471
           YD +G   V+VP ++L    G  + + A   L  V   G+  C A A  S      IIGN
Sbjct: 421 YDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGN 480

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
            QQ+  RV +D   +R+GF    C
Sbjct: 481 YQQKNKRVVYDTLGSRLGFADEDC 504


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 180/353 (50%), Gaps = 23/353 (6%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           GSGEY   + +GTPP  +  + DTGSD+ W QC PC +CYQQ  PIF+P  S+S+S +PC
Sbjct: 88  GSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPC 147

Query: 215 AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
               C ++D   C     C Y   YGD +++ GDL  E ++ G+S SVK + +GCGH + 
Sbjct: 148 NTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSS-SVKSV-IGCGHASS 205

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGD 325
           G F  ++G++GLGGG LSL  Q+  TS      +YCL    S A+G + F       G  
Sbjct: 206 GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPG 265

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            V+ PLI    V T+YY+ L   S+G +          M  A  G +I+D GT +T L  
Sbjct: 266 VVSTPLISKNTV-TYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLTILPK 316

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALD-LPA 442
           + Y+ +  S +++    +        D C+D   +   S+ +P ++ HF  G  ++ LP 
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVNLLPI 376

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +    D+        A  ++   IIGN+ Q    + +DL   R+ F P  C
Sbjct: 377 NTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  201 bits (510), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 134/401 (33%), Positives = 192/401 (47%), Gaps = 71/401 (17%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L +D +RV ++ ++L   +       LK ++A +       P  S ++ GSG Y   +G+
Sbjct: 45  LAQDESRVASIQSRLAKNL--AGGSNLKASKATL-------PSKSASTLGSGNYVVTVGL 95

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           G+P R  + + DTGSD+ W QC PC   CYQQ + IFDP TS SYS + C +P C+ L+ 
Sbjct: 96  GSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLES 155

Query: 225 S-----ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
           +      C ++ CLY + YGDGS+++G    E +S  ++        GCG +N GLF G+
Sbjct: 156 ATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGT 215

Query: 280 AGLLGLGGGMLSL---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           AGLLGL    LSL   T Q      +YCL    S ++G L F S  G             
Sbjct: 216 AGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSSTGYLSFGSGDGDS----------- 263

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
                           +AV+  P                      RL    Y+S++  F 
Sbjct: 264 ----------------KAVKFTP----------------------RLPPTVYSSVQKVFR 285

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
            L  +     GV++ DTCYD S  ++V+VP + L+F  G  +DL A   +I V      C
Sbjct: 286 ELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQVC 344

Query: 457 FAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFA  S    ++IIGNVQQ+   V +D A  RVGF P+ C
Sbjct: 345 LAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  201 bits (510), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 195/375 (52%), Gaps = 26/375 (6%)

Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-- 191
           PA A  +P+       SG    + E+   +G+GTP +  +++ DTGSD++W+QC+PC   
Sbjct: 125 PAPAVTIPDR------SGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSS 178

Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLV 249
             C+ Q DP+FDP  SS+Y+ + C  PQC +  D+ +     CLY V YGDGS T G L 
Sbjct: 179 GHCHPQQDPLFDPSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLS 238

Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV 306
            +T++  +S ++ G   GCG  N G F    GLLGLG G LSL  Q  A+     +YCL 
Sbjct: 239 RDTLALTSSRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLP 298

Query: 307 DRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
             +S  +G L   +    D   A    ++R  +  +FY+V L    +GG  + +PP++F 
Sbjct: 299 SSNS-TTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT 357

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                 GG ++D GT +T L  QAY  LRD F        P     + D CYDF+G   V
Sbjct: 358 R-----GGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEV 412

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVS 480
            VP VS  FG G   +L     +I +D     C AFA   +    LSIIGN QQ+   V 
Sbjct: 413 VVPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAMDTGGLPLSIIGNTQQRSAEVI 471

Query: 481 FDLANNRVGFTPNKC 495
           +D+A  ++GF P  C
Sbjct: 472 YDVAAEKIGFVPASC 486


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  201 bits (510), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 135/373 (36%), Positives = 191/373 (51%), Gaps = 36/373 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
           S P   G+S  + EY   +G+G+P     +V+DTGSD++W+QC PC   + C+  +  +F
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180

Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
           DP  SS+Y+   C+A  C  L    + + C A +RC Y V YGDGS T G   ++ ++  
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLS 240

Query: 257 NSGSVKGIALGCGHD--NEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP 311
            S  V+G   GC H     G+   + GL+GLGG   SL  Q  A    S +YCL    +P
Sbjct: 241 GSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCL--PATP 298

Query: 312 A-SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           A SG L   +   G         T P++R+KKV T+Y+  L   +VGG+ + + PS+F  
Sbjct: 299 ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA 358

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
                 G +VD GT ITRL   AY +L  +F            + + DTC++F+GL  V 
Sbjct: 359 ------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVS 412

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
           +PTV+L F  G  +DL A   +    S G  C AFAPT    A   IGNVQQ+   V +D
Sbjct: 413 IPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 466

Query: 483 LANNRVGFTPNKC 495
           +     GF    C
Sbjct: 467 VGGGVFGFRAGAC 479


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 172/343 (50%), Gaps = 38/343 (11%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS---AC 227
           ++V+DT SDI W+QC PC   +C+ Q DP++DP  SS+++P+PC +P CK L  S    C
Sbjct: 170 TVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGC 229

Query: 228 R--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG-SAGLLG 284
               + C Y V YGDG  T G  VT+T++   +  VK    GC H   G F   +AG+L 
Sbjct: 230 SPTTDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILA 289

Query: 285 LGGGMLSLTKQIK---ATSLAYCLVDRDS--------PASGVLEFNSARGGDAVTAPLIR 333
           LGGG  SL +Q       + +YC+    S        P    L+F+          PLI+
Sbjct: 290 LGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFS--------YTPLIK 341

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
           NK   TFY V L    V G+ + +PP+ F        G ++D G  +T+L  Q Y +LR 
Sbjct: 342 NKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRA 395

Query: 394 SFVRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           +F        P +  V   DTCYDF+    V+VP VSL F  G  LDL   + ++     
Sbjct: 396 AFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----D 451

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           G   FA  P   ++  IGNVQQQ   V +D+   +VGF    C
Sbjct: 452 GCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 150/452 (33%), Positives = 220/452 (48%), Gaps = 38/452 (8%)

Query: 64  ESETAAESFPLN---SSSSFSLPL-HSREILHKTRHNDYRSLVLSR-LERDSARVNTLIT 118
           +SET   +  +N   SS++ S+ L H       +++++  +  +S  L R  AR N +++
Sbjct: 36  DSETVCSASKVNLEPSSATVSMSLVHRYGPCAPSQYSNVPTPSISETLRRSRARTNYIMS 95

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           +   ++           +A +     + P   G    S EY   +G GTP     +++DT
Sbjct: 96  QASKSMGMGMASTPDDDDAAV-----TIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDT 150

Query: 179 GSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD---VSACRA--NR 231
           GSD++W+QC PC  T+CY Q DP+FDP  SS+Y+P+ C    C+ L     + C +   +
Sbjct: 151 GSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQ 210

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C Y V Y DGS + G    ET++     +V+    GCG D  G      GLLGLGG  +S
Sbjct: 211 CGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVS 270

Query: 292 L---TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKKVDTFYYVGL 345
           L   T  +   + +YCL   +S A G L   S   G+    V  P+       TFY V +
Sbjct: 271 LVVQTSSVYGGAFSYCLPALNSEA-GFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTM 329

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
           TG SVGG+ + IP S F       GG+I+D GT  T L   AYN+L ++ +R A    P 
Sbjct: 330 TGISVGGKPLHIPQSAFR------GGMIIDSGTVDTELPETAYNAL-EAALRKALKAYPL 382

Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--S 463
                FDTCY+F+G  ++ VP V+  F  G  +DL   N ++  D     C AF  +   
Sbjct: 383 VPSDDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPD 437

Query: 464 SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L IIGNV Q+   V +D     VGF    C
Sbjct: 438 DGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 184/353 (52%), Gaps = 18/353 (5%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP  F  + DTGSD+ W QC+PC  C+ Q  P++DP  SS++SP+PC++ 
Sbjct: 65  EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 124

Query: 218 QC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALGC 268
            C    +S + S   ++ C Y  +Y DG+++VG L TET++ G+S      SV  +A GC
Sbjct: 125 TCLPTWRSRNCSN-PSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGC 183

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSAR 322
           G DN G  + S G +GLG G LSL  Q+     +YCL D      DSP   G L   +  
Sbjct: 184 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTMDSPFFLGTLAELAPG 243

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G   + PL+++    + Y+V L G S+G   + IP   F++   G+GG++VD GT  T 
Sbjct: 244 PGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTI 303

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L    +  + D   +L G   P +  +L   C+  S      +P + LHF  G  + L  
Sbjct: 304 LAKSGFREVVDRVAQLLGQ-PPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADMRLHR 361

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            NY+   +   +FC     + S  S +GN QQQ  ++ FD+   ++ F P  C
Sbjct: 362 DNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 193/367 (52%), Gaps = 38/367 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCA 215
           GEY   + +GTPP  ++ V DTGSD+ W QC PC T+C++Q  P+++P +S+++S LPC 
Sbjct: 110 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 169

Query: 216 APQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
           +       +S C              C+Y   YG G +T G   +ET +FG+S +    V
Sbjct: 170 S------SLSMCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARV 222

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEF 318
            G+A GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL    D +S ++ +L  
Sbjct: 223 PGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGP 282

Query: 319 NSARGGDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           ++A  G  V + P + +     + T+YY+ LTG S+G +A+ I P  F +   G GG+I+
Sbjct: 283 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 342

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVR---VPTVS 429
           D GT IT L   AY  +R +   L   L    G      D C+      S     +P+++
Sbjct: 343 DSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMT 402

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRV 488
           LHF  G  + LPA +Y+I    +G +C A    T  A+S  GN QQQ   + +D+    +
Sbjct: 403 LHFD-GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETL 459

Query: 489 GFTPNKC 495
            F P KC
Sbjct: 460 SFAPAKC 466


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 128/253 (50%), Positives = 159/253 (62%), Gaps = 23/253 (9%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELK--PAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           RL+RDS RV ++ +   LA  +  R+  K  P  A      FS  V+SG SQGSGEYF R
Sbjct: 86  RLQRDSLRVKSITS---LAAVSTGRNATKRTPRTAG----GFSGAVISGLSQGSGEYFMR 138

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           +GVGTP     MVLDTGSD+ WLQC PC  CY Q+D IFDPK S +++ +PC +  C+ L
Sbjct: 139 LGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCRRL 198

Query: 223 DVSA-C---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
           D S+ C   R+  CLYQV+YGDGSFT GD  TET++F +   V  + LGCGHDNEGLFVG
Sbjct: 199 DDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-HGARVDHVPLGCGHDNEGLFVG 257

Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG------VLEFNSARGGDAVTA 329
           +AGLLGLG G LS   Q K       +YCLVDR S  S       ++  N+A    +V  
Sbjct: 258 AAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFT 317

Query: 330 PLIRNKKVDTFYY 342
           PL+ N K+DTFYY
Sbjct: 318 PLLTNPKLDTFYY 330


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 193/375 (51%), Gaps = 26/375 (6%)

Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-- 191
           PA A  +P+       SG    + E+   +G+GTP +  +++ DTGSD++W+QC+PC   
Sbjct: 130 PAPAVTIPDR------SGTYLDTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSS 183

Query: 192 -ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLV 249
             C+ Q DP+FDP  SS+Y+ + C  PQC +        N  CLY V YGDGS T G L 
Sbjct: 184 GHCHPQQDPLFDPSKSSTYAAVHCGEPQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLS 243

Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV 306
            +T++  +S ++ G   GCG  N G F    GLLGLG G LSL  Q  A+     +YCL 
Sbjct: 244 RDTLALTSSRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLP 303

Query: 307 DRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
             +S  +G L   +    D   A    ++R  +  +FY+V L    +GG  + +PP++F 
Sbjct: 304 SSNS-TTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT 362

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                 GG ++D GT +T L  QAY  LRD F        P     + D CYDF+G   V
Sbjct: 363 R-----GGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEV 417

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVS 480
            VP VS  FG G   +L     +I +D     C AFA   +    LSIIGN QQ+   V 
Sbjct: 418 IVPAVSFRFGDGAVFELDFFGVMIFLDE-NVGCLAFAAMDAGGLPLSIIGNTQQRSAEVI 476

Query: 481 FDLANNRVGFTPNKC 495
           +D+A  ++GF P  C
Sbjct: 477 YDVAAEKIGFVPASC 491


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 121/381 (31%), Positives = 186/381 (48%), Gaps = 33/381 (8%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           +  PV + A   SGEY     +GTP P++ ++ +DTGSD+ W QC PC  C+ Q  P+FD
Sbjct: 72  YGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFD 131

Query: 203 PKTSSSYSPLPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           P  SS++  + C  P C+    L VSAC  +  RC Y  +YGD S T G +  +T +F +
Sbjct: 132 PSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191

Query: 258 SG-------SVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRD 309
                    +V G+A GCG  N G+F  + +G+ G G G LSL  Q++    +YCL   D
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHD 251

Query: 310 -------------SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
                        +P +G+   +S   G   + P+I +    TFYY+ L G +VG   + 
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSS---GPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTC 414
           +  S+F + + G GG ++D GT +T      +  L++ FV          TS V      
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGNLLCF 368

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
               G + V VP +  H  A   +DLP +NY+     +G  C         + +IGN QQ
Sbjct: 369 QRPKGGKQVPVPKLIFHL-ASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQ 427

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           Q   + +D+ N+++ F   +C
Sbjct: 428 QNMHIVYDVENSKLLFASAQC 448


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 134/362 (37%), Positives = 195/362 (53%), Gaps = 31/362 (8%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           +  SGEY   I +GTPP     + DTGSD+ W QC+PC +CY Q DP+FDPK SS+Y  +
Sbjct: 88  TSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDV 147

Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
            C++ QC +L+  A      N C Y  +YGD S+T G++  +T++ G++ +    +K I 
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207

Query: 266 LGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
           +GCGH+N G F    +G++GLGGG +SL  Q+  +     +YCLV   S      + N  
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267

Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                 G   V+ PLI  K  +TFYY+ L   SVG + VQ P S      +G+G II+D 
Sbjct: 268 TNAVVSGTGVVSTPLIA-KSQETFYYLTLKSISVGSKEVQYPGS---DSGSGEGNIIIDS 323

Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           GT +T L T+ Y+ L D   S +       P +G++L   CY  +G   ++VP +++HF 
Sbjct: 324 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSL---CYSATG--DLKVPAITMHFD 378

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  ++L   N  + + S    CFAF   S + SI GNV Q    V +D  +  V F P 
Sbjct: 379 -GADVNLKPSNCFVQI-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 435

Query: 494 KC 495
            C
Sbjct: 436 DC 437


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 30/362 (8%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           +  SGEY   + +GTPP     + DTGSD+ W QC PC +CY Q DP+FDPKTSS+Y  +
Sbjct: 84  TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 143

Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
            C++ QC +L+  A      N C Y ++YGD S+T G++  +T++ G+S +    +K I 
Sbjct: 144 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
           +GCGH+N G F      +   GG  +SL KQ+  +     +YCLV   S      + N  
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263

Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                 G   V+ PLI     +TFYY+ L   SVG + +Q         E+ +G II+D 
Sbjct: 264 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDS 320

Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           GT +T L T+ Y+ L D   S +       P SG++L   CY  +G   ++VP +++HF 
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL---CYSATG--DLKVPVITMHFD 375

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  + L + N  + V S    CFAF   S + SI GNV Q    V +D  +  V F P 
Sbjct: 376 -GADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 432

Query: 494 KC 495
            C
Sbjct: 433 DC 434


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 137/375 (36%), Positives = 196/375 (52%), Gaps = 28/375 (7%)

Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           S PV  GA      + EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q+ P F
Sbjct: 18  SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYF 77

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF 255
           DP TSS+ S   C +  C+ L V++C + +      C+Y  +YGD S T G L  +  +F
Sbjct: 78  DPSTSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF 137

Query: 256 -GNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSP 311
            G   SV G+A GCG  N G+F  +  G+ G G G LSL  Q+K  + ++C   +    P
Sbjct: 138 VGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIP 197

Query: 312 ASGVLEFNS---ARGGDAV-TAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           ++ +L+  +   + G  AV T PLI   +N+   T YY+ L G +VG   + +P S F +
Sbjct: 198 STVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL 257

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
              G GG I+D GT+IT L  Q Y  +RD F  ++   + P +    + TC+        
Sbjct: 258 TN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKP 315

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVS 480
            VP + LHF  G  +DLP +NY+  V D AG    C A        +IIGN QQQ   V 
Sbjct: 316 DVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVL 373

Query: 481 FDLANNRVGFTPNKC 495
           +DL NN + F   +C
Sbjct: 374 YDLQNNMLSFVAAQC 388


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 130/363 (35%), Positives = 197/363 (54%), Gaps = 31/363 (8%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPC 214
           GEY   + +GTPP  +  + DTGSD+ W QC PC+  +C+ Q  P+++P +S+++  LPC
Sbjct: 90  GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPC 149

Query: 215 AAPQCKSLDVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIAL 266
            +       V A +A      C+Y   YG G +T G   +ET +FG++ +    V GIA 
Sbjct: 150 NSSLSMCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAF 208

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARG 323
           GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL    D +S ++ +L  ++A  
Sbjct: 209 GCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGPSAALN 268

Query: 324 GDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
           G  V + P + +     + T+YY+ LTG S+G +A+ I P  F +   G GG+I+D GT 
Sbjct: 269 GTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTT 328

Query: 380 ITRLQTQAYNSLR---DSFVRL-AGNLKPTSGVALFDTCYDFSGLRSV--RVPTVSLHFG 433
           IT L   AY  +R    S V L A +   ++G+   D CY      S    +P+++LHF 
Sbjct: 329 ITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGL---DLCYALPTPTSAPPAMPSMTLHFD 385

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
            G  + LPA +Y+I    +G +C A    T  A+S  GN QQQ   + +D+ N  + F P
Sbjct: 386 -GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAP 442

Query: 493 NKC 495
            KC
Sbjct: 443 AKC 445


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 140/431 (32%), Positives = 205/431 (47%), Gaps = 71/431 (16%)

Query: 102 VLSRLERDSARVNTLITK-LQLAIYNVDRHELKPAEAQILPEDFSTPVVS---------- 150
           VL    RD  R+ TL  + L+    N    + K  + +++    +TPV S          
Sbjct: 100 VLELQIRDLTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT---TTPVASSVEEQAGQLV 156

Query: 151 -----GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
                G + GSGEYF  + VG+PP+ FS++LDTGSD+NW+QC PC +C+QQ+D       
Sbjct: 157 ATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND------- 209

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG---NSGS-- 260
                                     C Y   YGD S T GD   ET +     N GS  
Sbjct: 210 -----------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSE 246

Query: 261 ---VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASG 314
              V+ +  GCGH N GLF G+AGLLGLG G LS + Q+++    S +YCLVDR+S  + 
Sbjct: 247 LYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNV 306

Query: 315 VLEFNSARGGDAVTAPLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
             +       D ++ P +        +   VDTFYYV +    V G+ + IP   + +  
Sbjct: 307 SSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISS 366

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVRV 425
            G GG I+D GT ++     AY  +++     A    P      + D C++ SG+ +V++
Sbjct: 367 DGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQL 426

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
           P + + F  G   + P +N  I ++     C A   T  SA SIIGN QQQ   + +D  
Sbjct: 427 PELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTK 485

Query: 485 NNRVGFTPNKC 495
            +R+G+ P KC
Sbjct: 486 RSRLGYAPTKC 496


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 30/362 (8%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           +  SGEY   + +GTPP     + DTGSD+ W QC PC +CY Q DP+FDPKTSS+Y  +
Sbjct: 84  TSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDV 143

Query: 213 PCAAPQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIA 265
            C++ QC +L+  A      N C Y ++YGD S+T G++  +T++ G+S +    +K I 
Sbjct: 144 SCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNII 203

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPASGVLEFN-- 319
           +GCGH+N G F      +   GG  +SL KQ+  +     +YCLV   S      + N  
Sbjct: 204 IGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFG 263

Query: 320 ---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                 G   V+ PLI     +TFYY+ L   SVG + +Q         E+ +G II+D 
Sbjct: 264 TNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDS 320

Query: 377 GTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           GT +T L T+ Y+ L D   S +       P SG++L   CY  +G   ++VP +++HF 
Sbjct: 321 GTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL---CYSATG--DLKVPVITMHFD 375

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  + L + N  + V S    CFAF   S + SI GNV Q    V +D  +  V F P 
Sbjct: 376 -GADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPT 432

Query: 494 KC 495
            C
Sbjct: 433 DC 434


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 181/359 (50%), Gaps = 23/359 (6%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           +Q  GEY     VG PP Q   ++DTGSD+ WLQC+PC +CY Q+  IFDP  S++Y  L
Sbjct: 80  TQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKIL 139

Query: 213 PCAAPQCKSLDVSACRA-NR--CLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GIA 265
           P ++  C+S++ ++C + NR  C Y + YGDGS++ GDL  ET++ G  N  SVK     
Sbjct: 140 PFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGSSVKFRRTV 199

Query: 266 LGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS------LAYCLVDRDSPASGVLEF 318
           +GCG +N   F G S+G++GLG G +SL  Q++  S       +YCL    S  S  L F
Sbjct: 200 IGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASM-SNISSKLNF 258

Query: 319 NSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
             A    GD   +  I       FYY+ L  FSVG   ++   S F   E G+  II+D 
Sbjct: 259 GDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGN--IIIDS 316

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT +T L    Y+ L  +   L    +    +     CY  S    +  P +  HF +G 
Sbjct: 317 GTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYR-STFDELNAPVIMAHF-SGA 374

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + L A N  I V+  G  C AF  +S    I GN+ QQ   V +DL    V F P  C
Sbjct: 375 DVKLNAVNTFIEVEQ-GVTCLAFI-SSKIGPIFGNMAQQNFLVGYDLQKKIVSFKPTDC 431


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/355 (33%), Positives = 184/355 (51%), Gaps = 19/355 (5%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP  F  + DTGSD+ W QC+PC  C+ Q  P++DP  SS++SP+PC++ 
Sbjct: 76  EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 135

Query: 218 QC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS-----GSVKGIALGC 268
            C    +S + S   ++ C Y  +Y DG+++ G L TET++ G+S      SV  +A GC
Sbjct: 136 TCLPVLRSRNCST-PSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGC 194

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----RDSP-ASGVLEFNSAR 322
           G DN G  + S G +GLG G LSL  Q+     +YCL D      DSP   G L   +  
Sbjct: 195 GTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLTDFFNSTLDSPFLLGTLAELAPG 254

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G   + PL+++    + Y V L G ++G   + IP   F++     GG++VD GT  + 
Sbjct: 255 PGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSI 314

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVR-VPTVSLHFGAGKALDL 440
           L    +  + D   ++ G   P +  +L   C+   +G R +  +P + LHF  G  + L
Sbjct: 315 LPESGFRVVVDHVAQVLGQ-PPVNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRL 373

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              NY+       +FC     T+S  S++GN QQQ  ++ FD+   ++ F P  C
Sbjct: 374 HRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDC 428


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 133/392 (33%), Positives = 192/392 (48%), Gaps = 48/392 (12%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TEC-YQQSDPIFDP 203
           +P++SGAS GSG+YF  I +G+PP+   +V DTGSD+ W++C  C T C        F  
Sbjct: 70  SPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLA 129

Query: 204 KTSSSYSPLPCAAPQCKSL---DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
           + S+++SP  C +  C+ +   + + C   R    C Y+  Y DGS T G    ET +  
Sbjct: 130 RHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLN 189

Query: 256 ---GNSGSVKGIALGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
              G    +K IA GCG    G       F G++G++GLG G +S   Q+      S +Y
Sbjct: 190 TSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSY 249

Query: 304 CLVDR--DSPASGVLEFNSARGGDAVTA-----------PLIRNKKVDTFYYVGLTGFSV 350
           CL+D     P +  L       GD V+            PL+ N +  TFYY+ + G  V
Sbjct: 250 CLLDYTLSPPPTSYLMI-----GDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFV 304

Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
            G  + I PS++ +DE G+GG ++D GT +T L   AY  +  +F R      PT G A 
Sbjct: 305 DGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGAS 364

Query: 411 ----FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT---S 463
               FD C + +G+   R P +SL  G       P +NY I + S G  C A  P    S
Sbjct: 365 TRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-SEGIKCLAIQPVEAES 423

Query: 464 SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              S+IGN+ QQG  + FD   +R+GF+   C
Sbjct: 424 GRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 145/371 (39%), Positives = 195/371 (52%), Gaps = 38/371 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
           + P   G   G+  Y     +GTP    +M +DTGSD++W+QC+PC+    CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
           DP  SSSY+ +PC  P C  L + A  A       Y V+YGDGS T G   ++T++   S
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 245

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
            +V+G   GCGH   GLF G  GLLGLG    SL +Q   T     +YCL  + S A G 
Sbjct: 246 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 304

Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           L      GG +  AP      L+ +    T+Y V LTG SVGGQ + +P S F       
Sbjct: 305 LTLG--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV-- 360

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
               VD GT +TRL   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P 
Sbjct: 361 ----VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 416

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
           V+L FG+G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D  
Sbjct: 417 VALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 467

Query: 485 NNRVGFTPNKC 495
              VGF P+ C
Sbjct: 468 GTSVGFKPSSC 478


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 128/368 (34%), Positives = 193/368 (52%), Gaps = 39/368 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCA 215
           GEY   + +GTPP  ++ V DTGSD+ W QC PC T+C++Q  P+++P +S+++S LPC 
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 171

Query: 216 APQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
           +       +S C              C+Y   YG G +T G   +ET +FG+S +    V
Sbjct: 172 S------SLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQARV 224

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEF 318
            G+A GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL    D +S ++ +L  
Sbjct: 225 PGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCLTPFQDTNSTSTLLLGP 284

Query: 319 NSARGGDAV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           ++A  G  V + P + +     + T+YY+ LTG S+G +A+ I P  F +   G GG+I+
Sbjct: 285 SAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLII 344

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPT---SGVALFDTCYDFSGLRSVR---VPTV 428
           D GT IT L   AY  +R +         PT   S     D C+      S     +P++
Sbjct: 345 DSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSM 404

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNR 487
           +LHF  G  + LPA +Y+I    +G +C A    T  A+S  GN QQQ   + +D+    
Sbjct: 405 TLHFD-GADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREET 461

Query: 488 VGFTPNKC 495
           + F P KC
Sbjct: 462 LSFAPAKC 469


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 128/367 (34%), Positives = 190/367 (51%), Gaps = 28/367 (7%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFD 202
           S P   G+S  S EY + +G+GTP    +++LDTGS + W+QC+PC  ++CY Q  P+FD
Sbjct: 115 SVPTQLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFD 174

Query: 203 PKTSSSYSPLPCAAPQCKSL----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF 255
           P TSSSYSP+PC + +C++L    D   C ++    C Y++ YG G+   G+  T+ ++ 
Sbjct: 175 PNTSSSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTL 234

Query: 256 GNSGSVKGIALGCGHDNE-GLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDS 310
           G    VK    GCGH  + G F  + G+LGLG    SL  Q  A       ++CL     
Sbjct: 235 GPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294

Query: 311 PASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
            ++G L   +     A V  PL+       FY +  T  SV GQ + IPP++F       
Sbjct: 295 -STGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE----- 348

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G+I D GT ++ LQ  AY +LR +F            V   DTC++F+G  +V VPTVS
Sbjct: 349 -GVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVS 407

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-IIGNVQQQGTRVSFDLANNRV 488
           L F  G  + L A + ++ +D     C AF  +    + +IG+V Q+   V +D+   +V
Sbjct: 408 LTFRGGATVHLDASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKV 462

Query: 489 GFTPNKC 495
           GF    C
Sbjct: 463 GFRTGAC 469


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 123/377 (32%), Positives = 190/377 (50%), Gaps = 35/377 (9%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
              P+ SGA+  +  Y + +G+G    + ++V+DT S++ W+QC+PC  C+ Q DP+FDP
Sbjct: 105 LQVPITSGANLRTLNYVATVGLGAA--EATVVVDTASELTWVQCQPCESCHDQQDPLFDP 162

Query: 204 KTSSSYSPLPCAAPQCKSLDV------SACRANR-----CLYQVAYGDGSFTVGDLVTET 252
            +S SY+ +PC +  C +L V      S C  +      C Y ++Y DGS++ G L  + 
Sbjct: 163 SSSPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDK 222

Query: 253 VSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
           +       ++G   GCG  N+G  F G++GL+GLG   +SL  Q         +YCL  R
Sbjct: 223 LRLAGQ-DIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMR 281

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-------FYYVGLTGFSVGGQAVQIPPSL 361
           +S +SG L           + P++    V         FY++ LTG +VGGQ V+ P   
Sbjct: 282 ESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESP--W 339

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           F       G +I+D GT IT L    YN++R  F+            ++ DTC++ +GL+
Sbjct: 340 FSA-----GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCFNLTGLK 394

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDS-AGTFCFAFAPTSSAL--SIIGNVQQQGTR 478
            V+VP++   F     +++ +K  L  V S A   C A A   S    SIIGN QQ+  R
Sbjct: 395 EVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLR 454

Query: 479 VSFDLANNRVGFTPNKC 495
           V FD   +++GF    C
Sbjct: 455 VIFDTLGSQIGFAQETC 471


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 145/371 (39%), Positives = 194/371 (52%), Gaps = 38/371 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
           + P   G   G+  Y     +GTP    +M +DTGSD++W+QC+PC     CY Q DP+F
Sbjct: 34  TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLF 93

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNS 258
           DP  SSSY+ +PC  P C  L + A  A       Y V+YGDGS T G   ++T++   S
Sbjct: 94  DPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSAS 153

Query: 259 GSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGV 315
            +V+G   GCGH   GLF G  GLLGLG    SL +Q   T     +YCL  + S A G 
Sbjct: 154 SAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GY 212

Query: 316 LEFNSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           L      GG +  AP      L+ +    T+Y V LTG SVGGQ + +P S F       
Sbjct: 213 LTLG--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV-- 268

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPT 427
               VD GT +TRL   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P 
Sbjct: 269 ----VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 324

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLA 484
           V+L FG+G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D  
Sbjct: 325 VALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID-- 375

Query: 485 NNRVGFTPNKC 495
              VGF P+ C
Sbjct: 376 GTSVGFKPSSC 386


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 143/361 (39%), Positives = 191/361 (52%), Gaps = 38/361 (10%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSP 211
           G+  Y     +GTP    +M +DTGSD++W+QC+PC     CY Q DP+FDP  SSSY+ 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 212 LPCAAPQCKSLDVSACRANRCL---YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           +PC  P C  L + A  A       Y V+YGDGS T G   ++T++   S +V+G   GC
Sbjct: 196 VPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGC 255

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGD 325
           GH   GLF G  GLLGLG    SL +Q   T     +YCL  + S A G L      GG 
Sbjct: 256 GHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLG--VGGP 312

Query: 326 AVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
           +  AP      L+ +    T+Y V LTG SVGGQ + +P S F           VD GT 
Sbjct: 313 SGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTV 366

Query: 380 ITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           +TRL   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P V+L FG+G  
Sbjct: 367 VTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGAT 426

Query: 438 LDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D     VGF P+ 
Sbjct: 427 VTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSS 477

Query: 495 C 495
           C
Sbjct: 478 C 478


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  195 bits (496), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 201/381 (52%), Gaps = 44/381 (11%)

Query: 149 VSGASQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPK 204
           VS  +Q S   GEY   + +GTPP  +  + DTGSD+ W QC PCT +C++Q  P+++P 
Sbjct: 77  VSAPTQNSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPS 136

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTET 252
           +S++++ LPC +       +S C A              C Y V YG G  +V    +ET
Sbjct: 137 SSTTFAVLPCNS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSET 189

Query: 253 VSFGNSGS----VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV- 306
            +FG++ +    V GIA GC   + G    SA GL+GLG G LSL  Q+     +YCL  
Sbjct: 190 FTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTP 249

Query: 307 --DRDSPASGVLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPP 359
             D +S ++ +L  +++  G A   + P + +     ++TFYY+ LTG S+G  A+ IPP
Sbjct: 250 YQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPP 309

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDF 417
             F ++  G GG+I+D GT IT L   AY  +R + V L   L  T G A    D C+  
Sbjct: 310 DAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSAATGLDLCFML 368

Query: 418 SGLRSV--RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQ 474
               S    +P+++LHF  G  + LPA +Y++  DS G +C A    T   ++I+GN QQ
Sbjct: 369 PSSTSAPPAMPSMTLHFN-GADMVLPADSYMMSDDS-GLWCLAMQNQTDGEVNILGNYQQ 426

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           Q   + +D+    + F P KC
Sbjct: 427 QNMHILYDIGQETLSFAPAKC 447


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 130/371 (35%), Positives = 196/371 (52%), Gaps = 41/371 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPC 214
           +GEY   + +GTPP  +  + DTGSD+ W QC PCT +C++Q  P+++P +S++++ LPC
Sbjct: 89  AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 148

Query: 215 AAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTETVSFGNS---- 258
            +       +S C A              C Y V YG G  +V    +ET +FG++    
Sbjct: 149 NS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGH 201

Query: 259 GSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASG 314
             V GIA GC   + G    SA GL+GLG G LSL  Q+     +YCL    D +S ++ 
Sbjct: 202 ARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTL 261

Query: 315 VLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +L  +++  G A   + P + +     ++TFYY+ LTG S+G  A+ IPP  F ++  G 
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 321

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RV 425
           GG+I+D GT IT L   AY  +R + V L   L  T G A    D C+      S    +
Sbjct: 322 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 380

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLA 484
           P+++LHF  G  + LPA +Y++  DS G +C A    T   ++I+GN QQQ   + +D+ 
Sbjct: 381 PSMTLHFN-GADMVLPADSYMMSDDS-GLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 438

Query: 485 NNRVGFTPNKC 495
              + F P KC
Sbjct: 439 QETLSFAPAKC 449


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 183/354 (51%), Gaps = 25/354 (7%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           GSGEY   + +GTPP  +  + DTGSD+ W QC PC +CY+QS PIFDP  S+S+S +PC
Sbjct: 88  GSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPC 147

Query: 215 AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            +  CK++D S C A   C Y   YGD ++T GDL  E ++ G+S SVK + +GCGH++ 
Sbjct: 148 NSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSS-SVKSV-IGCGHESG 205

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGD 325
           G F  ++G++GLGGG LSL  Q+  TS      +YCL    S A+G + F       G  
Sbjct: 206 GGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPG 265

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            V+ PLI    V T+YYV L   S+G +          M  A  G +I+D GT ++ L  
Sbjct: 266 VVSTPLISKNPV-TYYYVTLEAISIGNER--------HMASAKQGNVIIDSGTTLSFLPK 316

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAK 443
           + Y+ +  S +++    +       +D C+D   +   S  +P ++  F  G  ++L   
Sbjct: 317 ELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPV 376

Query: 444 NYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           N    V +    C    P S      IIGN+      + +DL   R+ F P  C
Sbjct: 377 NTFQKV-ANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 429


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 139/430 (32%), Positives = 213/430 (49%), Gaps = 48/430 (11%)

Query: 109 DSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS-------TPVVSGASQGSGEYFS 161
           D  R+      LQ A      +   P E+    +DF        + +VSG+S GSG+YF 
Sbjct: 2   DRGRIAAFGRVLQEAAQKNSTNSTLPRESLATIQDFQGEDPALFSRLVSGSSIGSGQYFV 61

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PIFDPKTSSSYSPLPCAAPQ 218
            + VGTP ++F +++DTGSD+ W+QC P       S    P +D  +SSSY  +PC   +
Sbjct: 62  ELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDDE 121

Query: 219 CKSLDV---SACRANR---CLYQVAYGDGSFTVGDLVTETVSF----------GNSGS-- 260
           C+ L     S+C       C Y   Y D S T G L  ET+S           GN  +  
Sbjct: 122 CQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTRR 181

Query: 261 --VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVD--RDSP 311
             +K +ALGC  ++ G  F+G++G+LGLG G +SL  Q + T+L    +YCLVD  R S 
Sbjct: 182 IRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLRGSN 241

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDG 370
           AS  L             P++RN    +FYYV +TG +V G+ V  I  S + +D  G+ 
Sbjct: 242 ASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNK 301

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
           G I D GT ++ L+  AY+ +  +    + L    +   G   F+ CY+ + +    +P 
Sbjct: 302 GTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCYNVTRMEK-GMPK 357

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGNVQQQGTRVSFDLAN 485
           + + F  G  ++LP  NY++ V +    C A     T++  +I+GN+ QQ   + +DLA 
Sbjct: 358 LGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAK 416

Query: 486 NRVGFTPNKC 495
            R+GF  + C
Sbjct: 417 ARIGFKWSPC 426


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 139/417 (33%), Positives = 214/417 (51%), Gaps = 36/417 (8%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L+R+  D +   +   +  L   ++ RH  +   A       S PV    +   GE+   
Sbjct: 32  LTRVHADPSVTASQFVRAALH-RDMHRHNARKLAASSSDGTVSAPV--SPTTVPGEFLMT 88

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           + +GTPP  F  + DTGSD+ W QC PC+ +C+QQ  P+++P +S+++S LPC +    S
Sbjct: 89  LAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNS----S 144

Query: 222 LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-----VKGIALGCGHDNEGLF 276
           L + A  A  C+Y + YG G +T     TET +FG+S       V GIA GC + + G  
Sbjct: 145 LGLCA-PACACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFN 202

Query: 277 VGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARGGDAVTA--P 330
             SA GL+GLG G LSL  Q+ A   +YCL    D +S ++ +L  +++     V +  P
Sbjct: 203 ASSASGLVGLGRGSLSLVSQLGAPKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTP 262

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
            + +     +YY+ LTG S+G  A+ IPP+ F +   G GG+I+D GT IT L   AY  
Sbjct: 263 FVASPS-SIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQ 321

Query: 391 LRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSV--RVPTVSLHFGAGKALDLPAKNYL 446
           +R + + L   L  T G A    D C++     S    +P+++LHF  G  + LPA NY+
Sbjct: 322 VRAAVLSLV-TLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFD-GADMVLPADNYM 379

Query: 447 I----PVDSAGTFCFAFAPTSS----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +    P   +  +C A    +      +SI+GN QQQ   + +D+    + F P KC
Sbjct: 380 MSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKC 436


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 147/449 (32%), Positives = 212/449 (47%), Gaps = 58/449 (12%)

Query: 71  SFPLNSSSSFSLPLHSREIL----HKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
           SF     + FS+ L  R+ L    +K   N Y+  V      D+AR +           N
Sbjct: 19  SFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFV------DAARRSI----------N 62

Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
              H  K + A I P+    P +       GEY     VGTPP +   ++DTGSDI WLQ
Sbjct: 63  RANHFYKYSLANI-PQSTVIPDI-------GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQ 114

Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR-ANRCLYQVAYGDGSFTV 245
           C PC ECY Q+ P+F+P  SSSY  +PC +  C+S++ ++C   N C Y   YGD S + 
Sbjct: 115 CEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSG 174

Query: 246 GDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS 300
           GDL  +T++     G + S   I +GCG +N   + G S+G++G G G  S   Q+ +++
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSST 234

Query: 301 ---LAYCL------VDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFS 349
               +YCL       +  S A+  L F  A    GD V    I  K  +TFYY+ L  FS
Sbjct: 235 GGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFS 294

Query: 350 VGGQAVQI---PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
           VG + V+I   P      +   +G II+D GT +T L    Y+ L  + V L    +   
Sbjct: 295 VGNRRVEIGGVP------NGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDD 348

Query: 407 GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
                + CY          P +++HF  G  +DL   +  + V + G FC AF  +S   
Sbjct: 349 PTQTLNLCYSVKA-EGYDFPIITMHF-KGADVDLHPISTFVSV-ADGVFCLAFE-SSQDH 404

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +I GN+ QQ   V +DL    V F P+ C
Sbjct: 405 AIFGNLAQQNLMVGYDLQQKIVSFKPSDC 433


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 144/440 (32%), Positives = 204/440 (46%), Gaps = 44/440 (10%)

Query: 84  LHSREILH-KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL-- 140
           +H  E  H + + + + +  +SR    S   N   TK Q       R  L+    + +  
Sbjct: 19  IHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRA 78

Query: 141 -PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
            P D  + V+SG     G Y   I +GTPP     + DTGSD+ W QC PC  CY+Q +P
Sbjct: 79  SPNDIQSDVISGG----GAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEP 134

Query: 200 IFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           +FDPK S +Y  L C    C+ L    S    N C Y  +YGD S+T GDL ++T++ G+
Sbjct: 135 LFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGS 194

Query: 258 S----GSVKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV-- 306
           +     S  GIA GCGHDN G F            G    ++ L+ ++     +YCLV  
Sbjct: 195 TEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGG-QFSYCLVPL 253

Query: 307 DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP----- 358
             DS  S  + F  +    G   V+ PLI+    DTFYY+ L G SVG + V        
Sbjct: 254 SSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP-DTFYYLTLEGLSVGSETVAFKGFSEN 312

Query: 359 ---PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
              P+  E     +G II+D GT +T L    Y  +  +     G    T    +F  CY
Sbjct: 313 KSSPAAVE-----EGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY 367

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQ 475
             S + ++ +PT++ HF  G  + LP  N  + V      CF+  P SS L+I GN+ Q 
Sbjct: 368 --SSVNNLEIPTITAHF-TGADVQLPPLNTFVQV-QEDLVCFSMIP-SSNLAIFGNLAQI 422

Query: 476 GTRVSFDLANNRVGFTPNKC 495
              V +DL NN+V F    C
Sbjct: 423 NFLVGYDLKNNKVSFKQTDC 442


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 185/371 (49%), Gaps = 34/371 (9%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           +PV+S     +GEY   I +GTPP     + DTGSD+ W QC+PC  CY+Q +PIFDP  
Sbjct: 86  SPVISN----NGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAK 141

Query: 206 SSSYSPLPCAAPQCKSL-DVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG---- 259
           S +Y  L C    C +L     C   N C+Y  +YGDGS T GDL  +T++ G++     
Sbjct: 142 SKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPV 201

Query: 260 SVKGIALGCGHDNEGLF----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPAS 313
           SV  +  GCGH+N G F     G  GL G    M+S  + +     +YCLV    D   S
Sbjct: 202 SVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDPSVS 261

Query: 314 GVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV------QIPPSLFEM 364
             + F S     G  AV+ PL  +++ DTFYY+ L   SVG + +      ++   L + 
Sbjct: 262 SKMHFGSRGIVSGAGAVSTPL-ASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADA 320

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           DE   G II+D GT +T L    Y +L  + V   G         +F  CY  S L  +R
Sbjct: 321 DE---GNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCY--SNLSGLR 375

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           +PT++ HF  G  L+L   N  + V     FCFA  P S  L+I GN+ Q    V +DL 
Sbjct: 376 IPTITAHF-VGADLELKPLNTFVQVQED-LFCFAMIPVSD-LAIFGNLAQMNFLVGYDLK 432

Query: 485 NNRVGFTPNKC 495
           +  V F P  C
Sbjct: 433 SRTVSFKPTDC 443


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 137/421 (32%), Positives = 207/421 (49%), Gaps = 47/421 (11%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D+ARV++L  +++   Y +       AE  +       PV SGA   +  Y + +G+
Sbjct: 93  LSTDAARVSSLQGRIEH--YRLTTTS-SSAEVAVTASKAQVPVSSGARLRTLNYVATVGL 149

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD-- 223
           G    + ++++DT S++ W+QC PC  C+ Q  P+FDP +S SY+ +PC +P C +L   
Sbjct: 150 GGG--EATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQ 207

Query: 224 --------VSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
                      C A R   C Y ++Y DGS++ G L  + +S      + G   GCG  N
Sbjct: 208 LATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-VIDGFVFGCGTSN 266

Query: 273 EG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCL-VDRDSPASGVLEFNSARGGDAV 327
           +G  F G++GL+GLG   LSL  Q         +YCL + R+S ASG L           
Sbjct: 267 QGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRN 326

Query: 328 TAPLIRNKKVDT--------FYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
           + P++    V          FY V LTG +VGGQ         E++  G     IVD GT
Sbjct: 327 STPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ---------EVESTGFSARAIVDSGT 377

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            IT L    YN++R  F+          G ++ DTC++ +GL+ V+VP+++L F  G  +
Sbjct: 378 VITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEV 437

Query: 439 DLPAKN--YLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           ++ +    Y +  DS+   C A A   S    SIIGN QQ+  RV FD + ++VGF    
Sbjct: 438 EVDSGGVLYFVSSDSS-QVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQET 496

Query: 495 C 495
           C
Sbjct: 497 C 497


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 129/371 (34%), Positives = 196/371 (52%), Gaps = 41/371 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPC 214
           +GEY   + +GTPP  +  + DTGSD+ W QC PCT +C++Q  P+++P +S++++ LPC
Sbjct: 29  AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88

Query: 215 AAPQCKSLDVSACRANR------------CLYQVAYGDGSFTVGDLVTETVSFGNS---- 258
            +       +S C A              C Y V YG G  +V    +ET +FG++    
Sbjct: 89  NS------SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ-GSETFTFGSTPAGH 141

Query: 259 GSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASG 314
             V GIA GC   + G    SA GL+GLG G LSL  Q+     +YCL    D +S ++ 
Sbjct: 142 ARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYCLTPYQDTNSTSTL 201

Query: 315 VLEFNSARGGDA--VTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +L  +++  G A   + P + +     ++TFYY+ LTG S+G  A+ IPP  F ++  G 
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RV 425
           GG+I+D GT IT L   AY  +R + V L   L  T G A    D C+      S    +
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 320

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLA 484
           P+++LHF  G  + LPA +Y++  D +G +C A    T   ++I+GN QQQ   + +D+ 
Sbjct: 321 PSMTLHFN-GADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378

Query: 485 NNRVGFTPNKC 495
              + F P KC
Sbjct: 379 QETLSFAPAKC 389


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 132/400 (33%), Positives = 199/400 (49%), Gaps = 38/400 (9%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
           R +L P+ A         P       G GEY   + +GTPP  +  + DTGSD+ W QC 
Sbjct: 58  REQLAPSSAAAAGLTVGAPTQKDLRNG-GEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116

Query: 189 PC--------TECYQQSDPIFDPKTSSSYSPLPCAAP--QCKSL-DVSACRANRCLYQVA 237
           PC         +C++QS  +++P +S+++  LPC +P   C ++   S      C+Y   
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176

Query: 238 YGDGSFTVGDLVTETVSFGNSGS-----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL 292
           YG G +T G    ET +FG+S +     V  IA GC + +   + GSAGL+GLG G +SL
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSL 235

Query: 293 TKQIKATSLAYCLV---DRDSPASGVLEFNSARG----GDAVTAPLI---RNKKVDTFYY 342
             Q+ A + +YCL    D +S ++ +L  ++A      G   + P +       + T+YY
Sbjct: 236 VSQLGAGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYY 295

Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS-----FVR 397
           + LTG SVG  A+ IPP  F +   G GG+I+D GT IT L   AY  +R +       R
Sbjct: 296 LNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTR 355

Query: 398 LAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
           L     P     L D C+   +      +P+++LHF  G  + LP +NY+I    +G +C
Sbjct: 356 LPLAHGPDHSTGL-DLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI--LGSGVWC 412

Query: 457 FAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A    T  A+S++GN QQQ   V +D+    + F P  C
Sbjct: 413 LAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/352 (34%), Positives = 182/352 (51%), Gaps = 25/352 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCA 215
           G Y   +G+GTP + F++  DTGSD+ W QC PC   C+ Q+ P FDP TS+SY  + C+
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCS 197

Query: 216 APQCK-----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
           +  CK     +     C +N CLY + YG G +T+G L TET++  +S   K    GC  
Sbjct: 198 SEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSDVFKNFLFGCSE 256

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQI--KATSL-AYCLVDRDSPASGVLEFNSARGGDAV 327
           ++ G F G+ GLLGLG   ++L  Q   K  +L +YCL    S ++G L F       A 
Sbjct: 257 ESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPS-STGHLSFGVEVSQAAK 315

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           + P+  + K+   Y +   G SV G+ + I  S+           I+D GT  T L +  
Sbjct: 316 STPI--SPKLKQLYGLNTVGISVRGRELPINGSISRT--------IIDSGTTFTFLPSPT 365

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKNY 445
           Y++L  +F  +  N   T+G + F  CYDFS  G  ++ +P +S+ F  G  +++     
Sbjct: 366 YSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGI 425

Query: 446 LIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IPV+     C AFA T   S  +I GN QQ+   V +D+A   VGF P  C
Sbjct: 426 MIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/435 (31%), Positives = 198/435 (45%), Gaps = 51/435 (11%)

Query: 96  NDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGA 152
           N +  L       DS R    N L+ ++ L        +L P+ +   P   + PV SG+
Sbjct: 26  NHHHGLRADLTHIDSGRGFTRNELLRRMVLRSRARAAKQLCPSRSGT-PVRVTAPVASGS 84

Query: 153 SQ-GSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
              G  EY    G+GTP P+Q ++ +DTGSD+ W QCRPC +C+ Q  P FD   S +  
Sbjct: 85  HVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVH 144

Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIAL 266
            + C  P C++L   AC    C YQV YGD S T+G L  ++ +F   G    +V  +  
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204

Query: 267 GCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-----------RDSPASG 314
           GCG  N G F     G+ G G G LSL +Q+  +S +YC                +PA G
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVFLGGAPADG 264

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           +    +   G  ++ P + N     +YY+ L G +VG   + +P S F +   G GG I+
Sbjct: 265 LRAHAT---GPILSTPFLPNHP--EYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTII 319

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG--------------L 420
           D GTAIT      + SL ++FV         + V L  T Y+ +G               
Sbjct: 320 DSGTAITAFPRAVFRSLWEAFV---------AQVPLPHTSYNDTGEPTLQCFSTESVPDA 370

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
             V VP ++LH   G   +LP +NY+     +   C          ++IGN QQQ   + 
Sbjct: 371 SKVPVPKMTLHL-EGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIV 429

Query: 481 FDLANNRVGFTPNKC 495
            DLA N++   P +C
Sbjct: 430 HDLAGNKLVIEPAQC 444


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 205/433 (47%), Gaps = 48/433 (11%)

Query: 76  SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
           SSS  ++PL+ R        +     +L  LE D  R   +  KL               
Sbjct: 59  SSSGTTVPLNHRYGPCSPAPSAKVPTILELLEHDQLRAKYIQRKLS-------------G 105

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
              + P D + P   G++  + EY   +G+G+P    +M++DTGSD++W++C        
Sbjct: 106 TDGLQPLDLTVPTTLGSALDTMEYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS-----T 160

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
               +FDP  S++Y+P  C++  C  L  +   C  + C Y+V YGDGS T G   ++T+
Sbjct: 161 DGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDTL 220

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSA--GLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
           +   S +V     GC H  E  F G    GL+GLGG   SL  Q  AT   S +YCL   
Sbjct: 221 ALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPT 279

Query: 309 DSPASGVLEFNSARG--GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           +   SG L F +  G  G  VT P++R  K  T Y V L   SVGG  + I PS+     
Sbjct: 280 NR-TSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS--- 335

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSL----RDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
               G ++D GT IT L  +AY++L    R S  RL    +  + + + DTCYDF+GL +
Sbjct: 336 ---NGSVMDSGTVITWLPRRAYSALSSAFRSSMTRL--RHQRAAPLGILDTCYDFTGLVN 390

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
           V +P VSL    G  +DL     +I        C AFA TS   SIIGNVQQ+   V  D
Sbjct: 391 VSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAATSGD-SIIGNVQQRTFEVLHD 443

Query: 483 LANNRVGFTPNKC 495
           +     GF    C
Sbjct: 444 VGQGVFGFRSGAC 456


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/353 (34%), Positives = 187/353 (52%), Gaps = 22/353 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   I +GTPP     + DTGSD+ W QC PC +CYQQ+ P+FDPK SS+Y  + C++
Sbjct: 84  GEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSS 143

Query: 217 PQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGH 270
            QC++L+ ++C    N C Y + YGD S+T GD+  +TV+ G+SG    S++ + +GCGH
Sbjct: 144 SQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGH 203

Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV--DRDSPASGVLEF--NSAR 322
           +N G F    +G++GLGGG  SL  Q++ +     +YCLV    ++  +  + F  N   
Sbjct: 204 ENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTNGIV 263

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            GD V +  +  K   T+Y++ L   SVG + +Q   ++F     G+G I++D GT +T 
Sbjct: 264 SGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIF---GTGEGNIVIDSGTTLTL 320

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L +  Y  L           +      +   CY  S   S +VP +++HF  G  + L  
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSS--SFKVPDITVHFKGGD-VKLGN 377

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            N  + V S    CFAFA  +  L+I GN+ Q    V +D  +  V F    C
Sbjct: 378 LNTFVAV-SEDVSCFAFA-ANEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDC 428


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 199/384 (51%), Gaps = 41/384 (10%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PIFDPK 204
           +VSG+S GSG+YF  + VGTP ++F +++DTGSD+ W+QC P       S    P +D  
Sbjct: 16  LVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKS 75

Query: 205 TSSSYSPLPCAAPQCKSLDV---SACRANR---CLYQVAYGDGSFTVGDLVTETVSF--- 255
           +SSSY  +PC   +C  L     S+C       C Y   Y D S T G L  ET+S    
Sbjct: 76  SSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSR 135

Query: 256 -------GNSGS----VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-- 301
                  GN  +    +K +ALGC  ++ G  F+G++G+LGLG G +SL  Q + T+L  
Sbjct: 136 KRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGG 195

Query: 302 --AYCLVD--RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ- 356
             +YCLVD  R S AS  L     R       P++RN    +FYYV +TG +V G+ V  
Sbjct: 196 IFSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDG 255

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDT 413
           I  S + +D  G+ G I D GT ++ L+  AY+ +  +    + L    +   G   F+ 
Sbjct: 256 IASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FEL 312

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--TSSALSIIGN 471
           CY+ + +    +P + + F  G  ++LP  NY++ V +    C A     T++  +I+GN
Sbjct: 313 CYNVTRMEK-GMPKLGVEFQGGAVMELPWNNYMVLV-AENVQCVALQKVTTTNGSNILGN 370

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           + QQ   + +DLA  R+GF  + C
Sbjct: 371 LLQQDHHIEYDLAKARIGFKWSPC 394


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 184/369 (49%), Gaps = 33/369 (8%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PV SGA   +  Y + +G+G    + ++++DT S++ W+QC PC  C+ Q  P+FDP +S
Sbjct: 115 PVTSGARLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCASCHDQQGPLFDPASS 172

Query: 207 SSYSPLPCAAPQCKSLDVSACRAN---------RCLYQVAYGDGSFTVGDLVTETVSFGN 257
            SY+ LPC +  C +L V+   A           C Y ++Y DGS++ G L  + +S   
Sbjct: 173 PSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG 232

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
              + G   GCG  N+G F G++GL+GLG   LSL  Q         +YCL  ++S +SG
Sbjct: 233 E-VIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSG 291

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
            L           + P++    V       FY+V LTG ++GGQ V          E+  
Sbjct: 292 SLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSA 341

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G +IVD GT IT L    YN+++  F+          G ++ DTC++ +G R V++P++ 
Sbjct: 342 GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLK 401

Query: 430 LHFGAGKALDLPAKNYLIPVDS-AGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANN 486
             F     +++ +   L  V S +   C A A   S    SIIGN QQ+  RV FD   +
Sbjct: 402 FVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGS 461

Query: 487 RVGFTPNKC 495
           ++GF    C
Sbjct: 462 QIGFAQETC 470


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
           R+LVL     D+ RV +L  +L++        E   +E QI       P+ SG    S  
Sbjct: 89  RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 134

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y   + +G   +  S+++DTGSD+ W+QC+PC  CY Q  P++DP  SSSY  + C +  
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192

Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           C+ L     +   C  N       C Y V+YGDGS+T GDL +E++  G++  ++    G
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 251

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
           CG +N+GLF GS+GL+GLG   +SL  Q   T     +YCL   +  ASG L F +    
Sbjct: 252 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 311

Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
                     PL++N ++ +FY + LTG S+GG  V++  S F        GI++D GT 
Sbjct: 312 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 363

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
           ITRL    Y +++  F++         G ++ DTC++ +    + +P + + F     L 
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423

Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            D+    Y +  D A   C A A  S  + + IIGN QQ+  RV +D    R+G     C
Sbjct: 424 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 129/392 (32%), Positives = 189/392 (48%), Gaps = 48/392 (12%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-YQQSDPIFD 202
             +P++SGAS GSG+YF  I +GTPP+   +V DTGSD+ W++C  C  C +      F 
Sbjct: 73  LKSPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL 132

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSA---CRANR----CLYQVAYGDGSFTVGDLVTETVSF 255
           P+ SSS+SP  C  P C+ L  +    C   R    C +  +Y DGS + G    ET + 
Sbjct: 133 PRHSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTL 192

Query: 256 ----GNSGSVKGIALGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIK---ATSLA 302
               G+   +KG++ GCG    G       F G+ G++GLG G +S + Q+        +
Sbjct: 193 KSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFS 252

Query: 303 YCLVDR--DSPASGVLEFNSARGGDAVTAPLIRNKKVD-----------TFYYVGLTGFS 349
           YCL+D     P +  L      GG   + PL    K+            TFYY+ +   +
Sbjct: 253 YCLMDYTLSPPPTSFLMI----GGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSIT 308

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTS 406
           + G  + I P+++E+DE G+GG +VD GT +T L   AY  +  S    V+L    + T 
Sbjct: 309 IDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTP 368

Query: 407 GVALFDTCYDFSGL-RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS- 464
           G   FD C + SG  R   +P +    G G     P +NY +  +  G  C A     S 
Sbjct: 369 G---FDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESG 424

Query: 465 -ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              S+IGN+ QQG  + FD   +R+GFT   C
Sbjct: 425 NGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 120/370 (32%), Positives = 185/370 (50%), Gaps = 35/370 (9%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PV SGA   +  Y + +G+G    + ++++DT S++ W+QC PC  C+ Q  P+FDP +S
Sbjct: 114 PVTSGARLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCASCHDQQGPLFDPASS 171

Query: 207 SSYSPLPCAAPQCKSLDVSACRAN---------RCLYQVAYGDGSFTVGDLVTETVSFGN 257
            SY+ LPC +  C +L V+   A           C Y ++Y DGS++ G L  + +S   
Sbjct: 172 PSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG 231

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASG 314
              + G   GCG  N+G F G++GL+GLG   LSL  Q         +YCL  ++S +SG
Sbjct: 232 E-VIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESESSG 290

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
            L           + P++    V       FY+V LTG ++GGQ V          E+  
Sbjct: 291 SLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV----------ESSA 340

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G +IVD GT IT L    YN+++  F+          G ++ DTC++ +G R V++P++ 
Sbjct: 341 GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLTGFREVQIPSLK 400

Query: 430 LHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLAN 485
             F     +++ +    Y +  DS+   C A A   S    SIIGN QQ+  RV FD   
Sbjct: 401 FVFEGNVEVEVDSSGVLYFVSSDSS-QVCLALASLKSEYETSIIGNYQQKNLRVIFDTLG 459

Query: 486 NRVGFTPNKC 495
           +++GF    C
Sbjct: 460 SQIGFAQETC 469


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 124/376 (32%), Positives = 191/376 (50%), Gaps = 34/376 (9%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PV SGA   +  Y + +G+G    + ++++DT S++ W+QC PC  C+ Q DP+FDP +S
Sbjct: 141 PVTSGAKLRTLNYVATVGLGGG--EATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSS 198

Query: 207 SSYSPLPCAAPQCKSLDV---------SACR-----ANRCLYQVAYGDGSFTVGDLVTET 252
            SY+ +PC +  C +L +         +AC+     A  C Y ++Y DGS++ G L  + 
Sbjct: 199 PSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDR 258

Query: 253 VSFGNSGSVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR 308
           +S      + G   GCG  N+G  F G++GL+GLG   LSL  Q         +YCL  +
Sbjct: 259 LSLAGE-VIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLK 317

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFE 363
           +S +SG L           + P++    V       FY+V LTG +VGGQ V+       
Sbjct: 318 ESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSG 377

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                    I+D GT IT L    YN+++  F+          G ++ DTC++ +GLR V
Sbjct: 378 GGGG---KAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREV 434

Query: 424 RVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRV 479
           +VP++ L F  G  +++ +    Y +  DS+   C A AP  S    +IIGN QQ+  RV
Sbjct: 435 QVPSLKLVFDGGVEVEVDSGGVLYFVSSDSS-QVCLAMAPLKSEYETNIIGNYQQKNLRV 493

Query: 480 SFDLANNRVGFTPNKC 495
            FD + ++VGF    C
Sbjct: 494 IFDTSGSQVGFAQETC 509


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
           R+LVL     D+ RV +L  +L++        E   +E QI       P+ SG    S  
Sbjct: 41  RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 86

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y   + +G   +  S+++DTGSD+ W+QC+PC  CY Q  P++DP  SSSY  + C +  
Sbjct: 87  YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 144

Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           C+ L     +   C  N       C Y V+YGDGS+T GDL +E++  G++  ++    G
Sbjct: 145 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 203

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
           CG +N+GLF GS+GL+GLG   +SL  Q   T     +YCL   +  ASG L F +    
Sbjct: 204 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 263

Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
                     PL++N ++ +FY + LTG S+GG  V++  S F        GI++D GT 
Sbjct: 264 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 315

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
           ITRL    Y +++  F++         G ++ DTC++ +    + +P + + F     L 
Sbjct: 316 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 375

Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            D+    Y +  D A   C A A  S  + + IIGN QQ+  RV +D    R+G     C
Sbjct: 376 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 137/420 (32%), Positives = 209/420 (49%), Gaps = 49/420 (11%)

Query: 99  RSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGE 158
           R+LVL     D+ RV +L  +L++        E   +E QI       P+ SG    S  
Sbjct: 89  RALVL-----DNIRVQSL--QLKIKAMTSSTTEQSVSETQI-------PLTSGIKLESLN 134

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y   + +G   +  S+++DTGSD+ W+QC+PC  CY Q  P++DP  SSSY  + C +  
Sbjct: 135 YIVTVELG--GKNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSST 192

Query: 219 CKSL-----DVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
           C+ L     +   C  N       C Y V+YGDGS+T GDL +E++  G++  ++    G
Sbjct: 193 CQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-KLENFVFG 251

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS---- 320
           CG +N+GLF GS+GL+GLG   +SL  Q   T     +YCL   +  ASG L F +    
Sbjct: 252 CGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSV 311

Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
                     PL++N ++ +FY + LTG S+GG  V++  S F        GI++D GT 
Sbjct: 312 YTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGR------GILIDSGTV 363

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL- 438
           ITRL    Y +++  F++         G ++ DTC++ +    + +P + + F     L 
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423

Query: 439 -DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            D+    Y +  D A   C A A  S  + + IIGN QQ+  RV +D    R+G     C
Sbjct: 424 VDVTGVFYFVKPD-ASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 148/410 (36%), Positives = 205/410 (50%), Gaps = 44/410 (10%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L RD AR N               H L+ A  + +    S P   GA   S +Y   +G 
Sbjct: 84  LRRDRARRN---------------HILRKASGRRITLGVSIPTSLGAFVDSLQYVVTLGF 128

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
           GTP     +++DTGSD++W+QC+PC  + CY Q DP+FDP  SS+Y+P+PC +  C+ LD
Sbjct: 129 GTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLD 188

Query: 224 V---------SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--VKGIALGCGHDN 272
                     S+  A+ C Y + YG+G  TVG   TET++     +  V   + GCG   
Sbjct: 189 PDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQ 248

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
           +G+F    GLLGLGG   SL  Q   T   + +YCL   +S A  +     A GG+    
Sbjct: 249 KGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGFLALGAPATGGNNTAG 308

Query: 330 PLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
                 +V   TFY V LTG SVGG+ + I P++F       GG+I+D GT +T L   A
Sbjct: 309 FQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA------GGMIIDSGTIVTGLPETA 362

Query: 388 YNSLRDSF--VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           Y++LR +F     A  L P +     DTCYDF+G  +V VPTV+L F  G  +DL   + 
Sbjct: 363 YSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSG 422

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++ +D  G   F    +     IIGNV Q+   V +D A   VGF    C
Sbjct: 423 VL-LD--GCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 126/360 (35%), Positives = 192/360 (53%), Gaps = 27/360 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPC- 214
           GEY   + +GTPP  +  + DTGSD+ W QC PC ++C++Q+   ++P +S+++  LPC 
Sbjct: 86  GEYIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCN 145

Query: 215 -AAPQCKSL-DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGC 268
            +   C +L   S      C+Y   YG G +T G    ET +FG++ +    V GIA GC
Sbjct: 146 SSVSMCAALAGPSPPPGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGC 204

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV---DRDSPASGVLEFNSARGGD 325
            + +   + GSAGL+GLG G +SL  Q+ A   +YCL    D +S ++ +L  ++A  G 
Sbjct: 205 SNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLTPFQDANSTSTLLLGPSAALNGT 264

Query: 326 AV-TAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            V T P + +     + T+YY+ LTG S+G  A+ IPP+ F +   G GG+I+D GT IT
Sbjct: 265 GVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTIT 324

Query: 382 RLQTQAYNSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSV--RVPTVSLHFGAGK 436
            L   AY  +R   +S V L   +   S     D C+  +   S    +P+++ HF  G 
Sbjct: 325 SLVDAAYQQVRAAIESLVTLP--VADGSDSTGLDLCFALTSETSTPPSMPSMTFHFD-GA 381

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + LP  NY+I    +G +C A    T  A+S  GN QQQ   + +D+    + F P KC
Sbjct: 382 DMVLPVDNYMI--LGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKC 439


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 186/388 (47%), Gaps = 47/388 (12%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP-IFDPK 204
           +PVVSGA+ GSG+YF  + +G PP+   ++ DTGSD+ W++C  C  C   S   +F P+
Sbjct: 71  SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 130

Query: 205 TSSSYSPLPCAAPQCKSL----DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
            SS++SP  C  P C+ +        C   R    C Y+  Y DGS T G    ET S  
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190

Query: 256 ---GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
              G    +K +A GCG    G       F G+ G++GLG G +S   Q+        +Y
Sbjct: 191 TSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 250

Query: 304 CLVD---RDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           CL+D      P S ++  N   GGD ++     PL+ N    TFYYV L    V G  ++
Sbjct: 251 CLMDYTLSPPPTSYLIIGN---GGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLR 307

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL-----RDSFVRLAGNLKPTSGVALF 411
           I PS++E+D++G+GG +VD GT +  L   AY S+     R   + +A  L P      F
Sbjct: 308 IDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVKLPIADALTPG-----F 362

Query: 412 DTCYDFSGLRSVR--VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALS 467
           D C + SG+      +P +   F  G     P +NY I  +     C A          S
Sbjct: 363 DLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFS 421

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IGN+ QQG    FD   +R+GF+   C
Sbjct: 422 VIGNLMQQGFLFEFDRDRSRLGFSRRGC 449


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 139/417 (33%), Positives = 196/417 (47%), Gaps = 40/417 (9%)

Query: 106 LERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQIL---PEDFSTPVVSGASQGSGEY 159
           + RDS R    N   TK Q       R  L+    + +   P D  + V+SG     G Y
Sbjct: 39  ISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGG----GSY 94

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
              I +GTPP     + DTGSD+ W QC PC +CY+Q +P+FDPK S +Y  L C    C
Sbjct: 95  LMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFC 154

Query: 220 KSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDNE 273
           + L    S    N C    +YGD S+T  DL +ET + G++     S  G+A GCGH N 
Sbjct: 155 QDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCGHSNG 214

Query: 274 GLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPASGVLEFNSA---RG 323
           G F            G    ++ L+ ++     +YCLV    DS AS  + F  +    G
Sbjct: 215 GTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTASSKINFGKSAVVSG 273

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE-----AGDGGIIVDCGT 378
              V+ PLI+    DTFYY+ L G S+G + V      F  ++     A +  II+D GT
Sbjct: 274 SGTVSTPLIKGTP-DTFYYLTLEGMSLGSEKVAFKG--FSKNKSSPAAAEESNIIIDSGT 330

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            +T L    Y  +  +  ++ G    T     F  CY  SG++ + +PT++ HF  G  +
Sbjct: 331 TLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHF-IGADV 387

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            LP  N  +        CF+  P SS L+I GN+ Q    V +DL NN+V F P  C
Sbjct: 388 QLPPLNTFVQAQE-DLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 442


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 125/353 (35%), Positives = 186/353 (52%), Gaps = 27/353 (7%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           GSGEY  ++  GTP +    ++DTGSD+ W+ C+ C  C+  + PIFDP  SSSY P  C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169

Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD--- 271
            +  C+ +  +    ++C ++V+YGDG+   G L ++ ++ G S  +   + GC      
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLG-SQYLPNFSFGCAESLSE 228

Query: 272 --NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
             +    +   G   L     + T ++   + +YCL     P+S     +   G +A  +
Sbjct: 229 DTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-----PSSSTSSGSLVLGKEAAVS 283

Query: 330 P-------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
                   LI++  + TFY+V L   SVG   + +P +    + A  GG I+D GT IT 
Sbjct: 284 SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGT----NIASGGGTIIDSGTTITH 339

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L   AY +LRD+F +   +L+PT  V   DTCYD S   SV VPT++LH      L LP 
Sbjct: 340 LVPSAYTALRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPK 397

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +N LI  +S G  C AF+ T S  SIIGNVQQQ  R+ FD+ N++VGF   +C
Sbjct: 398 ENILITQES-GLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 185/360 (51%), Gaps = 36/360 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
           S P   G+S  + EY   +G+G+P     +V+DTGSD++W+QC PC   + C+  +  +F
Sbjct: 94  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153

Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
           DP  SS+Y+   C+A  C  L    + + C A +RC Y V YGDGS T G   ++ ++  
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLS 213

Query: 257 NSGSVKGIALGCGHDN--EGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSP 311
            S  V+G   GC H     G+   + GL+GLGG   S   Q  A    S  YCL    +P
Sbjct: 214 GSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCL--PATP 271

Query: 312 A-SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           A SG L   +   G         T P++R+KKV T+Y+  L   +VGG+ + + PS+F  
Sbjct: 272 ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA 331

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
                 G +VD GT ITRL   AY +L  +F            + + DTC++F+GL  V 
Sbjct: 332 ------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVS 385

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
           +PTV+L F  G  +DL A   +    S G  C AFAPT    A   IGNVQQ+   V +D
Sbjct: 386 IPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 134/412 (32%), Positives = 202/412 (49%), Gaps = 31/412 (7%)

Query: 103 LSRLERDSAR------VNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           +  + RDS+R        T   ++  A++          ++ + P    T V+S      
Sbjct: 31  VEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRANHLNQSFVSPNSPETTVISAL---- 86

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY     VGTP  Q   +LDTGSDI WLQC+PC +CY+Q+ PIFD   S +Y  LPC +
Sbjct: 87  GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPS 146

Query: 217 PQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHD 271
             C+S+  + C + + CLY + Y DGS ++GDL  ET++ G++        G  +GCG  
Sbjct: 147 NTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQFPGTVIGCGRY 206

Query: 272 NE-GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSA---RGG 324
           N  G+   ++G++GLG G +SL  Q+  ++    +YCLV   S AS  L F +A    G 
Sbjct: 207 NAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLVPGLSTASSKLNFGNAAVVSGR 266

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
             V+ PL  +K    FY++ L  FSVG   ++           G G II+D GT +T L 
Sbjct: 267 GTVSTPLF-SKNGLVFYFLTLEAFSVGRNRIEFG----SPGSGGKGNIIIDSGTTLTALP 321

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTVSLHFGAGKALDLPAK 443
              Y+ L  +  +     +      +   CY  +  +    VP ++ HF +G  + L A 
Sbjct: 322 NGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHF-SGADVTLNAI 380

Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           N  + V +    CFAF PT +  ++ GN+ QQ   V +DL  N V F    C
Sbjct: 381 NTFVQV-ADDVVCFAFQPTETG-AVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 122/355 (34%), Positives = 177/355 (49%), Gaps = 16/355 (4%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
            G G Y   I VGTP   F +V DTGSD+ W QC PCT+C+QQ  P F P +SS++S LP
Sbjct: 81  NGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140

Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
           C +  C+ L   +  C A  C+Y   YG G +T G L TET+  G++ S   +A GC  +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
           N G+   ++G+ GLG G LSL  Q+     +YCL    +  +  + F S      G+  +
Sbjct: 199 N-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQS 257

Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
            P + N  V  ++YYV LTG +VG   + +  S F   + G  GG IVD GT +T L   
Sbjct: 258 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNY 445
            Y  ++ +F+    N+   +G    D C+  + G   + VP++ L F  G    +P    
Sbjct: 318 GYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFA 377

Query: 446 LIPVDSAGTF---CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  DS G+    C    P      +S+IGNV Q    + +DL      F+P  C
Sbjct: 378 GVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/391 (31%), Positives = 180/391 (46%), Gaps = 50/391 (12%)

Query: 150 SGASQG--SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFDPKTS 206
           +GA  G  + EY   + VGTPPR  ++ LDTGSD+ W QC PC  C+ Q + P+ DP  S
Sbjct: 83  AGAGGGIVTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAAS 142

Query: 207 SSYSPLPCAAPQCKSLDVSAC-------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
           S+++ + C AP C++L  ++C           C+Y   YGD S TVG L ++  +FG   
Sbjct: 143 STHAAVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGD 202

Query: 260 SVKG-------IALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
           +  G       +  GCGH N+G+F     G+ G G G  SL  Q+  TS +YC       
Sbjct: 203 NADGGGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFES 262

Query: 312 ASGVLEFNSARG-----GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
            S ++    A       G   + PL+R+    + Y++ L   +VG   + IP     + E
Sbjct: 263 TSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLRE 322

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFS------ 418
           A     I+D G +IT L    Y +++  FV   G   P S V  +  D C+         
Sbjct: 323 A---SAIIDSGASITTLPEDVYEAVKAEFVAQVG--LPVSAVEGSALDLCFALPSAAAPK 377

Query: 419 ---GLR--------SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-- 465
              G R         VRVP +  H G G   +LP +NY+     A   C      +    
Sbjct: 378 SAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGD 437

Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              +IGN QQQ T V +DL N+ + F P +C
Sbjct: 438 QTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 110/310 (35%), Positives = 158/310 (50%), Gaps = 18/310 (5%)

Query: 105 RLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ---ILPE--DFSTPVVSGASQGSGEY 159
           +L+       T  TKLQL    + R + + A  Q   +LP   D  T      +  SGEY
Sbjct: 30  QLKLTHVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEY 89

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
              + +GTPP  ++ ++DTGSD+ W QC PC  C  Q  P FD K S++Y  LPC + +C
Sbjct: 90  LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149

Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK----GIALGCGHDNEGL 275
            SL   +C    C+YQ  YGD + T G L  ET +FG + S K     IA GCG  N G 
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEF---------NSARGGDA 326
              S+G++G G G LSL  Q+  +  +YCL    S     L F         N++ G   
Sbjct: 210 LANSSGMVGFGRGPLSLVSQLGPSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPV 269

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            + P + N  +   Y++ L   S+G + + I P +F +++ G GG+I+D GT+IT LQ  
Sbjct: 270 QSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329

Query: 387 AYNSLRDSFV 396
           AY ++R   V
Sbjct: 330 AYEAVRRGLV 339


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 140/432 (32%), Positives = 209/432 (48%), Gaps = 34/432 (7%)

Query: 76  SSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
           SSS  ++PL  R        +     +   L RD  R   +  KL +         ++ +
Sbjct: 49  SSSGTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVN-SGSGTDGVQQS 107

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
            A  LP         G++  +  Y   + +GTP    ++++DTGSD++W+ C        
Sbjct: 108 AAITLPTTL------GSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGA 159

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRANR-CLYQVAYGDGSFTVGDLVTET 252
            S   FDP  SS+Y+P  C++  C  L+   + C  N  C Y V YGDGS T G   ++T
Sbjct: 160 GSSLFFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDT 219

Query: 253 VSFGNSGSVKGIALGCGHDN---EGLFVGSA-GLLGLGGGMLSLTKQIKAT---SLAYCL 305
           ++  ++  V+    GC   +   EGL      GL+GLGGG  SL  Q  AT   + +YCL
Sbjct: 220 LALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL 279

Query: 306 VDRDSPASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
               + +SG L   ++ G    VT P+ R+++  TFY+V L G +VGG  V I P++F  
Sbjct: 280 -PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFA- 337

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
                 G I+D GT ITRL  +AY++L  +F             ++ DTC+DF+G  +V 
Sbjct: 338 -----AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVS 392

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDL 483
           +P V L F  G  +DL A   +         C AFAP +  + SIIGNVQQ+   V  D+
Sbjct: 393 IPAVELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRTFEVLHDV 446

Query: 484 ANNRVGFTPNKC 495
             + +GF P  C
Sbjct: 447 GQSVLGFRPGAC 458


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 128/353 (36%), Positives = 186/353 (52%), Gaps = 27/353 (7%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           GSGEY  ++  GTP +    ++DTGSD+ W+ C+ C  C+  + PIFDP  SSSY P  C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169

Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD-NE 273
            +  C+ +  +    ++C ++V YGDG+   G L ++ ++ G S  +   + GC    +E
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLG-SQYLPNFSFGCAESLSE 228

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
             +     +   GG +  LT+   A     + +YCL     P+S     +   G +A  +
Sbjct: 229 DTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-----PSSSTSSGSLVLGKEAAVS 283

Query: 330 P-------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
                   LI++    TFY+V L   SVG   + +P +    + A  GG I+D GT IT 
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPAT----NIASGGGTIIDSGTTITY 339

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L   AY  LRD+F +   +L+PT  V   DTCYD S   SV VPT++LH      L LP 
Sbjct: 340 LVPSAYKDLRDAFRQQLSSLQPTP-VEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPK 397

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +N LI  +S G  C AF+ T S  SIIGNVQQQ  R+ FD+ N++VGF   +C
Sbjct: 398 ENILITQES-GLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 139/341 (40%), Positives = 183/341 (53%), Gaps = 36/341 (10%)

Query: 174 MVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN 230
           M +DTGSD++W+QC+PC     CY Q DP+FDP  SSSY+ +PC  P C  L + A  A 
Sbjct: 1   MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60

Query: 231 RCL---YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
                 Y V+YGDGS T G   ++T++   S +V+G   GCGH   GLF G  GLLGLG 
Sbjct: 61  SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120

Query: 288 GMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAP------LIRNKKVD 338
              SL +Q   T     +YCL  + S A G L      GG +  AP      L+ +    
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLG--VGGPSGAAPGFSTTQLLPSPNAP 177

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR- 397
           T+Y V LTG SVGGQ + +P S F           VD GT +TRL   AY +LR +F   
Sbjct: 178 TYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSG 231

Query: 398 LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
           +A    PT+    + DTCY+F+G  +V +P V+L FG+G  + L A   L    S G  C
Sbjct: 232 MASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--C 285

Query: 457 FAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFAP+ S   ++I+GNVQQ+   V  D     VGF P+ C
Sbjct: 286 LAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 324


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/356 (34%), Positives = 177/356 (49%), Gaps = 17/356 (4%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
            G G Y   I VGTP   FS+V DTGSD+ W QC PCT+C+QQ  P F P +SS++S LP
Sbjct: 81  NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140

Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
           C +  C+ L   +  C A  C+Y   YG G +T G L TET+  G++ S   +A GC  +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
           N G+   ++G+ GLG G LSL  Q+     +YCL    +  +  + F S      G+  +
Sbjct: 199 N-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLRSGSAAGASPILFGSLANLTDGNVQS 257

Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
            P + N  V  ++YYV LTG +VG   + +  S F   + G  GG IVD GT +T L   
Sbjct: 258 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 317

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKN 444
            Y  ++ +F+    ++   +G    D C+  +  G   + VP++ L F  G    +P   
Sbjct: 318 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 377

Query: 445 YLIPVDSAGTF---CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +  DS G+    C    P      +S+IGNV Q    + +DL      F P  C
Sbjct: 378 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 127/383 (33%), Positives = 182/383 (47%), Gaps = 37/383 (9%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP-IFDPK 204
           +PVVSGAS GSG+YF  + +G PP+   ++ DTGSD+ W++C  C  C   S   +F P+
Sbjct: 70  SPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 129

Query: 205 TSSSYSPLPCAAPQCKSL----DVSACRANR----CLYQVAYGDGSFTVGDLVTETVSF- 255
            SS++SP  C  P C+ +        C   R    C Y+  Y DGS T G    ET S  
Sbjct: 130 HSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLK 189

Query: 256 ---GNSGSVKGIALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIK---ATSLAY 303
              G    +K +A GCG    G       F G+ G++GLG G +S   Q+        +Y
Sbjct: 190 TSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 249

Query: 304 CLVDRDSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           CL+D               GGDAV+     PL+ N    TFYYV L    V G  ++I P
Sbjct: 250 CLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP 309

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYD 416
           S++E+D++G+GG ++D GT +  L   AY  +  +    ++L    + T G   FD C +
Sbjct: 310 SIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPG---FDLCVN 366

Query: 417 FSGLRSVR--VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNV 472
            SG+      +P +   F  G     P +NY I  +     C A          S+IGN+
Sbjct: 367 VSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKVGFSVIGNL 425

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
            QQG    FD   +R+GF+   C
Sbjct: 426 MQQGFLFEFDRDRSRLGFSRRGC 448


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 200/404 (49%), Gaps = 58/404 (14%)

Query: 117 ITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
           +++LQ   + +D ++L   E+ ++P+              GEY  R  +G+PP +   ++
Sbjct: 62  MSRLQRVSHFLDENKL--PESLLIPDK-------------GEYLMRFYIGSPPVERLAMV 106

Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA--C-RANRCL 233
           DTGS + WLQC PC  C+ Q  P+F+P  SS+Y    C +  C  L  S   C +  +C+
Sbjct: 107 DTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCI 166

Query: 234 YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-----LGCGHDNEGLFVGS---AGLLGL 285
           Y + YGD SF+VG L TET+SFG++G  + ++      GCG DN      S    G+ GL
Sbjct: 167 YGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGL 226

Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDT 339
           G G LSL  Q+ A      +YCL+  DS ++  L+F S         V+ PLI    + T
Sbjct: 227 GAGPLSLVSQLGAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPT 286

Query: 340 FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA 399
           +Y++ L   ++G + V    +        DG I++D GT +T L+   YN+         
Sbjct: 287 YYFLNLEAVTIGQKVVSTGQT--------DGNIVIDSGTPLTYLENTFYNN-------FV 331

Query: 400 GNLKPTSGVALFD-------TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
            +L+ T GV L         TC  F    ++ +P ++  F  G ++ L  KN LIP+  +
Sbjct: 332 ASLQETLGVKLLQDLPSPLKTC--FPNRANLAIPDIAFQF-TGASVALRPKNVLIPLTDS 388

Query: 453 GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              C A  P+S   +S+ G++ Q   +V +DL   +V F P  C
Sbjct: 389 NILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPTDC 432


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 136/390 (34%), Positives = 199/390 (51%), Gaps = 43/390 (11%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQS---D 198
           +P+ SGA  G G+Y   +  GTPP++  ++ DTGSD+ WLQC     P   C +++    
Sbjct: 41  SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 100

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDL 248
           P F    S++ S +PC+A QC  L V A R +           C Y   Y DGS T G L
Sbjct: 101 PAFVASKSATLSVVPCSAAQC--LLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFL 158

Query: 249 V--TETVSFGNSG--SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQ---IKATS 300
              T T+S G SG  +V+G+A GCG  N+G  F G+ G++GLG G LS   Q   + A +
Sbjct: 159 ARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 218

Query: 301 LAYCLVD-----RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
            +YCL+D     R   +S +      R       PL+ N    TFYYVG+    VG + +
Sbjct: 219 FSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL 278

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---D 412
            +P S + +D  G+GG ++D G+ +T L+  AY  L  +F       +  S    F   +
Sbjct: 279 PVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLE 338

Query: 413 TCYDFSGLRSVR-----VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
            CY+ S   S+       P +++ F  G +L+LP  NYL+ V +    C A  PT S  A
Sbjct: 339 LCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 397

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +++GN+ QQG  V FD A+ R+GF   +C
Sbjct: 398 FNVLGNLMQQGYHVEFDRASARIGFARTEC 427


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 132/365 (36%), Positives = 186/365 (50%), Gaps = 28/365 (7%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y   + +GTPP  FS++ DTGS + W QC PCTEC  +  P F P +SS++S LPCA
Sbjct: 87  AGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCA 146

Query: 216 APQCKSLDVS--ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           +  C+ L      C A  C+Y   YG G FT G L TET+  G + S  G+A GC  +N 
Sbjct: 147 SSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVGGA-SFPGVAFGCSTEN- 203

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVLEFNSAR--GGDAVTAP 330
           G+   S+G++GLG   LSL  Q+     +YCL  D D+  S +L  + A+  GG+  + P
Sbjct: 204 GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLRSDADAGDSPILFGSLAKVTGGNVQSTP 263

Query: 331 LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD----GGIIVDCGTAITRLQ 384
           L+ N ++   ++YYV LTG +VG   + +  + F           GG IVD GT +T L 
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLV 323

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDFS---GLRSVRVPTVSLHFGAGKA 437
            + Y  ++ +F+        T+ V      FD C+D +   G   V VPT+ L F  G  
Sbjct: 324 KEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLRFAGGAE 383

Query: 438 LDLPAKNY--LIPVDSAG---TFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGF 490
             +  ++Y  ++ VDS G     C    P S  L  SIIGNV Q    V +DL      F
Sbjct: 384 YAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSF 443

Query: 491 TPNKC 495
            P  C
Sbjct: 444 APADC 448


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 139/437 (31%), Positives = 216/437 (49%), Gaps = 61/437 (13%)

Query: 93  TRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGA 152
           TR +   S+  S+  RD+ R             ++ RH  +    Q+     +   VS  
Sbjct: 33  TRIHADPSVTASQFVRDALR------------RDMHRHNAR----QLAASSSNGTTVSAP 76

Query: 153 SQGS---GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSS 208
           +Q S   GEY   + +GTPP  +  + DTGSD+ W QC PC+ +C+QQ  P+++P +S++
Sbjct: 77  TQISPTAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTT 136

Query: 209 YSPLPCAAPQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFG-- 256
           ++ LPC +       +S C A            C+Y + YG G  +V    +ET +FG  
Sbjct: 137 FAVLPCNS------SLSMCAAALAGTTPPPGCTCMYNMTYGSGWTSVYQ-GSETFTFGSS 189

Query: 257 ---NSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLV---DRD 309
              N   V GIA GC + + G    SA GL+GLG G LSL  Q+     +YCL    D +
Sbjct: 190 TPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSYCLTPYQDTN 249

Query: 310 SPASGVLEFNSARG--GDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           S ++ +L  +++    G   + P +    +  + T+YY+ LTG S+G  A+ IP +   +
Sbjct: 250 STSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSL 309

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFSGLR 421
              G GG I+D GT IT L   AY  +R + V L   L  T G +     D C++     
Sbjct: 310 KADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGGSAATGLDLCFELPSST 368

Query: 422 SV--RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTR 478
           S    +P+++LHF  G  + LPA +Y++ +DS   +C A    T   +SI+GN QQQ   
Sbjct: 369 SAPPTMPSMTLHFD-GADMVLPADSYMM-LDS-NLWCLAMQNQTDGGVSILGNYQQQNMH 425

Query: 479 VSFDLANNRVGFTPNKC 495
           + +D+    + F P KC
Sbjct: 426 ILYDVGQETLTFAPAKC 442


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 185/339 (54%), Gaps = 25/339 (7%)

Query: 174 MVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA-----C 227
           M+LDTGS ++WLQC+PC   C+ Q+DP++DP  S +Y  L CA+ +C  L  +      C
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 228 R--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGL 285
              +N CLY  +YGD SF++G L  + ++  +S ++     GCG DN+GLF  +AG++GL
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120

Query: 286 GGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTF 340
               LS+  Q+      + +YCL   +  S   G L   S         P++ + K  + 
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
           Y++ LT  +V G+ + +  +++ +        ++D GT ITRL    Y +LR +FV+ ++
Sbjct: 181 YFLRLTAITVSGRPLDLAAAMYRVPT------LIDSGTVITRLPMSMYAALRQAFVKIMS 234

Query: 400 GNLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
                    ++ DTC+  S L+S+  VP + + F  G  L L A + LI  D  G  C A
Sbjct: 235 TKYAKAPAYSILDTCFKGS-LKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLA 292

Query: 459 FAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           FA +S  + ++IIGN QQQ   +++D++ +R+GF P  C
Sbjct: 293 FAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/360 (34%), Positives = 181/360 (50%), Gaps = 30/360 (8%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G  EY   + +GTPP  F  + DTGSD+ W QC+PC  C+ Q  PI+D  TSSS+SPLPC
Sbjct: 79  GQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPC 138

Query: 215 AAPQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDN 272
           ++  C  +  S C   +  C Y+ AY DG+++      E        SV GIA GCG DN
Sbjct: 139 SSATCLPIWSSRCSTPSATCRYRYAYDDGAYS-----PECAGI----SVGGIAFGCGVDN 189

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDAV---- 327
            GL   S G +GLG G LSL  Q+     +YCL D  ++  S  + F S     A     
Sbjct: 190 GGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFFGSLAELAASSASA 249

Query: 328 ------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAI 380
                 + PL+++    + YYV L G S+G   + IP   F++ D+ G GG+IVD GT  
Sbjct: 250 DAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIF 309

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-CY--DFSGLRSV-RVPTVSLHFGAGK 436
           T L    +  + D    + G  +P    +  D  C+    +G++ +  +P + LHF  G 
Sbjct: 310 TILVETGFRVVVDHVAGVLG--QPVVNASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGA 367

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + L   NY+   +   +FC     T SA  S++GN QQQ  ++ FD+   ++ F P  C
Sbjct: 368 DMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFMPTDC 427


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 150/422 (35%), Positives = 202/422 (47%), Gaps = 52/422 (12%)

Query: 102 VLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQIL--PEDFS--TPVVSGASQGSG 157
           +L  L  D  R   +  K      +V    L PA+ ++L    DF+  +P   G+  GS 
Sbjct: 78  LLEMLRWDQVRTEYVRRKASGGAEDV----LNPAKPRVLMSQTDFAVRSPFGVGSGSGSS 133

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCA 215
            +    G  T   Q +M +DT  D+ W+QC PC   +CY Q DP+FDP TSS+ + + C 
Sbjct: 134 AWIDADGDPTVVSQQTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCR 193

Query: 216 APQCKSLDV--SACRANR-----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           +P C+SL    + C +NR     C Y + Y D   T G  +T+T++   + +V+    GC
Sbjct: 194 SPACRSLGPYGNGC-SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGC 252

Query: 269 GHDNEGLFVG-SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
            H   G F   +AG + LGGG  SL  Q       + +YC+    + ASG L      GG
Sbjct: 253 SHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAFSYCV--PQASASGFLSI----GG 306

Query: 325 DA--------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
            A         T PL+R+    + Y V L G  V G+ + IPP  F        G ++D 
Sbjct: 307 PATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFS------AGAVMDS 360

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFGAG 435
              IT+L   AY +LR +F R A    P SG     DTCYDF GL +VRVP VSL FG G
Sbjct: 361 SAVITQLPPTAYRALRRAF-RNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGG 419

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
             + L     +I        C AF  TSS  AL  IGNVQQQ   V +D+A   VGF   
Sbjct: 420 AVVVLDPPAVMI------GGCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRG 473

Query: 494 KC 495
            C
Sbjct: 474 AC 475


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 185/362 (51%), Gaps = 32/362 (8%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC-------RPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G+GTPP+  ++++DTGSD+ W QC       R      +Q +P+++P+ SSS++ LPC+
Sbjct: 88  VGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCS 147

Query: 216 APQCKSLDVS---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHD 271
              C+    S     R NRC+Y   YG      G L +ET +FG +  V   +  GCG  
Sbjct: 148 DRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTFGVNAKVSLPLGFGCGAL 206

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG-------G 324
           + G  VG++GL+GL  G++SL  Q+     +YCL       +  L F +          G
Sbjct: 207 SAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYCLTPFAERKTSPLLFGAMADLRRYRTTG 266

Query: 325 DAVTAPLIRNKKVDT-FYYVGLTGFSVGGQAVQIPP-SLFEMDEAGDGGIIVDCGTAITR 382
              T  ++RN  ++T +YYV L G S+G + + +P  SL  +   G GG IVD G+ ++ 
Sbjct: 267 TVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSGGTIVDSGSTMSY 326

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFS---GLRSVRVPTVSLHFGAG 435
           L+  A+ +++ + V  A  L   +G       ++ C+       + +V+ P + LHF  G
Sbjct: 327 LEETAFRAVKKAVVE-AVRLPVANGTDEDYDDYELCFALPTGVAMEAVKTPPLVLHFDGG 385

Query: 436 KALDLPAKNYLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            A+ LP  NY      AG  C A   +P    +SIIGNVQQQ   V FD+ N +  F P 
Sbjct: 386 AAMTLPRDNYFQE-PRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVLFDVRNQKFSFAPT 444

Query: 494 KC 495
           KC
Sbjct: 445 KC 446


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 126/373 (33%), Positives = 182/373 (48%), Gaps = 32/373 (8%)

Query: 147 PVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFDP 203
           P+ SG    +  Y + I +G    +  ++++DTGSD+ W+QC PC  + CY Q DP+FDP
Sbjct: 168 PLGSGIRYQTLNYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDP 227

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRA------------NRCLYQVAYGDGSFTVGDLVTE 251
             S +++ +PC +P C +    A  A             RC Y ++YGDGSF+ G L  +
Sbjct: 228 AASPTFAAVPCGSPACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQD 287

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
           T+  G +  + G   GCG  N GLF G+AGL+GLG   LSL  Q  A      +YCL   
Sbjct: 288 TLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCL-PA 346

Query: 309 DSPASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
            + ++G L      S+   +     +I +     FY++ +TG +VGG A    P      
Sbjct: 347 TTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF---- 402

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
             G G ++VD GT ITRL    Y ++R  F R         G ++ D CYD +G   V V
Sbjct: 403 --GAGNVLVDSGTVITRLAPSVYKAVRAEFARRF-EYPAAPGFSILDACYDLTGRDEVNV 459

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGT-FCFAFA--PTSSALSIIGNVQQQGTRVSFD 482
           P ++L    G  + + A   L  V   G+  C A A  P      IIGN QQ+  RV +D
Sbjct: 460 PLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYD 519

Query: 483 LANNRVGFTPNKC 495
              +R+GF    C
Sbjct: 520 TVGSRLGFADEDC 532


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 136/390 (34%), Positives = 198/390 (50%), Gaps = 43/390 (11%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQS---D 198
           +P+ SGA  G G+Y   +  GTPP++  ++ DTGSD+ WLQC     P   C +++    
Sbjct: 40  SPMESGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR 99

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR----------CLYQVAYGDGSFTVGDL 248
           P F    S++ S +PC+A QC  L V A R +           C Y   Y DGS T G L
Sbjct: 100 PAFVASKSATLSVVPCSAAQC--LLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFL 157

Query: 249 V--TETVSFGNSG--SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQ---IKATS 300
              T T+S G SG  +V+G+A GCG  N+G  F G+ G++GLG G LS   Q   + A +
Sbjct: 158 ARDTATISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQT 217

Query: 301 LAYCLVD-----RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
            +YCL+D     R   +S +      R       PL+ N    TFYYVG+    VG + +
Sbjct: 218 FSYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL 277

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---D 412
            +P S + +D  G+GG ++D G+ +T L+  AY  L  +F       +  S    F   +
Sbjct: 278 PVPGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLE 337

Query: 413 TCYDFSGLRSVR-----VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
            CY+ S   S        P +++ F  G +L+LP  NYL+ V +    C A  PT S  A
Sbjct: 338 LCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFA 396

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +++GN+ QQG  V FD A+ R+GF   +C
Sbjct: 397 FNVLGNLMQQGYHVEFDRASARIGFARTEC 426


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 132/370 (35%), Positives = 190/370 (51%), Gaps = 36/370 (9%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
           +  S P   G S  S EY  R+  GTP     +V+DTGSD++WLQC+PC+  +C+ Q DP
Sbjct: 62  KKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDP 121

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
           ++DP  SS+YS +PCA+  CK L      S C + + C + ++Y DG+ TVG    + ++
Sbjct: 122 LYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLT 181

Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
                 V+    GCGH      GLF    G+LGLG    SL  +      +YCL    S 
Sbjct: 182 LAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGGV-FSYCLPSVSS- 236

Query: 312 ASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
             G L   + +     V  P+       TF  V L G +VGG+ + + PS F       G
Sbjct: 237 KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------G 290

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
           G+IVD GT IT LQ+ AY +LR +F +   A  L P   +   DTCY+ +G ++V VP +
Sbjct: 291 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNVVVPKI 347

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLAN 485
           +L F  G  ++L   N ++ V+     C AFA   P  SA  ++GNV Q+   V FD + 
Sbjct: 348 ALTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSA-GVLGNVNQRAFEVLFDTST 401

Query: 486 NRVGFTPNKC 495
           ++ GF    C
Sbjct: 402 SKFGFRAKAC 411


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  184 bits (468), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 132/370 (35%), Positives = 190/370 (51%), Gaps = 36/370 (9%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
           +  S P   G S  S EY  R+  GTP     +V+DTGSD++WLQC+PC+  +C+ Q DP
Sbjct: 96  KKVSVPAHLGTSVMSLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDP 155

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
           ++DP  SS+YS +PCA+  CK L      S C + + C + ++Y DG+ TVG    + ++
Sbjct: 156 LYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLT 215

Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
                 V+    GCGH      GLF    G+LGLG    SL  +      +YCL    S 
Sbjct: 216 LAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGGV-FSYCLPSVSS- 270

Query: 312 ASGVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
             G L   + +     V  P+       TF  V L G +VGG+ + + PS F       G
Sbjct: 271 KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFS------G 324

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRL--AGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
           G+IVD GT IT LQ+ AY +LR +F +   A  L P   +   DTCY+ +G ++V VP +
Sbjct: 325 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNVVVPKI 381

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLAN 485
           +L F  G  ++L   N ++ V+     C AFA   P  SA  ++GNV Q+   V FD + 
Sbjct: 382 ALTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSA-GVLGNVNQRAFEVLFDTST 435

Query: 486 NRVGFTPNKC 495
           ++ GF    C
Sbjct: 436 SKFGFRAKAC 445


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  184 bits (468), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 134/385 (34%), Positives = 182/385 (47%), Gaps = 31/385 (8%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
           R  +  +EA I P     PV    S  +GEY  +I +GTPP     + DTGSD+ W QC 
Sbjct: 65  RRFMSFSEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVG 246
           PC  CY+Q +P+FDP  S+S+  + C + QC+ LD  +C   +  C +   YGDGS   G
Sbjct: 121 PCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQG 180

Query: 247 DLVTETVSFG-NSG---SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-- 299
            + TET++   NSG   S+  I  GCGH+N G F     GL G GG  LSLT QI +T  
Sbjct: 181 VIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLG 240

Query: 300 ---SLAYCLVD-RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
                + CLV  R  P+  S ++    A   G D V+ PL+  K   T+Y+V L G SVG
Sbjct: 241 SGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLV-TKDDPTYYFVTLDGISVG 299

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
            +     P       A  G + +D GT  T L    YN L    V+ A  ++P     L 
Sbjct: 300 DKLF---PFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQ 355

Query: 412 -DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIG 470
              CY  + L  +  P ++ HF        P   ++ P +  G +CFA  P      I G
Sbjct: 356 PQLCYRSATL--IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFG 411

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           N  Q    + FDL   +V F    C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  184 bits (468), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 135/367 (36%), Positives = 182/367 (49%), Gaps = 21/367 (5%)

Query: 140 LPEDFSTPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD 198
           L +  S P+ SG A   S  Y  R  +GTP +   + LDT +D  W+ C  C  C   S 
Sbjct: 71  LAKKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASS 128

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN 257
            +FDP  SSS   L C APQCK      C A + C + + YG GS     L  +T++  N
Sbjct: 129 VLFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAN 187

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPAS 313
              +K    GC     G  + + GL+GLG G LSL   T+ +  ++ +YCL + + S  S
Sbjct: 188 D-VIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFS 246

Query: 314 GVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           G L          + T PL++N +  + YYV L G  VG + V IP S    D +   G 
Sbjct: 247 GSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGT 306

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           I D GT  TRL   AY ++R+ F R   N   TS +  FDTCY  SG  SV  P+V+  F
Sbjct: 307 IFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATS-LGGFDTCY--SG--SVVYPSVTFMF 361

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRV 488
            AG  + LP  N LI   S  T C A A      +S L++I ++QQQ  RV  DL N+R+
Sbjct: 362 -AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRL 420

Query: 489 GFTPNKC 495
           G +   C
Sbjct: 421 GISRETC 427


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 22/356 (6%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +GEY   + +GTPP     ++DTGSD+ W QCRPCT CY+Q  P FDPK SS+Y    C 
Sbjct: 89  AGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCG 148

Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
              C +L  D S     +C +  +Y DGSFT G+L  ET++     G   S  G A GC 
Sbjct: 149 TSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCV 208

Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNSA-- 321
           H + G+F   S+G++GLG   LS+  Q+K+T     +YCL  V  DS  S  + F  +  
Sbjct: 209 HRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTDSSMSSRINFGRSGI 268

Query: 322 -RGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
             G   V+ PL+  K  DT+YY + L GFSVG + +       +  E  +G IIVD GT 
Sbjct: 269 VSGAGTVSTPLVM-KGPDTYYYLITLEGFSVGKKRLSY-KGFSKKAEVEEGNIIVDSGTT 326

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
            T L  + Y  L +S        +      +   CY+ + +  +  P ++ HF       
Sbjct: 327 YTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQIDAPIITAHFKDANVEL 385

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            P   +L   +     CF   PTS  + I+GN+ Q    V FDL   RV F    C
Sbjct: 386 QPWNTFLRMQEDL--VCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKRVSFKAADC 438


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 102/301 (33%), Positives = 151/301 (50%), Gaps = 29/301 (9%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           V +     + EY   + VGTPPR  ++ LDTGSD+ W QC PC +C+ Q  P+ DP  SS
Sbjct: 75  VAAAGGIATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASS 134

Query: 208 SYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG---- 263
           +Y+ LPC AP+C++L  ++C    C+Y   YGD S TVG + T+  +FG++G   G    
Sbjct: 135 TYAALPCGAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSL 194

Query: 264 -----IALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE 317
                +  GCGH N+G+F  +  G+ G G G  SL  Q+ ATS +YC        S ++ 
Sbjct: 195 PATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSIVT 254

Query: 318 --------FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
                   ++ A  G+  T PL +N    + Y++ L G SVG   + +P + F       
Sbjct: 255 LGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR------ 308

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVRVPT 427
              I+D G +IT L  + Y +++  F    G   P SGV  +  D C+        R P 
Sbjct: 309 -STIIDSGASITTLPEEVYEAVKAEFAAQVG--LPPSGVEGSALDVCFALPVSALWRRPA 365

Query: 428 V 428
           V
Sbjct: 366 V 366


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 114/350 (32%), Positives = 171/350 (48%), Gaps = 21/350 (6%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
           +GVGTPP+   ++LD GSD+ W QC       +Q +P+FD   SSS+S LPC +  C+  
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAG 170

Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG-NSGSVKGIALGCGHDNEGLFVGS 279
           +     C   +C Y+  YG  + T G L TET +FG + G    +  GCG    G    +
Sbjct: 171 TFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEA 229

Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG-------GDAVTAPLI 332
           +G+LGL  G LS+ KQ+  T  +YCL       +  + F +          G   T PL+
Sbjct: 230 SGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPLL 289

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           +N   D +YYV + G SVG + + +P     +   G GG ++D  T +  L   A+  L+
Sbjct: 290 KNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELK 349

Query: 393 DSFVRLAGNLKPTSGVALFD--TCYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
            +   + G   P +  ++ D   C++      +  V+VP + LHF     + LP  NY  
Sbjct: 350 KAV--MEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQ 407

Query: 448 PVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              S G  C A   AP   A ++IGNVQQQ   V +D+ N +  + P KC
Sbjct: 408 E-PSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 127/378 (33%), Positives = 176/378 (46%), Gaps = 34/378 (8%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+G+P +Q  + LDT +D  W  C PC  C   S  +F P 
Sbjct: 67  SAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPA 122

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------------CLYQVAYGDGSFTVGDLVT 250
            SSSY+ LPC++  C      AC A +              C +   + D SF    L +
Sbjct: 123 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAA-LAS 181

Query: 251 ETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCL 305
           +T+  G   ++     GC     G    +   GLLGLG G ++L  Q  +      +YCL
Sbjct: 182 DTLRLGKD-AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL 240

Query: 306 VD-RDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
              R    SG L    A GG   +    P++RN    + YYV +TG SVG   V++P   
Sbjct: 241 PSYRSYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGS 299

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           F  D A   G +VD GT ITR     Y +LR+ F R        + +  FDTC++   + 
Sbjct: 300 FAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVA 359

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGT 477
           +   P V++H   G  L LP +N LI   +    C A A      +S +++I N+QQQ  
Sbjct: 360 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 419

Query: 478 RVSFDLANNRVGFTPNKC 495
           RV FD+AN+RVGF    C
Sbjct: 420 RVVFDVANSRVGFAKESC 437


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 120/354 (33%), Positives = 171/354 (48%), Gaps = 21/354 (5%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y   + +GTPP +   + DTGSD+ W  C PC +CY+Q +PIFDP+ S+SY  + C +
Sbjct: 23  GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDS 82

Query: 217 PQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
             C  LD   C   + C Y  AY   + T G L  ET++     G S  +KGI  GCGH+
Sbjct: 83  KLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHN 142

Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGD- 325
           N G F     G++GLGGG +S   QI ++      + CLV   +  S   + +  +G + 
Sbjct: 143 NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEV 202

Query: 326 ----AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
                V+ PL+  K+  T Y+V L G SVG   +    S  +  E G+  + +D GT  T
Sbjct: 203 SGKGVVSTPLVA-KQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTPPT 259

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            L TQ Y+ L    VR    +KP +          +    ++R P ++ HF  G    LP
Sbjct: 260 ILPTQLYDRLVAQ-VRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPVLTAHFEGGDVKLLP 318

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + ++ P D  G FC  F  TSS   + GN  Q    + FDL    V F P  C
Sbjct: 319 TQTFVSPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 128/359 (35%), Positives = 182/359 (50%), Gaps = 23/359 (6%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
           ASQG  EY  R  VG+PP Q   ++DTGSDI WLQC PC +CY+Q+ PIFDP  S +Y  
Sbjct: 86  ASQG--EYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKT 143

Query: 212 LPCAAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
           LPC++  C+SL  +AC + N C Y + YGDGS + GDL  ET++     G+S       +
Sbjct: 144 LPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFPKTVI 203

Query: 267 GCGHDNEGLFVGSAGLLGLGG----GMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNS 320
           GCGH+N G F      +   G     ++S          +YCL  +  +S +S  L F  
Sbjct: 204 GCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGD 263

Query: 321 A---RGGDAVTAPLI-RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
           A    G   V+ PL   N +V  FY++ L  FSVG   ++   S      +GDG II+D 
Sbjct: 264 AAVVSGRGTVSTPLDPLNGQV--FYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDS 321

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT +T L  + Y +L  +   +    +      L   CY  +    + +P ++ HF  G 
Sbjct: 322 GTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTS-DELDLPVITAHF-KGA 379

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            ++L   +  +PV+  G  CFAF  +S   +I GN+ QQ   V +DL    V F P  C
Sbjct: 380 DVELNPISTFVPVEK-GVVCFAFI-SSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDC 436


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 126/378 (33%), Positives = 176/378 (46%), Gaps = 34/378 (8%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+G+P +Q  + LDT +D  W  C PC  C   S  +F P 
Sbjct: 69  SAPVASG--QAPPSYVVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPA 124

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------------CLYQVAYGDGSFTVGDLVT 250
            SSSY+ LPC++  C      AC A +              C +   + D SF    L +
Sbjct: 125 NSSSYASLPCSSSWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAA-LAS 183

Query: 251 ETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCL 305
           +T+  G   ++     GC     G    +   GLLGLG G ++L  Q  +      +YCL
Sbjct: 184 DTLRLGKD-AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL 242

Query: 306 VD-RDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
              R    SG L    A GG   +    P++RN    + YYV +TG SVG   V++P   
Sbjct: 243 PSYRSYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGS 301

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           F  D A   G +VD GT ITR     Y +LR+ F R        + +  FDTC++   + 
Sbjct: 302 FAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVA 361

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGT 477
           +   P V++H   G  L LP +N LI   +    C A A      +S +++I N+QQQ  
Sbjct: 362 AGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNI 421

Query: 478 RVSFDLANNRVGFTPNKC 495
           RV FD+AN+R+GF    C
Sbjct: 422 RVVFDVANSRIGFAKESC 439


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 133/385 (34%), Positives = 181/385 (47%), Gaps = 31/385 (8%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR 188
           R  +  +EA I P     PV    S  +GEY  +I +GTPP     + DTGSD+ W QC 
Sbjct: 65  RRFMSFSEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVG 246
           PC  CY+Q +P+FDP  S+S+  + C + QC+ LD  +C   +  C +   YGDGS   G
Sbjct: 121 PCLSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQG 180

Query: 247 DLVTETVSFG-NSG---SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-- 299
            + TET++   NSG   S+  I  GCGH+N G F     GL G GG  LSLT QI +T  
Sbjct: 181 VIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLG 240

Query: 300 ---SLAYCLVD-RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
                + CLV  R  P+  S ++    A   G   V+ PL+  K   T+Y+V L G SVG
Sbjct: 241 SGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLV-TKDDPTYYFVTLDGISVG 299

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
            +     P       A  G + +D GT  T L    YN L    V+ A  ++P     L 
Sbjct: 300 DKLF---PFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQ 355

Query: 412 -DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIG 470
              CY  + L  +  P ++ HF        P   ++ P +  G +CFA  P      I G
Sbjct: 356 PQLCYRSATL--IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFG 411

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           N  Q    + FDL   +V F    C
Sbjct: 412 NFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 124/356 (34%), Positives = 177/356 (49%), Gaps = 28/356 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP +     DTGSD+ W QC PCT+CY+Q +P+FDP++SSSY+ + C   
Sbjct: 59  EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118

Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
            C  LD S C  ++  C Y  +Y D S T G L  ET++     G   + +GI  GCGH+
Sbjct: 119 SCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHN 178

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKAT------SLAYCLVDRDSPASGVLEFNSARGGD 325
           N G      GL+GLG G LSL  QI ++        + CLV  ++  S   + N  +G +
Sbjct: 179 NSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSE 238

Query: 326 A-----VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL-FEMDEAGDGGIIVDCGTA 379
                 V+ PLI   K  T Y+  L G SV  + + +P S    +     G I++D GT 
Sbjct: 239 VLGNGTVSTPLI--SKDGTGYFATLLGISV--EDINLPFSNGSSLGTITKGNILIDSGTT 294

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           IT L  + Y+ L +  VR    L+P   +  ++ CY      ++  PT+++HF  G  L 
Sbjct: 295 ITYLPEEFYHRLIEQ-VRNKVALEPFR-IDGYELCYQTP--TNLNGPTLTIHFEGGDVLL 350

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            PA+ + IPV     FCFA   T+      GN  Q    + FDL    V F    C
Sbjct: 351 TPAQMF-IPVQDD-NFCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATDC 404


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 114/343 (33%), Positives = 174/343 (50%), Gaps = 31/343 (9%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           Y   I +GTPP   + VLDTGSD+ W QC  PC  C+ Q  P++ P  S++Y+ + C +P
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSP 151

Query: 218 QCKSLDVSACRANR----CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            C++L     R +     C Y  +YGDG+ T G L TET + G+  +V+G+A GCG +N 
Sbjct: 152 MCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGTENL 211

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR 333
           G    S+GL+G+G G LSL  Q+  T        R   +          G    T+P   
Sbjct: 212 GSTDNSSGLVGMGRGPLSLVSQLGVT--------RPRRSCRARAAARGGGAPTTTSP--- 260

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
                      L G +VG   + I P++F +   GDGG+I+D GT  T L+ +A+ +L  
Sbjct: 261 -----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALAR 309

Query: 394 SFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           +       L   SG  L    C+  +   +V VP + LHF  G  ++L  ++Y++   SA
Sbjct: 310 ALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFD-GADMELRRESYVVEDRSA 367

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           G  C     ++  +S++G++QQQ T + +DL    + F P KC
Sbjct: 368 GVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 204/419 (48%), Gaps = 40/419 (9%)

Query: 90  LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVV 149
           LH    + Y SL+     R  +R  TL+T L        R       + I+P+       
Sbjct: 42  LHNPSLSRYDSLI-DAFRRSFSRSATLLTHLTSVSTACIR-------SPIIPD------- 86

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
                 SGE+   I +GTPP     + DTGSD+ W QC PC EC+ QS PIF+P+ SSSY
Sbjct: 87  ------SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSY 140

Query: 210 SPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
             + CA+  C+SL+   C  +   C Y  +YGD SFT GDL ++ ++ G S  +    +G
Sbjct: 141 RKVSCASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIG-SFKLPKTVIG 199

Query: 268 CGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPA--SGVLEFN 319
           CGH N G F G ++G++GLGGG LSL  Q++  +      +YCL    S A  +G + F 
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGTISFG 259

Query: 320 S---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                 G   V+ PL+  +  DTFY++ L   SVG +  +    +  M   G+  II+D 
Sbjct: 260 RKAVVSGRQVVSTPLV-PRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGN--IIIDS 316

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT +T L    Y  +  +  R+    +      + + CY    +  + +P ++ HF  G 
Sbjct: 317 GTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGA 376

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + L   N   PV    T C  FAP ++ ++I GN+ Q    V +DL N R+ F P  C
Sbjct: 377 DVKLLPVNTFAPVADNVT-CLTFAP-ATQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433


>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
          Length = 110

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 85/110 (77%), Positives = 94/110 (85%)

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           QAY S+RD+F RL  NL+   GVA+FDTCYD S LRSVRVPTVS HFG  +  DLPAKNY
Sbjct: 1   QAYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNY 60

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LIPVDS GTFCFAFAPTSS+LSIIGNVQQQGTRVSFD+AN+ VGF+PNKC
Sbjct: 61  LIPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 121/376 (32%), Positives = 178/376 (47%), Gaps = 28/376 (7%)

Query: 134 PAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-E 192
           PAEA       + P  +G +  + E+   +G G+P +  + + DTGSD++W+QC+PC+  
Sbjct: 91  PAEA----PSATIPDHTGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGH 146

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTET 252
           CY+Q DP+FDP  SSSY+ +PC   +C +     C    C+Y V YGDGS T G L  ET
Sbjct: 147 CYKQHDPVFDPAKSSSYAVVPCGTTECAAAG-GECNGTTCVYGVEYGDGSSTTGVLARET 205

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDR 308
           ++F +S    G   GCG  N G F G    L   G          A +     +YCL   
Sbjct: 206 LTFSSSSEFTGFIFGCGETNLGDF-GEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSY 264

Query: 309 D-SPASGVLEFNSARGGDAVTAPLIRNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           + +P    +      G   V    + NK    +FY++ L   ++GG  + +PPS F    
Sbjct: 265 NTTPGYLSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT- 323

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
               G ++D GT +T L   AY +LRD F       KP       DTCYDF+G   + +P
Sbjct: 324 ----GTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIP 379

Query: 427 TVSLHFGAGKALDLPAKNYL----IPVDS---AGTFCFAFAPTSSALSIIGNVQQQGTRV 479
            VS +F  G   +L   N+      P D+    G   F   P     S++G+  Q+   V
Sbjct: 380 GVSFNFSDGAVFNL---NFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEV 436

Query: 480 SFDLANNRVGFTPNKC 495
            +D+   ++GF P  C
Sbjct: 437 IYDVPAQKIGFIPASC 452


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 123/372 (33%), Positives = 174/372 (46%), Gaps = 26/372 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG S  S  Y  R G+G+P +   + LDT +D  W  C PC  C   S  +F P 
Sbjct: 65  SAPVASGQSPPS--YVVRAGLGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPA 121

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVS 254
            S+SY+PLPC++  C  L    C A            C +   + D SF    L ++ + 
Sbjct: 122 NSTSYAPLPCSSTMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQ-ASLASDWLH 180

Query: 255 FGNSGSVKGIALGCGHDNEGLFVG--SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRD 309
            G   ++   A GC     G        GLLGLG G ++L  Q+        +YCL    
Sbjct: 181 LGKD-AIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYK 239

Query: 310 SPA-SGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
           S   SG L   +A     V   P+++N    + YYV +TG SVG   V++P   F  D A
Sbjct: 240 SYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPA 299

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
              G +VD GT ITR     Y +LR+ F R        + +  FDTC++   + +   P 
Sbjct: 300 TGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPA 359

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDL 483
           V++H   G  L LP +N LI   +    C A A      ++ ++++ N+QQQ  RV FD+
Sbjct: 360 VTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDV 419

Query: 484 ANNRVGFTPNKC 495
           AN+RVGF    C
Sbjct: 420 ANSRVGFARESC 431


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 179/356 (50%), Gaps = 23/356 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY  ++ VGTPP     V DTGSDI W QC PCT CYQQ  P+F+P  S++Y  + C++
Sbjct: 83  GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSS 142

Query: 217 PQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGI---ALGCGH 270
           P C     D S      C Y ++YGD S + GD   +T++ G+ SG V      A+GCGH
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGH 202

Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNS---A 321
           DN G F    +G++GLG G  SL KQ+ +      +YCL  +  D   S  L F S    
Sbjct: 203 DNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANV 262

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G  AV+ P+  + K  +FY + L   SVG        S       G   II+D GT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFY--STANSILGGKANIIIDSGTTLT 320

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
            L    Y++   + +  + NL+ T     F + C++ +     +VP +++HF  G  L L
Sbjct: 321 LLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFE-TTTDDYKVPFIAMHF-EGANLRL 377

Query: 441 PAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +N LI V S    C AFA    + +SI GN+ Q    V +D+ N  + F P  C
Sbjct: 378 QRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 130/371 (35%), Positives = 184/371 (49%), Gaps = 33/371 (8%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--PIFDPKTSSSYSP 211
            G+G Y   I +GTPP  F +++DTGS++ W QC PCT C+ +    P+  P  SS++S 
Sbjct: 86  NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSR 145

Query: 212 LPCAAPQCKSLDVSA----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           LPC    C+ L  S+    C A   C Y   YG G +T G L TET++ G+ G+   +A 
Sbjct: 146 LPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD-GTFPKVAF 203

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR-- 322
           GC  +N      S+G++GLG G LSL  Q+     +YCL     D  AS +L  + A+  
Sbjct: 204 GCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLT 261

Query: 323 -GGDAVTAPLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
            G    + PL++N  +   T YYV LTG +V    + +  S F   + G  GG IVD GT
Sbjct: 262 EGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLK---PTSGVAL-FDTCYDFS---GLRSVRVPTVSLH 431
            +T L    Y  ++ +F     NL    P SG     D CY  S   G ++VRVP ++L 
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381

Query: 432 FGAGKALDLPAKNYL--IPVDSAGTF---CFAFAPTSSAL--SIIGNVQQQGTRVSFDLA 484
           F  G   ++P +NY   +  DS G     C    P +  L  SIIGN+ Q    + +D+ 
Sbjct: 382 FAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDID 441

Query: 485 NNRVGFTPNKC 495
                F P  C
Sbjct: 442 GGMFSFAPADC 452


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 136/427 (31%), Positives = 202/427 (47%), Gaps = 45/427 (10%)

Query: 99  RSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEA-----QILPEDFSTPVVS 150
           R   +  +  DS+R    N   T+LQ  I NV  H +K A        +   D   P + 
Sbjct: 25  RGFSVELIHPDSSRSPFYNIRETQLQ-RISNVVTHSIKRAHYLNHVFSLSHNDLPKPTI- 82

Query: 151 GASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
                   Y     +GTPP Q   V+DTGSD  W QC+PC  C  Q+ PIF+P  SS+Y 
Sbjct: 83  -IPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYK 141

Query: 211 PLPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKG 263
            + C++P CK  + + C +NR   C Y++ Y D S + GD+  +T++     G+  S   
Sbjct: 142 NIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPK 201

Query: 264 IALGCGHDN----EGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA--SG 314
           I +GCGH N    EGL   ++G++G G G  S+  Q+ ++     +YCL    S A  S 
Sbjct: 202 IVIGCGHKNSLTTEGL---ASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISS 258

Query: 315 VLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
            L F       G   V+ PLI++  V   Y+  L  FSVG   +++  S    D  G+  
Sbjct: 259 KLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFSVGDHIIKLKDSSLIPDNEGNA- 316

Query: 372 IIVDCGTAITRLQTQAYNSLRD---SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            ++D G+ IT+L    Y+ L     S V+L     PT  ++L   CY  + L+   VP +
Sbjct: 317 -VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSL---CYK-TTLKKYEVPII 371

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           + HF  G  + L A N  I ++     CFAF  ++    + GN+ QQ   V +D   N +
Sbjct: 372 TAHF-RGADVKLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNII 429

Query: 489 GFTPNKC 495
            F P  C
Sbjct: 430 SFKPTNC 436


>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
          Length = 150

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 85/146 (58%), Positives = 107/146 (73%)

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VGG  V I   +F + E GDGG+++D GTA+TRL T AY + RD+F+    NL   +GVA
Sbjct: 5   VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 64

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSII 469
           +FDTCYD  G  SVRVPTVS +F  G  L LPA+N+LIP+D AGTFCFAFAP++S LSI+
Sbjct: 65  IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSIL 124

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GN+QQ+G ++SFD AN  VGF PN C
Sbjct: 125 GNIQQEGIQISFDGANGYVGFGPNIC 150


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 131/432 (30%), Positives = 201/432 (46%), Gaps = 35/432 (8%)

Query: 87  REILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFST 146
           R  + KT++N +   ++  +   S   NT  +  Q    N+ +H         L   FS 
Sbjct: 15  RVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNM-KHSTN--RVHYLNHVFSF 71

Query: 147 P------VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           P      +V     G G Y     +GTPP Q   V+DT +D  W QC PC  C+  + P+
Sbjct: 72  PPNKVPNIVVSPFMGDG-YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPM 130

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGN 257
           FDP  SS+Y  +PC++P+CK+++ + C ++    C Y   YG  +++ GDL  +T++  +
Sbjct: 131 FDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNS 190

Query: 258 SG----SVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKAT---SLAYCLVD-- 307
           +     S K I +GCGH N+G   G  +G +GLG G LS   Q+ ++     +YCLV   
Sbjct: 191 NNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLF 250

Query: 308 RDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
            +   SG L F       G   V+ P+      +  Y   L   SVG   ++   S  + 
Sbjct: 251 SNEGISGKLHFGDKSVVSGVGTVSTPITAG---EIGYSTTLNALSVGDHIIKFENSTSKN 307

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           D  G+   I+D GT +T L    Y+ L      +    +  S    F  CY  + L+++ 
Sbjct: 308 DNLGN--TIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYK-ATLKNLD 364

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDL 483
           VP ++ HF  G  + L + N   P+D     CFAF    +   +IIGN+ QQ   V FDL
Sbjct: 365 VPIITAHFN-GADVHLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLVGFDL 422

Query: 484 ANNRVGFTPNKC 495
             N + F P  C
Sbjct: 423 QKNIISFKPTDC 434


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 130/367 (35%), Positives = 190/367 (51%), Gaps = 42/367 (11%)

Query: 145 STPVVSGASQG---SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           S PV  GA      + EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q+ P F
Sbjct: 72  SAPVSPGAYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYF 131

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           DP TSS+ S   C +  C+ L V++  R+++  +                     G   S
Sbjct: 132 DPSTSSTLSLTSCDSTLCQGLPVASLPRSDKFTF--------------------VGAGAS 171

Query: 261 VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE 317
           V G+A GCG  N G+F  +  G+ G G G LSL  Q+K  + ++C   +    P++ +L+
Sbjct: 172 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLD 231

Query: 318 FNS---ARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
             +   + G  AV T PLI+N    TFYY+ L G +VG   + +P S F +   G GG I
Sbjct: 232 LPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTI 290

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSL 430
           +D GTA+T L T+ Y  +RD+F   A  +K P       D  +  S  LR+   VP + L
Sbjct: 291 IDSGTAMTSLPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVL 347

Query: 431 HFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           HF  G  +DLP +NY+  V+ AG+   C A       ++ IGN QQQ   V +DL N+++
Sbjct: 348 HF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSKL 405

Query: 489 GFTPNKC 495
            F P +C
Sbjct: 406 SFVPAQC 412


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 136/426 (31%), Positives = 199/426 (46%), Gaps = 47/426 (11%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D AR N+L  + + A     +     A A    E    P+ SG    +  Y + I +
Sbjct: 107 LAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAE---VPLTSGIRFQTLNYVTTIAL 163

Query: 166 GTPPR------QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           G            ++++DTGSD+ W+QC+PC+ CY Q DP+FDP  S+SY+ +PC A  C
Sbjct: 164 GGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASAC 223

Query: 220 KSLDVSAC----------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
           ++   +A                 ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G
Sbjct: 224 EASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDG 282

Query: 264 IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS-PASGVLEF- 318
              GCG  N GLF G+AGL+GLG   LSL  Q         +YCL    S  A+G L   
Sbjct: 283 FVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLG 342

Query: 319 ---NSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
              +S R    V+   +I +     FY++ +T          +  +       G   +++
Sbjct: 343 GDTSSYRNATPVSYTRMIADPAQPPFYFMNVT-------GASVGGAAVAAAGLGAANVLL 395

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHF 432
           D GT ITRL    Y ++R  F R  G  +  +    +L D CY+ +G   V+VP ++L  
Sbjct: 396 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 455

Query: 433 GAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVG 489
             G  + + A   L      G+  C A A  S      IIGN QQ+  RV +D   +R+G
Sbjct: 456 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 515

Query: 490 FTPNKC 495
           F    C
Sbjct: 516 FADEDC 521


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 175/355 (49%), Gaps = 38/355 (10%)

Query: 171 QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC--- 227
             ++++DTGSD+ W+QC+PC+ CY Q DP+FDP  S+SY+ +PC A  C++   +A    
Sbjct: 176 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 235

Query: 228 -------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
                        ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G   GCG  N G
Sbjct: 236 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDGFVFGCGLSNRG 294

Query: 275 LFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDS-PASGVLEF----NSARGGDA 326
           LF G+AGL+GLG   LSL  Q         +YCL    S  A+G L      +S R    
Sbjct: 295 LFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATP 354

Query: 327 VT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           V+   +I +     FY++ +T          +  +       G   +++D GT ITRL  
Sbjct: 355 VSYTRMIADPAQPPFYFMNVT-------GASVGGAAVAAAGLGAANVLLDSGTVITRLAP 407

Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
             Y ++R  F R  G  +  +    +L D CY+ +G   V+VP ++L    G  + + A 
Sbjct: 408 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 467

Query: 444 NYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L      G+  C A A  S      IIGN QQ+  RV +D   +R+GF    C
Sbjct: 468 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 522


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 179/356 (50%), Gaps = 23/356 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY  ++ VGTPP     V DTGSDI W QC PCT CYQQ  P+F+P  S++Y  + C++
Sbjct: 83  GEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSS 142

Query: 217 PQCK--SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGI---ALGCGH 270
           P C     D S      C Y ++YGD S + GD   +T++ G+ SG V      A+GCGH
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGH 202

Query: 271 DNEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCL--VDRDSPASGVLEFNS---A 321
           DN G F    +G++GLG G  SL KQ+ +      +YCL  +  D   S  L F S    
Sbjct: 203 DNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANV 262

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G  AV+ P+  + K  +FY + L   SVG        S       G   II+D GT +T
Sbjct: 263 SGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFY--STANSILGGKANIIIDSGTTLT 320

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
            L    Y++   + +  + NL+ T     F + C++ +     +VP +++HF  G  L L
Sbjct: 321 LLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFE-TTTDDYKVPFIAMHF-EGANLRL 377

Query: 441 PAKNYLIPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +N LI V S    C AFA    + +SI GN+ Q    V +D+ N  + F P  C
Sbjct: 378 QRENVLIRV-SDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 143/458 (31%), Positives = 210/458 (45%), Gaps = 61/458 (13%)

Query: 59  EPFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLIT 118
           EP   +S ++  + PLN       P+ S     K +   +  L    L RD  R N +  
Sbjct: 47  EPKVRDSSSSGATVPLNHRHGPCSPVPS----GKKKQPTFTEL----LRRDQLRANYIQR 98

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           +        D H   P    +   + + P+  G+   + EY   + +G+P    +M +DT
Sbjct: 99  QFS------DEH--YPRTGGLQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDT 150

Query: 179 GSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD---VSACRANRCLYQ 235
           GSD++WL+C+           ++DP TSS+Y+P  C+AP C  L          + C+Y 
Sbjct: 151 GSDVSWLRCK---------SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYS 201

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGS--VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSL 292
           V YGDGS T G   ++T++   +    + G   GC     G    +  GL+GLGG   S 
Sbjct: 202 VKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSF 261

Query: 293 TKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLT 346
             Q  AT   + +YCL    + +SG L   +     +      P++R+K+  TFY + L 
Sbjct: 262 VSQTAATYGSAFSYCLPPTWN-SSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLR 320

Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL----RDSFVRLAGNL 402
           G SVGG+ ++IP S+F        G IVD GT ITRL   AY +L    RD   R     
Sbjct: 321 GISVGGKTLEIPSSVFS------AGSIVDSGTVITRLPPTAYGALSAAFRDGMARY--QY 372

Query: 403 KPTSGVALFDTCYDFSGL---RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
           +P +   L DTC+DF+G     +  VP+V+L    G  +DL      I  D     C AF
Sbjct: 373 QPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGGAVVDLHPNG--IVQDG----CLAF 426

Query: 460 APT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A T       IIGNVQQ+   V +D+  +  GF P  C
Sbjct: 427 AATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 163/338 (48%), Gaps = 30/338 (8%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L    + C  
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 223

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGG 288
           N+C Y V YGDG  T G  + + ++   S  V     GC H   G F  S +G + LGGG
Sbjct: 224 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGG 283

Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKK-VDTF 340
             SL  Q  AT   + +YC+ D  S  SG L       G         PL+RN   + T 
Sbjct: 284 RQSLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTL 341

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
           Y V L G  VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A
Sbjct: 342 YLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMA 395

Query: 400 GNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
              +   G A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF
Sbjct: 396 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAF 449

Query: 460 APTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            PT    AL  IGNVQQQ   V +D+    VGF    C
Sbjct: 450 VPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 163/338 (48%), Gaps = 30/338 (8%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L    + C  
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGG 288
           N+C Y V YGDG  T G  + + ++   S  V     GC H   G F  S +G + LGGG
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGG 267

Query: 289 MLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKK-VDTF 340
             SL  Q  AT   + +YC+ D  S  SG L       G         PL+RN   + T 
Sbjct: 268 RQSLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTL 325

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LA 399
           Y V L G  VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A
Sbjct: 326 YLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMA 379

Query: 400 GNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
              +   G A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF
Sbjct: 380 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAF 433

Query: 460 APTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            PT    AL  IGNVQQQ   V +D+    VGF    C
Sbjct: 434 VPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 114/344 (33%), Positives = 171/344 (49%), Gaps = 25/344 (7%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP  +  + DTGSD+ W QC PC +CYQQ  PIF+P  S+S+S +PC    C ++D 
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDD 145

Query: 225 SACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
             C     C Y   YGD +++ GDL  E ++ G+S SVK + +GCGH + G F  ++G++
Sbjct: 146 GHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSS-SVKSV-IGCGHASSGGFGFASGVI 203

Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS---ARGGDAVTAPLIRNK 335
           GLGGG LSL  Q+  TS      +YCL    S A+G + F       G   V+ PLI   
Sbjct: 204 GLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKN 263

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
            V T+YY+ L   S+G +          M  A  G +I+D GT ++ L  + Y+ +  S 
Sbjct: 264 TV-TYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSL 314

Query: 396 VRLAGNLKPTSGVALFDTCYD--FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
           +++    +       +D C+D   +   S  +P ++  F  G  ++L   N    V +  
Sbjct: 315 LKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANN 373

Query: 454 TFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             C    P S      IIGN+      + +DL   R+ F P  C
Sbjct: 374 VNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVC 417


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 38/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
           GEY   + +GTPP+ +  + DTGSD+ W QC PC E C++Q  P+++P +S ++  LPC+
Sbjct: 90  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149

Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
           +    +L++ A  A            C Y   YG G +T G   +ET +FG+S +    V
Sbjct: 150 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 204

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
            GIA GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL   +D+ +   L    
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 264

Query: 321 A------RGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           A       G    + P +       + T+YY+ LTG SVG  A+ IPP  F +   G GG
Sbjct: 265 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGG 324

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
           +I+D GT IT L   AY  +R + VR    L  T G      D C+    S      +P+
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
           ++LHFG G  + LP +NY+I +D  G +C A  + T   LS +GN QQQ   + +D+   
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 441

Query: 487 RVGFTPNKC 495
            + F P KC
Sbjct: 442 TLSFAPAKC 450


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 112/339 (33%), Positives = 173/339 (51%), Gaps = 28/339 (8%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
           ++++D+GSD++W+QC+PC    C++Q DP+FDP  S++Y+ +PC +  C  L      C 
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 228

Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
           AN +C + + YGDGS   G    + ++ G    ++G   GC H + G       AG L L
Sbjct: 229 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 288

Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
           GGG  SL +Q         +YCL    S       GV    +      V+ PL+ +    
Sbjct: 289 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 348

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           TFY V L    V G+ + +PP++F          ++D  T I+RL   AY +LR +F   
Sbjct: 349 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 402

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
               +    V++ DTCYDF+G+RS+ +P+++L F  G  ++L A   L+     G+ C A
Sbjct: 403 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 456

Query: 459 FAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           FAPT+S      IGNVQQ+   V +D+    + F    C
Sbjct: 457 FAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 173/352 (49%), Gaps = 23/352 (6%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +R G+GTP +   + +D  +D  W+ C  C  C   S P F P  SS+Y  +PC +P
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           QC  +   +C A   + C + + Y   +F    L  ++++  N+  V     GC     G
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENN-VVVSYTFGCLRVVSG 217

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
             V   GL+G G G LS   Q K T     +YCL + R S  SG L+         + T 
Sbjct: 218 NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTT 277

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV + G  VG + VQ+P S    +     G I+D GT  TRL    Y 
Sbjct: 278 PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 337

Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           ++RD+F  R+   + P  G   FDTCY+     +V VPTV+  F    A+ LP +N +I 
Sbjct: 338 AVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIH 391

Query: 449 VDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             S G  C A A       ++AL+++ ++QQQ  RV FD+AN RVGF+   C
Sbjct: 392 SSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 129/371 (34%), Positives = 184/371 (49%), Gaps = 33/371 (8%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--PIFDPKTSSSYSP 211
            G+G Y   I +GTPP  F +++DTGS++ W QC PCT C+ +    P+  P  SS++S 
Sbjct: 86  NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSR 145

Query: 212 LPCAAPQCKSLDVSA----CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
           LPC    C+ L  S+    C A   C Y   YG G +T G L TET++ G+ G+   +A 
Sbjct: 146 LPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGD-GTFPKVAF 203

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARGG 324
           GC  +N      S+G++GLG G LSL  Q+     +YCL     D  AS +L  + A+  
Sbjct: 204 GCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLRSDMADGGASPILFGSLAKLT 261

Query: 325 D---AVTAPLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGT 378
           +     + PL++N  +   T YYV LTG +V    + +  S F   + G  GG IVD GT
Sbjct: 262 ERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGT 321

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLK---PTSGVAL-FDTCYDFS---GLRSVRVPTVSLH 431
            +T L    Y  ++ +F     NL    P SG     D CY  S   G ++VRVP ++L 
Sbjct: 322 TLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALR 381

Query: 432 FGAGKALDLPAKNYL--IPVDSAGTF---CFAFAPTSSAL--SIIGNVQQQGTRVSFDLA 484
           F  G   ++P +NY   +  DS G     C    P +  L  SIIGN+ Q    + +D+ 
Sbjct: 382 FAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDID 441

Query: 485 NNRVGFTPNKC 495
                F P  C
Sbjct: 442 GGMFSFAPADC 452


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 179/362 (49%), Gaps = 21/362 (5%)

Query: 145 STPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           S P+ SG A   S  Y  R  +GTP +   + LDT +D  W+ C  C  C   S  +FDP
Sbjct: 73  SVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
             SSS   L C APQCK     +C  ++ C + + YG GS     L  +T++   S  + 
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLA-SDVIP 188

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
               GC +   G  + + GL+GLG G LSL  Q   +  ++ +YCL + + S  SG L  
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                   + T PL++N +  + YYV L G  VG + V IP S    D A   G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T  TRL   AY ++R+ F R   N   TS +  FDTCY  SG  SV  P+V+  F AG  
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + LP  N LI   +    C A A      +S L++I ++QQQ  RV  D+ N+R+G +  
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422

Query: 494 KC 495
            C
Sbjct: 423 TC 424


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 192/369 (52%), Gaps = 38/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
           GEY   + +GTPP+ +  + DTGSD+ W QC PC E C++Q  P+++P +S ++  LPC+
Sbjct: 95  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 154

Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
           +    +L++ A  A            C Y   YG G +T G   +ET +FG+S +    V
Sbjct: 155 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 209

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
            GIA GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL   +D+ +   L    
Sbjct: 210 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 269

Query: 321 A------RGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           A       G    + P +       + T+YY+ LTG SVG  A+ IPP  F +   G GG
Sbjct: 270 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGG 329

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
           +I+D GT IT L   AY  +R + VR    L  T G      D C+    S      +P+
Sbjct: 330 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 388

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
           ++LHFG G  + LP +NY+I +D  G +C A  + T   LS +GN QQQ   + +D+   
Sbjct: 389 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 446

Query: 487 RVGFTPNKC 495
            + F P KC
Sbjct: 447 TLSFAPAKC 455


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 179/362 (49%), Gaps = 21/362 (5%)

Query: 145 STPVVSG-ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           S P+ SG A   S  Y  R  +GTP +   + LDT +D  W+ C  C  C   S  +FDP
Sbjct: 73  SVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
             SSS   L C APQCK     +C  ++ C + + YG GS     L  +T++   S  + 
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLA-SDVIP 188

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
               GC +   G  + + GL+GLG G LSL  Q   +  ++ +YCL + + S  SG L  
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                   + T PL++N +  + YYV L G  VG + V IP S    D A   G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T  TRL   AY ++R+ F R   N   TS +  FDTCY  SG  SV  P+V+  F AG  
Sbjct: 309 TVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + LP  N LI   +    C A A      +S L++I ++QQQ  RV  D+ N+R+G +  
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422

Query: 494 KC 495
            C
Sbjct: 423 TC 424


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 132/414 (31%), Positives = 200/414 (48%), Gaps = 57/414 (13%)

Query: 121 QLAIYNVDRH-ELKPAEAQILP---EDFST--------------PVVSGASQGSGEYFSR 162
           QL +  ++R     P +++++P   EDF T              P+ S A Q      S 
Sbjct: 86  QLRVDGIERRLSDNPHDSKLVPAGGEDFQTNGNLLQVNYGNSGQPMSSEAQQSGVVNASA 145

Query: 163 IGVGT----PPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAA 216
            G G+    P    ++VLD+ SD+ W+QC PC    C+ Q D  +DP  S S +P  C++
Sbjct: 146 AGGGSRSKLPGVIQTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSS 205

Query: 217 PQCKSLD--VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           P C +L    + C  N+C Y V Y DGS T G  + + ++     +V G   GC H  +G
Sbjct: 206 PTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQG 265

Query: 275 LF-VGSAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA--VT 328
            F   +AG++ LGGG   +LS T      + +YC+    S  SG       R   +  V 
Sbjct: 266 SFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLGVPRRASSRYVV 324

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
            P++R ++  TFY V L   +VGGQ + + P++F        G ++D  TAITRL   AY
Sbjct: 325 TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAY 378

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
            +LR +F       +        DTCYDF+G+ ++R+P +SL F          +N ++P
Sbjct: 379 QALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLP 429

Query: 449 VDSAGTF---CFAFAPTSSA----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +D +G     C AF  TS+A      ++G+VQQQ   V +D+    VGF    C
Sbjct: 430 LDPSGILFNDCLAF--TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 173/352 (49%), Gaps = 23/352 (6%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +R G+GTP +   + +D  +D  W+ C  C  C   S P F P  SS+Y  +PC +P
Sbjct: 82  NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140

Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           QC  +   +C A   + C + + Y   +F    L  ++++  N+  V     GC     G
Sbjct: 141 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQ-AVLGQDSLALENN-VVVSYTFGCLRVVSG 198

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
             V   GL+G G G LS   Q K T     +YCL + R S  SG L+         + T 
Sbjct: 199 NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTT 258

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV + G  VG + VQ+P S    +     G I+D GT  TRL    Y 
Sbjct: 259 PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYA 318

Query: 390 SLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           ++RD+F  R+   + P  G   FDTCY+     +V VPTV+  F    A+ LP +N +I 
Sbjct: 319 AVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAVTLPEENVMIH 372

Query: 449 VDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             S G  C A A       ++AL+++ ++QQQ  RV FD+AN RVGF+   C
Sbjct: 373 SSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 122/354 (34%), Positives = 168/354 (47%), Gaps = 40/354 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY     VGTPP +   + DTGSDI WLQC PC ECY Q+ P F P  SS+Y  +PC++
Sbjct: 85  GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSS 144

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
             CKS                   G+ +V  L  E+ S G+  S     +GCG DN   F
Sbjct: 145 DLCKSGQ----------------QGNLSVDTLTLES-STGHPISFPKTVIGCGTDNTVSF 187

Query: 277 VG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR--DSPASGVLEF--NSARGGDAVT 328
            G S+G++GLGGG  SL  Q+ ++     +YCL+    +S  +  L F   +   GD V 
Sbjct: 188 EGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVV 247

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG--DGGIIVDCGTAITRLQTQ 386
           +  I  K    FYY+ L  FSVG + ++     FE    G  +G II+D GT +T + T 
Sbjct: 248 STPIVKKDPIVFYYLTLEAFSVGNKRIE-----FEGSSNGGHEGNIIIDSGTTLTVIPTD 302

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
            YN+L  + + L    +      LF+ CY  +       P ++ HF        P   ++
Sbjct: 303 VYNNLESAVLELVKLKRVNDPTRLFNLCYSVTS-DGYDFPIITTHFKGADVKLHPISTFV 361

Query: 447 IPVDSAGTFCFAFAPTSS-----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              D  G  C AFA TS+      +SI GN+ QQ   V +DL    V F P  C
Sbjct: 362 DVAD--GIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 172/346 (49%), Gaps = 37/346 (10%)

Query: 173 SMVLDTGSDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN 230
           +M +DT  D+ W+QC PC   +CY Q +  FDP+ SS+ +P+ C +  C++L   A   +
Sbjct: 160 TMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCS 219

Query: 231 R------CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLL 283
           +      CLY++ Y D   T+G  +T+T++   S +      GC H   G F   A G +
Sbjct: 220 KPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTM 279

Query: 284 GLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA-------VTAPLIR 333
            LGGG   +LS T +    + +YC+      A+G L       GD         T PL+R
Sbjct: 280 SLGGGPQSLLSQTARAYGNAFSYCVPGPS--AAGFLSIGGPVNGDDGGGSGAFATTPLVR 337

Query: 334 NKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
           +  V   T Y V L G  V G+ + +PP +F       GG ++D    IT+L   AY +L
Sbjct: 338 SANVINPTIYVVRLQGIEVAGRRLNVPPVVFS------GGTVMDSSAVITQLPPTAYRAL 391

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           R +F       K  +     DTC+DF G+  V VPTVSL F  G  ++L   + L+  DS
Sbjct: 392 RLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL--DS 449

Query: 452 AGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               C AFAP ++  AL  IGNVQQQ   V +D+A   VGF    C
Sbjct: 450 ----CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 193/369 (52%), Gaps = 38/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-CYQQSDPIFDPKTSSSYSPLPCA 215
           GEY   + +GTPP+ +  + DTGSD+ W QC PC E C++Q  P+++P +S ++  LPC+
Sbjct: 90  GEYIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCS 149

Query: 216 APQCKSLDVSACRAN----------RCLYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
           +    +L++ A  A            C Y   YG G +T G   +ET +FG+S +    V
Sbjct: 150 S----ALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRV 204

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNS 320
            GIA GC + +   + GSAGL+GLG G LSL  Q+ A   +YCL   +D+ +   L    
Sbjct: 205 PGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGP 264

Query: 321 ARGGDAVTAPLIRN---------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           A    A+    +R+           + T+YY+ LTG SVG  A+ IPP  F +   G GG
Sbjct: 265 AAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGG 324

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDF--SGLRSVRVPT 427
           +I+D GT IT L   AY  +R + VR    L  T G      D C+    S      +P+
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPS 383

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVSFDLANN 486
           ++LHFG G  + LP +NY+I +D  G +C A  + T   LS +GN QQQ   + +D+   
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKE 441

Query: 487 RVGFTPNKC 495
            + F P KC
Sbjct: 442 TLSFAPAKC 450


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 131/365 (35%), Positives = 183/365 (50%), Gaps = 25/365 (6%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT--ECYQQSDP 199
           +  S P   G S  S EY + +  GTP     +V+DTGSD+ WLQC+PC+  +C  Q DP
Sbjct: 95  KKVSVPAHLGTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDP 154

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV----SACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
           +FDP  SS+YS +PCA+ +CK L      S C   + C + ++Y DG+ TVG    + ++
Sbjct: 155 LFDPSHSSTYSAVPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLT 214

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI-KATSLAYCLVDRDSPAS 313
                 VK    GCGH    L     GLLGLG    SL  Q       +YCL   +S   
Sbjct: 215 LAPGAIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNS-KP 273

Query: 314 GVLEFNSARGGDA-VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           G L F + R     V  P+ R     TF  V L G +VGG+ + + PS F       GG+
Sbjct: 274 GFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFS------GGM 327

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           IVD GT +T LQ+  Y +LR +F       +   G    DTCYD +G ++V VP ++L F
Sbjct: 328 IVDSGTVVTVLQSTVYRALRAAFREAMKAYRLVHGD--LDTCYDLTGYKNVVVPKIALTF 385

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGF 490
             G  ++L   N ++ V+     C AFA T       ++GNV Q+   V FD + ++ GF
Sbjct: 386 SGGATINLDVPNGIL-VNG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGF 440

Query: 491 TPNKC 495
               C
Sbjct: 441 RAKAC 445


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 112/356 (31%), Positives = 180/356 (50%), Gaps = 25/356 (7%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           +G+GTPP+   +++DTGSD+ W QC+            S P++DP  SS+++ LPC+   
Sbjct: 95  VGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRL 154

Query: 219 CKSLDVS---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEG 274
           C+    S       NRC+Y+  YG  +  VG L +ET +FG   +V   +  GCG  + G
Sbjct: 155 CQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAG 213

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAV- 327
             +G+ G+LGL    LSL  Q+K    +YCL      + SP     + + +  +    + 
Sbjct: 214 SLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQ 273

Query: 328 TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           T  ++ N     +YYV L G S+G + + +P +   M   G GG IVD G+ +  L   A
Sbjct: 274 TTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 333

Query: 388 YNSLRDSFVRLAGNLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLP 441
           + +++++ + +         V  ++ C+        + + +V+VP + LHF  G A+ LP
Sbjct: 334 FEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLP 393

Query: 442 AKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             NY      AG  C A   T+  S +SIIGNVQQQ   V FD+ +++  F P +C
Sbjct: 394 RDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 448


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 126/347 (36%), Positives = 176/347 (50%), Gaps = 39/347 (11%)

Query: 173 SMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD------V 224
           SMV+DT SD+ W+QC PC +  CY QSD ++DP  S   +P PC++PQC+SL        
Sbjct: 175 SMVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCT 234

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTE--TVSFGNSGSVKGIALGCGHD--NEGLFVG-S 279
            A     C Y+V Y DGS T G  V++  T++    G+V     GC H     G F   +
Sbjct: 235 GAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKT 294

Query: 280 AGLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPAS----GVLEFNSARGGDAVTAP 330
           AG + LG G  SL+ Q K T       +YCL    S       GV +  ++R   AVT P
Sbjct: 295 AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASR--YAVT-P 351

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           ++++K     Y V L G  V GQ + +PP++F  + A      +D  T ITRL   AY +
Sbjct: 352 MLKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAA------MDSRTIITRLPPTAYMA 405

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           LR +F       +  +     DTCYDF+G+  VR+P V+L F    A++L     ++  D
Sbjct: 406 LRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVML--D 463

Query: 451 SAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S    C AFAP ++     IIGNVQQQ   V +++    VGF    C
Sbjct: 464 S----CLAFAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 180/362 (49%), Gaps = 21/362 (5%)

Query: 145 STPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           S P+ SG     S  Y  R  +GTP +   + LDT +D  W+ C  C  C   S  +FDP
Sbjct: 73  SVPIASGRGIVQSPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDP 130

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK 262
             SSS   L C APQCK     +C  ++ C + + YG GS     L  +T++   +  + 
Sbjct: 131 SKSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLA-TDVIP 188

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGVLEF 318
               GC +   G  + + GL+GLG G LSL  Q   +  ++ +YCL + + S  SG L  
Sbjct: 189 NYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRL 248

Query: 319 NSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                   + T PL++N +  + YYV L G  VG + V IP S    D A   G I D G
Sbjct: 249 GPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSG 308

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T  TRL   AY ++R+ F R   N   TS +  FDTCY  SG  SV  P+V+  F AG  
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRVKNANATS-LGGFDTCY--SG--SVVFPSVTFMF-AGMN 362

Query: 438 LDLPAKNYLIPVDSAGTFCFAF--APT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + LP  N LI   +    C A   APT  +S L++I ++QQQ  RV  D+ N+R+G +  
Sbjct: 363 VTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRE 422

Query: 494 KC 495
            C
Sbjct: 423 TC 424


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 110/340 (32%), Positives = 175/340 (51%), Gaps = 29/340 (8%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
           ++++D+GSD+ W+QC+PC    C+ Q DP+FDP TS++Y+ +PC++  C  L      C 
Sbjct: 82  TVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCL 141

Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
           AN +C + + Y +G+   G   ++ ++ G    V+G   GC H ++G       AG L L
Sbjct: 142 ANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSYDVAGTLAL 201

Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
           GGG  S  +Q  +      +YC+    S       GV    +A     V+ PL+ +  + 
Sbjct: 202 GGGSQSFVQQTASQYSRVFSYCVPPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSSSTMS 261

Query: 339 -TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
            TFY V L    V G+ + +PP++F          ++D  T I+R+   AY +LR +F  
Sbjct: 262 PTFYRVLLRSIIVAGRPLPVPPTVFSASS------VIDSATVISRIPPTAYQALRAAFRS 315

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
                +P   V++ DTCYDFSG+RS+ +P+++L F  G  ++L A   L+        C 
Sbjct: 316 AMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL------QGCL 369

Query: 458 AFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           AFAPT+S      IGNVQQ+   V +D+    + F    C
Sbjct: 370 AFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 127/359 (35%), Positives = 189/359 (52%), Gaps = 25/359 (6%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G GEYF RI +GTPP +  ++ DTGSD+ W+QC+PC ECY+Q  PIF+PK SS+Y  + C
Sbjct: 90  GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLC 149

Query: 215 AAPQCKSL--DVSACRAN----RCLYQVAYGDGSFTVGDLVTETVSFGNS-GSVKGIALG 267
               C +L  D+ AC A+     C Y  +YGD SFT+G L TE    G++  S++ +A G
Sbjct: 150 ETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQELAFG 209

Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV---DRDSPASGVLEF-- 318
           CG+ N G F    +G++GLGGG LSL  Q+        +YCLV   ++ + + G + F  
Sbjct: 210 CGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGD 269

Query: 319 NSARGGD--AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
           NS   G    V+ PL+ +K+ +TFYY+ L   SVG + +    S  + +    G II+D 
Sbjct: 270 NSFISGSDTYVSTPLV-SKEPETFYYLTLEAISVGNERLAYENSRNDGN-VEKGNIIIDS 327

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT +T L ++ YN L     +     + +    +F  C  F     + +P +++HF    
Sbjct: 328 GTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSIC--FRDKIGIELPIITVHFTDAD 385

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               P   +    +     CF   P S+ ++I GN+ Q    V +DL  N V F P  C
Sbjct: 386 VELKPINTFAKAEEDL--LCFTMIP-SNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDC 441


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 174/356 (48%), Gaps = 22/356 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   I VGTPP     V DTGSD+ W QC+PC+ CYQQ+ P+FDP  S++Y  + C++
Sbjct: 81  GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSS 140

Query: 217 PQCK-SLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGH 270
           P C  S D S+C  +  CLY +AYGD S + G+L  +TV+     G   +     +GCGH
Sbjct: 141 PVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGH 200

Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA---SGVLEFNS--- 320
           DN G F  + +G++GLG G  SL  Q+   +    +YCL+   + +   S  L F S   
Sbjct: 201 DNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNAN 260

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
             G   V+ P+  + +  TFY + L   SVG      P    ++   G+  II+D GT +
Sbjct: 261 VSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESNIIIDSGTTL 318

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           T L +   NS   +  +              D C+  +      +P V++HF  G  + L
Sbjct: 319 TYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFA-TTTDDYEMPPVTMHF-EGADVPL 376

Query: 441 PAKNYLIPVDSAGTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +N  + + S  T C AF       + I GN+ Q    V +D+ N  V F P  C
Sbjct: 377 QRENLFVRL-SDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 134/414 (32%), Positives = 202/414 (48%), Gaps = 36/414 (8%)

Query: 106 LERDSAR---VNTLITKLQLAIYNVDR-----HELKPAEAQILPEDFSTPVVSGASQGSG 157
           + RDS +    N+  T LQ     + R     H  +   A + P++  + +++      G
Sbjct: 36  VHRDSPKSPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANG----G 91

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP +   + DTGSD+ W QC PC +CY+Q  P+FDPK+S +Y  L C   
Sbjct: 92  EYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTR 151

Query: 218 QCKSL-DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF--GNSGSV--KGIALGCGHD 271
           QC++L + S+C + + C Y   YGD SFT G+L  +TV+    N G V      +GCG  
Sbjct: 152 QCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRR 211

Query: 272 NEGLF-VGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASG---VLEF--NSAR 322
           N G F    +G++GLGGG +SL  Q+ ++     +YCLV   S ++G    L F  N+  
Sbjct: 212 NNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVV 271

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G  V +  + +K  DTFYY+ L   SVG + ++   S F   E     II+D GT++T 
Sbjct: 272 SGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEG---NIIIDSGTSLTL 328

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVA-LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
                +     +      N + T   + L   CY  +    ++VP ++ HF  G  + L 
Sbjct: 329 FPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT--PDLKVPVITAHFN-GADVVLQ 385

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             N  I + S    C AF  T S  +I GNV Q    + +D+    V F P  C
Sbjct: 386 TLNTFILI-SDDVLCLAFNSTQSG-AIFGNVAQMNFLIGYDIQGKSVSFKPTDC 437


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 118/307 (38%), Positives = 169/307 (55%), Gaps = 24/307 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY   + +GTPP+   + LDTGSD+ W QC+PC  C+ Q+ P FDP TSS+ S   C + 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 218 QCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGH 270
            C+ L V++C + +      C+Y  +YGD S T G L  +  +F G   SV G+A GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 271 DNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE-----FNSAR 322
            N G+F  +  G+ G G G LSL  Q+K  + ++C   V+   P++ +L+     + S R
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVLLDLPADLYKSGR 260

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           G    T PLI+N    TFYY+ L G +VG   + +P S F +   G GG I+D GTA+T 
Sbjct: 261 GAVQST-PLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKN-GTGGTIIDSGTAMTS 318

Query: 383 LQTQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSG-LRSV-RVPTVSLHFGAGKALD 439
           L T+ Y  +RD+F   A  +K P       D  +  S  LR+   VP + LHF  G  +D
Sbjct: 319 LPTRVYRLVRDAF---AAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMD 374

Query: 440 LPAKNYL 446
           LP +NY+
Sbjct: 375 LPRENYV 381


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 144/370 (38%), Positives = 192/370 (51%), Gaps = 32/370 (8%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDP 199
           +  + P   G S G+ +Y   + +GTP    ++ +DTGSD++W+QC PC       Q D 
Sbjct: 483 KSVTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQ 542

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDV--SACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
           +FDP  SSSYS +PCAA  C  L      C A ++C Y V+YGDGS T G   ++T++  
Sbjct: 543 LFDPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLT 602

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS----LAYCLVDRDSPA 312
           ++ +V G   GCGH   GLF G  GLL LG   +SLT Q          +YCL    S +
Sbjct: 603 DADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS-S 661

Query: 313 SGVLEFN--SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGD 369
           +G L     S+  G A T  L+    V TFY V LTG  VGGQ +  +P S F       
Sbjct: 662 TGFLTLGGPSSASGFATTG-LLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFA------ 714

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGLRSVRV 425
           GG +VD GT ITRL   AY +LR +F            P +G+   DTCY+F+   +V +
Sbjct: 715 GGTVVDTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGI--LDTCYNFTDYGTVTL 772

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           PTVSL F  G  L L A  +L    S+G   FA        +I+GNVQQ+   V FD   
Sbjct: 773 PTVSLTFSGGATLKLDAPGFL----SSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--G 826

Query: 486 NRVGFTPNKC 495
           + VGF P+ C
Sbjct: 827 SSVGFMPHSC 836


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 123/357 (34%), Positives = 169/357 (47%), Gaps = 27/357 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G++   I +GTPP + + ++DTGSD+ W+QC PC  CY+Q  P+FDP  SS+Y+ + C +
Sbjct: 66  GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDS 125

Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
           P C  LD   C    RC Y   YGD S T G L  +T +F    G   S+     GCGH+
Sbjct: 126 PLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185

Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQI----KATSLAYCLVD--RDSPASGVLEFNSAR-- 322
           N G F     GL+GLGGG  SL  QI         + CLV    D   S  + F      
Sbjct: 186 NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQV 245

Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAI 380
            G   VT PL+  +K DT Y+V L G SV         + F M+   G   ++VD GT  
Sbjct: 246 LGNGVVTTPLVPREK-DTSYFVTLLGISVED-------TYFPMNSTIGKANMLVDSGTPP 297

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
             L  Q Y+ +    VR    LKP +      T   +    +++ PT++ HF     L  
Sbjct: 298 ILLPQQLYDKVFAE-VRNKVALKPITDDPSLGTQLCYRTQTNLKGPTLTFHFVGANVLLT 356

Query: 441 PAKNYLIPV-DSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P + ++ P   + G FC A +  T+S   + GN  Q    + FDL    V F P  C
Sbjct: 357 PIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/408 (34%), Positives = 189/408 (46%), Gaps = 43/408 (10%)

Query: 113 VNTLITKLQLAIYNVDRHELKPAEA--QILPEDFSTPVVSGASQGSGEYFSRIGVGTPPR 170
           V TL  K +       RH   P  A  QIL     TP           Y +R  +GTPP+
Sbjct: 66  VATLAAKPKPKPKGHSRHTFVPIAAGRQIL----RTP----------SYVARARLGTPPQ 111

Query: 171 QFSMVLDTGSDINWLQCRPCTECYQ-QSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSAC 227
              + +D  +D  W+ C  C  C    S P FDP  SS+Y P+ C APQC  +     +C
Sbjct: 112 TLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQCAQVPPATPSC 171

Query: 228 RAN---RCLYQVAYGDGSFTV---GDLVTETVSFGNSGSVKGIALGCGH--DNEGLFVGS 279
            A     C + ++Y   +       D ++ + S G +        GC       G  V  
Sbjct: 172 PAGPGASCAFNLSYASSTLHAVLGQDALSLSDSNGAAVPDDHYTFGCLRVVTGSGGSVPP 231

Query: 280 AGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDSPASGVLEFNSARGGDAV-TAPLIRN 334
            GL+G G G LS   Q KAT     +YCL   + S  SG L    A     + T PL+ N
Sbjct: 232 QGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSN 291

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRD 393
               + YYV + G  V G+AV IP S   +D A G GG IVD GT  TRL   AY +LR+
Sbjct: 292 PHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRN 351

Query: 394 SFVRLAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           +F R  G   P +  +  FDTCY  +G +S  VP V+  F  G  + LP +N +I   S 
Sbjct: 352 AFRR--GVSAPAAPALGGFDTCYYVNGTKS--VPAVAFVFAGGARVTLPEENVVISSTSG 407

Query: 453 GTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           G  C A A       ++ L+++ ++QQQ  RV FD+ N RVGF+   C
Sbjct: 408 GVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSRELC 455


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 117/359 (32%), Positives = 174/359 (48%), Gaps = 21/359 (5%)

Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R  +GTP +   + +DT +D  W+ C  C  C   S  +F+   
Sbjct: 83  PIASGRQIVQSPTYIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC---SSTVFNNVK 139

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + C APQCK +  S C  + C + + YG  S    +L  + V+   + S+    
Sbjct: 140 STTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSI-AANLSQDVVTLA-TDSIPSYT 197

Query: 266 LGCGHDNEGLFVGSAGLLGLGGG---MLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
            GC  +  G  +   GLLGLG G   +LS T+ +  ++ +YCL   R    SG L     
Sbjct: 198 FGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPV 257

Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                + T PL++N +  + YYV L    VG + V IPPS    +     G I D GT  
Sbjct: 258 GQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVF 317

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRL   AY ++RD+F +  GN   TS +  FDTCY       +  PT++  F +G  + L
Sbjct: 318 TRLVAPAYTAVRDAFRKRVGNATVTS-LGGFDTCYT----SPIVAPTITFMF-SGMNVTL 371

Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P  N LI   ++   C A A      +S L++I N+QQQ  R+ FD+ N+R+G     C
Sbjct: 372 PPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPC 430


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/354 (34%), Positives = 171/354 (48%), Gaps = 26/354 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G+Y     +GTPP     ++DT SDI W+QC+ C  CY  + P+FDP  S +Y  LPC++
Sbjct: 86  GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSS 145

Query: 217 PQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
             CKS+  ++C ++    C + V Y DGS + GDL+ ETV+ G+            +GC 
Sbjct: 146 TTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCI 205

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLV---DRDSPASGVLEFNSAR- 322
             N  +   S G++GLGGG +SL  Q+ ++     +YCL    DR S     L+F  A  
Sbjct: 206 R-NTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSK----LKFGDAAM 260

Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
             GD   +  I  K    FYY+ L  FSVG   ++   S      +G G II+D GT  T
Sbjct: 261 VSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSS--SSRSSGKGNIIIDSGTTFT 318

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            L    Y+ L  +   +    +    +  F  CY  S    V VP ++ HF +G  + L 
Sbjct: 319 VLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK-STYDKVDVPVITAHF-SGADVKLN 376

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           A N  I V S    C AF  + S  +I GN+ QQ   V +DL    V F P  C
Sbjct: 377 ALNTFI-VASHRVVCLAFLSSQSG-AIFGNLAQQNFLVGYDLQRKIVSFKPTDC 428


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 134/364 (36%), Positives = 183/364 (50%), Gaps = 25/364 (6%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           +GSG+Y    G+GTP    S   DTGSD+ W +C  C  C  +  P + P +SSS + + 
Sbjct: 87  KGSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVA 146

Query: 214 CAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVGDLVTETVSFG-NSGS 260
           C    C  L    C         +  C Y  AYG+      +T G L+TET +FG ++ +
Sbjct: 147 CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 206

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS-GVL 316
             GIA GC   +EG F   +GL+GLG G LSL  Q+   +  Y L   +   SP S G L
Sbjct: 207 FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSL 266

Query: 317 EFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
              +   GD+ ++ PL+ N  V    FYYVGLTG SVGG+ VQIP   F  D + G GG+
Sbjct: 267 ADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGV 326

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           I D GT +T L   AY  +RD  +   G  KP       D      G  +   P++ LHF
Sbjct: 327 IFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHF 386

Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN-RV 488
             G  +DL  +NYL  +   +     C++   +S AL+IIGN+ Q    V FDL+ N R+
Sbjct: 387 DGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARM 446

Query: 489 GFTP 492
            F P
Sbjct: 447 LFQP 450


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 102/302 (33%), Positives = 160/302 (52%), Gaps = 22/302 (7%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D ARV TL ++L        +  L   + +  P+  S P+  GAS GSG Y+ ++G 
Sbjct: 66  LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIR-FPKSVSVPLNPGASIGSGNYYVKVGF 124

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           G+P R +SM++DTGS ++WLQC+PC   C+ Q+DP+FDP  S +Y  L C + QC SL  
Sbjct: 125 GSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVD 184

Query: 223 -----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
                 +    +N C+Y  +YGD S+++G L  + ++   S ++ G   GCG D++GLF 
Sbjct: 185 ATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244

Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLI 332
            +AG+LGLG   LS+  Q+ +    + +YCL  R     G L    A   G      P+ 
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG--GGGFLSIGKASLAGSAYKFTPMT 302

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
            +    + Y++ LT  +VGG+A+ +  + + +        I+D GT ITRL    Y   +
Sbjct: 303 TDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------IIDSGTVITRLPMSVYTPFQ 356

Query: 393 DS 394
            +
Sbjct: 357 QA 358


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 134/364 (36%), Positives = 183/364 (50%), Gaps = 25/364 (6%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           +GSG+Y    G+GTP    S   DTGSD+ W +C  C  C  +  P + P +SSS + + 
Sbjct: 87  KGSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVA 146

Query: 214 CAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVGDLVTETVSFG-NSGS 260
           C    C  L    C         +  C Y  AYG+      +T G L+TET +FG ++ +
Sbjct: 147 CGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAA 206

Query: 261 VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS-GVL 316
             GIA GC   +EG F   +GL+GLG G LSL  Q+   +  Y L   +   SP S G L
Sbjct: 207 FPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSPISFGSL 266

Query: 317 EFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
              +   GD+ ++ PL+ N  V    FYYVGLTG SVGG+ VQIP   F  D + G GG+
Sbjct: 267 ADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGV 326

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           I D GT +T L   AY  +RD  +   G  KP       D      G  +   P++ LHF
Sbjct: 327 IFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHF 386

Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN-RV 488
             G  +DL  +NYL  +   +     C++   +S AL+IIGN+ Q    V FDL+ N R+
Sbjct: 387 DGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLSGNARM 446

Query: 489 GFTP 492
            F P
Sbjct: 447 LFQP 450


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 132/401 (32%), Positives = 183/401 (45%), Gaps = 27/401 (6%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ-GSGEYFSRIGVG 166
           +  + VNT+IT     + + D   LK        +  + P+  G        Y  R+ +G
Sbjct: 51  KQESWVNTVIT-----MASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLG 105

Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
           TP +Q  MVLDT +D  W+ C  CT C   S   F P  S++   L C+  QC  +   +
Sbjct: 106 TPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSGAQCSQVRGFS 162

Query: 227 CRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
           C A   + CL+  +YG  S     LV + ++  N   + G   GC +   G  +   GLL
Sbjct: 163 CPATGSSACLFNQSYGGDSSLTATLVQDAITLAND-VIPGFTFGCINAVSGGSIPPQGLL 221

Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVD 338
           GLG G +SL  Q  A      +YCL    S   SG L+        ++ T PL+RN    
Sbjct: 222 GLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRP 281

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           + YYV LTG SVG   V IP      D     G I+D GT ITR     Y ++RD F + 
Sbjct: 282 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQ 341

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
                P S +  FDTC  F+       P ++LHF  G  L LP +N LI   S    C +
Sbjct: 342 VNG--PISSLGAFDTC--FAATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLS 396

Query: 459 FAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A      +S L++I N+QQQ  R+ FD  N+R+G     C
Sbjct: 397 MAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 196/415 (47%), Gaps = 46/415 (11%)

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           ++ +LA   + R E   A   ++ E   TP++       GEY  ++G+GTPP +F+  +D
Sbjct: 55  SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
           T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++  C  LDV  C  +    C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
              Y   + T G L  + +  G   + +G+A GC   + G      ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226

Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
             Q+     AYCL    S   G L    + ++AR   + +  P+ R+ +  ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286

Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
             +G +A+ +P                        +   + +A   G+I+D  + IT L+
Sbjct: 287 LLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346

Query: 385 TQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
              Y+ L +     +RL      + G+ L     D      V VP V+L F  G+ L L 
Sbjct: 347 ASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLD 405

Query: 442 AKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                     +G  C       + ++SI+GN QQQ  +V ++L   RV F  + C
Sbjct: 406 KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 123/361 (34%), Positives = 182/361 (50%), Gaps = 22/361 (6%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
           ASQG  EY     VGTPP +   V+DTGS I W+QC+ C +CY+Q+ PIFDP  S +Y  
Sbjct: 92  ASQG--EYLMSYSVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKT 149

Query: 212 LPCAAPQCKS-LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GI 264
           LPC++  C+S +   +C +++  C Y + YGDGS + GDL  ET++ G  N  SV+    
Sbjct: 150 LPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNT 209

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT----SLAYCLVD--RDSPASGVLEF 318
            +GCGH+N+G F G    +   GG         ++      +YCL      S +S  L F
Sbjct: 210 VIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNF 269

Query: 319 NSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIV 374
             A    G  AV+ PL+     + FYY+ L  FSVG + ++ +  S       G+G II+
Sbjct: 270 GDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIII 329

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +T L  + Y++L  +        + +        CY  +    + VP ++ HF  
Sbjct: 330 DSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHF-K 388

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           G  ++L   +  + V + G  CFAF  +S  +SI GN+ Q    V +DL    V F P  
Sbjct: 389 GADVELNPISTFVQV-AEGVVCFAFH-SSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTD 446

Query: 495 C 495
           C
Sbjct: 447 C 447


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  175 bits (443), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 171/353 (48%), Gaps = 21/353 (5%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY  R  +GTPP +   + DT SD+ W+QC PC  C+ Q  P+F+P  SS+++ L C +
Sbjct: 88  GEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDS 147

Query: 217 PQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDNE 273
             C S ++  C    N CLY   YGDGS T G L TE++ FG+ + +      GCG +N+
Sbjct: 148 QPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFPKTIFGCGSNND 207

Query: 274 GLFVGS---AGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEF---NSARGG 324
            +   S    G++GLG G LSL  Q+        +YCL+   S ++  L+F    +  G 
Sbjct: 208 FMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTITGN 267

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
             V+ PLI +    ++Y++ L G ++G + +Q+        +  +G II+D GT +T L+
Sbjct: 268 GVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV-----RTTDHTNGNIIIDLGTVLTYLE 322

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
              Y++   + +R A  +  T     +   + F    ++  P +   F   K   L  KN
Sbjct: 323 VNFYHNFV-TLLREALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTGAKVF-LSPKN 380

Query: 445 YLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                D     C A  P   +   S+ GN+ Q   +V +D    +V F P  C
Sbjct: 381 LFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADC 433


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 131/449 (29%), Positives = 201/449 (44%), Gaps = 40/449 (8%)

Query: 79  SFSLPLH---SREILHKTRHNDYRS------LVLSRLERDSARVNTLITKLQLAIYNVDR 129
           SF++P H   SR I  +  H D R        V    +R   RVN L+        +  R
Sbjct: 17  SFAVPGHGQPSRGIRLELTHVDARGDFTGSDRVRRAADRSHRRVNGLLAAAPPPAASTLR 76

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC-R 188
            +     A       S          +  Y     +GTPP   S VLDTGSD+ W QC  
Sbjct: 77  SDGGGGGACAATAAASV------HASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDA 130

Query: 189 PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-------------DVSACRANRCLYQ 235
           PC  C+ Q  P++ P  S +Y+ + C +  C +L                A     C Y 
Sbjct: 131 PCRRCFPQPAPLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYY 190

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
            +YGDGS T G L TET +FG   +V  +A GCG DN G    S+GL+G+G G LSL  Q
Sbjct: 191 YSYGDGSSTDGVLATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQ 250

Query: 296 IKATSLAYCLV---DRDSPASGVLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFS 349
           +  T  +YC     D  + +   L  +++    A + P +      +  ++YY+ L G +
Sbjct: 251 LGVTKFSYCFTPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGIT 310

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VG   + I P++F +  +G GG+I+D GT  T L+ +A+  L  +          +    
Sbjct: 311 VGDTLLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL 370

Query: 410 LFDTCY---DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
               C+      G  +V VP + LHF  G  ++LP  + ++    AG  C     ++  +
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFD-GADMELPRSSAVVEDRVAGVACLGIV-SARGM 428

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S++G++QQQ   V +D+  + + F P  C
Sbjct: 429 SVLGSMQQQNMHVRYDVGRDVLSFEPANC 457


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 171/351 (48%), Gaps = 44/351 (12%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +GEY   + +GTPP     ++DTGSD+ W QCRPCT CY+Q  P+FDPK SS+Y    C 
Sbjct: 89  AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCG 148

Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
              C +L  D S  +  +C ++ +Y DGSFT G+L +ET++     G   S  G A GCG
Sbjct: 149 TSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCG 208

Query: 270 HDNEGLF-VGSAGLLGLGGGMLSLTKQIKATS---LAYCL--VDRDSPASGVLEFNSA-- 321
           H + G+F   S+G++GLGGG LSL  Q+K+T     +YCL  V  DS  S  + F ++  
Sbjct: 209 HSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTDSSISSRINFGASGR 268

Query: 322 -RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
             G   V+ PL    K          G+S             +  E  +G IIVD GT  
Sbjct: 269 VSGYGTVSTPLRLPYK----------GYS-------------KKTEVEEGNIIVDSGTTY 305

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           T L  + Y+ L  S        +      +F  CY+ +    +  P ++ HF        
Sbjct: 306 TFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAPIITAHFKDANVELQ 363

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           P   ++   +     CF  APTS  + ++GN+ Q    V FDL   R GF+
Sbjct: 364 PLNTFMRMQEDL--VCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKR-GFS 410



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/130 (28%), Positives = 53/130 (40%), Gaps = 4/130 (3%)

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
           E  +G IIVD GT  T L  + Y  L +S        +      +   CY+ + +  +  
Sbjct: 414 EVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQIDA 472

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           P ++ HF        P   +L   +     CF   PTS  + I+GN+ Q    V FDL  
Sbjct: 473 PIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPTSD-IGILGNLAQVNFLVGFDLRK 529

Query: 486 NRVGFTPNKC 495
            RV F    C
Sbjct: 530 KRVSFKAADC 539


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 167/354 (47%), Gaps = 22/354 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y   + +GTPP +   + DTGSD+ W  C PC  CY+Q +P+FDP+ S++Y  + C +
Sbjct: 70  GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDS 129

Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
             C  LD   C    RC Y  AY   + T G L  ET++     G S  +KGI  GCGH+
Sbjct: 130 KLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHN 189

Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQIKAT----SLAYCLV--DRDSPASGVLEF---NSA 321
           N G F     G++GLGGG +SL  Q+ ++      + CLV    D   S  + F   +  
Sbjct: 190 NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKV 249

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G   V+ PL+  K+  T Y+V L G SV    +    S   +++   G + +D GT  T
Sbjct: 250 SGKGVVSTPLVA-KQDKTPYFVTLLGISVENTYLHFNGSSQNVEK---GNMFLDSGTPPT 305

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            L TQ Y+ +    VR    +KP +          +    ++R P ++ HF        P
Sbjct: 306 ILPTQLYDQVVAQ-VRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSP 364

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + ++ P D  G FC  F  TSS   + GN  Q    + FDL    V F P  C
Sbjct: 365 TQTFISPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  174 bits (442), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 132/415 (31%), Positives = 191/415 (46%), Gaps = 37/415 (8%)

Query: 106 LERDSAR---VNTLITKLQLAIYNVDR-----HELKPAEAQILPEDFSTPVVSGASQGSG 157
           + RDS +    N   T  Q  +  V R     H   P +   +   F+    S      G
Sbjct: 34  INRDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDI---FTDTAQSEMISNQG 90

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  +  +GTP      + DTGSD+ W QC+PC +CY+Q  P+FDPK+SS+Y  + C+  
Sbjct: 91  EYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTK 150

Query: 218 QCKSLDVSAC---RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
           QC  L   A      N+ C Y  +YGD SFT G++  +T++ G++      +    +GCG
Sbjct: 151 QCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG 210

Query: 270 HDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPA--SGVLEFNS--- 320
           H+N G F      +   GG  +SL  Q+ +T     +YCLV   S A  S  L F S   
Sbjct: 211 HNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGI 270

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
             GG   + PLI +K  DTFY++ L   SVG + ++ P S F   E   G II+D GT +
Sbjct: 271 VSGGGVQSTPLI-SKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSE---GNIIIDSGTTL 326

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           T      ++ L  +               +   CY       ++ P+++ HF  G  + L
Sbjct: 327 TLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--DLKFPSITAHFD-GADVKL 383

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              N  + V S    CFAF P +S  +I GN+ Q    V +DL    V F P  C
Sbjct: 384 NPLNTFVQV-SDTVLCFAFNPINSG-AIFGNLAQMNFLVGYDLEGKTVSFKPTDC 436


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  174 bits (441), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 157/335 (46%), Gaps = 22/335 (6%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---C 227
           +MVLDT SD+ W+QC PC    CY Q D ++DP  SSS     C +P C  L   A    
Sbjct: 145 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCT 204

Query: 228 RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV---GSAGLLG 284
             N+C Y+V Y DG+ T G  +++ ++   + +V+    GC H  +G F     +AG++ 
Sbjct: 205 NNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMA 264

Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TF 340
           LGGG  SL  Q  AT     ++C           L          V  P+++N  +  TF
Sbjct: 265 LGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTF 324

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           Y V L   +V GQ + +PP++F        G  +D  TAITRL   AY +LR +F     
Sbjct: 325 YMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMA 378

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
             +P       DTCYD +G+RS  +P ++L F    A++L     L      G   F   
Sbjct: 379 MYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAG 434

Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P      IIGN+Q Q   V +++    VGF    C
Sbjct: 435 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 157/335 (46%), Gaps = 22/335 (6%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA---C 227
           +MVLDT SD+ W+QC PC    CY Q D ++DP  SSS     C +P C  L   A    
Sbjct: 170 TMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCT 229

Query: 228 RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV---GSAGLLG 284
             N+C Y+V Y DG+ T G  +++ ++   + +V+    GC H  +G F     +AG++ 
Sbjct: 230 NNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMA 289

Query: 285 LGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TF 340
           LGGG  SL  Q  AT     ++C           L          V  P+++N  +  TF
Sbjct: 290 LGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTF 349

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           Y V L   +V GQ + +PP++F        G  +D  TAITRL   AY +LR +F     
Sbjct: 350 YMVRLEAIAVAGQRIAVPPTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMA 403

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
             +P       DTCYD +G+RS  +P ++L F    A++L     L      G   F   
Sbjct: 404 MYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAG 459

Query: 461 PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P      IIGN+Q Q   V +++    VGF    C
Sbjct: 460 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           NV+R   + A A I  E  +  V     Q    +     VG PP    + +DTGSD+ W+
Sbjct: 62  NVERRRTRRA-AFITDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 117

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
           QCRPC +C++QS PIFDP  SS+Y  L   +P C  S        N+C+Y  +Y DGS +
Sbjct: 118 QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 177

Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
            G+L TE + F  S     +V  +  GCGH N G F G  +G+LGL  G  S+  ++  +
Sbjct: 178 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 236

Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
             +YC+ D   P       N    GD V     + P       + FYYV L G SVG   
Sbjct: 237 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 290

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
           + I P +F+  E+G GG+++D GT  T L    ++ L +   RL  G+ +      ++ T
Sbjct: 291 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 346

Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
                CY       +R  P ++ HF  G  L L A N L    +   FC A   ++  + 
Sbjct: 347 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 405

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            S+IG + QQ   V++DL   RV F    C
Sbjct: 406 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 121/415 (29%), Positives = 195/415 (46%), Gaps = 46/415 (11%)

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           ++ +LA   + R E   A   ++ E   TP++       GEY  ++G+GTPP +F+  +D
Sbjct: 55  SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
           T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++  C  LDV  C  +    C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
              Y   + T G L  + +  G   + +G+A GC   + G      ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226

Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
             Q+     AYCL    S   G L    + ++AR   + +  P+ R+ +  ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286

Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
             +G + + +P                        +   + +A   G+I+D  + IT L+
Sbjct: 287 LLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346

Query: 385 TQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
              Y+ L +     +RL      + G+ L     D      V VP V+L F  G+ L L 
Sbjct: 347 ASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAFD-GRWLRLD 405

Query: 442 AKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                     +G  C       + ++SI+GN QQQ  +V ++L   RV F  + C
Sbjct: 406 KARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/319 (33%), Positives = 166/319 (52%), Gaps = 28/319 (8%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
           ++++D+GSD++W+QC+PC    C++Q DP+FDP  S++Y+ +PC +  C  L      C 
Sbjct: 169 TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 228

Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
           AN +C + + YGDGS   G    + ++ G    ++G   GC H + G       AG L L
Sbjct: 229 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 288

Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
           GGG  SL +Q         +YCL    S       GV    +      V+ PL+ +    
Sbjct: 289 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 348

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           TFY V L    V G+ + +PP++F          ++D  T I+RL   AY +LR +F   
Sbjct: 349 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 402

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
               +    V++ DTCYDF+G+RS+ +P+++L F  G  ++L A   L+     G+ C A
Sbjct: 403 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 456

Query: 459 FAPTSS--ALSIIGNVQQQ 475
           FAPT+S      IGNVQQ+
Sbjct: 457 FAPTASDRMPGFIGNVQQK 475



 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 38/282 (13%)

Query: 218 QCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           Q K+L+   C AN +C + + YGDGS   G    + ++ G              D +GL 
Sbjct: 473 QQKTLE--GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----------DRQGLP 520

Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           + +A   G    + S       +SL +  +       GV    +A     V+ PL+ +  
Sbjct: 521 LRTATQYGR---VFSYCIPPSPSSLGFITL-------GVPPQRAALVPTFVSTPLLSSSS 570

Query: 337 VD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
           +  TFY V L    V G+ + +PP++F          ++   T I+RL   AY +LR +F
Sbjct: 571 MPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAF 624

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
            R     +    V++ DTCYDF+G+RS+ +P+++L F  G  ++L A   L+        
Sbjct: 625 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------QG 678

Query: 456 CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFAPT++      IGNVQQ+   V +D+    + F    C
Sbjct: 679 CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 135/368 (36%), Positives = 179/368 (48%), Gaps = 58/368 (15%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIF 201
           + P   G   G+  Y     +GTP    +M +DTGSD++W+QC+PC+    CY Q DP+F
Sbjct: 126 TVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLF 185

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           DP  SSSY+ +PC  P C  L                       G       S    G+V
Sbjct: 186 DPAQSSSYAAVPCGGPVCAGL-----------------------GIYAASACSAAQCGAV 222

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF 318
           +G   GCGH   GLF G  GLLGLG    SL +Q   T     +YCL  + S A G L  
Sbjct: 223 QGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTL 281

Query: 319 NSARGGDAVTAP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
               GG +  AP      L+ +    T+Y V LTG SVGGQ + +P S F          
Sbjct: 282 G--VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV----- 334

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSL 430
            VD GT +TRL   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P V+L
Sbjct: 335 -VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVAL 393

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNR 487
            FG+G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D     
Sbjct: 394 TFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTS 444

Query: 488 VGFTPNKC 495
           VGF P+ C
Sbjct: 445 VGFKPSSC 452


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 127/370 (34%), Positives = 177/370 (47%), Gaps = 25/370 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+GTP +Q  + LDT +D  W  C PC  C   S   F P 
Sbjct: 67  SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
           +SSSY+ LPCA+  C   +   C AN+        C +   + D SF    L ++T+  G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181

Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
              ++ G A GC     G    +   GLLGLG G +SL  Q  +T     +YCL   R  
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSY 240

Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
             SG L   +A +  +    PL+ N    + YYV +TG SVG   V++P   F  D A  
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G ++D GT ITR     Y +LR+ F R        + +  FDTC++   + +   P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
           LH   G  L LP +N LI   +    C A A      ++ ++++ N+QQQ  RV  D+A 
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420

Query: 486 NRVGFTPNKC 495
           +RVGF    C
Sbjct: 421 SRVGFAREPC 430


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 171/368 (46%), Gaps = 22/368 (5%)

Query: 141 PEDFSTPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP 199
           P+  S P+ SG      G Y  R+ +GTP +   MVLDT  D  W+   PC +C   S P
Sbjct: 80  PKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWV---PCADCAGCSSP 136

Query: 200 IFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFG 256
            F P TSS+Y+ L C+ PQC  +   +C       C +   YG  S     L  +++   
Sbjct: 137 TFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLA 196

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA- 312
              ++   + GC +   G  +   GLLGLG G +SL  Q   + +   +YC     S   
Sbjct: 197 -VDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYF 255

Query: 313 SGVLEFNS-ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           SG L      +  +  T PL+RN    T YYV LTG SVG   V + P L   D     G
Sbjct: 256 SGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAG 315

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
            I+D GT ITR     Y ++RD F +      P + +  FDTC  F+       P V+ H
Sbjct: 316 TIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATNEDIAPPVTFH 371

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNR 487
           F  G  L LP +N LI   +    C A A      +S L++I N+QQQ  R+ FD+ N+R
Sbjct: 372 F-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSR 430

Query: 488 VGFTPNKC 495
           +G     C
Sbjct: 431 LGIARELC 438


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           NV+R   + A A I  E  +  V     Q    +     VG PP    + +DTGSD+ W+
Sbjct: 30  NVERRRTRRA-AFITDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 85

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
           QCRPC +C++QS PIFDP  SS+Y  L   +P C  S        N+C+Y  +Y DGS +
Sbjct: 86  QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 145

Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
            G+L TE + F  S     +V  +  GCGH N G F G  +G+LGL  G  S+  ++  +
Sbjct: 146 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 204

Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
             +YC+ D   P       N    GD V     + P       + FYYV L G SVG   
Sbjct: 205 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 258

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
           + I P +F+  E+G GG+++D GT  T L    ++ L +   RL  G+ +      ++ T
Sbjct: 259 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 314

Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
                CY       +R  P ++ HF  G  L L A N L    +   FC A   ++  + 
Sbjct: 315 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 373

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            S+IG + QQ   V++DL   RV F    C
Sbjct: 374 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/319 (33%), Positives = 166/319 (52%), Gaps = 28/319 (8%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACR 228
           ++++D+GSD++W+QC+PC    C++Q DP+FDP  S++Y+ +PC +  C  L      C 
Sbjct: 78  TVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCS 137

Query: 229 AN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG--LFVGSAGLLGL 285
           AN +C + + YGDGS   G    + ++ G    ++G   GC H + G       AG L L
Sbjct: 138 ANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLAL 197

Query: 286 GGGMLSLTKQIK---ATSLAYCLVDRDSPAS----GVLEFNSARGGDAVTAPLIRNKKVD 338
           GGG  SL +Q         +YCL    S       GV    +      V+ PL+ +    
Sbjct: 198 GGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAP 257

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           TFY V L    V G+ + +PP++F          ++D  T I+RL   AY +LR +F   
Sbjct: 258 TFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSA 311

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
               +    V++ DTCYDF+G+RS+ +P+++L F  G  ++L A   L+     G+ C A
Sbjct: 312 MTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLA 365

Query: 459 FAPTSS--ALSIIGNVQQQ 475
           FAPT+S      IGNVQQ+
Sbjct: 366 FAPTASDRMPGFIGNVQQK 384



 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 81/282 (28%), Positives = 130/282 (46%), Gaps = 38/282 (13%)

Query: 218 QCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           Q K+L+   C AN +C + + YGDGS   G    + ++ G              D +GL 
Sbjct: 382 QQKTLE--GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDV----------DRQGLP 429

Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           + +A   G    + S       +SL +  +       GV    +A     V+ PL+ +  
Sbjct: 430 LRTATQYGR---VFSYCIPPSPSSLGFITL-------GVPPQRAALVPTFVSTPLLSSSS 479

Query: 337 VD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
           +  TFY V L    V G+ + +PP++F          ++   T I+RL   AY +LR +F
Sbjct: 480 MPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAF 533

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
            R     +    V++ DTCYDF+G+RS+ +P+++L F  G  ++L A   L+        
Sbjct: 534 RRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------QG 587

Query: 456 CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C AFAPT++      IGNVQQ+   V +D+    + F    C
Sbjct: 588 CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/429 (30%), Positives = 207/429 (48%), Gaps = 49/429 (11%)

Query: 95  HNDYRSLVLSRLERDSAR---VNTLITKLQLAIYNVDRHELKPAEAQILPEDFS---TPV 148
           H   + L +  + RD ++    +  +TK Q A YNV    +         ++FS      
Sbjct: 22  HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRA-YNVVHRSIN--RVNYFTKEFSLNKNQP 78

Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
           VS  +   GEY     VGTPP +    +DTGS+I WLQC+PC  C+ Q+ PIF+P  SSS
Sbjct: 79  VSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSS 138

Query: 209 YSPLPCAAPQCKSLDVS--ACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGS 260
           Y  +PC +  CK  + +  +C    + C Y + YG  + + GDL  ++++     G+S  
Sbjct: 139 YKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVL 198

Query: 261 VKGIALGCGH-----DNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLV--DRD 309
              I +GCGH     DN      S+G++G+G G +SL KQ+ ++S+    +YCL+  + D
Sbjct: 199 FPNIVIGCGHINVLQDNS----QSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSD 254

Query: 310 SPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           S +S  L F       G   V+ P+++    + +Y++ L  FSVG   ++      E   
Sbjct: 255 SNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYG----ERSN 310

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
           A    I++D GT +T L     + L       V+L     P   ++L   CY+ +G + +
Sbjct: 311 ASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLSL---CYNTTG-KQL 366

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
            VP ++ HF  G  + L +     P +  G  CF F  +S+ L I GN+ Q    + +DL
Sbjct: 367 NVPDITAHFN-GADVKLNSNGTFFPFED-GIMCFGFI-SSNGLEIFGNIAQNNLLIDYDL 423

Query: 484 ANNRVGFTP 492
               + F P
Sbjct: 424 EKEIISFKP 432


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/390 (33%), Positives = 185/390 (47%), Gaps = 36/390 (9%)

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           NV+R   + A A I  E  +  V     Q    +     VG PP    + +DTGSD+ W+
Sbjct: 30  NVERRRTRRA-AFIXDEIQANMVADDRGQA---FLVNFSVGRPPVPQLVGIDTGSDLLWV 85

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRANRCLYQVAYGDGSFT 244
           QCRPC +C++QS PIFDP  SS+Y  L   +P C  S        N+C+Y  +Y DGS +
Sbjct: 86  QCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTS 145

Query: 245 VGDLVTETVSFGNSG----SVKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT 299
            G+L TE + F  S     +V  +  GCGH N G F G  +G+LGL  G  S+  ++  +
Sbjct: 146 SGNLATEDIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GS 204

Query: 300 SLAYCLVDRDSPASGVLEFNSARGGDAV-----TAPLIRNKKVDTFYYVGLTGFSVGGQA 354
             +YC+ D   P       N    GD V     + P       + FYYV L G SVG   
Sbjct: 205 RFSYCIGDLFDPH---YTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETR 258

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLA-GNLKPTSGVALFDT 413
           + I P +F+  E+G GG+++D GT  T L    ++ L +   RL  G+ +      ++ T
Sbjct: 259 LDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ----VIYRT 314

Query: 414 -----CYDFSGLRSVR-VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SA 465
                CY       +R  P ++ HF  G  L L A N L    +   FC A   ++  + 
Sbjct: 315 IPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNI 373

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            S+IG + QQ   V++DL   RV F    C
Sbjct: 374 GSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 114/280 (40%), Positives = 150/280 (53%), Gaps = 19/280 (6%)

Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLG 286
           C    CLY V YGDGS+T+G    +T++  +  ++KG   GCG  NEGLF  +AGLLGLG
Sbjct: 16  CSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLG 75

Query: 287 GGMLSLTKQI---KATSLAYCLVDRDSPASGVLEF----NSARGGDAVTAPLIRNKKVDT 339
            G  SL  Q         A+C   R S  +G LEF    + A      T P++ +    T
Sbjct: 76  RGKTSLPVQTYDKYGGVFAHCFPARSS-GTGYLEFGPGSSPAVSAKLSTTPMLIDTG-PT 133

Query: 340 FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV--R 397
           FYYVG+TG  VGG+ + IP S+F        G IVD GT ITRL   AY+SLR +F    
Sbjct: 134 FYYVGMTGIRVGGKLLPIPQSVFAA-----AGTIVDSGTVITRLPPAAYSSLRSAFAASM 188

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF 457
            A   K    ++L DTCYD +G   V +PTVSL F  G +LD+ A   +I   S    C 
Sbjct: 189 AARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQACL 247

Query: 458 AFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            FA   +A  ++I+GN Q +   V +D+A+  VGF P  C
Sbjct: 248 GFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 131/401 (32%), Positives = 183/401 (45%), Gaps = 27/401 (6%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ-GSGEYFSRIGVG 166
           +  + VNT+IT     + + D   LK        +  + P+  G        Y  R+ +G
Sbjct: 51  KQESWVNTVIT-----MASKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLG 105

Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
           TP +Q  MVLDT +D  W+   PC+ C   S   F P  S++   L C+  QC  +   +
Sbjct: 106 TPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFS 162

Query: 227 CRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLL 283
           C A   + CL+  +YG  S     LV + ++  N   + G   GC +   G  +   GLL
Sbjct: 163 CPATGSSACLFNQSYGGDSSLTATLVQDAITLAND-VIPGFTFGCINAVSGGSIPPQGLL 221

Query: 284 GLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVD 338
           GLG G +SL  Q  A      +YCL    S   SG L+        ++ T PL+RN    
Sbjct: 222 GLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRP 281

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
           + YYV LTG SVG   V IP      D     G I+D GT ITR     Y ++RD F + 
Sbjct: 282 SLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQ 341

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
                P S +  FDTC  F+       P ++LHF  G  L LP +N LI   S    C +
Sbjct: 342 VNG--PISSLGAFDTC--FAATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLS 396

Query: 459 FAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A      +S L++I N+QQQ  R+ FD  N+R+G     C
Sbjct: 397 MAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 113/340 (33%), Positives = 172/340 (50%), Gaps = 35/340 (10%)

Query: 173 SMVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACR 228
           ++VLD+ SD+ W+QC PC    C+ Q D  +DP  S + +   C++P C +L    + C 
Sbjct: 30  TVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCA 89

Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGG 287
            N+C Y V Y DGS T G  + + ++     +V G   GC H  +G F   +AG++ LGG
Sbjct: 90  NNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGG 149

Query: 288 G---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA--VTAPLIRNKKVDTFYY 342
           G   +LS T      + +YC+    S  SG       R   +  V  P++R ++  TFY 
Sbjct: 150 GPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLGVPRRASSRYVVTPMVRFRQAATFYG 208

Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
           V L   +VGGQ + + P++F        G ++D  TAITRL   AY +LR +F       
Sbjct: 209 VLLRTITVGGQRLGVAPAVFA------AGSVLDSRTAITRLPPTAYQALRAAFRSSMTMY 262

Query: 403 KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF---CFAF 459
           +        DTCYDF+G+ ++R+P +SL F          +N ++P+D +G     C AF
Sbjct: 263 RSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFNDCLAF 313

Query: 460 APTSSA----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             TS+A      ++G+VQQQ   V +D+    VGF    C
Sbjct: 314 --TSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 121/368 (32%), Positives = 177/368 (48%), Gaps = 37/368 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTPP++  + +DT +D  W+ C  C  C   + P F+P +S+++ P+PC AP 
Sbjct: 94  YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPCGAPP 152

Query: 219 CKSLDVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           C      +C +     N C + ++YGD S              N G +KG   GC   + 
Sbjct: 153 CSQAPNPSCTSLAKSKNSCGFSLSYGDSSLDATLSQDNLAVTANGGVIKGYTFGCLTKSN 212

Query: 274 GLFVGS---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA---SGVLEFNSARGGDAV 327
           G    +    GL     G ++ TK I   + +YCL      A   SG L     R G   
Sbjct: 213 GSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLG--RKGQPA 270

Query: 328 -----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
                T PL+ +    + YYV +TG  +G ++V IPPS    D A   G ++D GT   R
Sbjct: 271 PEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFAR 330

Query: 383 LQTQAYNSLRDSF-VRLAGNLK---------PTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           L   AY ++RD    R+AG+L+           S +  FDTCY+ S   +V  P V+L F
Sbjct: 331 LAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWPAVTLVF 387

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPT-----SSALSIIGNVQQQGTRVSFDLANNR 487
           G G  + LP +N +I      T C A A +     ++AL++IG++QQQ  RV FD+ N R
Sbjct: 388 GGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNAR 447

Query: 488 VGFTPNKC 495
           VGF   +C
Sbjct: 448 VGFARERC 455


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 133/358 (37%), Positives = 175/358 (48%), Gaps = 58/358 (16%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT---ECYQQSDPIFDPKTSSSYSP 211
           G+  Y     +GTP    +M +DTGSD++W+QC+PC     CY Q DP+FDP  SSSY+ 
Sbjct: 136 GTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAA 195

Query: 212 LPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
           +PC  P C  L                       G       S    G+V+G   GCGH 
Sbjct: 196 VPCGGPVCAGL-----------------------GIYAASACSAAQCGAVQGFFFGCGHA 232

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVT 328
             GLF G  GLLGLG    SL +Q   T     +YCL  + S A G L      GG +  
Sbjct: 233 QSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTA-GYLTLGV--GGPSGA 289

Query: 329 AP------LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           AP      L+ +    T+Y V LTG SVGGQ + +P S F           VD GT +TR
Sbjct: 290 APGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTR 343

Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTS-GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           L   AY +LR +F   +A    PT+    + DTCY+F+G  +V +P V+L FG+G  + L
Sbjct: 344 LPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTL 403

Query: 441 PAKNYLIPVDSAGTF-CFAFAPTSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            A   L       +F C AFAP+ S   ++I+GNVQQ+   V  D     VGF P+ C
Sbjct: 404 GADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID--GTSVGFKPSSC 452


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 169/356 (47%), Gaps = 33/356 (9%)

Query: 154 QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
            G G Y   I VGTP   FS+V DTGSD+ W QC PCT+C+QQ  P F P +SS++S LP
Sbjct: 81  NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLP 140

Query: 214 CAAPQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
           C +  C+ L   +  C A  C+Y   YG G +T G L TET+  G++ S   +A GC  +
Sbjct: 141 CTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDA-SFPSVAFGCSTE 198

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVT 328
           N GL     G L LG G  S           YCL    +  +  + F S      G+  +
Sbjct: 199 N-GL-----GQLDLGVGRFS-----------YCLRSGSAAGASPILFGSLANLTDGNVQS 241

Query: 329 APLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAG-DGGIIVDCGTAITRLQTQ 386
            P + N  V  ++YYV LTG +VG   + +  S F   + G  GG IVD GT +T L   
Sbjct: 242 TPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKD 301

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS--GLRSVRVPTVSLHFGAGKALDLPAKN 444
            Y  ++ +F+    ++   +G    D C+  +  G   + VP++ L F  G    +P   
Sbjct: 302 GYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYF 361

Query: 445 YLIPVDSAGTF---CFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +  DS G+    C    P      +S+IGNV Q    + +DL      F P  C
Sbjct: 362 AGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 121/374 (32%), Positives = 179/374 (47%), Gaps = 40/374 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   + +GTPP     + DTGSD+ WLQ +PC +CY Q  PIFDP  S+++  LPC  
Sbjct: 78  GEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTT 137

Query: 217 PQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGCGHDN 272
             C +LD SA        C Y  +YGD S+T G L ++TV+ GN S  ++ +A GCG  N
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIRNVAFGCGTRN 197

Query: 273 EGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLV---------DRDSPASGVLEF- 318
            G F      +   GG  LS   Q+  T     +YCL+           DSPA+  + F 
Sbjct: 198 GGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRIVFG 257

Query: 319 -------NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM------- 364
                  +S  G    T PL+ NK+  T+YY+ +   +VG + +    S  +        
Sbjct: 258 DNPVFSSSSTNGVVFATTPLV-NKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDSGS 316

Query: 365 -DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLR 421
                +G II+D GT +T L+ + Y +L  + V     ++  + V  ++F  C+  SG  
Sbjct: 317 KSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI-KMERVNDVKNSMFSLCFK-SGKE 374

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSF 481
            V +P + +HF  G  ++L   N  +  +  G  CF   PT+  + I GN+ Q    V +
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEE-GLVCFTMLPTND-VGIYGNLAQMNFVVGY 432

Query: 482 DLANNRVGFTPNKC 495
           DL    V F P  C
Sbjct: 433 DLGKRTVSFLPADC 446


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 120/358 (33%), Positives = 175/358 (48%), Gaps = 36/358 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G +   +  GTPP++F ++LDTGS I W QC+ C  C + S   FD   SS+YS   C  
Sbjct: 125 GNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSC-- 182

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
                  + +   N   Y + YGD S +VG+   +T++   S   +    GCG +NEG F
Sbjct: 183 -------IPSTVGN--TYNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDF 233

Query: 277 -VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL------EFNSARGGDA 326
             G+ G+LGLG G LS   Q  +      +YCL + +S  S +       + +S +    
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKFTSL 293

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           V  P     +   +Y+V L   SVG + + IP S+F        G I+D GT ITRL  +
Sbjct: 294 VNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQR 348

Query: 387 AYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           AY++L+ +F +       ++G      + DTCY+ SG + V +P   LHFG G  + L  
Sbjct: 349 AYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNG 408

Query: 443 KNYLIPVDSAGTFCFAFAPTSSA-----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           K  +   D A   C AFA  S +     L+IIGN QQ    V +D+   R+GF  N C
Sbjct: 409 KRVVWGND-ASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score =  171 bits (433), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 117/374 (31%), Positives = 180/374 (48%), Gaps = 35/374 (9%)

Query: 146 TPVVSGASQGSG--EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           TP+V+  S   G  EY    G G P ++F +  DT   ++ L+C+PC       DP F+P
Sbjct: 73  TPMVAPISVAPGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEP 131

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
             SSS++ +PC +P+C       C    C + + +G+ +   G LV +T++   S +  G
Sbjct: 132 SRSSSFAAIPCGSPECAV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAG 187

Query: 264 IALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS 313
              GC   G D +  F G+ GL+ L     SL  ++        A + +YCL    + +S
Sbjct: 188 FTFGCIEVGADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS 246

Query: 314 -GVLEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            G L   ++R    GGD   AP+  N      Y+V L G SVGG+ + +PP++F      
Sbjct: 247 RGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVFAAH--- 303

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
             G +++  T  T L   AY +LRD+F R            + DTCY+ +GL S+ VPTV
Sbjct: 304 --GTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTV 361

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSF 481
           +L F  G  L+L  +  +   D +  F         A    +  +S+IG + Q+ T V +
Sbjct: 362 ALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVY 421

Query: 482 DLANNRVGFTPNKC 495
           DL   RVGF P +C
Sbjct: 422 DLRGGRVGFIPGRC 435


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 176/370 (47%), Gaps = 25/370 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+GTP +Q  + LDT +D  W  C PC  C   S   F P 
Sbjct: 67  SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
           +SSSY+ LPCA+  C   +   C AN+        C +   + D SF    L ++T+  G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181

Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
              ++ G A GC     G    +   GLLGLG G +SL  Q  +      +YCL   R  
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSY 240

Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
             SG L   +A +  +    PL+ N    + YYV +TG SVG   V++P   F  D A  
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G ++D GT ITR     Y +LR+ F R        + +  FDTC++   + +   P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
           LH   G  L LP +N LI   +    C A A      ++ ++++ N+QQQ  RV  D+A 
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420

Query: 486 NRVGFTPNKC 495
           +RVGF    C
Sbjct: 421 SRVGFAREPC 430


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 176/370 (47%), Gaps = 25/370 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+GTP +Q  + LDT +D  W  C PC  C   S   F P 
Sbjct: 67  SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR--------CLYQVAYGDGSFTVGDLVTETVSFG 256
           +SSSY+ LPCA+  C   +   C AN+        C +   + D SF    L ++T+  G
Sbjct: 123 SSSSYASLPCASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSFQA-SLGSDTLRLG 181

Query: 257 NSGSVKGIALGCGHDNEG--LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVD-RDS 310
              ++ G A GC     G    +   GLLGLG G +SL  Q  +      +YCL   R  
Sbjct: 182 KD-AIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSY 240

Query: 311 PASGVLEFNSA-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
             SG L   +A +  +    PL+ N    + YYV +TG SVG   V++P   F  D A  
Sbjct: 241 YFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATG 300

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G ++D GT ITR     Y +LR+ F R        + +  FDTC++   + +   P V+
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVT 360

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLAN 485
           LH   G  L LP +N LI   +    C A A      ++ ++++ N+QQQ  RV  D+A 
Sbjct: 361 LHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAG 420

Query: 486 NRVGFTPNKC 495
           +RVGF    C
Sbjct: 421 SRVGFAREPC 430


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 136/437 (31%), Positives = 206/437 (47%), Gaps = 51/437 (11%)

Query: 80  FSLPLHSREI----LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPA 135
           FSL L  R+     L+   H D+  L  +   R  +RVN   TK           ++   
Sbjct: 34  FSLNLIHRDSPLSPLYNPNHTDFDRL-RNAFSRSISRVNVFKTKAV---------DINSF 83

Query: 136 EAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
           +  ++P               GEYF ++ +GTP  +  ++ DTGSD+ W+QC PC  CY+
Sbjct: 84  QNDLVPN-------------GGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYR 130

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACR--ANRCLYQVAYGDGSFTVGDLVTE 251
           Q  P+FDP  SSSY  + C +  C +LDVS  AC    N C Y  +YGD S+T G+L TE
Sbjct: 131 QKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATE 190

Query: 252 TVSFGNSGS----VKGIALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAY 303
             + G++ S    +  I  GCG  N G F    +G++GLGGG LSL  Q+ +      +Y
Sbjct: 191 KFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSY 250

Query: 304 CLV--DRDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP 358
           CLV     S  +  ++F +     G   V+ PL+ +K+ DT+YYV L   SVG + +   
Sbjct: 251 CLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLV-SKQPDTYYYVTLEAISVGNKRLPYT 309

Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS 418
             L   +    G +I+D GT +T L ++ +  L           + +    LF  C+  +
Sbjct: 310 NGLLNGN-VEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSVCFRSA 368

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
           G   + +P +++HF     + L   N  +  D     CF    +S+ + I GN+ Q    
Sbjct: 369 G--DIDLPVIAVHFNDAD-VKLQPLNTFVKADE-DLLCFTMI-SSNQIGIFGNLAQMDFL 423

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    V F P  C
Sbjct: 424 VGYDLEKRTVSFKPTDC 440


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 110/369 (29%), Positives = 171/369 (46%), Gaps = 25/369 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           + PV    +  + EY   + +G P  Q   + LDTGSD+ W QC PC EC+ Q  P FD 
Sbjct: 78  TAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDT 137

Query: 204 KTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF-----GNS 258
             S++   + C+ P C +     C  + C Y   YGDGS + G  + ++ +F     G  
Sbjct: 138 AASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGK 197

Query: 259 GSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDR----DSPA- 312
            +V  I  GCG  N G F+ +  G+ G G G LSL  Q+K    +YC   R     SP  
Sbjct: 198 VTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQFSYCFTTRFEAKSSPVF 257

Query: 313 -SGVLEFNSARGGDAVTAPLIRN---KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
             G  +  +   G  ++ P +R+      ++ Y +   G +VG   + +P    E+   G
Sbjct: 258 LGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADG 313

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            G   +D GT IT      +  L+ +F+  A  L         D C+ + G ++  +P +
Sbjct: 314 SGATFIDSGTDITTFPDAVFRQLKSAFIAQAA-LPVNKTADEDDICFSWDGKKTAAMPKL 372

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANN 486
             H   G   DLP +NY+     +G  C A + TS  +  ++IGN QQQ T + +DLA  
Sbjct: 373 VFHL-EGADWDLPRENYVTEDRESGQVCVAVS-TSGQMDRTLIGNFQQQNTHIVYDLAAG 430

Query: 487 RVGFTPNKC 495
           ++   P +C
Sbjct: 431 KLLLVPAQC 439


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 179/352 (50%), Gaps = 24/352 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY   + +GTPP     V DTGS++ W QC+PC +CY Q DP+FDPK SS+Y  + C++
Sbjct: 92  GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSS 151

Query: 217 PQCKSLDVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCG 269
            QC +L+  A        C Y V+Y DGS+T+G    +T++ G++ +    +K I +GCG
Sbjct: 152 SQCTALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCG 211

Query: 270 HDNEGLFVGS-AGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSA--RG 323
            +N   F    +G++GLGGG +SL KQ+  +     +YCLV  +   S +    +A   G
Sbjct: 212 QNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVSG 271

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
              V+ PL+  K  DTFYY+ L   SVG + +Q P      D    G +++D GT +T L
Sbjct: 272 PGTVSTPLVV-KSRDTFYYLTLKSISVGSKNMQTP------DSNIKGNMVIDSGTTLTLL 324

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
             + Y  + ++   L    K          CY+ +    + +P +++HF  G  + L   
Sbjct: 325 PVKYYIEIENAVASLINADKSKDERIGSSLCYNATA--DLNIPVITMHF-EGADVKLYPY 381

Query: 444 NYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           N    V +    C AF  +     I GNV Q+   V +D A+  + F P  C
Sbjct: 382 NSFFKV-TEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 128/391 (32%), Positives = 191/391 (48%), Gaps = 49/391 (12%)

Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
           RD +RV+ + +K  Q A  N+  H       ++  ED             G +   +  G
Sbjct: 126 RDESRVSFINSKFNQYAPENLKDHT---PNNKLFDED-------------GNFLVDVAFG 169

Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
           TPP++F+++LDTGS I W QC+PC  C + S   FDP  S +YS   C         + +
Sbjct: 170 TPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSC---------IPS 220

Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
              N   Y + YGD S +VG+   +T++  +S        GCG +NEG F  G+ G+LGL
Sbjct: 221 TVGN--TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGL 278

Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVL------EFNSARGGDAVTAPLIRNKK 336
           G G LS   Q  +      +YCL + DS  S +       + +S +    V  P     +
Sbjct: 279 GQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLE 338

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
              +Y+V L   SVG + + IP S+F        G I+D GT ITRL  +AY++L+ +F 
Sbjct: 339 ESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFK 393

Query: 397 RLAGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           +       ++G      + DTCY+ SG + V +P + LHFG G  + L  K  +I  + A
Sbjct: 394 KAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR-VIWGNDA 452

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
              C AFA  +S L+IIGN QQ    V +D+
Sbjct: 453 SRLCLAFA-GNSELTIIGNRQQVSLTVLYDI 482


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 177/367 (48%), Gaps = 39/367 (10%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQSDPIFDPKTSSSYSPL 212
           G GEY   + +GTPP+    ++DTGSD+ WL+C  C  C      + IF    SSSY  L
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 213 PCAAPQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------V 261
           PC +  C  +  +     C    C Y+  YGDGS T GD+ ++ +SF + G+        
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSP--ASGVL 316
            G   GCG   +G +  + GL+GLG    SL +Q+        +YCLV  DSP  A   L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 317 EFNSA---RGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
              S+   RG D V+ P++    +D T YYV L   +VGG    +P  +++ +   +  +
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGG----VPVVVYDKESGHNTSV 235

Query: 373 --------IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSV 423
                   ++D GT  T L    Y ++R S       + PT G  A  D C++ SG  S 
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV--ILPTLGNSAGLDLCFNSSGDTSY 293

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
             P+V+ +F     L LP +N +  V S    C +   +   LSIIGN+QQQ   + +DL
Sbjct: 294 GFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352

Query: 484 ANNRVGF 490
             +++ F
Sbjct: 353 VASQISF 359


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 176/360 (48%), Gaps = 31/360 (8%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           +Y   + +GTPP +    +DTGSD+ WLQC PCT CY+Q +P+FDP++SS+YS +   + 
Sbjct: 58  DYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117

Query: 218 QCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
            C  L  ++C    N C Y  +Y D S T G L  ET++     G   ++KG+  GCGH+
Sbjct: 118 SCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHN 177

Query: 272 NEGLFVGS-AGLLGLGGGMLSLTKQIKAT----SLAYCLVDRDSPASGVLEFNSARGGD- 325
           N G+F     G++GLG G LSL  QI ++      + CLV   +  S     +  +G + 
Sbjct: 178 NNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEV 237

Query: 326 ----AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE----MDEAGDGGIIVDCG 377
                V+ PL+       FY+V L G SV  + + +P   F     ++    G +++D G
Sbjct: 238 LGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLP---FNDGSSLEPITKGNMVIDSG 292

Query: 378 TAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           T  T L    Y+ L +    ++A +  P      +  CY      +++  T++ HF    
Sbjct: 293 TPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP--TNLKGTTLTAHFEGAD 350

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  P + + IPV   G FCFAF  T S+   I GN  Q    + FDL    V F    C
Sbjct: 351 VLLTPTQIF-IPVQD-GIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 126/425 (29%), Positives = 202/425 (47%), Gaps = 43/425 (10%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           LS   RD    +  I + QLA     R       A++    F+ P+ SGA  G+G+YF R
Sbjct: 51  LSDRARDDLHRHAYI-RSQLASSRRGRRA-----AEVGASAFAMPLSSGAYTGTGQYFVR 104

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLPCAAPQ 218
             VGTP + F +V DTGSD+ W++CR               +F    S S++P+ C++  
Sbjct: 105 FRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSSDT 164

Query: 219 CKS---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG---------------NS 258
           C S     ++ C   A+ C Y   Y DGS   G + T++ +                   
Sbjct: 165 CTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRR 224

Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--A 312
             ++G+ LGC    +G  F  S G+L LG   +S   +  A      +YCLVD  +P  A
Sbjct: 225 AKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 284

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           +  L F       A   PL+ ++++  FY V +    V G+A+ IP  ++++D   +GG 
Sbjct: 285 TSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDR--NGGA 342

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           I+D GT++T L T AY ++  +  +    L P   +  F+ CY+++   ++ +P + +HF
Sbjct: 343 ILDSGTSLTILATPAYRAVVTALSKHLAGL-PRVTMDPFEYCYNWTDAGALEIPKMEVHF 401

Query: 433 GAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
                L+ PAK+Y+I  D+A G  C      S   +S+IGN+ QQ     FDL +  + F
Sbjct: 402 AGSARLEPPAKSYVI--DAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRF 459

Query: 491 TPNKC 495
              +C
Sbjct: 460 KHTRC 464


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 132/378 (34%), Positives = 180/378 (47%), Gaps = 49/378 (12%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDP 199
           E+ S+     A+ GS     R GV    RQ  M+LDT SD+ W+QC PC  ++CY Q+D 
Sbjct: 157 EELSSAADPAATGGSRRSRLRPGV----RQL-MLLDTASDVAWVQCFPCPASQCYAQTDV 211

Query: 200 IFDPKTSSSYSPLPCAAPQCKSL-------DVSACRANRCLYQVAYGDGSFTVGDLVTET 252
           ++DP  S S     C++P C+ L         S+  A +C Y+V Y DGS T G LV + 
Sbjct: 212 LYDPSKSRSSESFACSSPTCRQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQ 271

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGS--AGLLGLGGGMLSLTKQIK---ATSLAYCLVD 307
           +S   +  V     GC H   G F  S  AG++ LG G+ SL  Q         +YC   
Sbjct: 272 LSLSPTSQVPKFEFGCSHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPP 331

Query: 308 RDSPAS----GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
             S       GV   +S+R   AVT P++   K    Y V L   +V GQ + +PP++F 
Sbjct: 332 TASHKGFFVLGVPRRSSSR--YAVT-PML---KTPMLYQVRLEAIAVAGQRLDVPPTVFA 385

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                  G  +D  T ITRL   AY +LR +F       +P +     DTCYDF+G+ S+
Sbjct: 386 ------AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSI 439

Query: 424 RVPTVSLHF---GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGT 477
            +PT+SL F   GAG  LD        P       C AFA T+    A  IIG +Q Q  
Sbjct: 440 MLPTISLVFDRTGAGVQLD--------PSGVLFGSCLAFASTAGDDRATGIIGFLQLQTI 491

Query: 478 RVSFDLANNRVGFTPNKC 495
            V +++A   VGF    C
Sbjct: 492 EVLYNVAGGSVGFRRGAC 509


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 175/353 (49%), Gaps = 23/353 (6%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
           + +GTPP+  +++LDTGSD+ W QC+       +  P++DP  SSS++  PC    C+  
Sbjct: 93  VSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCETG 152

Query: 221 SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGS 279
           S +   C  N+C+Y   YG  + T G+L +ET +FG    V   +  GCG    G   G+
Sbjct: 153 SFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGA 211

Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCL---VDRDSPAS----GVLEFNSAR-GGDAVTAPL 331
           +G+LG+    LSL  Q++    +YCL   +DR++ +      + + +  R  G   T  L
Sbjct: 212 SGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSL 271

Query: 332 IRNKK-VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           + N    + +YYV L G SVG + + +P S F +   G GG  VD G     L +    +
Sbjct: 272 VTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEA 331

Query: 391 LRDSFVRLAG--NLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPA 442
           L+++ V       +  T     ++ C+        +   +V+VP +  HF  G A+ L  
Sbjct: 332 LKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRR 391

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +Y++ V SAG  C   +  +   +IIGN QQQ   V FD+ N+   F P +C
Sbjct: 392 DSYMVEV-SAGRMCLVISSGARG-AIIGNYQQQNMHVLFDVENHEFSFAPTQC 442


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 171/359 (47%), Gaps = 21/359 (5%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  VGTPP+   M LD   D  W+ C+ C  C   S  +F+   
Sbjct: 22  PIASGRGVIQSPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVK 78

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  L C APQCK +    C  + C +   YG  +  + +L  +T++  +   V   A
Sbjct: 79  STTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSSTI-LSNLTRDTIAL-SMDPVPYYA 136

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
            GC     G  V   GLLG G G LS    T+ +  ++ +YCL   R    SG L     
Sbjct: 137 FGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPV 196

Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                + T PL++N +  + YYV L G  VG + V IP S    +     G I D GT  
Sbjct: 197 GQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVF 256

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRL   AY ++R+ F +  GN    S +  FDTCY       +  PT++  F +G  + +
Sbjct: 257 TRLVAPAYIAVRNEFRKRVGNAT-VSSLGGFDTCYSV----PIVPPTITFMF-SGMNVTM 310

Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P +N LI   +  T C A A      +S L++I ++QQQ  R+ FD+ N+R+G    +C
Sbjct: 311 PPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQC 369


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 118/356 (33%), Positives = 168/356 (47%), Gaps = 24/356 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G+Y   + +GTPP + S  +DTGSD+ W+QC PC  CY Q +P+FDP  SS+Y+ + C +
Sbjct: 62  GQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDS 121

Query: 217 PQCKSLDVSACR-ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
           P C    +  C    RC Y   Y D S T G L  ETV+     G   S++GI  GCGH+
Sbjct: 122 PLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHN 181

Query: 272 NEGLFVG-SAGLLGLGGGMLSLTKQI----KATSLAYCLVD--RDSPASGVLEFNSAR-- 322
           N G F     GL+GLGGG  SL  QI         + CLV    D   S  + F      
Sbjct: 182 NTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQMSFGKGSEV 241

Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            G   VT PL++ ++  T YYV L G SV    + +  ++ +      G ++VD GT   
Sbjct: 242 LGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEK------GNMLVDSGTPPN 295

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            L  Q Y+ +    V+    L+P +          +    +++ PT++ HF     L  P
Sbjct: 296 ILPQQLYDRVYVE-VKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTP 354

Query: 442 AKNYLIPV-DSAGTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + ++ P  ++ G FC A     +S   I GN  Q    + FDL    V F P  C
Sbjct: 355 IQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDC 410


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 121/377 (32%), Positives = 186/377 (49%), Gaps = 32/377 (8%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           T + SG     GE+F  I +GTPP +   + DTGSD+ W+QC+PC +CY+++ PIFD K 
Sbjct: 72  TDLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKK 131

Query: 206 SSSYSPLPCAAPQCKSLDVS--AC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GN 257
           SS+Y   PC +  C +L  S   C    N C Y+ +YGD SF+ GD+ TET+S     G+
Sbjct: 132 SSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGS 191

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
             S  G   GCG++N G F  +   +   GG  LSL  Q+ ++     +YCL  + +  +
Sbjct: 192 PVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251

Query: 314 GV----LEFNS-----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           G     L  NS     ++    ++ PL+ +K+  T+YY+ L   SVG + +    S +  
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVISTPLV-DKEPRTYYYLTLEAISVGKKKIPYTGSSYNP 310

Query: 365 DEAG-----DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFS 418
           ++ G      G II+D GT +T L +  ++    +   L    K  S    L   C+  S
Sbjct: 311 NDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFK-S 369

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
           G   + +P +++HF  G  + L   N  + V S    C +  PT+  ++I GN  Q    
Sbjct: 370 GSAEIGLPEITVHF-TGADVRLSPINAFVKV-SEDMVCLSMVPTTE-VAIYGNFAQMDFL 426

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    V F    C
Sbjct: 427 VGYDLETRTVSFQRMDC 443


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/347 (34%), Positives = 174/347 (50%), Gaps = 27/347 (7%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-CAAPQCKSLD 223
           +GTPP    + L+ G+++ W    P  EC++Q+ P F+P T S   P   C +P+     
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKF---- 56

Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSF-GNSGSVKGIALGCGHDNEGLFVGS-AG 281
                   C+Y  +YGD S T G L  +  +F G   SV G+A GCG  N G+F  +  G
Sbjct: 57  ---WPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETG 113

Query: 282 LLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLEFNS---ARGGDAV-TAPLI--- 332
           + G G G LSL  Q+K  + ++C   +    P++ +L+  +   + G  AV T PLI   
Sbjct: 114 IAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYA 173

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           +N+   T YY+ L G +VG   + +P S F +   G GG I+D GT+IT L  Q Y  +R
Sbjct: 174 KNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVR 232

Query: 393 DSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV-D 450
           D F  ++   + P +    + TC+         VP + LHF  G  +DLP +NY+  V D
Sbjct: 233 DEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPD 290

Query: 451 SAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AG    C A        +IIGN QQQ   V +DL NN + F   +C
Sbjct: 291 DAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 176/367 (47%), Gaps = 39/367 (10%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQSDPIFDPKTSSSYSPL 212
           G GEY   + +GTPP+    ++DTGSD+ WL+C  C  C      + IF    SSSY  L
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 213 PCAAPQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------V 261
           PC +  C  +  +     C    C Y+  YGDGS T GD+ ++ +SF + G+        
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSP--ASGVL 316
            G   GC    +G +  + GL+GLG    SL +Q+        +YCLV  DSP  A   L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 317 EFNSA---RGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
              S+   RG D V+ P++    +D T YYV L   ++GG    +P  +++ +   +  +
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGG----VPVVVYDKESGHNTSV 235

Query: 373 --------IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFSGLRSV 423
                   ++D GT  T L    Y ++R S       + PT G  A  D C++ SG  S 
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV--ILPTLGNSAGLDLCFNSSGDTSY 293

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
             P+V+ +F     L LP +N +  V S    C +   +   LSIIGN+QQQ   + +DL
Sbjct: 294 GFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352

Query: 484 ANNRVGF 490
             +++ F
Sbjct: 353 VASQISF 359


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 163/350 (46%), Gaps = 73/350 (20%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIF 201
           S P   G+S  + EY   +G+G+P     +V+DTGSD++W+QC PC   + C+  +  +F
Sbjct: 92  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151

Query: 202 DPKTSSSYSPLPCAAPQCKSL----DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSFG 256
           DP  SS+Y+   C+A  C  L    + + C A +RC Y V YGDGS T G          
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTG---------- 201

Query: 257 NSGSVKGIALGCGHDN--EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
                 G   GC H     G+   + GL+GLGG   SL  Q  A                
Sbjct: 202 -----TGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAA---------------- 240

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
                             R+KKV T+Y+  L   +VGG+ + + PS+F        G +V
Sbjct: 241 ------------------RSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA------GSLV 276

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT ITRL   AY +L  +F            + + DTC++F+GL  V +PTV+L F  
Sbjct: 277 DSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAG 336

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPT--SSALSIIGNVQQQGTRVSFD 482
           G  +DL A   +    S G  C AFAPT    A   IGNVQQ+   V +D
Sbjct: 337 GAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 176/350 (50%), Gaps = 27/350 (7%)

Query: 169 PRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           PR+  +++DTGSD+ W QC+            S P++DP  SS+++ LPC+   C+    
Sbjct: 25  PRK--LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQF 82

Query: 225 S---ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGSA 280
           S       NRC+Y+  YG  +  VG L +ET +FG   +V   +  GCG  + G  +G+ 
Sbjct: 83  SFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGAT 141

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAVTAPLIRN 334
           G+LGL    LSL  Q+K    +YCL      + SP     + + +  +    +    I +
Sbjct: 142 GILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVS 201

Query: 335 KKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
             V+T +YYV L G S+G + + +P +   M   G GG IVD G+ +  L   A+ ++++
Sbjct: 202 NPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE 261

Query: 394 SFVRLAGNLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           + + +         V  ++ C+        + + +V+VP + LHF  G A+ LP  NY  
Sbjct: 262 AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYF- 320

Query: 448 PVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               AG  C A   T+  S +SIIGNVQQQ   V FD+ +++  F P +C
Sbjct: 321 QEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 175/365 (47%), Gaps = 33/365 (9%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           + G+ EY    G G P ++F +  DT   ++ L+C+PC       DP F+P  SSS++ +
Sbjct: 82  APGALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAI 140

Query: 213 PCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---G 269
           PC +P+C       C    C + + +G+ +   G LV +T++   S +  G   GC   G
Sbjct: 141 PCGSPECAV----ECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVG 196

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS-GVLEFNSA 321
            D +  F G+ GL+ L     SL  ++        A + +YCL    + +S G L   ++
Sbjct: 197 ADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGAS 255

Query: 322 R----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
           R    GGD   AP+  N      Y+V L G SVGG+ + +PP++F        G +++  
Sbjct: 256 RPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAA 310

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T  T L   AY +LRD+F +            + DTCY+ +GL S+ VP V+L F  G  
Sbjct: 311 TEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTE 370

Query: 438 LDLPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           L+L  +  +   D +  F         A    +  +S+IG + Q+ T V +DL   RVGF
Sbjct: 371 LELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGF 430

Query: 491 TPNKC 495
            P +C
Sbjct: 431 IPGRC 435


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 172/361 (47%), Gaps = 34/361 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           STP  S  +   GEY     +GTPP +    +DTGSD+ WLQC PC +CY Q  PIFDP 
Sbjct: 75  STPQ-STVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPS 133

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
            SSSY  +PC +  C S+  ++C             G  +V  L  ++ + G S S    
Sbjct: 134 LSSSYQNIPCLSDTCHSMRTTSCDVR----------GYLSVETLTLDSTT-GYSVSFPKT 182

Query: 265 ALGCGHDNEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNS 320
            +GCG+ N G F G S+G++GLG G +SL  Q+  +     +YCL      ++  L F  
Sbjct: 183 MIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNSTSKLNFGD 242

Query: 321 A---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
           A    G  A+T P+++ K   + YY+ L  FSVG + ++     +  +E   G I++D G
Sbjct: 243 AAIVYGDGAMTTPIVK-KDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNE---GNILIDSG 298

Query: 378 TAITRLQTQAYNSLRDS---FVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           T  T L    Y     +   ++ L     P      F  CY+ +       P ++ HF  
Sbjct: 299 TTFTFLPYDVYYRFESAVAEYINLEHVEDPN---GTFKLCYNVA-YHGFEAPLITAHF-K 353

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           G  + L   +  I V S G  C AF P+ +A  I GNV QQ   V ++L  N V F P  
Sbjct: 354 GADIKLYYISTFIKV-SDGIACLAFIPSQTA--IFGNVAQQNLLVGYNLVQNTVTFKPVD 410

Query: 495 C 495
           C
Sbjct: 411 C 411


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 21/359 (5%)

Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  VGTP + F M LDT +D  W+ C  C  C   S  +F+  T
Sbjct: 77  PIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVT 133

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  L C APQCK +    C  + C +   YG GS  + +L  +T++  ++  V G  
Sbjct: 134 STTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIAL-STDIVPGYT 191

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
            GC     G  V   G  GL       LS T+ +  ++ +YCL   R    SG L    A
Sbjct: 192 FGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251

Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                + T PL++N +  + YYV L G  VG + V IP S    +     G I D GT  
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRL    Y ++RD F +  GN    S +  FDTCY       +  PT++  F +G  + L
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAI-VSSLGGFDTCYT----GPIVAPTMTFMF-SGMNVTL 365

Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P  N LI   +  T C A A      +S L++I N+QQQ  R+ FD+ N+R+G     C
Sbjct: 366 PTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 127/362 (35%), Positives = 170/362 (46%), Gaps = 37/362 (10%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLP 213
           EY   + VGTPP Q   + DTGSD+ W+ C         +D     +F P  SS+YS L 
Sbjct: 102 EYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLS 161

Query: 214 CAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF---GNSGSVK--GIALG 267
           C +  C++L  ++C A+  C YQ +YGDGS T+G L TET SF   G  G V+   +  G
Sbjct: 162 CQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFG 221

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV-DRDSPASGVLEFNS- 320
           C   + G F  S GL+GLG G  SL  Q+ AT+     L+YCL+   D+ +S  L F S 
Sbjct: 222 CSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSR 280

Query: 321 --ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
                  A + PL+ +  VD++Y V L   +VGGQ V             D  IIVD GT
Sbjct: 281 AVVSEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVATH----------DSRIIVDSGT 329

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---VPTVSLHFGAG 435
            +T L       L     R     +      L   CYD  G        +P V+L FG G
Sbjct: 330 TLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIPDVTLRFGGG 389

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            A+ L  +N    +   GT C    P S +  +SI+GN+ QQ   V +DL    V F   
Sbjct: 390 AAVTLRPENTFSLLQE-GTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVTFAAA 448

Query: 494 KC 495
            C
Sbjct: 449 DC 450


>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
          Length = 144

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 78/144 (54%), Positives = 102/144 (70%)

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
           G  V I   +F ++E G+GG+++D GTA+TRL T AY++ RD+F+    NL  +S V++F
Sbjct: 1   GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
           DTCYD  G  SVRVPT+S +F  G  L LPA+N+LIPV+  GTFCFAFAP+ S LSIIGN
Sbjct: 61  DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           +QQ+G  +S D  N  VGF PN C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 180/366 (49%), Gaps = 33/366 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G GEY  RI +G P  +   + DTGSD+ W+QC+PC  CY+Q+ PIFDP+ SSSY  + C
Sbjct: 89  GGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLC 148

Query: 215 AAPQCKSLDVSA--CRA----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------- 260
               C  LD  A  C A      C Y  +YGD SF+ G L  E    G++ S        
Sbjct: 149 GNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAY 208

Query: 261 VKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVL 316
            + +A GCG  N G F    +G++GLGGG +SL  Q+    +   +YCLV     ++   
Sbjct: 209 FQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTS 268

Query: 317 EFN-------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           + N       S    + V+ PL+  KK +T+YY+ L   SV  +  ++P +     E   
Sbjct: 269 KINFGNDINISGSNYNVVSTPLLP-KKPETYYYLTLEAISVENK--RLPYTNLWNGEVEK 325

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G II+D GT +T L ++ +N+L  +        + +    LF+ C  F   +++ +P ++
Sbjct: 326 GNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNIC--FKDEKAIELPIIT 383

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVG 489
            HF  G  ++L   N    V+     CF   P S+ ++I GN+ Q    V +DL    V 
Sbjct: 384 AHF-TGADVELQPVNTFAKVEE-DLLCFTMIP-SNDIAIFGNLAQMNFLVGYDLEKKAVS 440

Query: 490 FTPNKC 495
           F P  C
Sbjct: 441 FLPTDC 446


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/363 (30%), Positives = 174/363 (47%), Gaps = 33/363 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G+ EY    G G P ++F +  DT   ++ L+C+PC       DP F+P  SSS++ +PC
Sbjct: 172 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGG-APCDPAFEPSRSSSFAAIPC 230

Query: 215 AAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHD 271
            +P+C       C    C + + +G+ +   G LV +T++   S +  G   GC   G D
Sbjct: 231 GSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEVGAD 286

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPAS-GVLEFNSAR- 322
            +  F G+ GL+ L     SL  ++        A + +YCL    + +S G L   ++R 
Sbjct: 287 AD-TFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSSRGFLSIGASRP 345

Query: 323 ---GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
              GGD   AP+  N      Y+V L G SVGG+ + +PP++F        G +++  T 
Sbjct: 346 EYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVFAAH-----GTLLEAATE 400

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
            T L   AY +LRD+F +            + DTCY+ +GL S+ VP V+L F  G  L+
Sbjct: 401 FTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELE 460

Query: 440 LPAKNYLIPVDSAGTFC-------FAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
           L  +  +   D +  F         A    +  +S+IG + Q+ T V +DL   RVGF P
Sbjct: 461 LDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIP 520

Query: 493 NKC 495
            +C
Sbjct: 521 GRC 523


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 193/388 (49%), Gaps = 37/388 (9%)

Query: 140 LPED--FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS 197
           +PE   F+ P+ SGA  G+G+YF +  VGTP + F +V DTGSD+ W++CR        +
Sbjct: 89  MPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDA 148

Query: 198 DP-----IFDPKTSSSYSPLPCAAPQCKS---LDVSACRANR-----CLYQVAYGDGSFT 244
            P     +F P  S S++P+PC++  CKS     ++ C A       C Y   Y D S  
Sbjct: 149 SPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSA 208

Query: 245 VGDLVTETVSFGNSGS-------VKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQI 296
            G + T+  +   SGS       ++ + LGC    +G  F  S G+L LG   +S   + 
Sbjct: 209 RGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRA 268

Query: 297 KAT---SLAYCLVDRDSP--ASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSV 350
            A      +YCLVD  +P  A+  L F       + +  PL+ + +V  FY V +   SV
Sbjct: 269 AARFGGRFSYCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSV 328

Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
            G+A+ IP  ++++ +  +GG I+D GT++T L T AY ++  +  +    + P   +  
Sbjct: 329 AGKALNIPAEVWDVKK--NGGAILDSGTSLTILATPAYKAVVAALSKQLARV-PRVTMDP 385

Query: 411 FDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAP-TSSALS 467
           F+ CY+++  R    VP + + F     L  P K+Y+I  D+A G  C          +S
Sbjct: 386 FEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVI--DAAPGVKCIGLQEGVWPGVS 443

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IGN+ QQ     FDLAN  + F  ++C
Sbjct: 444 VIGNILQQEHLWEFDLANRWLRFQESRC 471


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 170/359 (47%), Gaps = 21/359 (5%)

Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  VGTP + F M LDT +D  W+ C  C  C   S  +F+  T
Sbjct: 77  PIASGRQIVQSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVT 133

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  L C APQCK +    C  + C +   YG GS  + +L  +T++  ++  V G  
Sbjct: 134 STTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIAL-STDIVPGYT 191

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA 321
            GC     G  V   G  GL       LS T+ +  ++ +YCL   R    SG L    A
Sbjct: 192 FGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPA 251

Query: 322 RGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                + T PL++N +  + YYV L G  VG + V IP S    +     G I D GT  
Sbjct: 252 GQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVF 311

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           TRL    Y ++RD F +  GN    S +  FDTCY       +  PT++  F +G  + L
Sbjct: 312 TRLVAPVYTAVRDEFRKRVGNAI-VSSLGGFDTCYT----GPIVAPTMTFMF-SGMNVTL 365

Query: 441 PAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P  N LI   +  T C A A      +S L++I N+QQQ  R+ FD+ N+R+G     C
Sbjct: 366 PPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 122/354 (34%), Positives = 172/354 (48%), Gaps = 28/354 (7%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           S  +  R  +GTP +   + LDT +D  W+ C  C  C   S  +F    SSS+ PLPC 
Sbjct: 100 SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 157

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
           +PQC  +   +C  + C + + YG  S    DLV + ++   + SV     GC     G 
Sbjct: 158 SPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQDNLTLA-TDSVPSYTFGCIRKATGS 215

Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFN-SARGGDAVT--- 328
            V   GLLGLG G LSL  Q ++   ++ +YCL     P+   + F+ S R G       
Sbjct: 216 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCL-----PSFKSVNFSGSLRLGPVAQPIR 270

Query: 329 ---APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
               PL+RN +  + YYV L    VG + V IPPS    + A   G ++D GT  TRL  
Sbjct: 271 IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVA 330

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            AY ++RD F R  G     S +  FDTCY       +  PT++  F AG  + LP  N+
Sbjct: 331 PAYTAVRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMF-AGMNVTLPPDNF 385

Query: 446 LIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LI   +  T C A A      +S L++I ++QQQ  R+ FD+ N+RVG     C
Sbjct: 386 LIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 439


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 123/349 (35%), Positives = 170/349 (48%), Gaps = 18/349 (5%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           S  +  R  +GTP +   + LDT +D  W+ C  C  C   S  +F    SSS+ PLPC 
Sbjct: 23  SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 80

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
           +PQC  +   +C  + C + + YG  S    DLV + ++   + SV     GC     G 
Sbjct: 81  SPQCNQVPNPSCSGSACGFNLTYGS-STVAADLVQDNLTLA-TDSVPSYTFGCIRKATGS 138

Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEFNS-ARGGDAVTAP 330
            V   GLLGLG G LSL  Q ++   ++ +YCL    S   SG L     A+       P
Sbjct: 139 SVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTP 198

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           L+RN +  + YYV L    VG + V IPPS    + A   G ++D GT  TRL   AY +
Sbjct: 199 LLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTA 258

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           +RD F R  G     S +  FDTCY       +  PT++  F AG  + LP  N+LI   
Sbjct: 259 VRDEFRRRVGRNVTVSSLGGFDTCYTV----PIISPTITFMF-AGMNVTLPPDNFLIHST 313

Query: 451 SAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S  T C A A      +S L++I ++QQQ  R+ FD+ N+RVG     C
Sbjct: 314 SGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 362


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 129/387 (33%), Positives = 180/387 (46%), Gaps = 38/387 (9%)

Query: 144 FSTPVVSGASQ-GSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
            + PV  G S  GS EY   +G+GTP P++  + LDTGSD+ W QC  CT C+ Q  P+F
Sbjct: 78  LTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVCFDQPVPVF 136

Query: 202 DPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFG 256
               S ++S +PC+ P C     L +S C  R   C Y   Y D S T G +  +T +F 
Sbjct: 137 RASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFK 196

Query: 257 ------NSGSVKGIALGCGHDNEGLFV-GSAGLLGLGGGMLSLTKQIKATSLAYCLV--- 306
                  + +V  I  GCG  N GLF    +G+ G G G LSL  Q+K    +YC     
Sbjct: 197 APDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAME 256

Query: 307 -DRDSPA--SGVLEFNSARGGDAVT----APLIRNKKVDT--FYYVGLTGFSVGGQAVQI 357
             R SP    G  E   A     +     AP      V +  FY++ L G +VG   +  
Sbjct: 257 ESRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPF 316

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
             S F +   G GG  +D GTAIT      + SLR++FV     L    G    D    F
Sbjct: 317 NASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQV-PLPVAKGYTDPDNLLCF 375

Query: 418 S---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT-----FCFA-FAPTSSALSI 468
           S     ++  VP + LH   G   +LP +NY++  D  G+      C    +  +S  +I
Sbjct: 376 SVPAKKKAPAVPKLILHL-EGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTI 434

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IGN QQQ   + +DL +N++ F P +C
Sbjct: 435 IGNFQQQNMHIVYDLESNKMVFAPARC 461


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 186/377 (49%), Gaps = 32/377 (8%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           T + SG     GE+F  I +GTPP +   + DTGSD+ W+QC+PC +CY+++ PIFD K 
Sbjct: 72  TDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKK 131

Query: 206 SSSYSPLPCAAPQCKSLDVS--ACRA--NRCLYQVAYGDGSFTVGDLVTETVSF----GN 257
           SS+Y   PC +  C++L  +   C    N C Y+ +YGD SF+ GD+ TETVS     G+
Sbjct: 132 SSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGS 191

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
             S  G   GCG++N G F  +   +   GG  LSL  Q+ ++     +YCL  + +  +
Sbjct: 192 PVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTN 251

Query: 314 GVLEFN---------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           G    N          ++    V+ PL+ +K+  T+YY+ L   SVG + +    S +  
Sbjct: 252 GTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPYTGSSYNP 310

Query: 365 DEAG-----DGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
           ++ G      G II+D GT +T L+   ++    +    + G  + +    L   C+  S
Sbjct: 311 NDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFK-S 369

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
           G   + +P +++HF  G  + L   N  + + S    C +  PT+  ++I GN  Q    
Sbjct: 370 GSAEIGLPEITVHF-TGADVRLSPINAFVKL-SEDMVCLSMVPTTE-VAIYGNFAQMDFL 426

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    V F    C
Sbjct: 427 VGYDLETRTVSFQHMDC 443


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 129/391 (32%), Positives = 186/391 (47%), Gaps = 24/391 (6%)

Query: 122 LAIYNVDRHELKPAEAQIL--PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
           L + + D H L    + +   P+  S PV SG     G Y  R  +GTPP+   MVLDT 
Sbjct: 65  LHMASSDSHRLTYLSSLVAGKPKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTS 124

Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLY 234
           +D  WL C  C+ C   +   F+  +SS+YS + C+  QC       C     + + C +
Sbjct: 125 NDAVWLPCSGCSGC-SNASTSFNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSF 183

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTK 294
             +YG  S     LV +T++      +   + GC +   G  +   GL+GLG G +SL  
Sbjct: 184 NQSYGGDSSFSASLVQDTLTLA-PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVS 242

Query: 295 Q---IKATSLAYCLVD-RDSPASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFS 349
           Q   + +   +YCL   R    SG L+        ++   PL+RN +  + YYV LTG S
Sbjct: 243 QTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVS 302

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VG   V + P     D     G I+D GT ITR     Y ++RD F R   N+   S + 
Sbjct: 303 VGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLG 361

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF-CFAFA----PTSS 464
            FDTC  FS       P ++LH  +   L LP +N LI   SAGT  C + A      ++
Sbjct: 362 AFDTC--FSADNENVAPKITLHMTSLD-LKLPMENTLI-HSSAGTLTCLSMAGIRQNANA 417

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L++I N+QQQ  R+ FD+ N+R+G  P  C
Sbjct: 418 VLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  165 bits (417), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 123/376 (32%), Positives = 186/376 (49%), Gaps = 38/376 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S P+ SGA  G+G+YF ++ VGTP ++F++V DTGSD+ W++C   +   +    +F PK
Sbjct: 102 SLPMSSGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR----VFRPK 157

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSFTVGDLV-TETVSF-- 255
           TS S++P+PC++  CK LDV    AN       C Y   Y +GS     +V TE+ +   
Sbjct: 158 TSRSWAPIPCSSDTCK-LDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIAL 216

Query: 256 --GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRD 309
             G    +K + LGC   ++G  F  + G+L LG   +S   Q  A    S +YCLVD  
Sbjct: 217 PGGKVAQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHL 276

Query: 310 SP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           +P  A+G L F     G     P  + K        FY V +    V G+A+ IP    E
Sbjct: 277 APRNATGYLAFGP---GQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPA---E 330

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
           + +A  GG+I+D G  +T L   AY ++  +  +    + P      F+ CY+++  R  
Sbjct: 331 VWDAKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGV-PKVSFPPFEHCYNWTARRPG 389

Query: 424 R---VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRV 479
               +P +++ F     L+ PAK+Y+I V   G  C          LS+IGN+ QQ    
Sbjct: 390 APEIIPKLAVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLW 448

Query: 480 SFDLANNRVGFTPNKC 495
            FDL N +V F  + C
Sbjct: 449 EFDLKNMQVRFKQSNC 464


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 176/372 (47%), Gaps = 41/372 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           GEY  ++G+GTP   FS  +DT SD+ WLQC+PC  CY+Q DPIF+P+ SSSY+ +PC++
Sbjct: 86  GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSS 145

Query: 217 PQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSV-KGIALGCGHDN 272
             C  LD   C  +    C Y   Y   + T G L  + ++ G  G+V   + LGC   +
Sbjct: 146 DTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVG--GNVFHAVVLGCSDSS 203

Query: 273 EGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA----- 326
            G     A GL+GL  G LSL  Q+      YCL    S   G L   +  G DA     
Sbjct: 204 VGGPPPQASGLVGLARGPLSLLSQLSVRRFMYCLPPPMSRTPGKLVLGAGAGADAVRNVS 263

Query: 327 --VTAPLIRNKKVDTFYYVGLTGFSVGGQ---AVQIPPS------------LFEMDEAGD 369
             VT  +  + +  ++YY+   G +VG Q    ++ P S                  A  
Sbjct: 264 DRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANA 323

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCY---DFSGLRSV 423
            G+IVD  + I+ L+   Y+ L D     +RL     P++ + L D C+   +  G+  V
Sbjct: 324 YGMIVDVASTISFLEASLYDELADDLEEEIRLP-RATPSTRLGL-DLCFILPEGVGIDRV 381

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
            VPTVS+ F  G+ L+L      +  +     C     T S +SI+GN QQQ   V ++L
Sbjct: 382 YVPTVSMSFD-GRWLELERDRLFL--EDGRMMCLMIGRT-SGVSILGNYQQQNMHVLYNL 437

Query: 484 ANNRVGFTPNKC 495
              ++ F    C
Sbjct: 438 RRGKITFAKASC 449


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 122/373 (32%), Positives = 167/373 (44%), Gaps = 37/373 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G GEY  ++G GTP   FS  +DT SD+ W+QC+PC  CY+Q DP+F+PK SSSY+ +PC
Sbjct: 88  GGGEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPC 147

Query: 215 AAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
            +  C  LD   C  +    C Y   Y     T G L  + ++ G       +  GC   
Sbjct: 148 TSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGD-VFHAVVFGCSDS 206

Query: 272 NEGLFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG-----D 325
           + G     A GL+GLG G LSL  Q+      YCL    S  SG L   +         D
Sbjct: 207 SVGGPAAQASGLVGLGRGPLSLVSQLSVHRFMYCLPPPMSRTSGKLVLGAGADAVRNMSD 266

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQA------VQIPPSLFEMDEAGDG--------- 370
            VT  +  + +  ++YY+ L G +VG Q          PPS       G G         
Sbjct: 267 RVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGG 326

Query: 371 ----GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCY---DFSGLRS 422
               G+IVD  + I+ L+T  Y+ L D         + T  + L  D C+   +  G+  
Sbjct: 327 ANAYGMIVDVASTISFLETSLYDELADDLEEEIRLPRATPSLRLGLDLCFILPEGVGMDR 386

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
           V VPTVSL F  G+ L+L        V      C     T S +SI+GN Q Q  RV F+
Sbjct: 387 VYVPTVSLSFD-GRWLELDRDRLF--VTDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFN 442

Query: 483 LANNRVGFTPNKC 495
           L   ++ F    C
Sbjct: 443 LRRGKITFAKASC 455


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 111/359 (30%), Positives = 171/359 (47%), Gaps = 32/359 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +GEY   + +GTPP +   + DTGSD+ W+QC PC  C+ Q  P+F+P  SS++    C 
Sbjct: 89  NGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCD 148

Query: 216 APQCKSLDVS--AC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-----LG 267
           +  C S+  S   C +  +C+Y  +YGD SFTVG + TET+SFG++G  + ++      G
Sbjct: 149 SQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208

Query: 268 CG-------HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
           CG       H ++ +        G    +  L  QI     +YCL+   S ++  L+F S
Sbjct: 209 CGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQI-GYKFSYCLLPFSSNSTSKLKFGS 267

Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                    V+ PLI      +FY++ L   ++G + V             DG II+D G
Sbjct: 268 EAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPT--------GRTDGNIIIDSG 319

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T +T L+   YN+   S   +            F  C+ +   R + +P ++  F  G +
Sbjct: 320 TVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPY---RDMTIPVIAFQF-TGAS 375

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + L  KN LI +      C A  P+S S +SI GNV Q   +V +DL   +V F P  C
Sbjct: 376 VALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPTDC 434


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 128/401 (31%), Positives = 187/401 (46%), Gaps = 47/401 (11%)

Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
           RD +RV+ + +K  Q    N+  H        +  ED             G +   +  G
Sbjct: 92  RDESRVSFINSKCNQYTSGNLKNHA---HNNNLFDED-------------GNFLVDVAFG 135

Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
           TP  +  ++LDTGS I W QC+ C  C Q S+  FD   SS+YS   C          S 
Sbjct: 136 TPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSCIP--------ST 187

Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
              N   Y + YGD S +VG+   +T++   S   +    GCG +N+G F  G  G+LGL
Sbjct: 188 VENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGL 244

Query: 286 GGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK----KVD 338
           G G LS   Q  +      +YCL + DS  S +    +     ++    + N     +  
Sbjct: 245 GQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQES 304

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            +Y+V L+  SVG + + IP S+F        G I+D  T ITRL  +AY++L+ +F + 
Sbjct: 305 GYYFVNLSDISVGNERLNIPSSVFASP-----GTIIDSRTVITRLPQRAYSALKAAFKKA 359

Query: 399 AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
                 ++G      + DTCY+ SG + V +P + LHFG G  + L   N +   D A  
Sbjct: 360 MAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSD-ASR 418

Query: 455 FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C AFA TS  L+IIGN QQ    V +D+   R+GF  N C
Sbjct: 419 LCLAFAGTSE-LTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 179/363 (49%), Gaps = 29/363 (7%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
           ASQG  EY     VGTPP Q   ++DTGSDI WLQC+PC +CY Q+ PIFDP  S +Y  
Sbjct: 89  ASQG--EYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKT 146

Query: 212 LPCAAPQCKSLDVSA-CRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GI 264
           LPC++  C+S+  +A C +N   C Y + YGD S + GDL  ET++ G++   SV+    
Sbjct: 147 LPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGSTDGSSVQFPKT 206

Query: 265 ALGCGHDNEGLFVGSAGLLGLGG----GMLSLTKQIKATSLAYCLVD--RDSPASGVLEF 318
            +GCGH+N+G F      +   G     ++S          +YCL      S +S  L F
Sbjct: 207 VIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSKLNF 266

Query: 319 NS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
                  G   V+ P++    +  FY++ L  FSVG   ++   S       G+G II+D
Sbjct: 267 GDEAVVSGRGTVSTPIVPKNGLG-FYFLTLEAFSVGDNRIEF-GSSSFESSGGEGNIIID 324

Query: 376 CGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
            GT +T L    Y +L  +    + L     P+  + L   CY  +    + VP ++ HF
Sbjct: 325 SGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRL---CYRTTSSDELNVPVITAHF 381

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
             G  ++L   +  I VD  G  CFAF  +S    I GN+ QQ   V +DL    V F P
Sbjct: 382 -KGADVELNPISTFIEVDE-GVVCFAFR-SSKIGPIFGNLAQQNLLVGYDLVKQTVSFKP 438

Query: 493 NKC 495
             C
Sbjct: 439 TDC 441


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/397 (31%), Positives = 197/397 (49%), Gaps = 42/397 (10%)

Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC--- 187
           E  PAE+      F+ P+ SGA  G+G+YF R+ VGTP + F +V DTGSD+ W++C   
Sbjct: 80  ETSPAESSA----FAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSP 135

Query: 188 --RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS---LDVSACRA--NRCLYQVAYGD 240
                +        +F P  S S+SPLPC +  CKS     ++ C +  + C Y   Y D
Sbjct: 136 SSSSSSPAASPPQRVFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKD 195

Query: 241 GSFTVG--DLVTETVSF-GNSGSVKG----IALGCGHDNEGL-FVGSAGLLGLGGGMLSL 292
            S   G   L + TVS  GN G+ K     + LGC    +G  F  S G+L LG   +S 
Sbjct: 196 NSSARGVVGLDSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISF 255

Query: 293 TKQIKAT---SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAP-------LIRNKKVDTF 340
             +  +      +YCLVD  +P  A+  L F +        +        L+ + +   F
Sbjct: 256 ASRAASRFGGRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPF 315

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           Y+V +   +V G+ ++I P +++  +  +GG I+D GT++T L T AY+++  +  +   
Sbjct: 316 YFVSVDAVTVAGERLEILPDVWDFRK--NGGAILDSGTSLTILATPAYDAVVKAISKQFA 373

Query: 401 NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAF 459
            + P   +  F+ CY+++G+ S  +P + L F     L  P K+Y+I  D+A G  C   
Sbjct: 374 GV-PRVNMDPFEYCYNWTGV-SAEIPRMELRFAGAATLAPPGKSYVI--DTAPGVKCIGV 429

Query: 460 APTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              +   +S+IGN+ QQ     FDLAN  + F  ++C
Sbjct: 430 VEGAWPGVSVIGNILQQEHLWEFDLANRWLRFKQSRC 466


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 184/377 (48%), Gaps = 34/377 (9%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           T + SG     GEYF  I +GTPP +F  + DTGSD+ W+QC+PC +CY+Q+ P+FD K 
Sbjct: 72  TDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKK 131

Query: 206 SSSYSPLPCAAPQCKSLD--VSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG-- 259
           SS+Y    C +  C +L      C  +R  C Y+ +YGD SFT G++ TET+S  +S   
Sbjct: 132 SSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGS 191

Query: 260 --SVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
             S  G A GCG++N G F  +   +   GG  LSL  Q+ ++     +YCL    +  +
Sbjct: 192 PVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTSATTN 251

Query: 314 GVLEFN---------SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS---- 360
           G    N          ++    +T PLI+ K  +T+Y++ L   +VG    ++P +    
Sbjct: 252 GTSVINLGTNSMTSKPSKDSAILTTPLIQ-KDPETYYFLTLEAITVG--KTKLPYTGGGG 308

Query: 361 -LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFS 418
                     G II+D GT +T L +  Y+         + G  + +    +   C+  S
Sbjct: 309 YSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTHCFK-S 367

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
           G + + +PT+++HF  G  + L   N  + + S    C +  PT+  ++I GN+ Q    
Sbjct: 368 GDKEIGLPTITMHF-TGADVKLSPINSFVKL-SEDIVCLSMIPTTE-VAIYGNMVQMDFL 424

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    V F    C
Sbjct: 425 VGYDLETKTVSFQRMDC 441


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 165/356 (46%), Gaps = 31/356 (8%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           +GTPP+   M+LDTGS ++W+QC            +FDP  SSS+S LPC  P CK    
Sbjct: 88  IGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLCKPRIP 147

Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L  S C  NR C Y   Y DG+   G+LV E ++F  S S   + LGC  ++     
Sbjct: 148 DFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS---- 202

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SP-ASGVLEFNSARGGDAVTAPL 331
            + G+LG+  G LS   Q K T  +YC+  R      +P  S  L  N   GG      L
Sbjct: 203 DAKGILGMNLGRLSFASQAKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGGFRYINLL 262

Query: 332 I-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
                 R   +D   Y V + G  +G Q + IP S F  D +G G  ++D G+  T L  
Sbjct: 263 TFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGSEFTYLVD 322

Query: 386 QAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLPA 442
           +AYN +R+  VRL G       V   + D C++ + +   R +  +   F  G  + +  
Sbjct: 323 EAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKGVEIVVEK 382

Query: 443 KNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  L  V   G  C     +    +A +IIGN  QQ   V FDLAN RVGF    C
Sbjct: 383 ERVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGKADC 437


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 177/370 (47%), Gaps = 22/370 (5%)

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           P+  S PV SG     G Y  R  +GTPP+   MVLDT +D  WL C  C+ C   +   
Sbjct: 12  PKPTSVPVASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTS 70

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF 255
           F+  +SS+YS + C+  QC       C     + + C +  +YG  S     LV +T++ 
Sbjct: 71  FNTNSSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL 130

Query: 256 GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSP 311
                +   + GC +   G  +   GL+GLG G +SL  Q   + +   +YCL   R   
Sbjct: 131 A-PDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY 189

Query: 312 ASGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            SG L+        ++   PL+RN +  + YYV LTG SVG   V + P     D     
Sbjct: 190 FSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGA 249

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT ITR     Y ++RD F R   N+   S +  FDTC  FS       P ++L
Sbjct: 250 GTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC--FSADNENVAPKITL 306

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTF-CFAFA----PTSSALSIIGNVQQQGTRVSFDLAN 485
           H      L LP +N LI   SAGT  C + A      ++ L++I N+QQQ  R+ FD+ N
Sbjct: 307 HM-TSLDLKLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPN 364

Query: 486 NRVGFTPNKC 495
           +R+G  P  C
Sbjct: 365 SRIGIAPEPC 374


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 123/377 (32%), Positives = 186/377 (49%), Gaps = 40/377 (10%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP--IFD 202
           S P+ SGA  G+G+YF ++ VGTP ++F++V DTGS++ W++C         S P  +F 
Sbjct: 77  SLPMSSGAYAGTGQYFVKVLVGTPAQEFTLVADTGSELTWVKC-----AGGASPPGLVFR 131

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSACRAN------RCLYQVAYGDGSF----TVG-DLVTE 251
           P+ S S++P+PC++  CK LDV    AN       C Y   Y +GS      VG D  T 
Sbjct: 132 PEASKSWAPVPCSSDTCK-LDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATI 190

Query: 252 TVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVD 307
            +  G    ++ + LGC   ++G  F    G+L LG   +S   +  A    S +YCLVD
Sbjct: 191 ALPGGKVAQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVD 250

Query: 308 RDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPPSL 361
             +P  A+G L F     G     P  + K        FY V +    V GQA+ IP  +
Sbjct: 251 HLAPRNATGYLAFGP---GQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEV 307

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           ++      GG+I+D GT +T L T AY ++  +  +L   + P      F+ CY+++  R
Sbjct: 308 WDPKS---GGVILDSGTTLTVLATPAYKAVVAALTKLLAGV-PKVDFPPFEHCYNWTAPR 363

Query: 422 --SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTR 478
             +  +P +++ F     L+ PAK+Y+I V   G  C          +S+IGN+ QQ   
Sbjct: 364 PGAPEIPKLAVQFTGCARLEPPAKSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHL 422

Query: 479 VSFDLANNRVGFTPNKC 495
             FDL N  V F P+ C
Sbjct: 423 WEFDLKNMEVRFMPSTC 439


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 120/419 (28%), Positives = 190/419 (45%), Gaps = 57/419 (13%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           ++RD  R   +  +  + + N D    K  E    P +   P+ SG     GEYF+ + V
Sbjct: 62  VKRDKLRRQRMNQRWGV-VSNYDSRR-KGFEMTTTPAEVEMPMHSGRDDALGEYFAEVKV 119

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK----- 220
           G+P ++F +V+DTGS+  WL C                  S S+  + CA+ +CK     
Sbjct: 120 GSPGQRFWLVVDTGSEFTWLNC------------------SKSFEAVTCASRKCKVDLSE 161

Query: 221 --SLDVSACRANRCLYQVAYGDGSFTVG----DLVTETVSFGNSGSVKGIALGCGHDNEG 274
             SL V    ++ CLY ++Y DGS   G    D +T  ++ G  G +  + +GC    + 
Sbjct: 162 LFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGC---TKS 218

Query: 275 LFVG------SAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGGD 325
           +  G      + G+LGLG    S   +         +YCLVD  S  S  +  N   GG 
Sbjct: 219 MLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRS--VSSNLTIGGH 276

Query: 326 AVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                L   ++ +      FY V + G S+GGQ ++IPP +++ +   +GG ++D GT +
Sbjct: 277 HNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFN--AEGGTLIDSGTTL 334

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
           T L   AY ++ ++  +    +K  +G      + C+D  G     VP +  HF  G   
Sbjct: 335 TSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGFDDSVVPRLVFHFAGGARF 394

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + P K+Y+I V +    C    P       S+IGN+ QQ     FDL+ N VGF P+ C
Sbjct: 395 EPPVKSYIIDV-APLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 178/359 (49%), Gaps = 25/359 (6%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCA 215
           +Y +   +G PP++   ++DTGSD+ W QC  C    C +Q+ P ++   SS+++P+PCA
Sbjct: 89  QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148

Query: 216 APQCKSLD--VSACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---G 269
           A  C + D  +  C  A  C     YG G    G L TE  +F  SG+ + +A GC    
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAG-VVAGTLGTEAFAF-QSGTAE-LAFGCVTFT 205

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARG---- 323
              +G   G++GL+GLG G LSL  Q  AT  +YCL     ++ A+G L   ++      
Sbjct: 206 RIVQGALHGASGLIGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGH 265

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG----DGGIIVDCGTA 379
           GD +T   ++  K   FYY+ L G +VG   + IP ++F++ E       GG+I+D G+ 
Sbjct: 266 GDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSP 325

Query: 380 ITRLQTQAYNSLRDSF-VRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
            T L   AY++L      RL G+L  P         C     +  V VP V  HF  G  
Sbjct: 326 FTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRV-VPAVVFHFRGGAD 384

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + +PA++Y  PVD A       +       S+IGN QQQ  RV +DLAN    F P  C
Sbjct: 385 MAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 125/420 (29%), Positives = 200/420 (47%), Gaps = 77/420 (18%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--------------- 188
           F+ P+ SGA  G+G+YF R  VGTP + F +V DTGSD+ W++C                
Sbjct: 72  FAMPLSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASS 131

Query: 189 ---PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-----SLDVSACRANRCLYQVAYGD 240
              P     +++   F P  S +++P+PC++  C+     SL   A  AN C Y   Y D
Sbjct: 132 LPAPAPASPRRT---FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKD 188

Query: 241 GSFTVGDLVTETVSFGNSG------SVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLT 293
           GS   G +  ++ +   SG       ++G+ LGC     G  F+ S G+L LG   +S  
Sbjct: 189 GSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFA 248

Query: 294 KQIKAT---SLAYCLVDRDSP--ASGVL------EFNSARGGDAVTA------------- 329
            +  +      +YCLVD  +P  A+  L       F+S R  + + +             
Sbjct: 249 SRAASRFGGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAG 308

Query: 330 -------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
                  PL+ + +   FY V + G SV G+ ++IP +++++++   GG I+D GT++T 
Sbjct: 309 APGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQG--GGAILDSGTSLTM 366

Query: 383 LQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRV----PTVSLHFGAGKA 437
           L   AY ++  +   RLAG   P   +  FD CY+++      V    P +++HF     
Sbjct: 367 LAKPAYRAVVAALSKRLAG--LPRVTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSAR 424

Query: 438 LDLPAKNYLIPVDSA-GTFCFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+ PAK+Y+I  D+A G  C          LS+IGN+ QQ     +DL N R+ F  ++C
Sbjct: 425 LEPPAKSYVI--DAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 163/354 (46%), Gaps = 36/354 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S ++D   ++ W QC  C+ C++Q  P+F P  SS++ P PC    CKS+  
Sbjct: 73  IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132

Query: 225 SACRANRCLYQVAYGD--GSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVG 278
           S C +N C Y+       G  T+G + T+T + G   +   +  GC    G D  G   G
Sbjct: 133 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGT--ATASLGFGCVVASGIDTMG---G 187

Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRN 334
            +GL+GLG    SL  Q+  T  +YCL   DS  +  L   S    A GG++ T P ++ 
Sbjct: 188 PSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKT 247

Query: 335 KKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
              D    +Y + L G   G  A+ +PPS           ++V     ++ L   AY +L
Sbjct: 248 SPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQAL 299

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG-KALDLPAKNYLIPV- 449
           +    +  G     + +  FD C+  +GL +   P +   F  G  AL +P   YLI V 
Sbjct: 300 KKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVG 359

Query: 450 DSAGTFCFAFAPTS--------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  GT C A   TS          L+I+G++QQ+ T    DL    + F P  C
Sbjct: 360 EEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 413


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 170/358 (47%), Gaps = 22/358 (6%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP 211
            +  +G+Y  ++ +G+PP     ++DTGSD+ W QC PC  CY+Q  P+F+P  S +YSP
Sbjct: 75  VTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSP 134

Query: 212 LPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALG 267
           +PC + QC     S      C Y  +Y D S T G L  E ++F    G+   V  I  G
Sbjct: 135 IPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFG 194

Query: 268 CGHDNEGLF-VGSAGLLGLGGGMLSLTKQI----KATSLAYCLV--DRDSPASGVLEF-- 318
           CGH N G F     G++G+GGG LSL  QI     +   + CLV    D+  SG + F  
Sbjct: 195 CGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGE 254

Query: 319 -NSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
            +   G   VT PL  +++  T Y V L G SVG   V+   S    +    G I++D G
Sbjct: 255 ESDVSGEGVVTTPLA-SEEGQTSYLVTLEGISVGDTFVRFNSS----ETLSKGNIMIDSG 309

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T  T +  + Y  L +  +++  +L P        T   +    ++  P ++ HF     
Sbjct: 310 TPATYIPQEFYERLVEE-LKVQSSLLPIEDDPDLGTQLCYRSETNLEGPILTAHFEGADV 368

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             LP + ++ P D  G FCFA A ++    I GN  Q    + FDL    + F P  C
Sbjct: 369 QLLPIQTFIPPKD--GVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDC 424


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 166/358 (46%), Gaps = 35/358 (9%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           +GTPP+   M+LDTGS ++W+QC            +FDP  SSS+S LPC  P CK    
Sbjct: 83  IGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIP 142

Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L  S C  NR C Y   Y DG+   G+LV E ++F  S S   + LGC  D      
Sbjct: 143 DFTLPTS-CDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS---- 197

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SP-ASGVLEFNSARGGDAVTAPL 331
              G+LG+  G LS   Q K T  +YC+  R      +P  S  L  N    G    + L
Sbjct: 198 DDKGILGMNLGRLSFASQAKITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAGFQYISLL 257

Query: 332 I-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
                 R   +D   + V L G  +G + + IP S F  D +G G  ++D G+  T L  
Sbjct: 258 TFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVD 317

Query: 386 QAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
            AYN +R+  VRLAG  LK     SGV+  D C+D + +   R +  +   F  G  + +
Sbjct: 318 VAYNKVREEVVRLAGPRLKKGYVYSGVS--DMCFDGNAMEIGRLIGNMVFEFDKGVEIVI 375

Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                L  V   G  C     +    +A +IIGN  QQ   V FD+AN RVGF    C
Sbjct: 376 EKGRVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRVGFGKADC 432


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 164/355 (46%), Gaps = 28/355 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +R  +GTP +   + +D  +D  W+ C           P FDP  SS+Y P+ C AP
Sbjct: 106 SYVARARLGTPAQALLVAIDPSNDAAWVPCA--ACAGCARAPSFDPTRSSTYRPVRCGAP 163

Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           QC      +C     + C + ++Y   +F             +  +V     GC H   G
Sbjct: 164 QCSQAPAPSCPGGLGSSCAFNLSYAASTFQALLGQDALALHDDVDAVAAYTFGCLHVVTG 223

Query: 275 LFVGSAGLLGLGGGMLSL---TKQIKATSLAYCLVD-RDSPASGVLEFNSARGGDAV-TA 329
             V   GL+G G G LS    TK +  +  +YCL   + S  SG L    A     + T 
Sbjct: 224 GSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFSGTLRLGPAGQPKRIKTT 283

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV + G  VGG+ V +P S    D     G IVD GT  TRL    Y 
Sbjct: 284 PLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYA 343

Query: 390 SLRDSF---VRLAGNLKPTSG-VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           ++RD F   VR      P +G +  FDTCY+     ++ VPTV+  F    ++ LP +N 
Sbjct: 344 AVRDVFRSRVR-----APVAGPLGGFDTCYNV----TISVPTVTFSFDGRVSVTLPEENV 394

Query: 446 LIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +I   S G  C A A        +AL+++ ++QQQ  RV FD+AN RVGF+   C
Sbjct: 395 VIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 449


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 123/365 (33%), Positives = 173/365 (47%), Gaps = 21/365 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S  V +  +  +G+Y  ++ +GTPP     ++DTGSD+ W QC PC  CY+Q  P+F+P 
Sbjct: 36  SNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPL 95

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSG 259
            S++Y+P+PC + +C SL   +C   + C Y  AY D S T G L  ETV+F    G   
Sbjct: 96  RSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPV 155

Query: 260 SVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQI----KATSLAYCLV--DRDSPA 312
            V  I  GCGH N G F     G++GLGGG LSL  Q      +   + CLV    D   
Sbjct: 156 VVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHT 215

Query: 313 SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            G + F  A    G+ V A  + +++  T Y V L G SVG   V    S  EM     G
Sbjct: 216 LGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS--EM--LSKG 271

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            I++D GT  T L  + Y+ L    +++  N+ P        T   +    ++  P +  
Sbjct: 272 NIMIDSGTPATYLPQEFYDRLVKE-LKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIA 330

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           HF       +P + ++ P D  G FCFA A T+    I GN  Q    + FDL    V F
Sbjct: 331 HFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSF 388

Query: 491 TPNKC 495
               C
Sbjct: 389 KATDC 393


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 184/389 (47%), Gaps = 26/389 (6%)

Query: 122 LAIYNVDRHELKPAEAQI------LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMV 175
           L ++++ R   + ++A++      L  D S P+   + +G   Y   IG+GTPP+  +++
Sbjct: 51  LPVHDMWRRSARASKARVARLEARLTGDMSVPLARISDEG---YTVTIGIGTPPQLHTLI 107

Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD--VSACRANRCL 233
            DT SD+ W QC    +  +Q +P+FDP  SSS++ + C++  C   +     C    C 
Sbjct: 108 ADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTEDNPGTKRCSNKTCR 167

Query: 234 YQVAYGDGSFTVGDLVTE--TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           Y   Y       G L  E  T+S  N         GCG   +G  +G++G+LG+   +LS
Sbjct: 168 YVYPYVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGILGMSPAILS 226

Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFS 349
           +  Q+     +YCL       S  L F +    G    T P+   K +  +YYV L G S
Sbjct: 227 MVSQLAIPKFSYCLTPYTDRKSSPLFFGAWADLGRYKTTGPI--QKSLTFYYYVPLVGLS 284

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           +G + + +P + F + +   GG +VD G  + +L   A+ +L+++ +           V 
Sbjct: 285 LGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVK 341

Query: 410 LFDTCYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL 466
            +  C+       + +V+ P + L+F  G  + LP  NY     +AG  C A  P    +
Sbjct: 342 DYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYF-QEPTAGLMCLALVP-GGGM 399

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           SIIGNVQQQ   + FD+ +++  F P  C
Sbjct: 400 SIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 119/400 (29%), Positives = 173/400 (43%), Gaps = 48/400 (12%)

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCT 191
           A A  L    +TPV S      G Y   +  GTPP+  S V+DTGS   W  C     C 
Sbjct: 56  ARAHHLKNPQTTPVFS---HSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCN 112

Query: 192 EC-YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
            C +      F PK SSS   + C  P+C  +  +  R   C       + S     +  
Sbjct: 113 NCSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDC------DNNSRNCSQICP 166

Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLFVGS-------------AGLLGLGGGMLSLTKQIK 297
             +    SG+  G+AL       GL V +             AG+ G G G  SL  Q+ 
Sbjct: 167 PYLILYGSGTTGGVALSETLHLHGLIVPNFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLG 226

Query: 298 ATSLAYCLVDR---DSPASGVLEFNSARGGDAVTA-----PLIRNKKVD------TFYYV 343
            T  +YCL+     D+  S  L  +S    D  TA     PL++N KV        +YYV
Sbjct: 227 LTKFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYV 286

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
            L   S+GG++V+IP      D+ G+GG I+D GT  T + T+A+  L + F+    N +
Sbjct: 287 SLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYE 346

Query: 404 P---TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
                  ++    C++ SG + + +P + LHF  G  ++LP +NY   + S    CF   
Sbjct: 347 RALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVV 406

Query: 461 PTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              +  +     I+GN Q Q   V +DL N R+GF    C
Sbjct: 407 TDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 165/352 (46%), Gaps = 45/352 (12%)

Query: 15  ILFSFCLFTSASSRGLSETATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPL 74
           +L S CL  +       E     L +    QQ   +    PE+ +               
Sbjct: 28  LLVSLCLIIANGVSSFEEKKVFNLQILQRKQQLGSLGCLHPESRQ--------------- 72

Query: 75  NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
               +  L +  R    K + N +R L  ++L  D   V ++  +L+     V  H ++ 
Sbjct: 73  -EKGAIMLEMKDRSYCSKKKVNWHRKL-HNQLTLDDLHVRSMQNRLRKM---VSSHSVEV 127

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
           ++ QI       P+ SG +  +  Y   + +G   +  ++++DTGSD+ W+QC PC  CY
Sbjct: 128 SQIQI-------PLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPCMSCY 178

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS-----ACRAN--RCLYQVAYGDGSFTVGD 247
            Q  P+F P TSSSY  +PC +  C+SL ++     AC +N   C Y V YGDGS+T G+
Sbjct: 179 NQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGE 238

Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYC 304
           L  E +SFG   SV     GCG +N+GLF G +GL+GLG   LSL  Q  +T     +YC
Sbjct: 239 LGAEHLSFGGI-SVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSYC 297

Query: 305 LVDRDSPASGVLEFNSARGGDAVTAP-----LIRNKKVDTFYYVGLTGFSVG 351
           L   D+ ASG L   +         P     ++ N ++  FY + LTG  VG
Sbjct: 298 LPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/399 (30%), Positives = 190/399 (47%), Gaps = 55/399 (13%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP------- 199
           P+ S A  G G+YF R  VGTP + F +V DTGSD+ W++CRP       ++        
Sbjct: 83  PLTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASAS 142

Query: 200 ----IFDPKTSSSYSPLPCAAPQC-KSL--DVSAC--RANRCLYQVAYGDGSFTVGDLVT 250
                F P+ S +++P+PCA+  C KSL   +S C    + C Y   Y DGS   G + T
Sbjct: 143 SPRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGT 202

Query: 251 ETVSFG------------NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIK 297
           E+ +                  ++G+ LGC     G  F  S G+L LG   +S      
Sbjct: 203 ESATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAA 262

Query: 298 AT---SLAYCLVDRDSP--ASGVLEFN----------SARGGDAVTAPLIRNKKVDTFYY 342
           +      +YCLVD  SP  A+  L F           +A G  A   PL+ + ++  FY 
Sbjct: 263 SRFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYD 322

Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL 402
           V +   SV G+ ++IP  ++E+D  G GG+IVD GT++T L   AY ++  +  +     
Sbjct: 323 VSIKAISVDGELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARF 380

Query: 403 KPTSGVALFDTCYDFSGL----RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCF 457
            P   +  F+ CY+++          +P +++HF     L+ P+K+Y+I  D+A G  C 
Sbjct: 381 -PRVAMDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVI--DAAPGVKCI 437

Query: 458 AFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                    +S+IGN+ QQ     FDL N R+ F  ++C
Sbjct: 438 GVQEGPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
          Length = 343

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 93/211 (44%), Positives = 124/211 (58%), Gaps = 9/211 (4%)

Query: 34  ATTVLDVSSALQQTEHILSFEPETLEPFAEESETAAESFPLNSSSSFSLPLHSREILH-- 91
           AT  LDV+++L +    +S E   L   A  + +       +     +L LHSR+ L   
Sbjct: 35  ATETLDVAASLSRARAAVSAEAVPLHQSAAAAVSTEVVGEEHEEGRLALRLHSRDFLPEE 94

Query: 92  --KTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEA---QILPEDFST 146
             + RH  YRSLVL+RL RDSAR   +  +  +A   V R +L PA     +    +   
Sbjct: 95  QGRQRHASYRSLVLARLRRDSARAAAVSARAAMAADGVSRFDLVPANVTAFEASAAEIQG 154

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           PVVSG   GSGEYFSR+GVG+P RQ  MVLDTGSD+ W+QC+PC +CYQQSDP+FDP  S
Sbjct: 155 PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLS 214

Query: 207 SSYSPLPCAAPQCKSLDVSACR--ANRCLYQ 235
           +SY+ + C  P+C  LD +ACR     CLY+
Sbjct: 215 TSYASVACDNPRCHDLDAAACRNSTGACLYE 245


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/388 (31%), Positives = 178/388 (45%), Gaps = 28/388 (7%)

Query: 127 VDRHELKPAEAQILP-----EDFSTPVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGS 180
           +D     PA  + L      +  + P+ SG      G Y  R+ +GTP +   MVLDT +
Sbjct: 57  IDMASKDPARIRYLSSLTAQKTVAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSN 116

Query: 181 DINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVA 237
           D  W  C  C  C   S   F  + SS+++ L C+ P+C      +C       CL+   
Sbjct: 117 DAAWAPCSGCIGC--SSTTTFSAQNSSTFATLDCSKPECTQARGLSCPTTGNVDCLFNQT 174

Query: 238 YGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ-- 295
           YG  S     LV +++  G    +   + GC     G  +   GL+GLG G LSL  Q  
Sbjct: 175 YGGDSTFSATLVQDSLHLG-PNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSG 233

Query: 296 -IKATSLAYCLVDRDSPA-SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGG 352
            + +   +YCL    S   SG L+        A+ T PL+ N    + YYV LTG SVG 
Sbjct: 234 SLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGR 293

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALF 411
             V I P L   D     G I+D GT ITR     Y ++RD F + + G+  P   +  F
Sbjct: 294 VLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSP---LGAF 350

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALS 467
           DTC  F+    V  P ++LH  +G  L LP +N LI   +    C A A      +S ++
Sbjct: 351 DTC--FATNNEVSAPAITLHL-SGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVN 407

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +I N+QQQ  R+ FD+ N+++G     C
Sbjct: 408 VIANLQQQNHRILFDINNSKLGIARELC 435


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 137/447 (30%), Positives = 201/447 (44%), Gaps = 53/447 (11%)

Query: 71  SFPLNSSSSFSLPLHSREI----LHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYN 126
           SF    S+ FS+ L  R+      +K   N Y+ +V   + R   RVN            
Sbjct: 19  SFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVV-DAVHRSINRVN------------ 65

Query: 127 VDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQ 186
              H  K + A       STP  +  S   G+Y     VGTPP +   ++DTGSDI WLQ
Sbjct: 66  ---HSNKNSLA-------STPESTVISY-EGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQ 114

Query: 187 CRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTV 245
           C PC +CY Q+ P F+P  SSSY  + C++  C+S+  ++C   + C Y + YG+ S + 
Sbjct: 115 CEPCEQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQ 174

Query: 246 GDLVTETVSF----GNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT- 299
           GDL  ET++     G   S     +GCG +N G F   +  +   GG   SL  Q+  + 
Sbjct: 175 GDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSI 234

Query: 300 --SLAYCLVDRD------SPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
               +YCLV         S  S  L F       G + ++ P+++ K    FYY+ +  F
Sbjct: 235 GGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVK-KDHSFFYYLTIEAF 293

Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGV 408
           SVG + V+   S   ++E   G II+D  T +T + +  Y  L  + V L    +     
Sbjct: 294 SVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPN 350

Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSI 468
             F  CY+ S       P ++ HF     L L A N  + V +    CFAFAP++   +I
Sbjct: 351 QQFSLCYNVSSDEEYDFPYMTAHFKGADIL-LYATNTFVEV-ARDVLCFAFAPSNGG-AI 407

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
            G+  QQ   V +DL    V F    C
Sbjct: 408 FGSFSQQDFMVGYDLQQKTVSFKSVDC 434


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 163/350 (46%), Gaps = 23/350 (6%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTPP+Q  + +DT +D  W+ C  C  C   S P FDP  S+SY  +PC +P 
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169

Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C     +AC      C + + Y D S     L  ++++     +VK    GC     G  
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYADSSLQAA-LSQDSLAVAGD-AVKTYTFGCLQKATGTA 227

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
               G  GL       LS T+ +   + +YCL    S   SG L     R G      T 
Sbjct: 228 APPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLG--RNGQPPRIKTT 285

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IPP     D A   G ++D GT  TRL   AY 
Sbjct: 286 PLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYV 345

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           ++RD   R  G   P S +  FDTC++ +   +V  P V+L F  G  + LP +N +I  
Sbjct: 346 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHS 399

Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 C A A      ++ L++I ++QQQ  RV FD+ N RVGF   +C
Sbjct: 400 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 127/396 (32%), Positives = 192/396 (48%), Gaps = 52/396 (13%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP------- 199
           P+ SGA  G G+YF R  VGTP + F +V DTGSD+ W++CR          P       
Sbjct: 85  PLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGP 144

Query: 200 --IFDPKTSSSYSPLPCAAPQC-KSLDVSACR----ANRCLYQVAYGDGSFTVGDLVTET 252
              F P+ S +++P+ CA+  C KSL  S        + C Y   Y DGS   G + TE+
Sbjct: 145 GRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTES 204

Query: 253 VSFGNSG------SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKAT---SLA 302
            +   SG       +KG+ LGC     G  F  S G+L LG   +S      +      +
Sbjct: 205 ATIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFS 264

Query: 303 YCLVDRDSP--ASGVLEF------NSAR---------GGDAVTAPLIRNKKVDTFYYVGL 345
           YCLVD  SP  A+  L F      +S R            A   PL+ ++++  FY V L
Sbjct: 265 YCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSL 324

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
              SV G+ ++IP +++++ EAG GG+I+D GT++T L   AY ++  +  +    L P 
Sbjct: 325 KAISVAGEFLKIPRAVWDV-EAG-GGVILDSGTSLTVLAKPAYRAVVAALSKGLAGL-PR 381

Query: 406 SGVALFDTCYDFSGLRS----VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFA 460
             +  F+ CY+++        V VP +++HF     L+ P K+Y+I  D+A G  C    
Sbjct: 382 VTMDPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVI--DAAPGVKCIGLQ 439

Query: 461 P-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 +S+IGN+ QQ     FD+ N R+ F  ++C
Sbjct: 440 EGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 176/387 (45%), Gaps = 45/387 (11%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---Q 196
            P+   A  G G+YF    VGTP ++F +V DTGSD+ W+ C      R C+       +
Sbjct: 70  VPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIR 129

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLV 249
              +F    SSS+  +PC    CK       SL         C Y   Y DGS  +G   
Sbjct: 130 HKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFA 189

Query: 250 TETVSF----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT----- 299
            ETV+     G    +  + +GC    +G  F  + G++GLG    S    IKA      
Sbjct: 190 NETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGG 247

Query: 300 SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQ 353
             +YCLVD  S    S  L F S+R  +A+   +   +     V++FY V + G S+GG 
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGA 307

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVA 409
            ++IP  ++  D  G GG I+D G+++T L   AY     +LR S ++     K    + 
Sbjct: 308 MLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIG 362

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSI 468
             + C++ +G     VP +  HF  G   + P K+Y+I   + G  C  F   +    S+
Sbjct: 363 PLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSV 421

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GN+ QQ     FDL   ++GF P+ C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 115/364 (31%), Positives = 171/364 (46%), Gaps = 30/364 (8%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R  +GTPP+   + +DT +D  W+   PCT C   +  +F P+ 
Sbjct: 65  PIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWI---PCTACDGCASTLFAPEK 121

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + CAAP+CK +    C  + C + + YG  S    +LV +T++   +  V    
Sbjct: 122 STTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSI-AANLVQDTITLA-TDPVPSYT 179

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
            GC     G      G  GL      +LS T+ +  ++ +YCL     P+   L F+ S 
Sbjct: 180 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 234

Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           R G           PL++N +  + YYV L    VG + V IPP+    +     G I D
Sbjct: 235 RLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFD 294

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
            GT  TRL    Y ++RD F R  G     + +  FDTCY+      + VPT++  F  G
Sbjct: 295 SGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIF-TG 349

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
             + LP  N LI   +  T C A A      +S L++I N+QQQ  RV +D+ N+RVG  
Sbjct: 350 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVA 409

Query: 492 PNKC 495
              C
Sbjct: 410 RELC 413


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 167/358 (46%), Gaps = 38/358 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKSL 222
           +GTPP+   MVLDTGS ++W+QC      + ++ P   FDP  SSS+  LPC  P CK  
Sbjct: 94  IGTPPQPQQMVLDTGSQLSWIQC------HNKTPPTASFDPSLSSSFYVLPCTHPLCKPR 147

Query: 223 DV-----SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
                  + C  NR C Y   Y DG++  G+LV E ++F  S +   + LGC  ++    
Sbjct: 148 VPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSESRD-- 205

Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
             + G+LG+  G LS   Q K T  +YC+  R    +      S   G+   +   R   
Sbjct: 206 --ARGILGMNLGRLSFPFQAKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNSARFRYVS 263

Query: 337 VDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           + TF             Y V + G  +GG+ + IPPS+F  +  G G  +VD G+  T L
Sbjct: 264 MLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDSGSEFTFL 323

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
              AY+ +R+  +R+ G       V   + D C+D + +   R +  V+  F  G  + +
Sbjct: 324 VDVAYDRVREEIIRVLGPRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFEFEKGVEIVV 383

Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P +  L  V   G  C     +    +A +IIGN  QQ   V FDLAN R+GF    C
Sbjct: 384 PKERVLADV-GGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRIGFGVADC 440


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/415 (30%), Positives = 193/415 (46%), Gaps = 38/415 (9%)

Query: 102 VLSRLERDSARVNTLITKLQL----AIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSG 157
           ++ R    S   N+ +T+ +L    A+ ++ R +      QI P    +P+++      G
Sbjct: 30  LIPRHSPISPLYNSQMTQTELVKSAALRSITRSKRVNFIGQISPP--LSPIITPIPD-HG 86

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  R  +GTP  +   + DTGSD++WLQC PC  CY Q  P+FDP  SS+Y  +PC + 
Sbjct: 87  EYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQ 146

Query: 218 QCKSL--DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA------LGC 268
            C     +   C +++ C+Y   YG  SFT+G L  +T+SF ++G  +G A       GC
Sbjct: 147 PCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGC 206

Query: 269 GHDNEGLF---VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS-A 321
              +   F     + G +GLG G LSL  Q+        +YC+V   S ++G L+F S A
Sbjct: 207 AFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGSMA 266

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
              + V+ P + N    ++Y + L G +VG + V        +     G II+D    +T
Sbjct: 267 PTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPILT 318

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
            L+   Y     S V+ A N++        F+ C       ++  P    HF  G  + L
Sbjct: 319 HLEQGIYTDFISS-VKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHF-TGADVVL 374

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             KN  I +D+    C    P S  +SI GN  Q   +V +DL   +V F P  C
Sbjct: 375 GPKNMFIALDN-NLVCMTVVP-SKGISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 123/366 (33%), Positives = 175/366 (47%), Gaps = 23/366 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG     G Y  R  +GTPP+   MVLDT +D  WL C  C+ C   +   F+  
Sbjct: 91  SVPVASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTN 149

Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSFGNSG 259
           +SS+YS + C+  QC       C     + + C +  +YG  S    +LV +T++  +  
Sbjct: 150 SSSTYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL-SPD 208

Query: 260 SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVD-RDSPASGV 315
            +   + GC +   G  +   GL+GLG G +SL  Q   + +   +YCL   R    SG 
Sbjct: 209 VIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGS 268

Query: 316 LEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           L+        ++   PL+RN +  + YYV LTG SVG   V + P     D     G I+
Sbjct: 269 LKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTII 328

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT ITR     Y ++RD F +       T G   FDTC  FS       P ++LH   
Sbjct: 329 DSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGA--FDTC--FSADNENVTPKITLHM-T 383

Query: 435 GKALDLPAKNYLIPVDSAGTF-CFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVG 489
              L LP +N LI   SAGT  C + A      ++ L++I N+QQQ  R+ FD+ N+R+G
Sbjct: 384 SLDLKLPMENTLIH-SSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIG 442

Query: 490 FTPNKC 495
             P  C
Sbjct: 443 IAPEPC 448


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 176/365 (48%), Gaps = 36/365 (9%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFDPKTSSSYSPLPCA 215
           +Y +   VG PP++   ++DTGS + W QC  C    C +Q  P F+  +S S++P+PC 
Sbjct: 85  QYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQ 144

Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
              C    +  C  +  C ++V YG G   +G L T+  +F + G+   +A GC      
Sbjct: 145 DKACAGNYLHFCALDGTCTFRVTYGAGGI-IGFLGTDAFTFQSGGAT--LAFGCVSFTRF 201

Query: 274 ---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR-----G 323
               +  G++GL+GLG G LSL  Q  A   +YCL     ++ AS  L   +A      G
Sbjct: 202 AAPDVLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGG 261

Query: 324 GDAVTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA----GDGGIIVDC 376
           G  ++   + + K     TFYY+ L G +VG   + IP + F++ E      +GG+I+D 
Sbjct: 262 GAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDS 321

Query: 377 GTAITRLQTQAYNSLRDSFVR-LAGNLKP-----TSGVALFDTCYDFSGLRSVRVPTVSL 430
           G+  T L   AY  L     R L G+L P       G+AL   C     L  V VPT+ L
Sbjct: 322 GSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMAL---CVARGDLDRV-VPTLVL 377

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           HF  G  + LP +NY  P++ + T C A        SIIGN QQQ   + FD+   R+ F
Sbjct: 378 HFSGGADMALPPENYWAPLEKS-TACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGRLSF 435

Query: 491 TPNKC 495
               C
Sbjct: 436 QNADC 440


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 169/364 (46%), Gaps = 43/364 (11%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  R  +GTPP +   + DTGSD+ W+QC PC +C  Q+ P+FDP+ SS++  +PC + 
Sbjct: 91  EYLMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQ 150

Query: 218 QCKSLDVS--AC--RANRCLYQVAYGDGSFTVGDLVTETVSFG---NSGSVKGIALGCGH 270
            C  L  S  AC  ++ +C YQ  YGD +   G L  E+++FG   N+     +  GC  
Sbjct: 151 PCTLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESINFGSKNNAIKFPKLTFGCTF 210

Query: 271 DNEGLFVGSA---GLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSARGG 324
            N      S    GL+GLG G LSL  Q+        +YC     S ++  + F    G 
Sbjct: 211 SNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRF----GN 266

Query: 325 DA--------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
           DA        V+ PLI      ++YY+ L G S+G + V+   S        DG I++D 
Sbjct: 267 DAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSES------QTDGNILIDS 320

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHF 432
           GT+ T L+   YN     FV L   +     V +    ++ C++  G R  R P V   F
Sbjct: 321 GTSFTILKQSFYN----KFVALVKEVYGVEAVKIPPLVYNFCFENKGKRK-RFPDVVFLF 375

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGFT 491
             G  + + A N L   +     C    PTS    SI GN  Q G +V +DL    V F 
Sbjct: 376 -TGAKVRVDASN-LFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFA 433

Query: 492 PNKC 495
           P  C
Sbjct: 434 PADC 437


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 173/361 (47%), Gaps = 55/361 (15%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--PCTECYQQSDPIFDPKTSSSYSPLPCA 215
           EY   +  GTPP++  + LDTGSDI W QC+  P + C+ Q+ P+FDP  SSS++ LPC+
Sbjct: 87  EYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCS 146

Query: 216 APQCKSLDVSA----CRANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVKGIA 265
           +P C++           +  C Y ++YGDGS + G++  E  +F      G+S +V G+ 
Sbjct: 147 SPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLV 206

Query: 266 LGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
            GCGH N G+F  +  G+ G G G LSL  Q+K  + ++C        +  +        
Sbjct: 207 FGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPGVA 266

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
               +PL R +               G    +  P               + GT+IT L 
Sbjct: 267 PPSASPLGRRR---------------GSYRCRSTPR------------SSNSGTSITSLP 299

Query: 385 TQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVR--VPTVSLHFGAGKALDLP 441
            + Y ++R+ F  ++   + P +    F TC+  + LR  +  VPT++LHF  G  + LP
Sbjct: 300 PRTYRAVREEFAAQVKLPVVPGNATDPF-TCFS-APLRGPKPDVPTMALHF-EGATMRLP 356

Query: 442 AKNYLIPV---DSAGT----FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
            +NY+  V   D AG      C A         I+GN+QQQ   V +DL N+++ F P +
Sbjct: 357 QENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQ 414

Query: 495 C 495
           C
Sbjct: 415 C 415


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 171/364 (46%), Gaps = 40/364 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +GEY  R  +GTPP +     DTGSD+ W+QC PC  C+ QS P+F P  SS++ P  C 
Sbjct: 87  NGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCR 146

Query: 216 APQCKSL--DVSAC-RANRCLYQVAYGDG-SFTVGDLVTETVSFGNSGSVKGIA-----L 266
           +  C  L  +   C ++  C+Y   YGD  SF+ G L TET+ F + G V+ +A      
Sbjct: 147 SQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFF 206

Query: 267 GCG-HDNEGLF--VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNS 320
           GCG ++N  +F      G++GLG G LSL  QI        +YCL+   S ++  L+F +
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKLKFGN 266

Query: 321 AR---GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                G   V+ P+I    + T+Y++ L   +V  + V           + DG +I+D G
Sbjct: 267 ESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDSG 318

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC-----YDFSGLRSVRVPTVSLHF 432
           T +T L    Y          A +L+ +  V L         + F    +   P ++  F
Sbjct: 319 TLLTYLGESFY-------YNFAASLQESLAVELVQDVLSPLPFCFPYRDNFVFPEIAFQF 371

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFT 491
             G  + L   N  +  +   T C   AP+S S +SI G+  Q   +V +DL   +V F 
Sbjct: 372 -TGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQ 430

Query: 492 PNKC 495
           P  C
Sbjct: 431 PTDC 434


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/349 (34%), Positives = 173/349 (49%), Gaps = 27/349 (7%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G G Y     +GTPP++ S + DTGSD+ W +C  CT C  Q  P + P  SSS+S LPC
Sbjct: 78  GGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPC 137

Query: 215 AAPQCKSLDVSACRAN--RCLYQVAYGDGS----FTVGDLVTETVSFGNSGSVKGIALGC 268
           +   C  L  S C A    C Y+ +YG  S    +T G L +ET + G S +V GI  GC
Sbjct: 138 SGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLG-SDAVPGIGFGC 196

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAV- 327
              +EG +   +GL+GLG G LSL  Q+   + +YCL    +  S +L  + A  G  V 
Sbjct: 197 TTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQ 256

Query: 328 TAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           + PL+R     T+YY V L   S+G                G  GII D GT +  L   
Sbjct: 257 STPLLRTS---TYYYTVNLESISIGAATTA---------GTGSSGIIFDSGTTVAFLAEP 304

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           AY   +++ +    NL   SG   ++ C+  SG      P++ LHF  G  +DLP +NY 
Sbjct: 305 AYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSG---AVFPSMVLHFDGGD-MDLPTENYF 360

Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             VD + + C+     S +LSI+GN+ Q    + +D+  + + F P  C
Sbjct: 361 GAVDDSVS-CW-IVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 122/356 (34%), Positives = 180/356 (50%), Gaps = 35/356 (9%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y   I VGTP ++F  + DTGSD+ W+Q  PCT C      IFDP+ SS++  + C++
Sbjct: 53  GGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSS 110

Query: 217 PQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GIALGCGH 270
             C  L  S C   ++ C Y   YG G  T G+   +T+S G +  GS K    A+GCG 
Sbjct: 111 QLCAELPGS-CEPGSSTCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGM 168

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEF--NSARGG 324
            N G F G  GL+GLG G +SLT Q+ A   +  +YCLVD +S + S  L F  ++A  G
Sbjct: 169 VNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHG 227

Query: 325 DAVTAPLIR--NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
             + +  I   +    T+Y + + G +V GQ +  P           G  I+D GT +T 
Sbjct: 228 TGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTY 276

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
           + +  Y  +      +   L    G ++  D CYD S  R+ + P +++   AG  +  P
Sbjct: 277 VPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL-AGATMTPP 334

Query: 442 AKNYLIPVDSAG-TFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + NY + VD +G T C A    S   +SIIGNV QQG  + +D  ++ + F   KC
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 175/359 (48%), Gaps = 34/359 (9%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
           +GTP +   +VLDTGS ++W+QC P         P   FDP  SSS+S LPC+ P CK  
Sbjct: 87  IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 146

Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
               +L  S C +NR C Y   Y DG+F  G+LV E  +F NS +   + LGC  ++  +
Sbjct: 147 IPDFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKESTDV 205

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD------SPASGVLEFN-SARGGDAVT 328
                G+LG+  G LS   Q K +  +YC+  R       S  S  L  N ++RG   V+
Sbjct: 206 ----KGILGMNLGRLSFISQAKISKFSYCIPTRSNRPGLASTGSFYLGENPNSRGFKYVS 261

Query: 329 APLI----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
                   R   +D   Y V L G  +G + + IP S+F  D  G G  +VD G+  T L
Sbjct: 262 LLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSGSEFTHL 321

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSV--RVPTVSLHFGAGKALD 439
              AY+ +++  VRL G+      V  +  D C+D +    +   +  +   FG G  + 
Sbjct: 322 VDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFGRGVEIL 381

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  +  L+ V   G  C     +S   +A +IIGNV QQ   V FD+AN RVGF+  +C
Sbjct: 382 VEKQRLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGFSKAEC 439


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 133/429 (31%), Positives = 200/429 (46%), Gaps = 53/429 (12%)

Query: 100 SLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEY 159
           S V   L  D  R   +  KL+  +  + R  +     Q+  +    P V    QG+G  
Sbjct: 83  SSVAETLRWDQHRAGYIQRKLEDQV-PITRSVIT----QVSHQGVVQPKVGTQGQGTGVQ 137

Query: 160 FSRIGVGTPPRQFS------MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSP 211
            +   VG  P   S      MV+DT SD+ W+QC PC    C+ Q+D ++DP  SSS + 
Sbjct: 138 PAGEPVGDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAA 197

Query: 212 LPCAAPQCKSLD--VSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA-- 265
            PC++P C++L    + C    ++C Y+V Y DGS + G  +++ ++   +     I+  
Sbjct: 198 FPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAKPASAISEF 257

Query: 266 -LGCGHD--NEGLFVG-SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPAS----G 314
             GC H     G F   ++G++ LG G  SL  Q KAT     +YCL      +     G
Sbjct: 258 RFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILG 317

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           V    ++R   AVT P++R+K     Y V L    V G+ + +PP++F        G ++
Sbjct: 318 VPRVAASR--YAVT-PMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFA------AGAVM 368

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFS-----GLRSVRVPTVS 429
           D  T +TRL   AY +LR +FV      +  +     DTCYDFS     G   V++P ++
Sbjct: 369 DSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGVKLPKIT 428

Query: 430 LHF-GAGKALDLPAKNYLIPVDSAGTFCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
           L F G   A++L     L+        C AFAP +      IIGNVQQQ   V +++   
Sbjct: 429 LVFDGPNGAVELDPSGVLL------DGCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGA 482

Query: 487 RVGFTPNKC 495
            VGF    C
Sbjct: 483 TVGFRRGAC 491


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 35/365 (9%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + VGTPP+  +MVLDTGS+++WL C P     + S   F P+ SS+++ +PCA+ QC+S 
Sbjct: 89  LAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCRSR 148

Query: 223 DV---SAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEG 274
           D+    AC   ++RC   ++Y DGS + G L T+  + G+   ++  A GC     D+  
Sbjct: 149 DLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRA-AFGCMSSAFDSSP 207

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR-------GGDAV 327
             V SAGLLG+  G LS   Q      +YC+ DRD   +GVL    +            +
Sbjct: 208 DGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPTFLPLNYTPM 265

Query: 328 TAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
             P +     D   Y V L G  VGG+ + IP S+   D  G G  +VD GT  T L   
Sbjct: 266 YQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLGD 325

Query: 387 AYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS---VRVPTVSLHF-GAGK 436
           AY++L+  F R A  L P       +    FDTC+     RS    R+P V+L F GA  
Sbjct: 326 AYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTLLFNGAEM 385

Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFAPTSSA---LSIIGNVQQQGTRVSFDLANNRVGF 490
           A+      Y +P +     G +C  F           +IG+  Q    V +DL   RVG 
Sbjct: 386 AVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYDLERGRVGL 445

Query: 491 TPNKC 495
            P +C
Sbjct: 446 APVRC 450


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 163/350 (46%), Gaps = 25/350 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTPP+Q  + +DT +D  W+ C  C  C   +   F+P  S SY  +PC +P 
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165

Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C      +C  N   C + + Y D S     L  ++++  N   VK    GC     G  
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSLEAA-LSQDSLAVAND-VVKSYTFGCLQKATGTA 223

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAV---TA 329
               G  GL       LS TK +   + +YCL    S   SG L     R G  +   T 
Sbjct: 224 TPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLG--RKGQPLRIKTT 281

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IPP+    D A   G ++D GT  TRL   AY 
Sbjct: 282 PLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYV 341

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           ++RD  VR      P S +  FDTCY+     +V+ P V+  F  G  + LPA N +I  
Sbjct: 342 AVRDE-VRRRIRGAPLSSLGGFDTCYN----TTVKWPPVTFMF-TGMQVTLPADNLVIHS 395

Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               T C A A      ++ L++I ++QQQ  R+ FD+ N RVGF   +C
Sbjct: 396 TYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  158 bits (399), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 40/362 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
           +GTP +   +VLDTGS ++W+QC P         P   FDP  SSS+S LPC+ P CK  
Sbjct: 86  IGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPR 145

Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
               +L  S C +NR C Y   Y DG+F  G+LV E  +F NS +   + LGC  ++   
Sbjct: 146 IPDFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES--- 201

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
                G+LG+  G LS   Q K +  +YC+  R S   G+    S   GD   +   +  
Sbjct: 202 -TDEKGILGMNLGRLSFISQAKISKFSYCIPTR-SNRPGLASTGSFYLGDNPNSRGFKYV 259

Query: 336 KVDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            + TF             Y V L G  +G + + IP S+F  D  G G  +VD G+  T 
Sbjct: 260 SLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTH 319

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSVRVPT----VSLHFGAGK 436
           L   AY+ +++  VRL G+      V  +  D C+D  G  S+ +      +   FG G 
Sbjct: 320 LVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFD--GNHSMEIGRLIGDLVFEFGRGV 377

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            + +  ++ L+ V   G  C     +S   +A +IIGNV QQ   V FD+ N RVGF+  
Sbjct: 378 EILVEKQSLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFSKA 436

Query: 494 KC 495
           +C
Sbjct: 437 EC 438


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 126/427 (29%), Positives = 198/427 (46%), Gaps = 78/427 (18%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-------- 193
           E F+ P+ SGA  G+G+YF R  VGTP R F +V DTGSD+ W++CR             
Sbjct: 38  EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97

Query: 194 ---YQQSDP-----------------IFDPKTSSSYSPLPCAAPQCKS---LDVSACR-- 228
              Y    P                 +F P  S +++P+PC++  C +     ++AC   
Sbjct: 98  GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP 157

Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----------SVKGIALGCGHDNEGL-FV 277
            + C Y+  Y DGS   G + T++ +   SG           ++G+ LGC     G  F+
Sbjct: 158 GSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217

Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEFN------------- 319
            S G+L LG   +S   +  A      +YCLVD  +P  A+  L F              
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277

Query: 320 ----SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
               SA    A   PL+ + ++  FY V + G SV G+ ++IP  ++++ +   GG I+D
Sbjct: 278 ACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKG--GGAILD 335

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGL-----RSVRVPTVSL 430
            GT++T L + AY ++  +  +    L P   +  FD CY+++        +V VP +++
Sbjct: 336 SGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCYNWTSPLTGEDLAVAVPALAV 394

Query: 431 HFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRV 488
           HF     L  P K+Y+I  D+A G  C          +S+IGN+ QQ     FDL N R+
Sbjct: 395 HFAGSARLQPPPKSYVI--DAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRL 452

Query: 489 GFTPNKC 495
            F  ++C
Sbjct: 453 RFKRSRC 459


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 170/363 (46%), Gaps = 37/363 (10%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP--IFDPKTSSSYSPLPCA 215
           EY   + VGTPP Q   + DTGSD+ W+ C         SD   +F P  S++YS L C 
Sbjct: 99  EYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQ 158

Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGS-------VKGIALG 267
           +  C++L  ++C A+  C YQ AYGDGS T+G L TET SF  +G        V  ++ G
Sbjct: 159 SAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFG 218

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPA--SGVLEFNS 320
           C   + G F  S GL+GLG G LSL  Q+ A +      +YCLV   + A  S  L F +
Sbjct: 219 CSTGSAGSFR-SDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSSSTLSFGA 277

Query: 321 ---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                   A + PL+ + +VD++Y V L   +V GQ         ++  A    IIVD G
Sbjct: 278 RAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQ---------DVASANSSRIIVDSG 327

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---VPTVSLHFGA 434
           T +T L       L     R     +      L   CYD  G        +P V+L FG 
Sbjct: 328 TTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTLRFGG 387

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLANNRVGFTP 492
           G ++ L  +N    ++  GT C    P S +  +SI+GN+ QQ   V +DL    V F  
Sbjct: 388 GASVTLRPENTFSLLEE-GTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVTFAA 446

Query: 493 NKC 495
             C
Sbjct: 447 VDC 449


>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
           thaliana]
          Length = 142

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 77/139 (55%), Positives = 99/139 (71%), Gaps = 1/139 (0%)

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD 416
           +  SLF++D+ G+GG+I+D GT++TRL   AY ++RD+F   A  LK     +LFDTC+D
Sbjct: 4   VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63

Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQG 476
            S +  V+VPTV LHF  G  + LPA NYLIPVD+ G FCFAFA T   LSIIGN+QQQG
Sbjct: 64  LSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122

Query: 477 TRVSFDLANNRVGFTPNKC 495
            RV +DLA++RVGF P  C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 117/359 (32%), Positives = 165/359 (45%), Gaps = 37/359 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           +GTPP+   MVLDTGS ++W+QC             FDP  SS++S LPC  P CK    
Sbjct: 103 IGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCKPRIP 162

Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L  S C  NR C Y   Y DG++  G+LV E  +F  S     + LGC  ++     
Sbjct: 163 DFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATES----T 217

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
              G+LG+  G LS   Q K T  +YC+  R +   G     S   G    +   R  ++
Sbjct: 218 DPRGILGMNRGRLSFASQSKITKFSYCVPTRVT-RPGYTPTGSFYLGHNPNSNTFRYIEM 276

Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            TF             Y V L G  +GG+ + I P++F  D  G G  ++D G+  T L 
Sbjct: 277 LTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLV 336

Query: 385 TQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
            +AY+ +R   VR  G  +K      GVA  D C+D + +   R +  +   F  G  + 
Sbjct: 337 NEAYDKVRAEVVRAVGPRMKKGYVYGGVA--DMCFDGNAIEIGRLIGDMVFEFEKGVQIV 394

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +P +  L  V+  G  C   A +    +A +IIGN  QQ   V FDL N R+GF    C
Sbjct: 395 VPKERVLATVE-GGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/370 (32%), Positives = 174/370 (47%), Gaps = 43/370 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC----K 220
           +GTPPR+  +++DT S++ W+Q   CT C     P F+P  SSS+   PC +  C    K
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64

Query: 221 SLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
               SAC      C +QVAY DGS   G +  E  S     G + ++  +  GC   +  
Sbjct: 65  LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124

Query: 275 LFVG-SAGLLGLGGGMLSLTKQIKATS-------LAYCLVDRDSP--ASGVLEFNSARGG 324
             V  S+G LGL  G  S   QI + S        +YC  +R     +SGV+ F    G 
Sbjct: 125 RPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIF----GD 180

Query: 325 DAVTAPLIRNKKVDT---------FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
             + A   +   ++          FYYVGL G SVGG+ + IP S F++D  G+GG   D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSG--LRSVRVPTVSLHF 432
            GT ++ L   A+ +L ++F R   +L  TSG     + CYD +    R    P V+LHF
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTLHF 300

Query: 433 GAGKALDLPAKNYLIPV---DSAGTFCFAF----APTSSALSIIGNVQQQGTRVSFDLAN 485
                ++L   +  +P+       T C AF    A     +++IGN QQQ   +  DL  
Sbjct: 301 KNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLER 360

Query: 486 NRVGFTPNKC 495
           +R+GF P  C
Sbjct: 361 SRIGFAPANC 370


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 179/356 (50%), Gaps = 35/356 (9%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y   I VGTP ++F  + DTGSD+ W+Q  PCT C      IFDP+ SS++  + C++
Sbjct: 53  GGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGC--SGGTIFDPRQSSTFREMDCSS 110

Query: 217 PQCKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVK--GIALGCGH 270
             C  L  S C   ++ C Y   YG G  T G+   +T+S G +  GS K    A+GCG 
Sbjct: 111 QLCTELPGS-CEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGM 168

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPA-SGVLEF--NSARGG 324
            N G F G  GL+GLG G +SLT Q+ A   +  +YCLVD +S + S  L F  ++A  G
Sbjct: 169 VNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHG 227

Query: 325 DAVTAPLIR--NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
             + +  I   +    T+Y + + G +V GQ +  P           G  I+D GT +T 
Sbjct: 228 TGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTY 276

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
           + +  Y  +      +   L    G ++  D CYD S  R+ + P +++   AG  +  P
Sbjct: 277 VPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL-AGATMTPP 334

Query: 442 AKNYLIPVDSAG-TFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           + NY + VD +G T C A        +SIIGNV QQG  + +D  ++ + F   KC
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/375 (29%), Positives = 183/375 (48%), Gaps = 30/375 (8%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           T + SG     GEYF  I +GTPP +   + DTGSD+ W+QC+PC +CY+Q+ P+FD K 
Sbjct: 72  TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKK 131

Query: 206 SSSYSPLPCAAPQCKSL--DVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           SS+Y    C +  C++L      C  ++  C Y+ +YGD SFT GD+ TET+S  +S   
Sbjct: 132 SSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGS 191

Query: 262 K----GIALGCGHDNEGLFVGSAGLLGLGGGM-LSLTKQIKAT---SLAYCLVDRDSPAS 313
                G   GCG++N G F  +   +   GG  LSL  Q+ ++     +YCL    +  +
Sbjct: 192 SVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLSHTAATTN 251

Query: 314 GV---------LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
           G          +  N ++    +T PLI+ K  +T+Y++ L   +VG   +      + +
Sbjct: 252 GTSVINLGTNSIPSNPSKDSATLTTPLIQ-KDPETYYFLTLEAVTVGKTKLPYTGGGYGL 310

Query: 365 DEAGD---GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGL 420
           +       G II+D GT +T L +  Y+    +    + G  + +    L   C+  SG 
Sbjct: 311 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFK-SGD 369

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
           + + +P +++HF     + L   N  + ++   T C +  PT+  ++I GN+ Q    V 
Sbjct: 370 KEIGLPAITMHF-TNADVKLSPINAFVKLNE-DTVCLSMIPTTE-VAIYGNMVQMDFLVG 426

Query: 481 FDLANNRVGFTPNKC 495
           +DL    V F    C
Sbjct: 427 YDLETKTVSFQRMDC 441


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 122/382 (31%), Positives = 177/382 (46%), Gaps = 36/382 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--CYQQSDPIFD 202
           S PV    SQ   EY     +G PP+Q   ++DTGS++ W QC  C    C+ Q+   +D
Sbjct: 61  SAPVHWAESQYIAEYL----IGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYD 116

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-RANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           P  S +  P+ C    C     + C R N+ C    AYG G    G L TE  +F     
Sbjct: 117 PSRSRTARPVACNDTACALGSETRCARDNKACAVLTAYGAGVIG-GVLGTEAFTFQPQSE 175

Query: 261 VKGIALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS---- 313
              +A GC        G   G++G++GLG G LSL  Q+     +YCL    S ++    
Sbjct: 176 NVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSR 235

Query: 314 ---GVLEFNSARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
              G     S+ G  A + P ++N  VD   TFYY+ LTG +VG   + +P + F++ + 
Sbjct: 236 LFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQV 295

Query: 368 GDG---GIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFS-GLR 421
             G   G ++D G+  T L   AY +LRD  V+  G   + P +G    D C   + G  
Sbjct: 296 ATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDV 355

Query: 422 SVRVPTVSLHFGAGKA-LDLPAKNYLIPVDSAGTFCFAFA---PTSS----ALSIIGNVQ 473
              VP + LHFG+G   + +P +NY  PVD +      F+   P S+      +IIGN  
Sbjct: 356 GKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYM 415

Query: 474 QQGTRVSFDLANNRVGFTPNKC 495
           QQ   + +DL    + F P  C
Sbjct: 416 QQDMHLLYDLEKGMLSFQPADC 437


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 175/387 (45%), Gaps = 45/387 (11%)

Query: 146 TPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---Q 196
            P+   A  G G+Y     VGTP ++F +V DTGSD+ W+ C      R C+       +
Sbjct: 70  VPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIR 129

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLV 249
              +F    SSS+  +PC    CK       SL         C Y   Y DGS  +G   
Sbjct: 130 HKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFA 189

Query: 250 TETVSF----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT----- 299
            ETV+     G    +  + +GC    +G  F  + G++GLG    S    IKA      
Sbjct: 190 NETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGG 247

Query: 300 SLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQ 353
             +YCLVD  S    S  L F S+R  +A+   +   +     V++FY V + G S+GG 
Sbjct: 248 KFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGA 307

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVA 409
            ++IP  ++  D  G GG I+D G+++T L   AY     +LR S ++     K    + 
Sbjct: 308 MLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIG 362

Query: 410 LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSI 468
             + C++ +G     VP +  HF  G   + P K+Y+I   + G  C  F   +    S+
Sbjct: 363 PLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSV 421

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GN+ QQ     FDL   ++GF P+ C
Sbjct: 422 VGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 100/276 (36%), Positives = 141/276 (51%), Gaps = 21/276 (7%)

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C Y + YGDGSFT G+L  E + FG    VK    GCG +N+GLF G +GL+GLG   LS
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGTI-LVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 191

Query: 292 LTKQ---IKATSLAYCL--VDRDSPASGVLEFNSA--RGGDAVT-APLIRNKKVDTFYYV 343
           L  Q   I     +YCL   +R    S +L  NS+  R    ++ A +I N ++  FY++
Sbjct: 192 LISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFI 251

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
            LTG S+GG A+Q P         G   I+VD GT ITRL    Y +L+  F++      
Sbjct: 252 NLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 304

Query: 404 PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAP 461
           P    ++ DTC++ S  + V +PT+ +HF     L  D+    Y +  D A   C A A 
Sbjct: 305 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALAS 363

Query: 462 TS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 ++I+GN QQ+  RV +D    +VGF    C
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 399


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/312 (34%), Positives = 153/312 (49%), Gaps = 22/312 (7%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           SQ  G+Y  +  +G PP      +DTGSD+ W++C PC  C     P++DP  S S   L
Sbjct: 81  SQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKL 140

Query: 213 PCAAPQCKSLDVSACRANRCL-------YQVAYGDGS--FTVGDLVTETVSFGNSGSVKG 263
           PC++  C++L      +++C        Y  AYG      T G L TET +FG+      
Sbjct: 141 PCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTFGDGYVANN 200

Query: 264 IALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
           ++ G     +G  F G+AGL+GLG G LSL  Q+ A   AYCL    +  S +L F S  
Sbjct: 201 VSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTIL-FGSLA 259

Query: 321 ---ARGGDAVTAPLIRNKK--VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
                 GD  + PL+ N K   DT YYV L G SVGG  + I    F ++  G GG+  D
Sbjct: 260 ALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFD 319

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVSLHFGA 434
            G   T L+  AY  +R +       L   +G    DTC+  +  ++V ++P + LHF  
Sbjct: 320 SGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQAVAQMPPLVLHFDD 376

Query: 435 GKALDLPAKNYL 446
           G  + L  +NYL
Sbjct: 377 GADMSLNGRNYL 388


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 119/351 (33%), Positives = 160/351 (45%), Gaps = 25/351 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTP +Q  + +DT +D  W+ C  C  C   S   F+P  S+SY P+PC +PQ
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 164

Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C      +C  N   C + ++Y D S     L  +T++      VK    GC     G  
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGD-VVKAYTFGCLQRATGTA 222

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
               G  GL       LS TK +   + +YCL    S   SG L     R G      T 
Sbjct: 223 APPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG--RNGQPRRIKTT 280

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IP S    D A   G ++D GT  TRL    Y 
Sbjct: 281 PLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYL 340

Query: 390 SLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           +LRD   R  G      S +  FDTCY+     +V  P V+L F  G  + LP +N +I 
Sbjct: 341 ALRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIH 395

Query: 449 VDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                T C A A      ++ L++I ++QQQ  RV FD+ N RVGF    C
Sbjct: 396 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 175/373 (46%), Gaps = 45/373 (12%)

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           ++G+G+  +  S ++DTGS+   +QC        +S P+FDP  S SY  +PC +  C +
Sbjct: 103 QLGIGSLQKNLSAIIDTGSEAVLVQCGS------RSRPVFDPAASQSYRQVPCISQLCLA 156

Query: 222 LDVSACRANR---------CLYQVAYGDGSFTVGDL------VTETVSFGNSGSVKGIAL 266
           +       +          C Y ++YGD   + GD       +  T S G +   + +A 
Sbjct: 157 VQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAF 216

Query: 267 GCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIK----ATSLAYCLVDR--DSPASGVLEF 318
           GC H  +G  V  GS G++G   G LSL  Q+K     +  +YC   +     A+GV+  
Sbjct: 217 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 276

Query: 319 NSARGGDAVTA--PLIRNKKV---DTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGI 372
             +    +     PL+ N         YYVGLT  SV G+ + IP S F++D   GDGG 
Sbjct: 277 GDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 336

Query: 373 IVDCGTAITRLQTQAYNSLRDSFV--RLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVS 429
           ++D GT  TR+   AY + R++F     +G  K     A FD CY+ S   S+  VP V 
Sbjct: 337 VLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVR 396

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSA----LSIIGNVQQQGTRVSFD 482
           L       L+L  ++  +PV +AG   T C A   +  +    ++++GN QQ    V +D
Sbjct: 397 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 456

Query: 483 LANNRVGFTPNKC 495
              +RVGF    C
Sbjct: 457 NERSRVGFERADC 469


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 189/404 (46%), Gaps = 58/404 (14%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP---- 199
           F+ P+ SGA  G+G+YF R  VGTP + F ++ DTGSD+ W++CR        +      
Sbjct: 95  FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154

Query: 200 -----------IFDPKTSSSYSPLPCAAPQCKS---LDVSACRANR--CLYQVAYGDGSF 243
                      +F P  S ++SP+PC++  CKS     ++ C ++   C Y   Y D S 
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSA 214

Query: 244 TVGDLVTETVSFGNS------------GSVKGIALGC--GHDNEGLFVGSAGLLGLGGGM 289
             G + T++ +   S              ++G+ LGC   H  +G F  S G+L LG   
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSN 273

Query: 290 LSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTA---------PLIRNKKV 337
           +S   +  +      +YCLVD  +P +         G DA ++         PL+ + +V
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
             FY V +   SV G A+ IP  ++  D   +GG I+D GT++T L T AY ++  +   
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVW--DVGSNGGTIIDSGTSLTVLATPAYKAVVAALSE 391

Query: 398 LAGNLKPTSGVALFDTCYDFS----GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA- 452
               L P   +  FD CY+++    G   + VP +++ F     L+ PAK+Y+I  D+A 
Sbjct: 392 QLAGL-PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVI--DAAP 448

Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           G  C      +   +S+IGN+ QQ     FDL N  + F    C
Sbjct: 449 GVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 172/375 (45%), Gaps = 38/375 (10%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLP 213
           G  +Y +   +G PP++   ++DTGS++ W QC  C   C++Q+ P +DP  S +   + 
Sbjct: 67  GQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVG 126

Query: 214 CAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
           C    C     + C ++   C     YG G+   G L TE ++F  S +V  +  GC   
Sbjct: 127 CNDAACALGSETQCLSDNKTCAVVTGYGAGNI-AGTLATENLTF-QSETVS-LVFGCIVV 183

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSP------ASGVLEF 318
              + G   G++G++GLG G LSL  Q+  T  +YCL     D   P      AS  L  
Sbjct: 184 TKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSYCLTPYFEDTIEPSHMVVGASAGLIN 243

Query: 319 NSARGGDAVTAPLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG---GI 372
            SA      T P +R+   D   TFYY+ LTG + G   + +P + F++ +   G   G 
Sbjct: 244 GSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVAPGMWTGT 303

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            +D G  +T L   AY +LR    R  G   ++P +G   FD C        + VP + L
Sbjct: 304 FIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDLCVALKDAERL-VPPLVL 362

Query: 431 HFGAGKA----LDLPAKNYLIPVDSAGTFCFAFAPTS------SALSIIGNVQQQGTRVS 480
           HFG G      L +P  NY  PVDSA      F+         +  ++IGN  QQ   V 
Sbjct: 363 HFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVL 422

Query: 481 FDLANNRVGFTPNKC 495
           +DLA   + F P  C
Sbjct: 423 YDLAGGVLSFQPADC 437


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/349 (31%), Positives = 161/349 (46%), Gaps = 18/349 (5%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           S  Y  +   GTPP+   + LDT SD  W+ C  C  C   S P F P  S+S+  + C 
Sbjct: 94  SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCG 151

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
           +P CK +    C  + C +   YG  S     +V +T++   +  + G   GC +   G 
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSI-AASVVQDTLTLA-TDPIPGYTFGCVNKTTGS 209

Query: 276 FV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-AP 330
                G  GL      +LS ++ +  ++ +YCL    S   SG L          +   P
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           L+RN +  + YYV L    VG + V IPP+    +     G I D GT  TRL    Y +
Sbjct: 270 LLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTA 329

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           +R+ F R  G   P + +  FDTCY+      + VPT++  F +G  + LP  N +I   
Sbjct: 330 VRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLF-SGMNVTLPPDNIVIHST 384

Query: 451 SAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  T C A A      +S L++I N+QQQ  RV FD+ N+R+G     C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 119/351 (33%), Positives = 160/351 (45%), Gaps = 25/351 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTP +Q  + +DT +D  W+ C  C  C   S   F+P  S+SY P+PC +PQ
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111

Query: 219 CKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C      +C  N   C + ++Y D S     L  +T++      VK    GC     G  
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADSSLQAA-LSQDTLAVAGD-VVKAYTFGCLQRATGTA 169

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
               G  GL       LS TK +   + +YCL    S   SG L     R G      T 
Sbjct: 170 APPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLG--RNGQPRRIKTT 227

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IP S    D A   G ++D GT  TRL    Y 
Sbjct: 228 PLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYL 287

Query: 390 SLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           +LRD   R  G      S +  FDTCY+     +V  P V+L F  G  + LP +N +I 
Sbjct: 288 ALRDEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFD-GMQVTLPEENVVIH 342

Query: 449 VDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                T C A A      ++ L++I ++QQQ  RV FD+ N RVGF    C
Sbjct: 343 TTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 179/416 (43%), Gaps = 56/416 (13%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGAS---QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           R  L     ++LP      VV   +      GEY  ++G+GTP   F+  +DT SD+ W 
Sbjct: 55  RDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCFTAAIDTASDLIWT 114

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC-------RANRCLYQVAY 238
           QC+PC +CY+Q DP+F+P  S+SY+ +PC +  C  LD   C         + C Y  +Y
Sbjct: 115 QCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSY 174

Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTK 294
           G  + T G L  + ++ G+    +G+  GC   + G   G     +G++GLG G LSL  
Sbjct: 175 GGNATTRGILAVDRLAIGDD-VFRGVVFGCSSSSVG---GPPPQVSGVVGLGRGALSLVS 230

Query: 295 QIKATSLAYCLVDRDSPASGVLEFNS------ARGGDAVTAPLIRNKKVDTFYYVGLTGF 348
           Q+      YCL    S ++G L   +          + V  P+    +  ++YY+ L G 
Sbjct: 231 QLSVRRFMYCLPPPVSRSAGRLVLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGI 290

Query: 349 SVGGQAVQIPPSLFEMDEAGDG--------------------------GIIVDCGTAITR 382
           S+G +A+    S   M+    G                          G+I+D  + IT 
Sbjct: 291 SIGDRAMSF-RSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITF 349

Query: 383 LQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
           L+   Y  + D     +RL        G+ L     +   +  V  P VSL F  G  L 
Sbjct: 350 LEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAF-EGVWLR 408

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  +   +   ++G  C     T   +SI+GN QQQ  +V ++L   R+ F    C
Sbjct: 409 LDKEQMFVEDRASGMMCLMVGKT-DGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 107/350 (30%), Positives = 167/350 (47%), Gaps = 21/350 (6%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC-AAP 217
           + + I +G PP    +++DTGSD+ W+QC PC +CY Q+ P F P  SS+Y    C +AP
Sbjct: 88  FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPSRSSTYRNASCESAP 146

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNE 273
                     +   C Y + Y D S T G L  E ++F  S     S   I  GCG DN 
Sbjct: 147 HAMPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNS 206

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYC---LVDRDSPASGVLEFNSAR-GGDAVTA 329
           G F   +G+LGLG G  S+  +   +  +YC   L+D   P + ++  N AR  GD    
Sbjct: 207 G-FTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLIDPTYPHNFLILGNGARIEGDPTPL 265

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
            + +++     YY+ L   S+G + + I P +F+   +  GG ++D G + T L  +AY 
Sbjct: 266 QIFQDR-----YYLDLQAISLGEKLLDIEPGIFQRYRS-KGGTVIDTGCSPTILAREAYE 319

Query: 390 SLRDSFVRLAGNL--KPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           +L +    L G +  +        + CY+ +  L     P V+ HF  G  L L  ++  
Sbjct: 320 TLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 379

Query: 447 IPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  +S  +FC A    T   +S+IG + QQ   V ++L   +V F    C
Sbjct: 380 VSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 109/349 (31%), Positives = 161/349 (46%), Gaps = 18/349 (5%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           S  Y  +   GTPP+   + LDT SD  W+ C  C  C   S P F P  S+S+  + C 
Sbjct: 94  SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAPIKSTSFRNVSCG 151

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
           +P CK +    C  + C +   YG  S     +V +T++   +  + G   GC +   G 
Sbjct: 152 SPHCKQVPNPTCGGSACAFNFTYGSSSI-AASVVQDTLTLA-ADPIPGYTFGCVNKTTGS 209

Query: 276 FV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-AP 330
                G  GL      +LS ++ +  ++ +YCL    S   SG L          +   P
Sbjct: 210 SAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPVYQPKRIKYTP 269

Query: 331 LIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
           L+RN +  + YYV L    VG + V IPP+    +     G I D GT  TRL    Y +
Sbjct: 270 LLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTA 329

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           +R+ F R  G   P + +  FDTCY+      + VPT++  F +G  + LP  N +I   
Sbjct: 330 VRNEFRRRVGPKLPVTTLGGFDTCYNV----PIVVPTITFLF-SGMNVALPPDNIVIHST 384

Query: 451 SAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  T C A A      +S L++I N+QQQ  RV FD+ N+R+G     C
Sbjct: 385 AGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L    + C  
Sbjct: 166 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 225

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
           N+C Y V YGDG  T G  + + ++   S  V     GC H   G F  S       G M
Sbjct: 226 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 280

Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
            + T                                    PL+RN  +  T Y V L G 
Sbjct: 281 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 304

Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
            VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A   +   G
Sbjct: 305 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 358

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
            A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF PT    A
Sbjct: 359 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 412

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  IGNVQQQ   V +D+    VGF    C
Sbjct: 413 LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 160/350 (45%), Gaps = 27/350 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTPP+Q  + +DT +D +W+ C  C  C   S   FDP +S+SY  +PC +P 
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171

Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C     +AC      C + + Y D S             GN  +VK    GC     G  
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGN--AVKAYTFGCLQRATGTA 229

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
               G  GL       LS TK +   + +YCL    S   SG L     R G      T 
Sbjct: 230 APPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLG--RNGQPQRIKTT 287

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IP      D A   G ++D GT  TRL   AY 
Sbjct: 288 PLLANPHRSSLYYVNMTGIRVGRKVVPIP----AFDPATGAGTVLDSGTMFTRLVAPAYV 343

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           ++RD   R  G   P S +  FDTC++ +   +V  P V+L F  G  + LP +N +I  
Sbjct: 344 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPVTLLFD-GMQVTLPEENVVIHS 397

Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 C A A      ++ L++I ++QQQ  RV FD+ N RVGF   +C
Sbjct: 398 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 120/398 (30%), Positives = 188/398 (47%), Gaps = 50/398 (12%)

Query: 128 DRHELKPAEAQILPEDFSTPVVSGASQG--SGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
           D+  L+    +ILPE  + P+ SG      +G Y++RI +GTPP+QF + +DTGSD+ W+
Sbjct: 20  DQRRLR----RILPEVVAFPI-SGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWV 74

Query: 186 QCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAY 238
            C PCT C + S+      IFDP+ S+S + + C   +C     S C  N   C Y   Y
Sbjct: 75  NCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLY 134

Query: 239 GDGSFTVGDLVTETVSF-----GNSGSVKGIA---LGCGHDNEGLFVGSAGLLGLGGGML 290
           GDGS T G L+ + +SF     GNS +  G A    GCG +  G ++ + GL+G G   +
Sbjct: 135 GDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWL-TDGLVGFGQAEV 193

Query: 291 SLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
           SL  Q+   ++     A+CL   D+  SG L     R    V  P++  +   + Y V L
Sbjct: 194 SLPSQLSKQNVSVNIFAHCL-QGDNKGSGTLVIGHIREPGLVYTPIVPKQ---SHYNVEL 249

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
               V G  V  P +    D +  GG+I+D GT +T L   AY+  +         ++  
Sbjct: 250 LNIGVSGTNVTTPTAF---DLSNSGGVIMDSGTTLTYLVQPAYDQFQ-------AKVRDC 299

Query: 406 SGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL---IPVDSAGTFCFAFAPT 462
               +    + F        P V+L+F  G A+ L   +YL   +       +CF++  +
Sbjct: 300 MRSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLES 359

Query: 463 SS-----ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +S     + +I G+   +   V +D  NNR+G+    C
Sbjct: 360 TSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDC 397


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 124/401 (30%), Positives = 188/401 (46%), Gaps = 54/401 (13%)

Query: 144 FSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDP--- 199
           F  P+ SGA  G G+YF R  VGTP + F +V DTGSD+ W++C RP     +       
Sbjct: 79  FEMPLTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGR 138

Query: 200 IFDPKTSSSYSPLPCAAPQC-KSLDVSACR----ANRCLYQVAYGDGSFTVGDLVTETVS 254
            F P+ S +++P+ CA+  C KSL  S        + C Y   Y DGS   G + TE+ +
Sbjct: 139 AFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESAT 198

Query: 255 FGNSG--------SVKGIALGCGHDNEG-LFVGSAGLLGLGGGMLSLTKQIK---ATSLA 302
              SG         +KG+ LGC     G  F  S G+L LG   +S         A   +
Sbjct: 199 IALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFS 258

Query: 303 YCLVDRDSP--ASGVLEFN-----------------------SARGGDAVTAPLIRNKKV 337
           YCLVD  SP  A+  L F                              A   PL+ ++++
Sbjct: 259 YCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRM 318

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
             FY V +   SV GQ ++IP +++++D    GG+I+D GT++T L   AY ++  +   
Sbjct: 319 RPFYDVAVKAVSVAGQFLKIPRAVWDVDAG--GGVILDSGTSLTVLAKPAYRAVVAALSE 376

Query: 398 LAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTF 455
               L P   +  F+ CY+++     V +P +++HF     L+ P K+Y+I  D+A G  
Sbjct: 377 GLAGL-PRVTMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVI--DAAPGVK 433

Query: 456 CFAFAP-TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C          +S+IGN+ QQ     FD+ N R+ F  ++C
Sbjct: 434 CIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L    + C  
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
           N+C Y V YGDG  T G  + + ++   S  V     GC H   G F  S       G M
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 262

Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
            + T                                    PL+RN  +  T Y V L G 
Sbjct: 263 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 286

Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
            VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A   +   G
Sbjct: 287 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 340

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
            A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF PT    A
Sbjct: 341 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 394

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  IGNVQQQ   V +D+    VGF    C
Sbjct: 395 LGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 109/330 (33%), Positives = 144/330 (43%), Gaps = 61/330 (18%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV--SACRA 229
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L    + C  
Sbjct: 148 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSN 207

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGM 289
           N+C Y V YGDG  T G  + + ++   S  V     GC H   G F  S       G M
Sbjct: 208 NQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTS-----GTM 262

Query: 290 LSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV-DTFYYVGLTGF 348
            + T                                    PL+RN  +  T Y V L G 
Sbjct: 263 FART------------------------------------PLVRNPSIIPTLYLVRLRGI 286

Query: 349 SVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSG 407
            VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A   +   G
Sbjct: 287 EVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGG 340

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--A 465
            A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF PT    A
Sbjct: 341 RAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFA 394

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  IGNVQQQ   V +D+    VGF    C
Sbjct: 395 LGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 117/362 (32%), Positives = 171/362 (47%), Gaps = 47/362 (12%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK-- 220
           +GTPP+   MVLDTGS ++W+QC      +++  P   FDP  SS++S LPC  P CK  
Sbjct: 81  IGTPPQTQPMVLDTGSQLSWIQC------HKKQPPTASFDPSLSSTFSILPCTHPLCKPR 134

Query: 221 ----SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
               +L  S C  NR C Y   Y DG++  G+LV E  +F  S S   + LGC  ++   
Sbjct: 135 IPDFTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATES--- 190

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS-----PASGVLEFN--SARGGDAVT 328
                G+LG+  G LS  KQ K T  +YC+  R +     P       N  S++G   V 
Sbjct: 191 -TDPRGILGMNLGRLSFAKQSKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSKGFKYVG 249

Query: 329 APLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
                 +++  F    Y + + G  + G+ + I P++F  D  G G  ++D G+  T L 
Sbjct: 250 MMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGSEFTYLV 309

Query: 385 TQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR----VPTVSLHFGAGK 436
           ++AY+ +R   VR  G  LK      GVA  D C+D   +++V     +  +   F  G 
Sbjct: 310 SEAYDKVRAQVVRAVGPRLKKGYVYGGVA--DMCFD--SVKAVEIGRLIGEMVFEFERGV 365

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            + +P +  L  V   G  C     +    +A +IIGN  QQ   V FDL   RVGF   
Sbjct: 366 EVVIPKERVLADV-GGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVGFGKA 424

Query: 494 KC 495
            C
Sbjct: 425 DC 426


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 118/369 (31%), Positives = 171/369 (46%), Gaps = 46/369 (12%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
           + VG+PP+  +MVLDTGS+++WL C+     +     +FDP  SSSYSP+PC +P C+  
Sbjct: 67  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 122

Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
               S+ VS  +   C   ++Y D S   G+L ++T   GNS ++     GC       N
Sbjct: 123 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-AIPATIFGCMDSGFSSN 181

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
                 + GL+G+  G LS   Q+     +YC+  +DS  SG+L F  +           
Sbjct: 182 SDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYT 239

Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
               ++ PL    +V   Y V L G  V    +Q+P S++  D  G G  +VD GT  T 
Sbjct: 240 PLVQISTPLPYFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 297

Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-G 433
           L    Y +L++ FVR    +LK             D CY     R     +PTV+L F G
Sbjct: 298 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 357

Query: 434 AGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
           A  ++      Y +P     S   +CF F   S  L     IIG+  QQ   + FDLA +
Sbjct: 358 AEMSVSAERLMYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKS 416

Query: 487 RVGFTPNKC 495
           RVGF   +C
Sbjct: 417 RVGFAEVRC 425


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 81/214 (37%), Positives = 124/214 (57%), Gaps = 12/214 (5%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D ARV TL ++L        +  L   + +  P+  S P+  GAS GSG Y+ ++G 
Sbjct: 66  LAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIR-FPKSVSVPLNPGASIGSGNYYVKVGF 124

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-- 222
           G+P R +SM++DTGS ++WLQC+PC   C+ Q+DP+FDP  S +Y  L C + QC SL  
Sbjct: 125 GSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVD 184

Query: 223 -----DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
                 +    +N C+Y  +YGD S+++G L  + ++   S ++ G   GCG D++GLF 
Sbjct: 185 ATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSDGLFG 244

Query: 278 GSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
            +AG+LGLG   LS+  Q+ +    + +YCL  R
Sbjct: 245 RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTR 278


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 165/360 (45%), Gaps = 35/360 (9%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           +   I +G+PP    + +DT SD+ WLQCRPC  CY QS PIFDP  S ++    C   Q
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144

Query: 219 --CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGH 270
               SL  +A +   C Y + Y DG+ + G L  E + F       +S ++  +  GCGH
Sbjct: 145 YSMPSLRFNA-KTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGH 203

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAV- 327
           DN G  +   G+LGLG G  SL  +   T  +YC    D P+    VL      G D   
Sbjct: 204 DNYGEPLVGTGILGLGYGEFSLVHRF-GTKFSYCFGSLDDPSYPHNVL----VLGDDGAN 258

Query: 328 ----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITR 382
               T PL   +  + FYYV +   SV G  + I P +F  + + G GG I+D G ++T 
Sbjct: 259 ILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTS 315

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVR---VPTVSLHFGAG 435
           L  +AY  L++            + V   D     CY+ +  R +     P V+ HF  G
Sbjct: 316 LVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDG 375

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L L  K+  + + S   FC A  P +  ++ IG   QQ   + +DL   ++ F    C
Sbjct: 376 AELSLDVKSVFMKL-SPNVFCLAVTPGN--MNSIGATAQQSYNIGYDLEAKKISFERIDC 432


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 118/369 (31%), Positives = 171/369 (46%), Gaps = 46/369 (12%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
           + VG+PP+  +MVLDTGS+++WL C+     +     +FDP  SSSYSP+PC +P C+  
Sbjct: 60  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCRTR 115

Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
               S+ VS  +   C   ++Y D S   G+L ++T   GNS ++     GC       N
Sbjct: 116 TRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-AIPATIFGCMDSGFSSN 174

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
                 + GL+G+  G LS   Q+     +YC+  +DS  SG+L F  +           
Sbjct: 175 SDEDSKTTGLIGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYT 232

Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
               ++ PL    +V   Y V L G  V    +Q+P S++  D  G G  +VD GT  T 
Sbjct: 233 PLVQISTPLPYFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTF 290

Query: 383 LQTQAYNSLRDSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-G 433
           L    Y +L++ FVR    +LK             D CY     R     +PTV+L F G
Sbjct: 291 LLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRG 350

Query: 434 AGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
           A  ++      Y +P     S   +CF F   S  L     IIG+  QQ   + FDLA +
Sbjct: 351 AEMSVSAERLMYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKS 409

Query: 487 RVGFTPNKC 495
           RVGF   +C
Sbjct: 410 RVGFAEVRC 418


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 178/373 (47%), Gaps = 45/373 (12%)

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
           ++G+G+  +  S ++DTGS+   +QC        +S P+FDP  S SY  +PC +  C +
Sbjct: 2   QLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCLA 55

Query: 222 LDVSACRANR---------CLYQVAYGDGSFTVGDLVTETVSFGNSGS------VKGIAL 266
           +       +          C Y ++YGD   + GD   + +   ++ S       + +A 
Sbjct: 56  VQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAF 115

Query: 267 GCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIK----ATSLAYCLVDR--DSPASGVLEF 318
           GC H  +G  V  GS G++G   G LSL  Q+K     +  +YC   +     A+GV+  
Sbjct: 116 GCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFL 175

Query: 319 -NSARGGDAVT-APLIRNKKV---DTFYYVGLTGFSVGGQAVQIPPSLFEMDEA-GDGGI 372
            +S      V+  PL+ N         YYVGLT  SV G+ + IP S F++D + GDGG 
Sbjct: 176 GDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 235

Query: 373 IVDCGTAITRLQTQAYNSLRDSFV--RLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVS 429
           ++D GT  TR+   AY + R++F     +G  K     A FD CY+ S   S+  VP V 
Sbjct: 236 VLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEVR 295

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSA----LSIIGNVQQQGTRVSFD 482
           L       L+L  ++  +PV +AG   T C A   +  +    ++++GN QQ    V +D
Sbjct: 296 LSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYD 355

Query: 483 LANNRVGFTPNKC 495
              +RVGF    C
Sbjct: 356 NERSRVGFERADC 368


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 183/404 (45%), Gaps = 53/404 (13%)

Query: 104 SRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRI 163
           ++++R S+ +N  I +++   Y        P + Q +P          +  G+G Y    
Sbjct: 47  TQIQRISSILNYSINRVR---YLNHVFSFSPNKIQDVPLS--------SFMGAG-YVMSY 94

Query: 164 GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD 223
            +GTPP Q   ++DTG+D  W QC+PC  C  Q+ P+F P  SS+Y  +PC +P CK+  
Sbjct: 95  SIGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPICKN-- 152

Query: 224 VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGL 282
                           DG +   D +T   + G   S K I +GCGH N+G   G  +G 
Sbjct: 153 ---------------ADGHYLGVDTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGN 197

Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEF---NSARGGDAVTAPLIRN 334
           +GL  G LS   Q+ ++     +YCLV   S    S  L F   ++  G   V+ P+   
Sbjct: 198 IGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI--- 254

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS 394
            K +  Y+V L  FSVG   +++  S    D  G+   I+D GT +T L    Y+ L   
Sbjct: 255 -KEENGYFVSLEAFSVGDHIIKLENS----DNRGNS--IIDSGTTMTILPKDVYSRLESV 307

Query: 395 FVRLAGNLKPTSGVALFDTCYDFSGLRSV-RVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
            + +    +       F+ CY  +    + +V  ++ HF +G  + L A N   P+    
Sbjct: 308 VLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHF-SGSEVHLNALNTFYPITDE- 365

Query: 454 TFCFAFAP--TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             CFAF      S+L+I GNV QQ   V FDL    + F P  C
Sbjct: 366 VICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDC 409


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 129/413 (31%), Positives = 198/413 (47%), Gaps = 43/413 (10%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
            +  R    +++ +LA Y   + +L+ +       D S PV     Q   EY     +G 
Sbjct: 44  EERVRRAVAVSRERLA-YTQQQQQLRASG------DVSAPVHLATRQYIAEYL----IGD 92

Query: 168 PPRQFSMVLDTGSDINWLQC-RPC--TECYQQSDPIFDPKTSSSYSPLPCA--APQCKSL 222
           PP++ + ++DTGS++ W QC   C    C +Q  P ++   SS+++ +PCA  A  C + 
Sbjct: 93  PPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAAN 152

Query: 223 DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEGLFVG 278
            V  C  +  C +  +YG GS   G L TE  +F  SG+ K +  GC       +G   G
Sbjct: 153 GVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF-QSGAAK-LGFGCVSLTRITKGALNG 209

Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSAR----GGDAVTA-PL 331
           ++GL+GLG G LSL  Q  AT  +YCL    R+  AS  L   ++     GG AVT+ P 
Sbjct: 210 ASGLIGLGRGRLSLVSQTGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPF 269

Query: 332 IRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG----DGGIIVDCGTAITRLQ 384
           +++ +     TFYY+ L G SVG   + IP + FE+         GG+I+D G+ +T L 
Sbjct: 270 VKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLA 329

Query: 385 TQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
             AY++L D   R      ++P +   L D C     +  V VP +  HFG G  + + A
Sbjct: 330 EAAYSALSDEVARQLNRSLVQPPADTGL-DLCVARQDVDKV-VPVLVFHFGGGADMAVSA 387

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +Y  PVD + T C          ++IGN QQQ   + +D+    + F    C
Sbjct: 388 GSYWGPVDKS-TACM-LIEEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADC 438


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 110/365 (30%), Positives = 171/365 (46%), Gaps = 28/365 (7%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  ++ +GTP +   + +DT SD+ W+ C  C  C   S+  F P  
Sbjct: 86  PIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 143

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+S+  + C+APQCK +   AC A  C + + YG  S    +L  +T+    +  +K   
Sbjct: 144 STSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 201

Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
            GC +   G        G  GL      ++S  + +  ++ +YCL   R    SG L   
Sbjct: 202 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLG 261

Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
                  V    L+RN +  + YYV L    VG + V +PP+    + +   G I D GT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
             TRL    Y ++R+ F +    +KP + V      FDTCY  SG   V+VPT++  F  
Sbjct: 322 VYTRLAKPVYEAVRNEFRK---RVKPPTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 373

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           G  + +PA N ++   +  T C A A      +S +++I ++QQQ  RV  D+ N R+G 
Sbjct: 374 GVNMTMPADNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 433

Query: 491 TPNKC 495
              +C
Sbjct: 434 ARERC 438


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 170/365 (46%), Gaps = 28/365 (7%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  +GTP +   + +DT SD+ W+ C  C  C   S+  F P  
Sbjct: 86  PIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 143

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+S+  + C+APQCK +    C A  C + + YG  S    +L  +T+    +  +K   
Sbjct: 144 STSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 201

Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
            GC +   G        G  GL      ++S  + I  ++ +YCL   R    SG L   
Sbjct: 202 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 261

Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
                  V    L+RN +  + YYV L    VG + V +PP+    + +   G I D GT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
             TRL    Y ++R+ F +    +KPT+ V      FDTCY  SG   V+VPT++  F  
Sbjct: 322 VYTRLAKPVYEAVRNEFRK---RVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 373

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           G  + +PA N ++   +  T C A A      +S +++I ++QQQ  RV  D+ N R+G 
Sbjct: 374 GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 433

Query: 491 TPNKC 495
              +C
Sbjct: 434 ARERC 438


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/381 (30%), Positives = 173/381 (45%), Gaps = 45/381 (11%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC------RPCTECYQ---QSDPIFD 202
           A  G G+Y     VGTP ++F +V DTGSD+ W+ C      R C+       +   +F 
Sbjct: 5   ADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFH 64

Query: 203 PKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
              SSS+  +PC    CK       SL         C Y   Y DGS  +G    ETV+ 
Sbjct: 65  ANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTV 124

Query: 256 ----GNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT-----SLAYCL 305
               G    +  + +GC    +G  F  + G++GLG    S    IKA        +YCL
Sbjct: 125 ELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFA--IKAAEKFGGKFSYCL 182

Query: 306 VDRDSP--ASGVLEFNSARGGDAVTAPLIRNK----KVDTFYYVGLTGFSVGGQAVQIPP 359
           VD  S    S  L F S+R  +A+   +   +     V++FY V + G S+GG  ++IP 
Sbjct: 183 VDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPS 242

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYN----SLRDSFVRLAGNLKPTSGVALFDTCY 415
            ++  D  G GG I+D G+++T L   AY     +LR S ++     K    +   + C+
Sbjct: 243 EVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFR---KVEMDIGPLEYCF 297

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQ 474
           + +G     VP +  HF  G   + P K+Y+I   + G  C  F   +    S++GN+ Q
Sbjct: 298 NSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGTSVVGNIMQ 356

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           Q     FDL   ++GF P+ C
Sbjct: 357 QNHLWEFDLGLKKLGFAPSSC 377


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 166/357 (46%), Gaps = 35/357 (9%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +   +GTPP+  S V+D   ++ W QC+ C+ C++Q  P+FDP  S++Y   PC  P
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
            C+S+  D   C  N C YQ +   G  T G + T+T + G + +   +A GC      D
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
             G   G +G++GLG    SL  Q    + +YCL   D+  +  L   S    A GG A 
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAA 223

Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           + P +        +  +Y V L G   G   + +PPS           +++D  + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
              AY +++ +     G     + V  FD C+  SG  S   P +   F  G A+ +PA 
Sbjct: 276 VDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPAT 334

Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           NYL+   + GT C A        +++ LS++G++QQ+     FDL    + F P  C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 130/456 (28%), Positives = 205/456 (44%), Gaps = 57/456 (12%)

Query: 60  PFAEESETAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITK 119
           PF E S+T          SSF++ L   +      +N   S+  S+L R++A  +     
Sbjct: 19  PFTEPSKTP---------SSFTIDLIHHDSPPSPFYN--SSMTRSQLIRNAAMRSISRAN 67

Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
                 +   ++LK +     PE    P        +G Y  RI +GTP  +   + DTG
Sbjct: 68  QLSLSLSHSLNQLKESS----PEPIIIP-------NNGNYLMRIYIGTPSVERLAIADTG 116

Query: 180 SDINWLQCRPC--TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACR-ANRCLY 234
           SD+ W+QC PC  T+C+ Q+ P++DP  SS+++ LPC +  C  L  S   C     C+Y
Sbjct: 117 SDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQPCTQLPYSQYVCSDYGDCIY 176

Query: 235 QVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGLFVG-----SAGLLGLGG 287
              YGD S++ G L ++++            I  GCG  N+  F       + G++GLG 
Sbjct: 177 AYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGCGFQNK--FTADKSGKTTGIVGLGA 234

Query: 288 GMLSLTKQIK---ATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFY 341
           G LSL  Q+        +YCL+   S ++  L+F  A   +G   V+ PLI    +  FY
Sbjct: 235 GPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDL-PFY 293

Query: 342 YVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN 401
           Y+ L G +VG + V+   +        DG II+D G+ +T L+   YN    S V+    
Sbjct: 294 YLNLEGITVGAKTVKTGQT--------DGNIIIDSGSTLTYLEESFYNEFV-SLVKETVA 344

Query: 402 LKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
           ++    +   FD C+ +    S   P V  HF  G  +  P    ++  D+    C    
Sbjct: 345 VEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTGGDVVLKPMNTLVLIEDNL--ICSTVV 401

Query: 461 PTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P+    ++I GN+ Q    V +D+   +V F P  C
Sbjct: 402 PSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 170/365 (46%), Gaps = 28/365 (7%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  +GTP +   + +DT SD+ W+ C  C  C   S+  F P  
Sbjct: 102 PIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAK 159

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+S+  + C+APQCK +    C A  C + + YG  S    +L  +T+    +  +K   
Sbjct: 160 STSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSI-AANLSQDTIRLA-ADPIKAFT 217

Query: 266 LGCGHDNEGLFV-----GSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFN 319
            GC +   G        G  GL      ++S  + I  ++ +YCL   R    SG L   
Sbjct: 218 FGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 277

Query: 320 SARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
                  V    L+RN +  + YYV L    VG + V +PP+    + +   G I D GT
Sbjct: 278 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 337

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDFSGLRSVRVPTVSLHFGA 434
             TRL    Y ++R+ F +    +KPT+ V      FDTCY  SG   V+VPT++  F  
Sbjct: 338 VYTRLAKPVYEAVRNEFRK---RVKPTTAVVTSLGGFDTCY--SG--QVKVPTITFMF-K 389

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           G  + +PA N ++   +  T C A A      +S +++I ++QQQ  RV  D+ N R+G 
Sbjct: 390 GVNMTMPADNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGL 449

Query: 491 TPNKC 495
              +C
Sbjct: 450 ARERC 454


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 166/357 (46%), Gaps = 35/357 (9%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +   +GTPP+  S V+D   ++ W QC+ C  C++Q  P+FDP  S++Y   PC  P
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109

Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
            C+S+  DV  C  N C Y+ +   G  T G + T+T + G + +   +A GC      D
Sbjct: 110 LCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
             G   G +G++GLG    SL  Q    + +YCL   D+  +  L   S    A GG A 
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAA 223

Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           + P +        +  +Y V L G   G   + +PPS           +++D  + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
              AY +++ +     G     + V  FD C+  SG  S   P +   F  G A+ +PA 
Sbjct: 276 VDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPAT 334

Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           NYL+   + GT C A        +++ LS++G++QQ+     FDL    + F P  C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 68/360 (18%)

Query: 171 QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-SLDVS---- 225
             ++++DTGSD+ W+QC+PC+ CY Q DP+FDP  S+SY+ +PC A  C+ SL  +    
Sbjct: 121 NLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVP 180

Query: 226 -AC----------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
            +C          ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G   GCG  N G
Sbjct: 181 GSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDGFVFGCGLSNRG 239

Query: 275 LFV-GSA---------GLLGLGGGMLSL----TKQIKATSLAYCLVDRDSPASGVLEFNS 320
           L   GSA         G  G   G LSL    +    AT ++Y                 
Sbjct: 240 LRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSY----------------- 282

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                     +I +     FY++ +TG SVGG AV            G   +++D GT I
Sbjct: 283 --------TRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLDSGTVI 327

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
           TRL    Y ++R  F R  G  +  +    +L D CY+ +G   V+VP ++L   AG  +
Sbjct: 328 TRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADM 387

Query: 439 DLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            + A   L      G+  C A A  S      IIGN QQ+  RV +D   +R+GF    C
Sbjct: 388 TVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 159/350 (45%), Gaps = 27/350 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTPP+Q  + +DT +D +W+ C  C  C   S   FDP  S+SY  +PC +P 
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPL 171

Query: 219 CKSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
           C     +AC      C + + Y D S             GN  +VK    GC     G  
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLAVAGN--AVKAYTFGCLQRATGTA 229

Query: 277 V---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDA---VTA 329
               G  GL       LS TK +   + +YCL    S   SG L     R G      T 
Sbjct: 230 APPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLG--RNGQPQRIKTT 287

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ N    + YYV +TG  VG + V IP      D A   G ++D GT  TRL   AY 
Sbjct: 288 PLLANPHRSSLYYVNMTGVRVGRKVVPIP----AFDPATGAGTVLDSGTMFTRLVAPAYV 343

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
           ++RD   R  G   P S +  FDTC++ +   +V  P ++L F  G  + LP +N +I  
Sbjct: 344 AVRDEVRRRVG--APVSSLGGFDTCFNTT---AVAWPPMTLLFD-GMQVTLPEENVVIHS 397

Query: 450 DSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 C A A      ++ L++I ++QQQ  RV FD+ N RVGF   +C
Sbjct: 398 TYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 159/319 (49%), Gaps = 25/319 (7%)

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACR--------ANRCLYQVAYGDG----SFTVG 246
           P+  P +SSS + + C    C  L    C         +  C Y  AYG+      +T G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 247 DLVTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL 305
            L+TET +FG+ + +  GIA GC   +EG F   +GL+GLG G LSL  Q+   +  Y L
Sbjct: 73  ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRL 132

Query: 306 ---VDRDSPAS-GVLEFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIP 358
              +   SP S G L   +   GD+ ++ PL+ N  V    FYYVGLTG SVGG+ VQIP
Sbjct: 133 SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIP 192

Query: 359 PSLFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDF 417
              F  D + G GG+I D GT +T L   AY  +RD  +   G  KP       D     
Sbjct: 193 SGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFT 252

Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQ 474
            G  +   P++ LHF  G  +DL  +NYL  +   +     C++   +S AL+IIGN+ Q
Sbjct: 253 GGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQ 312

Query: 475 QGTRVSFDLANN-RVGFTP 492
               V FDL+ N R+ F P
Sbjct: 313 MDFHVVFDLSGNARMLFQP 331


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 117/378 (30%), Positives = 180/378 (47%), Gaps = 50/378 (13%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G Y++RI +GTPPR F + +DTGSDI W+ C+PC  C   S        FDP+ SS+ S
Sbjct: 38  AGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97

Query: 211 PLPCAAPQCKS---LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFG--------NS 258
           PL C   +C S   +  S C  +R C Y   YGDGS T+G  V++   +         N+
Sbjct: 98  PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157

Query: 259 GSVKGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S K I  GC ++  G          G+ G G   LS+  Q+ +  LA     +CL   D
Sbjct: 158 ASAK-ITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
            P  G+L          V  P++ ++     Y + L G +V GQ + I P +F       
Sbjct: 217 -PGGGILVLGEITEPGMVYTPIVPSQP---HYNLNLQGIAVNGQQLSIDPQVFATTNT-- 270

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKP--TSGVALFDTCYDFSGLRSVRVP 426
            G I+DCGT +  L  +AY    ++ +  ++ + +P    G   F T +    +     P
Sbjct: 271 RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEI----FP 326

Query: 427 TVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAF------APTSSALSIIGNVQQQGT 477
           +V+L+F  G  +DL  K+YLI     DS+  +C  +      A  SS ++I+G++  +  
Sbjct: 327 SVTLYF-EGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDK 385

Query: 478 RVSFDLANNRVGFTPNKC 495
              +DL N R+G+T   C
Sbjct: 386 VFVYDLENQRIGWTSFDC 403


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 173/363 (47%), Gaps = 21/363 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           + P+ SG +   G Y  R+ +GTP +   MVLDT +D  ++ C  CT C   SD  F PK
Sbjct: 86  TAPIASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPK 142

Query: 205 TSSSYSPLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
            S+SY PL C+ PQC   + L   A     C +  +Y   SF+   LV +++    +  +
Sbjct: 143 ASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLA-TDVI 200

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA-SGVLE 317
              + GC +   G  V + GLLGLG G LSL  Q  +      +YCL    S   SG L+
Sbjct: 201 PNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLK 260

Query: 318 FNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                   ++ T PL+R+    + YYV  TG SVG   V  P      +     G I+D 
Sbjct: 261 LGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDS 320

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT ITR     YN++R+ F +  G    TS +  FDTC  F        P ++LHF  G 
Sbjct: 321 GTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FVKTYETLAPPITLHF-EGL 376

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
            L LP +N LI   +    C A A      +S L++I N QQQ  R+ FD  NN+VG   
Sbjct: 377 DLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAR 436

Query: 493 NKC 495
             C
Sbjct: 437 EVC 439


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 160/373 (42%), Gaps = 58/373 (15%)

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY 194
           +EA I P     PV    S  +GEY  +I +GTPP     + DTGSD+ W QC PC  CY
Sbjct: 4   SEASISPNTPEPPV----SSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCY 59

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
           +Q +P+FDP  S+S+  + C + QC+ LD                               
Sbjct: 60  KQKNPMFDPSKSTSFKEVSCESQQCRLLDTPT---------------------------- 91

Query: 255 FGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKAT-----SLAYCLVD- 307
                S+  I  GCGH+N G F     GL G GG  LSLT QI +T       + CLV  
Sbjct: 92  -----SILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 146

Query: 308 RDSPA--SGVLEFNSAR--GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           R  P+  S ++    A   G D V+ PL+  K   T+Y+V L G SVG +     P    
Sbjct: 147 RTDPSITSKIIFGPEAEVSGSDVVSTPLV-TKDDPTYYFVTLDGISVGDKLF---PFSSS 202

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-DTCYDFSGLRS 422
              A  G + +D GT  T L    YN L    V+ A  ++P     L    CY  + L  
Sbjct: 203 SPMATKGNVFIDAGTPPTLLPRDFYNRLVQG-VKEAIPMEPVQDPDLQPQLCYRSATL-- 259

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFD 482
           +  P ++ HF        P   ++ P +  G +CFA  P      I GN  Q    + FD
Sbjct: 260 IDGPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFLIGFD 317

Query: 483 LANNRVGFTPNKC 495
           L   +V F    C
Sbjct: 318 LDGKKVSFKAVDC 330


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 137/427 (32%), Positives = 191/427 (44%), Gaps = 58/427 (13%)

Query: 115 TLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTP------ 168
           T IT L   +Y V R E     AQI       PVV G + G    FS      P      
Sbjct: 71  TSITSLPPQVYQVVRDEFA---AQI-----KLPVVPGNATGPYTCFSAPSQAKPDVPKLV 122

Query: 169 ----------PRQ---FSMVLDTGSDINWLQCRPCTEC-----YQQSD----PIFDPKTS 206
                     PR+   F +  D G+ I  L      E      +QQ +    P FD  TS
Sbjct: 123 LHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHALPYFDRSTS 182

Query: 207 SSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           S+     C +  C+ L V++C   +      C+Y   Y D S T G L  +  +FG   S
Sbjct: 183 STLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDKFTFGAGAS 242

Query: 261 VKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL--VDRDSPASGVLE 317
           V G+A GCG  N G+F  +  G+ G G G LSL  Q+K  + ++C   V+    ++ +L+
Sbjct: 243 VPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVLLD 302

Query: 318 -----FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
                + + RG    T PLI+N    T YY+ L G +VG   + +P S F +   G GG 
Sbjct: 303 LLADLYKNGRGAVQST-PLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGT 360

Query: 373 IVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
           I+D GT+IT L  Q Y  +RD F  ++   + P +    + TC+         VP + LH
Sbjct: 361 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVLH 419

Query: 432 FGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           F  G  +DLP +NY+  V D AG    C A        + IGN QQQ   V +DL NN +
Sbjct: 420 F-EGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNML 478

Query: 489 GFTPNKC 495
            F   +C
Sbjct: 479 SFVAAQC 485



 Score = 66.6 bits (161), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 67/136 (49%), Gaps = 8/136 (5%)

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNL 402
           G  G +VG   + +P S F +   G GG I+D GT+IT L  Q Y  +RD F  ++   +
Sbjct: 38  GRPGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPV 96

Query: 403 KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAF 459
            P +    + TC+         VP + LHF  G  +DLP +NY+  V D AG    C A 
Sbjct: 97  VPGNATGPY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAI 154

Query: 460 APTSSALSIIGNVQQQ 475
                  +IIGN QQQ
Sbjct: 155 N-KGDETTIIGNFQQQ 169


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 168/366 (45%), Gaps = 31/366 (8%)

Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  +  +GTP +   + +DT +D +W+ C  C  C   +   F P  
Sbjct: 85  PIASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTP--FAPAK 142

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + C A QCK +    C  + C +   YG  S     LV +TV+   +  V   A
Sbjct: 143 STTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAASLVQDTVTLA-TDPVPAYA 200

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
            GC     G  V   G  GL      +L+ T+++  ++ +YCL     P+   L F+ S 
Sbjct: 201 FGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCL-----PSFKTLNFSGSL 255

Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           R G           PL++N +  + YYV L    VG + V IPP     +     G + D
Sbjct: 256 RLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFD 315

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFG 433
            GT  TRL   AYN++R+ F R     K  +  +L  FDTCY       +  PT++  F 
Sbjct: 316 SGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT----APIVAPTITFMF- 370

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVG 489
           +G  + LP  N LI   +    C A AP     +S L++I N+QQQ  RV FD+ N+R+G
Sbjct: 371 SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLG 430

Query: 490 FTPNKC 495
                C
Sbjct: 431 VARELC 436


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/353 (32%), Positives = 175/353 (49%), Gaps = 26/353 (7%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G G Y     +GTPP+  S + DTGSD+ W +C  C  C  +    + P  SSS+S LPC
Sbjct: 77  GGGAYDMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPC 136

Query: 215 AAPQCKSLD---VSACRANR-----CLYQVAYGDGS----FTVGDLVTETVSFGNSGSVK 262
           ++  C++L+   ++ C   R     C Y+ +YG  S    +T G + +ET + G S +V+
Sbjct: 137 SSALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLG-SDAVQ 195

Query: 263 GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR 322
           GI  GC   +EG +   +GL+GLG G LSL +Q+K  + +YCL    S +S +L    A 
Sbjct: 196 GIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLTSDPSTSSPLLFGAGAL 255

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            G  V +  + N K  TFY V L   S+G  A + P +       G  GII D GT +T 
Sbjct: 256 TGPGVQSTPLVNLKTSTFYTVNLDSISIG--AAKTPGT-------GRHGIIFDSGTTLTF 306

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           L   AY       +    NL    G   ++ C+  SG      P++ LHF  G  + L  
Sbjct: 307 LAEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGGD-MALKT 363

Query: 443 KNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +NY   V+ + + C+    + S +SI+GN+ Q    + +DL  + + F P  C
Sbjct: 364 ENYFGAVNDSVS-CWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 101/296 (34%), Positives = 148/296 (50%), Gaps = 22/296 (7%)

Query: 222 LDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGS--------VKGIALGCGHD 271
           L  + C+A    C Y   YGD S T GD   ET +   + S        V+ +  GCGH 
Sbjct: 62  LVTNPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHW 121

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDSPASGVLEFNSARGGDAVT 328
           N GLF G+AGLLGLG G LS + Q+++    S +YCLVDR+S A+   +       D ++
Sbjct: 122 NRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLS 181

Query: 329 APLI--------RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
            P +        +   VDTFYYV +    VGG+ V IP   +++   G GG I+D GT +
Sbjct: 182 HPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTL 241

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           +     AY  ++++F+             + + CY+ +G+    +P   + F  G   + 
Sbjct: 242 SYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNF 301

Query: 441 PAKNYLIPVDSAGTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           P +NY I ++     C A   T  SALSIIGN QQQ   + +D   +R+GF P KC
Sbjct: 302 PVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 174/363 (47%), Gaps = 21/363 (5%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           + P+ SG +   G Y  R+ +GTP +   MVLDT +D  ++ C  CT C   SD  F PK
Sbjct: 85  TAPIASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPK 141

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
            S+SY PL C+ PQC  +   +C A     C +  +Y   SF+   LV + +    +  +
Sbjct: 142 ASTSYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDALRLA-TDVI 199

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA-SGVLE 317
              + GC +   G  V + GLLGLG G LSL  Q  +      +YCL    S   SG L+
Sbjct: 200 PYYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLK 259

Query: 318 FNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                   ++ T PL+R+    + YYV  TG SVG   V  P      +     G I+D 
Sbjct: 260 LGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDS 319

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           GT ITR     YN++R+ F +  G    TS +  FDTC  F        P ++LHF  G 
Sbjct: 320 GTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FVKTYETLAPPITLHF-EGL 375

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
            L LP +N LI   +    C A A      +S L++I N QQQ  R+ FD+ NN+VG   
Sbjct: 376 DLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAR 435

Query: 493 NKC 495
             C
Sbjct: 436 EVC 438


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 174/368 (47%), Gaps = 41/368 (11%)

Query: 151 GASQGSGEYFSRI-------GVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDP 203
           G SQ S E  S I       G  +PP   ++VLDT  D+ W++C PCT   Q +D  +DP
Sbjct: 137 GTSQTSSEPSSGIHPAAATDGSSSPP--VTVVLDTAGDVPWMRCVPCTFA-QCAD--YDP 191

Query: 204 KTSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
             SS+YS  PC +  CK L    + C AN +C Y V     SFT     +  V   NSG 
Sbjct: 192 TRSSTYSAFPCNSSACKQLGRYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGD 251

Query: 261 -VKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGV 315
            V+G   GC  + +G F   A G++ LG G+ SL  Q  +T   + +YCL   ++   G 
Sbjct: 252 RVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTET-TKGF 310

Query: 316 LEFNSARGGDA--VTAPLIRNK-----KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            +     G     VT P+++ +        T Y   L   +V G+ + +P  +F      
Sbjct: 311 FQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFA----- 365

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
             G ++D  T ITRL   AY +LR +F  R+   + P       DTCYD +G+R  R+P 
Sbjct: 366 -AGTVMDSRTIITRLPVTAYGALRAAFRNRMRYRVAPPQ--EELDTCYDLTGVRYPRLPR 422

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           ++L F     +++     L+     G   FA     S+ SI+GNVQQQ  +V  D+   R
Sbjct: 423 IALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGR 478

Query: 488 VGFTPNKC 495
           +GF    C
Sbjct: 479 IGFRSAAC 486


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 172/369 (46%), Gaps = 36/369 (9%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R  +G+PP+   + +DT +D  W+   PCT C   +  +F P+ 
Sbjct: 85  PIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDTSNDAAWI---PCTACDGCTSTLFAPEK 141

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + C +PQC  +   +C  + C + + YG  S    ++V +TV+   +  +    
Sbjct: 142 STTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSI-AANVVQDTVTLA-TDPIPDYT 199

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
            GC     G      G  GL      +LS T+ +  ++ +YCL     P+   L F+ S 
Sbjct: 200 FGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 254

Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           R G           PL++N +  + YYV L    VG + V IPP     + A   G + D
Sbjct: 255 RLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFD 314

Query: 376 CGTAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            GT  TRL   AY ++RD F R        NL  TS +  FDTCY       +  PT++ 
Sbjct: 315 SGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTS-LGGFDTCYTV----PIVAPTITF 369

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
            F +G  + LP  N LI   +  T C A A      +S L++I N+QQQ  RV +D+ N+
Sbjct: 370 MF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNS 428

Query: 487 RVGFTPNKC 495
           R+G     C
Sbjct: 429 RLGVARELC 437


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 126/433 (29%), Positives = 197/433 (45%), Gaps = 84/433 (19%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE-----CYQQ 196
           E F+ P+ SGA  G+G+YF R  VGTP R F +V DTGSD+ W++C           Y  
Sbjct: 90  EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGY 149

Query: 197 SDP----------------------IFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RA 229
           + P                      +F P  S +++P+PC++  C +     ++AC    
Sbjct: 150 AAPASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPG 209

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSFGNSG----------SVKGIALGCGHDNEG-LFVG 278
           + C Y   Y DGS   G + T++ +   SG           ++G+ LGC     G  F+ 
Sbjct: 210 SPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLA 269

Query: 279 SAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEF--NSARGGDAVT--- 328
           S G+L LG   +S   +  A      +YCLVD  +P  A+  L F  N A      +   
Sbjct: 270 SDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTA 329

Query: 329 -------------------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
                               PL+ + ++  FY V + G SV G+ ++IP  ++  D A  
Sbjct: 330 CAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVW--DVAKG 387

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-----SVR 424
           GG I+D GT++T L + AY ++  +  +    L P   +  FD CY+++        +V 
Sbjct: 388 GGAILDSGTSLTVLVSPAYRAVVAALNKKLAGL-PRVTMDPFDYCYNWTSPSTGEDLTVA 446

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
           +P +++HF     L  PAK+Y+I  D+A G  C          +S+IGN+ QQ     FD
Sbjct: 447 MPELAVHFAGSARLQPPAKSYVI--DAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFD 504

Query: 483 LANNRVGFTPNKC 495
           L N R+ F  ++C
Sbjct: 505 LKNRRLRFKRSRC 517


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 119/368 (32%), Positives = 178/368 (48%), Gaps = 43/368 (11%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + VGTPP+  +MVLDTGS+++WL C         +D  F P+ S++++ +PC + +C S 
Sbjct: 65  LAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCSSR 123

Query: 223 DVSA---CRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDNEG 274
           D+ A   C A   RC   ++Y DGS + G L T+  + G++  ++  A GC    +D+  
Sbjct: 124 DLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRS-AFGCMSAAYDSSP 182

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---------RGGD 325
             V +AGLLG+  G LS   Q      +YC+ DRD   +GVL    +             
Sbjct: 183 DAVATAGLLGMNRGALSFVTQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTPLY 240

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
             T PL    +V   Y V L G  VGG+ + IPPS+   D  G G  +VD GT  T L  
Sbjct: 241 QPTPPLPYFDRVA--YSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFLLG 298

Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLR---SVRVPTVSLHF-GAG 435
            AY++++  F++    L P       +    FDTC+     R   S R+P V+L F GA 
Sbjct: 299 DAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNGAQ 358

Query: 436 KALDLPAKNYLIPVD---SAGTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNR 487
            ++      Y +P +   + G +C  F      P ++   +IG+  Q    V +DL   R
Sbjct: 359 MSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTA--YVIGHHHQMNLWVEYDLERGR 416

Query: 488 VGFTPNKC 495
           VG  P KC
Sbjct: 417 VGLAPVKC 424


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 173/369 (46%), Gaps = 36/369 (9%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R  +GTPP+   + +DT +D  W+   PCT C   +  +F P+ 
Sbjct: 84  PIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDTSNDAAWI---PCTACDGCTSTLFAPEK 140

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + C +P+C  +   +C  + C + + YG  S    ++V +TV+   +  + G  
Sbjct: 141 STTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSI-AANVVQDTVTLA-TDPIPGYT 198

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
            GC     G      G  GL      +LS T+ +  ++ +YCL     P+   L F+ S 
Sbjct: 199 FGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 253

Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           R G           PL++N +  + YYV L    VG + V IPP+    + A   G + D
Sbjct: 254 RLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFD 313

Query: 376 CGTAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            GT  TRL    Y ++RD F R        NL  TS +  FDTCY       +  PT++ 
Sbjct: 314 SGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTS-LGGFDTCYTV----PIVAPTITF 368

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
            F +G  + LP  N LI   +  T C A A      +S L++I N+QQQ  RV +D+ N+
Sbjct: 369 MF-SGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNS 427

Query: 487 RVGFTPNKC 495
           R+G     C
Sbjct: 428 RLGVARELC 436


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 151/296 (51%), Gaps = 35/296 (11%)

Query: 13  TTILFSFCLFTS--ASSRGLS-ETATTVLDVSSALQQTEHILSFEPETL---EPFAEESE 66
           T  L  F L+++  +S RGL+ +   T L   S L    HI S  P ++    P  ++  
Sbjct: 7   TIFLLKFLLYSALLSSKRGLAFQGRKTALSTPSTLHNV-HITSLMPSSVCSPSPKGDDKR 65

Query: 67  TAAESFPLNSSSSFSLPLHSREILHKTRHNDYRSLVLSR-LERDSARVNTLITKLQLAIY 125
            + E             +H      K   +  RS   ++ L++D +RVN++  + +LA  
Sbjct: 66  ASLEV------------IHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSI--RSRLAKN 111

Query: 126 NVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL 185
             D  +LK ++  +       P  SG++ G+G Y   +G+GTP R  + + DTGSD+ W 
Sbjct: 112 PADGGKLKGSKVTL-------PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT 164

Query: 186 QCRPCTE-CYQQSDPIFDPKTSSSYSPLPCAAPQCKSL-----DVSACRANRCLYQVAYG 239
           QC PC   CY Q +PIF+P  S+SY+ + C++P C  L     +  +C A+ C+Y + YG
Sbjct: 165 QCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYG 224

Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQ 295
           D S++VG    + ++  ++        GCG +N GLFVG AGL+GLG   LSL  +
Sbjct: 225 DQSYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280



 Score = 68.2 bits (165), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 53/91 (58%), Gaps = 7/91 (7%)

Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN--YLIPVDSAGTFCFAFAPTSSA- 465
           ++ DTCYDFS   +V VP ++L+F  G  +DL      Y++ +      C AFA  S A 
Sbjct: 288 SILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFYILNISQV---CLAFAGNSDAT 344

Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            ++I+GNVQQ+   V +D+A  R+GF P  C
Sbjct: 345 DIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 170/379 (44%), Gaps = 48/379 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PIFDPKT 205
           G Y     +GTPP++ S+VLDTGS + W  C   T  Y   +           PI+    
Sbjct: 72  GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131

Query: 206 SSSYSPLPCAAPQCKSL---DVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           SS+   LPC +P+C  +   D++     RC  Y + YG GS T G LV++ +       +
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRI 190

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEF 318
                GC   +        G+ G G G+ S+  Q+  T  +YCLV     D+P SG L  
Sbjct: 191 PDFLFGCSLVSN---RQPEGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVL 247

Query: 319 NSAR-GGDAVT-----APLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +  R   DA       AP  ++  +     +YY+ L+   VGG+ V IPP      + GD
Sbjct: 248 HRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGD 307

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSF------VRLAGNLKPTSGVALFDTCYDFSGLRSV 423
           GG+IVD G+  T ++   ++ +           + A  ++ +SG+     CY+ +G   V
Sbjct: 308 GGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLG---PCYNITGQSEV 364

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-------APTSSALSIIGNVQQQG 476
            VP ++  F  G  +DLP  +Y   V + G  C            T+    I+GN QQQ 
Sbjct: 365 DVPKLTFSFKGGANMDLPLTDYFSLV-TDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQN 423

Query: 477 TRVSFDLANNRVGFTPNKC 495
             + +DL   R GF P +C
Sbjct: 424 FYIEYDLKKQRFGFKPQQC 442


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 104/353 (29%), Positives = 162/353 (45%), Gaps = 42/353 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP +   ++DTGS+I W QC PC  CY+Q+ PIFDP  SS++         
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKE------- 117

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
                   C  + C Y+V Y D ++T+G L TET++     G    +    +GCGH+N  
Sbjct: 118 ------KRCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSW 171

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVTA 329
                +G++GL  G  SL  Q+       ++YC   +    +  + F  N+   GD V +
Sbjct: 172 FKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQ---GTSKINFGANAIVAGDGVVS 228

Query: 330 -PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
             +        FYY+ L   SVG   ++   + F    A +G I++D GT +T       
Sbjct: 229 TTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTF---HALEGNIVIDSGTTLTYFPVSYC 285

Query: 389 NSLRDSFVRLAGNLK---PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           N +R +   +   ++   PT    L   CY+   +     P +++HF  G  L L   N 
Sbjct: 286 NLVRQAVEHVVTAVRAADPTGNDML---CYNSDTID--IFPVITMHFSGGVDLVLDKYNM 340

Query: 446 LIPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +  ++ G FC A    +PT  A  I GN  Q    V +D ++  V F+P  C
Sbjct: 341 YMESNNGGVFCLAIICNSPTQEA--IFGNRAQNNFLVGYDSSSLLVSFSPTNC 391


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 163/354 (46%), Gaps = 29/354 (8%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G G Y     +GTPP++ + + DTGSD+ W +C             + P  SS+++ LPC
Sbjct: 96  GGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPC 155

Query: 215 AAPQC---KSLDVSACRAN--RCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIAL 266
           +   C   +S  ++ C A    C Y+ AYG   D  FT G L +ET + G   +V G+  
Sbjct: 156 SDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGD-AVPGVGF 214

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS----GVLEFNSAR 322
           GC    EG +   AGL+GLG G LSL  Q+ A +  YCL    S AS    G L   +  
Sbjct: 215 GCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLTADASKASPLLFGALATMTGA 274

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           G    +  L+ +    TFY V L   ++G            +        + D GT +T 
Sbjct: 275 GAGVQSTGLLAST---TFYAVNLRSITIGSATTAGVGGPGGV--------VFDSGTTLTY 323

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALDLP 441
           L   AY   + +F+    +L P  G   F+ CY+     S R +P + LHF  G  + LP
Sbjct: 324 LAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKP--DSARLIPAMVLHFDGGADMALP 381

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             NY++ VD  G  C+     S +LSIIGN+ Q    V  D+  + + F P  C
Sbjct: 382 VANYVVEVDD-GVVCWV-VQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 159/362 (43%), Gaps = 65/362 (17%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +R G+GTP +   + +D  +D  W+ C  C  C   S P F P  SS+Y  +PC +P
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 218 QCKSLDVSACRA---NRCLYQVAY---------GDGSFTVGDLVTETVSFGNSGSVKGIA 265
           QC  +   +C A   + C + + Y         G  S  + + V  + +FG    V G +
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVNGNS 219

Query: 266 LGCGHDNEG------LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN 319
                 +        L V   G LG  G      K+IK T                    
Sbjct: 220 RAAAGAHRLRPRAALLLVADQGHLGPIG----QPKRIKTT-------------------- 255

Query: 320 SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
                     PL+ N    + YYV + G  VG + VQ+P S    +     G I+D GT 
Sbjct: 256 ----------PLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 305

Query: 380 ITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
            TRL    Y ++RD+F  R+   + P  G   FDTCY+     +V VPTV+  F    A+
Sbjct: 306 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCYNV----TVSVPTVTFMFAGAVAV 359

Query: 439 DLPAKNYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            LP +N +I   S G  C A A       ++AL+++ ++QQQ  RV FD+AN RVGF+  
Sbjct: 360 TLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRE 419

Query: 494 KC 495
            C
Sbjct: 420 LC 421


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 121/393 (30%), Positives = 172/393 (43%), Gaps = 58/393 (14%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTECYQQSDPI------FDP 203
           S   G Y   +  GTPP+  S ++DTGSDI W  C     C  C   S         F P
Sbjct: 61  SHSYGGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIP 120

Query: 204 KTSSSYSPLPCAAPQCKSLDVSA------CRANRCL------YQVAYGDGSFTVGDLVTE 251
           K SSS   L C  P+C  +  S       C    CL      Y + YG G+ T G  ++E
Sbjct: 121 KESSSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGT-TGGVALSE 179

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLV--- 306
           T+   +S S     +GC      +F     AG+ G G G+ SL  Q+     +YCL+   
Sbjct: 180 TLHL-HSLSKPNFLVGCS-----VFSSHQPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233

Query: 307 -DRDSPASGVL-----EFNSARGGDA-VTAPLIRNKKVDT------FYYVGLTGFSVGGQ 353
            D D+  S  L     + +S +  +A V  P ++N KVD       +YY+GL   +VGG 
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN---LKPTSGVAL 410
            V++P       E G+GG+I+D GT  T +  +A+  L D F+R   +   +K       
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353

Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--- 467
              C++ S  ++V  P + L+F  G  + LP +NY   V      C        A     
Sbjct: 354 LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFV-GGEVACLTVVTDGVAGPERV 412

Query: 468 -----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                I+GN Q Q   V +DL N R+GF   KC
Sbjct: 413 GGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 134/467 (28%), Positives = 206/467 (44%), Gaps = 51/467 (10%)

Query: 55  PETL-EPFAEESETAAESFPLNSSSSFSLPLHSREILHK-TRHNDYRSLVLSRLERDSAR 112
           P++L  PF   +  +   F ++    FSL     EI+H+ +R + +    ++  ER    
Sbjct: 2   PQSLASPFVYLTILSLIHFAISKPDGFSL-----EIVHRYSRESPFYPGNITDYER---- 52

Query: 113 VNTLITKLQLAIYNVDRHELKPAEAQ-ILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQ 171
               IT+L + +  +  H L    +    PE F   +    SQ    Y  ++ +G+P   
Sbjct: 53  ----ITRL-VELSKIRAHNLAITTSSGFSPEAFRLRI----SQDDTCYLVKVIIGSPGVP 103

Query: 172 FSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-KSLDVSACRAN 230
             +V DTGS + W QC PCT  ++Q  PIF+   S +Y  LPC    C  + +V  CR +
Sbjct: 104 LYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQFCTNNQNVFQCRDD 163

Query: 231 RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL-----FVGSAGLLGL 285
           +C+Y++AY  GS T G    + +    +  +     GC  DN+            G++GL
Sbjct: 164 KCVYRIAYAGGSATAGVAAQDILQSAENDRIP-FYFGCSRDNQNFSTFESSGKGGGIIGL 222

Query: 286 GGGMLSLTKQ---IKATSLAYC--LVDRDSP--ASGVLEF-NSARGG--DAVTAPLIRNK 335
               +SL +Q   I     +YC  L D  SP  A+ +L F N  R      ++ P +  +
Sbjct: 223 NMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPR 282

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
            +   Y++ L   SV G  +QIPP  F +   G GG I+D GTA+T +   AY  +  +F
Sbjct: 283 GMPN-YFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAF 341

Query: 396 VRLAGNLKPTSGVALFD------TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV 449
                N     G    +       CY   G      P+++ HF        P   YL  V
Sbjct: 342 ----KNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSMAFHFQGADFFVEPEYVYLT-V 396

Query: 450 DSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              G FC A  P S    +IIG + Q  T+  +D AN ++ FTP  C
Sbjct: 397 QDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFTPENC 443


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 118/362 (32%), Positives = 167/362 (46%), Gaps = 43/362 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSP---LPCAAPQCKS 221
           +GTPP+   MVLDTGS ++W+QC       ++  P       S  S    LPC  P CK 
Sbjct: 88  IGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNHPLCKP 147

Query: 222 L--DVSA---CRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
              D S    C AN  C Y   Y DG++  G+LV E ++F  S +   I LGC   ++  
Sbjct: 148 RVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPIILGCATQSDD- 206

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS-PASGVLEFNSARGGDAVTAPLIRN 334
              + G+LG+  G L    Q K T  +YC+  + + PASG         G+   +   R 
Sbjct: 207 ---ARGILGMNLGRLGFPSQAKITKFSYCVPTKQAQPASGSFYL-----GNNPASSSFRY 258

Query: 335 KKVDTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
             + TF             Y + L G S+GG+ + IPPS+F+ +  G G  ++D G+  T
Sbjct: 259 VNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVFKPNAGGSGQTMIDSGSEFT 318

Query: 382 RLQTQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGK 436
            L  +AYN +R+  V+  G  +K      GVA  D C+D   +   R V  +   F  G 
Sbjct: 319 YLVDEAYNVIREELVKKVGPKIKKGYMYGGVA--DICFDGDAIEIGRLVGDMVFEFEKGV 376

Query: 437 ALDLPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            + +P +  L  VD  G  C     +    +  +IIGN  QQ   V FDLAN RVGF   
Sbjct: 377 QIVIPKERVLATVD-GGVHCLGMGRSERLGAGGNIIGNFHQQNLWVEFDLANRRVGFGEA 435

Query: 494 KC 495
            C
Sbjct: 436 DC 437


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 167/356 (46%), Gaps = 30/356 (8%)

Query: 147 PVVSGASQ-GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R  +GTPP+   + +DT +D  W+   PCT C   +  +F P+ 
Sbjct: 80  PIASGRQIIQSPTYIVRAKIGTPPQTLLLAMDTSNDAAWI---PCTACDGCASTLFAPEK 136

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + CAAP+CK +    C  +   + + YG  S    +LV +T++   +  V    
Sbjct: 137 STTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSI-AANLVQDTITLA-TDPVPSYT 194

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFN-SA 321
            GC     G      G  GL      +LS T+ +  ++ +YCL     P+   L F+ S 
Sbjct: 195 FGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCL-----PSFKSLNFSGSL 249

Query: 322 RGGDAVT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           R G           PL++N +  + YYV L    VG + V IPP+    +     G I D
Sbjct: 250 RLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFD 309

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG 435
            GT  TRL    Y ++RD F R  G     + +  FDTCY+      + VPT++  F  G
Sbjct: 310 SGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYNV----PIVVPTITFIF-TG 364

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFA----PTSSALSIIGNVQQQGTRVSFDLANNR 487
             + LP  N LI   +  T C A A      +S L++I N+QQQ  RV +D+ N+R
Sbjct: 365 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 420


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 125/374 (33%), Positives = 172/374 (45%), Gaps = 58/374 (15%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS--- 221
           VGTPP+  +MVLDTGS+++WL C+      Q  + +F+P  SSSY+P+PC +P CK+   
Sbjct: 76  VGTPPQSVTMVLDTGSELSWLHCKK----QQNINSVFNPHLSSSYTPIPCMSPICKTRTR 131

Query: 222 ---LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
              + VS    N C   V+Y D +   G+L ++T +   SG   GI  G        N  
Sbjct: 132 DFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQ-PGIIFGSMDSGFSSNAN 190

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVTAPL 331
               + GL+G+  G LS   Q+     +YC+  +D  ASGVL F  A     G     PL
Sbjct: 191 EDSKTTGLMGMNRGSLSFVTQMGFPKFSYCISGKD--ASGVLLFGDATFKWLGPLKYTPL 248

Query: 332 IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           ++ N  +  F    Y V L G  VG + +Q+P  +F  D  G G  +VD GT  T L   
Sbjct: 249 VKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGS 308

Query: 387 AYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSG-----LRSVR------VPTVSLHFG 433
            Y +LR+ FV        T GV   L D  + F G      R  R      VP V++ F 
Sbjct: 309 VYTALRNEFV------AQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVF- 361

Query: 434 AGKALDLPAKNYLIPVDSAG--------TFCFAFAPTSSALSI----IGNVQQQGTRVSF 481
            G  + +  +  L  V   G         +C  F   S  L I    IG+  QQ   + F
Sbjct: 362 EGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFG-NSDLLGIEAYVIGHHHQQNVWMEF 420

Query: 482 DLANNRVGFTPNKC 495
           DL N+RVGF   KC
Sbjct: 421 DLVNSRVGFADTKC 434


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 166/352 (47%), Gaps = 39/352 (11%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  ++ +GTPP +   VLDTGS+  W QC PC  CY Q+ PIFDP  SS++  +     
Sbjct: 64  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI----- 118

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNE 273
           +C + D S      C Y++ YG  S+T G LVTETV+     G    +    +GCG +N 
Sbjct: 119 RCDTHDHS------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 172

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVT 328
           G   G AG++GL  G  SL  Q+       ++YC   + +     + F  N+   GD V 
Sbjct: 173 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSK---INFGANAIVAGDGVV 229

Query: 329 APLIRNKKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           +  +  K     FYY+ L   SVG   ++   + F    A  G I++D G+ +T      
Sbjct: 230 STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPF---HALKGNIVIDSGSTLTYFPESY 286

Query: 388 YNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
            N +R +  ++   ++ P S +     CY +S    +  P +++HF  G  L L   N  
Sbjct: 287 CNLVRKAVEQVVTAVRFPRSDIL----CY-YSKTIDI-FPVITMHFSGGADLVLDKYNMY 340

Query: 447 IPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  ++ G FC A    +P   A  I GN  Q    V +D ++  V F P  C
Sbjct: 341 VASNTGGVFCLAIICNSPIEEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 390


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 166/352 (47%), Gaps = 39/352 (11%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  ++ +GTPP +   VLDTGS+  W QC PC  CY Q+ PIFDP  SS++  +     
Sbjct: 58  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEI----- 112

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNE 273
           +C + D S      C Y++ YG  S+T G LVTETV+     G    +    +GCG +N 
Sbjct: 113 RCDTHDHS------CPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 166

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVT 328
           G   G AG++GL  G  SL  Q+       ++YC   + +     + F  N+   GD V 
Sbjct: 167 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSK---INFGANAIVAGDGVV 223

Query: 329 APLIRNKKVDT-FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           +  +  K     FYY+ L   SVG   ++   + F    A  G I++D G+ +T      
Sbjct: 224 STTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPF---HALKGNIVIDSGSTLTYFPESY 280

Query: 388 YNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
            N +R +  ++   ++ P S +     CY +S    +  P +++HF  G  L L   N  
Sbjct: 281 CNLVRKAVEQVVTAVRFPRSDIL----CY-YSKTIDI-FPVITMHFSGGADLVLDKYNMY 334

Query: 447 IPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  ++ G FC A    +P   A  I GN  Q    V +D ++  V F P  C
Sbjct: 335 VASNTGGVFCLAIICNSPIEEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 384


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 112/377 (29%), Positives = 176/377 (46%), Gaps = 32/377 (8%)

Query: 147 PVVSGASQGSGEYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IF 201
           P+ SGA  G  +YF  I +GTP P++F +V DTGSD+ W+ C    +   + +P    +F
Sbjct: 107 PIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVF 166

Query: 202 DPKTSSSYSPLPCAAPQCK-----SLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVS 254
               SSS+  +PC++  CK        ++ C      CL+   Y +G   +G    ETV+
Sbjct: 167 RANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVT 226

Query: 255 FGNSGSVK----GIALGCGHDNEGLFVGSAGLLGLGGGMLSLT---KQIKATSLAYCLVD 307
            G +   K     + +GC            G++GLG    SL     +I     +YCLVD
Sbjct: 227 VGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVD 286

Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKK-----VDTFYYVGLTGFSVGGQAVQIPPSLF 362
             S +S    F S      +  P +++ +     ++ FY V ++G SVGG  + I   ++
Sbjct: 287 HLS-SSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIW 345

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK---PTSGVALFDTCYDFSG 419
            +   G GG+IVD GT++T L  +AY+ + D+   +    K   P     L + C++  G
Sbjct: 346 NV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKG 403

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTR 478
                VP + +HF  G     P K+Y+I V + G  C           SI+GNV QQ   
Sbjct: 404 FDRAAVPRLLIHFADGAIFKPPVKSYIIDV-AEGIKCLGIIKADFPGSSILGNVMQQNHL 462

Query: 479 VSFDLANNRVGFTPNKC 495
             +DL   ++GF P+ C
Sbjct: 463 WEYDLGRGKLGFGPSSC 479


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 165/357 (46%), Gaps = 35/357 (9%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y +   +GTPP+  S V+D   ++ W QC+ C+ C++Q  P+FDP  S++Y   PC  P
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 218 QCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHD 271
            C+S+  D   C  N C YQ +   G  T G + T+T + G + +   +A GC      D
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGTAKA--SLAFGCVVASDID 166

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAV 327
             G   G +G++GLG    SL  Q    + +YCL   D+  +  L   S    A GG A 
Sbjct: 167 TMG---GPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAA 223

Query: 328 TAPLI----RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           + P +        +  +Y V L G   G   + +PPS           +++D  + I+ L
Sbjct: 224 STPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFL 275

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
              AY +++ +     G     + V  FD C+  SG  S   P +   F  G A+ + A 
Sbjct: 276 VDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVAAS 334

Query: 444 NYLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           NYL+   + GT C A        +++ LS++G++QQ+     FDL    + F P  C
Sbjct: 335 NYLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 163/358 (45%), Gaps = 61/358 (17%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y   + +GTPP  FS++ DTGS + W QC PCTEC  +  P F P +SS++S LPCA
Sbjct: 87  AGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCA 146

Query: 216 APQCKSLD--VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
           +  C+ L      C A  C+Y   YG G FT G L TET+  G + S  G+  GC  +N 
Sbjct: 147 SSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGA-SFPGVTFGCSTEN- 203

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS---ARGGDAVTAP 330
           G+   S+G++GLG   LSL  Q+     +YCL          + F S     GG+  + P
Sbjct: 204 GVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLRSNADAGDSPILFGSLAKVTGGNVQSTP 263

Query: 331 LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
           L+ N ++   ++YYV LTG +VG  A  +P ++                           
Sbjct: 264 LLENPEMPSSSYYYVNLTGITVG--ATDLPMAM--------------------------- 294

Query: 389 NSLRDSFVRLAGNLKPTSGVAL-FDTCYD---FSGLRSVRVPTVSLHFGAGKALDLPAKN 444
                       NL   +G    FD C+D     G   V VPT+ L F  G    +  ++
Sbjct: 295 -----------ANLTTVNGTRFGFDLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRS 343

Query: 445 Y--LIPVDSAG---TFCFAFAPTSSAL--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           Y  ++ VDS G     C    P S  L  SIIGNV Q    V +DL      F P  C
Sbjct: 344 YFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 401


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/345 (30%), Positives = 168/345 (48%), Gaps = 20/345 (5%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y    G+GTPP+Q S  LD  SD+ W  C             F+P  S++ + +PC 
Sbjct: 97  AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148

Query: 216 APQCKSLDVSACRA--NRCLYQVAYGDGSF-TVGDLVTETVSFGNSGSVKGIALGCGHDN 272
              C+      C A  + C Y   YG G+  T G L TE  +FG++  + G+  GCG  N
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDT-RIDGVVFGCGLKN 207

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS--PASGVLEFNSA--RGGDAVT 328
            G F G +G++GLG G LSL  Q++    +Y     DS    S +L  + A  +    ++
Sbjct: 208 VGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILFGDDATPQTSHTLS 267

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRLQTQA 387
             L+ +    + YYV L G  V G+ + IP   F++ ++ G GG+ +     +T L+  A
Sbjct: 268 TRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEAA 327

Query: 388 YNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           Y  LR +     G L   +G AL  D CY    L   +VP+++L F  G  ++L   NY 
Sbjct: 328 YKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGNYF 386

Query: 447 IPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGF 490
               + G  C    P+S+   S++G++ Q GT + +D+  +++ F
Sbjct: 387 YMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 116/404 (28%), Positives = 190/404 (47%), Gaps = 46/404 (11%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS-----GEYFSRIGVGTPPRQFSMVLDTGSDIN 183
           +  L P  A I  +    P    +S  +     GEY++ I +G+P ++  +++DTGS++ 
Sbjct: 65  QKSLFPYSAHIFQQHTKNPAALRSSTTTLGRKFGEYYTSIKLGSPGQEAILIVDTGSELT 124

Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACRANRCLYQVAY 238
           WLQC PC  C    D I+D   S+SY P+ C   Q      +       R ++C +   Y
Sbjct: 125 WLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY 184

Query: 239 GDGSFTVGD-----LVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGMLSL 292
           GDGSF+ G      L+ ETV  G   +V+  A GC   D E +  G++G+LGL  G ++L
Sbjct: 185 GDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMAL 244

Query: 293 TKQIK---ATSLAYCLVDRDSP--ASGVLEFNSA----RGGDAVTAPLIRNKKVDTFYYV 343
             Q+        ++C  DR S   ++GV+ F +A          +  L  ++    FY+V
Sbjct: 245 PMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHV 304

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNL 402
            L G S+    +   P            +I+D G++ +      ++ LR++F++    +L
Sbjct: 305 ALKGVSINSHELVFLPR--------GSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSL 356

Query: 403 KPTSGVALFD--TCY-----DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSA 452
           K   G +  D  TC+     D   L    +P++SL F  G  + +P+   L+PV    + 
Sbjct: 357 KHLEGDSFGDLGTCFKVSNDDIDELHRT-LPSLSLVFEDGVTIGIPSIGVLLPVARFQNH 415

Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              CFAF     + +++IGN QQQ   V +D+  +RVGF    C
Sbjct: 416 VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 121/372 (32%), Positives = 174/372 (46%), Gaps = 55/372 (14%)

Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKS---- 221
           PP+  SMV+DTGS+++WL+C   +      +P+  FDP  SSSYSP+PC++P C++    
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 222 -LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GHDNEG 274
            L  ++C +++ C   ++Y D S + G+L  E   FGNS +   +  GC     G D E 
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTAPLI 332
               + GLLG+  G LS   Q+     +YC+   D     +L  + N          PLI
Sbjct: 198 D-TKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLI 256

Query: 333 R-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           R +  +  F    Y V LTG  V G+ + IP S+   D  G G  +VD GT  T L    
Sbjct: 257 RISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGPV 316

Query: 388 YNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSV-----RVPTVSL 430
           Y +LR  F      L  T+G+               D CY  S  R       R+PTVSL
Sbjct: 317 YTALRSDF------LNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSL 370

Query: 431 HF-GAGKALDLPAKNYLIPVDSAGT---FCFAFAPTS---SALSIIGNVQQQGTRVSFDL 483
            F GA  A+      Y +P  +AG    +CF F  +        +IG+  QQ   + FDL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430

Query: 484 ANNRVGFTPNKC 495
             +R+G  P +C
Sbjct: 431 QRSRIGLAPVQC 442


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 106/350 (30%), Positives = 165/350 (47%), Gaps = 21/350 (6%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC-AAP 217
           + + I +G PP    +++DTGSD+ W+ C PC +CY Q+ P F P  SS+Y    C +AP
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIALGCGHDNE 273
                     +   C Y + Y D S T G L  E ++F  S     S + I  GCG DN 
Sbjct: 137 HAMPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNS 196

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYC---LVDRDSPASGVLEFNSAR-GGDAVTA 329
           G F   +G+LGLG G  S+  +   +  +YC   L +   P + ++  N A+  GD    
Sbjct: 197 G-FTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPL 255

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
            + +++     YY+ L   S G + + I P  F+   +  GG ++D G + T L  +AY 
Sbjct: 256 QIFQDR-----YYLDLQAISFGEKLLDIEPGTFQRYRS-QGGTVIDTGCSPTILAREAYE 309

Query: 390 SLRDSFVRLAGN-LKPTSGVALFDT-CYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           +L +    L G  L+       + T CY+ +  L     P V+ HF  G  L L  ++  
Sbjct: 310 TLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLF 369

Query: 447 IPVDSAGTFCFAFA-PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  +S  +FC A    T   +S+IG + QQ   V ++L   +V F    C
Sbjct: 370 VSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 106/349 (30%), Positives = 168/349 (48%), Gaps = 24/349 (6%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y    G+GTPP+Q S  LD  SD+ W  C             F+P  S++ + +PC 
Sbjct: 97  AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148

Query: 216 APQCKSLDVSACRA------NRCLYQVAYGDGSF-TVGDLVTETVSFGNSGSVKGIALGC 268
              C+      C A      + C Y   YG G+  T G L TE  +FG++  + G+  GC
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDT-RIDGVVFGC 207

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDS--PASGVLEFNSA--RGG 324
           G  N G F G +G++GLG G LSL  Q++    +Y     DS    S +L  + A  +  
Sbjct: 208 GLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDSVDTQSFILFGDDATPQTS 267

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM-DEAGDGGIIVDCGTAITRL 383
             ++  L+ +    + YYV L G  V G+ + IP   F++ ++ G GG+ +     +T L
Sbjct: 268 HTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVL 327

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
           +  AY  LR +     G L   +G AL  D CY    L   +VP+++L F  G  ++L  
Sbjct: 328 EEAAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELEL 386

Query: 443 KNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSFDLANNRVGF 490
            NY     + G  C    P+S+   S++G++ Q GT + +D+  +++ F
Sbjct: 387 GNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 167/362 (46%), Gaps = 42/362 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI-FDPKTSSSYSPLPCAAPQCKSLD 223
           +GTPP+   MVLDTGS ++W+QC   +   +      FDP  SSS+S LPC  P CK   
Sbjct: 86  IGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLCKPRI 145

Query: 224 V-----SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
                 + C  NR C Y   Y DG++  G LV E ++F +S S   + LGC   +     
Sbjct: 146 PDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPLILGCAEAS----T 201

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---------------DSPASGVLEFNSAR 322
              G+LG+  G  S   Q K +  +YC+  R               ++P SG  ++ +  
Sbjct: 202 DEKGILGMNLGRRSFASQAKISKFSYCVPTRQARAGLSSTGSFYLGNNPNSGRFQYINL- 260

Query: 323 GGDAVTAPLIRNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
                  P  R+  +D   Y + + G  +G   + I  +LF  D +G G  I+D G+  T
Sbjct: 261 ---LTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRPDPSGAGQTIIDSGSEFT 317

Query: 382 RLQTQAYNSLRDSFVRLAG-NLKP---TSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
            L  +AYN +R+  VRL G  LK      GV+  D C+D + +   R+   ++ F   K 
Sbjct: 318 YLVDEAYNKVREEVVRLVGPKLKKGYVYGGVS--DMCFDGNPMEIGRL-IGNMVFEFEKG 374

Query: 438 LDLPAKNYLIPVD-SAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           +++    + +  D   G  C     +    +A +IIGN  QQ   V +DLAN R+G    
Sbjct: 375 VEIVIDKWRVLADVGGGVHCIGIGRSEMLGAASNIIGNFHQQNLWVEYDLANRRIGLGKA 434

Query: 494 KC 495
            C
Sbjct: 435 DC 436


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  148 bits (373), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 160/353 (45%), Gaps = 38/353 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP +    +DTGSDI W QC PC  CY Q  PIFDP  SS++         
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFRE------- 473

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN-- 272
                   C  N C Y++ Y D +++ G L TETV+     G    +    +GCG DN  
Sbjct: 474 ------QRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTN 527

Query: 273 ---EGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
               G    S+G++GL  G LSL  Q+       ++YC   +    +  + F  N+   G
Sbjct: 528 LQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQ---GTSKINFGTNAIVAG 584

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D   A  +  KK + FYY+ L   SV    +    + F    A DG I +D GT +T   
Sbjct: 585 DGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPF---HAEDGNIFIDSGTTLTYFP 641

Query: 385 TQAYNSLRDSFVRLAGNLK-PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAK 443
               N +R++  ++   +K P  G      CY +S    +  P +++HF  G  L L   
Sbjct: 642 MSYCNLVREAVEQVVTAVKVPDMGSDNL-LCY-YSDTIDI-FPVITMHFSGGADLVLDKY 698

Query: 444 NYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           N  +   + G FC A      ++ ++ GN  Q    V +D ++N + F+P  C
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNC 751



 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 106/344 (30%), Positives = 159/344 (46%), Gaps = 46/344 (13%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP + +  +DTGSD+ W QC PC +CY Q DPIFDP  SS+++        
Sbjct: 82  YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNE------- 134

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG-H--- 270
                   C    C Y++ Y D +++ G L TETV+     G    +    +GCG H   
Sbjct: 135 ------QRCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTD 188

Query: 271 -DNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
            DN G    S+G++GL  G  SL  Q+       ++YC   +    +  + F  N+   G
Sbjct: 189 LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQ---GTSKINFGTNAIVAG 245

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D   A  +  KK + FYY+ L   SV    ++   + F    A DG I++D G+ +T   
Sbjct: 246 DGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPF---HAEDGNIVIDSGSTVTYFP 302

Query: 385 TQAYNSLRDSFVRLAGNLK---PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
               N +R +  ++   ++   P+    L   CY FS    +  P +++HF  G  L L 
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVPDPSGNDML---CY-FSETIDI-FPVITMHFSGGADLVLD 357

Query: 442 AKNYLIPVDSAGTFCFAF---APTSSALSIIGNVQQQGTRVSFD 482
             N  +  +S G FC A    +PT  A  I GN  Q    V +D
Sbjct: 358 KYNMYMESNSGGLFCLAIICNSPTQEA--IFGNRAQNNFLVGYD 399


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 168/372 (45%), Gaps = 24/372 (6%)

Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
           Q +P D       G SQ     +G Y     VGTPP+  + VLD  SD  W+QC  C  C
Sbjct: 72  QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131

Query: 194 -----YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSF--T 244
                   S P F    SS+   + CA   C+ L    C A+   C Y   YG G+   T
Sbjct: 132 GADAPAATSAPPFYAFLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTT 191

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
            G L  +  +F       G+  GC    EG      G++GLG G LSL  Q++    +Y 
Sbjct: 192 AGLLAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSLVSQLQIGRFSYY 247

Query: 305 LVDRDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
           L   D+   G     L+    R   AV+ PL+ N+   + YYV L G  V G+ + IP  
Sbjct: 248 LAPDDAVDVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRG 307

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSG 419
            F++   G GG+++     +T L   AY  +R +     G L+   G  L  D CY    
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIG-LRAADGSELGLDLCYTSES 366

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTR 478
           L + +VP+++L F  G  ++L   NY     + G  C    P+ +   S++G++ Q GT 
Sbjct: 367 LATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTH 426

Query: 479 VSFDLANNRVGF 490
           + +D++ +R+ F
Sbjct: 427 MIYDISGSRLVF 438


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  148 bits (373), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD- 223
           VG+PP+Q +MVLDTGS+++WL C+           +F+P +SSSYSP+PC++P C++   
Sbjct: 46  VGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPVCRTRTR 101

Query: 224 -----VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
                V+      C   V+Y D S   G+L ++    G+S ++ G   GC       N  
Sbjct: 102 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-ALPGTLFGCMDSGFSSNSE 160

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARG---GDAVTAPL 331
               + GL+G+  G LS   Q+     +YC+  RDS  SGVL F  +     G+    PL
Sbjct: 161 EDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS--SGVLLFGDSHLSWLGNLTYTPL 218

Query: 332 IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           ++ +  +  F    Y V L G  VG + + +P S+F  D  G G  +VD GT  T L   
Sbjct: 219 VQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGP 278

Query: 387 AYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF-SGLRSVRVPTVSLHF-GAGKAL 438
            Y +LR+ F+ +  G L P            D CY   +G +   +P VSL F GA   +
Sbjct: 279 VYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFRGAEMVV 338

Query: 439 DLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNRVGFT 491
                 Y +P    G    +C  F   S  L I    IG+  QQ   + FDL  +RVGF 
Sbjct: 339 GGEVLLYKVPGMMKGKEWVYCLTFG-NSDLLGIEAFVIGHHHQQNVWMEFDLVKSRVGFV 397

Query: 492 PNKC 495
             +C
Sbjct: 398 ETRC 401


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 115/404 (28%), Positives = 190/404 (47%), Gaps = 46/404 (11%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS-----GEYFSRIGVGTPPRQFSMVLDTGSDIN 183
           +  L P  A I  +    P    +S  +     GEY++ I +G+P ++  +++DTGS++ 
Sbjct: 65  QKSLFPYSAHIFQQHTKNPAALRSSTTTLGRKFGEYYTSIKLGSPGQEAILIVDTGSELT 124

Query: 184 WLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC-----KSLDVSACRANRCLYQVAY 238
           WL+C PC  C    D I+D   S SY P+ C   Q      +       R ++C +   Y
Sbjct: 125 WLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY 184

Query: 239 GDGSFTVGD-----LVTETVSFGNSGSVKGIALGCGH-DNEGLFVGSAGLLGLGGGMLSL 292
           GDGSF+ G      L+ ETV  G   +V+  A GC   D E +  G++G+LGL  G ++L
Sbjct: 185 GDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMAL 244

Query: 293 TKQIK---ATSLAYCLVDRDSP--ASGVLEFNSA----RGGDAVTAPLIRNKKVDTFYYV 343
             Q+        ++C  DR S   ++GV+ F +A          +  L  ++    FY+V
Sbjct: 245 PMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHV 304

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNL 402
            L G S+    + + P            +I+D G++ +      ++ LR++F++    +L
Sbjct: 305 ALKGVSINSHELVLLPR--------GSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSL 356

Query: 403 KPTSGVALFD--TCY-----DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSA 452
           K   G +  D  TC+     D   L    +P++SL F  G  + +P+   L+PV    + 
Sbjct: 357 KHLEGDSFGDLGTCFKVSNDDIDELHRT-LPSLSLVFEDGVTIGIPSIGVLLPVARYQNH 415

Query: 453 GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              CFAF     + +++IGN QQQ   V +D+  +RVGF    C
Sbjct: 416 VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 174/372 (46%), Gaps = 55/372 (14%)

Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKS---- 221
           PP+  SMV+DTGS+++WL+C   +      +P+  FDP  SSSYSP+PC++P C++    
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSS----NPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 222 -LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GHDNEG 274
            L  ++C +++ C   ++Y D S + G+L  E   FGNS +   +  GC     G D E 
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTAPLI 332
               + GLLG+  G LS   Q+     +YC+   D     +L  + N          PLI
Sbjct: 198 D-TKTTGLLGMNRGSLSFISQMGFPKFSYCISGTDDFPGFLLLGDSNFTWLTPLNYTPLI 256

Query: 333 R-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQA 387
           R +  +  F    Y V LTG  V G+ + IP S+   D  G G  +VD GT  T L    
Sbjct: 257 RISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPV 316

Query: 388 YNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSV-----RVPTVSL 430
           Y +LR  F      L  T+G+               D CY  S +R       R+PTVSL
Sbjct: 317 YTALRSHF------LNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSL 370

Query: 431 HF-GAGKALDLPAKNYLIP---VDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDL 483
            F GA  A+      Y +P   V +   +CF F  +        +IG+  QQ   + FDL
Sbjct: 371 VFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEFDL 430

Query: 484 ANNRVGFTPNKC 495
             +R+G  P +C
Sbjct: 431 QRSRIGLAPVEC 442


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 120/456 (26%), Positives = 191/456 (41%), Gaps = 77/456 (16%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           + RD  R   +  +  ++ Y+  R  L+      +      P+ +G     GEYF+ + V
Sbjct: 62  VNRDGLRRQRMNQRWGVSNYDRRRKGLETTTTTEV----EMPMRAGRDDALGEYFTEVKV 117

Query: 166 GTPPRQFSMVLDTGSDINWLQC-----------------------------------RPC 190
           G+P ++F +  DTGS+  W  C                                   R  
Sbjct: 118 GSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRT 177

Query: 191 TECYQQSDP---IFDPKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGD 240
            +   +S+P   +F P  S S+  + CA+ +CK       SL +    ++ CLY ++Y D
Sbjct: 178 KKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYAD 237

Query: 241 GSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG---LFVGSAGLLGLGGGMLSLT 293
           GS   G   T+T++     G  G +  + +GC    E        + G+LGLG    S  
Sbjct: 238 GSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFI 297

Query: 294 KQIK---ATSLAYCLVDRDSP--ASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYV 343
            +         +YCLVD  S    S  L      GG      L   K+ +      FY V
Sbjct: 298 DKAAYEYGAKFSYCLVDHLSHRNVSSYLTI----GGHHNAKLLGEIKRTELILFPPFYGV 353

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
            + G S+GGQ ++IPP +++ +    GG ++D GT +T L   AY  + ++ ++    +K
Sbjct: 354 NVVGISIGGQMLKIPPQVWDFNS--QGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVK 411

Query: 404 PTSG--VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
             +G      D C+D  G     VP +  HF  G   + P K+Y+I V +    C    P
Sbjct: 412 RVTGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVP 470

Query: 462 TSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                  S+IGN+ QQ     FDL+ N +GF P+ C
Sbjct: 471 IDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/234 (38%), Positives = 125/234 (53%), Gaps = 16/234 (6%)

Query: 75  NSSSSFSLPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKP 134
           N+ SS  +        H + + D R      L RD ARV ++ +KL   I +    E+  
Sbjct: 60  NTKSSLRVVHMHGACSHLSSNKDARLDHDEILRRDEARVESIHSKLSKNIAD----EVSK 115

Query: 135 AEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-EC 193
           A++  LP        +G   GS  Y   IG+GTP    S++ DTGSD+ W QC PC   C
Sbjct: 116 AKSTKLPAK------NGIILGSPNYIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLGSC 169

Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETV 253
           Y Q +P F+P +SSSY  + C++P C + +  +C A+ CLY + YGDGS TVG L  E  
Sbjct: 170 YSQKEPKFNPSSSSSYHNVSCSSPMCGNPE--SCSASNCLYGIGYGDGSVTVGFLAKEKF 227

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYC 304
           +  NS  +  I  GCG +N+G+F+GSAG+LGLG G  S   Q   T     +YC
Sbjct: 228 TLTNSDVLDDIYFGCGENNKGVFIGSAGILGLGPGKFSFPLQTTTTYNNIFSYC 281


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 136/263 (51%), Gaps = 21/263 (7%)

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C Y + YGDGSFT G+L  E + FG    VK    GCG +N+GLF G +GL+GLG   LS
Sbjct: 76  CNYAINYGDGSFTRGELGHEKLKFGTI-LVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 134

Query: 292 LTKQ---IKATSLAYCL--VDRDSPASGVLEFNSA--RGGDAVT-APLIRNKKVDTFYYV 343
           L  Q   I     +YCL   +R    S +L  NS+  R    ++ A +I N ++  FY++
Sbjct: 135 LISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFI 194

Query: 344 GLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLK 403
            LTG S+GG A+Q P         G   I+VD GT ITRL    Y +L+  F++      
Sbjct: 195 NLTGISIGGVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP 247

Query: 404 PTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFAP 461
           P    ++ DTC++ S  + V +PT+ +HF     L  D+    Y +  D A   C A A 
Sbjct: 248 PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALAS 306

Query: 462 TS--SALSIIGNVQQQGTRVSFD 482
                 ++I+GN QQ+  RV +D
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYD 329


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 140/298 (46%), Gaps = 17/298 (5%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y  R+ +GTP +Q  MVLDT +D  W+ C  CT C   S   F P  S++   L C+  
Sbjct: 44  NYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEA 100

Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           QC  +   +C A   + CL+  +YG  S     LV + ++  N   + G   GC +   G
Sbjct: 101 QCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLAND-VIPGFTFGCINAVSG 159

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TA 329
             +   GLLGLG G +SL  Q  A      +YCL    S   SG L+        ++ T 
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+RN    + YYV LTG SVG   V IP      D     G I+D GT ITR     Y 
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           ++RD F +      P S +  FDTC  F+       P V+LHF  G  L LP +N LI
Sbjct: 280 AIRDEFRKQVNG--PISSLGAFDTC--FAATNEAEAPAVTLHF-EGLNLVLPMENSLI 332


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 35/370 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S P+ SG +   G Y  R+ +GTP +   MVLDT +D  ++    C  C   S   F P 
Sbjct: 84  SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPN 140

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFT---VGD---LVTETV-- 253
            S+SY PL C+ PQC  +   +C A     C +  +Y   +++   V D   L T+ +  
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLRLATDVIPS 200

Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
            SFG+  ++ G ++             +        +LS T  + +   +YCL    S  
Sbjct: 201 YSFGSINAISGSSIPAQGLLGLGRGPLS--------LLSQTGSLYSGVFSYCLPSFKSYY 252

Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            SG L+        ++ T PL+RN +  + Y+V LTG +VG   V  P  L   D     
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT ITR     YN++RD F +      P S +  FDTC  F        P ++L
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLAN 485
           HF     L LP +N LI   S    C A A T      + L++I N QQQ  RV FD  N
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVN 427

Query: 486 NRVGFTPNKC 495
           N+VG     C
Sbjct: 428 NKVGIARELC 437


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/361 (32%), Positives = 178/361 (49%), Gaps = 29/361 (8%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPKTSSSYSPLPC 214
           +Y +   +G+PP++   ++DTGSD+ W QC        C +Q  P ++   SS++ P+PC
Sbjct: 85  QYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQSSTFVPVPC 144

Query: 215 A--APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
           A  A  C +  V  C  +  C +  +YG G   +G L TE+ +F  SG+   +A GC   
Sbjct: 145 ADKAGFCAANGVHLCGLDGSCTFIASYGAGR-VIGSLGTESFAF-ESGTTS-LAFGCVSL 201

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVL--EFNSARGG 324
                G    ++GL+GLG G LSL  QI AT  +YCL      S AS  L    +++ GG
Sbjct: 202 TRITSGALNDASGLIGLGRGRLSLVSQIGATRFSYCLTPYFHSSGASSHLFVGASASLGG 261

Query: 325 DAVTAPLIRNKK---VDTFYYVGLTGFSVGGQAV-QIPPSLFEMDEA----GDGGIIVDC 376
              + P +++ K     TFYY+ L G +VG   +  +  + F++ +       GG+I+D 
Sbjct: 262 GGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKGYWAGGVIIDT 321

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           G+ +T+L + AY +L++      GN  L P    +  + C    G + V VP +  HFG 
Sbjct: 322 GSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVAREGFQKV-VPALVFHFGG 380

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           G  + +PA +Y  PVD A   C          SIIGN QQQ   + +DL   R  F    
Sbjct: 381 GADMAVPAASYWAPVDKAAA-CMMILEGGYD-SIIGNFQQQDMHLLYDLRRGRFSFQTAD 438

Query: 495 C 495
           C
Sbjct: 439 C 439


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 131/427 (30%), Positives = 199/427 (46%), Gaps = 50/427 (11%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           L    RD AR +  I + QLA         +   A +    F+ P+ SGA  G+G+YF R
Sbjct: 57  LGERARDDARRHAYI-RSQLA-------SRRRRAADVGASAFAMPLSSGAYTGTGQYFVR 108

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCK 220
             VGTP + F +V DTGSD+ W++CR          P   F    S S++PL C++  C 
Sbjct: 109 FRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCT 168

Query: 221 S---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG--------------NSGSV 261
           S     ++ C   A+ C Y   Y DGS   G + T+  +                    +
Sbjct: 169 SYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKL 228

Query: 262 KGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGV 315
           +G+ LGC    +G  F  S G+L LG   +S   +  A      +YCLVD  +P  AS  
Sbjct: 229 QGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSY 288

Query: 316 LEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           L F     G    A   PL+ +++V  FY V +    V G+A+ IP  ++++     GG 
Sbjct: 289 LTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRG--GGA 346

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSL 430
           I+D GT++T L T AY ++      L G L     VA+  F+ CY+++   +  +P + +
Sbjct: 347 ILDSGTSLTVLATPAYRAV---VAALGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLEV 402

Query: 431 HFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRV 488
            F     L+ PAK+Y+I  D+A G  C      +   +S+IGN+ QQ     FDL +  +
Sbjct: 403 SFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 460

Query: 489 GFTPNKC 495
            F   +C
Sbjct: 461 RFKHTRC 467


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 159/356 (44%), Gaps = 43/356 (12%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R+ +GTPP +    +DTGSD+ W QC PC  CY Q  PIFDP  SS++         
Sbjct: 61  YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKE------- 113

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
                   C  N C Y++ Y D S++ G L TETV+     G    +   ++GCG +N  
Sbjct: 114 ------KRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSN 167

Query: 275 LFV-----GSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGG 324
           L        S+G++GL  G  SL  Q+       ++YC     S  +  + F  N+   G
Sbjct: 168 LMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF---SSQGTSKINFGTNAVVAG 224

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D   A  +  KK   FYY+ L   SVG + ++   + F    A DG I +D GT  T L 
Sbjct: 225 DGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPF---HAQDGNIFIDSGTTYTYLP 281

Query: 385 TQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
           T   N +R++             P+S   L   CY++  +     P ++LHF  G  L L
Sbjct: 282 TSYCNLVREAVAASVVAANQVPDPSSENLL---CYNWDTME--IFPVITLHFAGGADLVL 336

Query: 441 PAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              N  +   + GTFC A      ++ +I GN       V +D +   + F+P  C
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNC 392


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 154/353 (43%), Gaps = 42/353 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP +    +DTGSD+ W QC PCT CY Q  PIFDP  SS++         
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKE------- 113

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
                   C  N C Y++ Y D +++ G L TETV+     G    +    +GCGH++  
Sbjct: 114 ------KRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGDAVTA 329
                +G++GL  G  SL  Q+       ++YC     S  +  + F  N+   GD V +
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA---SQGTSKINFGTNAIVAGDGVVS 224

Query: 330 -PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
             +         YY+ L   SVG   V+   + F    A +G II+D GT +T       
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTF---HALEGNIIIDSGTTLTYFPVSYC 281

Query: 389 NSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           N +R   D +V       PT    L   CY    +     P +++HF  G  L L   N 
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDML---CYYTDTID--IFPVITMHFSGGADLVLDKYNM 336

Query: 446 LIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            I   + GTFC A     P   A  I GN  Q    V +D ++  V F+P  C
Sbjct: 337 YIETITRGTFCLAIICNNPPQDA--IFGNRAQNNFLVGYDSSSLLVSFSPTNC 387


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 130/425 (30%), Positives = 203/425 (47%), Gaps = 44/425 (10%)

Query: 108 RDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGT 167
           RD AR +  I    LA             A      F+ P+ SGA  G+G+YF R  VGT
Sbjct: 61  RDDARRHAYIRSQLLAASRTRGRRAAEVGASASASAFAMPLSSGAYTGTGQYFVRFRVGT 120

Query: 168 PPRQFSMVLDTGSDINWLQCRPCTECYQQS-DPIFDPKTSSSYSPLPCAAPQCKS---LD 223
           P + F +V DTGSD+ W++C    +    +   +F    S S++P+ C++  C S     
Sbjct: 121 PAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSDTCTSYVPFS 180

Query: 224 VSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS-----------VKGIALGCGH 270
           ++ C   A+ C Y   Y DGS   G + T++ +   SGS           ++G+ LGC  
Sbjct: 181 LANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKLQGVVLGCTA 240

Query: 271 DNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSP--ASGVLEFN--SAR 322
             +G  F  S G+L LG   +S   +  A      +YCLVD  +P  A+  L F      
Sbjct: 241 SYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPPGPE 300

Query: 323 GGDAVTA---------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           GG A ++         PL+ ++++  FY V +    V G+A+ IP  ++  D A  GG I
Sbjct: 301 GGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADVW--DVARGGGAI 358

Query: 374 VDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHF 432
           +D GT++T L T AY ++  +   RLAG   P   +  F+ CY+++   ++ +P + + F
Sbjct: 359 LDSGTSLTVLATPAYRAVVAALSERLAG--LPRVSMDPFEYCYNWTAA-ALEIPGLEVRF 415

Query: 433 GAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNRVGF 490
                L  PAK+Y+  VD+A G  C      +   +S+IGN+ QQ     FDL +  + F
Sbjct: 416 AGSARLQPPAKSYV--VDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWEFDLRDRWLRF 473

Query: 491 TPNKC 495
              +C
Sbjct: 474 KHTRC 478


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 140/298 (46%), Gaps = 17/298 (5%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
            Y  R+ +GTP +Q  MVLDT +D  W+ C  CT C   S   F P  S++   L C+  
Sbjct: 44  NYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSEA 100

Query: 218 QCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           QC  +   +C A   + CL+  +YG  S     LV + ++  N   + G   GC +   G
Sbjct: 101 QCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLAND-VIPGFTFGCINAVSG 159

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPA-SGVLEFNSARGGDAV-TA 329
             +   GLLGLG G +SL  Q  A      +YCL    S   SG L+        ++ T 
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+RN    + YYV LTG SVG   V IP      D     G I+D GT ITR     Y 
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           ++RD F +      P S +  FDTC  F+       P V+LHF  G  L LP +N LI
Sbjct: 280 AIRDEFRKQVNG--PISSLGAFDTC--FAETNEAEAPAVTLHF-EGLNLVLPMENSLI 332


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 175/368 (47%), Gaps = 51/368 (13%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P+ S+SY  L C 
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
            P C   D        C+Y+  Y + S + G L  + +SFGN    S +    GC ++  
Sbjct: 132 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTA 329
           G LF   A G++GLG G LS+  Q         LVD+     GV+E  F+   GG  V  
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQ---------LVDK-----GVIEDVFSLCYGGMEVGG 233

Query: 330 PLIRNKKV---------------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
             +   K+                 +Y + L    V G+++++ P +F     G  G ++
Sbjct: 234 GAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVL 289

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
           D GT       +A+ +++D+ ++   +LK   G      D C+  +G     +    P +
Sbjct: 290 DSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349

Query: 429 SLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           ++ FG G+ L L  +NYL       G +C    P   + +++G +  + T V++D  N++
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDK 409

Query: 488 VGFTPNKC 495
           +GF    C
Sbjct: 410 LGFLKTNC 417


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 166/359 (46%), Gaps = 38/359 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           +GTPP+   MVLDTGS ++W+QC    +   +    FDP  SSS+S LPC+ P CK    
Sbjct: 78  IGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRIP 136

Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L  S C +NR C Y   Y DG+F  G+LV E ++F N+     + LGC  ++     
Sbjct: 137 DFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS---- 191

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
              G+LG+  G LS   Q K +  +YC+  + S   G     S   GD   +   +   +
Sbjct: 192 DDRGILGMNRGRLSFVSQAKISKFSYCIPPK-SNRPGFTPTGSFYLGDNPNSHGFKYVSL 250

Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            TF             Y V + G   G + + I  S+F  D  G G  +VD G+  T L 
Sbjct: 251 LTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLV 310

Query: 385 TQAYNSLR-DSFVRLAGNLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
             AY+ +R +   R+   LK      G A  D C+D +     R +  +   F  G  + 
Sbjct: 311 DAAYDKVRAEIMTRVGRRLKKGYVYGGTA--DMCFDGNVAMIPRLIGDLVFVFTRGVEIF 368

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +P +  L+ V   G  C     +S   +A +IIGNV QQ   V FD+ N RVGF    C
Sbjct: 369 VPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 166/359 (46%), Gaps = 38/359 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK---- 220
           +GTPP+   MVLDTGS ++W+QC    +   +    FDP  SSS+S LPC+ P CK    
Sbjct: 78  IGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCKPRIP 136

Query: 221 --SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             +L  S C +NR C Y   Y DG+F  G+LV E ++F N+     + LGC  ++     
Sbjct: 137 DFTLPTS-CDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS---- 191

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKV 337
              G+LG+  G LS   Q K +  +YC+  + S   G     S   GD   +   +   +
Sbjct: 192 DDRGILGMNRGRLSFVSQAKISKFSYCIPPK-SNRPGFTPTGSFYLGDNPNSHGFKYVSL 250

Query: 338 DTF-------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            TF             Y V + G   G + + I  S+F  D  G G  +VD G+  T L 
Sbjct: 251 LTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSEFTHLV 310

Query: 385 TQAYNSLR-DSFVRLAGNLKP---TSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKALD 439
             AY+ +R +   R+   LK      G A  D C+D +     R +  +   F  G  + 
Sbjct: 311 DAAYDKVRAEIMTRVGRRLKKGYVYGGTA--DMCFDGNVAMIPRLIGDLVFVFTRGVEIL 368

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +P +  L+ V   G  C     +S   +A +IIGNV QQ   V FD+ N RVGF    C
Sbjct: 369 VPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 190/408 (46%), Gaps = 46/408 (11%)

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ--ILPEDFSTPVVSGASQG 155
           Y+++    L +D+A  +TL            RH    A  Q  + P DF  P +    + 
Sbjct: 57  YKNVKAESLAKDTALESTL-----------SRHAYLRARQQKALQPADFVPPPLI---RD 102

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
              + + + +G PP    +VLDTGSD+ W+QC PC  CY+Q DPI++   S SY+ + C 
Sbjct: 103 KSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCN 162

Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
            P C SL  +     +  CLYQ +Y DGS T G L  E V+F     +      +  GCG
Sbjct: 163 EPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 222

Query: 270 HDNEGLFVGS--AGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSP-ASGVLEFNSA 321
             N      S   G+LGLG G++SL  Q+ A      S AYC  +  +P A G L F  A
Sbjct: 223 LQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDA 282

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQA--VQIPPSLFEMDEAGDGGIIVDCGTA 379
              +    P++    +  FYYV L G  +G +   + I  S FE    G GG+I+D G+ 
Sbjct: 283 TYLNGDMTPMV----IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGST 338

Query: 380 ITRLQTQAYNSLRDSFV---RLAGNLKPTSGVALFDTCYDFSGLRSVRV-PTVSLHFGAG 435
           ++    + Y  +R++ V   +   N+ P +       C++    R + + PT+ L+  + 
Sbjct: 339 LSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSP---DCFEGKIGRDLPLFPTLVLYLEST 395

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
             L+     +L   D    FC  F  +   LSIIG + QQ  +  ++L
Sbjct: 396 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNL 440


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/358 (30%), Positives = 157/358 (43%), Gaps = 45/358 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKTSSSYSPLPCA 215
           G Y+S I +G+PP+ FS+V+DTGSD+ W++C PC+ +C       FD   S++Y  L CA
Sbjct: 1   GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLASNTYKALTCA 56

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-----GIALGCGH 270
                             Y   YGDGSFT GDL  +T+    + S +     G   GCG 
Sbjct: 57  DD----------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGS 100

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVD-------RDSP---ASGVLE 317
             +GL  G  G+L L  G LS   QI        +YCL+        + SP       +E
Sbjct: 101 LLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160

Query: 318 FNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                 G           +   +Y V L G SVG Q + + PS F   +  D   I D G
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ--DKPTIFDSG 218

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKA 437
           T +T L     +S++ S   +    +  + +   D C+         +P ++ HF  G  
Sbjct: 219 TTLTMLPPGVCDSIKQSLASMVSGAEFVA-IKGLDACFRVPPSSGQGLPDITFHFNGGAD 277

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                 NY+I + S    C  F PT+  +SI GN+QQQ   V  D+ N R+GF    C
Sbjct: 278 FVTRPSNYVIDLGSLQ--CLIFVPTNE-VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 175/368 (47%), Gaps = 51/368 (13%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P+ S+SY  L C 
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKC- 131

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
            P C   D        C+Y+  Y + S + G L  + +SFGN    S +    GC ++  
Sbjct: 132 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTA 329
           G LF   A G++GLG G LS+  Q         LVD+     GV+E  F+   GG  V  
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQ---------LVDK-----GVIEDVFSLCYGGMEVGG 233

Query: 330 PLIRNKKV---------------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
             +   K+                 +Y + L    V G+++++ P +F     G  G ++
Sbjct: 234 GAMVLGKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVL 289

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
           D GT       +A+ +++D+ ++   +LK   G      D C+  +G     +    P +
Sbjct: 290 DSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEI 349

Query: 429 SLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           ++ FG G+ L L  +NYL       G +C    P   + +++G +  + T V++D  N++
Sbjct: 350 AMEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDK 409

Query: 488 VGFTPNKC 495
           +GF    C
Sbjct: 410 LGFLKTNC 417


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 175/367 (47%), Gaps = 49/367 (13%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P+ SSSY  L C 
Sbjct: 77  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKC- 135

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
            P C   D        C+Y+  Y + S + G L  + +SFGN   +  +    GC +   
Sbjct: 136 NPDCNCDD----EGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVET 191

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAY---------CLVDRDSPASGV 315
           G LF   A G++GLG G LS+  Q+          SL Y          ++ + SP +G+
Sbjct: 192 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVLGKISPPAGM 251

Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           +  +S    D   +P         +Y + L    V G+++++ P +F     G  G ++D
Sbjct: 252 VFSHS----DPFRSP---------YYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLD 294

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVS 429
            GT       +A+ +++D+ ++   +LK   G      D C+  +G     +    P + 
Sbjct: 295 SGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEID 354

Query: 430 LHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           + FG G+ L L  +NYL       G +C    P   + +++G +  + T V++D  N+++
Sbjct: 355 MEFGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKL 414

Query: 489 GFTPNKC 495
           GF    C
Sbjct: 415 GFLKTNC 421


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 189/408 (46%), Gaps = 46/408 (11%)

Query: 98  YRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQ--ILPEDFSTPVVSGASQG 155
           Y+++    L +D+A  +TL            RH    A  Q  + P DF  P +    + 
Sbjct: 44  YKNVKAESLAKDTALESTL-----------SRHAYLRARQQKALQPADFVPPPLI---RD 89

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
              + + + +G PP    +VLDTGSD+ W+QC PC  CY+Q DPI++   S SY+ + C 
Sbjct: 90  KSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCN 149

Query: 216 APQCKSL--DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCG 269
            P C SL  +     +  CLYQ AY DG+ T G L  E V+F     +      +  GCG
Sbjct: 150 EPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCG 209

Query: 270 HDNEGLFVGS--AGLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDSP-ASGVLEFNSA 321
             N      +   G+LGLG G++SL  Q+ A      S AYC  +  +P A G L F  A
Sbjct: 210 LQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDA 269

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGL--TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
              +    P++    +  FYYV L   G  VG   + I  S FE    G GG+I+D G+ 
Sbjct: 270 TYLNGDMTPMV----IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGST 325

Query: 380 ITRLQTQAYNSLRDSFV---RLAGNLKPTSGVALFDTCYDFSGLRSVRV-PTVSLHFGAG 435
           ++    + Y  +R++ V   +   N+ P +       C++    R + + PT+ L+  + 
Sbjct: 326 LSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSP---DCFEGKIERDLPLFPTLVLYLEST 382

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
             L+     +L   D    FC  F  +   LSIIG + QQ  +  ++L
Sbjct: 383 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYNL 427


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 109/353 (30%), Positives = 154/353 (43%), Gaps = 42/353 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP +    +DTGSD+ W QC PCT CY Q  PIFDP  SS++         
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKE------- 113

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
                   C  N C Y++ Y D +++ G L TETV+     G    +    +GCGH++  
Sbjct: 114 ------KRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEF--NSARGGD-AVT 328
                +G++GL  G  SL  Q+       ++YC     S  +  + F  N+   GD  V+
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFA---SQGTSKINFGTNAIVAGDGVVS 224

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
             +         YY+ L   SVG   V+   + F    A +G II+D GT +T       
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTF---HALEGNIIIDSGTTLTYFPVSYC 281

Query: 389 NSLR---DSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           N +R   D +V       PT    L   CY    +     P +++HF  G  L L   N 
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDML---CYYTDTID--IFPVITMHFSGGADLVLDKYNM 336

Query: 446 LIPVDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            I   + GTFC A     P   A  I GN  Q    V +D ++  V F+P  C
Sbjct: 337 YIETITRGTFCLAIICNNPPQDA--IFGNRAQNNFLVGYDSSSLLVFFSPTNC 387


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 170/385 (44%), Gaps = 52/385 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP---IFDPKTSSSY 209
           G Y   +  GTPP+   +++DTGSD+ W  C     C  C +  S+P   IF PK+SSS 
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 210 SPLPCAAPQCKSLDVSACRANRCL---------------YQVAYGDGSFTVGDLVTETVS 254
             L C  P+C  +  S  ++ RC                Y V YG G  T G +++ET+ 
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLD 205

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
               G V    +GC   +       AG+ G G G  SL  Q+     +YCL+ R   D+ 
Sbjct: 206 LPGKG-VPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTT 261

Query: 312 ASGVLEFNSARGGDAVTA-----PLIRNKKV------DTFYYVGLTGFSVGGQAVQIPPS 360
            S  L  +        TA     P ++N KV        +YY+GL   +VGG+ V+IP  
Sbjct: 262 ESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYK 321

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT--SGVALFDTCYDFS 418
                  GDGG I+D GT  T ++ + +  +   F +   + + T   G+     C++ S
Sbjct: 322 YLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNIS 381

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIG 470
           GL +   P ++L F  G  ++LP  NY+  +      C       +A          I+G
Sbjct: 382 GLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILG 441

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           N QQQ   V +DL N R+GF    C
Sbjct: 442 NFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 173/371 (46%), Gaps = 41/371 (11%)

Query: 137 AQILPEDFSTPVVSGASQG---SGEYFSRIGVG--TPPRQFSMVLDTGSD-INWLQCRPC 190
           + +LP++  +    G SQG   + +Y    G G   PP    ++ +   D I W QC+PC
Sbjct: 47  SSLLPKNKCSASARGGSQGLPITQKYGPCSGSGHSQPPSPQEILAEMNPDSITWTQCKPC 106

Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVT 250
             C + S   FDP  S +YS   C         + +   N   Y + YGD S +VG+   
Sbjct: 107 VRCLKDSHRHFDPSASLTYSLGSC---------IPSTVGNT--YNMTYGDKSTSVGNYGC 155

Query: 251 ETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGLGGGMLSLTKQIKA---TSLAYCLV 306
           +T++   S        GCG +NEG F  G+ G+LGLG G LS   Q  +      +YCL 
Sbjct: 156 DTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLP 215

Query: 307 DRDSPASGVL-----EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
           + DS  S +        +S +    V  P     +   +Y+V L   SVG + + +P S+
Sbjct: 216 EEDSIGSLLFGEKATSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNVPSSV 275

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA----LFDTCYDF 417
           F        G I+D GT IT L  +AY++L  +F +       ++G      + DTCY+ 
Sbjct: 276 F-----ASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCYNL 330

Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNV 472
           SG + V +P + LHFG G  + L  K  +I  + A   C AFA  S     S L+IIGN 
Sbjct: 331 SGRKDVLLPEIVLHFGEGADVRLNGKR-VIWGNDASRLCLAFAGNSKSTMNSELTIIGNR 389

Query: 473 QQQGTRVSFDL 483
           QQ    V +D+
Sbjct: 390 QQVSLTVLYDI 400


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/355 (30%), Positives = 170/355 (47%), Gaps = 28/355 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           +     +G P      ++DTGS+I W++C PC  C QQ+ P+ DP  SS+Y+ LPC    
Sbjct: 99  FLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTM 158

Query: 219 CKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDNE 273
           C     + C R N+C Y ++Y  G  + G L TE + F +S     +V  +  GC H+N 
Sbjct: 159 CHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN- 217

Query: 274 GLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV--LEFNSARGGDAVTA 329
           G +      G+ GLG G+ S   ++  +  +YCL +   P  G   L F      +  + 
Sbjct: 218 GDYKDRRFTGVFGLGKGITSFVTRM-GSKFSYCLGNIADPHYGYNQLVFGEKANFEGYST 276

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL   K V+  YYV L G SVG + + I  + F M +  +   ++D GTA+T L   A+ 
Sbjct: 277 PL---KVVNGHYYVTLEGISVGEKRLDIDSTAFSM-KGNEKSALIDSGTALTWLAESAFR 332

Query: 390 SLRDSFVR--LAGNLKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYL 446
           +L D+ VR  L G L P    +    CY  +  +  +  P V+ HF  G  LDL  ++  
Sbjct: 333 AL-DNEVRQLLDGVLMPFWRGSF--ACYKGTVSQDLIGFPVVTFHFSGGADLDLDTESMF 389

Query: 447 IPVDSAGTFCFAFAPTSS------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               +    C A    S+      + S+IG + QQ   +++DL +N++ F    C
Sbjct: 390 YQA-TPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDC 443


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 162/353 (45%), Gaps = 31/353 (8%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           +   I +G+PP    + +DT SD+ W+QC PC  CY QS PIFDP  S ++    C   Q
Sbjct: 85  FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144

Query: 219 --CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG------NSGSVKGIALGCGH 270
               SL  +A     C Y + Y D + + G L  E + F       +S ++  +  GCGH
Sbjct: 145 YSMPSLKFNA-NTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGH 203

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAV- 327
           DN G  +   G+LGLG G  SL  +      +YC    D P+    VL      G + + 
Sbjct: 204 DNYGEPLVGTGILGLGYGEFSLVHRF-GKKFSYCFGSLDDPSYPHNVLVLGD-DGANILG 261

Query: 328 -TAPL-IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITRLQ 384
            T PL I N     FYYV +   SV G  + I P +F  + + G GG I+D G ++T L 
Sbjct: 262 DTTPLEIHNG----FYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLV 317

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVR---VPTVSLHFGAGKA 437
            +AY  L++    +       + V+  D     CY+ +  R +     P V+ HF  G  
Sbjct: 318 EEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAE 377

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           L L  K+  + + S   FC A  P +  L+ IG   QQ   + +DL    V F
Sbjct: 378 LSLDVKSLFMKL-SPNVFCLAVTPGN--LNSIGATAQQSYNIGYDLEAMEVSF 427


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 151/307 (49%), Gaps = 41/307 (13%)

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           ++ +LA   + R E   A   ++ E   TP++       GEY  ++G+GTPP +F+  +D
Sbjct: 55  SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
           T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++  C  LDV  C  +    C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
              Y   + T G L  + +  G   + +G+A GC   + G      ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226

Query: 293 TKQIKATSLAYCLVDRDSPASGVL----EFNSARGG-DAVTAPLIRNKKVDTFYYVGLTG 347
             Q+     AYCL    S   G L    + ++AR   + +  P+ R+ +  ++YY+ L G
Sbjct: 227 VSQLSVRRFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDG 286

Query: 348 FSVGGQAVQIP-----------------------PSLFEMDEAGDGGIIVDCGTAITRLQ 384
             +G + + +P                        +   + +A   G+I+D  + IT L+
Sbjct: 287 LLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLE 346

Query: 385 TQAYNSL 391
              Y+ L
Sbjct: 347 ASLYDEL 353


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDPIFDPKTSSSYSPLPCAAPQCK 220
           + VGTPP+  +MVLDTGS+++WL C P        +S   F P+ S +++ +PC + QC+
Sbjct: 69  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128

Query: 221 SLDVS---AC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDN 272
           S D+    AC   + +C   ++Y DGS + G L TE  + G    ++  A GC     D 
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRA-AFGCMATAFDT 187

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------GGDA 326
               V +AGLLG+  G LS   Q      +YC+ DRD   +GVL    +           
Sbjct: 188 SPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTP 245

Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           +  P +     D   Y V L G  VGG+ + IP S+   D  G G  +VD GT  T L  
Sbjct: 246 LYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 305

Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS--VRVPTVSLHF-GAGK 436
            AY++L+  F R      P       +    FDTC+     R+   R+P V+L F GA  
Sbjct: 306 DAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQM 365

Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNRV 488
            +      Y +P +     G +C  F      P ++   +IG+  Q    V +DL   RV
Sbjct: 366 TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRV 423

Query: 489 GFTPNKC 495
           G  P +C
Sbjct: 424 GLAPIRC 430


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 122/370 (32%), Positives = 177/370 (47%), Gaps = 45/370 (12%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           I VGTPP+  SMV+DTGS+++WL C   T       P F+P  SSSY+P+ C++P C + 
Sbjct: 70  ITVGTPPQNMSMVIDTGSELSWLHCNTNTTA-TIPYPFFNPNISSSYTPISCSSPTCTTR 128

Query: 222 -----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
                +  S    N C   ++Y D S + G+L ++T  FG+S +  GI  GC +     N
Sbjct: 129 TRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-PGIVFGCMNSSYSTN 187

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
                 + GL+G+  G LSL  Q+K    +YC+   D   SG+L   E N + GG     
Sbjct: 188 SESDSNTTGLMGMNLGSLSLVSQLKIPKFSYCISGSD--FSGILLLGESNFSWGGSLNYT 245

Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           PL++ +  +  F    Y V L G  +  + + I  +LF  D  G G  + D GT  + L 
Sbjct: 246 PLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQFSYLL 305

Query: 385 TQAYNSLRDSFV-RLAGNLKPTSG------VALFDTCYDFSGLRSV--RVPTVSLHF-GA 434
              YN+LRD F+ +  G L+          +A+ D CY     +S    +P+VSL F GA
Sbjct: 306 GPVYNALRDEFLNQTNGTLRALDDPNFVFQIAM-DLCYRVPVNQSELPELPSVSLVFEGA 364

Query: 435 -----GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLAN 485
                G  L      ++   DS   +CF F   S  L     IIG+  QQ   + FDL  
Sbjct: 365 EMRVFGDQLLYRVPGFVWGNDSV--YCFTFG-NSDLLGVEAFIIGHHHQQSMWMEFDLVE 421

Query: 486 NRVGFTPNKC 495
           +RVG    +C
Sbjct: 422 HRVGLAHARC 431


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--QSDPIFDPKTSSSYSPLPCAAPQCK 220
           + VGTPP+  +MVLDTGS+++WL C P        +S   F P+ S +++ +PC + QC+
Sbjct: 70  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129

Query: 221 SLDVS---AC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---GHDN 272
           S D+    AC   + +C   ++Y DGS + G L TE  + G    ++  A GC     D 
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRA-AFGCMATAFDT 188

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------GGDA 326
               V +AGLLG+  G LS   Q      +YC+ DRD   +GVL    +           
Sbjct: 189 SPDGVATAGLLGMNRGALSFVSQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPLNYTP 246

Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           +  P +     D   Y V L G  VGG+ + IP S+   D  G G  +VD GT  T L  
Sbjct: 247 LYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 306

Query: 386 QAYNSLRDSFVRLAGNLKPT------SGVALFDTCYDFSGLRS--VRVPTVSLHF-GAGK 436
            AY++L+  F R      P       +    FDTC+     R+   R+P V+L F GA  
Sbjct: 307 DAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLFNGAQM 366

Query: 437 ALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDLANNRV 488
            +      Y +P +     G +C  F      P ++   +IG+  Q    V +DL   RV
Sbjct: 367 TVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRV 424

Query: 489 GFTPNKC 495
           G  P +C
Sbjct: 425 GLAPIRC 431


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 166/367 (45%), Gaps = 35/367 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S P+ SG +   G Y  R+ +GTP +   MVLDT +D  ++    C  C   S   F P 
Sbjct: 84  SAPIASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGC---SATTFSPN 140

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRAN---RCLYQVAYGDGSFT---VGD---LVTETV-- 253
            S+SY PL C+ PQC  +   +C A     C +  +Y   +++   V D   L T+ +  
Sbjct: 141 ASTSYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYSATLVQDSLRLATDVIPS 200

Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
            SFG+  ++ G ++             +        +LS T  + +   +YCL    S  
Sbjct: 201 YSFGSINAISGSSIPAQGLLGLGRGPLS--------LLSQTGSLYSGVFSYCLPSFKSYY 252

Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            SG L+        ++ T PL+RN +  + Y+V LTG +VG   V  P  L   D     
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGS 312

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT ITR     YN++RD F +      P S +  FDTC  F        P ++L
Sbjct: 313 GTIIDSGTVITRFVEPVYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLAN 485
           HF     L LP +N LI   S    C A A T      + L++I N QQQ  RV FD  N
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVN 427

Query: 486 NRVGFTP 492
           N+  + P
Sbjct: 428 NKGWYCP 434


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/350 (30%), Positives = 151/350 (43%), Gaps = 36/350 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  ++ VGTPP +   V+DTGS+I W QC PC  CY+Q+ PIFDP  SS++         
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKE------- 432

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDNEG 274
                   C  + C Y+V Y D ++T G L T+TV+     G    +    +GCG +N  
Sbjct: 433 ------KRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSW 486

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
                 G +GL  G LSL  Q+       ++YC     +            GG  V+  +
Sbjct: 487 FRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNAIVGGGGVVSTTM 546

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
                   FYY+ L   SVG   ++   + F    A +G I++D GT +T       N +
Sbjct: 547 FVTTARPGFYYLNLDAVSVGDTRIETLGTPF---HALEGNIVIDSGTTLTYFPESYCNLV 603

Query: 392 RDSFVRLAGNL---KPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           R +   +   +    PT    L   CY +S    +  P +++HF  G  L L   N  + 
Sbjct: 604 RQAVEHVVPAVPAADPTGNDLL---CY-YSNTTEI-FPVITMHFSGGADLVLDKYNMFME 658

Query: 449 VDSAGTFCFAFA---PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             S G FC A     PT  A  I GN  Q    V +D ++  V F P  C
Sbjct: 659 SYSGGLFCLAIICNNPTQEA--IFGNRAQNNFLVGYDSSSLLVSFKPTNC 706



 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 100/335 (29%), Positives = 150/335 (44%), Gaps = 50/335 (14%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           EY  ++ +GTPP +   VLDTGS++ W QC PC  CY Q  PIFDP  SS++    C  P
Sbjct: 64  EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTP 123

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA----LGCGHDN- 272
                       + C Y++ Y D S+T G L TETV+  ++  V  +     +GC  +N 
Sbjct: 124 D-----------HSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNS 172

Query: 273 -EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
             G    S+G++GL  G LSL  Q+     AY       P  GV           V+  +
Sbjct: 173 GSGFRPSSSGIVGLSRGSLSLISQMGG---AY-------PGDGV-----------VSTTM 211

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
                    YY+ L   SVG   ++   + F    A +G I++D GT +T       N +
Sbjct: 212 FAKTAKRGQYYLNLDAVSVGDTRIETVGTPF---HALNGNIVIDSGTPLTYFPVSYCNLV 268

Query: 392 RDSFVRLAGN---LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
           R +  R+      + P+    L   CY +S    +  P +++HF  G  L L   N  + 
Sbjct: 269 RKAVERVVTADRVVDPSRNDML---CY-YSNTIEI-FPVITVHFSGGADLVLDKYNMYME 323

Query: 449 VDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
           ++  G FC A    + + ++I GN  Q    V +D
Sbjct: 324 LNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/420 (26%), Positives = 192/420 (45%), Gaps = 38/420 (9%)

Query: 101 LVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ------ 154
           +VLS  +  +     +I  L L+  N+  H  KP  +           +  A        
Sbjct: 24  VVLSATDIPNHNHRPMIIPLHLSTSNISSHR-KPFTSNYHRRQLHNSDLPNAHMRLYDDL 82

Query: 155 -GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
             +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P++SS+Y P+ 
Sbjct: 83  LSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQ 142

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
           C  P C   D       +C Y+  Y + S + G L  + +SFGN   +  +    GC   
Sbjct: 143 C-NPSCNCDD----EGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRAIFGCETV 197

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G LF   A G++GLG G LS+      K++   S + C    D     ++  N     
Sbjct: 198 ETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPP 257

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D V A    +     +Y + L    V G+ +++ P +F+    G  G ++D GT    L 
Sbjct: 258 DMVFAH--SDPYRSAYYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDSGTTYAYLP 311

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGKA 437
            +A+ + +D+ ++    LK   G   +  D C+     D S L  +  P V++ FG G+ 
Sbjct: 312 EEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKI-FPEVNMVFGNGQK 370

Query: 438 LDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L L  +NYL      +G +C   F       +++G +  + T V++D  N+++GF    C
Sbjct: 371 LSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNC 430


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/373 (31%), Positives = 171/373 (45%), Gaps = 36/373 (9%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SG   G+ +YF+ I VGTP ++F +V+DTGS++ W+ CR      + +  +F    S S+
Sbjct: 97  SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSF 155

Query: 210 SPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
             + C    CK       SL      +  C Y   Y DGS   G    ET++     G  
Sbjct: 156 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRM 215

Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDR--DS 310
             + G  +GC     G  F G+ G+LGL     S T    ATSL     +YCLVD   + 
Sbjct: 216 ARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTST--ATSLYGAKFSYCLVDHLSNK 273

Query: 311 PASGVLEFNSARGGDAV---TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
             S  L F S+R        T PL    ++  FY + + G S+G   + IP  ++  D  
Sbjct: 274 NVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVW--DAT 330

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS--GVALFDTCYDF-SGLRSVR 424
             GG I+D GT++T L   AY  +     R    LK     GV + + C+ F SG    +
Sbjct: 331 SGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI-EYCFSFTSGFNVSK 389

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFD 482
           +P ++ H   G   +   K+YL  VD+A G  C  F    + A ++IGN+ QQ     FD
Sbjct: 390 LPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFD 447

Query: 483 LANNRVGFTPNKC 495
           L  + + F P+ C
Sbjct: 448 LMASTLSFAPSAC 460


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 110/351 (31%), Positives = 155/351 (44%), Gaps = 25/351 (7%)

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
            + I +G PP    +V+DTGSDI W+ C PCT C      +FDP  SS++SPL C  P  
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-- 158

Query: 220 KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCGHD-NEG 274
              D   CR +   + V Y D S   G    +TV F  +      +  +  GCGH+    
Sbjct: 159 --CDFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHD 216

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPLI 332
              G  G+LGL  G  SL  ++     +YC+ +   P     +     G D    + P  
Sbjct: 217 TDPGHNGILGLNNGPDSLVTKL-GQKFSYCIGNLADPYYNYHQLILGEGADLEGYSTPF- 274

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
             +  + FYYV + G SVG + + I P  FEM E   GG+I+D G+ IT L    +  L 
Sbjct: 275 --EVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLS 332

Query: 393 DSFVRLAGN--LKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIPV 449
                L G    + T   + +  C+  S  R  V  P V+ HF  G  L L + ++   +
Sbjct: 333 KEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSGSFFNQL 392

Query: 450 DSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +    FC    P S     S  S+IG + QQ   V +DL N  V F    C
Sbjct: 393 ND-NVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 118/402 (29%), Positives = 184/402 (45%), Gaps = 45/402 (11%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSMVLDTGSDI 182
           R  L+ A    L + F   VV  + QGS      G YF+R+ +GTPPR+F++ +DTGSD+
Sbjct: 48  RDHLRHAR---LLQGFVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTGSDV 104

Query: 183 NWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRC 232
            W+ C  C+ C Q S        FD  +SS+   +PC+ P C S      + C  ++N+C
Sbjct: 105 LWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQSNQC 164

Query: 233 LYQVAYGDGSFTVGDLVTETVSF----GNS---GSVKGIALGCGHDNEGLFVGS----AG 281
            Y   YGDGS T G  V++T  F    G S    S   I  GC     G    +     G
Sbjct: 165 SYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDG 224

Query: 282 LLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           + G G G LS+  Q+ +  +     ++CL   DS   G+L          V +PL+ ++ 
Sbjct: 225 IFGFGQGELSVISQLSSHGITPRVFSHCLKGEDS-GGGILVLGEILEPGIVYSPLVPSQP 283

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
               Y + L   +V GQ + I P+ F    + + G I+D GT +  L  +AY+    +  
Sbjct: 284 ---HYNLDLQSIAVSGQLLPIDPAAFA--TSSNRGTIIDTGTTLAYLVEEAYDPFVSAIT 338

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS---AG 453
                L  T  +   + CY  S   S   P VS +F  G  + L  + YL+ + +   A 
Sbjct: 339 AAVSQLA-TPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAA 397

Query: 454 TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +C  F      ++I+G++  +     +DLA+ R+G+    C
Sbjct: 398 LWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 121/390 (31%), Positives = 172/390 (44%), Gaps = 60/390 (15%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDPI----FDPKTSSS 208
           G Y   + +GTPP+    VLDTGS + W  C     C+ C +   DP     F PK SS+
Sbjct: 86  GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSST 145

Query: 209 YSPLPCAAPQCKSL---DVSACRANRCL-------------YQVAYGDGSFTVGDLVTET 252
              L C  P+C  L   DV + R  +C              Y + YG G+ T G L+ + 
Sbjct: 146 AKLLGCRNPKCGYLFGPDVES-RCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDN 203

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---D 309
           ++F    +V    +GC   +       +G+ G G G  SL  Q+     +YCLV     D
Sbjct: 204 LNFPGK-TVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDD 259

Query: 310 SPASGVLEFNSARGGDAVT-----APLIRNKKVDT----FYYVGLTGFSVGGQAVQIPPS 360
           +P S  L    +  GD  T      P   N   ++    +YYV L    VGG  V+IP  
Sbjct: 260 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYK 319

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDT 413
             E    G+GG IVD G+  T ++   YN +   F+R  G       N++  SG++    
Sbjct: 320 FLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS---P 376

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF-------AFAPTSSAL 466
           C++ SG++++  P  +  F  G  +  P  NY   V  A   CF       A  P ++  
Sbjct: 377 CFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGP 436

Query: 467 SII-GNVQQQGTRVSFDLANNRVGFTPNKC 495
           +II GN QQQ   V +DL N R GF P  C
Sbjct: 437 AIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 116/363 (31%), Positives = 171/363 (47%), Gaps = 30/363 (8%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           ++GSG +   + +G+PP    +V+DTGS + W+QC PC  C+QQS   FDP  S S+  L
Sbjct: 99  NRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157

Query: 213 PCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVK--GIALG 267
            C  P    ++   C R N+  Y++ Y  G  + G L  E++ F   + G +K   I  G
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFG 217

Query: 268 CGHDNEGLFVGSA--GLLGLGGG-MLSLTKQIKATSLAYCLVDRDSPASG----VLEFNS 320
           CGH N       A  G+ GLG    +++  Q+     +YC+ D ++P       VL   S
Sbjct: 218 CGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-GNKFSYCIGDINNPLYTHNHLVLGQGS 276

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
              GD+    +         YYV L   SVG + ++I P+ F++   G GG+++D G   
Sbjct: 277 YIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTY 331

Query: 381 TRLQTQAYNSLRDSFVRL-AGNLKPTSGVALFD-TCYDFSGLRS---VRVPTVSLHFGAG 435
           T+L    +  L D  V L  G L+       F+  C  F G+ S   V  P V+ HF  G
Sbjct: 332 TKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLC--FKGVVSRDLVGFPAVTFHFAGG 389

Query: 436 KALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRVSFDLANNRVGFTP 492
             L L + + L        FC A  P++S    LS+IG + QQ   V FDL   +V F  
Sbjct: 390 ADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRR 448

Query: 493 NKC 495
             C
Sbjct: 449 IDC 451


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/419 (28%), Positives = 184/419 (43%), Gaps = 56/419 (13%)

Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
            LA  ++ R H LK  +   L +      +S +    G +   +  GTPP++ S ++DTG
Sbjct: 54  HLATASLSRAHHLKHGKTSPLTQ------ISLSPHSYGGHSIPLSFGTPPQKLSFLVDTG 107

Query: 180 SDINWLQCRP---CTEC-----YQQSDPIFDPKTSSSYSPLPCAAPQCKS-------LDV 224
           S + W  C     CT C       +  PIF+PK SSS   L C  P+C +       L  
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNTSSPDVHLGC 167

Query: 225 SACRAN--RCL-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
             C  N   C      Y + YG G+ + GD + E ++F    ++    +GC     G  V
Sbjct: 168 PPCNGNSKNCSHACPPYSLQYGTGA-SSGDFLLENLNFPGK-TIHEFLVGCTTSAVGE-V 224

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-----SPASGVLEFNSARGGDAVTAPLI 332
            SA L G G  M SL  Q+     AYCL   D     + +  +L+++         AP +
Sbjct: 225 TSAALAGFGRSMFSLPMQMGVKKFAYCLNSHDYDDTRNSSKLILDYSDGETKGLSYAPFL 284

Query: 333 RNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY--- 388
           +N      +YY+G+    +G + ++IP         G GG+++D G A   +    +   
Sbjct: 285 KNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKV 344

Query: 389 -NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY-- 445
            N L+    +   +L+  + + +   CY+F+G +S+++P +   F  G  + +P KNY  
Sbjct: 345 TNELKKRMSKYRRSLEAEAEIGV-TPCYNFTGQKSIKIPDLIYQFRGGATMVVPGKNYFV 403

Query: 446 LIPVDS---------AGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LIP  S         AGT    F P  S   I+GN Q     V FDL N R+GF    C
Sbjct: 404 LIPEISLACFPLTTDAGTNTLEFTPGPSI--ILGNSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 109/372 (29%), Positives = 173/372 (46%), Gaps = 38/372 (10%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSY 209
           G G Y +++ +GTPPR+F++ +DTGSDI W+ C  C+ C + S        FD   SS+ 
Sbjct: 80  GYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTA 139

Query: 210 SPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF--------- 255
           + +PC+ P C S    A      + N+C Y   Y DGS T G  V++ + F         
Sbjct: 140 ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTP 199

Query: 256 GNSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLV 306
            N  S   I  GC     G    +     G+LG G G LS+  Q+ +  +     ++CL 
Sbjct: 200 ANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL- 258

Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
             D    G+L          V +PL+ ++     Y + L   +V GQ + I P++F   +
Sbjct: 259 KGDGNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQVLSINPAVFATSD 315

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
               G I+D GT ++ L  +AY+ L ++ V  A +   TS ++    CY          P
Sbjct: 316 --KRGTIIDSGTTLSYLVQEAYDPLVNA-VDTAVSQFATSFISKGSQCYLVLTSIDDSFP 372

Query: 427 TVSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
           TVS +F  G ++DL    YL+     D A  +C  F      ++I+G++  +   V +DL
Sbjct: 373 TVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDL 432

Query: 484 ANNRVGFTPNKC 495
           A  ++G+T   C
Sbjct: 433 ARQQIGWTNYDC 444


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 117/373 (31%), Positives = 171/373 (45%), Gaps = 36/373 (9%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSY 209
           SG   G+ +YF+ I VGTP ++F +V+DTGS++ W+ CR      + +  +F    S S+
Sbjct: 75  SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARG-KDNRRVFRADESKSF 133

Query: 210 SPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNS 258
             + C    CK       SL      +  C Y   Y DGS   G    ET++     G  
Sbjct: 134 KTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRM 193

Query: 259 GSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDR--DS 310
             + G  +GC     G  F G+ G+LGL     S T    ATSL     +YCLVD   + 
Sbjct: 194 ARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTST--ATSLYGAKFSYCLVDHLSNK 251

Query: 311 PASGVLEFNSARGGDAV---TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
             S  L F S+R        T PL    ++  FY + + G S+G   + IP  ++  D  
Sbjct: 252 NVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAINVIGISLGYDMLDIPSQVW--DAT 308

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS--GVALFDTCYDF-SGLRSVR 424
             GG I+D GT++T L   AY  +     R    LK     GV + + C+ F SG    +
Sbjct: 309 SGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI-EYCFSFTSGFNVSK 367

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFD 482
           +P ++ H   G   +   K+YL  VD+A G  C  F    + A ++IGN+ QQ     FD
Sbjct: 368 LPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFD 425

Query: 483 LANNRVGFTPNKC 495
           L  + + F P+ C
Sbjct: 426 LMASTLSFAPSAC 438


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 163/366 (44%), Gaps = 32/366 (8%)

Query: 147 PVVSGAS-QGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKT 205
           P+ SG     S  Y  R   GTP +   + +DT +D  W+ C  C  C   +   F P  
Sbjct: 93  PIASGRQITQSPTYIVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTP--FAPPK 150

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           S+++  + C A QCK +    C  + C +   YG  S     LV +TV+   +  V    
Sbjct: 151 STTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAASLVQDTVTLA-TDPVPAYT 208

Query: 266 LGCGHDNEGLFV---GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS-- 320
            GC     G  +   G  GL      +L+ T+++  ++ +YCL     P+   L F+   
Sbjct: 209 FGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCL-----PSFKTLNFSGHX 263

Query: 321 -----ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
                A+  D V  P  +N +  + YYV L    VG + V IPP     +     G + D
Sbjct: 264 DLXPVAQPRDQV-YPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFD 322

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTCYDFSGLRSVRVPTVSLHFG 433
            GT  TRL   AY ++R+ F R     K  +  +L  FDTCY       +  PT++  F 
Sbjct: 323 SGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV----PIVAPTITFMF- 377

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVG 489
           +G  + LP  N LI   +    C A AP     +S L++I N+QQQ  RV FD+ N+R+G
Sbjct: 378 SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLG 437

Query: 490 FTPNKC 495
                C
Sbjct: 438 VARELC 443


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 158/351 (45%), Gaps = 26/351 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQS--DPIFDPKTSSSYSPLPCAA 216
           +F    VG PP     ++DTGS + W+QC PC  C       P+F+P  SS++    C  
Sbjct: 68  FFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDD 127

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHDN 272
             C+      C +N+C+Y+  Y  G+ + G L  E ++F    GN+   + IA GCGH+N
Sbjct: 128 RFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHEN 187

Query: 273 -EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPL 331
            E L     G+LGLG    SL  Q+  +  +YC+ D  +   G  +       D +  P 
Sbjct: 188 GEQLESEFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGEDADILGDPT 246

Query: 332 -IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
            I  +  +  YY+ L G SVG + + I P +F+       G+I+D GT  T L   AY  
Sbjct: 247 PIEFETENGIYYMNLEGISVGDKQLNIEPVVFKR-RGSRTGVILDTGTLYTWLADIAY-- 303

Query: 391 LRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS---VRVPTVSLHFGAGKALDLPAKNYLI 447
            R+ +  +   L P      F     + G  +   +  P V+ HF  G  L + A +   
Sbjct: 304 -RELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAELAMEATSMFY 362

Query: 448 PVDSAGT----FCFAFAPTSSA------LSIIGNVQQQGTRVSFDLANNRV 488
           P+  + T    FC +  PT+         + IG + QQ   +++DL    +
Sbjct: 363 PMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNI 413


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 128/401 (31%), Positives = 180/401 (44%), Gaps = 66/401 (16%)

Query: 153 SQGSGEYFSRIGVGTPPRQ-FSMVLDTGSDINWLQCRP--CTEC---YQQSDPIF----- 201
           S    +Y     +G+ P Q  ++ +DTGSD+ W  C P  C  C   +  + P+      
Sbjct: 13  SNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSH 72

Query: 202 -----DPKTSSSYSPLP----CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLV 249
                 P  S+++S +     CA  +C   +++ S C +  C  +  AYGDGSF +  L 
Sbjct: 73  RVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSF-IAHLH 131

Query: 250 TETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAY 303
            +T+S      +K    GC H          G+ G G G+LSL  Q+   S       +Y
Sbjct: 132 RDTLSMSQL-FLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSY 187

Query: 304 CLV----DRD---SPASGVL----EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
           CLV    D++    P+  +L    +++S R  + V   ++RN K   FY VGLTG SVG 
Sbjct: 188 CLVSHSFDKERVRKPSPLILGHYDDYSSERV-EFVYTSMLRNPKHSYFYCVGLTGISVGK 246

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
           + +  P  L  +D  GDGG++VD GT  T L    YNS+   F R  G +   +      
Sbjct: 247 RTILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEK 306

Query: 413 T----CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----DSA----GTFCFAF 459
           T    CY   GL  V VPTV+ HF G    + LP  NY        D A    G      
Sbjct: 307 TGLGPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMN 364

Query: 460 APTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               + LS     I+GN QQQG  V +DL N RVGF   +C
Sbjct: 365 GGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 111/372 (29%), Positives = 168/372 (45%), Gaps = 24/372 (6%)

Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
           Q +P D       G SQ     +G Y     VGTPP+  + VLD  SD  W+QC  C  C
Sbjct: 72  QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131

Query: 194 -----YQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSF--T 244
                   S P F    SS+   + CA   C+ L    C A+   C Y   YG G+   T
Sbjct: 132 GADAPAATSAPPFYAFLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTT 191

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
            G L  +  +F       G+  GC    EG      G++GLG G LS   Q++    +Y 
Sbjct: 192 AGLLAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYY 247

Query: 305 LVDRDSPASG--VLEFNSA--RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPS 360
           L   D+   G  +L  + A  R   AV+ PL+ ++   + YYV L G  V G+ + IP  
Sbjct: 248 LAPDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRG 307

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSG 419
            F++   G GG+++     +T L   AY  +R +       L+   G  L  D CY    
Sbjct: 308 TFDLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSES 366

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL-SIIGNVQQQGTR 478
           L + +VP+++L F  G  ++L   NY     + G  C    P+ +   S++G++ Q GT 
Sbjct: 367 LATAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTH 426

Query: 479 VSFDLANNRVGF 490
           + +D++ +R+ F
Sbjct: 427 MIYDISGSRLVF 438


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 163/346 (47%), Gaps = 19/346 (5%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTP +   M +DT SD+ W+ C  C  C   S  +F+   S++Y  L C A Q
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 157

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
           CK +    C    C + + YG GS    +L  +T++   + +V G + GC     G  + 
Sbjct: 158 CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLP 215

Query: 279 S---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIR 333
           +    GL      +LS T+ +  ++ +YCL    S   SG L          +   PL++
Sbjct: 216 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLK 275

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
           N +  + Y+V L    VG + V +PP  F  + +   G I D GT  TRL T AY ++RD
Sbjct: 276 NPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRD 335

Query: 394 SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG 453
           +F    G     + +  FDTCY       +  PT++  F  G  + LP  N LI   +  
Sbjct: 336 AFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGS 390

Query: 454 TFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           T C A A      +S L++I N+QQQ  R+ +D+ N+R+G     C
Sbjct: 391 TTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 436


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 167/372 (44%), Gaps = 62/372 (16%)

Query: 173 SMVLDTGSDINWLQCRPCTECYQQS--DPIFDPKTSSSYSPLPCAAPQCKSLD------- 223
           +M +DT  DI W+QCRPC         + +FDP  S S + +PC +  C++L        
Sbjct: 166 TMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYGNGCS 225

Query: 224 ------------VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD 271
                        S      C Y+VAY DG  + G  +T+ ++     S      GC H 
Sbjct: 226 NNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFGCSHG 285

Query: 272 NEGLFVG-SAGLLGLGGG---MLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDA- 326
             G F G ++G + LGGG   +LS T +    + +YC V + S ASG L    A      
Sbjct: 286 VRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYC-VPKPS-ASGFLSLGGAINDGDS 343

Query: 327 --------VTAPLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
                   VT PL+RN ++   T+Y V L G  V G+ + +PP +F       GG ++D 
Sbjct: 344 DSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVFS------GGTLMDS 397

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLK-----------PTSGVALFDTCYDFSGLRSVRV 425
              +T+L   AY +LR +F       +           P  G  + DTCYDF GL +V V
Sbjct: 398 SAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTPAGGEMILDTCYDFEGLDNVTV 457

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGTRVSFDL 483
           PTVSL F  G  +DL     ++        C AF PT +   L  IGNVQQQ   V +D+
Sbjct: 458 PTVSLVFFGGAVVDLDPTTAVMMEG-----CLAFVPTPADFDLGFIGNVQQQTHEVLYDV 512

Query: 484 ANNRVGFTPNKC 495
               VGF    C
Sbjct: 513 GARNVGFRRGAC 524


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/315 (31%), Positives = 150/315 (47%), Gaps = 29/315 (9%)

Query: 206 SSSYSPLPCAAPQCK---SLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSF----G 256
           SS++  + C  P C+    + VSAC     +C Y  +YGD S T G +  +T +F    G
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 257 NSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
              +V  +A GCG  N GLFV + +G+ G G G  SL  Q+K    +YCL       S V
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCLTLVTESKSSV 121

Query: 316 LEFNSARGGDAVTA---------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           +   +    D + A         P+I N  + TFYY+ L G +VG   +    S+F + +
Sbjct: 122 VILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFALKK 181

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL----AGNLKPTSGVALFDTCYDF-SGLR 421
            G GG ++D GT++T L    +  L++  V        +  P  G  L   C+    G +
Sbjct: 182 DGSGGTVIDSGTSLTTLPEAVFELLQEELVAQFPLPRYDNTPEVGDRL---CFRRPKGGK 238

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF-APTSSALSIIGNVQQQGTRVS 480
            V VP + LH  AG  +DLP  NY +    +G  C        + + +IGN QQQ   V 
Sbjct: 239 QVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMHVV 297

Query: 481 FDLANNRVGFTPNKC 495
           +D+ NN++ F P +C
Sbjct: 298 YDVENNKLLFAPAQC 312


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 121/389 (31%), Positives = 176/389 (45%), Gaps = 43/389 (11%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC--TECYQQSDPIFD 202
           S P+    +Q   EY     +G PP+Q + ++DTGS++ W QC  C    C+ Q    +D
Sbjct: 74  SAPIHWNETQYIAEYL----IGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYD 129

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           P  S +  P+ C    C     + C      C    AYG G+   G L TE  +FG+  S
Sbjct: 130 PSRSRTAKPVACNDTACLLGSETRCARDGKACAVLTAYGAGAIG-GFLGTEVFTFGHGQS 188

Query: 261 VKG---IALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG 314
            +    +A GC        G   G++G++GLG G LSL  Q+     +YCL    S A+ 
Sbjct: 189 SENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSYCLTPYFSDAAN 248

Query: 315 VLEF-------NSARGGDAVTAPLIRN---KKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
                       S  G  A + P ++N      D+FYY+ LTG +VG   + +P + F++
Sbjct: 249 TSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDL 308

Query: 365 DE---AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN--LKPTSGVALFDTCYD--F 417
            E   A  GG ++D G+  T L   AY +LRD  VR  G   + P +G    D C     
Sbjct: 309 REVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVA 368

Query: 418 SGLRSVRVPTVSLHFGAGKA----LDLPAKNYLIPVDSAGTFCFAFA---PTSS----AL 466
            G     VP + LHFG+G      + +P +NY  PVD +      F+   P S+      
Sbjct: 369 PGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNET 428

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IIGN  QQ   + +DL    + F P  C
Sbjct: 429 TIIGNYMQQDMHLLYDLGQGVLSFQPADC 457


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 111/347 (31%), Positives = 167/347 (48%), Gaps = 21/347 (6%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTP +   M +DT SD+ W+ C  C  C   S  +F+   S++Y  L C A Q
Sbjct: 36  YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 92

Query: 219 CKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVG 278
           CK +    C    C + + YG GS    +L  +T++   + +V G + GC     G  + 
Sbjct: 93  CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLP 150

Query: 279 S---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIR 333
           +    GL      +LS T+ +  ++ +YCL    S   SG L          +   PL++
Sbjct: 151 AQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLK 210

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
           N +  + Y+V L    VG + V +PP  F  + +   G I D GT  TRL T AY ++RD
Sbjct: 211 NPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRD 270

Query: 394 SFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
           +F  R+  NL  TS +  FDTCY       +  PT++  F  G  + LP  N LI   + 
Sbjct: 271 AFRNRVGRNLTVTS-LGGFDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAG 324

Query: 453 GTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            T C A A      +S L++I N+QQQ  R+ +D+ N+R+G     C
Sbjct: 325 STTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 120/383 (31%), Positives = 183/383 (47%), Gaps = 42/383 (10%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI--FDPK 204
           P+ SGA  G+G+YF R  VGTP + F +V DTGSD+ W++CR          P   F   
Sbjct: 2   PLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRAS 61

Query: 205 TSSSYSPLPCAAPQCKS---LDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSFG--- 256
            S S++PL C++  C S     ++ C   A+ C Y   Y DGS   G + T+  +     
Sbjct: 62  ESRSWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSG 121

Query: 257 -----------NSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKAT---SL 301
                          ++G+ LGC    +G  F  S G+L LG   +S   +  A      
Sbjct: 122 SGSEDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRF 181

Query: 302 AYCLVDRDSP--ASGVLEFNSARGGDAVTA---PLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           +YCLVD  +P  AS  L F     G    A   PL+ +++V  FY V +    V G+A+ 
Sbjct: 182 SYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALD 241

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL--FDTC 414
           IP  ++++     GG I+D GT++T L T AY ++      L G L     VA+  F+ C
Sbjct: 242 IPADVWDVGRG--GGAILDSGTSLTVLATPAYRAV---VAALGGRLAALPRVAMDPFEYC 296

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTS-SALSIIGNV 472
           Y+++   +  +P + + F     L+ PAK+Y+I  D+A G  C      +   +S+IGN+
Sbjct: 297 YNWTA-GAPEIPKLEVSFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNI 353

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
            QQ     FDL +  + F   +C
Sbjct: 354 LQQEHLWEFDLRDRWLRFKHTRC 376


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 118/374 (31%), Positives = 173/374 (46%), Gaps = 57/374 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           + VG PP+  SMVLDTGS+++WL C+           +F+P +SS+YSP+PC++P C++ 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 124

Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD---- 271
              L + A    + + C   ++Y D +   G+L  ET   G S +  G   GC       
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 183

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR----GGDAV 327
           N      S GL+G+  G LS   Q+  +  +YC+   DS  SG L    A     G    
Sbjct: 184 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS--SGFLLLGDASYSWLGPIQY 241

Query: 328 TAPLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           T  ++++  +  F    Y V L G  VG + + +P S+F  D  G G  +VD GT  T L
Sbjct: 242 TPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFL 301

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTVS 429
               Y +L++ F+    ++        F      D CY        +FSGL     P VS
Sbjct: 302 MGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL-----PMVS 356

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTRV 479
           L F  G  + +  +  L  V+ AG+      +CF F   S  L I    IG+  QQ   +
Sbjct: 357 LMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVWM 414

Query: 480 SFDLANNRVGFTPN 493
            FDLA +RVGF  N
Sbjct: 415 EFDLAKSRVGFAGN 428


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 122/375 (32%), Positives = 179/375 (47%), Gaps = 58/375 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
           + VGTPP+  SMVLDTGS+++WL+C   T+ +Q +   FDP  SSSYSP+PC++  C  +
Sbjct: 89  LTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTT---FDPNRSSSYSPVPCSSLTCTDR 144

Query: 221 SLDV---SACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
           + D    ++C +N+ C   ++Y D S + G+L ++T   GNS  + G   GC       N
Sbjct: 145 TRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNS-DMPGTIFGCMDSSFSTN 203

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
                 + GL+G+  G LS   Q+     +YC+ D D   SGVL    A           
Sbjct: 204 TEEDSKNTGLMGMNRGSLSFVSQMDFPKFSYCISDSD--FSGVLLLGDANFSWLMPLNYT 261

Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
               ++ PL    +V   Y V L G  V  + + +P S+F  D  G G  +VD GT  T 
Sbjct: 262 PLIQISTPLPYFDRVA--YTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTF 319

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLRSV------------RVPTV 428
           L    Y++LR+ F      L  TS +   L D  Y F G   +             +PTV
Sbjct: 320 LLGPVYSALRNEF------LNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTV 373

Query: 429 SLHF-GAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVS 480
           SL F GA   +      Y +P +  G+   +CF F   S  L+    +IG+  QQ   + 
Sbjct: 374 SLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFG-NSDLLAVEAYVIGHHHQQNVWME 432

Query: 481 FDLANNRVGFTPNKC 495
           FDL  +R+GF   +C
Sbjct: 433 FDLEKSRIGFAQVQC 447


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 114/336 (33%), Positives = 149/336 (44%), Gaps = 45/336 (13%)

Query: 174 MVLDTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
           M +DT  D+ W+QC PC   ECY Q + +FDP+ S + + +PC +  C  L     R  R
Sbjct: 164 MSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELG----RYGR 219

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGML 290
            L Q                 +        +     C H   G F  S +G + LGGG  
Sbjct: 220 WLLQQP------------VPVLRRLRRRQGQPRGRTC-HAVRGNFSASTSGTMSLGGGRQ 266

Query: 291 SLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA----VTAPLIRNKKV-DTFYY 342
           SL  Q  AT   + +YC+ D  S  SG L       G         PL+RN  +  T Y 
Sbjct: 267 SLLSQTAATFGNAFSYCVPDPSS--SGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYL 324

Query: 343 VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGN 401
           V L G  VGG+ + +PP +F       GG ++D    IT+L   AY +LR +F   +A  
Sbjct: 325 VRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 378

Query: 402 LKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
            +   G A  DTCYDF    SV VP VSL F  G  + L A   ++        C AF P
Sbjct: 379 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVP 432

Query: 462 TSS--ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           T    AL  IGNVQQQ   V +D+    VGF    C
Sbjct: 433 TPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 109/312 (34%), Positives = 150/312 (48%), Gaps = 29/312 (9%)

Query: 199 PIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTVGDLVTET 252
           P FD  TSS+     C +  C+ L V++C   +      C+Y   Y D S T G +  + 
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82

Query: 253 VSFGNSGSVKGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCL------ 305
            +FG   SV G+A GCG  N G+F  +  G+ G G G LSL  Q+K  + ++C       
Sbjct: 83  FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGL 142

Query: 306 ----VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
               V  D PA     + + RG    T PLI+N    TFYY+ L G +VG   + +P S 
Sbjct: 143 KQSTVLLDLPAD---LYKNGRGAVQST-PLIQNSANPTFYYLSLKGITVGSTRLPVPESA 198

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYDFSGL 420
           F +   G GG I+D GT+IT L  Q Y  +RD F  ++   + P +    + TC+     
Sbjct: 199 FALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQ 256

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPV-DSAGT--FCFAFAPTSSALSIIGNVQQQGT 477
               VP + LHF  G  +DLP +NY+  V D AG    C A        +IIGN QQQ  
Sbjct: 257 AKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNM 314

Query: 478 RVSFDLANNRVG 489
            V +DL N   G
Sbjct: 315 HVLYDLQNMHRG 326


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  141 bits (355), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 115/389 (29%), Positives = 168/389 (43%), Gaps = 58/389 (14%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSD----PIFDPKTSSS 208
           G Y   + +GTPP+    VLDTGS + W  C     C+ C +   D    P F PK SS+
Sbjct: 90  GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSST 149

Query: 209 YSPLPCAAPQCKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
              L C  P+C  +              +   C      Y + YG GS T G L+ + ++
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLN 208

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
           F    +V    +GC   +       +G+ G G G  SL  Q+     +YCLV     D+P
Sbjct: 209 FPGK-TVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTP 264

Query: 312 ASGVLEFNSARGGDAVTA----------PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
            S  L    +  GD  T           P   N     +YY+ L    VGG+ V+IP + 
Sbjct: 265 QSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTF 324

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-------LAGNLKPTSGVALFDTC 414
            E    G+GG IVD G+  T ++   YN +   FV+        A + +  SG++    C
Sbjct: 325 LEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLS---PC 381

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF-------AFAPTSSALS 467
           ++ SG+++V  P ++  F  G  +  P +NY   V  A   C        A  P ++  +
Sbjct: 382 FNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPA 441

Query: 468 II-GNVQQQGTRVSFDLANNRVGFTPNKC 495
           II GN QQQ   + +DL N R GF P  C
Sbjct: 442 IILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/305 (33%), Positives = 141/305 (46%), Gaps = 22/305 (7%)

Query: 212 LPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG------I 264
           + CA   C  +   +C R + C Y+  YGDG+ TVG   TE  +F +SG          +
Sbjct: 1   MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             GCG  N G     +G++G G   LSL  Q+     +YCL    S     L F S   G
Sbjct: 61  GFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDG 120

Query: 325 ---DAV----TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
              DA     T PL+++ +  TFYYV  TG +VG + ++IP S F +   G GG+IVD G
Sbjct: 121 VYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSG 180

Query: 378 TAITRLQTQAYNSLRDSF---VRL--AGNLKPTSGVALF--DTCYDFSGLRSVRVPTVSL 430
           TA+T L       +  +F   +RL  A    P  GV           S    + VP + L
Sbjct: 181 TALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVL 240

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
           HF  G  LDLP +NY++     G  C   A +    S IGN+ QQ  RV +DL    +  
Sbjct: 241 HF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSI 299

Query: 491 TPNKC 495
            P +C
Sbjct: 300 APARC 304


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/369 (31%), Positives = 166/369 (44%), Gaps = 34/369 (9%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S P+ SG +   G Y  R+ +GTP +   MVLDT +D  ++    C  C   S   F P 
Sbjct: 84  SAPIASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGC---SATTFYPN 140

Query: 205 TSSSYSPLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFT---VGD---LVTETV-- 253
            S+S+ PL C+ PQC   + L   A  +  C +  +Y   +F+   V D   L T+ +  
Sbjct: 141 VSTSFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYAGSTFSATLVQDSLRLATDVIPS 200

Query: 254 -SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA 312
            SFG+  ++ G ++             +        +LS +  I +   +YCL    S  
Sbjct: 201 YSFGSINAISGSSVPAQGLLGLGRGPLS--------LLSQSGAIYSGVFSYCLPSFKSYY 252

Query: 313 -SGVLEFNSARGGDAV-TAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
            SG L+        ++ T PL+ N    + YYV LT  SVG   V +P  L   + +   
Sbjct: 253 FSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGA 312

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT ITR     YN++RD F +      P S +  FDTC  F        P ++L
Sbjct: 313 GTIIDSGTVITRFVEPIYNAVRDEFRKQVTG--PFSSLGAFDTC--FVKNYETLAPAITL 368

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
           HF     L LP +N LI   S    C A A      +S L++I N QQQ  RV FD  NN
Sbjct: 369 HF-TDLDLKLPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNN 427

Query: 487 RVGFTPNKC 495
           +VG     C
Sbjct: 428 KVGIARELC 436


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 171/352 (48%), Gaps = 23/352 (6%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y  R+ +GTP +   MVLDT +D  W+ C  CT C   +       TSS+Y  L C+ 
Sbjct: 95  GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTF---STNTSSTYGSLDCSM 151

Query: 217 PQCKSLDVSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            QC  +   +C A   + C++  +YG  S     LV +++   N   +   A GC +   
Sbjct: 152 AQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVND-VIPNFAFGCINSIS 210

Query: 274 GLFVGSAGLLGLGGGMLSLTKQ---IKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT- 328
           G  V   GLLGLG G LSL  Q   + +   +YCL    S   SG L+   A    ++  
Sbjct: 211 GGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRY 270

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
            PL+RN    + YYV LTG SVG   V I P L   +     G I+D GT ITR     Y
Sbjct: 271 TPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIY 330

Query: 389 NSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
            ++RD F + +AG   P S +  FDTC  F+       P V+LHF  G  L LP +N LI
Sbjct: 331 TAIRDEFRKQVAG---PFSSLGAFDTC--FAATNEAVAPAVTLHF-TGLNLVLPMENSLI 384

Query: 448 PVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              +    C A A      +S L++I N+QQQ  R+ FD+ N+R+G     C
Sbjct: 385 HSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 172/410 (41%), Gaps = 55/410 (13%)

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
           H++K  ++  +   F +P+   +    G Y + +  GTP +   ++ DTGS + W  C  
Sbjct: 58  HQIKTPKSNSV---FKSPL---SPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS 111

Query: 190 ---CTEC-YQQSDPI----FDPKTSSSYSPLPCAAPQCKSL---DV-SACRA-----NRC 232
              C+EC + + DP     F PK SSS   + C  P+C  +   DV S CR+       C
Sbjct: 112 RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENC 171

Query: 233 L-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
                 Y V YG GS T G L++ET+ F +   +    +GC   +       +G+ G G 
Sbjct: 172 TQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-XIPNFVVGCSFLS---IHQPSGIAGFGR 226

Query: 288 GMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSA---RGGDAVTA----PLIRNKKV 337
           G  SL  Q+     AYCL  R   DSP SG L  +S      G   T     P + N   
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
             +YY+ +    VG QAV++P         G+GG I+D G+  T +       +   F +
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346

Query: 398 LAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
              N    + V        C+D S  +SV+ P +   F  G    LP  NY   V S+G 
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGV 406

Query: 455 FCFAFA---------PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C                    I+G  QQQ   V +DL N R+GF    C
Sbjct: 407 ACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/410 (29%), Positives = 172/410 (41%), Gaps = 55/410 (13%)

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP 189
           H++K  ++  +   F +P+   +    G Y + +  GTP +   ++ DTGS + W  C  
Sbjct: 58  HQIKTPKSNSV---FKSPL---SPHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTS 111

Query: 190 ---CTEC-YQQSDPI----FDPKTSSSYSPLPCAAPQCKSL---DV-SACRA-----NRC 232
              C+EC + + DP     F PK SSS   + C  P+C  +   DV S CR+       C
Sbjct: 112 RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENC 171

Query: 233 L-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGG 287
                 Y V YG GS T G L++ET+ F +   +    +GC   +       +G+ G G 
Sbjct: 172 TQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-KIPNFVVGCSFLS---IHQPSGIAGFGR 226

Query: 288 GMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSA---RGGDAVTA----PLIRNKKV 337
           G  SL  Q+     AYCL  R   DSP SG L  +S      G   T     P + N   
Sbjct: 227 GSESLPSQMGLKKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAY 286

Query: 338 DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR 397
             +YY+ +    VG QAV++P         G+GG I+D G+  T +       +   F +
Sbjct: 287 KEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEK 346

Query: 398 LAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
              N    + V        C+D S  +SV+ P +   F  G    LP  NY   V S+G 
Sbjct: 347 QLANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGV 406

Query: 455 FCFAFA---------PTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C                    I+G  QQQ   V +DL N R+GF    C
Sbjct: 407 ACLTVVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 129/429 (30%), Positives = 197/429 (45%), Gaps = 39/429 (9%)

Query: 82  LPLHSREILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILP 141
           +P++      K     + ++++    +D  RV   ++ L  ++        KP  A    
Sbjct: 46  IPIYGNCSPFKNYSTSWENIIIDMASKDPERV-VYLSSLDASL------RRKPISA---- 94

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
                P+ SG + G G Y  R+ +G+P + F MVLDT +D  W+ C  CT C   S   +
Sbjct: 95  ----APIASGQAFGIGSYVVRVKLGSPNQLFFMVLDTSTDEAWVPCTGCTGC-SSSSTYY 149

Query: 202 DPKTSSSY-SPLPCAAPQCK----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG 256
            P+ S++Y   + C AP+C     +L      +  C +  +Y   +F+   LV +++  G
Sbjct: 150 SPQASTTYGGAVACYAPRCAQARGALPCPYTGSKACTFNQSYAGSTFS-ATLVQDSLRLG 208

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGML---SLTKQIKATSLAYCLVD-RDSPA 312
              ++   A GC +   G  + + GLLGLG G L   S + ++ +   +YCL   + S  
Sbjct: 209 ID-TLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSYF 267

Query: 313 SGVLEFN-SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           SG L+   + +     T PL++N +  + YYV LTG +VG   V +P      D     G
Sbjct: 268 SGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLAFDPNKGSG 327

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            I+D GT ITR     Y+++RD F  ++ G      G   FDTC  F        P + L
Sbjct: 328 TILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRGG---FDTC--FVKTYENLTPLIKL 382

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANN 486
            F  G  + LP +N LI     G  C A A      +S L++I N QQQ  RV FD  NN
Sbjct: 383 RF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRVLFDTVNN 441

Query: 487 RVGFTPNKC 495
           RVG     C
Sbjct: 442 RVGIARELC 450


>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
          Length = 328

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 67/141 (47%), Positives = 92/141 (65%)

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
           + I   L+ + + GD G ++D G  +TRL T AY + RD+FV    NL    GV++F+TC
Sbjct: 188 LNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNTC 247

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQ 474
           YD +G  +VRVPTV  +F  G+ L +  +N+LIP D  GTF FAFA + SALSIIGN+QQ
Sbjct: 248 YDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQQ 307

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           +G ++S D AN  +GF  N C
Sbjct: 308 EGIQISVDGANGFLGFGRNVC 328


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/375 (30%), Positives = 176/375 (46%), Gaps = 59/375 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           + VG+PP+  SMVLDTGS+++WL C+           +F+P +SS+YSP+PC++P C++ 
Sbjct: 65  LAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 120

Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC-----GH 270
              L + A    + + C   ++Y D +   G+L  +T   G S +  G   GC       
Sbjct: 121 TRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIG-SVTRPGTLFGCMDSGLSS 179

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR----GGDA 326
           D+E     S GL+G+  G LS   Q+  +  +YC+   DS  SG+L    A     G   
Sbjct: 180 DSEE-DAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDS--SGILLLGDASYSWLGPIQ 236

Query: 327 VTAPLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            T  +++   +  F    Y V L G  VG + + +P S+F  D  G G  +VD GT  T 
Sbjct: 237 YTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTF 296

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTV 428
           L    Y +L++ F+    ++        F      D CY        +F+GL     P +
Sbjct: 297 LMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGL-----PVI 351

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTR 478
           SL F  G  + +  +  L  V+ AG+      +CF F   S  L I    IG+  QQ   
Sbjct: 352 SLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVW 409

Query: 479 VSFDLANNRVGFTPN 493
           + FDLA +RVGF  N
Sbjct: 410 MEFDLAKSRVGFAGN 424


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/372 (31%), Positives = 172/372 (46%), Gaps = 53/372 (14%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           + VG PP+  SMVLDTGS+++WL C+           +F+P +SS+YSP+PC++P C++ 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICRTR 124

Query: 222 ---LDVSAC---RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD---- 271
              L + A    + + C   ++Y D +   G+L  ET   G S +  G   GC       
Sbjct: 125 TRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIG-SVTRPGTLFGCMDSGLSS 183

Query: 272 NEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARGGDAVTA 329
           N      S GL+G+  G LS   Q+  +  +YC+   DS    +L     S  G    T 
Sbjct: 184 NSEEDAKSTGLMGMNRGSLSFVNQLGFSKFSYCISGSDSSVFLLLGDASYSWLGPIQYTP 243

Query: 330 PLIRNKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            ++++  +  F    Y V L G  VG + + +P S+F  D  G G  +VD GT  T L  
Sbjct: 244 LVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMG 303

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALF------DTCY--------DFSGLRSVRVPTVSLH 431
             Y +L++ F+    ++        F      D CY        +FSGL     P VSL 
Sbjct: 304 PVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL-----PMVSLM 358

Query: 432 FGAGKALDLPAKNYLIPVDSAGT------FCFAFAPTSSALSI----IGNVQQQGTRVSF 481
           F  G  + +  +  L  V+ AG+      +CF F   S  L I    IG+  QQ   + F
Sbjct: 359 F-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG-NSDLLGIEAFVIGHHHQQNVWMEF 416

Query: 482 DLANNRVGFTPN 493
           DLA +RVGF  N
Sbjct: 417 DLAKSRVGFAGN 428


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 124/374 (33%), Positives = 166/374 (44%), Gaps = 39/374 (10%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPK 204
           VVS     S EY   + +G+PPR    + DTGSD+ W++C+     T         FDP 
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149

Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
            SS+Y  + C    C++L  + C   + C Y  AYGDGS T G L TET +F + GS   
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRS 209

Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSP 311
                V G+  GC     G F     +   GG +  +T+   ATSL    +YCLV     
Sbjct: 210 PRQVRVGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269

Query: 312 ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
           AS  L F +        A + PL+    VDT+Y V L    VG + V           A 
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVAG-DVDTYYTVVLDSVKVGNKTVA---------SAA 319

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR--- 424
              IIVD GT +T L       + D   R    L P  S   L   CY+ +G R V    
Sbjct: 320 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGE 377

Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSF 481
            +P ++L FG G A+ L  +N  + V   GT C A   T+    +SI+GN+ QQ   V +
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 436

Query: 482 DLANNRVGFTPNKC 495
           DL    V F    C
Sbjct: 437 DLDAGTVTFAGADC 450


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 171/357 (47%), Gaps = 28/357 (7%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P  SS+Y P+ C 
Sbjct: 74  NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKC- 132

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
            P C   D       +C Y+  Y + S + G +  + VSFGN   +K      GC +   
Sbjct: 133 NPSCNCDD----EGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVET 188

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
           G L+   A G++GLG G LS+  Q     +   S + C    D     ++    +   + 
Sbjct: 189 GDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPNM 248

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           V +    N     +Y + L    V G+ +++ P +F+       G ++D GT        
Sbjct: 249 VFSH--SNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH----GTVLDSGTTYAYFPEA 302

Query: 387 AYNSLRDSFVRLAGNLK--PTSGVALFDTCYDFSGLR----SVRVPTVSLHFGAGKALDL 440
           A+++L+D+ ++   +LK  P       D C+  +G      S   P V++ FG+G+ L L
Sbjct: 303 AFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKLSL 362

Query: 441 PAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +NYL      +G +C   F   +   +++G +  + T V++D  N+++GF    C
Sbjct: 363 SPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/357 (32%), Positives = 157/357 (43%), Gaps = 22/357 (6%)

Query: 145 STPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPK 204
           S PV SG  Q    Y  R G+GTP +Q  + LDT +D  W  C PC  C   S   F P 
Sbjct: 67  SAPVASG--QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPA 122

Query: 205 TSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
           +SSSY+ LPCA+  C      A              G+     L+        SG +   
Sbjct: 123 SSSSYASLPCASDWCPLFRRPAVPGEPGRV------GAAADVRLLQAASRTPRSGVLA-- 174

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD-RDSPASGVLEFNSA-R 322
           A  CG          +G + L    LS T        +YCL   R    SG L   +A +
Sbjct: 175 ATRCGWARTPSPATRSGPMSL----LSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQ 230

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
             +    PL+ N    + YYV +TG SVG   V+ P   F  D +   G ++D GT ITR
Sbjct: 231 PRNVRYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITR 290

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPA 442
                Y +LRD F R        + +  FDTC++   + +   P V+LH G G  L LP 
Sbjct: 291 WTAPVYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPM 350

Query: 443 KNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +N LI   +    C A A      +S ++++ N+QQQ  RV  D+A +RVGF    C
Sbjct: 351 ENTLIHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 118/406 (29%), Positives = 179/406 (44%), Gaps = 78/406 (19%)

Query: 108 RDSARVNTLITKL-QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVG 166
           RD +RV+ + +K  Q A  N+  H       ++  ED             G +   +  G
Sbjct: 92  RDESRVSFINSKFNQYAPENLKDHT---PNNKLFDED-------------GNFLVDVAFG 135

Query: 167 TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSA 226
           TPP+ F+++LDTGS I W QC+ CT                                   
Sbjct: 136 TPPQNFTLILDTGSSITWTQCKACT----------------------------------- 160

Query: 227 CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF-VGSAGLLGL 285
              N   Y + YGD S +VG+   +T++   S   +    G G +N+G F  G  G+LGL
Sbjct: 161 VENN---YNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRNNKGDFGSGVDGMLGL 217

Query: 286 GGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK----KVD 338
           G G LS   Q  +      +YCL + DS  S +    +     ++    + N     +  
Sbjct: 218 GQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQES 277

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            +Y+V L+  SVG + + IP S+F        G I+D  T ITRL  +AY++L+ +F + 
Sbjct: 278 GYYFVNLSDISVGNERLNIPSSVFASP-----GTIIDSRTVITRLPQRAYSALKAAFKKA 332

Query: 399 AGNLKPTSGVA----LFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
                 ++G      + DTCY+ SG + V +P + LHFG G  + L   N +   D +  
Sbjct: 333 MAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDES-R 391

Query: 455 FCFAFAPTSSA-----LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C AFA  S +     L+IIGN QQ    V +D+   R+GF  N C
Sbjct: 392 LCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 122/408 (29%), Positives = 177/408 (43%), Gaps = 70/408 (17%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC----------TECYQQSDPIFDPK 204
           G  +Y +  G+G PP+    V+DTGSD+ W QC  C            C+ Q+ P ++  
Sbjct: 74  GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133

Query: 205 TSSSYSPLPC---------AAPQCKSLDVSACRA-NRCLYQVAYGDGSFTVGDLVTETVS 254
            S +   +PC          AP+            + C+   +YG G   +G L T+  +
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192

Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RD 309
           F +S SV  +A GC        G   G++G++GLG G LSL  Q+ AT  +YCL    RD
Sbjct: 193 FPSSSSVT-LAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRD 251

Query: 310 SPASGVLEFNSARGGD--------------AVTAPLIRNKK---VDTFYYVGLTGFSVGG 352
           + +   L                         T P  +N K     TFYY+ L G + G 
Sbjct: 252 TVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGN 311

Query: 353 QAVQIPPSLFEMDEAGD----GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPT 405
             V +P   F++ EA      GG ++D G+  TRL   A+ +L     R    +G+L P 
Sbjct: 312 ATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPP 371

Query: 406 S---GVAL---FDTCYDFSGLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTF 455
               G AL    +   D   L +  VP + L F    G G+ L +PA+ Y   V+ A T+
Sbjct: 372 PAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE-ASTW 430

Query: 456 CFAFAPTSSA--------LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C A   ++S          +IIGN  QQ  RV +DLAN  + F P  C
Sbjct: 431 CMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/369 (31%), Positives = 176/369 (47%), Gaps = 43/369 (11%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
           + VGTPP+  +MV+DTGS+++WL C   ++    S   F+P  SSSYSP+PC++  C  +
Sbjct: 77  LTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTDQ 135

Query: 221 SLDVS---ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
           + D     +C +N+ C   ++Y D S + G+L T+T   G+SG +  +  GC       N
Sbjct: 136 TRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-IPNVVFGCMDSIFSSN 194

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI 332
                 + GL+G+  G LS   Q+     +YC+ + D   SG+L    A    +  APL 
Sbjct: 195 SEEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISEYD--FSGLLLLGDANF--SWLAPLN 250

Query: 333 RNKKVD----------TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
               ++            Y V L G  V  + + IP S+FE D  G G  +VD GT  T 
Sbjct: 251 YTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQFTF 310

Query: 383 LQTQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF--SGLRSVRVPTVSLHF-G 433
           L   AY +LRD F+ + AG+L+             D CY    +  R   +P+V+L F G
Sbjct: 311 LLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVFRG 370

Query: 434 AGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
           A   +      Y +P +  G     CF F   S  L     +IG++ QQ   + FDL  +
Sbjct: 371 AEMTVTGDRILYRVPGERRGNDSIHCFTFG-NSDLLGVEAFVIGHLHQQNVWMEFDLKKS 429

Query: 487 RVGFTPNKC 495
           R+G    +C
Sbjct: 430 RIGLAEIRC 438


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/372 (31%), Positives = 177/372 (47%), Gaps = 44/372 (11%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPI---FDPKTSSSYSPLPCAA 216
           + VGTPP+  +MVLDTGS+++WL C   R  +     +  +   F P+ S++++ +PC +
Sbjct: 67  LAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGS 126

Query: 217 PQCKSLDVSA---C--RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
            QC S D+ A   C   + +C   ++Y DGS + G L T+  + G +  ++  A GC   
Sbjct: 127 TQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPLRS-AFGCMST 185

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSAR------ 322
            +D+    V +AGLLG+  G LS   Q      +YC+ DRD   +GVL    +       
Sbjct: 186 AYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDD--AGVLLLGHSDLPFLPL 243

Query: 323 GGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
               +  P +     D   Y V L G  VGG+A+ IP S+   D  G G  +VD GT  T
Sbjct: 244 NYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQFT 303

Query: 382 RLQTQAYNSLRDSFVRLAGNL-----KPTSGV-ALFDTCYDFSGLR---SVRVPTVSLHF 432
            L   AY++L+  F++    L      P+       DTC+     R   S R+P V+L F
Sbjct: 304 FLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPPVTLLF 363

Query: 433 -GAGKALDLPAKNYLIPVD---SAGTFCFAFA-----PTSSALSIIGNVQQQGTRVSFDL 483
            GA  ++      Y +P +   + G +C  F      P ++   +IG+  Q    V +DL
Sbjct: 364 NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTA--YVIGHHHQMNLWVEYDL 421

Query: 484 ANNRVGFTPNKC 495
              RVG  P KC
Sbjct: 422 ERGRVGLAPVKC 433


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 111/352 (31%), Positives = 156/352 (44%), Gaps = 26/352 (7%)

Query: 160 FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
            + I +G PP    +V+DTGSDI W+ C PCT C      +FDP  SS++SPL C  P  
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-- 158

Query: 220 KSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS----VKGIALGCGHD-NE 273
              D   C R +   + V Y D S   G    +TV F  +      +  +  GCGH+  +
Sbjct: 159 --CDFKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNIGQ 216

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPL 331
               G  G+LGL  G  SL  +I     +YC+ D   P     +     G D    + P 
Sbjct: 217 DTDPGHNGILGLNNGPDSLATKI-GQKFSYCIGDLADPYYNYHQLILGEGADLEGYSTPF 275

Query: 332 IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
              +  + FYYV + G SVG + + I P  FEM +   GG+I+D G+ IT L    +  L
Sbjct: 276 ---EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLL 332

Query: 392 RDSFVRLAGN--LKPTSGVALFDTCYDFSGLRS-VRVPTVSLHFGAGKALDLPAKNYLIP 448
                 L G    + T   + +  C+  S  R  V  P V+ HF  G  L L + ++   
Sbjct: 333 SKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDSGSFFNQ 392

Query: 449 VDSAGTFCFAFAPTS-----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++    FC    P S     S  S+IG + QQ   V +DL N  V F    C
Sbjct: 393 LND-NVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDC 443


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 120/381 (31%), Positives = 173/381 (45%), Gaps = 47/381 (12%)

Query: 158 EYFSRIGVGTP-PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           EY   + +GTP P++ ++ LDTGSD+ W QC  C  C+ Q  P FD   S +   +PC+ 
Sbjct: 99  EYLIHLSIGTPRPQRVALTLDTGSDLVWTQC-ACHVCFAQPFPTFDALASQTTLAVPCSD 157

Query: 217 PQCKS--LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGS-------V 261
           P C S    +S C    N C Y   Y D S T G +V +T +F    GN+GS       V
Sbjct: 158 PICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAV 217

Query: 262 KGIALGCGHDNEGLFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVD----RDSP----- 311
             +  GCG  N+G+F  + +G+ G   G +SL  Q+K    ++C       R SP     
Sbjct: 218 PNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCFTAIADARTSPVFLGG 277

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           A G     +   G   + P   +    + YY+ L G +VG   + +    F     G G 
Sbjct: 278 APGPDNLGAHATGPVQSTPFANSNG--SLYYLTLKGITVGKTRLPLNALAFAGKGTGSGS 335

Query: 372 I--IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT-- 427
              I+D GT I  L    Y SLR +FV            A  ++   F   RS  +P   
Sbjct: 336 GGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPEA 395

Query: 428 -------VSLHFGAGKALDLPAKNYLIPV----DSAGT-FCFAF-APTSSALSIIGNVQQ 474
                  V LH  AG   DLP ++Y++ +    D +G+  C    +   S L+IIGN QQ
Sbjct: 396 PAPALPKVVLHV-AGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQ 454

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
           Q   V++DL  N++ F P +C
Sbjct: 455 QNMHVAYDLEKNKLVFVPARC 475


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 112/332 (33%), Positives = 161/332 (48%), Gaps = 27/332 (8%)

Query: 188 RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVS--ACRANRCLYQVAYGDGSFTV 245
           R   EC  +  P F P +SS++S LPCA+  C+ L      C A  C+Y   YG G FT 
Sbjct: 83  RAVHECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTA 141

Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL 305
           G L TET+  G + S  G+A GC  +N G+   S+G++GLG   LSL  Q+     +YCL
Sbjct: 142 GYLATETLHVGGA-SFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCL 199

Query: 306 -VDRDSPASGVLEFNSARGGDAVTAP-LIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSL 361
             D D+  S +L  + A+     ++P ++ N ++   ++YYV LTG +VG   + +  + 
Sbjct: 200 RSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTT 259

Query: 362 FEMDEAGD----GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA----LFDT 413
           F           GG IVD GT +T L  + Y  ++ +F+        T+ V      FD 
Sbjct: 260 FGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDL 319

Query: 414 CYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNY--LIPVDSAG---TFCFAFAPTSSA 465
           C+D +   G   V VPT+ L F  G    +  ++Y  ++ VDS G     C    P S  
Sbjct: 320 CFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEK 379

Query: 466 L--SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  SIIGNV Q    V +DL      F P  C
Sbjct: 380 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 168/364 (46%), Gaps = 45/364 (12%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI---FDPKTSSSYSPLPCAAPQCKS 221
           +GTPP+   MVLDTGS ++W+ C       ++  P    FDP  SSS+  LPC  P CK 
Sbjct: 75  IGTPPQLQQMVLDTGSQVSWIHCDNKKGPQKKQPPTTSSFDPSLSSSFFALPCNHPLCKP 134

Query: 222 L--DVSA---CRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
              D+S    C ANR C Y  +Y DG+   G+LV E ++   S +   I LGC + ++  
Sbjct: 135 QVPDISLPTDCDANRLCHYSFSYTDGTVVEGNLVRENIALSPSLTTPPIILGCANQSDD- 193

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCL-VDRDSPASGVLEFNSARGGDAVTAPLIRN 334
              + G+LG+  G LS   Q K T  +Y + V +  P SG L       G+   +   R 
Sbjct: 194 ---ARGILGMNLGRLSFPNQAKITKFSYFVPVKQTQPGSGSLYL-----GNNPNSSCFRY 245

Query: 335 KKVDTF---------------YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            K+ TF               + + + G S+GG+ + IPPS+F+ D  G G  I+D G+ 
Sbjct: 246 VKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGGKKLNIPPSVFKPDTTGFGQTIIDSGSE 305

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTS----GVALFDTCYDFSGLRSVR-VPTVSLHFGA 434
            + +  +AYN +R+  V+  G+         GVA  D C+D       R V  +   F  
Sbjct: 306 FSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGVA--DICFDGDATEIGRLVGDMVFEFEK 363

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQ---QQGTRVSFDLANNRVGFT 491
           G  + +P +  LI VD  G  CF              +    QQ   V FDLA +RVGF 
Sbjct: 364 GVEIVIPKERVLIEVD-GGVHCFGIGRAEGLGGGGNIIGNFYQQNLWVEFDLAKHRVGFR 422

Query: 492 PNKC 495
              C
Sbjct: 423 GANC 426


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 122/421 (28%), Positives = 177/421 (42%), Gaps = 71/421 (16%)

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
           H   PA A + P  +            G Y     +GTPP+   ++LDTGS + W+    
Sbjct: 82  HPSVPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTS 129

Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---------- 233
             +CR C+     + P+F PK SSS   + C  P C+ +  +A  A +C           
Sbjct: 130 SYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAAN 189

Query: 234 -----------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
                      Y V YG GS T G L+ +T+      +V G  LGC      +    +GL
Sbjct: 190 CPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGC--SLVSVHQPPSGL 245

Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDAVT-APLIRNKKV 337
            G G G  S+  Q+     +YCL+ R    ++  SG L      GG+ +   PL+++   
Sbjct: 246 AGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAG 305

Query: 338 D-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           D      +YY+ L G +VGG+AV++P   F  + AG GG IVD GT  T L    +  + 
Sbjct: 306 DKLPYGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVA 365

Query: 393 DSFVRLAGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           D+ V   G     S     G+ L        G RS+ +P +S HF  G  + LP +NY +
Sbjct: 366 DAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV 425

Query: 448 PVDSAGT--FCFAF-----------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
                     C A               S    I+G+ QQQ   V +DL   R+GF    
Sbjct: 426 VAGRGAVEAICLAVVTDFGGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQS 485

Query: 495 C 495
           C
Sbjct: 486 C 486


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 116/397 (29%), Positives = 166/397 (41%), Gaps = 58/397 (14%)

Query: 149 VSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSS 208
           VS   +  G Y   +  GTPP+  S + DTGS + W  C     C + S P  DP T S 
Sbjct: 122 VSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISK 181

Query: 209 YSP--------LPCAAPQCKSLD----VSACR-----ANRCL-----YQVAYGDGSFTVG 246
           + P        + C  P+C  +      S CR     + +C      Y + YG G+ T G
Sbjct: 182 FVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAG 240

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
            L++ET+   N   V    +GC   +       AG+ G G G  SL  Q++    ++CLV
Sbjct: 241 ILLSETLDLENK-RVPDFLVGCSVMSVH---QPAGIAGFGRGPESLPSQMRLKRFSHCLV 296

Query: 307 DR---DSPASGVLEFNSARGGDA------VTAPLIRNKKVDT-----FYYVGLTGFSVGG 352
            R   DSP S  L  +S    D       + AP   N  V       +YY+ L    +GG
Sbjct: 297 SRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGG 356

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR------LAGNLKPTS 406
           + V+ P      D  G+GG I+D G+  T L    + ++ D   +       A +++  S
Sbjct: 357 KPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQS 416

Query: 407 GVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
           G+     C++      S   P V L F  G  L L A+NYL  V   G  C       + 
Sbjct: 417 GL---RPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAV 473

Query: 466 LS-------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +        I+G  QQQ   V +DLA  R+GF   KC
Sbjct: 474 VGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 116/386 (30%), Positives = 165/386 (42%), Gaps = 55/386 (14%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-----YQQSDPIFDPKTSSS 208
           G Y   +  GTPP+    V+DTGS + W  C     C+EC      +   P F PK SSS
Sbjct: 81  GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSS 140

Query: 209 YSPLPCAAPQCKSL----DVSACR-----ANRCL-----YQVAYGDGSFTVGDLVTETVS 254
              + C  P+C  +      S C+     A  C      Y + YG GS T G L++ET+ 
Sbjct: 141 SKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLD 199

Query: 255 FGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSP 311
           F N  ++    +GC   +        G+ G G    SL  Q+     +YCLV     D+P
Sbjct: 200 FPNKKTIPDFLVGCSIFS---IKQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTP 256

Query: 312 ASGVLEFNSARG-GDAVTA-----PLIRNKKV--DTFYYVGLTGFSVGGQAVQIPPSLFE 363
            S  L  ++  G G   TA     P ++N       +YYV L    +G   V++P     
Sbjct: 257 TSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLV 316

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR------LAGNLKPTSGVALFDTCYDF 417
               G+GG IVD GT  T ++   Y  +   F +      +A  ++  +G+     CY+ 
Sbjct: 317 PGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLR---PCYNI 373

Query: 418 SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------II 469
           SG +S+ VP +   F  G  + LP  NY   VDS G  C      + A          I+
Sbjct: 374 SGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDS-GVICLTIVSDNVAGPGLGGGPAIIL 432

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GN QQ+   V FDL N + GF    C
Sbjct: 433 GNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 113/385 (29%), Positives = 156/385 (40%), Gaps = 53/385 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-----YQQSDPIFDPKTSSS 208
           G Y   +  GTPP+    V+DTGS + W  C     C+ C          P F PK SSS
Sbjct: 90  GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSS 149

Query: 209 YSPLPCAAPQCKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVS 254
            + + C   +C  L                  C  +   Y + YG GS T G L++ET+ 
Sbjct: 150 SNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLD 208

Query: 255 FGNSGSVKGIALGCGHDNEGLFV--GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---D 309
           F +  ++ G  +GC      LF      G+ G G    SL  Q+     +YCLV     D
Sbjct: 209 FPHKKTIPGFLVGCS-----LFSIRQPEGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDD 263

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDT--------FYYVGLTGFSVGGQAVQIPPSL 361
           +PAS  L  ++  G D    P +               +YYV L    +G   V++P   
Sbjct: 264 TPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKF 323

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFS 418
                 G+GG IVD GT  T ++   Y  +   F +   +    + V        C++ S
Sbjct: 324 LVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNIS 383

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIG 470
           G +SV VP    HF  G  + LP  NY   VDS G  C      + + S        I+G
Sbjct: 384 GEKSVSVPEFIFHFKGGAKMALPLANYFSFVDS-GVICLTIVSDNMSGSGIGGGPAIILG 442

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           N QQ+   V FDL N R GF    C
Sbjct: 443 NYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 88/268 (32%), Positives = 123/268 (45%), Gaps = 48/268 (17%)

Query: 232 CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLS 291
           C Y + YGDGSFT G+L  E + FG +  VK    GCG +N+GLF G +GL+GLG   LS
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFG-TILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLS 191

Query: 292 LTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
           L  Q                                      N ++  FY++ LTG S+G
Sbjct: 192 LISQTS-----------------------------------ENPQLYNFYFINLTGISIG 216

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
           G A+Q P         G   I+VD GT ITRL    Y +L+  F++      P    ++ 
Sbjct: 217 GVALQAP-------SVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSIL 269

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKAL--DLPAKNYLIPVDSAGTFCFAFA--PTSSALS 467
           DTC++ S  + V +PT+ +HF     L  D+    Y +  D A   C A A       ++
Sbjct: 270 DTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSD-ASQVCLALASLEYQDEVA 328

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           I+GN QQ+  RV +D    +VGF    C
Sbjct: 329 ILGNYQQKNLRVIYDTKETKVGFALETC 356


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 112/358 (31%), Positives = 165/358 (46%), Gaps = 38/358 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI-FDPKTSSSYSPLPCAAPQCK--- 220
           +GTPP+   MVLDTGS ++W+QC+       ++ P  FDP  SSS+S LPC    CK   
Sbjct: 84  IGTPPQTQQMVLDTGSQLSWIQCK----VPPKTPPTAFDPLLSSSFSVLPCNHSLCKPRV 139

Query: 221 ---SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
              +L  S C  NR C Y   Y DG++  G+LV E  +F +S +   + LGC  D+    
Sbjct: 140 PDYTLPTS-CDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPLILGCATDSSD-- 196

Query: 277 VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA------SGVLEFNSARGGDAVTAP 330
             + G+LG+  G LS +   K +  +YC+  R S +      S  L  N +  G      
Sbjct: 197 --TQGILGMNLGRLSFSSLAKISKFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGFKYVNL 254

Query: 331 LI-----RNKKVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           +      R   +D   Y + + G  + G+ + I  S F  D +G G  ++D GT  T L 
Sbjct: 255 MTYRQSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLV 314

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYDFSGLRSVR-VPTVSLHFGAGKALDL 440
            +AY+ +++  V+LAG  K   G       D C+D   +   R +  ++  F  G  + +
Sbjct: 315 DEAYSKVKEEIVKLAGP-KLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVV 373

Query: 441 PAKNYLIPVDSAGTFCFAFAPTS---SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +  L  V   G  C     +     A +IIGN  QQ   V FDL   RVGF    C
Sbjct: 374 EREKMLADV-GGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEFDLVGRRVGFGRTDC 430


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 153/362 (42%), Gaps = 78/362 (21%)

Query: 141 PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI 200
           P D  + V+SG     G Y   I +GTPP     + DTGSD+ W QC PC +CY+Q +P+
Sbjct: 15  PNDIQSNVISGG----GSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPL 70

Query: 201 FDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS 260
           FDPK S +Y  L   + +                       +FT+G   TE    G+  S
Sbjct: 71  FDPKKSKTYKTLGYLSSE-----------------------TFTIGS--TE----GDPAS 101

Query: 261 VKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV--DRDSPAS 313
             G+A GCGH N G F            G    ++ L+ ++     +YCLV    DS AS
Sbjct: 102 FPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGG-QFSYCLVPLSSDSTAS 160

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
             + F    G  AV                      V G     P +      A +  II
Sbjct: 161 SKINF----GKSAV----------------------VSGSGTSSPAA------AEESNII 188

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +T L    Y  +  +  ++ G    T     F  CY  SG++ + +PT++ HF 
Sbjct: 189 IDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHF- 245

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
            G  + LP  N  +        CF+  P SS L+I GN+ Q    V +DL NN+V F P 
Sbjct: 246 IGADVQLPPLNTFVQAQE-DLVCFSMIP-SSNLAIFGNLSQMNFLVGYDLKNNKVSFKPT 303

Query: 494 KC 495
            C
Sbjct: 304 DC 305


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 131/379 (34%), Positives = 181/379 (47%), Gaps = 60/379 (15%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDP-----IFDPKTSS 207
           EY   + +GTPP +   + DTGSD+ WL C      P     + +D       FDP  S+
Sbjct: 99  EYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKST 158

Query: 208 SYSPLPCAAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--- 263
           ++  + C +  C  L  ++C A+ +C Y  +YGDGS T G L TET +F ++   +G   
Sbjct: 159 TFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDGT 218

Query: 264 ------IALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKA-TSL----AYCLVDRD 309
                 +  GC       FVGS+   GL+GLGGG LSL  Q+ A TSL    +YCLV   
Sbjct: 219 TTRVANVNFGCST----TFVGSSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCLVPYS 274

Query: 310 SPASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
             AS  L F    +     AVT PLI + +V  +Y V L    VG +  + P      D 
Sbjct: 275 VKASSALNFGPRAAVTDPGAVTTPLIPS-QVKAYYIVELRSVKVGNKTFEAP------DR 327

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLK---PTSGVALFDTCYDFSGLR- 421
           +    +IVD GT +T L      +L D  V+ L G +K     S   L   C+D SG+R 
Sbjct: 328 S---PLIVDSGTTLTFLP----EALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVRE 380

Query: 422 ---SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSAL--SIIGNVQQQG 476
              +  +P V++  G G A+ L A+N  + V   GT C A +  S     SIIGN+ QQ 
Sbjct: 381 GQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQE-GTLCLAVSAMSEQFPASIIGNIAQQN 439

Query: 477 TRVSFDLANNRVGFTPNKC 495
             V +DL    V F P  C
Sbjct: 440 MHVGYDLDKGTVTFAPAAC 458


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/358 (32%), Positives = 169/358 (47%), Gaps = 35/358 (9%)

Query: 162 RIGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPCAAP 217
            I VGTP  Q  S ++D  S   W QC PC        P    F P  S+++SPLPC++ 
Sbjct: 91  NITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSD 150

Query: 218 QCKSLDVSACRAN----------RC-LYQVAYG-DGSFTVGDLVTETVSFGNSGSVKGIA 265
            C  +    C             RC  Y + YG   + T G L T+T +FG + +V G+ 
Sbjct: 151 MCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGAT-AVPGVV 209

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPASGVLEFNSA 321
            GC   + G F G++G++G+G G LSL  Q++    +Y L+      D  A  V+ F   
Sbjct: 210 FGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRF--- 266

Query: 322 RGGDAV-------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGII 373
            G DAV       + PL+ +     FYYV LTG  V G  +  IP   F++   G GG+I
Sbjct: 267 -GDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVI 325

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHF 432
           +   T +T L+  AY+ +R +     G        AL  D CY+ S +  V+VP ++L F
Sbjct: 326 LSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVF 385

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
             G  +DL A NY    +  G  C    P+    S++G + Q GT + +D+   R+ F
Sbjct: 386 DGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/357 (32%), Positives = 169/357 (47%), Gaps = 35/357 (9%)

Query: 163 IGVGTPPRQ-FSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPCAAPQ 218
           I VGTP  Q  S ++D  S   W QC PC        P    F P  S+++SPLPC++  
Sbjct: 92  ITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCSSDM 151

Query: 219 CKSLDVSACRAN----------RC-LYQVAYG-DGSFTVGDLVTETVSFGNSGSVKGIAL 266
           C  +    C             RC  Y + YG   + T G L T+T +FG + +V G+  
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSGYLATDTFTFGAT-AVPGVVF 210

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPASGVLEFNSAR 322
           GC   + G F G++G++G+G G LSL  Q++    +Y L+      D  A  V+ F    
Sbjct: 211 GCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLLAPEATDDGSADSVIRF---- 266

Query: 323 GGDAV-------TAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIV 374
           G DAV       + PL+ +     FYYV LTG  V G  +  IP   F++   G GG+I+
Sbjct: 267 GDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVIL 326

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFG 433
              T +T L+  AY+ +R +     G        AL  D CY+ S +  V+VP ++L F 
Sbjct: 327 SSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFD 386

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGF 490
            G  +DL A NY    +  G  C    P+    S++G + Q GT + +D+   R+ F
Sbjct: 387 GGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 169/373 (45%), Gaps = 54/373 (14%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP----IFDPKTSSSYSPLPCAAPQ 218
           + +GTPP+  +MVLDTGS+++WL+C+         +P    IF+P  S +Y+ +PC++  
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRCK--------KEPNFTSIFNPLASKTYTKIPCSSQT 122

Query: 219 CK------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC---- 268
           CK      +L V+   A  C + ++Y D S   G L  ET  FG S +      GC    
Sbjct: 123 CKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFG-SLTRPATVFGCMDSG 181

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--- 325
              N      + GL+G+  G LS   Q+     +YC+   DS  +G L    AR      
Sbjct: 182 SSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISGLDS--TGFLLLGEARYSWLKP 239

Query: 326 -------AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
                   ++ PL    +V   Y V L G  V  + + +P S+F  D  G G  +VD GT
Sbjct: 240 LNYTPLVQISTPLPYFDRVA--YSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGT 297

Query: 379 AITRLQTQAYNSLRDSF-VRLAGNLKPTSG-----VALFDTCYDFSGLRSV--RVPTVSL 430
             T L    Y++LR  F ++ AG L+  +          D CY      S    +P V L
Sbjct: 298 QFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKL 357

Query: 431 HF-GAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFD 482
            F GA  ++      Y +P +  G    +CF F   S  L I    IG+ QQQ   + +D
Sbjct: 358 MFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFG-NSDELGISSFLIGHHQQQNVWMEYD 416

Query: 483 LANNRVGFTPNKC 495
           L N+R+GF   +C
Sbjct: 417 LENSRIGFAELRC 429


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 170/364 (46%), Gaps = 45/364 (12%)

Query: 165  VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLD- 223
            VG+PP+Q +MVLDTGS+++WL C+           +F+P +SSSYSP+PC++P C++   
Sbjct: 1006 VGSPPQQVTMVLDTGSELSWLHCKKSPNL----TSVFNPLSSSSYSPIPCSSPICRTRTR 1061

Query: 224  -----VSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----NEG 274
                 V+      C   V+Y D S   G+L ++    G+S ++ G   GC       N  
Sbjct: 1062 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-ALPGTLFGCMDSGFSSNSE 1120

Query: 275  LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS---ARGGDAVTAPL 331
                + GL+G+  G LS   Q+     +YC+  RDS  SGVL F     +  G+    PL
Sbjct: 1121 EDAKTTGLMGMNRGSLSFVTQLGLPKFSYCISGRDS--SGVLLFGDLHLSWLGNLTYTPL 1178

Query: 332  IR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
            ++ +  +  F    Y V L G  VG + + +P S+F  D  G G  +VD GT  T L   
Sbjct: 1179 VQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQFTFLLGP 1238

Query: 387  AYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDF-SGLRSVRVPTVSLHF-GAGKAL 438
             Y +LR+ F+ +  G L P            D CY   +G +   +P+VSL F GA   +
Sbjct: 1239 VYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFRGAEMVV 1298

Query: 439  DLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNRVGFT 491
                  Y +P    G    +C  F   S  L I    IG+  QQ   + FDL    V F 
Sbjct: 1299 GGEVLLYRVPEMMKGNEWVYCLTFG-NSDLLGIEAFVIGHHHQQNVWMEFDL----VAFA 1353

Query: 492  PNKC 495
             + C
Sbjct: 1354 ADLC 1357


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 161/375 (42%), Gaps = 48/375 (12%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTEC-YQQSD----PIFDPKTSSSYSPLPC 214
           +  GTPP++ S ++DTGSD+ W  C     CT C +  +D    PIFDPK SSS   L C
Sbjct: 82  LSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDC 141

Query: 215 AAPQCKS-------LDVSACRANR------CLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
             P+C S       L    C  N       C Y   YG G+ + G  + E + F    ++
Sbjct: 142 RNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGA-SSGYFLLENLKFPRK-TI 199

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASG--VL 316
           +   LGC   +    + S  L G G  M SL  Q+     AYCL      D+  SG  +L
Sbjct: 200 RNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLNSHDYDDTRNSGKLIL 258

Query: 317 EFNSARGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           ++   +       P +++     FYY +G+    +G + ++IP         G  G+I+D
Sbjct: 259 DYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIID 318

Query: 376 CGTAITRLQTQ-----AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
            G       T        N L+    +   +L+  +   L   CY+F+G +S+++P +  
Sbjct: 319 SGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGL-TPCYNFTGHKSIKIPPLIY 377

Query: 431 HFGAGKALDLPAKNYL----------IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVS 480
            F  G  + +P KNY             +D+ GT      P  S   I+GN Q     V 
Sbjct: 378 QFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSI--ILGNSQHVDYYVE 435

Query: 481 FDLANNRVGFTPNKC 495
           +DL N+R GF    C
Sbjct: 436 YDLKNDRFGFRRQTC 450


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF++I +G+PP+++ + +DTGSDI W+ C PC +C  ++D      ++D K SS+   
Sbjct: 75  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKN 134

Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKG----- 263
           + C    C   +    C A + C Y V YGDGS + GD V + ++    +G+++      
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194

Query: 264 -IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
            +  GCG +  G    +     G++G G    S+  Q+ A        ++CL + +    
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMN--GG 252

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G+            T PL+ N+     Y V L G  V G+ + +PPSL   +  GDGG I
Sbjct: 253 GIFAIGEVESPVVKTTPLVPNQ---VHYNVILKGMDVDGEPIDLPPSLASTN--GDGGTI 307

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +  L    YNSL +     A        V     C+ F+       P V+LHF 
Sbjct: 308 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 365

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
               L +   +YL  +     +CF +          + + ++G++      V +DL N  
Sbjct: 366 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 424

Query: 488 VGFTPNKC 495
           +G+  + C
Sbjct: 425 IGWADHNC 432


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 159/356 (44%), Gaps = 40/356 (11%)

Query: 162 RIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS 221
            + +G P     +V+DTGSDI W+ C PCT C      +FDP  SS++SPL C  P    
Sbjct: 104 NLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTP---- 158

Query: 222 LDVSACRANRCLYQVAYGD-----GSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF 276
                C+ +   + ++Y D     G+F    LV ET   G S  +  + +GCGH N G  
Sbjct: 159 CGFKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTS-QISDVIIGCGH-NIGFN 216

Query: 277 V--GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD--AVTAPLI 332
              G  G+LGL  G  SL  QI     +YC+ +   P     +     G D    + P  
Sbjct: 217 SDPGYNGILGLNNGPNSLATQI-GRKFSYCIGNLADPYYNYNQLRLGEGADLEGYSTPF- 274

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
             +    FYYV + G SVG + + I    FEM   G GG+I+D GT IT L   A+  L 
Sbjct: 275 --EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAHKLLY 332

Query: 393 DSFVRLAGNLKPTSGVALFDT-----CYDFSGLRS---VRVPTVSLHFGAGKALDLPAKN 444
           +    L   LK +    +F+      CY   G+ S   V  P V+ HF  G  L L   +
Sbjct: 333 NEVRNL---LKWSFRQVIFENAPWKLCY--YGIISRDLVGFPVVTFHFVDGADLALDTGS 387

Query: 445 YLIPVDSAGTFCFAFAP-----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +    D    FC   +P     T+ + S+IG + QQ   V +DL N  V F    C
Sbjct: 388 FFSQRDD--IFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 126/239 (52%), Gaps = 34/239 (14%)

Query: 147 PVVSGASQGSGEYFSRIGVG----TPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFD 202
           P+ SG    +  Y + I +G    +P    ++++DTGSD+ W+QC+PC+ CY Q DP+FD
Sbjct: 80  PLTSGIRLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFD 139

Query: 203 PKTSSSYSPLPCAAPQCK-----------SLDVSACRANRCLYQVAYGDGSFTVGDLVTE 251
           P  S++Y+ + C A  C            S   +   + +C Y +AYGDGSF+ G L T+
Sbjct: 140 PAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATD 199

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR 308
           TV+ G + S+ G   GCG  N GLF G+AGL+GLG   LSL  Q  +      +YCL   
Sbjct: 200 TVALGGA-SLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAA 258

Query: 309 DS-PASGVLEFNSARGGDAV------TAP-----LIRNKKVDTFYYVGLTGFSVGGQAV 355
            S  ASG L      GGD        T P     +I +     FY++ +TG +VGG A+
Sbjct: 259 TSGDASGSLSLG---GGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 314


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 161/356 (45%), Gaps = 33/356 (9%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y + + +GTPP+  S ++    +  W QC PC  C++Q  P+F+   SS+Y P PC    
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87

Query: 219 CKSLDVSACRAN-RCLYQV--AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD-NEG 274
           C+S+  S C  +  C Y+V   +GD S   G   T+T + G   +   +A GC  D N  
Sbjct: 88  CESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGT--ATASLAFGCAMDSNIK 142

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSAR---GGDAVTA 329
             +G++G++GLG    SL  Q+ AT+ +YCL    +    S +L   SA+   G  A T 
Sbjct: 143 QLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGGKSAATT 202

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+      + Y + L G   G   +  PP+           ++VD    ++ L   A+ 
Sbjct: 203 PLVNTSDDSSDYMIHLEGIKFGDVIIAPPPN--------GSVVLVDTIFGVSFLVDAAFQ 254

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGL-----RSVRVPTVSLHFGAGKALDLPAKN 444
           +++ +     G     +    FD C+  +        S+ +P V L F    AL +P   
Sbjct: 255 AIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSK 314

Query: 445 YLIPVDSAGTFCFAFAPT-----SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           Y+    + GT C A   +     ++ LSI+G + Q+     FDL    + F P  C
Sbjct: 315 YMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 369


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 170/376 (45%), Gaps = 43/376 (11%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPL 212
           ++GSG +   + +G+PP    +V+DTGS + W+QC PC  C+QQS   FDP  S S+  L
Sbjct: 99  NRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTL 157

Query: 213 PCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSV-------- 261
            C  P    ++   C R N+  Y++ Y  G  + G L  E++ F   + G V        
Sbjct: 158 GCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAIST 217

Query: 262 -------KGIALGCGHDNEGLFVGSA--GLLGLGGG-MLSLTKQIKATSLAYCLVDRDSP 311
                    I  GCGH N       A  G+ GLG    +++  Q+     +YC+ D ++P
Sbjct: 218 QISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQL-GNKFSYCIGDINNP 276

Query: 312 ASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
                  VL   S   GD+    +         YYV L   SVG + ++I P+ F++   
Sbjct: 277 LYTHNHLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSD 331

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRL-AGNLKPTSGVALFD-TCYDFSGLRS--- 422
           G GG+++D G   T+L    +  L D  V L  G L+       F+  C  F G+ S   
Sbjct: 332 GSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLC--FKGVVSRDL 389

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA---LSIIGNVQQQGTRV 479
           V  P V+ HF  G  L L + + L        FC A  P++S    LS+IG + QQ   V
Sbjct: 390 VGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNV 448

Query: 480 SFDLANNRVGFTPNKC 495
            FDL   +V F    C
Sbjct: 449 GFDLEQMKVFFRRIDC 464


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 170/368 (46%), Gaps = 50/368 (13%)

Query: 166 GTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS---- 221
           GTP +  +MVLDTGS+++WL C+         + IF+P  S +Y+ +PC++P C++    
Sbjct: 74  GTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCSSPTCETRTRD 129

Query: 222 --LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
             L VS   A  C + ++Y D S   G+L  ET   G   SV G A   G  + G    S
Sbjct: 130 LPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVG---SVTGPATVFGCMDSGFSSNS 186

Query: 280 ------AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD-------- 325
                  GL+G+  G LS   Q+     +YC+ DRDS  SGVL    A            
Sbjct: 187 EEDAKTTGLMGMNRGSLSFVNQMGFRKFSYCISDRDS--SGVLLLGEASFSWLKPLNYTP 244

Query: 326 --AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
              ++ PL    +V   Y V L G  V  + + +P S+F  D  G G  +VD GT  T L
Sbjct: 245 LVEMSTPLPYFDRVA--YSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFL 302

Query: 384 QTQAYNSLRDSF-VRLAGNLKPTSG-----VALFDTCYDFSGLRSV--RVPTVSLHF-GA 434
               Y++L+  F ++  G L+  +          D CY     R+    +P V+L F GA
Sbjct: 303 LGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRGA 362

Query: 435 GKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALSI----IGNVQQQGTRVSFDLANNR 487
             ++      Y +P +  G    +CF F   S +L I    IG+ QQQ   + +DL  +R
Sbjct: 363 EMSVSGQRLLYRVPGEVRGKDSVWCFTFG-NSDSLGIESFVIGHHQQQNVWMEYDLEKSR 421

Query: 488 VGFTPNKC 495
           +GF   +C
Sbjct: 422 IGFAEVRC 429


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF++I +G+PP+++ + +DTGSDI W+ C PC +C  ++D      ++D KTSS+   
Sbjct: 76  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGS---VK 262
           + C    C   +    C A + C Y V YGDGS + GD + + ++     GN  +    +
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 263 GIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
            +  GCG +  G      SA  G++G G    S+  Q+ A        ++CL + +    
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN--GG 253

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G+            T P++ N+     Y V L G  V G  + +PPSL   +  GDGG I
Sbjct: 254 GIFAVGEVESPVVKTTPIVPNQ---VHYNVILKGMDVDGDPIDLPPSLASTN--GDGGTI 308

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +  L    YNSL +     A        V     C+ F+       P V+LHF 
Sbjct: 309 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 366

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
               L +   +YL  +     +CF +          + + ++G++      V +DL N  
Sbjct: 367 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425

Query: 488 VGFTPNKC 495
           +G+  + C
Sbjct: 426 IGWADHNC 433


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 174/369 (47%), Gaps = 52/369 (14%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+QF++++DTGS + ++ C  C +C +  DP FDP++SS+Y P+ C 
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
               C S  V      +C+Y+  Y + S + G L  + +SFGN   +  +    GC +  
Sbjct: 140 IDCICDSDGV------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRD-----------SPASG 314
            G LF   A G++GLG G LSL  Q+        S + C    D           SP S 
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSD 253

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           ++   S    D V +P         +Y V L    V G+ + +   +F+    G  G ++
Sbjct: 254 MIFTYS----DPVRSP---------YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVL 296

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTV 428
           D GT    L  +A+++ +D+ +    +LK   G      D C+  +G      S + PTV
Sbjct: 297 DSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTV 356

Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
            + F  G+ L L  +NY        G +C   F   +   +++G +  + T V +D AN+
Sbjct: 357 DMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANS 416

Query: 487 RVGFTPNKC 495
           ++GF    C
Sbjct: 417 KIGFWKTNC 425


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 39/368 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF++I +G+PP+++ + +DTGSDI W+ C PC +C  ++D      ++D KTSS+   
Sbjct: 72  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 131

Query: 212 LPCAAPQCK-SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGS---VK 262
           + C    C   +    C A + C Y V YGDGS + GD + + ++     GN  +    +
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191

Query: 263 GIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
            +  GCG +  G      SA  G++G G    S+  Q+ A        ++CL + +    
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMN--GG 249

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G+            T P++ N+     Y V L G  V G  + +PPSL   +  GDGG I
Sbjct: 250 GIFAVGEVESPVVKTTPIVPNQ---VHYNVILKGMDVDGDPIDLPPSLASTN--GDGGTI 304

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +  L    YNSL +     A        V     C+ F+       P V+LHF 
Sbjct: 305 IDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFACFSFTSNTDKAFPVVNLHFE 362

Query: 434 AGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNR 487
               L +   +YL  +     +CF +          + + ++G++      V +DL N  
Sbjct: 363 DSLKLSVYPHDYLFSL-REDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421

Query: 488 VGFTPNKC 495
           +G+  + C
Sbjct: 422 IGWADHNC 429


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 174/369 (47%), Gaps = 52/369 (14%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+QF++++DTGS + ++ C  C +C +  DP FDP++SS+Y P+ C 
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
               C S  V      +C+Y+  Y + S + G L  + +SFGN   +  +    GC +  
Sbjct: 140 IDCICDSDGV------QCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRD-----------SPASG 314
            G LF   A G++GLG G LSL  Q+        S + C    D           SP S 
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSD 253

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           ++   S    D V +P         +Y V L    V G+ + +   +F+    G  G ++
Sbjct: 254 MIFTYS----DPVRSP---------YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVL 296

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTV 428
           D GT    L  +A+++ +D+ +    +LK   G      D C+  +G      S + PTV
Sbjct: 297 DSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTV 356

Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
            + F  G+ L L  +NY        G +C   F   +   +++G +  + T V +D AN+
Sbjct: 357 DMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANS 416

Query: 487 RVGFTPNKC 495
           ++GF    C
Sbjct: 417 KIGFWKTNC 425


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  135 bits (341), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 35/368 (9%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPP++F++ +DTGSDI W+ C  C+ C Q S        FD   SS+ + 
Sbjct: 76  GLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAAL 135

Query: 212 LPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
           +PC+ P C S    A      R N+C Y   YGDGS T G  V++ + F    G   +V 
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195

Query: 263 G---IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
               I  GC     G    +     G+ G G G LS+  Q+ +  +     ++CL   D 
Sbjct: 196 SSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL-KGDG 254

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              GVL          V +PL+ ++     Y + L   +V GQ + I P++F +     G
Sbjct: 255 DGGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPINPAVFSISN-NRG 310

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G IVDCGT +  L  +AY+ L  + +  A +       +  + CY  S       P+VSL
Sbjct: 311 GTIVDCGTTLAYLIQEAYDPLVTA-INTAVSQSARQTNSKGNQCYLVSTSIGDIFPSVSL 369

Query: 431 HFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           +F  G ++ L  + YL+    +D A  +C  F       SI+G++  +   V +D+A  R
Sbjct: 370 NFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQR 429

Query: 488 VGFTPNKC 495
           +G+    C
Sbjct: 430 IGWANYDC 437


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 99/257 (38%), Positives = 135/257 (52%), Gaps = 13/257 (5%)

Query: 249 VTETVSFGN-SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCL-- 305
           +TET +FG+ + +  GIA GC   +EG F   +GL+GLG G LSL  Q+   +  Y L  
Sbjct: 1   MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRLSS 60

Query: 306 -VDRDSPAS-GVLEFNSARGGDA-VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPS 360
            +   SP S G L   +   GD+ ++ PL+ N  V    FYYVGLTG SVGG+ VQIP  
Sbjct: 61  DLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSG 120

Query: 361 LFEMDEA-GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSG 419
            F  D + G GG+I D GT +T L   AY  +RD  +   G  KP       D      G
Sbjct: 121 TFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGG 180

Query: 420 LRSVRVPTVSLHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQG 476
             +   P++ LHF  G  +DL  +NYL  +   +     C++   +S AL+IIGN+ Q  
Sbjct: 181 SSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMD 240

Query: 477 TRVSFDLANN-RVGFTP 492
             V FDL+ N R+ F P
Sbjct: 241 FHVVFDLSGNARMLFQP 257


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 114/366 (31%), Positives = 157/366 (42%), Gaps = 48/366 (13%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS 207
           VVS     + EY   + V TPP +   + DTGS + WL+C+          P      SS
Sbjct: 65  VVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK---------LPAAHTPASS 115

Query: 208 SYSPLPCAAPQCKSL-DVSACRA-----NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           SY+ LPC A  CK+L D ++CRA     N C+Y+ A+ DGS T G +  +  +F      
Sbjct: 116 SYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFSTR--- 172

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV--DRDSPASG 314
             +  GC    EGL V   GL+GL  G +SL  Q+ A +      +YCLV        S 
Sbjct: 173 --LDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVSS 230

Query: 315 VLEFNS----ARGGDAVTAPLI--RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            L F S    +    A T PL+  RNK   +FY + L    V G+ V +        +  
Sbjct: 231 SLNFGSHAIVSSSPGAATTPLVAGRNK---SFYTIALDSIKVAGKPVPL--------QTT 279

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR----SVR 424
              +IVD GT +T L     + L  +        +  S   L+  CYD            
Sbjct: 280 TTKLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKS 339

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           +P V+L  G G  + LP  N  +  +   T C A   +     I+GNV QQ   V FDL 
Sbjct: 340 IPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLE 399

Query: 485 NNRVGF 490
              V F
Sbjct: 400 RRTVSF 405


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 160/371 (43%), Gaps = 42/371 (11%)

Query: 144 FSTPVVSGASQG--------SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ 195
           +  P  S AS G        +G+Y  ++ +GTPP     ++DT SD+ W QC PC  CY+
Sbjct: 8   YQVPKKSYASNGPFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYK 67

Query: 196 QSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVS 254
           Q +P+FDP              +C S    +C   + C Y  AY D S T G L  E  +
Sbjct: 68  QKNPMFDP------------LKECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIAT 115

Query: 255 FGNSGS---VKGIALGCGHDNEGLF-----VGSAGLLGLGGGMLSLTKQIKATSLAYCLV 306
           F ++     V+ I  GCGH+N G+F            G    +  +     +   + CLV
Sbjct: 116 FSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLV 175

Query: 307 --DRDSPASGVLEFNSA---RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
               D   SG +    A    G   VT PL+ +++  T Y V L G SVG   V    S 
Sbjct: 176 PFHADPHTSGTISLGEASDVSGEGVVTTPLV-SEEGQTPYLVTLEGISVGDTFVPFNSS- 233

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
            EM     G I++D GT  T L  + Y+ L +  +++  NL P        T   +    
Sbjct: 234 -EM--LSKGNIMIDSGTPETYLPQEFYDRLVEE-LKVQINLPPIHVDPDLGTQLCYKSET 289

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSF 481
           ++  P ++ HF       LP + ++ P D  G FCFA   T+  L I GN  Q    + F
Sbjct: 290 NLEGPILTAHFEGADVKLLPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVLIGF 347

Query: 482 DLANNRVGFTP 492
           DL    V F P
Sbjct: 348 DLDKRIVFFKP 358


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 176/373 (47%), Gaps = 44/373 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C   S        F+P TSS+ S 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 212 LPCAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---S 258
           +PC+  +C +      + C+ +    C Y   YGDGS T G  V++T+ F    GN   +
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
            S   I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL   D
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD 268

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +   G+L          V  PL+ ++     Y + L    V GQ + I  SLF       
Sbjct: 269 N-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT-- 322

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVP 426
            G IVD GT +  L   AY    D FV  +   + P+  S V+  + C+  S       P
Sbjct: 323 QGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFP 378

Query: 427 TVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
           TVSL+F  G A+ +  +NYL+    +D+   +C  +       ++I+G++  +     +D
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 483 LANNRVGFTPNKC 495
           LAN R+G+T   C
Sbjct: 439 LANMRMGWTDYDC 451


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 108/367 (29%), Positives = 165/367 (44%), Gaps = 38/367 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI----FDPKTSSSYSPL 212
           G YF++IG+GTP R F + +DTGSDI W+ C  C  C ++SD +    +D   SS+   +
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSV 142

Query: 213 PCAAPQCKSLDV-SACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GN--SGSVKG- 263
            C+   C  ++  S C + + C Y + YGDGS T G LV + V      GN  +GS  G 
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGT 202

Query: 264 IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPASG 314
           I  GCG    G    S     G++G G    S   Q+ +      S A+CL + +    G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN--GGG 260

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           +            T P++        Y V L    VG   +Q+    F  D   D G+I+
Sbjct: 261 IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNSVLQLSSDAF--DSGDDKGVII 315

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +  L    YN L +  +     L   +    F TC+ +   R  R PTV+  F  
Sbjct: 316 DSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYID-RLDRFPTVTFQFDK 373

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNRV 488
             +L +  + YL  V    T+CF +          ++L+I+G++      V +D+ N  +
Sbjct: 374 SVSLAVYPQEYLFQV-REDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVI 432

Query: 489 GFTPNKC 495
           G+T + C
Sbjct: 433 GWTNHNC 439


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 165/373 (44%), Gaps = 59/373 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--- 219
           + VG+PP+  +MVLDTGS+++WL C+         +  F+P  SSSY+P PC +  C   
Sbjct: 64  LTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSICTTR 119

Query: 220 -KSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            + L + A C  N   C   V+Y D S   G L  ET S   +    G   GC  D+ G 
Sbjct: 120 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-PGTLFGC-MDSAGY 177

Query: 276 FV------GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT- 328
                    + GL+G+  G LSL  Q+     +YC+   D  A GVL      G DA + 
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISGED--ALGVLLLGD--GTDAPSP 233

Query: 329 ---APLIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
               PL+       +     Y V L G  V  + +Q+P S+F  D  G G  +VD GT  
Sbjct: 234 LQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQF 293

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSVRVPTV 428
           T L    Y+SL+D F      L+ T GV               D CY  +      VP V
Sbjct: 294 TFLLGSVYSSLKDEF------LEQTKGVLTRIEDPNFVFEGAMDLCYH-APASFAAVPAV 346

Query: 429 SLHFGAGKALDLPAKNYLIPVD--SAGTFCFAFAPTSSALSI----IGNVQQQGTRVSFD 482
           +L F +G  + +  +  L  V   S   +CF F   S  L I    IG+  QQ   + FD
Sbjct: 347 TLVF-SGAEMRVSGERLLYRVSKGSDWVYCFTFG-NSDLLGIEAYVIGHHHQQNVWMEFD 404

Query: 483 LANNRVGFTPNKC 495
           L  +RVGFT   C
Sbjct: 405 LLKSRVGFTQTTC 417


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 99/343 (28%), Positives = 159/343 (46%), Gaps = 38/343 (11%)

Query: 169 PRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACR 228
           PR+  +++DTGSD+ W QC+  +     +             PL   AP         C 
Sbjct: 52  PRK--LIVDTGSDLIWTQCKLSSSTAAAA--------RHGSPPLSRTAPARTGAFTRTCT 101

Query: 229 ANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK-GIALGCGHDNEGLFVGSAGLLGLGG 287
           A+           +  VG L +ET +FG   +V   +  GCG  + G  +G+ G+LGL  
Sbjct: 102 AS-----------AAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSP 150

Query: 288 GMLSLTKQIKATSLAYCLV----DRDSPA--SGVLEFNSARGGDAVTAPLIRNKKVDT-F 340
             LSL  Q+K    +YCL      + SP     + + +  +    +    I +  V+T +
Sbjct: 151 ESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVY 210

Query: 341 YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG 400
           YYV L G S+G + + +P +   M   G GG IVD G+ +  L   A+ +++++ + +  
Sbjct: 211 YYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVR 270

Query: 401 NLKPTSGVALFDTCYDF------SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT 454
                  V  ++ C+        + + +V+VP + LHF  G A+ LP  NY      AG 
Sbjct: 271 LPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQE-PRAGL 329

Query: 455 FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            C A   T+  S +SIIGNVQQQ   V FD+ +++  F P +C
Sbjct: 330 MCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 120/376 (31%), Positives = 164/376 (43%), Gaps = 65/376 (17%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--- 219
           + +G+PP+  +MVLDTGS+++WL C+         +  F+P  SSSY+P PC +  C   
Sbjct: 63  LTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSVCMTR 118

Query: 220 -KSLDVSA-CRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGL 275
            + L + A C  N   C   V+Y D S   G L  ET S   +    G   GC  D+ G 
Sbjct: 119 TRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-PGTLFGC-MDSAGY 176

Query: 276 F------VGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA 329
                    + GL+G+  G LSL  Q+     +YC+   D  A GVL       GD  +A
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQMVLPKFSYCISGED--AFGVLLL-----GDGPSA 229

Query: 330 P-------LIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
           P       L+       +     Y V L G  V  + +Q+P S+F  D  G G  +VD G
Sbjct: 230 PSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSG 289

Query: 378 TAITRLQTQAYNSLRDSFVRLAGNLKPTSGV------------ALFDTCYDFSGLRSVRV 425
           T  T L    YNSL+D F      L+ T GV               D CY      +  V
Sbjct: 290 TQFTFLLGPVYNSLKDEF------LEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAA-V 342

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA--GTFCFAFAPTSSALSI----IGNVQQQGTRV 479
           P V+L F +G  + +  +  L  V       +CF F   S  L I    IG+  QQ   +
Sbjct: 343 PAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFG-NSDLLGIEAYVIGHHHQQNVWM 400

Query: 480 SFDLANNRVGFTPNKC 495
            FDL  +RVGFT   C
Sbjct: 401 EFDLVKSRVGFTETTC 416


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 176/373 (47%), Gaps = 44/373 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C   S        F+P TSS+ S 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 212 LPCAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---S 258
           +PC+  +C +      + C+ +    C Y   YGDGS T G  V++T+ F    GN   +
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
            S   I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL   D
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD 268

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +   G+L          V  PL+ ++     Y + L    V GQ + I  SLF       
Sbjct: 269 N-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT-- 322

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVP 426
            G IVD GT +  L   AY    D FV  +   + P+  S V+  + C+  S       P
Sbjct: 323 QGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFP 378

Query: 427 TVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
           TVSL+F  G A+ +  +NYL+    +D+   +C  +       ++I+G++  +     +D
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 483 LANNRVGFTPNKC 495
           LAN R+G+T   C
Sbjct: 439 LANMRMGWTDYDC 451


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  135 bits (339), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 169/362 (46%), Gaps = 39/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C  C +  DP F P  SS+Y P+ C 
Sbjct: 85  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144

Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDN 272
               C    V+      C+Y+  Y + S + G L  + +SFGN   V  +    GC +  
Sbjct: 145 MDCNCDHDGVN------CVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENVE 198

Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT-- 328
            G L+   A G++GLG G LS+  Q+   ++   + D  S   G +      GG A+   
Sbjct: 199 TGDLYSQRADGIMGLGRGQLSIVDQLVDKNV---INDSFSLCYGGMHV----GGGAMVLG 251

Query: 329 ----APLIRNKKVD----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                P +   + D     +Y + L    V G+ +++ PS F+       G ++D GT  
Sbjct: 252 GIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH----GTVLDSGTTY 307

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGA 434
             L  +A+ + RD+ ++ + NLK   G      D C+  +G      S   P V + F  
Sbjct: 308 AYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSN 367

Query: 435 GKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           G+ L L  +NYL       G +C        + +++G +  + T V++D  N ++GF   
Sbjct: 368 GQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKT 427

Query: 494 KC 495
            C
Sbjct: 428 NC 429


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 116/373 (31%), Positives = 175/373 (46%), Gaps = 45/373 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-----YQQSDPIFDPKTSSSYSP 211
           G Y++R+ +G PP+ F + +DTGSD+ W+ C  C  C      Q     FDP +S++ S 
Sbjct: 81  GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140

Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF-------GNS 258
           + C+   C    +S D SAC  ++N+C Y   YGDGS T G  V + +           S
Sbjct: 141 VSCSDQICALGVQSSD-SACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTS 199

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S   +  GC     G    S     G+ G G   LS+  Q+ +  +A     +CL   D
Sbjct: 200 NSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDD 259

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           S   G+L        + V  PL+ ++     Y + L   SV GQ + I P++F    +  
Sbjct: 260 S-GGGILVLGEIVEPNVVYTPLVPSQP---HYNLNLQSISVNGQVLPISPAVFATSSS-- 313

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL--KPTSGVALF-DTCYDFSGLRSVRVP 426
            G I+D GT +  L  +AYN    +FV    N+  + T  V L  + CY  S   S   P
Sbjct: 314 QGTIIDSGTTLAYLAEEAYN----AFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFD 482
            VSL+F  G +L L A++YLI  +S G    +C  F       ++I+G++  +     +D
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYD 429

Query: 483 LANNRVGFTPNKC 495
           LAN R+G+T   C
Sbjct: 430 LANQRIGWTNYDC 442


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECY--QQSDPIFDPKTSSSYSPLPCAA 216
           +     VG PP     ++DTGS + W+QC+PC  C       P+F+P  SS++    C  
Sbjct: 96  FLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDD 155

Query: 217 PQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
             C+      C  +N+C+Y+  Y  G+ + G L  E ++F    GN+   + IA GCG++
Sbjct: 156 RFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYE 215

Query: 272 N-EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAP 330
           N E L     G+LGLG    SL  Q+  +  +YC+ D  +   G  +       D +  P
Sbjct: 216 NGEQLESHFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGEDADILGDP 274

Query: 331 L-IRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
             I  +  ++ YY+ L G SVG   + I P +F+       G+I+D GT  T L   AY 
Sbjct: 275 TPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKR-RGPRTGVILDSGTLYTWLADIAY- 332

Query: 390 SLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS---VRVPTVSLHFGAGKALDLPAKNYL 446
             R+ +  +   L P      F     + G  S   +  P V+ HF  G  L + A +  
Sbjct: 333 --RELYNEIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMF 390

Query: 447 IPVDSAGT---FCFAFAPTS------SALSIIGNVQQQGTRVSFDLANNRV 488
            P+    T   FC +  PT          + IG + QQ   + +DL    +
Sbjct: 391 YPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNI 441


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 179/421 (42%), Gaps = 71/421 (16%)

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
           H   PA A + P  +            G Y     +GTPP+   ++LDTGS + W+    
Sbjct: 50  HPSVPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTS 97

Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCL---------- 233
             +CR C+     + P+F PK SSS   + C  P C+ +  +A  A +C           
Sbjct: 98  SYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAAN 157

Query: 234 -----------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
                      Y V YG GS T G L+ +T+      +V G  LGC   +  +    +GL
Sbjct: 158 CPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGCSLVS--VHQPPSGL 213

Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDAVT-APLIRNKKV 337
            G G G  S+  Q+     +YCL+ R    ++  SG L      GG+ +   PL+++   
Sbjct: 214 AGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAG 273

Query: 338 D-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
           D      +YY+ L G +VGG+AV++P   F  + AG GG IVD GT  T L    +  + 
Sbjct: 274 DKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVA 333

Query: 393 DSFVRLAGNLKPTSGVAL----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI 447
           D+ V   G     S  A        C+    G RS+ +P +S HF  G  + LP +NY +
Sbjct: 334 DAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV 393

Query: 448 PVDSAGT--FCFAFAPTSSALS-----------IIGNVQQQGTRVSFDLANNRVGFTPNK 494
                     C A     S  S           I+G+ QQQ   V +DL   R+GF    
Sbjct: 394 VAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQS 453

Query: 495 C 495
           C
Sbjct: 454 C 454


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 165/359 (45%), Gaps = 32/359 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++D+GS + ++ C  C +C    DP F P  SS+YSP+ C 
Sbjct: 85  NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144

Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDN 272
               C S        N+C Y+  Y + S + G L  + VSFG    +K      GC +  
Sbjct: 145 VDCTCDS------DKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSE 198

Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD-SPASGVLEFNSARGG 324
            G LF   A G++GLG G LS+  Q     +   S + C    D    + VL    A  G
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPG 258

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
              T     N     +Y + L    V G+A+++ P +F+    G  G ++D GT    L 
Sbjct: 259 MIYTH---SNAVRSPYYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAYLP 311

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
            QA+ + +D+       LK   G      D C+  +G    ++    P V + FG G+ L
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL 371

Query: 439 DLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  +NYL       G +C   F       +++G +  + T V++D  N ++GF    C
Sbjct: 372 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 77/209 (36%), Positives = 113/209 (54%), Gaps = 16/209 (7%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           +Y   + +GTPP +     DTGSD+ WLQC PCT CY+Q +P+FD ++SS++S + C + 
Sbjct: 58  DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117

Query: 218 QCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIALGCGHD 271
            C  L  ++C  ++  C Y  +Y DGS T G L  ET++     G   + KG+  GCGH+
Sbjct: 118 SCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHN 177

Query: 272 NEGLFV-GSAGLLGLGGGMLSLTKQIKAT----SLAYCLV--DRDSPASGVLEFNSAR-- 322
           N G F     G++GLG G LSL  QI ++      + CLV  + +   S  + F      
Sbjct: 178 NNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEV 237

Query: 323 -GGDAVTAPLIRNKKVDTFYYVGLTGFSV 350
            G   V+ PL+      +FY+V L G SV
Sbjct: 238 LGNGVVSTPLVSKTTYQSFYFVTLLGISV 266


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 165/359 (45%), Gaps = 32/359 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++D+GS + ++ C  C +C    DP F P  SS+YSP+ C 
Sbjct: 85  NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCN 144

Query: 216 AP-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDN 272
               C S        N+C Y+  Y + S + G L  + VSFG    +K      GC +  
Sbjct: 145 VDCTCDS------DKNQCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSE 198

Query: 273 EG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD-SPASGVLEFNSARGG 324
            G LF   A G++GLG G LS+  Q     +   S + C    D    + VL    A  G
Sbjct: 199 TGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPG 258

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
              T     N     +Y + L    V G+A+++ P +F+    G  G ++D GT    L 
Sbjct: 259 MIYTH---SNAVRSPYYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAYLP 311

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
            QA+ + +D+       LK   G      D C+  +G    ++    P V + FG G+ L
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVFGNGQKL 371

Query: 439 DLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  +NYL       G +C   F       +++G +  + T V++D  N ++GF    C
Sbjct: 372 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 38/367 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI----FDPKTSSSYSPL 212
           G YF++IG+GTP R F + +DTGSDI W+ C  C  C ++SD +    +D   SS+   +
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSV 142

Query: 213 PCAAPQCKSLDV-SACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GN--SGSVKG- 263
            C+   C  ++  S C + + C Y + YGDGS T G LV + V      GN  +GS  G 
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202

Query: 264 IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKAT-----SLAYCLVDRDSPASG 314
           I  GCG    G    S     G++G G    S   Q+ +      S A+CL + +    G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN--GGG 260

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
           +            T P++        Y V L    VG   +++  + F  D   D G+I+
Sbjct: 261 IFAIGEVVSPKVKTTPMLSKS---AHYSVNLNAIEVGNSVLELSSNAF--DSGDDKGVII 315

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGA 434
           D GT +  L    YN L +  +     L   +    F TC+ ++  +  R PTV+  F  
Sbjct: 316 DSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYTD-KLDRFPTVTFQFDK 373

Query: 435 GKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLANNRV 488
             +L +  + YL  V    T+CF +          ++L+I+G++      V +D+ N  +
Sbjct: 374 SVSLAVYPREYLFQV-REDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVI 432

Query: 489 GFTPNKC 495
           G+T + C
Sbjct: 433 GWTNHNC 439


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 168/359 (46%), Gaps = 32/359 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C  C +  DP F P  S +Y P+ C 
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKC- 144

Query: 216 APQCKSLDVSACRA--NRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
            P C       C    N+C+Y   Y + S + G L  + VSFGN   +  +    GC +D
Sbjct: 145 TPDCN------CDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCEND 198

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+      K++ + S + C    D     ++    +   
Sbjct: 199 ETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPE 258

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D V      +     +Y + L    V G+ +Q+ P +F+    G  G ++D GT    L 
Sbjct: 259 DMVFTH--SDPDRSPYYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLP 312

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKAL 438
             A+ + + + ++   +LK  +G      D C+  +G+   ++    P V + F  G  L
Sbjct: 313 ETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKL 372

Query: 439 DLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  +NYL       G +C   F+      +++G +  + T V +D  N+++GF    C
Sbjct: 373 SLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 175/371 (47%), Gaps = 44/371 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
           YF+R+ +G+PP+++ + +DTGSDI W+ C PCT C   S        F+P TSS+ S +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 214 CAAPQCKS---LDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF----GN---SGS 260
           C+  +C +      + C+ +    C Y   YGDGS T G  V++T+ F    GN   + S
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
              I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL   D+ 
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN- 295

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
             G+L          V  PL+ ++     Y + L    V GQ + I  SLF        G
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTTSNT--QG 350

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVRVPTV 428
            IVD GT +  L   AY    D FV  +   + P+  S V+  + C+  S       PTV
Sbjct: 351 TIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTV 406

Query: 429 SLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLA 484
           SL+F  G A+ +  +NYL+    +D+   +C  +       ++I+G++  +     +DLA
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 466

Query: 485 NNRVGFTPNKC 495
           N R+G+T   C
Sbjct: 467 NMRMGWTDYDC 477


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 163/360 (45%), Gaps = 33/360 (9%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y  R  +GTP +   M +DT SD+ W+ C  C  C   S  +F+   S++Y  L C A Q
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQ 157

Query: 219 CKSL--------------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
           CK +                  C    C + + YG GS    +L  +T++   + +V G 
Sbjct: 158 CKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLA-TDAVPGY 215

Query: 265 ALGCGHDNEGLFVGS---AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA-SGVLEFNS 320
           + GC     G  + +    GL      +LS T+ +  ++ +YCL    S   SG L    
Sbjct: 216 SFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP 275

Query: 321 ARGGDAVT-APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
                 +   PL++N +  + Y+V L    VG + V +PP  F  + +   G I D GT 
Sbjct: 276 VGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTV 335

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALD 439
            TRL T AY ++RD+F    G     + +  FDTCY       +  PT++  F  G  + 
Sbjct: 336 FTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYTV----PIAAPTITFMF-TGMNVT 390

Query: 440 LPAKNYLIPVDSAGTFCFAFAP----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP  N LI   +  T C A A      +S L++I N+QQQ  R+ +D+ N+R+G     C
Sbjct: 391 LPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 450


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 160/358 (44%), Gaps = 30/358 (8%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y +   +GTPP+  S V+D   ++ W QC PC  C++Q  P+FDP  SS++  LPC +
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
             C+S+  S+  C ++ C+Y+     G  T G   T+T + G +    G       D   
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGAAKETLGFGCVVMTDKRL 173

Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
             +G  +G++GLG    SL  Q+  T+ +YCL  + S A   G      A G ++ T  +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233

Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           I+      +   + +Y V L G   GG  +Q   S           +++D  +  + L  
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASS-------SGSTVLLDTVSRASYLAD 286

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            AY +L+ +     G     S    +D C  FS   +   P +   F  G AL +P  NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FSKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 446 LIPVDSAGTFCFAFAPTSS--------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+     GT C     ++S          SI+G++QQ+   V FDL    + F P  C
Sbjct: 345 LL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 166/371 (44%), Gaps = 40/371 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G YF++IG+G PP+ + + +DTGSDI W+ C  C +C  +SD      ++DP++S+S +
Sbjct: 79  AGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138

Query: 211 PLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GN--SGS 260
            + C    C +     +  C  +  C Y V YGDGS T G  V + + F    GN  + S
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198

Query: 261 VKG-IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
             G +  GCG    G    S+    G+LG G    S+  Q+ A        A+CL   + 
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL--DNV 256

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+            T P++ N+     Y V +    VGG  +++P  +F  D     
Sbjct: 257 KGGGIFAIGEVVSPKVNTTPMVPNQP---HYNVVMKEIEVGGNVLELPTDIF--DTGDRR 311

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G I+D GT +  L    Y S+    V     LK  +    F TC+ ++G  +   P V  
Sbjct: 312 GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-TCFQYTGNVNEGFPVVKF 370

Query: 431 HFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDLA 484
           HF    +L +   +YL  +     +CF +      +     ++++G++      V +DL 
Sbjct: 371 HFNGSLSLTVNPHDYLFQIHEE-VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLE 429

Query: 485 NNRVGFTPNKC 495
           N  +G+T   C
Sbjct: 430 NQAIGWTDYNC 440


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 127/422 (30%), Positives = 175/422 (41%), Gaps = 56/422 (13%)

Query: 119 KLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           KL ++      H LK  +     +   TPV     +  G Y   +  GTP + F  VLDT
Sbjct: 52  KLAVSTSITRAHHLKNHKPN---KSLETPV---HPKTYGGYSIDLEFGTPSQTFPFVLDT 105

Query: 179 GSDINWLQCRP---CTECYQQSD-PIFDPKTSSSYSPLPCAAPQCKSL---DVSA--CRA 229
           GS + WL C     C++C   S+ P F PK SSS   + C  P+C  +   DV +  CR 
Sbjct: 106 GSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGPDVKSHCCRQ 165

Query: 230 -----NRC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS 279
                N C      Y V YG GS T G L++E ++F  +       LGC   +       
Sbjct: 166 DKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFP-TKKYSDFLLGCSVVS---VYQP 220

Query: 280 AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASG------VLEFNSARGGD---AVTAP 330
           AG+ G G G  SL  Q+  T  +YCL+      S       VLE  S+R G        P
Sbjct: 221 AGIAGFGRGEESLPSQMNLTRFSYCLLSHQFDDSATITSNLVLETASSRDGKTNGVSYTP 280

Query: 331 LIRN---KKVDTF---YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            ++N   KK   F   YY+ L    VG + V++P  L E +  GDGG IVD G+  T ++
Sbjct: 281 FLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFME 340

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALF--DTCYDFS-GLRSVRVPTVSLHFGAGKALDLP 441
              ++ +   F +     +       F    C+  + G  +   P +   F  G  + LP
Sbjct: 341 RPIFDLVAQEFAKQVSYTRAREAEKQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLP 400

Query: 442 AKNYLIPVDSAGTFCFAFAPTSSALS--------IIGNVQQQGTRVSFDLANNRVGFTPN 493
             NY   V      C        A S        I+GN QQQ   V +DL N R GF   
Sbjct: 401 VANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQ 460

Query: 494 KC 495
            C
Sbjct: 461 SC 462


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 167/370 (45%), Gaps = 39/370 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+P + F + +DTGSDI W+ C  C+ C   S        FD   SS+ + 
Sbjct: 81  GLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140

Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN--------S 258
           + CA P C        S C  +AN+C Y   YGDGS T G  V++T+ F          +
Sbjct: 141 VSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVA 200

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
            S   I  GC     G    +     G+ G G G LS+  Q+ +  +     ++CL   +
Sbjct: 201 NSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +   GVL          V +PL+ +      Y + L   +V GQ + I  ++F      +
Sbjct: 261 N-GGGVLVLGEILEPSIVYSPLVPSLP---HYNLNLQSIAVNGQLLPIDSNVFA--TTNN 314

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTV 428
            G IVD GT +  L  +AYN   D+         KP   ++  + CY  S       P V
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPI--ISKGNQCYLVSNSVGDIFPQV 372

Query: 429 SLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           SL+F  G ++ L  ++YL+    +DSA  +C  F       +I+G++  +     +DLAN
Sbjct: 373 SLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLAN 432

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 433 QRIGWADYNC 442


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 163/357 (45%), Gaps = 31/357 (8%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y SR+ +GTPP +FS+++DTGS + ++ C  CT C    DP F P  SSSY PL C +
Sbjct: 33  GYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS 92

Query: 217 PQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--IALGCGHDNEG 274
            +C +     C  +R  YQ  Y + S + G L  + + F NS  + G  +  GC     G
Sbjct: 93  -ECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCETAETG 147

Query: 275 -LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPASGVLEFNSARGGD 325
            L+  +A G++GLG G LS+  Q+          SL Y  +D    A  +  F   +  D
Sbjct: 148 DLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPK--D 205

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
            V      +     +Y + L G  VGG  +++ P +F+    G  G ++D GT       
Sbjct: 206 MVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYFPG 259

Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKALD 439
            A+ + + +     G+LK   G      D CY  +G      S   P+V   FG G+++ 
Sbjct: 260 AAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVT 319

Query: 440 LPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  +NYL      +G +C          +++G +  +   V+++     +GF   KC
Sbjct: 320 LSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKC 376


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 122/429 (28%), Positives = 177/429 (41%), Gaps = 65/429 (15%)

Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
            LA  ++ R H LK  +      +FS       S+  G Y   + +GTP +   +++DTG
Sbjct: 50  HLATTSISRAHHLKSPKT-----NFSLIKTPLFSRSYGGYSMSLSLGTPSQTVKLIMDTG 104

Query: 180 SDINWLQCRP---CTEC-YQQSD----PIFDPKTSSSYSPLPCAAPQCK----SLDVSAC 227
           S + W  C     C  C +  +D    P F P+ SSS   + C  P+C     S   S C
Sbjct: 105 SSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSVQSKC 164

Query: 228 -----RANRCL-----YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFV 277
                +A  C      Y + YG GS T G L++ET++F N  ++     GC   +     
Sbjct: 165 HNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPNK-TISDFLAGCSLLSTR--- 219

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASG--VLEFNSARGGDAVTA--- 329
              G+ G G    SL  Q+     +YCLV R   DSP S   +L+   +      T    
Sbjct: 220 QPEGIAGFGRSQESLPLQLGLKKFSYCLVSRRFDDSPVSSDLILDMGPSTSDSKTTGLSY 279

Query: 330 -PLIRN------KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
            P  +N           +YYV L    VG   V++P S       G+GG IVD G+  T 
Sbjct: 280 TPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTF 339

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVPTVSLHFGAGKALD 439
           ++   +  L   F +   N    + V        C+D SG +SV +P ++  F  G  + 
Sbjct: 340 VEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQ 399

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSA-------------LSIIGNVQQQGTRVSFDLANN 486
           LP  NY   VD  G  C      ++A               I+GN QQQ   + +DL N+
Sbjct: 400 LPLSNYFAFVD-MGVVCLTIVSDNAAALGGDGGVRSSGPAIILGNFQQQNFYIEYDLEND 458

Query: 487 RVGFTPNKC 495
           R GF    C
Sbjct: 459 RFGFKEQSC 467


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 166/358 (46%), Gaps = 30/358 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++D+GS + ++ C  C +C    DP F P  SS+YSP+ C+
Sbjct: 82  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCS 141

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
           A      D S     +C Y+  Y + S + G L  + VSFG    +K      GC +   
Sbjct: 142 ADCTCDSDKS-----QCTYERQYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSET 196

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
           G LF   A G++GLG G LS+  Q     +   S + C    D     ++        D 
Sbjct: 197 GDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDM 256

Query: 327 VTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           V +   R+  V + YY + L    V G+A+++ P +F+       G ++D GT    L  
Sbjct: 257 VFS---RSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKH----GTVLDSGTTYAYLPE 309

Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKALD 439
           QA+ + +D+       LK   G      D C+  +G      S   P V + FG G+ L 
Sbjct: 310 QAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLS 369

Query: 440 LPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L  +NYL       G +C   F       +++G +  + T V++D  N ++GF    C
Sbjct: 370 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 165/366 (45%), Gaps = 46/366 (12%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-------CAAP 217
           +GTPP+   +VLDTGS ++W+QC    +  ++  P+  PKT+S    L        C  P
Sbjct: 72  IGTPPQPTDLVLDTGSQLSWIQCHD-KKVKKRLPPLPKPKTASFDPSLSSSFSLLPCNHP 130

Query: 218 QCK------SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            CK      +L  S C  NR C Y   Y DG+   G+LV E  +F  S S   + LGC  
Sbjct: 131 ICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQ 189

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR------------DSPASGVLEF 318
            +      + G+LG+  G LS   Q K +  +YC+  R            D+P S   ++
Sbjct: 190 AS----TENRGILGMNHGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKY 245

Query: 319 NSARGGDAVTAPLIRNK-KVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
            +      +T P  ++   +D   Y + +    + G+ + IPP+ F+ D  G G  ++D 
Sbjct: 246 VTM-----LTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNIPPAAFKPDAGGSGQTMIDS 300

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RVPTVSLHF 432
           G+ +T L  +AY  +++  VRL G +     V   + D C+D      V  R+  +S  F
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGTRVSFDLANNRVG 489
             G  + +     ++     G  C     +       +IIG V QQ   V +DLAN RVG
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420

Query: 490 FTPNKC 495
           F   +C
Sbjct: 421 FGGAEC 426


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 159/358 (44%), Gaps = 30/358 (8%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y +   +GTPP+  S V+D   ++ W QC PC  C++Q  P+FDP  SS++  LPC +
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
             C+S+  S+  C ++ C+Y+     G  T G   T+T + G +    G       D   
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGFGCVVMTDKRL 173

Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
             +G  +G++GLG    SL  Q+  T+ +YCL  + S A   G      A G ++ T  +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233

Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           I+      +   + +Y V L G   GG  +Q   S           +++D  +  + L  
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASS-------SGSTVLLDTVSRASYLAD 286

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            AY +L+ +     G     S    +D C  F    +   P +   F  G AL +P  NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FPKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 446 LIPVDSAGTFCFAFAPTSS--------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L+     GT C     ++S          SI+G++QQ+   V FDL    + F P  C
Sbjct: 345 LL-ASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 119/419 (28%), Positives = 181/419 (43%), Gaps = 58/419 (13%)

Query: 121 QLAIYNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTG 179
            LA  ++ R H LK  +A  L +    P   GA      +   +  GTPP++ S ++DTG
Sbjct: 54  HLATASMSRSHHLKHGKASPLIQTSLFPHSYGA------HTIPLSFGTPPQKLSFLMDTG 107

Query: 180 SDINWLQC---RPCTEC---YQQSDPIFDPKTSSSYSPLPCAAPQCK-------SLDVSA 226
           S + W  C     CT C     +  PIF+P+ SSS   L C  P+C         L    
Sbjct: 108 SHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBVHLGXPR 167

Query: 227 CRAN--RC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGLFV 277
           C  N  +C      Y + YG G+ + G  + E + F    ++    +GC    D E    
Sbjct: 168 CNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-TIHKFLVGCTTSADREP--- 222

Query: 278 GSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD---SPASG--VLEFNSARGGDAVTAPLI 332
            S  L G G  M SL  Q+     AYCL   D   +  SG  +L+++         AP  
Sbjct: 223 SSDALAGFGRTMFSLPMQMGVKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFX 282

Query: 333 RNK-KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY--- 388
           +N      +YY+G+    +G + ++IP           GG+++D G A + +    +   
Sbjct: 283 KNPPDYPIYYYLGVKDMKIGNKVLRIPGKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIV 342

Query: 389 -NSLRD--SFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            N L+   S  R +  L+  +GV     CY+F+G +S+++P +   F  G  + +P  NY
Sbjct: 343 TNELKKQMSKYRRSLELEAQTGVT---PCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNY 399

Query: 446 LIPVDSAGTFCFAF---APTSS------ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            +    A   CF     +PTS+         I+GN QQ    V FDL N R+GF    C
Sbjct: 400 FLLFSEASLGCFPVTTDSPTSNLEFTPGPSIILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 172/372 (46%), Gaps = 43/372 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPP +F++ +DTGSD+ W+ C  C  C Q S        FDP +SS+ S 
Sbjct: 76  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135

Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
           + C+  +C    +S D + + + N+C Y   YGDGS T G  V++ +           + 
Sbjct: 136 IACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN 195

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
           S   +  GC +   G    S     G+ G G   +S+  Q+ +  +A     +CL   DS
Sbjct: 196 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL-KGDS 254

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+L        + V   L+  +     Y + L   SV GQ +QI  S+F    +   
Sbjct: 255 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSISVNGQTLQIDSSVFATSNS--R 309

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
           G IVD GT +  L  +AY    D FV       P S    V+  + CY  +   +   P 
Sbjct: 310 GTIVDSGTTLAYLAEEAY----DPFVSAITAAIPQSVRTVVSRGNQCYLITSSVTDVFPQ 365

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDL 483
           VSL+F  G ++ L  ++YLI  +S G    +C  F       ++I+G++  +   V +DL
Sbjct: 366 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 425

Query: 484 ANNRVGFTPNKC 495
           A  R+G+    C
Sbjct: 426 AGQRIGWANYDC 437


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 29/367 (7%)

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTS 206
           P+      G+ +Y   +G GTP +QF M LDT   ++ + C+PC       DP FD   S
Sbjct: 137 PIDGSPDAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQS 196

Query: 207 SSYSPLPCAAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           ++++ +PC +P C S   + C A   C + + + +G+F+      + ++   S +V+   
Sbjct: 197 TTFTHVPCDSPDCPS--TANCSAGSVCPFNLFFVEGTFS-----QDVLTVAPSVAVQDFT 249

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDR-DSPASGVLEFNSA 321
             C        +   G L L     SL  ++  +   + +YC+    DSP    L  ++ 
Sbjct: 250 FVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCMPQYPDSPGFLSLGDDAT 309

Query: 322 RGGDAVT--APLIRNKKVD--TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
             GD  T  APL+ +   D    Y++ + G S+G   + IP   F      +   IV+ G
Sbjct: 310 VRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTF----GNNASTIVEAG 365

Query: 378 TAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGK 436
           T  T L   AY  LRD+F + +A   +   G   FDTCY+F+GL+ + VP V   FG G 
Sbjct: 366 TTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKFGNGD 425

Query: 437 ALDLPAKNYL-IPVDSAGTF---CFAFAPTSSAL----SIIGNVQQQGTRVSFDLANNRV 488
           +L +     L   + S G F   C AF+          ++IG      T V +D+A   V
Sbjct: 426 SLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTV 485

Query: 489 GFTPNKC 495
           GF P  C
Sbjct: 486 GFIPESC 492


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 170/363 (46%), Gaps = 40/363 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C  C    DP F P+ S +Y P+ C 
Sbjct: 90  NGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT 149

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHD 271
             QC       C  +R  C Y+  Y + S + G L  + VSFGN    S +    GC +D
Sbjct: 150 W-QCN------CDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCEND 202

Query: 272 NEGLFVG--SAGLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G      + G++GLG G LS+      K++ + S + C         GV       GG
Sbjct: 203 ETGDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYG-----GMGVGGGAMVLGG 257

Query: 325 DAVTAPLI--RNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            +  A ++  R+  V + YY + L    V G+ + + P +F+    G  G ++D GT   
Sbjct: 258 ISPPADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYA 313

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGA 434
            L   A+ + + + ++   +LK  SG      D C+     D S + S   P V + FG 
Sbjct: 314 YLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQI-SKSFPVVEMVFGN 372

Query: 435 GKALDLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTP 492
           G  L L  +NYL       G +C   F+  +   +++G +  + T V +D  + ++GF  
Sbjct: 373 GHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWK 432

Query: 493 NKC 495
             C
Sbjct: 433 TNC 435


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 168/360 (46%), Gaps = 34/360 (9%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C  C    DP F P+ S +Y P+ C 
Sbjct: 90  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT 149

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSG--SVKGIALGCGHDNE 273
             QC   D       +C Y+  Y + S + G L  + VSFGN    S +    GC +D  
Sbjct: 150 W-QCNCDD----DRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDET 204

Query: 274 GLFVG--SAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVLEFNSARGGDAVT 328
           G      + G++GLG G LS+  Q+   K  S A+ L        GV       GG +  
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLC---YGGMGVGGGAMVLGGISPP 261

Query: 329 APLI--RNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           A ++   +  V + YY + L    V G+ + + P +F+    G  G ++D GT    L  
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAYLPE 317

Query: 386 QAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR------SVRVPTVSLHFGAGKA 437
            A+ + + + ++   +LK  SG      D C  FSG        S   P V + FG G  
Sbjct: 318 SAFLAFKHAIMKETHSLKRISGPDPHYNDIC--FSGAEINVSQLSKSFPVVEMVFGNGHK 375

Query: 438 LDLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           L L  +NYL       G +C   F+  +   +++G +  + T V +D  ++++GF    C
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 118/428 (27%), Positives = 181/428 (42%), Gaps = 55/428 (12%)

Query: 88  EILHKTRHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFS-T 146
           E+LH+   +D+         R  A  + L++       + D   L  AE + +  D + T
Sbjct: 65  ELLHEVVTHDF--------ARARALASRLVSSNSPNRSSSDHRHL--AEEEEVEHDLAQT 114

Query: 147 PVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCT-ECYQQSDPIFDPKT 205
           PV   +    G Y+S I +G+PP+ FS+V+DTGSD+ W++C PC+ +C       FD   
Sbjct: 115 PV---SFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSPDC----SSTFDRLA 167

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--- 262
           S++Y  L CA       D+      R   ++      F  G  + +T+    + S +   
Sbjct: 168 SNTYKALTCAD------DLRLPVLLRLWRRL------FHSGRSLRDTLKMAGAASDELEE 215

Query: 263 --GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVD-------RDS 310
             G   GCG   +GL  G  G+L L  G LS   QI        +YCL+        + S
Sbjct: 216 FPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKS 275

Query: 311 P---ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
           P       +E      G           +   +Y V L G SVG Q + + PS F     
Sbjct: 276 PMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFL--NG 333

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
            D   I D GT +T L +   +S++ S   +    +  + +   D C+         +P 
Sbjct: 334 QDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVA-IKGLDACFRVPPSSGQGLPD 392

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNR 487
           ++ HF  G        NY+I + S    C  F PT+  +SI GN+QQQ   V  D+ N R
Sbjct: 393 ITFHFNGGADFVTRPSNYVIDLGSLQ--CLIFVPTNE-VSIFGNLQQQDFFVLHDMDNRR 449

Query: 488 VGFTPNKC 495
           +GF    C
Sbjct: 450 IGFKETDC 457


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 139/486 (28%), Positives = 201/486 (41%), Gaps = 74/486 (15%)

Query: 71  SFPLNSSSS-FSLPLHSREILHKTRHNDY---------RSLVLSRLERDSARVNTLITKL 120
           SFP N+SSS +SLPL    + H T H+ Y             L      S+  +   T L
Sbjct: 124 SFPQNASSSHYSLPL-LFPLHHITIHHHYFIHPHPQHHHHPPLFTHHPSSSNSHPFHT-L 181

Query: 121 QLAI-YNVDR-HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDT 178
           QLA+  ++ R H LK       P    T V     +  G Y   +  GTPP+ F  VLDT
Sbjct: 182 QLAVSTSITRAHHLKNHNN---PSSLKTLV---HPKTYGGYSIDLKFGTPPQTFPFVLDT 235

Query: 179 GSDINWLQCRP---CTECYQQSD---PIFDPKTSSSYSPLPCAAPQCKSL---DVSA--C 227
           GS + WL C     C++C   S+   P F PK S S   + C  P+C  +   DV++  C
Sbjct: 236 GSSLVWLPCYSHYLCSKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFGSDVTSHCC 295

Query: 228 R--------ANRC-----LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
           +         N C      Y V YG GS T G L++E ++F  + +V    +GC   +  
Sbjct: 296 KLAKAAFSNNNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFP-AKNVSDFLVGCSVVS-- 351

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSARGGDA----- 326
                 G+ G G G  SL  Q+  T  +YCL+     +SP +  L   +   G+      
Sbjct: 352 -VYQPGGIAGFGRGEESLPAQMNLTRFSYCLLSHQFDESPENSDLVMEATNSGEGKKTNG 410

Query: 327 ------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                 +  P  +      +YY+ L    VG + V++P  + E D  GDGG IVD G+ +
Sbjct: 411 VSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEPDVNGDGGFIVDSGSTL 470

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALF--DTCYDFS-GLRSVRVPTVSLHFGAGKA 437
           T ++   ++ + + FV+     +       F    C+  + G  +   P +   F  G  
Sbjct: 471 TFMERPIFDLVAEEFVKQVNYTRARELEKQFGLSPCFVLAGGAETASFPEMRFEFRGGAK 530

Query: 438 LDLPAKNYLIPVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTRVSFDLANNRVG 489
           + LP  NY   V      C        A          I+GN QQQ   V  DL N R G
Sbjct: 531 MRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAVILGNYQQQNFYVECDLENERFG 590

Query: 490 FTPNKC 495
           F    C
Sbjct: 591 FRSQSC 596


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 100/352 (28%), Positives = 156/352 (44%), Gaps = 39/352 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S  +D   ++ W QC  C  C++Q  P+F P  SS++ P PC    CKS+  
Sbjct: 60  IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVGSA 280
             C ++ C Y    G G  TVG + T+T + G +     +  GC      D  G   G +
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPAS-LGFGCVVASDIDTMG---GPS 175

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIR---N 334
           G +GLG    SL  Q+K T  +YCL   D+  +  L   ++    GG A T P ++   N
Sbjct: 176 GFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPN 234

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSL 391
             +  +Y + L     G   + +P         G   ++V   TA+ R   L    Y   
Sbjct: 235 DGMSQYYPIELEEIKAGDATITMP--------RGRNTVLVQ--TAVVRVSLLVDSVYQEF 284

Query: 392 RDSFVRLAGNLKPTSGV-ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           + + +   G     + V A F+ C+  +G+     P +   F AG AL +P  NYL  V 
Sbjct: 285 KKAVMASVGAAPTATPVGAPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVG 342

Query: 451 SAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  T C +    +         L+I+G+ QQ+   + FDL  + + F P  C
Sbjct: 343 N-DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 393


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 167/391 (42%), Gaps = 63/391 (16%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PIFDPKT 205
           G Y   +  GTP +    V DTGS + W    PCT  Y  SD           P F PK 
Sbjct: 88  GGYSVSLSFGTPSQTIPFVFDTGSSLVWF---PCTSRYLCSDCNFSGLDPTQIPRFIPKN 144

Query: 206 SSSYSPLPCAAPQCK-----SLDVSACRAN--RCL-----YQVAYGDGSFTVGDLVTETV 253
           SSS   + C  P+C+     ++    C  N   C      Y + YG GS T G L++E +
Sbjct: 145 SSSSRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKL 203

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
            F +  +V    +GC   +       AG+ G G G  SL  Q+K  S ++CLV R   D+
Sbjct: 204 DFPDL-TVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLKSFSHCLVSRRFDDT 259

Query: 311 PASGVLEFNSARGGDAVT-------APLIRNKKVDT-----FYYVGLTGFSVGGQAVQIP 358
             +  L  ++  G  + +        P  +N  V       +YY+ L    VG + V+IP
Sbjct: 260 NVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSKHVKIP 319

Query: 359 PSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSGVALFD 412
                    G+GG IVD G+  T ++   +  + + F     N      L+  SG+A   
Sbjct: 320 YKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIA--- 376

Query: 413 TCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------- 465
            C++ SG   V VP +   F  G  ++LP  NY   V +A T C      ++        
Sbjct: 377 PCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGGTG 436

Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              I+G+ QQQ   V +DL N+R GF   KC
Sbjct: 437 PAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 179/373 (47%), Gaps = 44/373 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR+  + +DTGSD+ W+ C  C  C Q S        FDP +SS+ S 
Sbjct: 75  GLYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSL 134

Query: 212 LPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN-------SG 259
           + C   +C+S      ++C  R N+C Y   YGDGS T G  V++ + F +       + 
Sbjct: 135 ISCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTN 194

Query: 260 SVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
           S   +  GC     G    S     G+ G G   +S+  Q+ +  +A     +CL   D+
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL-KGDN 253

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              GVL        + V +PL+ ++     Y + L   SV GQ V+I PS+F    + + 
Sbjct: 254 SGGGVLVLGEIVEPNIVYSPLVPSQP---HYNLNLQSISVNGQIVRIAPSVFA--TSNNR 308

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRV-P 426
           G IVD GT +  L  +AYN     FV     + P S  ++    + CY  +   +V + P
Sbjct: 309 GTIVDSGTTLAYLAEEAYN----PFVIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFP 364

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFD 482
            VSL+F  G +L L  ++YL+  +  G    +C  F   S  +++I+G++  +     +D
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYD 424

Query: 483 LANNRVGFTPNKC 495
           LA  R+G+    C
Sbjct: 425 LAGQRIGWANYDC 437


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 105/366 (28%), Positives = 165/366 (45%), Gaps = 46/366 (12%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP-------CAAP 217
           +GTPP+   +VLDTGS ++W+QC    +  ++  P+  PKT+S    L        C  P
Sbjct: 72  IGTPPQPTDLVLDTGSQLSWIQCHD-KKIKKRLPPLPKPKTTSFDPSLSSSFSLLPCNHP 130

Query: 218 QCK------SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            CK      +L  S C  NR C Y   Y DG+   G+LV E  +F  S S   + LGC  
Sbjct: 131 ICKPRIPDFTLPTS-CDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPVILGCAQ 189

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR------------DSPASGVLEF 318
            +      + G+LG+  G LS   Q K +  +YC+  R            D+P S   ++
Sbjct: 190 AS----TENRGILGMNRGRLSFISQAKISKFSYCVPSRTGSNPTGLFYLGDNPNSSKFKY 245

Query: 319 NSARGGDAVTAPLIRNK-KVDTF-YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDC 376
            +      +T P  ++   +D   Y + +    + G+ + +PP+ F+ D  G G  ++D 
Sbjct: 246 VTM-----LTFPESQSSPNLDPLAYTLPMKAIKIAGKRLNVPPAAFKPDAGGSGQTMIDS 300

Query: 377 GTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA--LFDTCYDFSGLRSV--RVPTVSLHF 432
           G+ +T L  +AY  +++  VRL G +     V   + D C+D      V  R+  +S  F
Sbjct: 301 GSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVADMCFDAGVTAEVGRRIGGISFEF 360

Query: 433 GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS---ALSIIGNVQQQGTRVSFDLANNRVG 489
             G  + +     ++     G  C     +       +IIG V QQ   V +DLAN RVG
Sbjct: 361 DNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGIGSNIIGTVHQQNMWVEYDLANKRVG 420

Query: 490 FTPNKC 495
           F   +C
Sbjct: 421 FGGAEC 426


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 170/394 (43%), Gaps = 59/394 (14%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDPI----FDP 203
           +++  G Y   +  GTP +    V DTGS + WL C     C+ C +   DP     F P
Sbjct: 83  SAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIP 142

Query: 204 KTSSSYSPLPCAAPQCKSL------------DVSACRANRCLYQVAYGDGSFTVGDLVTE 251
           K SSS   + C +P+C+ L            +   C      Y + YG GS T G L+TE
Sbjct: 143 KNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITE 201

Query: 252 TVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR--- 308
            + F +  +V    +GC   +       AG+ G G G +SL  Q+     ++CLV R   
Sbjct: 202 KLDFPDL-TVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSRRFD 257

Query: 309 DSPASGVLEFNSARGGDAVTA------------PLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
           D+  +  L+ ++  G ++ +             P + NK    +YY+ L    VG + V+
Sbjct: 258 DTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVK 317

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSGVAL 410
           IP         GDGG IVD G+  T ++   +  + + F     N      L+  +G+  
Sbjct: 318 IPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLG- 376

Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP--------- 461
              C++ SG   V VP +   F  G  L+LP  NY   V +  T C              
Sbjct: 377 --PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGG 434

Query: 462 TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           T  A+ I+G+ QQQ   V +DL N+R GF   KC
Sbjct: 435 TGPAI-ILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 105/364 (28%), Positives = 175/364 (48%), Gaps = 38/364 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC-YQQS--DPIFDPKTSSSYSPLP 213
           G Y SR+ +GTP ++F++++DTGS + ++ C  CT C + Q+  DP F P  SSSY  + 
Sbjct: 97  GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVS 156

Query: 214 CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
           C +P C +    A R ++C Y+  Y + S + G L  + + FGN   ++   +  GC   
Sbjct: 157 CNSPDCITKMCDA-RVHQCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFGCETA 215

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQIKAT-------SLAYCLVDRDSPASGVLEFNSAR 322
             G L++  A G++GLG G LS+  Q+  T       SL Y  +D +   S VL      
Sbjct: 216 ETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMD-EGGGSMVL------ 268

Query: 323 GGDAVTAPLIRNKKVD----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
            G     P +   K D     +Y + L+   V G ++ +P  +F     G  G ++D GT
Sbjct: 269 -GAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSGT 323

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRS----VRVPTVSLHF 432
               L  +A+++ +D+  +  G+L+   G   +  D C+  +G  S       P V   F
Sbjct: 324 TYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVF 383

Query: 433 GAGKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
              + + L  +NYL       G +C  F     A +++G +  + T V++D AN+++GF 
Sbjct: 384 SGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGFF 443

Query: 492 PNKC 495
              C
Sbjct: 444 KTNC 447


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 107/372 (28%), Positives = 173/372 (46%), Gaps = 43/372 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPP +F++ +DTGSD+ W+ C  C+ C Q S        FDP +SS+ S 
Sbjct: 73  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132

Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
           + C+  +C    +S D + + + N+C Y   YGDGS T G  V++ +           + 
Sbjct: 133 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 192

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
           S   +  GC +   G    S     G+ G G   +S+  Q+ +  +A     +CL   DS
Sbjct: 193 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-KGDS 251

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+L        + V   L+  +     Y + L   +V GQ +QI  S+F    +   
Sbjct: 252 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSIAVNGQTLQIDSSVFATSNS--R 306

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
           G IVD GT +  L  +AY    D FV       P S    V+  + CY  +   +   P 
Sbjct: 307 GTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFPQ 362

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDL 483
           VSL+F  G ++ L  ++YLI  +S G    +C  F       ++I+G++  +   V +DL
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422

Query: 484 ANNRVGFTPNKC 495
           A  R+G+    C
Sbjct: 423 AGQRIGWANYDC 434


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 165/375 (44%), Gaps = 46/375 (12%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G YF++IG+G+P + + + +DTGSDI W+ C  CT C ++SD      ++DPK S +  
Sbjct: 66  TGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSE 125

Query: 211 PLPCAAPQCKSL---DVSACRA-NRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSV- 261
            + C    C S     +  C+A N C Y ++YGDGS T G  V + ++F    GN  +  
Sbjct: 126 FVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTAT 185

Query: 262 --KGIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
               I  GCG    G F  S+     G++G G    S+  Q+ A+       ++CL    
Sbjct: 186 QNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL--DT 243

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +   G+            T PL+ N      Y V L    V G  +Q+P   F  D    
Sbjct: 244 NVGGGIFSIGEVVEPKVKTTPLVPNM---AHYNVILKNIEVDGDILQLPSDTF--DSENG 298

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
            G ++D GT +  L    Y+ L    +     LK    V L +   +C+ ++G      P
Sbjct: 299 KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLK----VYLVEEQYSCFQYTGNVDSGFP 354

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
            V LHF    +L +   +YL        +C  +  ++S       ++++G+       V 
Sbjct: 355 IVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVV 414

Query: 481 FDLANNRVGFTPNKC 495
           +DL N  +G+T   C
Sbjct: 415 YDLENMTIGWTDYNC 429


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 165/376 (43%), Gaps = 66/376 (17%)

Query: 176 LDTGSDINWLQC---RPCTECYQQS--DPIFDPKTSSSYSPLPCAAPQCKSL-------- 222
           +DTGSD+ W+ C     C  C + S  + +F P+ SSS   + CA   CK+L        
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 223 ------DVSACRANRCLYQVAYGDGSFTVGDLVTETVSF-----GNSGSVKGIALGCGHD 271
                  +  C      Y + YG GS T G L+TET++        + ++   A+GC   
Sbjct: 61  CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCS-- 117

Query: 272 NEGLFVGS---AGLLGLGGGMLSLTKQ----IKATSLAYCL----VDRDSPASGVLEFNS 320
                V S   +G+ G G G LS+  Q    I     AYCL     D ++  S ++  + 
Sbjct: 118 ----IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDK 173

Query: 321 ARGGDAVT--APLIRNKKV------DTFYYVGLTGFSVGGQAV-QIPPSLFEMDEAGDGG 371
           A   +      P + N +         +YY+GL G S+GG+ + Q+P  L   D  G+GG
Sbjct: 174 ALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGG 233

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-----RLAGNLKPTSGVALFDTCYDFSGLRSVRVP 426
            I+D GT  T    + +  +   F      R AG ++  +G+ L   CYD +GL ++ +P
Sbjct: 234 TIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGL---CYDVTGLENIVLP 290

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-------IIGNVQQQGTRV 479
             + HF  G  + LP  NY     S  + C     +   L        I+GN QQQ   +
Sbjct: 291 EFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYL 350

Query: 480 SFDLANNRVGFTPNKC 495
            +D   NR+GFT   C
Sbjct: 351 LYDREKNRLGFTQQTC 366


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 170/378 (44%), Gaps = 42/378 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G Y++RIG+G+PP  F + +DTGSDI W+ C  C+ C ++SD      +++PK
Sbjct: 64  NGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPK 123

Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----G 256
           +SS+ + + C  P C +     +  C+ +  C Y+V YGDGS T G  V + +      G
Sbjct: 124 SSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG 183

Query: 257 NSGSVK---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYC 304
           N  + +    I  GCG    G    S+    G+LG G    S+  Q+ AT       A+C
Sbjct: 184 NHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHC 243

Query: 305 LVDRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           L   DS    G+            T P++ N+     Y V L G  VG  A+ +P  LFE
Sbjct: 244 L---DSISGGGIFAIGEVVEPKLKTTPVVPNQ---AHYNVVLNGVKVGDTALDLPLGLFE 297

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
                  G I+D GT +  L    Y  L +  +    +LK  +    F TC+ F      
Sbjct: 298 TSYK--RGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDD 354

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
             PTV+  F     L +    YL  +     +C  +      +   + ++++G++  Q  
Sbjct: 355 GFPTVTFKFEESLILTIYPHEYLFQIRDD-VWCVGWQNSGAQSKDGNEVTLLGDLVLQNK 413

Query: 478 RVSFDLANNRVGFTPNKC 495
            V ++L N  +G+T   C
Sbjct: 414 LVYYNLENQTIGWTEYNC 431


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 107/343 (31%), Positives = 148/343 (43%), Gaps = 37/343 (10%)

Query: 185 LQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR---CLYQVAYGDG 241
           +QC+PC  CY+Q DP+F+PK SSSY+ +PC +  C  LD   C  +    C Y   Y   
Sbjct: 1   MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60

Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKATS 300
             T G L  + ++ G       +  GC   + G     A GL+GLG G LSL  Q+    
Sbjct: 61  GVTKGTLAIDKLAIGGD-VFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHR 119

Query: 301 LAYCLVDRDSPASGVLEFNSARGG-----DAVTAPLIRNKKVDTFYYVGLTGFSVGGQA- 354
             YCL    S  SG L   +         D VT  +  + +  ++YY+ L G +VG Q  
Sbjct: 120 FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTP 179

Query: 355 -----VQIPPSLFEMDEAGDG-------------GIIVDCGTAITRLQTQAYNSLRDSFV 396
                   PPS       G G             G+IVD  + I+ L+T  Y+ L D   
Sbjct: 180 GTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE 239

Query: 397 RLAGNLKPTSGVAL-FDTCY---DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA 452
                 + T  + L  D C+   +  G+  V VPTVSL F  G+ L+L        V   
Sbjct: 240 EEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFD-GRWLELDRDRLF--VTDG 296

Query: 453 GTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              C     T S +SI+GN Q Q  RV F+L   ++ F    C
Sbjct: 297 RMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 129/461 (27%), Positives = 190/461 (41%), Gaps = 98/461 (21%)

Query: 116 LITKLQLAIYNVDRHELKPAEAQILPE----------DFSTPVVSGASQGSGEYFSRIGV 165
           L   L +  +N   H LK      L              S P+  G+     +Y     +
Sbjct: 27  LTHSLSMIEFNTTHHLLKSTSTHSLSRFHRHKHHHHNQLSLPLSPGS-----DYTLSFNL 81

Query: 166 GTPPRQFSMVLDTGSDINWLQCRP--CTECYQQ----SDPIFDPKTSSSYS-PLPCAAPQ 218
           G   +  ++ +DTGSD+ W  C P  C  C  +    SDP   P T+ S+S P+ C +  
Sbjct: 82  GPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDP--SPPTNISHSTPISCNSHA 139

Query: 219 CK--------------------SLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGN 257
           C                     S++   C +  C  +  AYGDGS  +  L  +T+S  +
Sbjct: 140 CSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSL-IASLYRDTLSL-S 197

Query: 258 SGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-- 309
           +  +     GC H     F    G+ G G G+LSL  Q+   S       +YCLV     
Sbjct: 198 TLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254

Query: 310 -----SPASGVL-EFNSAR--GGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
                 P+  +L  +N  +   GD V       ++ N K   FY VGL G SVG + V  
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPA 314

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN-------LKPTSGVAL 410
           P  L  +++ GDGG++VD GT  T L  + YNS+ + F R A         ++  +G++ 
Sbjct: 315 PKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLS- 373

Query: 411 FDTCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----------DSAGTFCFAF 459
              CY  +   +  VP V+L F G   ++ LP KNY              +  G   F  
Sbjct: 374 --PCYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMN 429

Query: 460 APTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               + +S     ++GN QQQG  V +DL   RVGF   KC
Sbjct: 430 GGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 122/459 (26%), Positives = 192/459 (41%), Gaps = 61/459 (13%)

Query: 84  LHSREILHKTRHNDYRSLV---LSRLERDSARVNTLITKLQLAIYNVDR-HELKPAEAQI 139
           L SR +L  +  N+  + +   L+     +     L+    LA  ++ R H LK  +A  
Sbjct: 14  LFSRLVLASSSKNNIPATITIPLTPTFTKNPSTEPLLFLQHLATASMSRSHHLKHGKASP 73

Query: 140 LPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCTEC--- 193
           L +    P         G +   +  GTPP++ S ++DTGS + W  C     CT C   
Sbjct: 74  LIQTSLFP------HSHGGHTIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFS 127

Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKS-------LDVSACRAN--RC-----LYQVAYG 239
             +  PIF+P+ SSS   L C  P+C +       L    C  N  +C      Y + YG
Sbjct: 128 NPKKVPIFNPELSSSDKILGCRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYG 187

Query: 240 DGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGLFVGSAGLLGLGGGMLSLTKQIK 297
            G+ + G  + E + F    ++    +GC    D E     S  L G G  M SL  Q+ 
Sbjct: 188 TGAAS-GFFLLENLDFPGK-TIHKFLVGCTTSADRE---PSSDALAGFGRTMFSLPMQMG 242

Query: 298 ATSLAYCLVDRD---SPASG--VLEFNSARGGDAVTAPLIRNK-KVDTFYYVGLTGFSVG 351
               AYCL   D   +  SG  +L+++         AP ++N      +YY+G+    +G
Sbjct: 243 VKKFAYCLNSHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIG 302

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSG 407
            + ++IP           GG+++D G A   +    +    N L+    +   +L+  + 
Sbjct: 303 NKLLRIPGKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQ 362

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCF---------- 457
             L   CY+F+G +S+++P +   F  G  + +P  NY +    A   CF          
Sbjct: 363 SGL-TPCYNFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNN 421

Query: 458 -AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             F P  S   I+GN QQ    V FDL N R+GF    C
Sbjct: 422 LEFTPGPSI--ILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 101/353 (28%), Positives = 157/353 (44%), Gaps = 27/353 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           Y   + +GTPP+  S ++D G ++ W QC + C  C++Q  P+FD   SS++ P PC A 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSF--TVGDLVTETVSFGNSGSVKGIALGCGHDNE-G 274
            C+S+   +C  +           SF  TVG + T+ V+ G + + + +A GC   +E  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR-LAFGCAVASEMD 169

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-----RGGDAVTA 329
              GS+G +GLG   LSL  Q+ AT+ +YCL   D+  S  L   ++      G  A T 
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKLAGAGKGAGTT 229

Query: 330 PLIR-----NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           P ++     N  +   Y + L     G   + +P S           I V   T +T L 
Sbjct: 230 PFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQS--------GNTITVSTATPVTALV 281

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKN 444
              Y  LR +     G       V  +D C+  +   S   P + L F  G  + +P  +
Sbjct: 282 DSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVPVSS 340

Query: 445 YLIPVDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           YL    +  T C A   +P    +SI+G++QQ    + FDL    + F P  C
Sbjct: 341 YLFDAGN-DTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/396 (29%), Positives = 177/396 (44%), Gaps = 48/396 (12%)

Query: 124 IYNVDRHELKPAEAQIL-------PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
           I+  DR  ++   A+I         +D  +P         G +   +G GTP ++F++++
Sbjct: 87  IFLQDRSRVRSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLII 146

Query: 177 DTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQV 236
           DTGSD  W+QC  C+     +   F+P  SSSYS   C      S D +        Y +
Sbjct: 147 DTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIP----STDTN--------YTM 194

Query: 237 AYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGG----MLSL 292
            Y D S++ G  V + V+       K    GCG    G F  ++G+LGL  G    ++S 
Sbjct: 195 KYEDNSYSKGVFVCDEVTLKPDVFPK-FQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQ 253

Query: 293 TKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR-----NKKVDTFYYVGLT 346
           T        +YC   ++     +L      G  A++A P ++     N      Y+V L 
Sbjct: 254 TASKFKKKFSYCFPPKEHTLGSLL-----FGEKAISASPSLKFTQLLNPPSGLGYFVELI 308

Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR---LAGNLK 403
           G SV  + + +  SLF        G I+D GT ITRL T AY +LR +F +      ++ 
Sbjct: 309 GISVAKKRLNVSSSLF-----ASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSIS 363

Query: 404 PTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP 461
           P     L DTCY+  G   R++++P + LHF     + L     L         C AFA 
Sbjct: 364 PPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFAR 423

Query: 462 TS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            S  S ++IIGN QQ   +V +D+   R+GF  N C
Sbjct: 424 KSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG-NDC 458


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 174/371 (46%), Gaps = 41/371 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR F + +DTGSD+ W+ C  C  C Q S        FDP +S + SP
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138

Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
           + C+  +C    +S D S C  + N C Y   YGDGS T G  V++ + F    G+S   
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S   +  GC     G  V S     G+ G G   +S+  Q+ +  +A     +CL   +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+L        + V  PL+ ++     Y V L   SV GQA+ I PS+F       
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            G I+D GT +  L   AY    ++    ++ +++P   V+  + CY  +       P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
           SL+F  G ++ L  ++YLI  ++ G    +C  F    +  ++I+G++  +     +DL 
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429

Query: 485 NNRVGFTPNKC 495
             R+G+    C
Sbjct: 430 GQRIGWANYDC 440


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 175/371 (47%), Gaps = 41/371 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y+++I +G+PPR F + +DTGSD+ W+ C  C  C Q S        FDP +S + +P
Sbjct: 79  GLYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATP 138

Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
           + C+  +C    +S D S C  + N C Y   YGDGS T G  V++ + F    G+S   
Sbjct: 139 VSCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S   +  GC     G  V S     G+ G G   +S+  Q+ +  LA     +CL   +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGEN 257

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+L        + V  PL+ ++     Y V L   SV GQA+ I PS+F       
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            G I+D GT +  L   AY    ++    ++ +++P   V+  + CY  +   +   P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVIATSVADIFPPV 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
           SL+F  G ++ L  ++YLI  ++ G    +C  F    +  ++I+G++  +     +DL 
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429

Query: 485 NNRVGFTPNKC 495
             R+G+    C
Sbjct: 430 GQRIGWANYDC 440


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 167/363 (46%), Gaps = 35/363 (9%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLPCAA 216
           Y  +  +G+PP +   + DTGS+I W+QC    CT CY+Q  P+F+P  SS+Y+   C  
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167

Query: 217 PQCKSL-----DVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG-----I 264
            +CK       +   C+++   C Y ++Y D SF+ G + T+ ++F    +  G     +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227

Query: 265 ALGCGHDNEGL------FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRD-SPASGVLE 317
             GCG++N            + G++GLG  M SL  Q+     +YC+   D    +G +E
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQLTLGQFSYCISTPDVQKPNGTIE 287

Query: 318 FNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGI 372
               R G A +    +  + N     + +  + G  V    V+  P  +F+  E G GG+
Sbjct: 288 I---RFGLAASISGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGIGGL 344

Query: 373 IVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           I+D GT  T L   A ++L       + LA + +  S  + +  CY+ +      VP + 
Sbjct: 345 IMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSN-SNYSLCYNAANFLLTYVPAIE 403

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG-TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           L F   K    P       +D+    +C A   T S +SIIG  Q +  ++ +DL  N V
Sbjct: 404 LKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGT-SGISIIGIYQHRDIKIGYDLKYNLV 462

Query: 489 GFT 491
            FT
Sbjct: 463 SFT 465


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 174/371 (46%), Gaps = 41/371 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR F + +DTGSD+ W+ C  C  C Q S        FDP +S + SP
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138

Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
           + C+  +C    +S D S C  + N C Y   YGDGS T G  V++ + F    G+S   
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S   +  GC     G  V S     G+ G G   +S+  Q+ +  +A     +CL   +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+L        + V  PL+ ++     Y V L   SV GQA+ I PS+F       
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            G I+D GT +  L   AY    ++    ++ +++P   V+  + CY  +       P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLA 484
           SL+F  G ++ L  ++YLI  ++ G    +C  F    +  ++I+G++  +     +DL 
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLV 429

Query: 485 NNRVGFTPNKC 495
             R+G+    C
Sbjct: 430 GQRIGWANYDC 440


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 170/378 (44%), Gaps = 42/378 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G Y++RIG+G+PP  F + +DTGSDI W+ C  C+ C ++SD      +++PK
Sbjct: 64  NGHPAETGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPK 123

Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSF----G 256
           +SS+ + + C  P C +     +  C+ +  C Y+V YGDGS T G  V + +      G
Sbjct: 124 SSSTSTLITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVG 183

Query: 257 NSGSVK---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYC 304
           N  + +    I  GCG    G    S+    G+LG G    S+  Q+ AT       A+C
Sbjct: 184 NHKTSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHC 243

Query: 305 LVDRDS-PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           L   DS    G+              P++ N+     Y V L G  VG  A+ +P  LFE
Sbjct: 244 L---DSISGGGIFAIGEVVEPKLXNTPVVPNQ---AHYNVVLNGVKVGDTALDLPLGLFE 297

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
              +   G I+D GT +  L    Y  L +  +    +LK  +    F TC+ F      
Sbjct: 298 T--SYKRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDD 354

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
             PTV+  F     L +    YL  +     +C  +      +   + ++++G++  Q  
Sbjct: 355 GFPTVTFKFEESLILTIYPHEYLFQIRDD-VWCVGWQNSGAQSKDGNEVTLLGDLVLQNK 413

Query: 478 RVSFDLANNRVGFTPNKC 495
            V ++L N  +G+T   C
Sbjct: 414 LVYYNLENQTIGWTEYNC 431


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/349 (29%), Positives = 165/349 (47%), Gaps = 19/349 (5%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-RPCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           Y   + +GTPP+  S ++D G ++ W QC + C  C++Q  P+FD   SS++ P PC A 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 218 QCKSLDVSACRANRCLYQVAYGDGSF--TVGDLVTETVSFGNSGSVKGIALGCGHDNE-G 274
            C+S+   +C  +           SF  TVG + T+ V+ G + + + +A GC   +E  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAATAR-LAFGCAVASEMD 169

Query: 275 LFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-----RGGDAVTA 329
              GS+G +GLG   LSL  Q+ AT+ +YCL   D+  S  L   ++      G  A T 
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKLAGAGKGAGTT 229

Query: 330 PLIRNKKVDTFYYVGLT-GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
           P +   K  T  + GL+  + +  +A++   +   M ++G+  I+V   T +T L    Y
Sbjct: 230 PFV---KTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGN-TIMVSTATPVTALVDSVY 285

Query: 389 NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
             LR +     G       V  +D C+  +   S   P + L F  G  + +P  +YL  
Sbjct: 286 RDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVPVSSYLFD 344

Query: 449 VDSAGTFCFAF--APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +  T C A   +P    +SI+G++QQ    + FDL    + F P  C
Sbjct: 345 AGN-DTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 108/375 (28%), Positives = 168/375 (44%), Gaps = 47/375 (12%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD------PIFDPKTSSSY 209
           +G Y+++I +GTPP  + + +DTGSD+ WL C PCT C  ++         +DP  SS+ 
Sbjct: 34  TGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93

Query: 210 SPLPCAAPQC----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF---GNSGSVK 262
             L C    C     S +VS   A  C Y   YGDGS T G  + + ++F    N+  V 
Sbjct: 94  GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153

Query: 263 GIA---LGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKA-----TSLAYCLVDRDS 310
           G A    GCG    G  + S+    GL+G G   +S+  Q+ +        A+CL   D+
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL-QGDN 212

Query: 311 PASGVLEFNSARGGDAVTAPLI-RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
              G +   S    +    P++ RN      Y VG+   +V G+ V  P S F+      
Sbjct: 213 QGGGTIVIGSVSEPNISYTPIVSRNH-----YAVGMQNIAVNGRNVTTPAS-FDTTSTSA 266

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTV 428
           GG+I+D GT +  L   AY      FV      + +S  +    C   +        PTV
Sbjct: 267 GGVIMDSGTTLAYLVDPAYT----QFVNAVSTFE-SSMFSSHSQCLQLAWCSLQADFPTV 321

Query: 429 SLHFGAGKALDLPAKNYLI--PVDSA-GTFCFAFAPTSS-----ALSIIGNVQQQGTRVS 480
            L F AG  ++L  +NYL   P+ +    +C  +  +++     + SI+G++  +   V 
Sbjct: 322 KLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVV 381

Query: 481 FDLANNRVGFTPNKC 495
           +D  N  VG+    C
Sbjct: 382 YDNDNRVVGWKSFDC 396


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 170/377 (45%), Gaps = 41/377 (10%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G YF++IG+GTP + + + +DTGSDI W+ C  C  C  +SD      ++D K
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
            S++   + C    C   D  +  C+   +CLY V YGDGS T G  V + V +   SG+
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265

Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
            +       +  GCG+   G    S+    G+LG G    S+  Q+ ++       ++CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
            + D    G+              PL++N+     Y V +    VGG  + +P   F   
Sbjct: 326 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 377

Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           E+GD  G I+D GT +     + Y  L +  +    +L+  +    F TC+D++G     
Sbjct: 378 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 436

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
            PTV+LHF    +L +    YL  V     +C  +  + +       L+++G++      
Sbjct: 437 FPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 495

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    +G+    C
Sbjct: 496 VVYDLEKQGIGWVEYNC 512


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 173/371 (46%), Gaps = 56/371 (15%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++D+GS + ++ C  C +C    DP F P  SSSYSP+ C 
Sbjct: 86  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKC- 144

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
                ++D + C +++  C Y+  Y + S + G L  + VSFG    +K      GC + 
Sbjct: 145 -----NVDCT-CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENS 198

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRD----------SPASG 314
             G LF   A G++GLG G LS+  Q     + + S + C    D           PA  
Sbjct: 199 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPS 258

Query: 315 VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV 374
            + F+ +   D + +P         +Y + L    V G+A+++   +F        G ++
Sbjct: 259 DMVFSHS---DPLRSP---------YYNIELKEIHVAGKALRVDSRVFNSKH----GTVL 302

Query: 375 DCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTV 428
           D GT    L  QA+ + +D+      +LK   G      D C+  +G    ++    P V
Sbjct: 303 DSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDV 362

Query: 429 SLHFGAGKALDLPAKNYLI---PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLA 484
            + FG G+ L L  +NYL     VD  G +C   F       +++G +  + T V++D  
Sbjct: 363 DMVFGNGQKLSLTPENYLFRHSKVD--GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRH 420

Query: 485 NNRVGFTPNKC 495
           N ++GF    C
Sbjct: 421 NEKIGFWKTNC 431


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 166/379 (43%), Gaps = 42/379 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ-------QSDPIFD 202
           SG   G+ +YF+ + VGTP ++F +V+DTGS++ W+ CR     Y+       ++  +F 
Sbjct: 79  SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR-----YRGRGKGKVKNRRVFR 133

Query: 203 PKTSSSYSPLPCAAPQCK-------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF 255
            + S S+  + C    CK       SL      +  C Y   Y DGS   G    ET++ 
Sbjct: 134 AEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITV 193

Query: 256 ----GNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTK---QIKATSLAYCLVD 307
               G    ++G+ +GC     G     A G+LGL     S T     +    L+YCLVD
Sbjct: 194 GLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVD 253

Query: 308 R--DSPASGVLEFNSARGGDAVTAPLIRNKKVDT-----FYYVGLTGFSVGGQAVQIPPS 360
              +   S  L F  +    +      R   +D      FY + + G S+G   + IP  
Sbjct: 254 HLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQ 313

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCY-DFS 418
           ++  D    GG I+D GT++T L   AY  +     R    LK      +  + C+   S
Sbjct: 314 VW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTS 371

Query: 419 GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFAFAPTSS-ALSIIGNVQQQG 476
           G    ++P ++ H   G   +   K+YL  VD+A G  C  F    + A +++GN+ QQ 
Sbjct: 372 GFNESKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFMSAGTPATNVVGNIMQQN 429

Query: 477 TRVSFDLANNRVGFTPNKC 495
               FDL  + + F P+ C
Sbjct: 430 YLWEFDLMASTLSFAPSTC 448


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 116/413 (28%), Positives = 187/413 (45%), Gaps = 54/413 (13%)

Query: 125 YNVDRHELKPAE----AQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSM 174
           + ++ H+L+  +    A++L + F   VV  + QGS      G YF+++ +G+PPR+F++
Sbjct: 23  HGLELHQLRARDRLRHARLL-QGFVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNV 81

Query: 175 VLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSA 226
            +DTGSD+ W+ C  C  C + S        FD  +SS+   + C+ P C S      + 
Sbjct: 82  QIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQ 141

Query: 227 C--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GSVKGIALGCGHDNEGLFV 277
           C  + ++C Y   YGDGS T G  V++T+ F    G S    S   I  GC     G   
Sbjct: 142 CSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLT 201

Query: 278 GS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVT 328
            +     G+ G G G LS+  Q+    +     ++CL   D    G+L          V 
Sbjct: 202 KTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL-KGDGSGGGILVLGEILEPGIVY 260

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
           +PL+ ++     Y + L   +V GQ + I P+ F    +   G IVD GT +  L  +AY
Sbjct: 261 SPLVPSQP---HYNLNLLSIAVNGQLLPIDPAAFATSNS--QGTIVDSGTTLAYLVAEAY 315

Query: 389 NSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
               D FV     +   S   +    + CY  S   S   P  S +F  G ++ L  ++Y
Sbjct: 316 ----DPFVSAVNAIVSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDY 371

Query: 446 LIPVDSAG---TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LIP  S+G    +C  F      ++I+G++  +     +DL   R+G+    C
Sbjct: 372 LIPFGSSGGSAMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 423


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 166/370 (44%), Gaps = 39/370 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+P ++F + +DTGSDI W+ C  C+ C   S        FD   SS+ + 
Sbjct: 81  GLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAAL 140

Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSFGN--------S 258
           + C  P C        S C  +AN+C Y   YGDGS T G  V++T+ F          +
Sbjct: 141 VSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVA 200

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRD 309
            S   I  GC     G    +     G+ G G G LS+  Q+ +  +     ++CL   +
Sbjct: 201 NSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGE 260

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
           +   GVL          V +PL+ ++     Y + L   +V GQ + I  ++F      +
Sbjct: 261 N-GGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPIDSNVFATTN--N 314

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL-KPTSGVALFDTCYDFSGLRSVRVPTV 428
            G IVD GT +  L  +AYN    +         KP   ++  + CY  S       P V
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPI--ISKGNQCYLVSNSVGDIFPQV 372

Query: 429 SLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
           SL+F  G ++ L  ++YL+    +D A  +C  F       +I+G++  +     +DLAN
Sbjct: 373 SLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLAN 432

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 433 QRIGWADYDC 442


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 103/369 (27%), Positives = 172/369 (46%), Gaps = 52/369 (14%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++DTGS + ++ C  C +C +  DP F P  SS+Y  + C 
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
              C   D       +C+Y+  Y + S + G L  + +SFGN  ++  +    GC +   
Sbjct: 70  I-DCNCDD----EKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENMET 124

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAY---------CLVDRDSPASGV 315
           G L+   A G++G+G G LS+   +        + SL Y          ++   SP S +
Sbjct: 125 GDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSNM 184

Query: 316 LEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVD 375
           +   S    D V +P         +Y + L    V G+ + + P++F+    G  G I+D
Sbjct: 185 VFSQS----DPVRSP---------YYNIDLKEIHVAGKPLPLNPTVFD----GKHGTILD 227

Query: 376 CGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTV 428
            GT    L   A+ S +D+ ++   +LKP  G      D C+     D S L S   P V
Sbjct: 228 SGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSS-SFPAV 286

Query: 429 SLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANN 486
            + FG G+ L L  +NYL       G +C   F       +++G +  + T V +D  N+
Sbjct: 287 EMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENS 346

Query: 487 RVGFTPNKC 495
           ++GF    C
Sbjct: 347 KIGFWKTNC 355


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 171/392 (43%), Gaps = 50/392 (12%)

Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
           +LK +   +  E  + P ++G       YF+++ +GTPPR +++ +DTGSD+ W+ C PC
Sbjct: 14  KLKSSAVSLPVEGVADPYIAGL------YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPC 67

Query: 191 TECYQQSD---PI--FDPKTSSSYSPLPCAAPQC---KSLDVSACR-ANRCLYQVAYGDG 241
             C   SD   PI  +D K S+S S +PC+ P C     +  S C   N+C Y   YGDG
Sbjct: 68  IGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDG 127

Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIK 297
           S T+G LV + + +  + +   +  GCG    G    S     G++G G   LS   Q+ 
Sbjct: 128 SGTLGYLVEDVLHYMVNATAT-VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186

Query: 298 ATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
                    A+CL D      G+L   +    D    PL+      + Y V L   SV  
Sbjct: 187 KQGKTPNVFAHCL-DGGERGGGILVLGNVIEPDIQYTPLVPYM---SHYNVVLQSISVNN 242

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
             + I P LF  D     G I D GT +  L  +AY +   + V L         VA F 
Sbjct: 243 ANLTIDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQA-VSLV--------VAPFL 291

Query: 413 TC-YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS- 467
            C    S       P V L+F  G ++ L    YLI   SA     +C  +    SA S 
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESE 350

Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               I G++  +   V +DL   R+G+ P  C
Sbjct: 351 LQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 84/231 (36%), Positives = 125/231 (54%), Gaps = 24/231 (10%)

Query: 97  DYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGS 156
           D+   +  +L  D  RV ++  +++        H ++ ++ QI       P+ SG +  +
Sbjct: 13  DWNRRLQKQLILDDLRVRSMQNRIRRV---ASTHNVEASQTQI-------PLSSGINLQT 62

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
             Y   +G+G+  +  ++++DT SD+ W+QC PC  CY Q  PIF P TSSSY  + C +
Sbjct: 63  LNYIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 217 PQCKSL-----DVSACRANR---CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
             C+SL     +  AC ++    C Y V YGDGS+T GDL  E +SFG   SV     GC
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGV-SVSDFVFGC 179

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVL 316
           G +N+GLF G +GL+GLG   LSL  Q  AT     +YCL   ++ +SG L
Sbjct: 180 GRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSL 230


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 179/424 (42%), Gaps = 74/424 (17%)

Query: 130 HELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWL---- 185
           H+  PA A + P  +            G Y     +GTPP+   ++LDTGS + W+    
Sbjct: 86  HKSIPATAALYPHSY------------GGYAFTASLGTPPQPLPVLLDTGSQLTWVPCTS 133

Query: 186 --QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC----KSLDVSACRA---------- 229
              CR C+  +  + P+F PK SSS   + C  P C     +  V+ CRA          
Sbjct: 134 NYDCRNCSSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHSAEHVAKCRAPCSRGANCTP 193

Query: 230 --NRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLG 286
             N C  Y V YG GS T G L+ +T+      +V G  LGC      +    +GL G G
Sbjct: 194 ASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVSGFVLGC--SLVSVHQPPSGLAGFG 249

Query: 287 GGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD---AVTAPLIRNKKVD----- 338
            G  S+  Q+  +  +YCL+ R    +  +  +   GGD       PL+++   D     
Sbjct: 250 RGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYA 309

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            +YY+ L+G +VGG+AV++P   F  + AG GG IVD GT  T L    +  + D+ V  
Sbjct: 310 VYYYLALSGVTVGGKAVRLPARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAA 369

Query: 399 AGNLKPTS-----GVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA- 452
            G     S     G+ L        G +S+ +P +SLHF  G  + LP +NY +    A 
Sbjct: 370 VGGRYKRSKDVEEGLGLHPCFALPQGAKSMALPELSLHFKGGAVMQLPLENYFVVAGRAP 429

Query: 453 -----------GTFCFAFA----------PTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
                         C A                   I+G+ QQQ   V +DL   R+GF 
Sbjct: 430 VPGAGAGAGAAEAICLAVVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFR 489

Query: 492 PNKC 495
              C
Sbjct: 490 RQPC 493


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 101/329 (30%), Positives = 155/329 (47%), Gaps = 19/329 (5%)

Query: 176 LDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQ 235
           +DT SD+ W+ C  C  C   S  +F+   S++Y  L C A QCK +    C    C + 
Sbjct: 1   MDTSSDVAWIPCNGCLGC---SSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57

Query: 236 VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGS---AGLLGLGGGMLSL 292
           + YG GS    +L  +T++   + +V G + GC     G  + +    GL      +LS 
Sbjct: 58  LTYG-GSSLAANLSQDTITLA-TDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQ 115

Query: 293 TKQIKATSLAYCLVDRDSPA-SGVLEFNSARGGDAVT-APLIRNKKVDTFYYVGLTGFSV 350
           T+ +  ++ +YCL    S   SG L          +   PL++N +  + Y+V L    V
Sbjct: 116 TQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRV 175

Query: 351 GGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL 410
           G + V +PP  F  + +   G I D GT  TRL T AY ++RD+F    G     + +  
Sbjct: 176 GRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG 235

Query: 411 FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP----TSSAL 466
           FDTCY       +  PT++  F  G  + LP  N LI   +  T C A A      +S L
Sbjct: 236 FDTCYTV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVL 290

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++I N+QQQ  R+ +D+ N+R+G     C
Sbjct: 291 NVIANLQQQNHRLLYDVPNSRLGVARELC 319


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 155/352 (44%), Gaps = 39/352 (11%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S  +D   ++ W QC  C  C++Q  P+F P  SS++ P PC    CKS+  
Sbjct: 30  IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 89

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVGSA 280
             C ++ C +    G G  TVG + T+T + G +     +  GC      D  G   G +
Sbjct: 90  PKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPAS-LGFGCVVASDIDTMG---GPS 145

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAVTAPLIR---N 334
           G +GLG    SL  Q+K T  +YCL   D+  +  L   ++    GG A T P ++   N
Sbjct: 146 GFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPN 204

Query: 335 KKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSL 391
             +  +Y + L     G   + +P         G   ++V   TA+ R   L    Y   
Sbjct: 205 DGMSQYYPIELEEIKAGDATITMP--------RGRNTVLVQ--TAVVRVSLLVDSVYQEF 254

Query: 392 RDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD 450
           + + +   G     + V   F+ C+  +G+     P +   F AG AL +P  NYL  V 
Sbjct: 255 KKAVMASVGAAPTATPVGEPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVG 312

Query: 451 SAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  T C +    +         L+I+G+ QQ+   + FDL  + + F P  C
Sbjct: 313 N-DTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 363


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 168/368 (45%), Gaps = 48/368 (13%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           IG G     + +VLDT S + W++C  C    +Q  P+FDP  SSSY PL   +P C++ 
Sbjct: 80  IGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAP 139

Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKGIALGC-----GHDNEGLF 276
           +      ++C + +  G+    VG   T+T+  GN +  +  +A GC     G D +G F
Sbjct: 140 NPVLPAGDKCSFHLP-GEAHGYVG---TDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTF 195

Query: 277 VGSAGLLGLGGGMLSLTKQIK---ATSLAYCLVDR-DSPA-SGVLEFNS----------A 321
              AG LG+G    SL  QIK    +  +YCL+    SP  +G + F +           
Sbjct: 196 ---AGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHH 252

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAI 380
           R     T P + +   D+ YYV L G S+ G  +  I  ++FE    G GG  VD GT +
Sbjct: 253 RIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQV 312

Query: 381 TRLQTQAYNSLRDSFVRLAGNL------KPTSGVALFDTCY-DFSGLRSVRVPTVSLHFG 433
           T L   AY  + ++   +           P      F  C+ +  G+ S  +P ++L F 
Sbjct: 313 THLVPAAYAVVEEAVAHMVQQWGYKRVRDPN-----FSLCFREHPGIWS-HIPKLTLDFE 366

Query: 434 AGKA-----LDLPAKNYLIPVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLANNR 487
              +     L++ ++N  + VD+    CF    TS  + +++G +QQ  TR  FDL  N 
Sbjct: 367 GPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANT 426

Query: 488 VGFTPNKC 495
           + F    C
Sbjct: 427 ITFHRESC 434


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 122/410 (29%), Positives = 168/410 (40%), Gaps = 58/410 (14%)

Query: 122 LAIYNVDRHELKPAEAQILPEDFSTPVVSGA--SQGSGEYFSRIGVGTPPRQFSMVLDTG 179
           L+ YN           +IL + FS   +S    S     +     +G PP     V+DTG
Sbjct: 54  LSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMDTG 113

Query: 180 SDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAY- 238
           S + W+ C PC+ C QQS PIFDP  SS+YS L C+  +C   DV       C Y V Y 
Sbjct: 114 SSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS--ECNKCDVV---NGECPYSVEYV 168

Query: 239 GDGS----FTVGDLVTETVSFGNSGSVKGIALGCGHD-----NEGLFVGSAGLLGLGGGM 289
           G GS    +    L  ET+   +   V  +  GCG       N   + G  G+ GLG G 
Sbjct: 169 GSGSSQGIYAREQLTLETID-ESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGR 227

Query: 290 LSLTKQIKATSLAYCLVDRDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
            SL         +YC+ +  +        VL   +   GD+ T  +I     +  YYV L
Sbjct: 228 FSLLPSF-GKKFSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVI-----NGLYYVNL 281

Query: 346 TGFSVGGQAVQIPPSLFEMD-EAGDGGIIVDCGTAITRLQTQAY-------NSLRDSFVR 397
              S+GG+ + I P+LFE      + G+I+D G   T L    +        +L +  + 
Sbjct: 282 EAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLV 341

Query: 398 LAGNLKPTSGVALFDTCY------DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDS 451
           LA   K       +  CY      D SG      P V+ HF  G  LDL   +  I   +
Sbjct: 342 LAQQDKHNP----YTLCYSGVVSQDLSGF-----PLVTFHFAEGAVLDLDVTSMFIQT-T 391

Query: 452 AGTFCFAFAPTS------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              FC A  P +       + S IG + QQ   V +DL   RV F    C
Sbjct: 392 ENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 179/373 (47%), Gaps = 44/373 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR+F + +DTGSD+ W+ C  C  C Q S        FDP++SS+ S 
Sbjct: 75  GLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSL 134

Query: 212 LPCAAPQCKS----LDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
           + C+  +C+S     D S + + N+C Y   YGDGS T G  V++ + F         + 
Sbjct: 135 ISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTN 194

Query: 260 SVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
           S   +  GC     G    S     G+ G G   +S+  Q+    +A     +CL   D+
Sbjct: 195 SSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL-KGDN 253

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              GVL        + V +PL++++     Y + L   SV GQ V I P++F    + + 
Sbjct: 254 SGGGVLVLGEIVEPNIVYSPLVQSQP---HYNLNLQSISVNGQIVPIAPAVFA--TSNNR 308

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF---DTCYDFSGLRSVRV-P 426
           G IVD GT +  L  +AYN     FV     L P S  ++    + CY  +   +V + P
Sbjct: 309 GTIVDSGTTLAYLAEEAYN----PFVNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFP 364

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFD 482
            VSL+F  G +L L  ++YL+  +  G    +C  F      +++I+G++  +     +D
Sbjct: 365 QVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYD 424

Query: 483 LANNRVGFTPNKC 495
           LA  R+G+    C
Sbjct: 425 LAGQRIGWANYDC 437


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 166/387 (42%), Gaps = 65/387 (16%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           + VG PP+  +MVLDTGS+++WL+C   R  +    Q+   F+   SS+Y+   C++P+C
Sbjct: 66  VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC 125

Query: 220 ----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
               + L V    +   +N C   ++Y D S   G L  +T   G +  V+ +  GC   
Sbjct: 126 QWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRAL-FGCVTS 184

Query: 269 --------GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
                     D+E     + GLLG+  G LS   Q      AYC+   D P   VL    
Sbjct: 185 YSSATATNSSDSEA----ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVL---- 236

Query: 321 ARGGDAVT-------APLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
             GGD           PLI+ ++ +  F    Y V L G  VG   + IP S+   D  G
Sbjct: 237 --GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTG 294

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA------LFDTCYDFSGLR- 421
            G  +VD GT  T L   AY  L+  F+     L    G +       FD C+  S  R 
Sbjct: 295 AGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARV 354

Query: 422 ---SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSI 468
              S  +P V L   GA  A+      Y +P +  G       +C  F  +  A     +
Sbjct: 355 AAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYV 414

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IG+  QQ   V +DL N RVGF P +C
Sbjct: 415 IGHHHQQNVWVEYDLQNGRVGFAPARC 441


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 102/354 (28%), Positives = 152/354 (42%), Gaps = 52/354 (14%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S ++D           PC+           P  SS++ P PC    CKS+  
Sbjct: 73  IGTPPQPASAIIDVAGP------APCSF----------PNASSTFRPEPCGTDACKSIPT 116

Query: 225 SACRANRCLYQVAYGD--GSFTVGDLVTETVSFGNSGSVKGIALGC----GHDNEGLFVG 278
           S C +N C Y+       G  T+G + T+T + G   +   +  GC    G D  G   G
Sbjct: 117 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGT--ATASLGFGCVVASGIDTMG---G 171

Query: 279 SAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRN 334
            +GL+GLG    SL  Q+  T  +YCL   DS  +  L   S    A GG++ T P ++ 
Sbjct: 172 PSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKT 231

Query: 335 KKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSL 391
              D    +Y + L G   G  A+ +PPS           ++V     ++ L   AY +L
Sbjct: 232 SPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQAL 283

Query: 392 RDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAG-KALDLPAKNYLIPV- 449
           +    +  G     + +  FD C+  +GL +   P +   F  G  AL +P   YLI V 
Sbjct: 284 KKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVG 343

Query: 450 DSAGTFCFAFAPTS--------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +  GT C A   TS          L+I+G++QQ+ T    DL    + F P  C
Sbjct: 344 EEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 397


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 170/377 (45%), Gaps = 41/377 (10%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G YF++IG+GTP + + + +DTGSDI W+ C  C  C  +SD      ++D K
Sbjct: 65  NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 124

Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
            S++   + C    C   D  +  C+   +CLY V YGDGS T G  V + V +   SG+
Sbjct: 125 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 184

Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
            +       +  GCG+   G    S+    G+LG G    S+  Q+ ++       ++CL
Sbjct: 185 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 244

Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
            + D    G+              PL++N+     Y V +    VGG  + +P   F   
Sbjct: 245 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 296

Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           E+GD  G I+D GT +     + Y  L +  +    +L+  +    F TC+D++G     
Sbjct: 297 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 355

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
            PTV+LHF    +L +    YL  V     +C  +  + +       L+++G++      
Sbjct: 356 FPTVTLHFDKSISLTVYPHEYLFQVKEF-EWCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 414

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    +G+    C
Sbjct: 415 VVYDLEKQGIGWVEYNC 431


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/267 (29%), Positives = 127/267 (47%), Gaps = 18/267 (6%)

Query: 244 TVGDLVTETVSFGNSGSVKG-IALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA 302
           + G L TET +FG   +    +  GCG    G   G++G++G+  G LS+ KQ+  T  +
Sbjct: 3   STGVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFS 62

Query: 303 YCLVDRDSPASGVLEFNSARG-------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
           YCL       +  + F +          G   T PL++N   D +YYV + G S+G + +
Sbjct: 63  YCLTPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRL 122

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--T 413
            +P ++  +   G GG ++D  T +  L   A+  L+ + +   G   P +  ++ D   
Sbjct: 123 DVPEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVME--GMKLPAANRSIDDYPV 180

Query: 414 CYDFS---GLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF--APTSSALSI 468
           C++      +  V+VP + LHF     + LP  +Y     S G  C A   AP   A ++
Sbjct: 181 CFELPRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNV 239

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IGNVQQQ   V +DL N +  + P KC
Sbjct: 240 IGNVQQQNMHVLYDLGNRKFSYAPTKC 266


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 118/392 (30%), Positives = 170/392 (43%), Gaps = 50/392 (12%)

Query: 131 ELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC 190
           +LK +   +  E  + P ++G       YF+++ +GTPPR +++ +DTGSD+ W+ C PC
Sbjct: 14  KLKSSAVSLPVEGVADPYIAGL------YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPC 67

Query: 191 TECYQQSD---PI--FDPKTSSSYSPLPCAAPQC---KSLDVSACR-ANRCLYQVAYGDG 241
             C   SD   PI  +D K S+S S +PC+ P C     +  S C   N+C Y   YGDG
Sbjct: 68  IGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDG 127

Query: 242 SFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIK 297
           S T+G LV + + +  + +   +  GCG    G    S     G++G G   LS   Q+ 
Sbjct: 128 SGTLGYLVEDVLHYMVNATAT-VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLA 186

Query: 298 ATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
                    A+CL D      G+L   +    D    PL+        Y V L   SV  
Sbjct: 187 KQGKTPNVFAHCL-DGGERGGGILVLGNVIEPDIQYTPLVPYMY---HYNVVLQSISVNN 242

Query: 353 QAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD 412
             + I P LF  D     G I D GT +  L  +AY +   + V L         VA F 
Sbjct: 243 ANLTIDPKLFSNDVM--QGTIFDSGTTLAYLPDEAYQAFTQA-VSLV--------VAPFL 291

Query: 413 TC-YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS- 467
            C    S       P V L+F  G ++ L    YLI   SA     +C  +    SA S 
Sbjct: 292 LCDTRLSRFIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESE 350

Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
               I G++  +   V +DL   R+G+ P  C
Sbjct: 351 LQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 167/371 (45%), Gaps = 43/371 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+PP +F++ +DTGSDI W+ C  C+ C   S        FD   S +   
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157

Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
           + C+ P C S+     + C   N+C Y   YGDGS T G  +T+T  F    G S    S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
              I  GC     G    S     G+ G G G LS+  Q+ +  +     ++CL   D  
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
             GV           V +PL+ ++     Y + L    V GQ + +  ++FE       G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFEASNT--RG 331

Query: 372 IIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
            IVD GT +T L  +AY    N++ +S  +L      T  ++  + CY  S   S   P+
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPS 386

Query: 428 VSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           VSL+F  G ++ L  ++YL      D A  +C  F       +I+G++  +     +DLA
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446

Query: 485 NNRVGFTPNKC 495
             R+G+    C
Sbjct: 447 RQRIGWASYDC 457


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 168/371 (45%), Gaps = 43/371 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+PP +F++ +DTGSDI W+ C  C+ C   S        FD   S +   
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGS 157

Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
           + C+ P C S+     + C   N+C Y   YGDGS T G  +T+T  F    G S    S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
              I  GC     G    S     G+ G G G LS+  Q+ +  +     ++CL   D  
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
             GV           V +PL+ ++     Y + L    V GQ + +  ++FE   +   G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFE--ASNTRG 331

Query: 372 IIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
            IVD GT +T L  +AY    N++ +S  +L      T  ++  + CY  S   S   P+
Sbjct: 332 TIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPS 386

Query: 428 VSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
           VSL+F  G ++ L  ++YL      D A  +C  F       +I+G++  +     +DLA
Sbjct: 387 VSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLA 446

Query: 485 NNRVGFTPNKC 495
             R+G+    C
Sbjct: 447 RQRIGWASYDC 457


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 134/460 (29%), Positives = 199/460 (43%), Gaps = 90/460 (19%)

Query: 103 LSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSR 162
           LSRL R S     L    +L  ++  +    P  A + P  +            G Y   
Sbjct: 47  LSRLARAS-----LARASRLRGHHQGQAASSPVRAALYPHSY------------GGYAFS 89

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD--------PIFDPKTSSSYSPLPC 214
           + +GTPP+   ++LDTGS + W+   PCT  YQ  +        P+F PK+SSS   + C
Sbjct: 90  LSLGTPPQPLPVLLDTGSHLTWV---PCTSNYQCQNCSAAAGSFPVFHPKSSSSSLLVSC 146

Query: 215 AAPQCKSL-----------DVSACR----------ANRC-LYQVAYGDGSFTVGDLVTET 252
           ++P C  +           D + CR           N C  Y V YG GS T G LV++T
Sbjct: 147 SSPSCLWIHSKSHLSDCARDSAPCRPSTANCSATATNVCPPYLVVYGSGS-TAGLLVSDT 205

Query: 253 VSFGNSGSV-KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR--- 308
           +     G+  +  A+GC      +    +GL G G G  S+  Q+     +YCL+ R   
Sbjct: 206 LRLSPRGAASRNFAVGC--SLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYCLLSRRFD 263

Query: 309 -DSPASGVLEFNSARGGDAVT----APLIRN----KKVDTFYYVGLTGFSVGGQAVQIPP 359
            D+  SG L   ++  G A      APL++N         +YY+ LTG +VGG++V +P 
Sbjct: 264 DDAAISGELVLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPA 323

Query: 360 -SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTC 414
            +L  +   G GG I+D GT  T L    +  +  + V   G      K   G      C
Sbjct: 324 RALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPC 383

Query: 415 YDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAG-----TFCFAFAPTSSALS- 467
           +   +G R++ +P +SLHF  G  + LP +NY +    A        C A     S+ S 
Sbjct: 384 FALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVVSDVSSASG 443

Query: 468 ------------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                       I+G+ QQQ  +V +DL  NR+GF    C
Sbjct: 444 GAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 106/204 (51%), Gaps = 7/204 (3%)

Query: 296 IKATSLAYCLVDRDSPASGVLEFNSARGG--DAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
           +K    +YCL   D   + VL   S      DA++ PL+ N    +FYY+ L G  VGG 
Sbjct: 1   MKEAKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGT 60

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFD 412
            + I  S+F++ + G GG+I+D GT IT L+   +++L+  F+  +   L  +S   L D
Sbjct: 61  QLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGL-D 119

Query: 413 TCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
            C+   S    V VP +  HF  G  L+LPA++Y+I     G  C A    S+ +SI GN
Sbjct: 120 VCFSLPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMG-ASNGMSIFGN 177

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           VQQQ   V+ DL    + F P +C
Sbjct: 178 VQQQNILVNHDLEKETISFVPTQC 201


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 116/401 (28%), Positives = 164/401 (40%), Gaps = 81/401 (20%)

Query: 168 PPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPI----FDPKTSSSYSPLPCAAPQCKS 221
           PP+  S+ +DTGSD+ W  C P  C  C  + D        P   +S + + C +P C +
Sbjct: 83  PPQPISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSA 142

Query: 222 LDVSA-----CRANRCLYQV----------------AYGDGSFTVGDLVTETVSFGNSGS 260
              S      C   RC  ++                AYGDGS  V  L  +++S   S  
Sbjct: 143 AHTSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSL-VARLYRDSLSMPASSP 201

Query: 261 V--KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVD----- 307
           +       GC H   G  VG AG    G G+LSL  Q+ + S       +YCLV      
Sbjct: 202 LVLHNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDA 258

Query: 308 ----RDSP-ASGVLEFNSARG-------GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
               R SP   G    +  +        G+ V   ++ N K   FY VGL G +VG + +
Sbjct: 259 DRVRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKI 318

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF-------VRLAGNLKPTSGV 408
            +P  L  +D  G+GG++VD GT  T L    Y SL   F        + A  ++  +G+
Sbjct: 319 PVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGL 378

Query: 409 ALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPV--------DSAGTFCFAF- 459
                CY +S   + +VP V+LHF     + LP  NY                  C    
Sbjct: 379 G---PCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLM 434

Query: 460 -----APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                A +    + +GN QQQG  V +DL  +RVGF   KC
Sbjct: 435 NGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 155/391 (39%), Gaps = 69/391 (17%)

Query: 170 RQFSMVLDTGSDINWLQCRP--CTECYQQSDP-IFDPKTSSSYSPLPCAAPQCKS----- 221
           +  S+ +DTGSDI W  C P  C  C  + +P    P   S  S + C +  C +     
Sbjct: 103 QTLSVYMDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCKSRACSTAHNSP 162

Query: 222 ---------------LDVSACRANRC-LYQVAYGDGSFTVG----DLVTETVSFGNSGSV 261
                          ++ S C    C  +  AYGDGS        +L+  + S     S+
Sbjct: 163 STSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTS-NKPFSL 221

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDR------- 308
           K    GC H   G  +G AG    G G LSL  Q+   S       +YCLV         
Sbjct: 222 KDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTKL 278

Query: 309 DSPASGVLEFNSARGGDAVT----APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM 364
             P+  +L     R  D +T     P++ N K   FY V +   SVG   V+ P +L  +
Sbjct: 279 HHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALIRI 338

Query: 365 DEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFDTCYDFSGL 420
           D  G+GG++VD GT  T L T  YNS+     R  G +      T        CY   G 
Sbjct: 339 DRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLEGN 398

Query: 421 RSVR----VPTVSLHFGAGKALDLPAKNYLIPV---------DSAGTFCFAFAPTSSA-- 465
              R    VP ++ HFG   ++ LP +NY                G          S   
Sbjct: 399 GVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESEGG 458

Query: 466 -LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             + +GN QQQG +V +DL   RVGF P KC
Sbjct: 459 PGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 168/397 (42%), Gaps = 65/397 (16%)

Query: 152 ASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----------PI 200
           +++  G Y   +  GTP +    V DTGS    L C PCT  Y  S            P 
Sbjct: 83  SAKSYGGYSVSLSFGTPSQTIPFVFDTGSS---LVCLPCTSRYLCSGCDFSGLDPTLIPR 139

Query: 201 FDPKTSSSYSPLPCAAPQCKSL------------DVSACRANRCLYQVAYGDGSFTVGDL 248
           F PK SSS   + C +P+C+ L            +   C      Y + YG GS T G L
Sbjct: 140 FIPKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVL 198

Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR 308
           +TE + F +  +V    +GC   +       AG+ G G G +SL  Q+     ++CLV R
Sbjct: 199 ITEKLDFPDL-TVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLVSR 254

Query: 309 ---DSPASGVLEFNSARGGDAVTA------------PLIRNKKVDTFYYVGLTGFSVGGQ 353
              D+  +  L+ ++  G ++ +             P + NK    +YY+ L    VG +
Sbjct: 255 RFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRK 314

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGN------LKPTSG 407
            V+IP         GDGG IVD G+  T ++   +  + + F     N      L+  +G
Sbjct: 315 HVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETG 374

Query: 408 VALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP------ 461
           +     C++ SG   V VP +   F  G  L+LP  NY   V +  T C           
Sbjct: 375 LG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNP 431

Query: 462 ---TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              T  A+ I+G+ QQQ   V +DL N+R GF   KC
Sbjct: 432 SGGTGPAI-ILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 165/372 (44%), Gaps = 44/372 (11%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC------TECYQQSDPIFDPKTSSS 208
           G  EY    G GTP +Q  +  D  S ++ ++C+PC       E     D  FDP  SSS
Sbjct: 134 GVFEYTVLAGYGTPAQQLPLFFDV-SGMSNMRCKPCFSGSSGGETTTTCDVAFDPSMSSS 192

Query: 209 YSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
           +  + C +P C     SA     C + +      F  G +V +T++   S + +  A+GC
Sbjct: 193 FRSVLCGSPDCGGHSCSA--GGSCTFTLQNSTFVFGNGTIVMDTLTLSPSATFENFAVGC 250

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT-----------SLAYCL-VDRDSP----- 311
              +  LF      + +G   LSL++   AT           + +YCL  D D+      
Sbjct: 251 MQLDNDLFTDG---VAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPADTDTHGFLTI 307

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           A  + +++   G   V  PL+ N     FYYV L   ++ G+ + IPP+LF  +     G
Sbjct: 308 APALSDYSDHAGVKYV--PLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFTGN-----G 360

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
            ++D  +A T L    Y +LRD F +     +P       DTCY+F+   ++ +P ++L 
Sbjct: 361 TMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLDTCYNFTLAENIYLPDITLR 420

Query: 432 FGAGKALDLPAKNYLIPVDSA-------GTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDL 483
           F  G+ +DL  + ++             G   FA AP  +   + +G+  Q+   + +D+
Sbjct: 421 FSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQRTKEIVYDV 480

Query: 484 ANNRVGFTPNKC 495
               V F P++C
Sbjct: 481 RGGMVAFVPSRC 492


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 169/362 (46%), Gaps = 38/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP++F++++D+GS + ++ C  C +C    DP F P  SSSYSP+ C 
Sbjct: 85  NGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKC- 143

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
                ++D + C +++  C Y+  Y + S + G L  + VSFG    +K      GC + 
Sbjct: 144 -----NVDCT-CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENS 197

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G LF   A G++GLG G LS+  Q     + + S + C    D     ++      GG
Sbjct: 198 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMV-----LGG 252

Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
                 +I    +     +Y + L    V G+A+++   +F        G ++D GT   
Sbjct: 253 MLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKH----GTVLDSGTTYA 308

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAG 435
            L  QA+ + +++      +LK   G   +  D C+  +G    ++    P V + FG G
Sbjct: 309 YLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 368

Query: 436 KALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + L L  +NYL       G +C   F       +++G +  + T V++D  N ++GF   
Sbjct: 369 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 428

Query: 494 KC 495
            C
Sbjct: 429 NC 430


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 170/360 (47%), Gaps = 34/360 (9%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+ F++++DTGS + ++ C  C +C +  DP F P  SS+Y P+ C 
Sbjct: 78  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKC- 136

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
                +LD + C  +R  C+Y+  Y + S + G L  + VSFGN   +  +    GC + 
Sbjct: 137 -----TLDCN-CDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+      K + + S + C    D     ++    +   
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPS 250

Query: 325 DAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
           D V A   ++  V + YY + L    V G+ + + PS+F+    G  G ++D GT    L
Sbjct: 251 DMVFA---QSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYL 303

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKA 437
             +A+ + +++ V+   +    SG      D C+  +G+     S   P V + FG G  
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHK 363

Query: 438 LDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L  +NY+       G +C   F       +++G +  + T V +D    ++GF    C
Sbjct: 364 YSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/418 (27%), Positives = 191/418 (45%), Gaps = 52/418 (12%)

Query: 123 AIYNVDRHELKPAEA----QILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQF 172
           A + ++  +LK  ++    +IL    S  VV    QG+      G YF+R+ +G+PP+ F
Sbjct: 38  ASHKLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFLVGLYFTRVQLGSPPKDF 97

Query: 173 SMVLDTGSDINWLQCRPCTEC-----YQQSDPIFDPKTSSSYSPLPCAAPQC-----KSL 222
            + +DTGSD+ W+ C  C  C      Q     FDP +S++ + + C+  +C      S 
Sbjct: 98  YVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSD 157

Query: 223 DVSACRANRCLYQVAYGDGS----FTVGDLVTETVSFGNSGSVKGI--------ALGCGH 270
            + + R N+C Y   YGDGS    + V DL+       +SG +  I        +  C  
Sbjct: 158 SLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCST 217

Query: 271 DNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSA 321
              G    S     G+ G G   +S+  Q+ +  +     ++CL   DS   GVL     
Sbjct: 218 LQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDS-GGGVLVLGEI 276

Query: 322 RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
              + V  PL+ ++     Y + L   SV GQ + I PS+F    + + G IVD GT + 
Sbjct: 277 VEPNIVYTPLVPSQP---HYNLYLQSISVAGQTLAIDPSVF--GASSNQGTIVDSGTTLA 331

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLP 441
            L   AY+    +   +  +L   + ++  + CY  +   +   P VSL+F  G +L L 
Sbjct: 332 YLAEGAYDPFVSAITSVV-SLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILN 390

Query: 442 AKNYLIPVDSAG---TFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            ++YL+  +S G    +C  F  T    ++I+G++  +     +D+AN RVG+T   C
Sbjct: 391 PQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVGWTNYDC 448


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 170/377 (45%), Gaps = 42/377 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G YF++IG+GTP + + + +DTGSDI W+ C  C  C  +SD      ++D K
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 205 TSSSYSPLPCAAPQCKSLD--VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SGS 260
            S++   + C    C   D  +  C+   +CLY V YGDGS T G  V + V +   SG+
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265

Query: 261 VK------GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCL 305
            +       +  GCG+   G    S+    G+LG G    S+  Q+ ++       ++CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
            + D    G+              PL++N+     Y V +    VGG  + +P   F   
Sbjct: 326 DNVD--GGGIFAIGEVVEPKVNITPLVQNQ---AHYNVVMKEIEVGGDPLDVPSDAF--- 377

Query: 366 EAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR 424
           E+GD  G I+D GT +     + Y  L +  +    +L+  +    F TC+D++G     
Sbjct: 378 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAF-TCFDYTGNVDDG 436

Query: 425 VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTR 478
            PTV+LHF    +L +    YL   +    +C  +  + +       L+++G++      
Sbjct: 437 FPTVTLHFDKSISLTVYPHEYLFQHEFE--WCIGWQNSGAQTKDGKDLTLLGDLVLSNKL 494

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL    +G+    C
Sbjct: 495 VVYDLEKQGIGWVEYNC 511


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 166/362 (45%), Gaps = 38/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTP ++F++++D+GS + ++ C  C +C    DP F P  SS+YSP+ C 
Sbjct: 88  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKC- 146

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHD 271
                ++D + C   R  C Y+  Y + S + G L  + +SFG    +K      GC + 
Sbjct: 147 -----NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENT 200

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G LF   A G++GLG G LS+  Q     + + S + C    D    G +      GG
Sbjct: 201 ETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGGTMVL----GG 255

Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
                 ++    N     +Y + L    V G+A+++ P +F        G ++D GT   
Sbjct: 256 MPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----GTVLDSGTTYA 311

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAG 435
            L  QA+ + +D+      +LK   G      D C+  +G    ++    P V + FG G
Sbjct: 312 YLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNG 371

Query: 436 KALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + L L  +NYL       G +C   F       +++G +  + T V++D  N ++GF   
Sbjct: 372 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 431

Query: 494 KC 495
            C
Sbjct: 432 NC 433


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 120/373 (32%), Positives = 168/373 (45%), Gaps = 54/373 (14%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           + VGTPP+  SMV+DTGS+++WL C   T  Y  +   FDP  S+SY  +PC++P C + 
Sbjct: 35  LTVGTPPQNVSMVIDTGSELSWLHCNK-TLSYPTT---FDPTRSTSYQTIPCSSPTCTNR 90

Query: 222 -----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
                +  S    N C   ++Y D S + G+L ++    G+S  + G+  GC       N
Sbjct: 91  TQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSS-DISGLVFGCMDSVFSSN 149

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
                 S GL+G+  G LS   Q+     +YC+   D   SG+L   E N          
Sbjct: 150 SDEDSKSTGLMGMNRGSLSFVSQLGFPKFSYCISGTD--FSGLLLLGESNLTWSVPLNYT 207

Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           PLI+ +  +  F    Y V L G  V  + + IP S FE D  G G  +VD GT  T L 
Sbjct: 208 PLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQFTFLL 267

Query: 385 TQAYNSLRDSFVRLAGNLKPTSGV--ALFDTCYDFSGLR--------SVRV----PTVSL 430
              YN+LR +F      L  TS V   L D  + F G          S RV    PTV+L
Sbjct: 268 GPVYNALRSAF------LNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTL 321

Query: 431 HF-GAGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALS----IIGNVQQQGTRVSFD 482
            F GA   +      Y +P +  G     C +F   S  L     +IG+  QQ   + FD
Sbjct: 322 VFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFG-NSDLLGVEAYVIGHHHQQNVWMEFD 380

Query: 483 LANNRVGFTPNKC 495
           L  +R+G    +C
Sbjct: 381 LEKSRIGLAQVRC 393


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 34/360 (9%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+ F++++DTGS + ++ C  C +C +  DP F P++SS+Y P+ C 
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 167

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
                ++D + C  +R  C+Y+  Y + S + G L  + +SFGN   +  +    GC + 
Sbjct: 168 -----TIDCN-CDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 221

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+      K++ + S + C    D     ++    +   
Sbjct: 222 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGISPPS 281

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D   A    +     +Y + L    V G+ + +  ++F+    G  G ++D GT    L 
Sbjct: 282 DMTFA--YSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYLP 335

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGKA 437
             A+ + +D+ V+   +LK  SG      D C+     D S L S   P V + FG G  
Sbjct: 336 EAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQL-SKSFPVVDMVFGNGHK 394

Query: 438 LDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             L  +NY+       G +C   F   +   +++G +  + T V +D    ++GF    C
Sbjct: 395 YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 166/369 (44%), Gaps = 46/369 (12%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCK-- 220
           + VG+PP+  +MVLDTGS+++WL C+      Q  + +F+P +S +YS +PC +P CK  
Sbjct: 73  LTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSSKTYSKVPCLSPTCKTR 128

Query: 221 ----SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH----DN 272
               ++ VS      C   V+Y D +   G+L  ET   G S +      GC       N
Sbjct: 129 TRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLG-SLTKPATIFGCMDSGFSSN 187

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD------- 325
                 + GL+G+  G LS   Q+     +YC+   DS  +GVL   +A           
Sbjct: 188 SEEDSKTTGLIGMNRGSLSFVNQMGYPKFSYCISGFDS--AGVLLLGNASFPWLKPLSYT 245

Query: 326 ---AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
               ++ PL    +V   Y V L G  V  + + +P S+F  D  G G  +VD GT  T 
Sbjct: 246 PLVQISTPLPYFDRVA--YTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQFTF 303

Query: 383 LQTQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCY--DFSGLRSVRVPTVSLHF-G 433
           L    Y +L++ F+ +  G LK  +          D CY  D S      +P VSL F G
Sbjct: 304 LLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPVVSLMFQG 363

Query: 434 AGKALDLPAKNYLIPVDSAG---TFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANN 486
           A  ++      Y +P +  G    +CF F   S  L     +IG+  QQ   + FDL  +
Sbjct: 364 AEMSVSGERLLYRVPGEVRGRDSVWCFTFG-NSDLLGVEAFVIGHHHQQNVWMEFDLEKS 422

Query: 487 RVGFTPNKC 495
           R+G    +C
Sbjct: 423 RIGLADVRC 431


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 46/375 (12%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
           YF+++G+G P + + + +DTGSD+ W+ CRPC+ C ++S       ++DP+ SS+ S + 
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 214 CAAPQC---KSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGSVK 262
           C+ P C   +    + C    N C Y  +YGDGS + G  V + + +      G + +  
Sbjct: 62  CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121

Query: 263 GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
            +  GC     G    S     G++G G   LS+  Q+ A        ++CL        
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 181

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
            ++    A  G   T PL+ +      Y V L G SV     ++P    +     D G+I
Sbjct: 182 ILVIGGIAEPGMTYT-PLVPDS---VHYNVVLRGISVNSN--RLPIDAEDFSSTNDTGVI 235

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFG 433
           +D GT +    + AYN    + +R A +  P     +   C+  SG  S   P V+L+F 
Sbjct: 236 MDSGTTLAYFPSGAYNVFVQA-IREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNF- 293

Query: 434 AGKALDLPAKNYLI-----PVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTRVS 480
            G A++L   NYL+     P  +   +C  +  +SS+        L+I+G++  +   V 
Sbjct: 294 EGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVV 353

Query: 481 FDLANNRVGFTPNKC 495
           +DL N+R+G+    C
Sbjct: 354 YDLDNSRIGWMSYNC 368


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G P ++F + +DTGSDI W+ C PCT C   S        F+P +SS+ S 
Sbjct: 89  GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 148

Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
           + C+  +C +           S  +++ C Y   YGDGS T G  V++T+ F    GN  
Sbjct: 149 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 208

Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
            + S   I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL  
Sbjct: 209 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 268

Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
            D+   G+L          V  PL+ ++     Y + L   +V GQ + I  SLF     
Sbjct: 269 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 324

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
              G IVD GT +  L   AY    D FV  +A  + P+  S V+    C+  S      
Sbjct: 325 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 378

Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
            PTV+L+F  G A+ +  +NYL+    VD++  +C  +       ++I+G++  +     
Sbjct: 379 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 438

Query: 481 FDLANNRVGFTPNKC 495
           +DLAN R+G+    C
Sbjct: 439 YDLANMRMGWADYDC 453


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 116/357 (32%), Positives = 174/357 (48%), Gaps = 36/357 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCT-ECYQQSDPIFDPKTSSSYSPLPC 214
           G Y     +GTPP++ + + DTGSD+ W +C   CT  C  Q  P + P  SS+++ LPC
Sbjct: 89  GAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPC 148

Query: 215 AAPQC---KSLDVSACRAN--RCLYQVAYG----DGSFTVGDLVTETVSFGNSGSVKGIA 265
           +   C   +S  V+ C A    C Y+ +YG    D  +T G L  ET + G + +V  + 
Sbjct: 149 SDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-ADAVPSVR 207

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVL--EFNSARG 323
            GC   +EG +   +GL+GLG G LSL  Q+ A++  YCL    S AS +L     S  G
Sbjct: 208 FGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASKASPLLFGSLASLTG 267

Query: 324 GDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
               +  L+ +    TFY V L   S+G       P + E +     G++ D GT +T L
Sbjct: 268 AQVQSTGLLAST---TFYAVNLRSISIGSATT---PGVGEPE-----GVVFDSGTTLTYL 316

Query: 384 QTQAYNSLRDSFVRLAG--NLKPTSGVALFDTCYDFSG---LRSVRVPTVSLHFGAGKAL 438
              AY+  + +F+       ++ T G   F+ C+       L +  VPT+ LHF  G  +
Sbjct: 317 AEPAYSEAKAAFLSQTSLDQVEDTDG---FEACFQKPANGRLSNAAVPTMVLHFD-GADM 372

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            LP  NY++ V+  G  C+     S +LSIIGN+ Q    V  D+  + + F P  C
Sbjct: 373 ALPVANYVVEVED-GVVCW-IVQRSPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 38/361 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y +RI +GTPP+ F++++DTGS + ++ C  C +C +  DP F P  SS+Y PL C+ 
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149

Query: 217 P-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
              C S          C+Y   Y + S + G L  + VSFG    +K      GC +   
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 274 GLFVG--SAGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
           G      + G++GLG G LS+  Q     +   S + C    D     ++      GG +
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV-----LGGIS 258

Query: 327 VTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
             A ++    +     +Y + L    + G+ + I P +F+    G  G I+D GT    L
Sbjct: 259 PPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYL 314

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGK 436
              A+ + +D+ ++   +LK   G      D C+     D S L S   P V L F  G 
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQL-SKTFPAVDLVFSNGN 373

Query: 437 ALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
            L L  +NYL     A G +C   F   +   +++G +  + T V +D  + ++GF    
Sbjct: 374 RLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433

Query: 495 C 495
           C
Sbjct: 434 C 434


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G P ++F + +DTGSDI W+ C PCT C   S        F+P +SS+ S 
Sbjct: 87  GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 146

Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
           + C+  +C +           S  +++ C Y   YGDGS T G  V++T+ F    GN  
Sbjct: 147 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 206

Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
            + S   I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL  
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 266

Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
            D+   G+L          V  PL+ ++     Y + L   +V GQ + I  SLF     
Sbjct: 267 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 322

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
              G IVD GT +  L   AY    D FV  +A  + P+  S V+    C+  S      
Sbjct: 323 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 376

Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
            PTV+L+F  G A+ +  +NYL+    VD++  +C  +       ++I+G++  +     
Sbjct: 377 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 436

Query: 481 FDLANNRVGFTPNKC 495
           +DLAN R+G+    C
Sbjct: 437 YDLANMRMGWADYDC 451


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 165/385 (42%), Gaps = 67/385 (17%)

Query: 159 YFSRIGVGTPP--RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLP--- 213
           Y   +GVGT      + + +D  +  +W+QC PC  C  Q +P+FDP  S ++ P+    
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160

Query: 214 ---CAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL 266
              C  P     D       RC + +AY +G+   G L  +T SF     N   + GI  
Sbjct: 161 AVLCRPPYHPLQD------GRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVF 214

Query: 267 GCGH-----DNEGLFVGSAGLLGLGGG-----MLSLTKQIKATS---LAYCLVDRDSPAS 313
           GC +     D  G     AG+LG+G G     +    +Q+        +YC +   + A 
Sbjct: 215 GCANRIARFDTHGAL---AGVLGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAY 271

Query: 314 GVLEFNS----------ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIP---PS 360
             L F +           R   AV AP   ++     YYV L G SVG  A+++P   P 
Sbjct: 272 SFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEA----YYVKLAGISVG--ALRVPGVTPE 325

Query: 361 LFEMDEAGDGGIIVDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYD 416
           +FE D+ G GG  +D GT +T +   AY     ++R    R       + G  L   C  
Sbjct: 326 MFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNRARFVQSPGHHL---CVH 382

Query: 417 FSGLRSVRVPTVSLHFGAGKALDL-PAKNYLI---PVDSAGTFCFAFAPTSSALSIIGNV 472
            +     R+P+++LHF  G  L + P   +L+   P       C    P +  +++IG +
Sbjct: 383 RTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPDAE-MTVIGAM 441

Query: 473 QQQGTRVSFDLANN--RVGFTPNKC 495
           QQ  TR  FDL NN   V F P  C
Sbjct: 442 QQIDTRFIFDLHNNIPIVSFNPEDC 466


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 170/370 (45%), Gaps = 39/370 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
           G Y++R+ +GTPPR F + +DTGSD+ W+ C  C  C   S    P+  FDP +S + S 
Sbjct: 50  GLYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASL 109

Query: 212 LPCAAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN--SGSVKG- 263
           + C+  +C      S  V + + N C Y   YGDGS T G  V++ + F     GSV   
Sbjct: 110 ISCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNN 169

Query: 264 ----IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDS 310
               I  GC     G    S     G+ G G   +S+  Q     I   + ++CL   DS
Sbjct: 170 SSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDS 229

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+L        + V  PL+ ++     Y + +   SV GQ + I PS+F    +   
Sbjct: 230 -GGGILVLGEIVEPNIVYTPLVPSQP---HYNLNMQSISVNGQTLAIDPSVFGTSSS--Q 283

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G I+D GT +  L   AY+    +   +   +++P   ++  + CY  S   +   P VS
Sbjct: 284 GTIIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPY--LSKGNHCYLISSSINDIFPQVS 341

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
           L+F  G ++ L  ++YLI   S G    +C  F       ++I+G++  +     +D+AN
Sbjct: 342 LNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIAN 401

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 402 QRIGWANYDC 411


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 174/375 (46%), Gaps = 46/375 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G P ++F + +DTGSDI W+ C PCT C   S        F+P +SS+ S 
Sbjct: 3   GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62

Query: 212 LPCAAPQCKS--------LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
           + C+  +C +           S  +++ C Y   YGDGS T G  V++T+ F    GN  
Sbjct: 63  ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122

Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVD 307
            + S   I  GC +   G    +     G+ G G   LS+  Q+ +  +     ++CL  
Sbjct: 123 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKG 182

Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
            D+   G+L          V  PL+ ++     Y + L   +V GQ + I  SLF     
Sbjct: 183 SDN-GGGILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIAVNGQKLPIDSSLFTTSNT 238

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPT--SGVALFDTCYDFSGLRSVR 424
              G IVD GT +  L   AY    D FV  +A  + P+  S V+    C+  S      
Sbjct: 239 --QGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 292

Query: 425 VPTVSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVS 480
            PTV+L+F  G A+ +  +NYL+    VD++  +C  +       ++I+G++  +     
Sbjct: 293 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352

Query: 481 FDLANNRVGFTPNKC 495
           +DLAN R+G+    C
Sbjct: 353 YDLANMRMGWADYDC 367


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 38/361 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y +RI +GTPP+ F++++DTGS + ++ C  C +C +  DP F P  SS+Y PL C+ 
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149

Query: 217 P-QCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVK--GIALGCGHDNE 273
              C S          C+Y   Y + S + G L  + VSFG    +K      GC +   
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 274 GLFVG--SAGLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDA 326
           G      + G++GLG G LS+  Q     +   S + C    D     ++      GG +
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMV-----LGGIS 258

Query: 327 VTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRL 383
             A ++    +     +Y + L    + G+ + I P +F+    G  G I+D GT    L
Sbjct: 259 PPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYL 314

Query: 384 QTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAGK 436
              A+ + +D+ ++   +LK   G      D C+     D S L S   P V L F  G 
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQL-SKTFPAVDLVFSNGN 373

Query: 437 ALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
            L L  +NYL     A G +C   F   +   +++G +  + T V +D  + ++GF    
Sbjct: 374 RLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTN 433

Query: 495 C 495
           C
Sbjct: 434 C 434


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 158/368 (42%), Gaps = 56/368 (15%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC---TECYQQSDPIFDPK 204
           VVS     S EY   + +G+PPR    + DTGSD+ W++C+     T         FDP 
Sbjct: 90  VVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPS 149

Query: 205 TSSSYSPLPCAAPQCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGS--- 260
            SS+Y  + C    C++L  + C   + C Y  AYGDGS T G L TET +F + G+   
Sbjct: 150 RSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRS 209

Query: 261 -----VKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSL----AYCLVDRDSP 311
                + G+  GC     G F     +   GG +  +T+   ATSL    +YCLV     
Sbjct: 210 PRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269

Query: 312 ASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
           AS  L F +        A + PL+ NK V +                           A 
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVGNKTVAS---------------------------AA 302

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR--- 424
              IIVD GT +T L       + D   R    L P  S   L   CY+ +G R V    
Sbjct: 303 SSRIIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGE 360

Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSF 481
            +P ++L FG G A+ L  +N  + V   GT C A   T+    +SI+GN+ QQ   V +
Sbjct: 361 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 419

Query: 482 DLANNRVG 489
           DL    VG
Sbjct: 420 DLDAGTVG 427



 Score = 58.5 bits (140), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 60/131 (45%), Gaps = 10/131 (7%)

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-SGVALFDTCYDFSGLRSVR----VP 426
           IIVD GT +T L       + D   R    L P  S   L   CY+ +G R V     +P
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRI-TLPPVQSPDGLLQLCYNVAG-REVEAGESIP 496

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--LSIIGNVQQQGTRVSFDLA 484
            ++L FG G A+ L  +N  + V   GT C A   T+    +SI+GN+ QQ   V +DL 
Sbjct: 497 DLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 555

Query: 485 NNRVGFTPNKC 495
              V F    C
Sbjct: 556 AGTVTFAVADC 566


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/252 (32%), Positives = 132/252 (52%), Gaps = 22/252 (8%)

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLV----DRDSPA----- 312
           + +  GCG  + G  VG++GL+GL  G +SL  Q+     +YCL      + SP      
Sbjct: 92  RALGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKTSPMLFGAM 151

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYY-VGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           + + ++N+   G   T  ++RN  +DTFYY V L G S+G + +++P +   ++  G GG
Sbjct: 152 ADLRKYNTT--GPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGG 209

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG-VALFDTCYDFS---GLRSVRVPT 427
            IVD G+ +  L  +A+++++ + +  A  L   +G V  ++ C+       + +V+ P 
Sbjct: 210 TIVDSGSTMAHLAGKAFDAVKKAVLE-AVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPP 268

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT----SSALSIIGNVQQQGTRVSFDL 483
           + LHF  G A+ LP  NY      AG  C A A +     + +SIIGNVQQQ   V FD+
Sbjct: 269 LVLHFDGGAAMALPRDNYFQ-EPRAGLMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDV 327

Query: 484 ANNRVGFTPNKC 495
            N +  F P KC
Sbjct: 328 HNQKFSFAPTKC 339


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 117/389 (30%), Positives = 168/389 (43%), Gaps = 72/389 (18%)

Query: 175 VLDTGSDINWLQCRPC----------TECYQQSDPIFDPKTSSSYSPLPC---------A 215
           V+DTGSD+ W QC  C            C+ Q+ P ++   S +   +PC          
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 216 APQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
           AP+           +  C+   +YG G   +G L T+  +F +S SV  +A GC      
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVT-LAFGCVSQTRI 194

Query: 274 --GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD--RDSPASGVLEFNSARGGD---- 325
             G   G++G++GLG G LSL  Q+ AT  +YCL    RD+ +   L             
Sbjct: 195 SPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTPYFRDTVSPSHLFVGDGELAGLRAA 254

Query: 326 ----------AVTAPLIRNKK---VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD--- 369
                       T P  +N K     TFYY+ L G + G   V +P   F++ EA     
Sbjct: 255 AGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAAPKVW 314

Query: 370 -GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPTS---GVALFDTCY----DFS 418
            GG ++D G+  TRL   A+ +L     R    +G+L P     G AL + C     D  
Sbjct: 315 AGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGAL-ELCVEAGDDGD 373

Query: 419 GLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--------L 466
            L +  VP + L F    G G+ L +PA+ Y   V+ A T+C A   ++S          
Sbjct: 374 SLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVE-ASTWCMAVVSSASGNATLPTNET 432

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IIGN  QQ  RV +DLAN  + F P  C
Sbjct: 433 TIIGNFMQQDMRVLYDLANGLLSFQPANC 461


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 122/411 (29%), Positives = 166/411 (40%), Gaps = 87/411 (21%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP- 213
           G+Y     +G+   + S+ +DTGSD+ W  C P  C  C  +      PK  S   PLP 
Sbjct: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGK------PKIQS---PLPK 124

Query: 214 ---------------------------CAAPQC--KSLDVSACRANRCL-YQVAYGDGSF 243
                                      CA  +C  +S+++S C +  C  +  AYGDGS 
Sbjct: 125 IANNKSVSCSAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSL 184

Query: 244 TVGDLVTETVSFGNSG-----SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
            V  L  +++S          +V+    GC H   G  VG AG    G G+LS+  Q+  
Sbjct: 185 -VARLYRDSLSLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLAT 240

Query: 299 TS------LAYCLV------DR-DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGL 345
            S       +YCLV      DR   P+  +L        + +   L+ N K   FY VGL
Sbjct: 241 FSPQLGNRFSYCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGL 300

Query: 346 TGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT 405
            G SVG   +  P  L ++DE G GG++VD GT  T L    Y S+   F    G +   
Sbjct: 301 AGISVGNIRIPAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANR 360

Query: 406 SGVALFDT----CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAGTFCFAFA 460
           +     +T    CY +    SV VP V LHF G    + LP KNY       G       
Sbjct: 361 ARRIEENTGLSPCYYYE--NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRK 418

Query: 461 PTSSALSI----------------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
                L +                +GN QQQG  V +DL  NRVGF   +C
Sbjct: 419 RKVGCLMLMNGGDEAELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 152/356 (42%), Gaps = 36/356 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S ++D   ++ W QC  C+ C++Q  P+F P  SS++ P PC    CKS   
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 225 SACRANRCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE-GLFVGSA 280
           S C  + C Y+       D   T+G + TET + G   +   +A GC   ++     G++
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGT--ATASLAFGCVVASDIDTMDGTS 166

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKK 336
           G +GLG    SL  Q+K T  +YCL  R +  S  L   S    A G    TAP I+   
Sbjct: 167 GFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKTSP 226

Query: 337 VDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQTQAYNSLR 392
            D    +Y + L     G   +           A  GGI+V    +  + L   AY + +
Sbjct: 227 DDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVDSAYRAFK 277

Query: 393 DSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLI 447
            +     G        +    FD C+   +G      P +   F G G AL +P   YLI
Sbjct: 278 KAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYLI 337

Query: 448 PV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            V +   T C A    +         +S++G++QQ+     +DL    + F P  C
Sbjct: 338 DVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADC 393


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 163/367 (44%), Gaps = 35/367 (9%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+PP +F++ +DTGSDI W+ C  C+ C   S        FD   S +   
Sbjct: 98  GLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGS 157

Query: 212 LPCAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GS 260
           + C+ P C S+     + C   N+C Y   YGDGS T G  +T+T  F    G S    S
Sbjct: 158 VTCSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANS 217

Query: 261 VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSP 311
              I  GC     G    S     G+ G G G LS+  Q+ +  +     ++CL   D  
Sbjct: 218 SAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGS 276

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
             GV           V +PL+ ++     Y + L    V GQ + I  ++FE       G
Sbjct: 277 GGGVFVLGEILVPGMVYSPLLPSQP---HYNLNLLSIGVNGQILPIDAAVFEASNT--RG 331

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
            IVD GT +T L  +AY+   ++       L  T  ++  + CY  S   S   P VSL+
Sbjct: 332 TIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV-TLIISNGEQCYLVSTSISDMFPPVSLN 390

Query: 432 FGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRV 488
           F  G ++ L  ++YL      D A  +C  F       +I+G++  +     +DLA  R+
Sbjct: 391 FAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 450

Query: 489 GFTPNKC 495
           G+    C
Sbjct: 451 GWANYDC 457


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 174/377 (46%), Gaps = 46/377 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++G+G P + + + +DTGSD+ W+ CRPC+ C ++S       ++DP+ SS+ S 
Sbjct: 27  GLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSL 86

Query: 212 LPCAAPQC---KSLDVSACR--ANRCLYQVAYGDGSFTVGDLVTETVSF------GNSGS 260
           + C+ P C   +    + C    N C Y  +YGDGS + G  V + + +      G + +
Sbjct: 87  VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146

Query: 261 VKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSP 311
              +  GC     G    S     G++G G   LS+  Q+ A        ++CL      
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRG 206

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
              ++    A  G   T PL+ +      Y V L G SV   + ++P    +     D G
Sbjct: 207 GGILVIGGIAEPGMTYT-PLVPDS---VHYNVVLRGISV--NSNRLPIDAEDFSSTNDTG 260

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
           +I+D GT +    + AYN    + +R A +  P     +   C+  SG  S   P V+L+
Sbjct: 261 VIMDSGTTLAYFPSGAYNVFVQA-IREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLN 319

Query: 432 FGAGKALDLPAKNYLI-----PVDSAGTFCFAFAPTSSA--------LSIIGNVQQQGTR 478
           F  G A++L   NYL+     P  +   +C  +  +SS+        L+I+G++  +   
Sbjct: 320 F-EGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 378

Query: 479 VSFDLANNRVGFTPNKC 495
           V +DL N+R+G+    C
Sbjct: 379 VVYDLDNSRIGWMSYNC 395


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 167/362 (46%), Gaps = 38/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +G+PP++F++++DTGS + ++ C  C +C    DP F P+ SS+Y P+ C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145

Query: 216 APQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
           A  C       C  N  +C Y+  Y + S + G L  + +SFG    +  +    GC   
Sbjct: 146 A-DCN------CDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETM 198

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+  Q     + + S + C    D     ++      GG
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMV-----LGG 253

Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            +    ++    +     +Y + L    V G+ +++ P  F+    G  G I+D GT   
Sbjct: 254 ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYA 309

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPT----VSLHFGAG 435
               +AY + +D+ ++    LK  SG      D C+  +G     +P     V + F  G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369

Query: 436 KALDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + + L  +NYL      +G +C   F   +   +++G +  + T V+++  N+ +GF   
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429

Query: 494 KC 495
            C
Sbjct: 430 NC 431


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 107/369 (28%), Positives = 167/369 (45%), Gaps = 43/369 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLP 213
           YF+++ +G+PP +F++ +DTGSDI W+ C  C+ C   S        FD   S +   + 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 214 CAAPQCKSL---DVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---GSVK 262
           C+ P C S+     + C   N+C Y   YGDGS T G  +T+T  F    G S    S  
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224

Query: 263 GIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
            I  GC     G    S     G+ G G G LS+  Q+ +  +     ++CL   D    
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL-KGDGSGG 283

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           GV           V +PL+ ++     Y + L    V GQ + +  ++FE   +   G I
Sbjct: 284 GVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIGVNGQMLPLDAAVFE--ASNTRGTI 338

Query: 374 VDCGTAITRLQTQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           VD GT +T L  +AY    N++ +S  +L      T  ++  + CY  S   S   P+VS
Sbjct: 339 VDTGTTLTYLVKEAYDLFLNAISNSVSQLV-----TPIISNGEQCYLVSTSISDMFPSVS 393

Query: 430 LHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           L+F  G ++ L  ++YL      D A  +C  F       +I+G++  +     +DLA  
Sbjct: 394 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 453

Query: 487 RVGFTPNKC 495
           R+G+    C
Sbjct: 454 RIGWASYDC 462


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 168/396 (42%), Gaps = 65/396 (16%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR----PCTECYQQSDPIFDPKTSSSYSPLPCAA-P 217
           + VG PP+  +MVLDTGS+++WL C     P T    Q+   F+   SS+Y+   C++ P
Sbjct: 63  VAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSSSP 122

Query: 218 QC----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI----- 264
           +C    + L V    +   +N C   ++Y D S   G L  +T   G +  V+ +     
Sbjct: 123 ECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLLGGAPPVRALFGCIT 182

Query: 265 -------ALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS 313
                  A G G+ N+     S+    GLLG+  G LS   Q      AYC+   D P  
Sbjct: 183 SYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTLRFAYCIAPGDGP-- 240

Query: 314 GVLEFNSARGGDAVTA-------PLIRNKKVDTF-----YYVGLTGFSVGGQAVQIPPSL 361
           G+L       G A++A       PLI   +   +     Y V L G  VG   + IP S+
Sbjct: 241 GLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLEGIRVGAALLPIPKSV 300

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCY 415
              D  G G  +VD GT  T L   AY  L+  F+     L    G         FD C+
Sbjct: 301 LAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGEPDFVFQGAFDACF 360

Query: 416 DFSGLR------SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPT 462
             S  R      S  +P V L   GA  A+      Y++P +  G       +C  F  +
Sbjct: 361 RASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRGEGGSEAVWCLTFGNS 420

Query: 463 SSA---LSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             A     +IG+  QQ   V +DL N+RVGF P +C
Sbjct: 421 DMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 40/370 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
           G YF+R+ +G+PP++F + +DTGSD+ W+ C  C  C Q S    P+  FDP +SS+ S 
Sbjct: 66  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 125

Query: 212 LPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDLVTETVSF----GNS--G 259
           + C+  +C SL V +  A      N+C+Y   YGDGS T G  V++ ++F    G+S   
Sbjct: 126 ISCSDQRC-SLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 184

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
           S   I  GC     G    S     G+ G G   +S+  Q+ +  +     ++CL     
Sbjct: 185 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 244

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
               ++        D V +PL+ ++     Y + L   SV G+++ I P +F    + + 
Sbjct: 245 GGGILVLGEIVE-EDIVYSPLVPSQP---HYNLNLQSISVNGKSLAIDPEVFAT--STNR 298

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G IVD GT +  L  +AY+    +    ++ +++P   ++    CY  +       PTVS
Sbjct: 299 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL--LSKGTQCYLITSSVKGIFPTVS 356

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
           L+F  G +++L  ++YL+  +S G    +C  F       ++I+G++  +     +DLA 
Sbjct: 357 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 416

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 417 QRIGWANYDC 426


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 180/370 (48%), Gaps = 40/370 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSP 211
           G YF+R+ +G+PP++F + +DTGSD+ W+ C  C  C Q S    P+  FDP +SS+ S 
Sbjct: 81  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 140

Query: 212 LPCAAPQCKSLDVSACRA------NRCLYQVAYGDGSFTVGDLVTETVSF----GNS--G 259
           + C+  +C SL V +  A      N+C+Y   YGDGS T G  V++ ++F    G+S   
Sbjct: 141 ISCSDQRC-SLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTN 199

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
           S   I  GC     G    S     G+ G G   +S+  Q+ +  +     ++CL     
Sbjct: 200 SSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGG 259

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
               ++        D V +PL+ ++     Y + L   SV G+++ I P +F    + + 
Sbjct: 260 GGGILVLGEIVE-EDIVYSPLVPSQP---HYNLNLQSISVNGKSLAIDPEVFA--TSTNR 313

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G IVD GT +  L  +AY+    +    ++ +++P   ++    CY  +       PTVS
Sbjct: 314 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL--LSKGTQCYLITSSVKGIFPTVS 371

Query: 430 LHFGAGKALDLPAKNYLIPVDSAG---TFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
           L+F  G +++L  ++YL+  +S G    +C  F       ++I+G++  +     +DLA 
Sbjct: 372 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 431

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 432 QRIGWANYDC 441


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 153/349 (43%), Gaps = 29/349 (8%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQ-SDPIFDPKTSSSYSPLPCAAP 217
           +     +G PP     ++DTGS + W+QC PC  C QQ   P+FDP  SS+Y  L C   
Sbjct: 102 FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNI 161

Query: 218 QCKSLDVSAC-RANRCLYQVAYGDGSFTVGDLVTETVSFGNS----GSVKGIALGCGHDN 272
            C+      C  +++C+Y   Y +G  +VG + TE + FG+S     +V  +  GC H N
Sbjct: 162 ICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRN 221

Query: 273 EGLFVGS--AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP--ASGVLEFNSARGGDAVT 328
            G +      G+ GLG G+ S+  Q+  +  +YC+ +   P  +   L  +     +  +
Sbjct: 222 -GNYKDRRFTGVFGLGSGITSVVNQM-GSKFSYCIGNIADPDYSYNQLVLSEGVNMEGYS 279

Query: 329 APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAY 388
            PL     VD  Y V L G SVG   + I PS F+  E     +I+D GTA T L    Y
Sbjct: 280 TPL---DVVDGHYQVILEGISVGETRLVIDPSAFKRTEK-QRRVIIDSGTAPTWLAENEY 335

Query: 389 NSLRDSFVRLAGN-LKPTSGVALFDTCYDFS-GLRSVRVPTVSLHFGAGKALDLPAKNYL 446
            +L      L    L P    +    CY    G   V  P V+ HF  G  L        
Sbjct: 336 RALEREVRNLLDRFLTPFMRESFL--CYKGKVGQDLVGFPAVTFHFAEGADL-------- 385

Query: 447 IPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             VD+       +       S+IG + QQ   V++DL  +++ F    C
Sbjct: 386 -VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 172/369 (46%), Gaps = 38/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +GTPP +F++ +DTGSDI W+ C  C  C + S        FD  +SSS S 
Sbjct: 77  GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136

Query: 212 LPCAAPQCKS---LDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---G 259
           + C+ P C S      + C  ++N+C Y   YGDGS T G  V+E++ F    G S    
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
           S   +  GC     G    S     G+ G G G LS+  Q+ A  +     ++CL   + 
Sbjct: 197 SSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL-KGEG 255

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+L          V +PL+ ++     Y   L   SV GQ + I PS+F    + + 
Sbjct: 256 NGGGILVLGEVLEPGIVYSPLVPSQPHYNLY---LQSISVNGQTLPIDPSVFA--TSINR 310

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
           G I+D GT +  L  +AY     +    ++ ++ PT  ++  + CY  S       P VS
Sbjct: 311 GTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPT--ISKGNQCYLVSTSVGEIFPLVS 368

Query: 430 LHFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANN 486
           L+F    ++ L  + YL+ +   D A  +C  F      ++I+G++  +     +DLA  
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQ 428

Query: 487 RVGFTPNKC 495
           R+G+    C
Sbjct: 429 RIGWASYDC 437


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 156/372 (41%), Gaps = 39/372 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP---IFDPKTSSSY 209
           G Y   +  GTPP+   +++DTGSD+ W  C     C  C +  S+P   IF PK+SSS 
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVKGIALG 267
             L C  P+C  +  S  ++ RC            +       + F +          L 
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLNFLRFWDHRRSQFHRRMLC 206

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DSPASGVLEFNSARGG 324
             H +         + G G G  SL  Q+     +YCL+ R   D+  S  L  +     
Sbjct: 207 PLHQST-----RREISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDS 261

Query: 325 DAVTA-----PLIRNKKV------DTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
              TA     P ++N KV        +YY+GL   +VGG+ V+IP         GDGG I
Sbjct: 262 GEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTI 321

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT--SGVALFDTCYDFSGLRSVRVPTVSLH 431
           +D GT  T ++ + +  +   F +   + + T   G+     C++ SGL +   P ++L 
Sbjct: 322 IDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLK 381

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------IIGNVQQQGTRVSFDL 483
           F  G  ++LP  NY+  +      C       +A          I+GN QQQ   V +DL
Sbjct: 382 FRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDL 441

Query: 484 ANNRVGFTPNKC 495
            N R+GF    C
Sbjct: 442 RNERLGFRQQSC 453


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  125 bits (314), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 152/355 (42%), Gaps = 35/355 (9%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP+  S ++D   ++ W QC  C+ C++Q  P+F P  SS++ P PC    CKS   
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 225 SACRANRCLYQVAYG---DGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE-GLFVGSA 280
           S C  + C Y+       D   T+G + TET + G   +   +A GC   ++     G++
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGT--ATASLAFGCVVASDIDTMDGTS 166

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTAPLIRNKK 336
           G +GLG    SL  Q+K T  +YCL  R +  S  L   S    A G    TAP I+   
Sbjct: 167 GFIGLGRTPRSLVAQMKLTKFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKTSP 226

Query: 337 VDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQTQAYNSLR 392
            D    +Y + L     G   +           A  GGI+V    +  + L   AY + +
Sbjct: 227 DDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVDSAYRAFK 277

Query: 393 DSFVRLAGNL--KPTSGVAL-FDTCY-DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIP 448
            +     G    +P +     FD C+   +G      P +   F    AL +P   YLI 
Sbjct: 278 KAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLID 337

Query: 449 V-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           V +   T C A    +         +S++G++QQ+     +DL    + F P  C
Sbjct: 338 VGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADC 392


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 167/362 (46%), Gaps = 38/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +G+PP++F++++DTGS + ++ C  C +C    DP F P+ SS+Y P+ C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN 145

Query: 216 APQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
           A  C       C  N  +C Y+  Y + S + G L  + +SFG    +  +    GC   
Sbjct: 146 A-DCN------CDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETM 198

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+  Q     + + S + C    D     ++      GG
Sbjct: 199 ESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMV-----LGG 253

Query: 325 DAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAIT 381
            +    ++    +     +Y + L    V G+ +++ P  F+    G  G I+D GT   
Sbjct: 254 ISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYA 309

Query: 382 RLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPT----VSLHFGAG 435
               +AY + +D+ ++    LK  SG      D C+  +G     +P     V + F  G
Sbjct: 310 YFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANG 369

Query: 436 KALDLPAKNYLI-PVDSAGTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           + + L  +NYL      +G +C   F   +   +++G +  + T V+++  N+ +GF   
Sbjct: 370 QKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKT 429

Query: 494 KC 495
            C
Sbjct: 430 NC 431


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 75/203 (36%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 106 LERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGV 165
           L  D AR N+L  + + A     +     A A    E    P+ SG    +  Y + I +
Sbjct: 107 LAADEARANSLQLRNKAAFTQSGKKATAAAAAAAGAE---VPLTSGIRFQTLNYVTTIAL 163

Query: 166 GTPPR------QFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           G            ++++DTGSD+ W+QC+PC+ CY Q DP+FDP  S+SY+ +PC A  C
Sbjct: 164 GGGGSSRAGAGNLTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASAC 223

Query: 220 KSLDVSAC----------------RANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
           ++   +A                 ++ RC Y +AYGDGSF+ G L T+TV+ G + SV G
Sbjct: 224 EASLKAATGVPGSCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGA-SVDG 282

Query: 264 IALGCGHDNEGLFVGSAGLLGLG 286
              GCG  N GLF G+AGL+GLG
Sbjct: 283 FVFGCGLSNRGLFGGTAGLMGLG 305



 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 62/129 (48%), Gaps = 5/129 (3%)

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRVPTVS 429
           +++D GT ITRL    Y ++R  F R  G  +  +    +L D CY+ +G   V+VP ++
Sbjct: 346 VLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLT 405

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT-FCFAFAPTS--SALSIIGNVQQQGTRVSFDLANN 486
           L    G  + + A   L      G+  C A A  S      IIGN QQ+  RV +D   +
Sbjct: 406 LRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGS 465

Query: 487 RVGFTPNKC 495
           R+GF    C
Sbjct: 466 RLGFADEDC 474


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 118/399 (29%), Positives = 168/399 (42%), Gaps = 85/399 (21%)

Query: 168 PPRQFSMVLDTGSDINWLQCRP--CTECY---QQSDPIFDPKTSSSYS---PLP------ 213
           PP+  ++ +DTGSD+ W  C P  C  C    Q + P    K + S S   P        
Sbjct: 85  PPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSCQSPACSAAHAS 144

Query: 214 ------CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
                 CA  +C    ++ S C +  C  +  AYGDGSF V +L  +T+S  +S  ++  
Sbjct: 145 MSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSF-VANLYQQTLSL-SSLHLQNF 202

Query: 265 ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV------DR-DSP 311
             GC H          G+ G G G+LSL  Q+   S       +YCLV      DR   P
Sbjct: 203 TFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGDRLRRP 259

Query: 312 ASGVLEFNSARGGDAVTAP------------LIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           +  +L     R  D +T              ++ N K   +Y VGL G SVG + V  P 
Sbjct: 260 SPLIL----GRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTVPAPE 315

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDS-------FVRLAGNLKPTSGVALFD 412
            L  +DE G+GG++VD GT  T L    YN++ +        F + A  ++  +G+    
Sbjct: 316 ILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLG--- 372

Query: 413 TCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG--------TFCFAFAPTS 463
            CY  +GL   ++P + LHF G    + LP KNY       G          C       
Sbjct: 373 PCYYLNGLS--QIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLMNGE 430

Query: 464 SALSI-------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
               +       +GN QQQG  V +DL   RVGF   +C
Sbjct: 431 DETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 167/370 (45%), Gaps = 46/370 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF++I +G+PP+++ + +DTGSDI W+ C+PC EC  +++      +FD   SS+   
Sbjct: 72  GLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKK 131

Query: 212 LPCAAPQCKSLDVS-ACR-ANRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVKG----- 263
           + C    C  +  S +C+ A  C Y + Y D S + G+ + + ++    +G ++      
Sbjct: 132 VGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQ 191

Query: 264 -IALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPAS 313
            +  GCG D  G    S     G++G G    S+  Q+ AT       ++CL   +    
Sbjct: 192 EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCL--DNVKGG 249

Query: 314 GVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGII 373
           G+            T P++ N+     Y V L G  V G A+ +PPS+       +GG I
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQ---MHYNVMLMGMDVDGTALDLPPSIMR-----NGGTI 301

Query: 374 VDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT--CYDFSGLRSVRVPTVSLH 431
           VD GT +       Y+SL ++ +      +P     + DT  C+ FS    V  P VS  
Sbjct: 302 VDSGTTLAYFPKVLYDSLIETILA----RQPVKLHIVEDTFQCFSFSENVDVAFPPVSFE 357

Query: 432 FGAGKALDLPAKNYLIPVDSAGTFCFAFAP------TSSALSIIGNVQQQGTRVSFDLAN 485
           F     L +   +YL  ++    +CF +          + + ++G++      V +DL N
Sbjct: 358 FEDSVKLTVYPHDYLFTLEKE-LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLEN 416

Query: 486 NRVGFTPNKC 495
             +G+  + C
Sbjct: 417 EVIGWADHNC 426


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 173/369 (46%), Gaps = 40/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR+F++ +DTGSD+ W+ C  C  C + S+       FDP  SSS S 
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 212 LPCAAPQCKS--LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNS-------GSV 261
           + C+  +C S     S C  N  C Y   YGDGS T G  +++ +SF           S 
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSS 201

Query: 262 KGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDSPA 312
                GC +   G          G+ GLG G LS+  Q+    LA     +CL   D   
Sbjct: 202 APFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL-KGDKSG 260

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
            G++     +  D V  PL+ ++     Y V L   +V GQ + I PS+F +   GDG I
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTI-ATGDGTI 316

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDT--CYDFSGLRSVRVPTVS 429
           I D GT +  L  +AY+     F++   N     G  + +++  C++ +       P VS
Sbjct: 317 I-DTGTTLAYLPDEAYS----PFIQAVANAVSQYGRPITYESYQCFEITAGDVDVFPQVS 371

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
           L F  G ++ L  + YL    S+G+  +C  F   S   ++I+G++  +   V +DL   
Sbjct: 372 LSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQ 431

Query: 487 RVGFTPNKC 495
           R+G+    C
Sbjct: 432 RIGWAEYDC 440


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 164/372 (44%), Gaps = 43/372 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G YF+ I +GTPP+++ + +DTGSDI W+ C  C +C ++S        +DPK SSS S
Sbjct: 81  TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGS 140

Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
            + C    C +     +  C AN  C Y V YGDGS T G  VT+ + F    G+  +  
Sbjct: 141 TVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQP 200

Query: 263 G---IALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
           G   +  GCG   +G  +GS+     G+LG G    S+  Q+ A        A+CL    
Sbjct: 201 GNATVTFGCGA-QQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCL--DT 257

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+    +       T PL+ +      Y V L    VGG  +Q+P  +FE  E   
Sbjct: 258 IKGGGIFAIGNVVQPKVKTTPLVADMP---HYNVNLKSIDVGGTTLQLPAHVFETGER-- 312

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G I+D GT +T L    +  +  +      ++     V  F  C+ + G      PT++
Sbjct: 313 KGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIV-FHNVQDF-MCFQYPGSVDDGFPTIT 370

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVSFDL 483
            HF    AL +    Y  P +    +C  F      +     + ++G++      V +DL
Sbjct: 371 FHFEDDLALHVYPHEYFFP-NGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDL 429

Query: 484 ANNRVGFTPNKC 495
            N  +G+T   C
Sbjct: 430 ENQVIGWTDYNC 441


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 177/369 (47%), Gaps = 37/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+PPR+F++ +DTGSDI W+ C  C +C + S        FDP +SS+ S 
Sbjct: 84  GLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSL 143

Query: 212 LPCAAPQCKSL---DVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS---G 259
           + C+ P C SL     + C  ++N+C Y   YGDGS T G  V++ + F    G+S    
Sbjct: 144 VSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIAN 203

Query: 260 SVKGIALGCGHDNEGLF--VGSA--GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDS 310
           S   I  GC     G    V  A  G+ G G   LS+  Q+ +  +     ++CL   + 
Sbjct: 204 SSASIVFGCSTYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL-KGEG 262

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G L        + + +PL+ ++   + Y + L   SV GQ + I P++F    + + 
Sbjct: 263 DGGGKLVLGEILEPNIIYSPLVPSQ---SHYNLNLQSISVNGQLLPIDPAVFA--TSNNQ 317

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSL 430
           G IVD GT +T L   AY+    +      +   T  ++  + CY  S       P VSL
Sbjct: 318 GTIVDSGTTLTYLVETAYDPFVSAITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSL 376

Query: 431 HFGAGKALDLPAKNYLIPV---DSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
           +F  G ++ L    YL+ +   D A  +C  F   +   ++I+G++  +     +DLA+ 
Sbjct: 377 NFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQ 436

Query: 487 RVGFTPNKC 495
           R+G+    C
Sbjct: 437 RIGWANYDC 445


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 116/387 (29%), Positives = 164/387 (42%), Gaps = 65/387 (16%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC 219
           + VG PP+  +MVLDTGS+++WL+C   R  +    Q+   F+   SS+Y+   C++P+C
Sbjct: 64  VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC 123

Query: 220 ----KSLDV----SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--- 268
               + L V    +   +  C   ++Y D S   G L  +T   G +  V  +  GC   
Sbjct: 124 QWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLLGGAPPVXAL-FGCVTS 182

Query: 269 --------GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS 320
                     D+E     + GLLG+  G LS   Q      AYC+   D P   VL    
Sbjct: 183 YSSATATNSSDSEA----ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVL---- 234

Query: 321 ARGGDAVT-------APLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
             GGD           PLI+ ++ +  F    Y V L G  VG   + IP S+   D  G
Sbjct: 235 --GGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTG 292

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA------LFDTCYDFSGLR- 421
            G  +VD GT  T L   AY  L+  F+     L    G +       FD C+  S  R 
Sbjct: 293 AGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGAFDACFRASEARV 352

Query: 422 ---SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSI 468
              S  +P V L   GA  A+      Y +P +  G       +C  F  +  A     +
Sbjct: 353 AAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYV 412

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           IG+  QQ   V +DL N RVGF P +C
Sbjct: 413 IGHHHQQNVWVEYDLQNGRVGFAPARC 439


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/359 (26%), Positives = 170/359 (47%), Gaps = 32/359 (8%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+ F++++DTGS + ++ C  C +C +  DP F P++SS+Y P+ C 
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKC- 139

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
                ++D + C ++R  C+Y+  Y + S + G L  + +SFGN   +  +    GC + 
Sbjct: 140 -----TIDCN-CDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENV 193

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLT-----KQIKATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LS+      K + + S + C    D     ++    +   
Sbjct: 194 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGISPPS 253

Query: 325 DAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           D   A    +     +Y + L    V G+ + +  ++F+    G  G ++D GT    L 
Sbjct: 254 DMAFA--YSDPVRSPYYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYLP 307

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLR----SVRVPTVSLHFGAGKAL 438
             A+ + +D+ V+   +LK  SG      D C+  +G+     S   P V + F  G+  
Sbjct: 308 EAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKY 367

Query: 439 DLPAKNYLIPVDSA-GTFCF-AFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  +NY+       G +C   F   +   +++G +  + T V +D    ++GF    C
Sbjct: 368 TLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKTNC 426


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 160/356 (44%), Gaps = 28/356 (7%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           +++ + +GTP R FS+++DTGS I ++ C+ C+ C + +   FDP  S++   L C  P 
Sbjct: 13  FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPL 72

Query: 219 CKSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC--GHDNEGL 275
           C     S  C  +RC Y   Y + S + G ++ +T  F +S S   +  GC  G   E  
Sbjct: 73  CNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVRLVFGCENGETGEIY 132

Query: 276 FVGSAGLLGLGGGMLSLTKQI---KATSLAYCLVDRDSPASGVL---EFNSARGGDAVTA 329
              + G++G+G    +   Q+   K     + L     P  G+L   +     G + V  
Sbjct: 133 RQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLC-FGYPKDGILLLGDVTLPEGANTVYT 191

Query: 330 PLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYN 389
           PL+ +  +  +Y V + G +V GQ +    S+F+       G ++D GT  T L T A+ 
Sbjct: 192 PLLTHLHLH-YYNVKMDGITVNGQTLAFDASVFDRGY----GTVLDSGTTFTYLPTDAFK 246

Query: 390 SLRDS---FVRLAGNLKPTSGV--ALFDTCY-----DFSGLRSVRVPTVSLHFGAGKALD 439
           ++  +   +V   G L+ T G      D C+      F  L     P     FG G  L 
Sbjct: 247 AMAKAVGDYVEKKG-LQSTPGADPQYNDICWKGAPDQFKDLDKY-FPPAEFVFGGGAKLT 304

Query: 440 LPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           LP   YL  +     +C       ++ +++G V  +   V++D  N++VGFT   C
Sbjct: 305 LPPLRYLF-LSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMAC 359


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 131/454 (28%), Positives = 189/454 (41%), Gaps = 88/454 (19%)

Query: 116 LITKLQLAIYNVDRHELKPAE---AQILPEDFSTPVVSGASQGSGEYFS-RIGVGTPPRQ 171
           L   L  A +N   H LK      A+      S P+    S GS    S  +G     + 
Sbjct: 29  LTHTLSKAQFNSTHHLLKSTSTRSAKRFRRQLSLPL----SPGSDYTLSFNLGPQAQAQP 84

Query: 172 FSMVLDTGSDINWLQCRP--CTECY-QQSDPIFDPKT---------------SSSYSPLP 213
            ++ +DTGSD+ W  C P  C  C  + ++P   P T               S++++  P
Sbjct: 85  ITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVAVSCKSPACSAAHNLAP 144

Query: 214 ----CAAPQC--KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIAL 266
               CAA +C  +S++ S C   +C  +  AYGDGS  +  L  +T+S  +S  ++    
Sbjct: 145 PSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSL-SSLFLRNFTF 202

Query: 267 GCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-------SPAS 313
           GC H          G+ G G G+LSL  Q+   S       +YCLV           P+ 
Sbjct: 203 GCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSP 259

Query: 314 GVL------EFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
            +L      E     GG A  V   ++ N K   FY V L G +VG + +  P  L  ++
Sbjct: 260 LILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAVGKRTIPAPEMLRRVN 319

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDTCYDFS 418
             GDGG++VD GT  T L    YNS+ D F R  G        ++  +G+A    CY  +
Sbjct: 320 NRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEEKTGLA---PCYYLN 376

Query: 419 GLRSVRVPTVSLHFGAGK--ALDLPAKNYLIPVD----------SAGTFCFAFAPTSSAL 466
            +    VP ++L F  GK  ++ LP KNY                 G          + L
Sbjct: 377 SV--ADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCLMLMNGGDEADL 434

Query: 467 S-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S      +GN QQQG  V +DL   RVGF   +C
Sbjct: 435 SGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 117/378 (30%), Positives = 165/378 (43%), Gaps = 51/378 (13%)

Query: 158 EYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDP---IFDPKTSSSYSPLPC 214
           EY   I VGTPP +   + DTGSD+ W++C+        + P    F P  SS+Y  + C
Sbjct: 109 EYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGC 168

Query: 215 AAPQCKSLDVSA-CRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSG------------- 259
               C++L  +A C  +  C Y  +YGDGS   G L TET +F                 
Sbjct: 169 DTKACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNN 228

Query: 260 --------SVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLV 306
                    +  +  GC     G F     L+GLGGG +SL  Q+ AT+      +YCL 
Sbjct: 229 NSSSHGQVEIAKLDFGCSTTTTGTFRADG-LVGLGGGPVSLASQLGATTSLGRKFSYCLA 287

Query: 307 D-RDSPASGVLEFNS---ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
              ++ AS  L F S        A + PLI   +V+T+Y + L   +V G   + P +  
Sbjct: 288 PYANTNASSALNFGSRAVVSEPGAASTPLITG-EVETYYTIALDSINVAG--TKRPTT-- 342

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR- 421
               A    IIVD GT +T L +     L     R     +  S   + D CYD SG+R 
Sbjct: 343 ----AAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDISGVRG 398

Query: 422 --SVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS--ALSIIGNVQQQGT 477
             ++ +P V+L  G G  + L   N  + V   G  C A   TS   ++SI+GN+ QQ  
Sbjct: 399 EDALGIPDVTLVLGGGGEVTLKPDNTFVVVQE-GVLCLALVATSERQSVSILGNIAQQNL 457

Query: 478 RVSFDLANNRVGFTPNKC 495
            V +DL    V F    C
Sbjct: 458 HVGYDLEKGTVTFAAADC 475


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 169/367 (46%), Gaps = 39/367 (10%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQC--K 220
           + VGTPP+  SMV+DTGS+++WL C   T         F+   S SY P+PC++  C  +
Sbjct: 35  LTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCTNQ 93

Query: 221 SLDVS---ACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHD----N 272
           + D S   +C +N  C   ++Y D S + G+L ++T   G S  + G+  GC       N
Sbjct: 94  TRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGAS-DIPGMVFGCMDSVFSSN 152

Query: 273 EGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVT---A 329
                 + GL+G+  G LS   Q+     +YC+   D   SG+L    +    AV     
Sbjct: 153 SDEDSKNTGLMGMNRGSLSFVSQMGFPKFSYCISGTD--FSGMLLLGESNFTWAVPLNYT 210

Query: 330 PLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
           PL++ +  +  F    Y V L G  V  + + IP S+FE D  G G  +VD GT  T L 
Sbjct: 211 PLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLL 270

Query: 385 TQAYNSLRDSFV-RLAGNLKPTSGVAL-----FDTCYDFSGLRSV--RVPTVSLHF-GAG 435
             AY +LR  F+ +  G L+             D CY     + V  R+PTVSL F GA 
Sbjct: 271 GPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNGAE 330

Query: 436 KALDLPAKNYLIPVDSAGT---FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRV 488
             +      Y +P +  G     C +F   S  L     +IG+  QQ   + FDL  +R+
Sbjct: 331 MTVADERVLYRVPGEIRGNDSVHCLSFG-NSDLLGVEAYVIGHHHQQNVWMEFDLERSRI 389

Query: 489 GFTPNKC 495
           G    +C
Sbjct: 390 GLAQVRC 396


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 163/379 (43%), Gaps = 44/379 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           SG     G Y++++G+GTP + + + +DTGSDI W+ C  C EC + S       +++ K
Sbjct: 77  SGRPDTVGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIK 136

Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            S S   +PC    C  ++   +S C AN  C Y   YGDGS T G  V + V +   SG
Sbjct: 137 DSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSG 196

Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
            ++       +  GCG    G    ++     G+LG G    S+  Q+ AT       A+
Sbjct: 197 DLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAH 256

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL   +    G+              PLI N+     Y V +T   VG   + +P   F 
Sbjct: 257 CLDGIN--GGGIFAIGHVVQPKVNMTPLIPNQP---HYNVNMTAVQVGEDFLHLPTEEF- 310

Query: 364 MDEAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
             EAGD  G I+D GT +  L    Y  L    +    +LK    V    TC+ +SG   
Sbjct: 311 --EAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLK-VHIVRDEYTCFQYSGSVD 367

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQG 476
              P V+ HF     L +    YL P +  G +C  +  +         ++++G++    
Sbjct: 368 DGFPNVTFHFENSVFLKVHPHEYLFPFE--GLWCIGWQNSGMQSRDRRNMTLLGDLVLSN 425

Query: 477 TRVSFDLANNRVGFTPNKC 495
             V +DL N  +G+T   C
Sbjct: 426 KLVLYDLENQAIGWTEYNC 444


>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
          Length = 424

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 165/389 (42%), Gaps = 89/389 (22%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC----------TECYQQSDPIFDPK 204
           G  +Y +  G+G PP+    V+DTGSD+ W QC  C            C+ Q+ P ++  
Sbjct: 74  GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133

Query: 205 TSSSYSPLPC---------AAPQCKSLDVSACRAN-RCLYQVAYGDGSFTVGDLVTETVS 254
            S +   +PC          AP+           +  C+   +YG G   +G L T+  +
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192

Query: 255 FGNSGSVKGIALGCGHDNE---GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSP 311
           F +S SV  +A GC        G   G++G++GLG G LSL               +DSP
Sbjct: 193 FPSSSSVT-LAFGCVSQTRISPGALTGASGIIGLGRGALSLNP-------------KDSP 238

Query: 312 ASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD-- 369
            S                         TFYY+ L G + G   V +P   F++ EA    
Sbjct: 239 FS-------------------------TFYYLPLVGLAAGNATVALPAGAFDLREAAPKV 273

Query: 370 --GGIIVDCGTAITRLQTQAYNSLRDSFVRL---AGNLKPTS---GVAL---FDTCYDFS 418
             GG ++D G+  TRL   A+ +L     R    +G+L P     G AL    +   D  
Sbjct: 274 WAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGDDGD 333

Query: 419 GLRSVRVPTVSLHF----GAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA--------L 466
            L +  VP++ L F    G G+ L +PA+ Y   V+ A T+C A   ++S          
Sbjct: 334 SLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVE-ASTWCMAVVSSASGNATLPTNET 392

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +IIGN  QQ  RV +DLAN  + F P  C
Sbjct: 393 TIIGNFMQQDMRVLYDLANGLLSFQPANC 421


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 164/368 (44%), Gaps = 36/368 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           G G Y  ++ +GTPP +    +DTGS++ W+ C  C +C+ QS  IF+P  SS+Y   PC
Sbjct: 94  GDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPC 153

Query: 215 AAPQCKSLDVSACRANRCLY------QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
            + QC++   S    N CLY      Q+   +G   V D +T T S G    +      C
Sbjct: 154 DSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAV-DTMTLTSSDGRPFPLPYSDFVC 212

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDS--PAS---GVLEFNS 320
           G+     F G  G++GLG G LSLT ++   S    +YCL D  S  P+    G+  F S
Sbjct: 213 GNSIYKTFAG-VGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYSKQPSKINFGLQSFIS 271

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD---GGIIVDCG 377
               + V+  L  ++     YYV L G SVG +       L+ +D+      G +++D G
Sbjct: 272 DDDLEVVSTTLGHHRHSGN-YYVTLEGISVGEKRQD----LYYVDDPFAPPVGNMLIDSG 326

Query: 378 TAITRLQTQAYNSLRDSF-VRLAGNLKPTSGVALFDTCYD--------FSGLRSVRVPTV 428
           T  T L    Y+ L  +    +  N +     + F    D        F     ++ P +
Sbjct: 327 TMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFWYYPELKFPKI 386

Query: 429 SLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS-IIGNVQQQGTRVSFDLANNR 487
           ++HF     ++L   N  I V +    CFAFA T    S + G+ QQ    + +DL    
Sbjct: 387 TIHFTDAD-VELSDDNSFIRV-AEDVVCFAFAATQPGQSTVYGSWQQMNFILGYDLKRGT 444

Query: 488 VGFTPNKC 495
           V F    C
Sbjct: 445 VSFKRTDC 452


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 172/369 (46%), Gaps = 40/369 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR+F++ +DTGSD+ W+ C  C  C + S+       FDP  SSS S 
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 212 LPCAAPQCKS--LDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNS-------GSV 261
           + C+  +C S     S C  N  C Y   YGDGS T G  +++ +SF           S 
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSS 201

Query: 262 KGIALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDSPA 312
                GC +   G          G+ GLG G LS+  Q+    LA     +CL   D   
Sbjct: 202 APFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCL-KGDKSG 260

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
            G++     +  D V  PL+ ++     Y V L   +V GQ + I PS+F +   GDG I
Sbjct: 261 GGIMVLGQIKRPDTVYTPLVPSQP---HYNVNLQSIAVNGQILPIDPSVFTI-ATGDGTI 316

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDT--CYDFSGLRSVRVPTVS 429
           I D GT +  L  +AY+     F++   N     G  + +++  C++ +       P VS
Sbjct: 317 I-DTGTTLAYLPDEAYS----PFIQAIANAVSQYGRPITYESYQCFEITAGDVDVFPEVS 371

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGT--FCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
           L F  G ++ L    YL    S+G+  +C  F   S   ++I+G++  +   V +DL   
Sbjct: 372 LSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQ 431

Query: 487 RVGFTPNKC 495
           R+G+    C
Sbjct: 432 RIGWAEYDC 440


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 60/123 (48%), Positives = 82/123 (66%), Gaps = 3/123 (2%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPC 214
           GSGEY   + +GTPP  +  + DTGSD+ W QC PC +CY+QS PIFDP  S+S+S +PC
Sbjct: 88  GSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPC 147

Query: 215 AAPQCKSLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE 273
            +  CK++D S C A   C Y   YGD ++T GDL  E ++ G+S SVK + +GCGH++ 
Sbjct: 148 NSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSS-SVKSV-IGCGHESG 205

Query: 274 GLF 276
           G F
Sbjct: 206 GGF 208


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 168/361 (46%), Gaps = 35/361 (9%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+ F++++D+GS + ++ C  C +C +  DP F P+ SS+Y P+ C 
Sbjct: 91  NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCN 150

Query: 216 APQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHDNE 273
              C   D       +C+Y+  Y + S + G L  + +SFGN   +  +    GC     
Sbjct: 151 M-DCNCDD----DKEQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETVET 205

Query: 274 G-LFVGSA-GLLGLGGGMLSLTKQIK-----ATSLAYCLVDRDSPASGVLEFNSARGGDA 326
           G L+   A G++GLG G LSL  Q+      + S   C    D     ++      G D 
Sbjct: 206 GDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMI----LGGFDY 261

Query: 327 VTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            +  +  +   D   +Y + LTG  V G+ + +   +F+    G+ G ++D GT    L 
Sbjct: 262 PSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFD----GEHGAVLDSGTTYAYLP 317

Query: 385 TQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY------DFSGLRSVRVPTVSLHFGAGK 436
             A+ +  ++ +R    LK   G      DTC+      D S L  +  P+V + F +G+
Sbjct: 318 DAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKI-FPSVEMIFKSGQ 376

Query: 437 ALDLPAKNYLIPVDSA-GTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           +  L  +NY+       G +C    P      +++G +  + T V +D  N++VGF    
Sbjct: 377 SWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRTN 436

Query: 495 C 495
           C
Sbjct: 437 C 437


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 111/406 (27%), Positives = 176/406 (43%), Gaps = 47/406 (11%)

Query: 121 QLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
            L  ++  RH    A A  LP        +G    +G YF++IG+GTP + + + +DTGS
Sbjct: 48  NLRAHDARRHGRSLAAAVDLPLG-----GNGLPTETGLYFTQIGIGTPAKSYYVQVDTGS 102

Query: 181 DINWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKSLD---VSACR-ANR 231
           DI W+ C  C  C ++S       ++DP  SSS + + C    C +     + +C  A  
Sbjct: 103 DILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVATHGGVIPSCVPAAP 162

Query: 232 CLYQVAYGDGSFTVGDLVTETVSF----GNSGSV---KGIALGCGHDNEGLFVGSA---- 280
           C Y ++YGDGS T G  VT+ + +    GNS +      I  GCG    G    S+    
Sbjct: 163 CQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCGAKIGGDLGSSSQALD 222

Query: 281 GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNK 335
           G+LG G    S+  Q+ A        A+CL   +    G+            T PL+   
Sbjct: 223 GILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTIN--GGGIFAIGDVVQPKVSTTPLVPGM 280

Query: 336 KVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSF 395
                Y V L    VGG  +Q+P ++F++ E+   G I+D GT +  L    YN++    
Sbjct: 281 P---HYNVNLEAIDVGGVKLQLPTNIFDIGES--KGTIIDSGTTLAYLPGVVYNAIMSKV 335

Query: 396 VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTF 455
               G++ P      F  C+ +SG      P ++ HF  G  L++   +YL    +   +
Sbjct: 336 FAQYGDM-PLKNDQDFQ-CFRYSGSVDDGFPIITFHFEGGLPLNIHPHDYLF--QNGELY 391

Query: 456 CFAF------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C  F            + ++G++      V +DL N  +G+T   C
Sbjct: 392 CMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNC 437


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 117/392 (29%), Positives = 168/392 (42%), Gaps = 81/392 (20%)

Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYS-PLPCAAPQC---------- 219
           ++ +DTGSD+ W  C P  C  C  +  P   P  +++ S  + C +P C          
Sbjct: 64  TLYMDTGSDLVWFPCAPFKCILC--EGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPS 121

Query: 220 ----------KSLDVSACRANRCL-YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGC 268
                     +S++ S C   +C  +  AYGDGS  +  L  +T+S  +S  ++    GC
Sbjct: 122 DLCAAARCPLESIETSDCANFKCPPFYYAYGDGSL-IARLYRDTLSL-SSLFLRNFTFGC 179

Query: 269 GHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRD-------SPASGV 315
            +          G+ G G G+LSL  Q+   S       +YCLV           P+  +
Sbjct: 180 AYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLI 236

Query: 316 L-------EFNSARGGDA--VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
           L       E     GG A  V  P++ N K   FY VGL G SVG + V  P  L  ++ 
Sbjct: 237 LGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAPEMLRRVNN 296

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG-------NLKPTSGVALFDTCYDFSG 419
            GDGG++VD GT  T L    YNS+ D F R  G        ++  +G+A    CY  + 
Sbjct: 297 RGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLA---PCYYLNS 353

Query: 420 LRSVRVPTVSLHFGAGK-ALDLPAKNYLIPV----DSA------GTFCFAFAPTSSALS- 467
           +    VP ++L F  G  ++ LP KNY        D+A      G          + LS 
Sbjct: 354 V--AEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAELSG 411

Query: 468 ----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                +GN QQQG  V +DL   RVGF   +C
Sbjct: 412 GPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/296 (32%), Positives = 142/296 (47%), Gaps = 32/296 (10%)

Query: 120 LQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQ--GSGEYFSRIGVGTPPRQFSMVLD 177
           + L  Y+  R   +    ++LPE  S P+ SG +     G Y++RI +GTPP+QF + +D
Sbjct: 1   MSLDHYHTLRKHDQRRLRRMLPEVVSFPI-SGDNDIFAMGLYYTRISLGTPPQQFYVDVD 59

Query: 178 TGSDINWLQCRPCTECYQQSD-PI----FDPKTSSSYSPLPCAAPQCKSLDVS-ACRANR 231
           TGS++ W++C PCT C    D P+    FDP+ S++   + C   +C  L+    C   R
Sbjct: 60  TGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119

Query: 232 --CLYQVAYGDGSFTVGDLVTETVSFG-----NSGSVKGIA---LGCGHDNEGLFVGSAG 281
             C Y + YGDGS T G  + +  +F      NS +  G A    GCG    G +    G
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSW-SVDG 178

Query: 282 LLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           LLG G   +SL  Q     I     A+CL   D    G L   + R  D V  P++  + 
Sbjct: 179 LLGFGPTTVSLPNQLAQQNISVNIFAHCL-QGDVSGRGSLVIGTIREPDLVYTPMVFGED 237

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
               Y V L    + G+ V  P S F+++    GG+I+D GT +T L   AY+  R
Sbjct: 238 ---HYNVQLLNIGISGRNVTTPAS-FDLEYT--GGVIIDSGTTLTYLVQPAYDEFR 287


>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
          Length = 484

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 157/370 (42%), Gaps = 48/370 (12%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSD-INWLQCRPCTECYQQSDPIFDPKTSSSYSPLP 213
           G+ EY    G GTP ++  +  DT +     LQC PC      +D  FDP  SSS S +P
Sbjct: 134 GAFEYHVVAGFGTPMQKLPVGFDTTTTGATLLQCTPCG---SGADHAFDPSASSSVSQVP 190

Query: 214 CAAPQCK--------SLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIA 265
           C +P C         S  +S    N  L    +   + T+    + TV       ++GIA
Sbjct: 191 CGSPDCPFHGCSGRPSCTLSVSFNNTLLGNATFFTDTLTLTPSSSATVDKFRFACLEGIA 250

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLVDRDSPAS----GV 315
            G   D      GSAG+L L     SL  ++ A+S       +YCL     PAS    G 
Sbjct: 251 PGPAED------GSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCL-----PASTADVGF 299

Query: 316 LEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           L   + +    G      PL  +      Y V L G  +GG  + IPP+    D+     
Sbjct: 300 LSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAIAGDD----- 354

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLH 431
            I++  T  T L+ Q Y  LRDSF +          +   DTCY+F+GL +  VP V+L 
Sbjct: 355 TILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDAFSVPAVTLK 414

Query: 432 FGAGKALDLPAKNYLIPVDSAGTF---CFAFAPTSSAL---SIIGNVQQQGTRVSFDLAN 485
           F  G  +DL     +   D    F   C AF          ++IG++ Q  T V +D+  
Sbjct: 415 FAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMSTEVVYDVRG 474

Query: 486 NRVGFTPNKC 495
            +VGF P +C
Sbjct: 475 GKVGFVPYRC 484


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 170/362 (46%), Gaps = 37/362 (10%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +R+ +GTPP+ F++++D+GS + ++ C  C +C +  DP F P+ SS+Y P+ C 
Sbjct: 90  NGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKC- 148

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
                ++D + C  +R  C+Y+  Y + S + G L  + +SFGN   +  +    GC   
Sbjct: 149 -----NMDCN-CDDDREQCVYEREYAEHSSSKGVLGEDLISFGNESQLTPQRAVFGCETV 202

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQIK-----ATSLAYCLVDRDSPASGVLEFNSARGG 324
             G L+   A G++GLG G LSL  Q+      + S   C    D     ++      G 
Sbjct: 203 ETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMI----LGGF 258

Query: 325 DAVTAPLIRNKKVDT--FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           D  +  +  +   D   +Y + LTG  V G+ + +   +F+    G+ G ++D GT    
Sbjct: 259 DYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFD----GEHGAVLDSGTTYAY 314

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVR-----VPTVSLHFGAG 435
           L   A+ +  ++ +R    LK   G      DTC+  +    V       P+V + F +G
Sbjct: 315 LPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSG 374

Query: 436 KALDLPAKNYLIPVDSA-GTFCFAFAPT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
           ++  L  +NY+       G +C    P      +++G +  + T V +D  N++VGF   
Sbjct: 375 QSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGFWRT 434

Query: 494 KC 495
            C
Sbjct: 435 NC 436


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 119/389 (30%), Positives = 154/389 (39%), Gaps = 71/389 (18%)

Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSP------LPCAAPQCKSLDV 224
           S+ LDTGSD+ W  C P  C  C  +  P  +  +S+   P      +PCA+P C +   
Sbjct: 99  SLFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRRIPCASPFCSAAHS 158

Query: 225 SA-----CRANRC------------------LYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           SA     C A RC                  LY  AYGDGS  V  L    V    S +V
Sbjct: 159 SAPPADLCAAARCPLDDIETGSCAASHACPPLY-YAYGDGSL-VARLRRGRVGIAASVAV 216

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA----YCLVDR----DSP-A 312
           +     C H   G  VG AG    G G LSL  Q+   +L+    YCLV      D P  
Sbjct: 217 ENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIR 273

Query: 313 SGVLEFNSARGGDA------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
              L    + G D       V  PL+ N K   FY V L   SVGG  +   P L  +  
Sbjct: 274 PSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGR 333

Query: 367 AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-----CYDFSGLR 421
           AGDGG++VD GT  T L  + Y  + + F R     +     A  D      CY +    
Sbjct: 334 AGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYYDHDA 393

Query: 422 SV-------RVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFA-----PTSSAL 466
           S         VP +++HF     + LP +NY +   S       C               
Sbjct: 394 SAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDGGGPA 453

Query: 467 SIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +GN QQQG  V +D+   RVGF   +C
Sbjct: 454 GTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 111/383 (28%), Positives = 157/383 (40%), Gaps = 61/383 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + VGTPP+  +MVLDTGS+++WL C        + D  FD   SSSY+P+PC++P C  L
Sbjct: 67  VAVGTPPQNVTMVLDTGSELSWLLCN-----GSRHDAPFDASASSSYAPVPCSSPACTWL 121

Query: 223 --DVSA---CRANRCLYQVAYGDGSFTVGDLVTETVSFGNS--GSVKGIALGCGHDNEGL 275
             D+     C ++ C   ++Y D S   G L  +T   G+S   ++ G         +  
Sbjct: 122 GRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPALFGCITSYSSSTDPS 181

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLIR-- 333
                GLLG+  G LS   Q      AYC+     P  G+L      GG+    PL    
Sbjct: 182 ETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQGP--GILLL----GGNDTETPLTSPP 235

Query: 334 ------------NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCG 377
                       ++ +  F    Y V L G  VG   + IP  L   D  G G  +VD G
Sbjct: 236 QQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQTMVDSG 295

Query: 378 TAITRLQTQAYNSLRDSFVR-----LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--- 424
           T  T L   AY +L+  F       L G L P           FD C+  +  R      
Sbjct: 296 TRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARVSAAAA 355

Query: 425 ---VPTVSLHFGAGKALDLPAKNYLIPV------DSAGTFCFAFAPTSSA---LSIIGNV 472
              +P V L     + +   A+  L  V      +  G +C  F  +  A     +IG+ 
Sbjct: 356 GGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAGVSAYVIGHH 415

Query: 473 QQQGTRVSFDLANNRVGFTPNKC 495
            QQ   V +DL N R+GF   +C
Sbjct: 416 HQQDVWVEYDLRNARLGFAAARC 438


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 44/372 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y+++IG+GTP + + + +DTGSDI W+ C  C EC + S       +++   S +   
Sbjct: 76  GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKL 135

Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
           +PC    C  ++   +  C AN  C Y   YGDGS T G  V + V +   SG +K    
Sbjct: 136 VPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195

Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
              +  GCG    G    S      G+LG G    S+  Q+  T       A+CL    +
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL--DGT 253

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
              G+              PLI N+     Y V +T   VG + + +P  +F   EAGD 
Sbjct: 254 NGGGIFVIGHVVQPKVNMTPLIPNQP---HYNVNMTAVQVGHEFLSLPTDVF---EAGDR 307

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVS 429
            G I+D GT +  L    Y  L    +    +LK  +    + TC+ +S       P V+
Sbjct: 308 KGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEY-TCFQYSDSLDDGFPNVT 366

Query: 430 LHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS------SALSIIGNVQQQGTRVSFDL 483
            HF     L +    YL P +  G +C  +  +         ++++G++      V +DL
Sbjct: 367 FHFENSVILKVYPHEYLFPFE--GLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDL 424

Query: 484 ANNRVGFTPNKC 495
            N  +G+T   C
Sbjct: 425 ENQAIGWTEYNC 436


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/184 (35%), Positives = 102/184 (55%), Gaps = 13/184 (7%)

Query: 118 TKLQLAIYNVDRHELKPAEAQILPEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLD 177
           ++ +LA   + R E   A   ++ E   TP++       GEY  ++G+GTPP +F+  +D
Sbjct: 55  SRYRLAGIGMARGEAASARKAVVAE---TPIMPAG----GEYLVKLGIGTPPYKFTAAID 107

Query: 178 TGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRAN---RCLY 234
           T SD+ W QC+PCT CY Q DP+F+P+ SS+Y+ LPC++  C  LDV  C  +    C Y
Sbjct: 108 TASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQY 167

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLF--VGSAGLLGLGGGMLSL 292
              Y   + T G L  + +  G   + +G+A GC   + G      ++G++GLG G LSL
Sbjct: 168 TYTYSGNATTEGTLAVDKLVIGED-AFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSL 226

Query: 293 TKQI 296
             Q+
Sbjct: 227 VSQL 230



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 57/129 (44%), Gaps = 5/129 (3%)

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSF---VRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
           G+I+D  + IT L+   Y+ L +     +RL      + G+ L     D      V VP 
Sbjct: 236 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPA 295

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANN 486
           V+L F  G+ L L           +G  C       + ++SI+GN QQQ  +V ++L   
Sbjct: 296 VALAFD-GRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 354

Query: 487 RVGFTPNKC 495
           RV F  + C
Sbjct: 355 RVTFVQSPC 363


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 106/381 (27%), Positives = 163/381 (42%), Gaps = 61/381 (16%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G Y++ I +GTPP+ + + +DTGSDI W+ C  C +C  +S       ++DPK SS+ S
Sbjct: 83  TGLYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGS 142

Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN---SGSVK- 262
            + C    C +     +  C AN  C Y V YGDGS T+G  VT+ + F      G  + 
Sbjct: 143 MVMCDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQP 202

Query: 263 ---GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRD 309
               +  GCG   +G  +GS+     G+LG G    S+  Q+          A+CL    
Sbjct: 203 ANASVIFGCGA-QQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCL--DT 259

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+            T PL+ +K     Y V L    VGG  +Q+P  +FE  E   
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLQLPAHIFEPGEK-- 314

Query: 370 GGIIVDCGTAITRL--------QTQAYNSLRD-SFVRLAGNLKPTSGVALFDTCYDFSGL 420
            G I+D GT +T L            +N  +D +F  + G L           C+ + G 
Sbjct: 315 KGTIIDSGTTLTYLPELVFKEVMLAVFNKHQDITFHDVQGFL-----------CFQYPGS 363

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQ 474
                PT++ HF    AL +    Y    +    +C  F   +S       + ++G++  
Sbjct: 364 VDDGFPTITFHFEDDLALHVYPHEYFF-ANGNDVYCVGFQNGASQSKDGKDIVLMGDLVL 422

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
               V +DL N  +G+T   C
Sbjct: 423 SNKLVIYDLENRVIGWTDYNC 443


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 149/359 (41%), Gaps = 42/359 (11%)

Query: 170 RQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA 229
           + + + LD G  ++W+QC PC  C  Q  P+FDP  S ++S +P                
Sbjct: 109 QNYQLALDMGGGLSWMQCLPCRHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLAN 168

Query: 230 NRCLYQVAYGDGSFTVGDLVTETVSF--GNSGSV--KGIALGCGHDNEGLF--VGSAGLL 283
             C + +AY D +   G L  +T SF  GN   V    I  GC H  E        AG+L
Sbjct: 169 GACGFDIAYRDNTHASGYLARDTFSFPAGNDDFVPLSAIVFGCAHQTEHFKNQRAVAGIL 228

Query: 284 GLGGG-----MLSLTKQI---KATSLAYCLVDRDSPASGVLEF----------NSARGGD 325
           GLG G       + TKQ+        +YC           L F          N  R   
Sbjct: 229 GLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHRQST 288

Query: 326 AVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQ 384
            V AP   ++     Y+V L G SVG   +  + P++F  +  G GG +VD GT +T   
Sbjct: 289 PVLAPAHNSEA----YFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFI 344

Query: 385 TQAY----NSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDL 440
             AY    +++R    R   ++    G    +TC          +P+++LHF  G  L +
Sbjct: 345 HSAYVHIDHAVRQHLQRRGAHIVVVRG----NTCVQQPAPHHDVLPSMTLHFENGAWLRV 400

Query: 441 PAKNYLIPVDSAGTF--CFAFAPTSSALSIIGNVQQQGTRVSFDLANN--RVGFTPNKC 495
             ++  +P    G    CF F  +S+ L++IG  QQ   R  FDL +    + F P  C
Sbjct: 401 MPEHVFMPFVVGGHHYQCFGFV-SSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPEDC 458


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 166/375 (44%), Gaps = 50/375 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y+++IG+GTP + + + +DTGSDI W+ C  C +C ++S       +++   S S   
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137

Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
           + C    C  +    +S C+AN  C Y   YGDGS T G  V + V + + +G +K    
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
              +  GCG    G    S      G+LG G    S+  Q+ ++       A+CL  R+ 
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN- 256

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
              G+              PL+ N+     Y V +T   VG + + IP  LF   + GD 
Sbjct: 257 -GGGIFAIGRVVQPKVNMTPLVPNQP---HYNVNMTAVQVGQEFLNIPADLF---QPGDR 309

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
            G I+D GT +  L    Y  L    V+   + +P   V + D    C+ +SG      P
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFP 365

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
            V+ HF     L +   +YL P +  G +C  +  ++        ++++G++      V 
Sbjct: 366 NVTFHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVL 423

Query: 481 FDLANNRVGFTPNKC 495
           +DL N  +G+T   C
Sbjct: 424 YDLENQLIGWTEYNC 438


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 157/375 (41%), Gaps = 49/375 (13%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G Y++ + +GTPP++F + +DTGSDI W+ C  C +C  +S       ++DPK SS+ S
Sbjct: 85  TGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGS 144

Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN---SGSVK- 262
            + C    C       +  C AN  C Y V YGDGS TVG  V + + F      G  + 
Sbjct: 145 TVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQP 204

Query: 263 ---GIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
               +  GCG    G    S+    G+LG G    S+  Q+          A+CL     
Sbjct: 205 ANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCL--DTI 262

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+            T PL+ +K     Y V L    VGG  +++P  +F+  E    
Sbjct: 263 KGGGIFAIGDVVQPKVKTTPLVADKP---HYNVNLKTIDVGGTTLELPADIFKPGEK--R 317

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT----CYDFSGLRSVRVP 426
           G I+D GT +T L    +  +      +         +   D     C+++SG      P
Sbjct: 318 GTIIDSGTTLTYLPELVFKKV------MLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFP 371

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGTRVS 480
           T++ HF    AL +    Y  P +    +C  F      +     + ++G++      V 
Sbjct: 372 TLTFHFEDDLALHVYPHEYFFP-NGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVV 430

Query: 481 FDLANNRVGFTPNKC 495
           +DL N  +G+T   C
Sbjct: 431 YDLENRVIGWTDYNC 445


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 171/370 (46%), Gaps = 42/370 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC--YQQS---------DPIFDPKT 205
           G Y SR+ +GTPP +F++++DTGS + ++ C  CT C  +Q S         DP F P+ 
Sbjct: 38  GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97

Query: 206 SSSYSPLPCAAPQCKSLDVSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG 263
           SSSY  + C +  C +     C +N  +C Y+  Y + S + G L  + + FG +  ++ 
Sbjct: 98  SSSYQKIGCRSSDCIT---GLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDFGPASRLQS 154

Query: 264 --IALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQIKAT-------SLAYCLVDRDSPA 312
             ++ GC     G L++  A G++GLG G LS+  Q+          SL Y  +D +   
Sbjct: 155 QLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMD-EGGG 213

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           S VL    A  G         + +   +Y + LT   V G ++++  ++F     G  G 
Sbjct: 214 SMVLGAIPAPSGMVFAKS---DPRRSNYYNLELTEIQVQGASLKLDSNVFN----GKFGT 266

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----P 426
           I+D GT    L  +A+ +  D+ V   G+L+   G      D CY  +G  +  +    P
Sbjct: 267 ILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFP 326

Query: 427 TVSLHFGAGKALDLPAKNYLIP-VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLAN 485
            V   F   + + L  +NYL       G +C  F     A +++G +  +   V++D  N
Sbjct: 327 LVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYN 386

Query: 486 NRVGFTPNKC 495
           +++GF    C
Sbjct: 387 HQIGFLKTNC 396


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 35/377 (9%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFD 202
           VV  + QG+ +  S    G     F++ +DTGSDI W+ C  C+ C Q S        FD
Sbjct: 57  VVDFSVQGTSDPNSVGMYGXXXXXFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFD 116

Query: 203 PKTSSSYSPLPCAAPQCKSLDVSAC-----RANRCLYQVAYGDGSFTVGDLVTETVSFG- 256
              SS+ + +PC+   C S    A      R N+C Y   YGDGS T G  V++ + F  
Sbjct: 117 TVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFNL 176

Query: 257 ------NSGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSL----- 301
                    S   I  GC     G    +     G+ G G G LS+  Q+ +  +     
Sbjct: 177 IMGQPPAVNSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVF 236

Query: 302 AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
           ++CL   D    G+L          V +PL+ ++     Y + L   +V GQ + I P++
Sbjct: 237 SHCL-KGDGNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQPLPINPAV 292

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
           F +     GG IVDCGT +  L  +AY+ L  + +  A +       +  + CY  S   
Sbjct: 293 FSISN-NRGGTIVDCGTTLAYLIQEAYDPLVTA-INTAVSQSARQTNSKGNQCYLVSTSI 350

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTR 478
               P VSL+F  G ++ L  + YL+    +D A  +C  F       SI+G++  +   
Sbjct: 351 GDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKI 410

Query: 479 VSFDLANNRVGFTPNKC 495
           V +D+A  R+G+    C
Sbjct: 411 VVYDIAQQRIGWANYDC 427


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 107/381 (28%), Positives = 166/381 (43%), Gaps = 48/381 (12%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           SG     G Y+++IG+GTPP+ + + +DTGSDI W+ C  C EC  +S+      ++D K
Sbjct: 76  SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIK 135

Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            SSS   +PC    CK ++   ++ C AN  C Y   YGDGS T G  V + V +   SG
Sbjct: 136 ESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 195

Query: 260 SVK------GIALGCGHDNEGLFVGS-----AGLLGLGGGMLSLTKQIKATS-----LAY 303
            +K       I  GCG    G    S      G+LG G    S+  Q+ ++       A+
Sbjct: 196 DLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAH 255

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL        G+              PL+ ++     Y V +T   VG   + +     +
Sbjct: 256 CL--NGVNGGGIFAIGHVVQPKVNMTPLLPDQP---HYSVNMTAVQVGHAFLSLST---D 307

Query: 364 MDEAGD-GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCYDFSGL 420
               GD  G I+D GT +  L    Y  L    +    +LK  +   L D  TC+ +S  
Sbjct: 308 TSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRT---LHDEYTCFQYSES 364

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSIIGNVQQ 474
                P V+ +F  G +L +   +YL P  S   +C  +  +      S  ++++G++  
Sbjct: 365 VDDGFPAVTFYFENGLSLKVYPHDYLFP--SGDFWCIGWQNSGTQSRDSKNMTLLGDLVL 422

Query: 475 QGTRVSFDLANNRVGFTPNKC 495
               V +DL N  +G+T   C
Sbjct: 423 SNKLVFYDLENQVIGWTEYNC 443


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 125/410 (30%), Positives = 165/410 (40%), Gaps = 86/410 (20%)

Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
           +Y   + VG P      S+ LDTGSD+ W  C P  C  C  ++ P       +  SPLP
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141

Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
                    CA+P C +   SA     C A RC                 LY  AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200

Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT--- 299
             V +L    V    S +V+     C H      VG AG    G G LSL  Q+  +   
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSG 256

Query: 300 SLAYCLVD---------RDSPASGVLEFNSARGG----DAVTAPLIRNKKVDTFYYVGLT 346
             +YCLV          R SP       ++A  G    D V  PL+ N K   FY V L 
Sbjct: 257 RFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALE 316

Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG------ 400
             SVGG+ +Q  P L ++D  G+GG++VD GT  T L +  +  + D F R         
Sbjct: 317 AVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTR 376

Query: 401 --NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTF 455
               +  +G+A    CY +S      VP V+LHF     + LP +NY +   S       
Sbjct: 377 AEGAEAQTGLA---PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 456 CFAFAPT----------SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C                      +GN QQQG  V +D+   RVGF   +C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 125/410 (30%), Positives = 165/410 (40%), Gaps = 86/410 (20%)

Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
           +Y   + VG P      S+ LDTGSD+ W  C P  C  C  ++ P       +  SPLP
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141

Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
                    CA+P C +   SA     C A RC                 LY  AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200

Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKAT--- 299
             V +L    V    S +V+     C H      VG AG    G G LSL  Q+  +   
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSG 256

Query: 300 SLAYCLVD---------RDSPASGVLEFNSARGG----DAVTAPLIRNKKVDTFYYVGLT 346
             +YCLV          R SP       ++A  G    D V  PL+ N K   FY V L 
Sbjct: 257 RFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALE 316

Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG------ 400
             SVGG+ +Q  P L ++D  G+GG++VD GT  T L +  +  + D F R         
Sbjct: 317 AVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTR 376

Query: 401 --NLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTF 455
               +  +G+A    CY +S      VP V+LHF     + LP +NY +   S       
Sbjct: 377 AEGAEAQTGLA---PCYHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 456 CFAFAPT----------SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C                      +GN QQQG  V +D+   RVGF   +C
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 162/359 (45%), Gaps = 39/359 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSS--------SYS 210
           Y   +G+GTP +   + +DTGS  +W+ C  C  C+         ++++        S  
Sbjct: 82  YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNPRTFLQSRSTTCAKVSCGTSMC 140

Query: 211 PLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
            L  + P C+  +        C ++V+Y DGS + G L  +T++F +   + G + GC  
Sbjct: 141 LLGGSDPHCQDSE----NYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPGFSFGCNM 196

Query: 271 DNEGL--FVGSAGLLGLGGGMLSLTKQIKAT--SLAYCLVDRD------SPASGVLEFNS 320
           D+ G   F    GLLG+G G +S+ KQ   T    +YCL  +       S  +G      
Sbjct: 197 DSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTGYFSLGK 256

Query: 321 -ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTA 379
            A   D     ++  KK    ++V LT  SV G+ + + PS+F        G++ D G+ 
Sbjct: 257 VATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRK-----GVVFDSGSE 311

Query: 380 ITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT---CYDFSGLRSVRVPTVSLHFGAGK 436
           ++ +  +A + L      L   LK   G A  ++   CYD   +    +P +SLHF  G 
Sbjct: 312 LSYIPDRALSVLSQRIRELL--LK--RGAAEEESERNCYDMRSVDEGDMPAISLHFDDGA 367

Query: 437 ALDLPAKNYLIP--VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
             DL +    +   V     +C AFAPT S +SIIG++ Q    V +DL    +G  P+
Sbjct: 368 RFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
 gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
          Length = 506

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/381 (26%), Positives = 174/381 (45%), Gaps = 48/381 (12%)

Query: 148 VVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPI------- 200
           +++G+S     Y+++IGVG P +  + ++DTGSDI W +C+ C  C  + + I       
Sbjct: 77  MLNGSSTSDATYYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIM 136

Query: 201 ------FDPKTSSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTET 252
                 +DP+ S + SP  C+ P C   +  +CR N   C Y ++Y D S + G    + 
Sbjct: 137 QGPITLYDPELSITASPATCSDPLCS--EGGSCRGNNNSCAYDISYEDTSSSTGIYFRDV 194

Query: 253 VSFGNSGSVK-GIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAY-----CLV 306
           V  G+  S+   + LGC     GL+    G++G G   +S+  Q+ A + +Y     CL 
Sbjct: 195 VHLGHKASLNTTMFLGCATSISGLW-PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLS 253

Query: 307 DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDE 366
                   ++   +    + V  P++ N   D  Y V L   SV  +A+ I  S FE + 
Sbjct: 254 GEKEGGGILVLGKNDEFPEMVYTPMLAN---DIVYNVKLVSLSVNSKALPIEASEFEYNA 310

Query: 367 -AGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPT-----SGVALFDTCYDFSGL 420
             G+GG I+D GT+     ++A      +  +    + PT     SG   F +  D + +
Sbjct: 311 TVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAI-PTAPLESSGSPCFISISDRNSV 369

Query: 421 RSVRVPTVSLHFGAGKALDLPAKNYLIPVDS-----------AGTFCFAFAPTSSALSII 469
             V  P V+L F  G  ++L A NYL  V S               C +++  +S  +I+
Sbjct: 370 E-VDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVCISWSVGNS--TIL 426

Query: 470 GNVQQQGTRVSFDLANNRVGF 490
           G+   +   V +D+  +R+G+
Sbjct: 427 GDAILKDKVVVYDMEKSRIGW 447


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 117/399 (29%), Positives = 164/399 (41%), Gaps = 76/399 (19%)

Query: 167 TPPRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIF----DPKTSSS------------ 208
            PP+  S+ LDTGSD+ W  C+P  C  C  +++        P+ SS+            
Sbjct: 91  NPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACS 150

Query: 209 --YSPLP----CAAPQC--KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSF---G 256
             +S LP    CA   C  +S++ S C +  C  +  AYGDGS  V  L  +++      
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSL-VARLYHDSIKLPLAT 209

Query: 257 NSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA------TSLAYCLV---- 306
            S S+     GC H      VG AG    G G+LSL  Q+ +         +YCLV    
Sbjct: 210 PSLSLHNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLVSHSF 266

Query: 307 --DR-DSPASGVLEFNSARGGDA-------VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQ 356
             DR   P+  +L  +  +           V   ++ N K   FY VGL G S+G + + 
Sbjct: 267 NSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIP 326

Query: 357 IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNL----KPTSGVALFD 412
            P  L  +D  G GG++VD GT  T L    YNS+   F    G +    K         
Sbjct: 327 APEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGLG 386

Query: 413 TCYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG--------TFCFAFAP-- 461
            CY +  +  V +P++ LHF G   ++ LP KNY       G          C       
Sbjct: 387 PCYYYDTV--VNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGG 444

Query: 462 -----TSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                T    + +GN QQ G  V +DL   RVGF   KC
Sbjct: 445 EEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 173/371 (46%), Gaps = 40/371 (10%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+R+ +G P +++ + +DTGSDI W+ C PCT C   S        F+P +SS+ S 
Sbjct: 87  GLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146

Query: 212 LPCAAPQCKS---LDVSACRANR-----CLYQVAYGDGSFTVGDLVTETVSF----GN-- 257
           +PC+  +C +      + C+++      C Y   YGDGS T G  V++T+ F    GN  
Sbjct: 147 IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQ 206

Query: 258 -SGSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQ-----IKATSLAYCLVD 307
            + S   +  GC +   G  + +     G+ G G   LS+  Q     +   + ++CL  
Sbjct: 207 TANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKG 266

Query: 308 RDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEA 367
            D+   G+L          V  PL+ ++     Y + L   +V GQ + I  SLF     
Sbjct: 267 SDN-GGGILVLGEIVEPGLVFTPLVPSQP---HYNLNLESIAVSGQKLPIDSSLFATSNT 322

Query: 368 GDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPT 427
              G IVD GT +  L   AY+   ++ +  A +    S V+    C+  +       PT
Sbjct: 323 --QGTIVDSGTTLVYLVDGAYDPFINA-IAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPT 379

Query: 428 VSLHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLA 484
            +L+F  G ++ +  +NYL+    VD+   +C  +   S  ++I+G++  +     +DLA
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQ-RSQGITILGDLVLKDKIFVYDLA 438

Query: 485 NNRVGFTPNKC 495
           N R+G+    C
Sbjct: 439 NMRMGWADYDC 449


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 40/372 (10%)

Query: 153 SQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQC---RPCTECYQQSDPIFDPKTSSSY 209
           + G  +Y   +G GTP +Q +M  DTG  I+ ++C   RP   C   +   FDP  SS++
Sbjct: 140 APGFHDYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTF 197

Query: 210 SPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCG 269
           +P+PC +P C+S   S    +  L         F  G +  + ++   S SV     GC 
Sbjct: 198 APVPCGSPDCRSGCSSGSTPSCPLTSF-----PFLSGAVAQDVLTLTPSASVDDFTFGCV 252

Query: 270 HDNEGLFVGSAGLLGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA 326
             + G  +G+AGLL L     S+  ++ A    + +YCL    + + G L    A     
Sbjct: 253 EGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGEADVPHN 312

Query: 327 VT------APLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
            T      APL+ +      Y + L G S+GG+ + IPP       A    +++D     
Sbjct: 313 RTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPHAATASAA----MVLDTALPY 368

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR-SVRVPTVSLHF-----GA 434
           T ++   Y  LRD+F R          +   DTCY+F+G+R  V +P V L F     G 
Sbjct: 369 TYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGIGGGG 428

Query: 435 GKALDLPAKNYLIPVDSAGTF----CFAFAPTSS-------ALSIIGNVQQQGTRVSFDL 483
           G  +     + +  +   G F    C AFA   S          ++G + Q    V  D+
Sbjct: 429 GGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHDV 488

Query: 484 ANNRVGFTPNKC 495
              ++GF P  C
Sbjct: 489 PGGKIGFIPGSC 500


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/265 (32%), Positives = 131/265 (49%), Gaps = 19/265 (7%)

Query: 239 GDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKA 298
           G  + T G L T+T +FG + +V G+  GC   + G F G++G++G+G G LSL  Q++ 
Sbjct: 124 GSAANTSGYLATDTFTFGAT-AVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQF 182

Query: 299 TSLAYCLV----DRDSPASGVLEFNSARGGDAV-------TAPLIRNKKVDTFYYVGLTG 347
              +Y L+      D  A  V+ F    G DAV       + PL+ +     FYYV LTG
Sbjct: 183 GKFSYQLLAPEATDDGSADSVIRF----GDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTG 238

Query: 348 FSVGGQAVQ-IPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
             V G  +  IP   F++   G GG+I+   T +T L+  AY+ +R +     G      
Sbjct: 239 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNG 298

Query: 407 GVAL-FDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA 465
             AL  D CY+ S +  V+VP ++L F  G  +DL A NY    +  G  C    P+   
Sbjct: 299 SAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG 358

Query: 466 LSIIGNVQQQGTRVSFDLANNRVGF 490
            S++G + Q GT + +D+   R+ F
Sbjct: 359 -SVLGTLLQTGTNMIYDVDAGRLTF 382


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 119/415 (28%), Positives = 178/415 (42%), Gaps = 90/415 (21%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQ--------QSDP--IFDPKTS 206
           G Y   + +GTPP+   ++LDTGS ++W+   PCT  YQ         + P  +F PK S
Sbjct: 87  GGYAFTVSLGTPPQPLPVLLDTGSHLSWV---PCTSSYQCRNCSSLSAASPLHVFHPKNS 143

Query: 207 SSYSPLPCAAPQCKSL----DVSACRA-----------------NRC-LYQVAYGDGSFT 244
           SS   + C  P C  +     +S CRA                 N C  Y V YG GS T
Sbjct: 144 SSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-T 202

Query: 245 VGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
            G L+++T+      +V+   +GC      +    +GL G G G  S+  Q+  T  +YC
Sbjct: 203 AGLLISDTLRTPGR-AVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYC 259

Query: 305 LVDRDSPASGVLEFNSARGGDAVT--------------APLIRNKKV----DTFYYVGLT 346
           L+ R        + N+A  G+ +               APL R+         +YY+ LT
Sbjct: 260 LLSRR------FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALT 313

Query: 347 GFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS 406
             +VGG++VQ+P   F +     GG IVD GT  +      +  +  + V   G     S
Sbjct: 314 AITVGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRS 372

Query: 407 GVAL----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI---PVDSAG----- 453
            V         C+    G +++ +P +SLHF  G  ++LP +NY +   P  S G     
Sbjct: 373 KVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMA 432

Query: 454 -TFCFAF---APTSSALS---------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              C A     PTSS  +         I+G+ QQQ   + +DL   R+GF   +C
Sbjct: 433 EAICLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 165/375 (44%), Gaps = 50/375 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y+++IG+GTP + + + +DTGSDI W+ C  C +C ++S       +++   S S   
Sbjct: 78  GLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKL 137

Query: 212 LPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK---- 262
           + C    C  +    +S C+AN  C Y   YGDGS T G  V + V + + +G +K    
Sbjct: 138 VSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTA 197

Query: 263 --GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
              +  GCG    G    S      G+LG G    S+  Q+ ++       A+CL  R+ 
Sbjct: 198 NGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRN- 256

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD- 369
              G+              PL+ N+     Y V +T   VG + + IP  LF   + GD 
Sbjct: 257 -GGGIFAIGRVVQPKVNMTPLVPNQP---HYNVNMTAVQVGQEFLTIPADLF---QPGDR 309

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD---TCYDFSGLRSVRVP 426
            G I+D GT +  L    Y  L    V+   + +P   V + D    C+ +SG      P
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFP 365

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRVS 480
            V+ HF     L +   +YL P    G +C  +  ++        ++++G++      V 
Sbjct: 366 NVTFHFENSVFLRVYPHDYLFP--HEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVL 423

Query: 481 FDLANNRVGFTPNKC 495
           +DL N  +G+T   C
Sbjct: 424 YDLENQLIGWTEYNC 438


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 167/372 (44%), Gaps = 48/372 (12%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC----------YQQSDPIFDPKT 205
           +G Y +R+ +GTP ++F++++D+GS + ++ C  C +C           +  DP F P  
Sbjct: 89  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 148

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
           SS+YSP+ C      ++D + C   R  C Y+  Y + S + G L  + +SFG    +K 
Sbjct: 149 SSTYSPVKC------NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 201

Query: 263 -GIALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASG 314
                GC +   G LF   A G++GLG G LS+  Q     + + S + C    D    G
Sbjct: 202 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 260

Query: 315 VLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
            +      GG      ++    N     +Y + L    V G+A+++ P +F        G
Sbjct: 261 TMVL----GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----G 312

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV---- 425
            ++D GT    L  QA+ + +D+      +LK   G      D C+  +G    ++    
Sbjct: 313 TVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVF 372

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDL 483
           P V + FG G+ L L  +NYL       G +C   F       +++G +  + T V++D 
Sbjct: 373 PDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 432

Query: 484 ANNRVGFTPNKC 495
            N ++GF    C
Sbjct: 433 HNEKIGFWKTNC 444


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 167/372 (44%), Gaps = 48/372 (12%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC----------YQQSDPIFDPKT 205
           +G Y +R+ +GTP ++F++++D+GS + ++ C  C +C           +  DP F P  
Sbjct: 88  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 147

Query: 206 SSSYSPLPCAAPQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSVK- 262
           SS+YSP+ C      ++D + C   R  C Y+  Y + S + G L  + +SFG    +K 
Sbjct: 148 SSTYSPVKC------NVDCT-CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP 200

Query: 263 -GIALGCGHDNEG-LFVGSA-GLLGLGGGMLSLTKQ-----IKATSLAYCLVDRDSPASG 314
                GC +   G LF   A G++GLG G LS+  Q     + + S + C    D    G
Sbjct: 201 QRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV-GGG 259

Query: 315 VLEFNSARGGDAVTAPLI---RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
            +      GG      ++    N     +Y + L    V G+A+++ P +F        G
Sbjct: 260 TMVL----GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKH----G 311

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV---- 425
            ++D GT    L  QA+ + +D+      +LK   G      D C+  +G    ++    
Sbjct: 312 TVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVF 371

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDL 483
           P V + FG G+ L L  +NYL       G +C   F       +++G +  + T V++D 
Sbjct: 372 PDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 431

Query: 484 ANNRVGFTPNKC 495
            N ++GF    C
Sbjct: 432 HNEKIGFWKTNC 443


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 160/387 (41%), Gaps = 54/387 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP----IFDPKTSSS 208
           G Y   +  GTP +  S V+DTGS + W  C     CT C +   DP     F PK SSS
Sbjct: 88  GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147

Query: 209 YSPLPCAAPQCKSLDVSACRANRC---------------LYQVAYGDGSFTVGDLVTETV 253
              + C  P+C  +  S  R  RC                Y + YG G+ TVG L+ E++
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRT-RCPGCDQNSANCTKACPTYAIQYGLGT-TVGLLLLESL 205

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
            F    +     +GC   +       +G+ G G G  SL KQ+     +YCL+     DS
Sbjct: 206 VFAER-TEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDS 261

Query: 311 PASGVLEF-------NSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           P S  +         +   GG + T     P+  N     +YYV L    VG + V++P 
Sbjct: 262 PKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPY 321

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYD 416
           S       G+GG IVD G+  T ++   + ++   F R   N    + V        C++
Sbjct: 322 SFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFN 381

Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------I 468
            SG+ SV +P++   F  G  ++LP  NY   V      C       +  S        I
Sbjct: 382 LSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSII 441

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GN Q Q     +DL N R GF   +C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 169/370 (45%), Gaps = 43/370 (11%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD---PI--FDPKTSSSYSPLP 213
           Y++R+ +G+PPR F + +DTGSD+ W+ C  C  C   S    P+  FDP +S + S + 
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149

Query: 214 CAAPQC-----KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN--SGSVKG--- 263
           C+  +C      S  V A + N+C Y   YGDGS T G  V++ + F     GSV     
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209

Query: 264 --IALGCGHDNEGLFV----GSAGLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPA 312
             I  GC     G          G+ G G   +S+  Q+ +  +     ++CL   DS  
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDS-G 268

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
            G+L        + V  PL+ ++     Y + L    V GQ + I PS+F    + + G 
Sbjct: 269 GGILVLGEIVEPNIVYTPLVPSQP---HYNLNLQSIYVNGQTLAIDPSVFA--TSSNQGT 323

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALF--DTCYDFSGLRSVRVPTVS 429
           I+D GT +  L   AY    D F+  +   + P+    L   + CY  S   +   P VS
Sbjct: 324 IIDSGTTLAYLTEAAY----DPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFPQVS 379

Query: 430 LHFGAGKALDLPAKNYLI---PVDSAGTFCFAFAPTS-SALSIIGNVQQQGTRVSFDLAN 485
           L+F  G ++ L  ++YLI    ++ A  +C  F       ++I+G++  +     +D+A 
Sbjct: 380 LNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAG 439

Query: 486 NRVGFTPNKC 495
            R+G+    C
Sbjct: 440 QRIGWANYDC 449


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 164/384 (42%), Gaps = 45/384 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTE--------------------CYQQ 196
           G Y   + +GTP   +++VLDT +D+ W+ CR                          + 
Sbjct: 123 GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEA 182

Query: 197 SDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG----DL 248
           S   + P  SSS+  + C+  +C  L  + C    +A  C Y     DG+ T+G    + 
Sbjct: 183 SKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEK 242

Query: 249 VTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLAYC 304
            T TVS G    + G+ LGC     G  V +  G+L LG G +S             ++C
Sbjct: 243 ATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFSFC 302

Query: 305 LVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           L+  +S   AS  L F    +  G   +   ++ N  V   Y   +TG  VGG+ + IP 
Sbjct: 303 LLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIPD 362

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD--F 417
            +++ +    GG+I+D  T++T L  +AY  +  +  R   +L     +  F+ CY   F
Sbjct: 363 EVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKWTF 422

Query: 418 SG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP-TSSALSIIGN 471
           +G       +V +P+ ++    G  L+  AK+ ++P    G  C AF         I+GN
Sbjct: 423 TGDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGILGN 482

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           V  Q      D  + ++ F  +KC
Sbjct: 483 VFMQEYIWEIDHGDGKIRFRKDKC 506


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 98/335 (29%), Positives = 155/335 (46%), Gaps = 42/335 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPP +F++ +DTGSD+ W+ C  C+ C Q S        FDP +SS+ S 
Sbjct: 23  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82

Query: 212 LPCAAPQC----KSLDVS-ACRANRCLYQVAYGDGSFTVGDLVTETVSFG-------NSG 259
           + C+  +C    +S D + + + N+C Y   YGDGS T G  V++ +           + 
Sbjct: 83  IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142

Query: 260 SVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRDS 310
           S   +  GC +   G    S     G+ G G   +S+  Q+ +  +A     +CL   DS
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-KGDS 201

Query: 311 PASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
              G+L        + V   L+  +     Y + L   +V GQ +QI  S+F    +   
Sbjct: 202 SGGGILVLGEIVEPNIVYTSLVPAQP---HYNLNLQSIAVNGQTLQIDSSVFATSNS--R 256

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTS---GVALFDTCYDFSGLRSVRVPT 427
           G IVD GT +  L  +AY    D FV       P S    V+  + CY  +   +   P 
Sbjct: 257 GTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTAVSRGNQCYLITSSVTEVFPQ 312

Query: 428 VSLHFGAGKALDLPAKNYLIPVDSAG---TFCFAF 459
           VSL+F  G ++ L  ++YLI  +S G    +C  F
Sbjct: 313 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGF 347


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 115/403 (28%), Positives = 182/403 (45%), Gaps = 47/403 (11%)

Query: 129 RHELKPAEAQILPEDFSTPVVSGASQGS------GEYFSRIGVGTPPRQFSMVLDTGSDI 182
           R  L+ A    L + F   VV  + QGS      G YF+++ +G+PPR+F++ +DTGSD+
Sbjct: 33  RDRLRHAR---LLQGFVGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTGSDV 89

Query: 183 NWLQCRPCTECYQQSD-----PIFDPKTSSSYSPLPCAAPQCKS---LDVSAC--RANRC 232
            W+ C  C  C + S        FD  +SS+   + C+ P C S     V+ C  + N+C
Sbjct: 90  LWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQTNQC 149

Query: 233 LYQVAYGDGSFTVGDLVTETVSF----GNSGSVKGIAL---GCGHDNEGLFVGS----AG 281
            Y   Y DGS T G  V++T+ F    G S  V   AL   GC     G    +     G
Sbjct: 150 SYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDG 209

Query: 282 LLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKK 336
           + G G G LS+  Q+    +     ++CL         ++       G  V +PL+ ++ 
Sbjct: 210 IFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIGGGILVLGEILEPG-MVYSPLVPSQP 268

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
               Y + L   +V G+ + I PS+F    +   G IVD GT +  L  +AY+    S V
Sbjct: 269 ---HYNLNLQSIAVNGKLLPIDPSVFATSNS--QGTIVDSGTTLAYLVAEAYDPFV-SAV 322

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVD-SAG-- 453
            +  +   T  ++  + CY  S   S   P  S +F  G ++ L  ++YLIP   S G  
Sbjct: 323 NVIVSPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGS 382

Query: 454 -TFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             +C  F      ++I+G++  +     +DL   R+G+    C
Sbjct: 383 VMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 115/456 (25%), Positives = 190/456 (41%), Gaps = 58/456 (12%)

Query: 94  RHNDYRSLVLSRLERDSARVNTLITKLQLAIYNVDRHELKPAEAQILPE------DFSTP 147
           R N +R++    L R    +  +         +  R + K  E+  LPE       F  P
Sbjct: 57  RRNYFRAMEAKDLFRHQQMIKMMGNGSGTGSASSRRRQAK--ESSKLPEVMSATSMFELP 114

Query: 148 VVSGASQGS-GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR------------------ 188
           + S  +    G Y   +  GTP   +++VLDT +D+ W+ CR                  
Sbjct: 115 MRSALNIAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAG 174

Query: 189 ----PCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGD 240
                  E  +++   + P  SSS+  + C+  +C  L  + C    +A  C Y     D
Sbjct: 175 DDGAAAKEARRKN--WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQD 232

Query: 241 GSFTVG----DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQ 295
           G+ T+G    +  T TVS G    + G+ LGC     G  V +  G+L LG G +S    
Sbjct: 233 GTLTMGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVH 292

Query: 296 IK---ATSLAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTG 347
                    ++CL+  +S   AS  L F    +  G   +   ++ N  V   Y   +TG
Sbjct: 293 AAKRFGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTG 352

Query: 348 FSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG 407
             VGG+ + IP  +++ ++   GG+I+D  T++T L  +AY ++  +  R   +L     
Sbjct: 353 IFVGGERLDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYE 412

Query: 408 VALFDTCYD--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA 460
           +  F+ CY   F+G       +V VP +++    G  L+  AK+ ++P    G  C AF 
Sbjct: 413 LDGFEYCYRWTFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFR 472

Query: 461 PT-SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                   I+GNV  Q      D    ++ F  +KC
Sbjct: 473 KLPRGGPGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 118/412 (28%), Positives = 178/412 (43%), Gaps = 84/412 (20%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWL------QCRPCTECYQQSD-PIFDPKTSSSY 209
           G Y   + +GTPP+   ++LDTGS ++W+      QCR C+     S   +F PK SSS 
Sbjct: 87  GGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAASPLHVFHPKNSSSS 146

Query: 210 SPLPCAAPQCKSL----DVSACRA-----------------NRC-LYQVAYGDGSFTVGD 247
             + C  P C  +     +S CRA                 N C  Y V YG GS T G 
Sbjct: 147 RLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-TAGL 205

Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD 307
           L+++T+      +V+   +GC   +  +    +GL G G G  S+  Q+  T  +YCL+ 
Sbjct: 206 LISDTLRTPGR-AVRNFVIGCSLAS--VHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLLS 262

Query: 308 RDSPASGVLEFNSARGGDAVT--------------APLIRNKKV----DTFYYVGLTGFS 349
           R        + N+A  G+ +               APL R+         +YY+ LT  +
Sbjct: 263 RR------FDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAIT 316

Query: 350 VGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           VGG++VQ+P   F +     GG IVD GT  +      +  +  + V   G     S V 
Sbjct: 317 VGGKSVQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVV 375

Query: 410 L----FDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNYLI---PVDSAG------TF 455
                   C+    G +++ +P +SLHF  G  ++LP +NY +   P  S G        
Sbjct: 376 EEGLGLSPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAI 435

Query: 456 CFAF---APTSSALS---------IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           C A     PTSS  +         I+G+ QQQ   + +DL   R+GF   +C
Sbjct: 436 CLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 166/378 (43%), Gaps = 55/378 (14%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G Y++ I +GTPP+Q+ + +DTGSDI W+ C  C +C ++SD      ++DPK SSS S
Sbjct: 80  TGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGS 139

Query: 211 PLPCAAPQCKSL---DVSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--- 263
            + C    C +     +  C  N  C Y V YGDGS T G  V++++ + N  S  G   
Sbjct: 140 TVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQY-NQVSGDGQTR 198

Query: 264 -----IALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDR 308
                +  GCG   +G  +GS      G++G G    S+  Q+ A        ++CL   
Sbjct: 199 HANASVIFGCGA-QQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL--D 255

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
                G+            + PL+ +      Y V L   +VGG  +Q+P  +FE  E  
Sbjct: 256 TIKGGGIFAIGDVVQPKVKSTPLVPDMP---HYNVNLESINVGGTTLQLPSHMFETGEK- 311

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVR---- 424
             G I+D GT +T L    Y   +D    +      T+    F +  DF  ++  +    
Sbjct: 312 -KGTIIDSGTTLTYLPELVY---KDVLAAVFAKHPDTT----FHSVQDFLCIQYFQSVDD 363

Query: 425 -VPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSIIGNVQQQGT 477
             P ++ HF     L++   +Y    +    +CF F      +     + ++G++     
Sbjct: 364 GFPKITFHFEDDLGLNVYPHDYFFQ-NGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNK 422

Query: 478 RVSFDLANNRVGFTPNKC 495
            V +DL N  VG+T   C
Sbjct: 423 VVVYDLENQVVGWTDYNC 440


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 156/330 (47%), Gaps = 38/330 (11%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++++ +GTPPR F + +DTGSD+ W+ C  C  C Q S        FDP +S + SP
Sbjct: 79  GLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASP 138

Query: 212 LPCAAPQC----KSLDVSAC--RANRCLYQVAYGDGSFTVGDLVTETVSF----GNS--- 258
           + C+  +C    +S D S C  + N C Y   YGDGS T G  V++ + F    G+S   
Sbjct: 139 ISCSDQRCSWGIQSSD-SGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVP 197

Query: 259 GSVKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLA-----YCLVDRD 309
            S   +  GC     G  V S     G+ G G   +S+  Q+ +  +A     +CL   +
Sbjct: 198 NSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGEN 257

Query: 310 SPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGD 369
               G+L        + V  PL+ ++     Y V L   SV GQA+ I PS+F       
Sbjct: 258 G-GGGILVLGEIVEPNMVFTPLVPSQP---HYNVNLLSISVNGQALPINPSVFSTSNG-- 311

Query: 370 GGIIVDCGTAITRLQTQAYNSLRDSFVR-LAGNLKPTSGVALFDTCYDFSGLRSVRVPTV 428
            G I+D GT +  L   AY    ++    ++ +++P   V+  + CY  +       P V
Sbjct: 312 QGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV--VSKGNQCYVITTSVGDIFPPV 369

Query: 429 SLHFGAGKALDLPAKNYLIPVDS-AGTFCF 457
           SL+F  G ++ L  ++YLI  ++ A   CF
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVASALCF 399


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/353 (28%), Positives = 161/353 (45%), Gaps = 41/353 (11%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           +G+GTP    ++V DT SD+ W QC+PC  C  Q+  ++DP  + +Y+ L  ++      
Sbjct: 92  LGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSS------ 145

Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-- 280
                      Y   Y   SFT G   TET + GN  +V  I  GCG  N+G +   A  
Sbjct: 146 -----------YNYTYSKQSFTSGYFATETFALGNV-TVANITFGCGTRNQGYYDNVAGV 193

Query: 281 -GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA-------RGGDAVTAPLI 332
            G+   G G +SL  Q+     +YC     +P S  +    +           A + P++
Sbjct: 194 FGVGRGGRGGVSLLNQLGIDRFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMV 253

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
            +  + + Y+V L G +VG   V +  +     E G   +++D  + +T L    Y  +R
Sbjct: 254 ADPVLKSGYFVKLVGVTVGATLVDVAGA--SSAEGGGRALVIDSTSPVTVLDEATYGPVR 311

Query: 393 DSFV-RLA----GNLKPTSGVALFDTCYDFSGLRSVRVP---TVSLHFGAGKA-LDLPAK 443
            + V +LA     N   ++GV L D C++ +   +   P   T++LHF  G A L LP  
Sbjct: 312 RALVAQLAPLKEANANASAGVGL-DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPA 370

Query: 444 NYLIPVDSAGTFCFAFAPTSS-ALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +YL    + G  C    P+SS  + ++G+     T V +DLA N V F P  C
Sbjct: 371 SYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQPLDC 423


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 106/393 (26%), Positives = 163/393 (41%), Gaps = 57/393 (14%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR--------------------------PC 190
           G Y   +  GTP   +++VLDT +D+ W+ CR                            
Sbjct: 138 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDDDVVAA 197

Query: 191 TECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRA----NRCLYQVAYGDGSFTVG 246
               +     + P  SSS+  + C+  QC  L  + C++      C Y     DG+ T+G
Sbjct: 198 LAKKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIG 257

Query: 247 ----DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIKAT-- 299
               +  T TVS G    + G+ LGC     G  V +  G+L LG G +S    I A   
Sbjct: 258 IYGNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFA--IHAVLR 315

Query: 300 ---SLAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVG 351
                ++CL+  +S   AS  L F    +  G   +   ++ N  V   Y   +T   VG
Sbjct: 316 FGGRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVG 375

Query: 352 GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF 411
           G+ + IP  ++ +D+    G+I+D  T++T L  +AY  L  +  R   +L P    A F
Sbjct: 376 GERLDIPDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAHL-PRESFAGF 434

Query: 412 DTCYD--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFA--PT 462
           + CY   F+G       +V +P V++    G  L+  AK+ ++P    G  C AF   P 
Sbjct: 435 EYCYRWTFTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMPEVGHGVACLAFRKLPW 494

Query: 463 SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                IIGNV  Q      D +     F  +KC
Sbjct: 495 GGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKC 527


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/386 (26%), Positives = 166/386 (43%), Gaps = 49/386 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR----------------------PCTECY 194
           G Y   +  GTP   +++VLDT +D+ W+ CR                         E  
Sbjct: 125 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEAR 184

Query: 195 QQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG---- 246
           +++   + P  SSS+  + C+  +C  L  + C    +A  C Y     DG+ T+G    
Sbjct: 185 RKN--WYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGK 242

Query: 247 DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATSLA 302
           +  T TVS G    + G+ LGC     G  V +  G+L LG G +S             +
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFS 302

Query: 303 YCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
           +CL+  +S   AS  L F    +  G   +   ++ N  V   Y   +TG  VGG+ + I
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYD- 416
           P  +++ ++   GG+I+D  T++T L  +AY ++  +  R   +L     +  F+ CY  
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRW 422

Query: 417 -FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT-SSALSII 469
            F+G       +V VP +++    G  L+  AK+ ++P    G  C AF         I+
Sbjct: 423 TFAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGIL 482

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           GNV  Q      D    ++ F  +KC
Sbjct: 483 GNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 100/388 (25%), Positives = 164/388 (42%), Gaps = 49/388 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR------------------------PCTE 192
           G Y   + +GTP   +++VLDT +D+ W+ CR                            
Sbjct: 122 GMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAA 181

Query: 193 CYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSAC----RANRCLYQVAYGDGSFTVG-- 246
             + S   + P  SSS+  + C+  +C  L  + C    +A  C Y     DG+ T+G  
Sbjct: 182 KKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 241

Query: 247 --DLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSA-GLLGLGGGMLSLTKQIK---ATS 300
             +  T TVS G    + G+ LGC     G  V +  G+L LG G +S            
Sbjct: 242 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQR 301

Query: 301 LAYCLVDRDSP--ASGVLEFN---SARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
            ++CL+  +S   AS  L F    +  G   +   ++ N  V   Y   +TG  VGG+ +
Sbjct: 302 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERL 361

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCY 415
            IP  +++ +    GG+I+D  T++T L  +AY  +  +  R   +L     +  F+ CY
Sbjct: 362 DIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCY 421

Query: 416 D--FSG-----LRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAP-TSSALS 467
              F+G       +V +P+ ++    G  L+  AK+ ++P    G  C AF         
Sbjct: 422 KWTFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPG 481

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           I+GNV  Q      D  + ++ F  +KC
Sbjct: 482 ILGNVFMQEYIWEIDHGDGKIRFRKDKC 509


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 154/362 (42%), Gaps = 39/362 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y +   +GTPP+  S ++D   ++ W QC  C  C++Q  P+F P  SS++ P PC    
Sbjct: 62  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121

Query: 219 CKSLDVSACRANRCLYQ----VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
           C+S+   +C  + C Y+       G+   T G   T+T + G + +V+ +A GC   ++ 
Sbjct: 122 CESIPTRSCSGDVCSYKGPPTQLRGN---TSGFAATDTFAIGTA-TVR-LAFGCVVASDI 176

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNS----ARGGDAVTA 329
               G +G +GLG    SL  Q+K T  +YCL  R++  S  L   S    A G    TA
Sbjct: 177 DTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGKSSRLFLGSSAKLAGGESTSTA 236

Query: 330 PLIRNKKVDT---FYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQT 385
           P I+    D    +Y + L     G   +           A  GGI+V    +  + L  
Sbjct: 237 PFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVD 287

Query: 386 QAYNSLRDSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGKALDLP 441
            AY + + +     G        +    FD C+   +G      P +   F    AL +P
Sbjct: 288 SAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 347

Query: 442 AKNYLIPV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
              YLI V +   T C A    +         +S++G++QQ+     +DL    + F P 
Sbjct: 348 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 407

Query: 494 KC 495
            C
Sbjct: 408 DC 409


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 161/376 (42%), Gaps = 40/376 (10%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G  +  G Y+++IG+GTP R + + +DTGSDI W+ C  C EC ++S       ++D K
Sbjct: 89  TGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIK 148

Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            S +   + C    C +++    S C AN  C Y   Y DGS + G  V + V +   SG
Sbjct: 149 ESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSG 208

Query: 260 SVK------GIALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKATS-----LAYCL 305
            ++       +  GC     G         G+LG G    S+  Q+ ++       A+CL
Sbjct: 209 DLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL 268

Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
              +    G+            T PL+ N+   T Y V +    VGG  + +P  +F++ 
Sbjct: 269 DGLN--GGGIFAIGHIVQPKVNTTPLVPNQ---THYNVNMKAVEVGGYFLNLPTDVFDVG 323

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
           +    G I+D GT +  L    Y+ L         +LK  +    F TC+ +S       
Sbjct: 324 DK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-TCFQYSESLDDGF 380

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRV 479
           P V+ HF     L +    YL   D  G +C  +  +         ++++G++      V
Sbjct: 381 PAVTFHFENSLYLKVHPHEYLFSYD--GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLV 438

Query: 480 SFDLANNRVGFTPNKC 495
            +DL N  +G+T   C
Sbjct: 439 LYDLENQVIGWTEYNC 454


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 91/311 (29%), Positives = 153/311 (49%), Gaps = 36/311 (11%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCA 215
           +G Y +RI +GTPP+ F++++DTGS + ++ C  C +C +  DP F+P+ SS+Y P+ C 
Sbjct: 87  NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSC- 145

Query: 216 APQCKSLDVSACRANR--CLYQVAYGDGSFTVGDLVTETVSFGNSGSV--KGIALGCGHD 271
                ++D + C   R  C+Y+  Y + S + G L  + +SFGN   +  +    GC + 
Sbjct: 146 -----NIDCT-CDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSELVPQRAIFGCENQ 199

Query: 272 NEG-LFVGSA-GLLGLGGGMLSLTKQI-------KATSLAYCLVDRDSPASGVLEFNSAR 322
             G L+   A G++GLG G LS+  Q+        + SL Y  +D    A  +   +   
Sbjct: 200 ETGDLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISPPS 259

Query: 323 GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITR 382
           G     +  +R++    +Y + L    V G+ + + PS+F+    G  G ++D GT    
Sbjct: 260 GMVFAESDPVRSQ----YYNIDLKAIHVAGKQLHLDPSIFD----GKHGTVLDSGTTYAY 311

Query: 383 LQTQAYNSLRDSFVRLAGNLKPTSG--VALFDTCY-----DFSGLRSVRVPTVSLHFGAG 435
           L   A+ + +D+ ++   +LK   G      D C+     D S L S   P V + F  G
Sbjct: 312 LPEAAFTAFKDAMMKELTSLKQIHGPDPNYNDICFSGAESDVSQL-SNTFPAVEMVFSNG 370

Query: 436 KALDLPAKNYL 446
           + L L  +NYL
Sbjct: 371 QKLSLSPENYL 381


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 161/394 (40%), Gaps = 80/394 (20%)

Query: 158 EYFSRIGVGTP--PRQFSMVLDTGSDINWLQCRP--CTECYQQSDPIFDPKTSSSYSPLP 213
           +Y   + VG P      S+ LDTGSD+ W  C P  C  C  ++ P       +  SPLP
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATP-----GGNHSSPLP 141

Query: 214 ---------CAAPQCKSLDVSA-----CRANRC-----------------LYQVAYGDGS 242
                    CA+P C +   SA     C A RC                 LY  AYGDGS
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLY-YAYGDGS 200

Query: 243 FTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLA 302
             V +L    V    S +V+     C H      VG AG    G G LSL  Q+ A SL+
Sbjct: 201 L-VANLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQL-APSLS 255

Query: 303 YCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLF 362
                 D+ A G  E       D V  PL+ N K   FY V L   SVGG+ +Q  P L 
Sbjct: 256 G---STDAAAIGASET------DFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELG 306

Query: 363 EMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG--------NLKPTSGVALFDTC 414
           ++D  G+GG++VD GT  T L +  +  + D F R             +  +G+A    C
Sbjct: 307 DVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLA---PC 363

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSA---GTFCFAFAPT--------- 462
           Y +S      VP V+LHF     + LP +NY +   S       C               
Sbjct: 364 YHYSPSDRA-VPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGED 422

Query: 463 -SSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
                  +GN QQQG  V +D+   RVGF   +C
Sbjct: 423 GGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 456


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 152/342 (44%), Gaps = 47/342 (13%)

Query: 173 SMVLDTGSDINWLQCRPCTECYQQSDPI--FDPKTSSSYSPLPCAAPQCKSLDV---SAC 227
           ++VLDT SD+ W+QC P             +DP  SS+Y  L C +  C  L      AC
Sbjct: 125 TVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGAC 184

Query: 228 RANRCLYQVAYGDGSF------TVGDLVTETVSFGNSGSVKGIALGCGHDN-----EG-L 275
             N+C Y+V             T G  + +  +    G+      GC H       EG +
Sbjct: 185 VNNQCQYRVPIPSSPASSSSSGTYGSDLLKLTADPADGASMSFKFGCSHGEAKQGGEGSI 244

Query: 276 FVGSAGLLGLGGGMLSLTKQIKA---TSLAYCLVDRDS--PASGVLEFN----SARGGDA 326
              +AG++ LGGG  SL  Q  A   ++ +YC+   +S  P   VL       S  GG A
Sbjct: 245 DNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGGYA 304

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           VT P++R  +V T Y V L   +V GQ + + PS+F        G ++D  TAITRL   
Sbjct: 305 VT-PMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGSVLDSRTAITRLPPT 357

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYL 446
           AY +LR++F       +        DTCYDF+G   V VP V+L         L   N +
Sbjct: 358 AYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVAL---------LLDGNAV 408

Query: 447 IPVDSAGTF---CFAFAPTSS--ALSIIGNVQQQGTRVSFDL 483
           + +D  G     C  F   +      I+GNVQQQ   V +++
Sbjct: 409 VALDRQGILFHDCLVFTSNTDDRMPGILGNVQQQTMEVLYNV 450


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 162/358 (45%), Gaps = 55/358 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKS- 221
           + VG+PP++ +MVLDTGS+++WL C+         + IF+P  SSSY+P PC +P C + 
Sbjct: 40  LTVGSPPQRVTMVLDTGSELSWLHCKKLPNL----NFIFNPLVSSSYTPTPCTSPICTTQ 95

Query: 222 ----LDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFG--NSGSVKGIALGCGHDNEGL 275
               ++  +C AN+  + +     +F VG      + FG  ++G+  G       D +  
Sbjct: 96  TRDLINPVSCDANKLCHII-----TFFVGGPAQRGMVFGCMDTGTSSG-------DEDS- 142

Query: 276 FVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLE--FNSARGGDAVTAPLIR 333
              + GL+G+  G LS + Q++    +YC+ ++DS    VLE   N  R G     PL++
Sbjct: 143 --KTTGLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENIANPPRLGPLHYTPLVK 200

Query: 334 NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRD 393
                 ++      F           S F  D  G G  +VD  T  T L+   Y +L++
Sbjct: 201 KTTPLPYFNRNCCLFQK---------SAFLPDHTGAGQTMVDSATQFTFLRQPVYTALKN 251

Query: 394 SFVRLAGNLKPTSGVALF------DTCYDFSGLRSVRV-PTVSLHFGAGKALDLPAKNYL 446
            F     N+    G   F      D C+      ++ V P V+L F  G  L +  +  L
Sbjct: 252 EFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFD-GAELRVTGERLL 310

Query: 447 IPVDSAGT-----FCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             V +        +CF F   S  L     IIG+  Q+   + +DLAN+R+GF+   C
Sbjct: 311 YKVSNVAKSNSWIYCFTFG-NSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 165/378 (43%), Gaps = 42/378 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G Y++++G+G+P ++F + +DTGSDI W+ C  CT C ++S       ++DP 
Sbjct: 63  NGLPSSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPN 122

Query: 205 TSSSYSPLPCAAPQCK---SLDVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            S + + +PC    C    S  +S C+ +  C Y + YGDGS T G  V ++++F   SG
Sbjct: 123 GSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSG 182

Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
           ++        +  GCG    G    ++     G++G G    S+  Q+ A+       ++
Sbjct: 183 NLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSH 242

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL        G+            T PL+        Y V L    V G+ + +P  LF 
Sbjct: 243 CL--DSHHGGGIFSIGQVMEPKFNTTPLVPRM---AHYNVILKDMDVDGEPILLPLYLF- 296

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSV 423
            D     G I+D GT +  L    YN L    +     LK       F TC+ +S     
Sbjct: 297 -DSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQF-TCFHYSDKLDE 354

Query: 424 RVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGT 477
             P V  HF  G +L +   +YL  +     +C  +  +S+       L +IG++     
Sbjct: 355 GFPVVKFHF-EGLSLTVHPHDYLF-LYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNK 412

Query: 478 RVSFDLANNRVGFTPNKC 495
            V +DL N  +G+T   C
Sbjct: 413 LVVYDLENMVIGWTNFNC 430


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 100/401 (24%), Positives = 169/401 (42%), Gaps = 44/401 (10%)

Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
           L  ++ +RH  +   A  LP   F+ P       G+G Y++ IG+GTP  ++ + LDTGS
Sbjct: 51  LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 104

Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
              W+    C +C  +SD +     +DP++S S   + C    C S     C    RC Y
Sbjct: 105 KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 162

Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
              Y DG  T+G L T+ +     +GN     +   +  GCG    G    SA    G++
Sbjct: 163 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 222

Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
           G G    +   Q+ A        ++CL    +   G+            T P+++N +V 
Sbjct: 223 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 279

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            ++ V L   +V G  +Q+P ++F   +    G  +D G+ +  L    Y+ L       
Sbjct: 280 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 334

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
           A +   T G      C+ F G    + P ++ HF     LD+   +YL+  +    +CF 
Sbjct: 335 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 393

Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           F          + I+G++      V +D+    +G+T + C
Sbjct: 394 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHNC 434


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 161/376 (42%), Gaps = 40/376 (10%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G  +  G Y+++IG+GTP R + + +DTGSDI W+ C  C EC ++S       ++D K
Sbjct: 89  TGRPEAVGLYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIK 148

Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            S +   + C    C +++    S C AN  C Y   Y DGS + G  V + V +   SG
Sbjct: 149 ESLTGKLVSCDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSG 208

Query: 260 SVK------GIALGCGHDNEGLFVGSA---GLLGLGGGMLSLTKQIKATS-----LAYCL 305
            ++       +  GC     G         G+LG G    S+  Q+ ++       A+CL
Sbjct: 209 DLETTSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL 268

Query: 306 VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMD 365
              +    G+            T PL+ N+   T Y V +    VGG  + +P  +F++ 
Sbjct: 269 DGLN--GGGIFAIGHIVQPKVNTTPLVPNQ---THYNVNMKAVEVGGYFLNLPTDVFDVG 323

Query: 366 EAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRV 425
           +    G I+D GT +  L    Y+ L         +LK  +    F TC+ +S       
Sbjct: 324 DK--KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQF-TCFQYSESLDDGF 380

Query: 426 PTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQGTRV 479
           P V+ HF     L +    YL   D  G +C  +  +         ++++G++      V
Sbjct: 381 PAVTFHFENSLYLKVHPHEYLFSYD--GLWCIGWQNSGMQSRDRRNITLLGDLALSNKLV 438

Query: 480 SFDLANNRVGFTPNKC 495
            +DL N  +G+T   C
Sbjct: 439 LYDLENQVIGWTEYNC 454


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  118 bits (296), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 165/386 (42%), Gaps = 58/386 (15%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           SG     G Y+++IG+GTPP+ + + +DTGSDI W+ C  C EC  +S       ++D K
Sbjct: 74  SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIK 133

Query: 205 TSSSYSPLPCAAPQCKSLD---VSACRAN-RCLYQVAYGDGSFTVGDLVTETVSFGN-SG 259
            SSS   +PC    CK ++   ++ C AN  C Y   YGDGS T G  V + V +   SG
Sbjct: 134 ESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 193

Query: 260 SVK------GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
            +K       I  GCG    G    S      G+LG G    S+  Q+ ++       A+
Sbjct: 194 DLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAH 253

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD-TFYYVGLTGFSVGGQAVQIPPSLF 362
           CL             N   GG       +   KV+ T        +SV   AVQ+  +  
Sbjct: 254 CL-------------NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFL 300

Query: 363 EM--DEAGDG---GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCY 415
            +  D +  G   G I+D GT +  L    Y  L    +    +LK  +   L D  TC+
Sbjct: 301 SLSTDTSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQT---LHDEYTCF 357

Query: 416 DFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSII 469
            +S       P V+  F  G +L +   +YL P  S   +C  +  +      S  ++++
Sbjct: 358 QYSESVDDGFPAVTFFFENGLSLKVYPHDYLFP--SVNFWCIGWQNSGTQSRDSKNMTLL 415

Query: 470 GNVQQQGTRVSFDLANNRVGFTPNKC 495
           G++      V +DL N  +G+    C
Sbjct: 416 GDLVLSNKLVFYDLENQAIGWAEYNC 441


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 154/370 (41%), Gaps = 38/370 (10%)

Query: 138 QILPEDFSTPVVSGASQ----GSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTEC 193
           Q +P D       G SQ     +G Y     VGTPP+  + VLD  SD  W+QC  C  C
Sbjct: 72  QAVPADGGENGGGGQSQDPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATC 131

Query: 194 YQQSDPIFDPKTSSSYSPLPCAAPQCKSL----DVSACRANRCLYQVAYGDGSF--TVGD 247
                         + +P   +AP   +     D  A     C Y   YG G+   T G 
Sbjct: 132 -------------GADAPAATSAPPFYAFLSFHDTRAPTTPPCGYSYVYGGGAANTTAGL 178

Query: 248 LVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVD 307
           L  +  +F       G+  GC    EG      G++GLG G LS   Q++    +Y L  
Sbjct: 179 LAVDAFAFATV-RADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYYLAP 234

Query: 308 RDSPASG----VLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
            D+   G     L+    R   AV+ PL+ ++   + YYV L G  V G+ + IP   F+
Sbjct: 235 DDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFD 294

Query: 364 MDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL-FDTCYDFSGLRS 422
           +   G GG+++     +T L   AY  +R +       L+   G  L  D CY    L +
Sbjct: 295 LQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSESLAT 353

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA-LSIIGNVQQQGTRVSF 481
            +VP+++L F  G  ++L   NY     + G  C    P+ +   S++G++ Q    VS 
Sbjct: 354 AKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQ----VSL 409

Query: 482 DLANNRVGFT 491
                R  FT
Sbjct: 410 LSCRRRADFT 419


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 65/170 (38%), Positives = 89/170 (52%), Gaps = 3/170 (1%)

Query: 327 VTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQ 386
           VT PLI N    +FYY+ L   SVG   + I  S FE+ + G GG+I+D GT IT ++  
Sbjct: 23  VTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYIEEN 82

Query: 387 AYNSLRDSFVRLAGNLKPTSGVALFDTCYDF-SGLRSVRVPTVSLHFGAGKALDLPAKNY 445
           A++SL+  F          SG    D C+   SG   V +P +  HF  G  L+LP +NY
Sbjct: 83  AFDSLKKEFTSQTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGD-LELPGENY 141

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +I   S G  C A    S+ +SI GN+QQQ   V+ DL    + F P +C
Sbjct: 142 MIADSSLGVACLAMG-ASNGMSIFGNIQQQNILVNHDLQKETITFIPTQC 190


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 118/416 (28%), Positives = 171/416 (41%), Gaps = 84/416 (20%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQC-----RPCTECYQ--QSDPIFDPKTSSSYSP 211
           Y   + +GTPP+ F + LDTGSD+ W+ C       C +C    +  P F P  S+S + 
Sbjct: 25  YLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPSESTSNTR 84

Query: 212 LPCAAPQCKSLDVSACRANRCL--------------------YQVAYGDGSFTVGDLVTE 251
             C +  C  +  S  R + C                     +   YG G+  +G L  +
Sbjct: 85  DLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSRD 144

Query: 252 TVSFGNSGSVKGIALGCGHDNEGL------FVGSA-----GLLGLGGGMLSLTKQIK--A 298
           +V+    GS  G   G G             VGS+     G+ G G G LSL  Q+    
Sbjct: 145 SVTL--HGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFGRGALSLPSQLGFLG 202

Query: 299 TSLAYCLV--------DRDSP-ASGVLEFNSAR-GGDAVTAPLIRNKKVDTFYYVGLTGF 348
              ++C +        +  SP   G L  +SA   G  V  P++ +     FYYVGL G 
Sbjct: 203 KGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGLEGV 262

Query: 349 SVG----GQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAG---- 400
            +G    G A+  PPSL  +D  G+GG++VD GT  T+L    Y S+  S +  A     
Sbjct: 263 VLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPPYER 322

Query: 401 --NLKPTSGVALFDTCYDFSGLRSV----RVPTVSLHFGAGKALDLPAKNYLIPV----D 450
             +L+  +G   FD C+     R+      +P ++LH   G  L LP  +   PV    D
Sbjct: 323 SRDLEARTG---FDLCFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYPVTAIRD 379

Query: 451 SAGTFCFAF-----------APTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           S    C  F                  +++G+ Q Q   V +DLA  RVGF P  C
Sbjct: 380 SVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPRDC 435


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 149/359 (41%), Gaps = 87/359 (24%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSL 222
           + VG+PP+  +MVLDTGS+++WL C+     +     +FDP  SSSYSP+PC +P     
Sbjct: 379 LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSP----- 429

Query: 223 DVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGL 282
               CR                     T T S                        + GL
Sbjct: 430 ---TCR---------------------TRTHS-----------------------KTTGL 442

Query: 283 LGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGD----------AVTAPLI 332
           +G+  G LS   Q+     +YC+  +DS  SG+L F  +               ++ PL 
Sbjct: 443 IGMNRGSLSFVTQMGLQKFSYCISGQDS--SGILLFGESSFSWLKALKYTPLVQISTPLP 500

Query: 333 RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLR 392
              +V   Y V L G  V    +Q+P S++  D  G G  +VD GT  T L    Y +L+
Sbjct: 501 YFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 558

Query: 393 DSFVR-LAGNLKPTSGVAL-----FDTCYDFSGLRSVR--VPTVSLHF-GAGKALDLPAK 443
           + FVR    +LK             D CY     R     +PTV+L F GA  ++     
Sbjct: 559 NEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMFRGAEMSVSAERL 618

Query: 444 NYLIP---VDSAGTFCFAFAPTSSALS----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            Y +P     S   +CF F   S  L     IIG+  QQ   + FDLA +RVGF   +C
Sbjct: 619 MYRVPGVIRGSDSVYCFTFG-NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 676


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 111/387 (28%), Positives = 159/387 (41%), Gaps = 54/387 (13%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRP---CTEC-YQQSDP----IFDPKTSSS 208
           G Y   +  GTP +  S V+DTGS + W  C     CT C +   DP     F PK SSS
Sbjct: 88  GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147

Query: 209 YSPLPCAAPQCKSLDVSACRANRC---------------LYQVAYGDGSFTVGDLVTETV 253
              + C  P+C  +  S  R  RC                Y + YG G+ TVG L+ E++
Sbjct: 148 AKIVGCLNPKCGFVMDSEVR-TRCPGCDQNSANCTKACPTYAIQYGLGT-TVGLLLLESL 205

Query: 254 SFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR---DS 310
            F    +     +GC   +       +G+ G G G  SL KQ+     +YCL+     DS
Sbjct: 206 VFAER-TEPDFVVGCSILSSR---QPSGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDS 261

Query: 311 PASGVLEF-------NSARGGDAVTA----PLIRNKKVDTFYYVGLTGFSVGGQAVQIPP 359
           P S  +         +   GG + T     P+  N     +YYV L    VG + V+ P 
Sbjct: 262 PKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPY 321

Query: 360 SLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVAL---FDTCYD 416
           S       G+GG IVD G+  T ++   + ++   F R   N    + V        C++
Sbjct: 322 SFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFN 381

Query: 417 FSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALS--------I 468
            SG+ SV +P++   F  G  ++LP  NY   V      C       +  S        I
Sbjct: 382 LSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSII 441

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +GN Q Q     +DL N R GF   +C
Sbjct: 442 LGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 163/387 (42%), Gaps = 57/387 (14%)

Query: 156 SGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYS 210
           +G YF+ I +GTPP+++ + +DTGSDI W+ C  C++C ++S        +DPK SSS S
Sbjct: 84  TGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGS 143

Query: 211 PLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
            + C    C +     +  C AN  C Y V YGDGS T G  +T+ + F    G+  +  
Sbjct: 144 TVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQP 203

Query: 263 G---IALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATS-----LAYCLVDRDS 310
           G   I  GCG    G    S     G+LG G    S+  Q+ A        A+CL   D+
Sbjct: 204 GNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCL---DT 260

Query: 311 PASG-------------VLEFNSARGGDAVTAPLIRNKKV---DTFYYVGLTGFSVGGQA 354
              G                F  A G   +  PL     +      Y V L    VGG  
Sbjct: 261 IKGGGIFAIGNVVQPKCYFVFFFAHG--LLNIPLFLLVMILLSRPHYNVNLKSIDVGGTT 318

Query: 355 VQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTC 414
           +Q+P  +FE  E    G I+D GT +T L    +  + D       ++   +       C
Sbjct: 319 LQLPAHVFETGEK--KGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFL--C 374

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF------APTSSALSI 468
           + +SG      PT++ HF    AL +    Y  P +    +C  F      +     + +
Sbjct: 375 FQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP-NGNDIYCVGFQNGALQSKDGKDIVL 433

Query: 469 IGNVQQQGTRVSFDLANNRVGFTPNKC 495
           +G++      V +DL N  +G+T   C
Sbjct: 434 MGDLVLSNKLVVYDLENQVIGWTDYNC 460


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  117 bits (294), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
           G Y+  + +G PPR + + +DTGSD+ WLQC  PC  C +   P++ P  +     +PC 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
              C +L         C +   +C Y++ Y D   ++G LVT++  +   NS  V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
            GCG+D +   VGS+       G+LGLG G +SL  Q+K   +      +CL  R     
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226

Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G L F         A  AP+ R+   + +Y  G      GG+ + + P    M+      
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
           ++ D G++ T    Q Y +L D+    L+ NLK     +L   C+     F  +  V+  
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334

Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
             TV L F  GK   +++P +NYLI V   G  C      S      L+I+G++  Q   
Sbjct: 335 FKTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393

Query: 479 VSFDLANNRVGFTPNKC 495
           V +D    ++G+    C
Sbjct: 394 VIYDNERGQIGWIRAPC 410


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 111/418 (26%), Positives = 170/418 (40%), Gaps = 87/418 (20%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G YF+++ +G+P ++F + +DTGSDI WL C  C  C + S        FD  +SS+ + 
Sbjct: 69  GLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAAL 128

Query: 212 LPCAAPQCK---SLDVSAC--RANRCLYQVAYGDGSFTVG---------DLVTETVSFGN 257
           + C+ P C        S C  +AN+C Y   YGDGS T G         D++     F N
Sbjct: 129 VSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSN 188

Query: 258 SGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKATSLA-----YCLVDR 308
           S S   +  GC     G    +     G+ G G G LS+  Q+ +  +A     +CL  +
Sbjct: 189 SSST--VVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQ 246

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
            S   G+L        + V  PL+    +   Y + L   +V GQ + I   +F      
Sbjct: 247 GS-GGGILVLGEILEPNIVYTPLV---PLQPHYNLNLQSIAVNGQILPIDQDVFA--TGN 300

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDS---------FVRLAGNLKPTSG------------ 407
           + G IVD GT +  L  +AY+   ++         F     N+K   G            
Sbjct: 301 NRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVKRHY 360

Query: 408 -------------------VALFDTCYDFSGLRSVRVPT--------VSLHFGAGKALDL 440
                              V+ F       G +   VPT        VSL+F  G ++ L
Sbjct: 361 YDEVTLRLVLKHSAIITTTVSQFSKPIISKGNQCYLVPTSLGDIFPLVSLNFMGGASMVL 420

Query: 441 PAKNYLIP---VDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
             + YLI    +D A  +C  F       +I+G++  +     +DLAN R+G+T   C
Sbjct: 421 KPEQYLIHYGFLDGAAMWCIGFQKVQKGYTILGDLVLKDKIFVYDLANQRIGWTDYDC 478


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 59/133 (44%), Positives = 80/133 (60%), Gaps = 5/133 (3%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           +D   PV    S G+GE+  ++ +G P   +S +LDTGSD+ W QC PC++CY+Q  PI+
Sbjct: 8   KDVQAPV----SAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIY 63

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           DP  SS+Y  + C +  C +L  SAC +  C Y   YGD S T G L  ET +  +S S+
Sbjct: 64  DPSLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL-SSQSI 122

Query: 262 KGIALGCGHDNEG 274
             IA GCG DNEG
Sbjct: 123 PHIAFGCGQDNEG 135


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
           G Y+  + +G PPR + + +DTGSD+ WLQC  PC  C +   P++ P  +     +PC 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
              C +L         C +   +C Y++ Y D   ++G LVT++  +   NS  V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
            GCG+D +   VGS+       G+LGLG G +SL  Q+K   +      +CL  R     
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226

Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G L F         A  AP+ R+   + +Y  G      GG+ + + P    M+      
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
           ++ D G++ T    Q Y +L D+    L+ NLK     +L   C+     F  +  V+  
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334

Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
             TV L F  GK   +++P +NYLI V   G  C      S      L+I+G++  Q   
Sbjct: 335 FRTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393

Query: 479 VSFDLANNRVGFTPNKC 495
           V +D    ++G+    C
Sbjct: 394 VIYDNERGQIGWIRAPC 410


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 160/385 (41%), Gaps = 59/385 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           + VGTPP+  +MVLDTGS+++WL C      P T       P F+   SSSY  +PC + 
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT-------PAFNASGSSSYGAVPCPST 111

Query: 218 QC----KSLDVSA-CR---ANRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVKGIALG 267
            C    + L V   C    +N C   ++Y D S   G L T+T  ++ G      G   G
Sbjct: 112 ACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFG 171

Query: 268 C----------GHDNEGLFVGSA--GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
           C            +  G  V  A  GLLG+  G LS   Q      AYC+   + P   +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLL 231

Query: 316 LEFNSARGGDAVTAPLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
           L  +          PLI  ++ +  F    Y V L G  VG   + IP S+   D  G G
Sbjct: 232 LGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAG 291

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCYDFSGLR--- 421
             +VD GT  T L   AY +L+  F   A  L    G         FD C+     R   
Sbjct: 292 QTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAA 351

Query: 422 -SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSIIG 470
            S  +P V L   GA  A+      Y++P +  G       +C  F  +  A     +IG
Sbjct: 352 ASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIG 411

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           +  QQ   V +DL N RVGF P +C
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/385 (30%), Positives = 160/385 (41%), Gaps = 59/385 (15%)

Query: 163 IGVGTPPRQFSMVLDTGSDINWLQCR-----PCTECYQQSDPIFDPKTSSSYSPLPCAAP 217
           + VGTPP+  +MVLDTGS+++WL C      P T       P F+   SSSY  +PC + 
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT-------PAFNASGSSSYGAVPCPST 111

Query: 218 QC----KSLDVSA-CR---ANRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVKGIALG 267
            C    + L V   C    +N C   ++Y D S   G L T+T  ++ G      G   G
Sbjct: 112 ACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAVGAYFG 171

Query: 268 C----------GHDNEGLFVGSA--GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGV 315
           C            +  G  V  A  GLLG+  G LS   Q      AYC+   + P   +
Sbjct: 172 CITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLL 231

Query: 316 LEFNSARGGDAVTAPLIR-NKKVDTF----YYVGLTGFSVGGQAVQIPPSLFEMDEAGDG 370
           L  +          PLI  ++ +  F    Y V L G  VG   + IP S+   D  G G
Sbjct: 232 LGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAG 291

Query: 371 GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSG------VALFDTCYDFSGLR--- 421
             +VD GT  T L   AY +L+  F   A  L    G         FD C+     R   
Sbjct: 292 QTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAA 351

Query: 422 -SVRVPTVSLHF-GAGKALDLPAKNYLIPVDSAG------TFCFAFAPTSSA---LSIIG 470
            S  +P V L   GA  A+      Y++P +  G       +C  F  +  A     +IG
Sbjct: 352 ASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIG 411

Query: 471 NVQQQGTRVSFDLANNRVGFTPNKC 495
           +  QQ   V +DL N RVGF P +C
Sbjct: 412 HHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 172/377 (45%), Gaps = 60/377 (15%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCR-PCTECYQQSDPIFDPKTSSSYSPLPCA 215
           G Y+  + +G PPR + + +DTGSD+ WLQC  PC  C +   P++ P  +     +PC 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 216 APQCKSLD-----VSACRA--NRCLYQVAYGDGSFTVGDLVTET--VSFGNSGSVK-GIA 265
              C +L         C +   +C Y++ Y D   ++G LVT++  +   NS  V+ G+A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 266 LGCGHDNEGLFVGSA-------GLLGLGGGMLSLTKQIKATSL-----AYCLVDRDSPAS 313
            GCG+D +   VGS+       G+LGLG G +SL  Q+K   +      +CL  R     
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTR---GG 226

Query: 314 GVLEFNS--ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGG 371
           G L F         A  AP+ R+   + +Y  G      GG+ + + P    M+      
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARSTSRN-YYSPGSANLYFGGRPLGVRP----ME------ 275

Query: 372 IIVDCGTAITRLQTQAYNSLRDSFV-RLAGNLKPTSGVALFDTCYD----FSGLRSVR-- 424
           ++ D G++ T    Q Y +L D+    L+ NLK     +L   C+     F  +  V+  
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSL-PLCWKGKKPFKSVLDVKKE 334

Query: 425 VPTVSLHFGAGKA--LDLPAKNYLIPVDSAGTFCFAFAPTSSA----LSIIGNVQQQGTR 478
             TV L F  GK   +++P +NYLI V   G  C      S      L+I+G++  Q   
Sbjct: 335 FRTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQM 393

Query: 479 VSFDLANNRVGFTPNKC 495
           V +D    ++G+    C
Sbjct: 394 VIYDNERGQIGWIRAPC 410


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 151/360 (41%), Gaps = 35/360 (9%)

Query: 155 GSGEYFSRIGVGTPPRQFSMVLDTGSD-INWLQCRPCTE---CYQQSDPIFDPKTSSSYS 210
           G+ EY    G GTP +QF++  DT +     LQC+PC     C+      FDP  SSS +
Sbjct: 141 GAFEYHVTAGFGTPVQQFTVGFDTTTTGATQLQCKPCAADEPCHHA----FDPSASSSIA 196

Query: 211 PLPCAAPQC---KSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALG 267
            +PC +P C   K     +C  +  +     G+ +F    L     +  +      +  G
Sbjct: 197 HVPCGSPDCPFNKGCSGHSCTLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAG 256

Query: 268 CGHDNEGLFVGSAGLLGLGGGMLSLTKQI-----KATSLAYCLVDRDSPASGVLEFNSAR 322
              D++     S G+L L     SL  +       A + +YCL    S   G L   + +
Sbjct: 257 FRPDDD-----STGILDLSRNSHSLASRAAPSSPDAVAFSYCLPSYPSDV-GFLSLGATK 310

Query: 323 ----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGT 378
               G      PL  N+     Y V L G  +GG  + +P +      AG GG I++  T
Sbjct: 311 PELLGRKVSYTPLRSNRHNGNLYVVELVGLGLGGVDLPVPRAAI----AG-GGTILELHT 365

Query: 379 AITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKAL 438
             T L+ + Y +LRD F +              DTCY+F+ L S  VP V+L F  G   
Sbjct: 366 TFTYLKPKVYAALRDEFRKSMSQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEF 425

Query: 439 DLPAKNYLIPVDSAGTF---CFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           DL     +   +    F   C AF       ++IG++ Q  T V +D+   +VGF P +C
Sbjct: 426 DLWIDEMMYFPEPGSYFSVGCLAFVAQDGG-AVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/372 (24%), Positives = 154/372 (41%), Gaps = 41/372 (11%)

Query: 159 YFSRIGVG--------TPPRQFSMVLDTGSDINWLQCRPCTE----CYQQSDPIFDPKTS 206
           + +++GVG        T  + +   +DTG++++W+QC  C      C+   DP +    S
Sbjct: 80  FLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQS 139

Query: 207 SSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSF----GNSGSVK 262
            SY P+ C   Q    + + C+   C Y V YG GS+T G+L  ET +F    G   ++K
Sbjct: 140 KSYKPVSCN--QHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTALK 197

Query: 263 GIALGCGHDNEGLFVG-------SAGLLGLGGGMLSLTKQIKATS---LAYCLVDRDSPA 312
            I+ GC  D+  +           +G+LG+G G  S   Q+ + S    +YC+   ++  
Sbjct: 198 SISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHN 257

Query: 313 SGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGI 372
           + +           +    I   K    Y+V L G SV G  + I  +   + + G  G 
Sbjct: 258 TYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGC 317

Query: 373 IVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALF-------DTCYD-FSGLRSVR 424
           I+D GT  T L    +++L  +   L+ +L     +  +       D CY+  S      
Sbjct: 318 IIDAGTLATLLVKPIFDTLHTA---LSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKN 374

Query: 425 VPTVSLHFGAGKALDLPAKNYLI-PVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDL 483
           +P V+ H         P   +L    +    FC +     S  +IIG  QQ   +  +D 
Sbjct: 375 LPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSK-TIIGAYQQMKQKFVYDT 433

Query: 484 ANNRVGFTPNKC 495
               + F P  C
Sbjct: 434 KARVLSFGPEDC 445


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/362 (26%), Positives = 156/362 (43%), Gaps = 39/362 (10%)

Query: 159 YFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQ 218
           Y +   +GTPP+  S ++D   ++ W QC  C  C++Q  P+F P  SS++ P PC    
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104

Query: 219 CKSLDVSACRANRCLYQ----VAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNE- 273
           C+S+   +C  + C Y+       G+   T G   T+T + G + +V+ +A GC   ++ 
Sbjct: 105 CESIPTRSCSGDVCSYKGPPTQLRGN---TSGFAATDTFAIGTA-TVR-LAFGCVVASDI 159

Query: 274 GLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSA---RGGDAV-TA 329
               G +G +GLG    SL  Q+K T  +YCL  R++  S  L   S+    G ++  TA
Sbjct: 160 DTMDGPSGFIGLGRTPWSLVAQMKLTRFSYCLSPRNTGKSSRLFLGSSAKLAGSESTSTA 219

Query: 330 PLIRNKKVD---TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIV-DCGTAITRLQT 385
           P I+    D    +Y + L     G   +           A  GGI+V    +  + L  
Sbjct: 220 PFIKTSPDDDGSNYYLLSLDAIRAGNTTIAT---------AQSGGILVMHTVSPFSLLVD 270

Query: 386 QAYNSLRDSFVRLAGN---LKPTSGVALFDTCY-DFSGLRSVRVPTVSLHFGAGKALDLP 441
            AY + + +     G        +    FD C+   +G      P +   F    AL +P
Sbjct: 271 SAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 330

Query: 442 AKNYLIPV-DSAGTFCFAFAPTS-------SALSIIGNVQQQGTRVSFDLANNRVGFTPN 493
              YLI V +   T C A    +         +S++G++QQ+     +DL    + F P 
Sbjct: 331 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 390

Query: 494 KC 495
            C
Sbjct: 391 DC 392


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 59/133 (44%), Positives = 80/133 (60%), Gaps = 5/133 (3%)

Query: 142 EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF 201
           +D   PV    S G+GE+  ++ +G P   +S +LDTGSD+ W QC PC++CY+Q  PI+
Sbjct: 8   KDVQAPV----SAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIY 63

Query: 202 DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSV 261
           DP  SS+Y  + C +  C +L  SAC +  C Y   YGD S T G L  ET +  +S S+
Sbjct: 64  DPSLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTL-SSQSI 122

Query: 262 KGIALGCGHDNEG 274
             IA GCG DNEG
Sbjct: 123 PHIAFGCGQDNEG 135


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 83/221 (37%), Positives = 113/221 (51%), Gaps = 21/221 (9%)

Query: 283 LGLGGGMLSLTKQIKAT---SLAYCLVDRDSPASGVLEFNSARGGDA---VTAPLIRNKK 336
           +GLGGG  SL  Q   T   + +YCL    S +SG L   +A G      V  P++R+ +
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQ 59

Query: 337 VDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFV 396
           V TFY V L    VGG+ + IP S+F        G ++D GT ITRL   AY++L  +F 
Sbjct: 60  VPTFYGVRLQAIRVGGRQLSIPASVFS------AGTVMDSGTVITRLPPTAYSALSSAFK 113

Query: 397 RLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFC 456
                  P     + DTC+DFSG  SV +P+V+L F  G  + L A   ++      + C
Sbjct: 114 AGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNC 167

Query: 457 FAFAPTS--SALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            AFA  S  S+L IIGNVQQ+   V +D+    VGF    C
Sbjct: 168 LAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 109/373 (29%), Positives = 159/373 (42%), Gaps = 59/373 (15%)

Query: 178 TGSDINWL------QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANR 231
           +GS + W+      +CR C+     + P+F PK SSS   + C  P C+ +  +A  A +
Sbjct: 79  SGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATK 138

Query: 232 CL---------------------YQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGH 270
           C                      Y V YG GS T G L+ +T+      +V G  LGC  
Sbjct: 139 CRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTLR-APGRAVPGFVLGC-- 194

Query: 271 DNEGLFVGSAGLLGLGGGMLSLTKQIKATSLAYCLVDR----DSPASGVLEFNSARGGDA 326
               +    +GL G G G  S+  Q+     +YCL+ R    ++  SG L      GG+ 
Sbjct: 195 SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEG 254

Query: 327 VT-APLIRNKKVD-----TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
           +   PL+++   D      +YY+ L G +VGG+AV++P   F  + AG GG IVD GT  
Sbjct: 255 MQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTF 314

Query: 381 TRLQTQAYNSLRDSFVRLAGNLKPTSGVAL----FDTCYDF-SGLRSVRVPTVSLHFGAG 435
           T L    +  + D+ V   G     S  A        C+    G RS+ +P +S HF  G
Sbjct: 315 TYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGG 374

Query: 436 KALDLPAKNYLIPVDSAGT--FCFAFAPTSSALS-----------IIGNVQQQGTRVSFD 482
             + LP +NY +          C A     S  S           I+G+ QQQ   V +D
Sbjct: 375 AVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYD 434

Query: 483 LANNRVGFTPNKC 495
           L   R+GF    C
Sbjct: 435 LEKERLGFRRQSC 447


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/398 (27%), Positives = 159/398 (39%), Gaps = 80/398 (20%)

Query: 173 SMVLDTGSDINWLQCRP--CTECYQQSD-----PIFDPKTSSSYSPLPCAAPQC------ 219
           S+ LDTGSD+ W  C+P  C  C  +++         PK S + +P+ C +  C      
Sbjct: 94  SLYLDTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTATPVSCKSSACSAVHSN 153

Query: 220 --------------KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGSVKGI 264
                         +S+++S CR + C  +  AYGDGS  +  L  +++    S     I
Sbjct: 154 LPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSL-IARLYRDSIRLPLSNQTNLI 212

Query: 265 ----ALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV----DRDS 310
                 GC H      +G AG    G G+LSL  Q+   S       +YCLV    D D 
Sbjct: 213 FNNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDR 269

Query: 311 ---PASGVL----------EFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQI 357
              P+  +L            N  +    V   ++ N +   FY VGL G S+G + +  
Sbjct: 270 VRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPA 329

Query: 358 PPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT---- 413
           P  L ++D  G GG++VD GT  T L    Y+ +   F    G +   + V   +T    
Sbjct: 330 PDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEENTGLSP 389

Query: 414 CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV--------DSAGTFCFAFAPTSS 464
           CY F          V LHF G G ++ LP +NY                  C        
Sbjct: 390 CYYFDNNVVNVP-RVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGD 448

Query: 465 ALSI-------IGNVQQQGTRVSFDLANNRVGFTPNKC 495
              +       +GN QQQG  V +DL N RVGF   +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486


>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 460

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 113/392 (28%), Positives = 168/392 (42%), Gaps = 66/392 (16%)

Query: 153 SQGSGEY--FSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYS 210
           +Q  G Y   + +G G   R + + LD  +++ W+QC+P  E + Q  P F+P  S S+ 
Sbjct: 78  TQVGGMYSVVTSVGTGAGRRTYVLALDMTTNLLWMQCKPVQEPFTQLPPPFEPAKSPSFR 137

Query: 211 PLPCAAPQCKSLDVSACRANR------CLYQVAYGDGSFTV-GDLVTETVSFGNSGS--- 260
            LP     C    + A R +R      C +     DGS    G L  ET++F  SG    
Sbjct: 138 RLPGNNAFC----LPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAASGQQQT 193

Query: 261 -VKGIALGCGHDNEGLFVGS----AGLLGLGGGMLSLTK--------QIKATSLAYCL-- 305
            V G+ +GC H+++G    S    AG+LGLG    SL           ++    +YCL  
Sbjct: 194 EVTGVVIGCTHNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPS 253

Query: 306 -------------VDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGG 352
                         D D P +  +        D+ T+   R       Y+V LTG SV G
Sbjct: 254 HGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRA------YFVSLTGISVAG 307

Query: 353 QAVQIPPSLFEMDEAGD---GGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVA 409
           + +Q    LF+    G     G   D GT    +   AYN L+D+ VR   +LKP  G+ 
Sbjct: 308 KPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVR---HLKPL-GLQ 363

Query: 410 L----FDTCYDFSGLRSVRVPTVSLHFGAGKA-LDLPAKNYLIPVDSAGTFCFAFAPTSS 464
           +    +  C+  +      +PTV L F   +A L LP +   + V      C A    S 
Sbjct: 364 IVSGQYHLCFRATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGY--DICLAVV-RSY 420

Query: 465 ALSIIGNVQQQGTRVSFDLANNRVGFTP-NKC 495
            ++IIG +QQ   R  +D+ + R+ F P N C
Sbjct: 421 DITIIGAMQQVDKRFVYDVRHGRIYFVPENAC 452


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/393 (29%), Positives = 178/393 (45%), Gaps = 49/393 (12%)

Query: 124 IYNVDRHELKPAEAQIL-------PEDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVL 176
           I+  DR  ++   A+IL        +D  +P    +    G +   +G G P +  ++++
Sbjct: 87  IFLQDRSRVRSINARILGQYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLII 146

Query: 177 DTGSDINWLQCRPCT--ECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLY 234
           DTGSD  W++C  C+   C+ +  P F+P  SSSYS   C         + + + N   Y
Sbjct: 147 DTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSC---------IPSTKTN---Y 194

Query: 235 QVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEGLFVGSAGLLGLGGG----ML 290
            + Y D S++ G  V + V+       K    GCG    G F  ++G+LGL  G    ++
Sbjct: 195 TMNYEDNSYSKGVFVCDEVTLKPDVFPK-FQFGCGDSGGGDFGSASGVLGLAQGEQYSLI 253

Query: 291 SLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTA-PLIR-----NKKVDTFYYVG 344
           S T        +YC    ++   G L F    G  A++A P ++     N    + Y+V 
Sbjct: 254 SQTASKFKKKFSYCFPHNEN-TRGSLLF----GEKAISASPSLKFTRLLNPSSGSVYFVE 308

Query: 345 LTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVR---LAGN 401
           L G SV  + + +  SLF        G I+D GT IT L T AY +LR +F +      +
Sbjct: 309 LIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPS 363

Query: 402 LKPTSGVALFDTCYDFSGL--RSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAF 459
           + P       DTCY+  G   R++++P + LHF     + L     L         C AF
Sbjct: 364 VSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAF 423

Query: 460 APTS--SALSIIGNVQQQGTRVSFDLANNRVGF 490
           A  S  S ++IIGN QQ   +V +D+   R+GF
Sbjct: 424 ARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF 456


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 100/388 (25%), Positives = 160/388 (41%), Gaps = 62/388 (15%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIF-------- 201
           +G    SG YF++IG+GTP + + + +DTGSDI W+ C  CT C ++SD           
Sbjct: 65  NGHPSESGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPS 124

Query: 202 ------------DPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLV 249
                       D  TS+   P+P   P+             C Y+VAYGDGS T G  V
Sbjct: 125 SSSTSNRVTCNQDFCTSTYDGPIPGCTPEL-----------LCEYRVAYGDGSSTAGYFV 173

Query: 250 TETV-------SFGNSGSVKGIALGCGHDNEGLFVGSA----GLLGLGGGMLSLTKQIKA 298
            + V       +F  + +   I  GCG    G    ++    G+LG G    S+  Q+ +
Sbjct: 174 RDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLAS 233

Query: 299 TS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQ 353
           +       A+CL + +    G+            T PL+  +     Y V +    V  +
Sbjct: 234 SGKVKRVFAHCLDNIN--GGGIFAIGEVVQPKVRTTPLVPQQ---AHYNVFMKAIEVDNE 288

Query: 354 AVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT 413
            + +P  +F+ D     G I+D GT +       Y  L          LK  +    F T
Sbjct: 289 VLNLPTDVFDTDLR--KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQF-T 345

Query: 414 CYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LS 467
           C+++ G      PTV+ HF    +L +    YL  +DS   +C  +  + +       + 
Sbjct: 346 CFEYDGNVDDGFPTVTFHFEDSLSLTVYPHEYLFDIDS-NKWCVGWQNSGAQSRDGKDMI 404

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           ++G++  Q   V +DL N  +G+T   C
Sbjct: 405 LLGDLVLQNRLVMYDLENQTIGWTEYNC 432


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 165/379 (43%), Gaps = 43/379 (11%)

Query: 150 SGASQGSGEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPK 204
           +G    +G YF+++G+G+PP+ + + +DTGSDI W+ C  C+ C ++SD      ++DPK
Sbjct: 61  NGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPK 120

Query: 205 TSSSYSPLPCAAPQCKSL---DVSACRANR-CLYQVAYGDGSFTVGDLVTETVSFGNSG- 259
            S +   + C    C +     +  C++   C Y + YGDGS T G  V + +++ +   
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND 180

Query: 260 ------SVKGIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAY 303
                     I  GCG    G    S+     G++G G    S+  Q+ A+       ++
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240

Query: 304 CLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFE 363
           CL   +    G+            T PL+        Y V L    V    +Q+P  +F 
Sbjct: 241 CL--DNIRGGGIFAIGEVVEPKVSTTPLVPRM---AHYNVVLKSIEVDTDILQLPSDIF- 294

Query: 364 MDEAGDG-GIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRS 422
             ++G+G G I+D GT +  L    Y+ L    +     LK       F +C+ ++G   
Sbjct: 295 --DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQF-SCFQYTGNVD 351

Query: 423 VRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSA------LSIIGNVQQQG 476
              P V LHF    +L +   +YL      G +C  +  + +       ++++G++    
Sbjct: 352 RGFPVVKLHFEDSLSLTVYPHDYLFQFKD-GIWCIGWQKSVAQTKNGKDMTLLGDLVLSN 410

Query: 477 TRVSFDLANNRVGFTPNKC 495
             V +DL N  +G+T   C
Sbjct: 411 KLVIYDLENMAIGWTDYNC 429


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 91/328 (27%), Positives = 149/328 (45%), Gaps = 23/328 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAA 216
           G Y +   +GTPP+  S V+D   ++ W QC PC  C++Q  P+FDP  SS++  LPC +
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 217 PQCKSLDVSA--CRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKGIALGCGHDNEG 274
             C+S+  S+  C ++ C+Y+     G  T G   T+T + G +    G       D   
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGAAKETLGFGCVVMTDKRL 173

Query: 275 LFVGS-AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPA--SGVLEFNSARGGDAVTAPL 331
             +G  +G++GLG    SL  Q+  T+ +YCL  + S A   G      A G ++ T  +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233

Query: 332 IR------NKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQT 385
           I+      +   + +Y V L G   GG  +Q   S           +++D  +  + L  
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGST-------VLLDTVSRASYLAD 286

Query: 386 QAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNY 445
            AY +L+ +     G     S    +D C  F    +   P +   F  G AL +P  NY
Sbjct: 287 GAYKALKKALTAAVGVQPVASPPKPYDLC--FPKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 446 LIPVDSAGTFCFAFAPTSSALSIIGNVQ 473
           L+     GT C     +S++L++ G ++
Sbjct: 345 LL-ASGNGTVCLTIG-SSASLNLTGELE 370


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 97/353 (27%), Positives = 154/353 (43%), Gaps = 38/353 (10%)

Query: 165 VGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDV 224
           +GTPP++F++++DTGS + ++ C  C +C    DP F P  S +Y P+ C  P C     
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKC-NPDC----T 56

Query: 225 SACRANRCLYQVAYGDGSFTVGDLVTETVSFGNSGSVKG--IALGCGHDNEG-LFVGSA- 280
                ++C Y+  Y + S + G L  + VSFGN   +K      GC +   G LF   A 
Sbjct: 57  CDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKPQRAVFGCENAETGDLFSQHAD 116

Query: 281 GLLGLGGGMLSLTKQIKATSLAYCLVDRDSPASGVLEFNSARGGDAVTAPLI-------- 332
           G++GLG G LS+  Q+    +   + D  S   G +E     GG A+    I        
Sbjct: 117 GIMGLGRGDLSIVDQLVEKGV---INDSFSLCYGGMEV----GGGAMVLGQISPPSDMVF 169

Query: 333 --RNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNS 390
              +     +Y + L G  V G+ + I P +F+    G  G I+D GT    L   A+  
Sbjct: 170 SHSDPDRSPYYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPEAAFLP 225

Query: 391 LRDSFVRLAGNLKPTSG--VALFDTCYDFSGLRSVRV----PTVSLHFGAGKALDLPAKN 444
              +       LK   G      D C+  +G     +    P+V + F  G+   L  +N
Sbjct: 226 FIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPEN 285

Query: 445 YLIPVDSA-GTFCFA-FAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           YL       G +C   F       +++G +  + T V++D  +++VGF    C
Sbjct: 286 YLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNC 338


>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
 gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
          Length = 437

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 103/364 (28%), Positives = 164/364 (45%), Gaps = 34/364 (9%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y++ IG+G P ++  +++DTGSDI W++C PC  C  + D      I++   SS+ S 
Sbjct: 81  GLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSV 140

Query: 212 LPCAAPQCKSLDVSACRANR---CLYQVAYGDGSFTVGDLVTETVSF---GNSGSVKGIA 265
             C+ P C   +V   R+     C Y  +Y D S +VG  V + + +   G + +   I 
Sbjct: 141 SSCSDPLCTGEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATTSRIF 200

Query: 266 LGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNS 320
            GC  +  G +    G++G G    ++  QI          ++CL   +    G+LEF  
Sbjct: 201 FGCATNITGSWP-VDGIMGFGLISKTVPNQIATQRNMSRVFSHCL-GGEKHGGGILEFGE 258

Query: 321 A-RGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEM--DEAGDGGIIVDCG 377
           A    + V  PL+    V T Y V L   SV  + + I P  F    +   + G+I+D G
Sbjct: 259 APNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSG 315

Query: 378 TAITRLQTQAYNSLRDSFVRL-AGNLKPT-SGVALFDTCYDFSGL-RSVRVPTVSLHFGA 434
           T    L T+A   L      L    L P   G+  F   Y  SGL      P V+L F  
Sbjct: 316 TTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLECF---YLKSGLTMETSFPNVTLTFSG 372

Query: 435 GKALDLPAKNYLIPVD---SAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFT 491
           G  + L   NYL+  +       +C+A++ ++  L+I G +  +   V +D+ N R+G+ 
Sbjct: 373 GSTMKLKPDNYLVMAEYKKKRNGYCYAWS-SADGLTIFGEIVLKDKLVFYDVENRRIGWK 431

Query: 492 PNKC 495
              C
Sbjct: 432 GQNC 435


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 90/328 (27%), Positives = 142/328 (43%), Gaps = 32/328 (9%)

Query: 186 QCRPCTECYQQSDPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTV 245
            C  C  C++Q  P+F P  SS++ P PC    CKS+    C ++ C Y    G G  TV
Sbjct: 54  NCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTV 113

Query: 246 GDLVTETVSFGNSGSVKGIALGCGHDNEGL-FVGSAGLLGLGGGMLSLTKQIKATSLAYC 304
           G + T+T + G +   +  A G         + G +G +GLG    SL  Q+K T  +YC
Sbjct: 114 GIVATDTFAIGTAAPARPPASGASWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTRFSYC 173

Query: 305 LVDRDSPASGVLEFNSA---RGGDAVTAPLIR---NKKVDTFYYVGLTGFSVGGQAVQIP 358
           L   D+  +  L   ++    GG A T P ++   N  +  +Y + L     G   + +P
Sbjct: 174 LAPHDTGKNSRLFLGASAKLAGGGAWT-PFVKTSPNDGMSQYYPIELEEIKAGDATITMP 232

Query: 359 PSLFEMDEAGDGGIIVDCGTAITR---LQTQAYNSLRDSFVRLAGNLKPTSGV-ALFDTC 414
                    G   ++V   TA+ R   L    Y   + + +   G     + V A F+ C
Sbjct: 233 --------RGRNTVLVQ--TAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEVC 282

Query: 415 YDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTS-------SALS 467
           +  +G+     P +   F AG AL +P  NYL  V +  T C +    +         L+
Sbjct: 283 FPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DTVCLSVMSIALLNITALDGLN 339

Query: 468 IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
           I+G+ QQ+   + FDL  + + F P  C
Sbjct: 340 ILGSFQQENVHLLFDLDKDMLSFEPADC 367


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 154/357 (43%), Gaps = 26/357 (7%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPC-TECYQQSDPI---FDPKTSSSYSPL 212
           GEY     +G P  Q    LDT + + W+QC  C ++C  +   +   F    S +Y   
Sbjct: 73  GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEME 132

Query: 213 PCAAPQCKSLD-VSACRAN--RCLYQVAYGDGSFTVGDLVTETVSFGNSG----SVKGIA 265
           PC +  C SL     C ++   C Y++ YGD   T G L +++  F  S      V  + 
Sbjct: 133 PCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDVGFLN 192

Query: 266 LGCGHDNEGLFVGS----AGLLGLGGGMLSLTKQIKATSLAYCLVDRDSPAS-GVLEFNS 320
            GC   +E    G      G +GL    LSL  Q+     +YCLV  ++  S   + F S
Sbjct: 193 FGC---SEAPLTGDEQSYTGNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKMYFGS 249

Query: 321 ARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAI 380
                    PL+        YYV + G S+G         +F++ E  DG II D G   
Sbjct: 250 LPVTSGGQTPLLYPNS--DAYYVKVLGISIGNDEPHFD-GVFDVYEVRDGWII-DTGITY 305

Query: 381 TRLQTQAYNSLRDSFVRLAG-NLKPTSGVALFDTCYDFSGLRSVR-VPTVSLHFGAGKAL 438
           + L+T A++SL   F+ L     +       F+ C++      +   P V++HF  G  L
Sbjct: 306 SSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVHFD-GADL 364

Query: 439 DLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGNVQQQGTRVSFDLANNRVGFTPNKC 495
            L  ++  + ++  G FC A   + S +SI+GN Q Q   V +DL    + F P  C
Sbjct: 365 ILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 113/400 (28%), Positives = 158/400 (39%), Gaps = 86/400 (21%)

Query: 174 MVLDTGSDINWLQCRP--CTECYQQSDPIF-----DPKTSSSYSPLPCAAPQC------- 219
           + LDTGSD+ W  C+P  C  C  +++         PK S + +P+ C +  C       
Sbjct: 95  LYLDTGSDLVWFPCQPFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAAHSNL 154

Query: 220 -------------KSLDVSACRANRC-LYQVAYGDGSFTVGDLVTETVSFGNSGS----V 261
                        +S++ S C+ + C  +  AYGDGS  +  L  +++S   S      V
Sbjct: 155 PSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSL-IARLYRDSISLPLSNPTNLIV 213

Query: 262 KGIALGCGHDNEGLFVGSAGLLGLGGGMLSLTKQIKATS------LAYCLV--------- 306
                GC H      +G AG    G G+LSL  Q+   S       +YCLV         
Sbjct: 214 NNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRL 270

Query: 307 -----------DRDSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAV 355
                      D D     V   N  R    V   ++ N +   FY VGL G S+G + +
Sbjct: 271 RRPSPLILGRYDHDEKERRVNGVNKPR---FVYTSMLDNLEHPYFYCVGLEGISIGRKKI 327

Query: 356 QIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDT-- 413
             P  L ++D  G GG++VD GT  T L    Y S+   F    G +   + V   DT  
Sbjct: 328 PAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGL 387

Query: 414 --CYDFSGLRSVRVPTVSLHF-GAGKALDLPAKNYLIPV----------DSAGTFCFAFA 460
             CY F         +V LHF G G ++ LP +NY                 G       
Sbjct: 388 SPCYYFDNNVVNVP-SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNG 446

Query: 461 PTSSALS-----IIGNVQQQGTRVSFDLANNRVGFTPNKC 495
              + LS      +GN QQQG  V +DL N RVGF   +C
Sbjct: 447 GEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 164/375 (43%), Gaps = 48/375 (12%)

Query: 157 GEYFSRIGVGTPPRQFSMVLDTGSDINWLQCRPCTECYQQSD-----PIFDPKTSSSYSP 211
           G Y+++IG+GTP + + + +DTG+D+ W+ C  C EC  +S+      +++ K SSS   
Sbjct: 71  GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKL 130

Query: 212 LPCAAPQCKSLD---VSACRA---NRCLYQVAYGDGSFTVGDLVTETVSFGN-SGSVK-- 262
           +PC    CK ++   ++ C +   + C Y   YGDGS T G  V + V F   SG +K  
Sbjct: 131 VPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTA 190

Query: 263 ----GIALGCGHDNEGLFVGSA-----GLLGLGGGMLSLTKQIKATS-----LAYCLVDR 308
                +  GCG    G    S      G+LG G    S+  Q+ ++       A+CL   
Sbjct: 191 SANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL--N 248

Query: 309 DSPASGVLEFNSARGGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSLFEMDEAG 368
                G+            T PL+ ++     Y V +T   VG   + +     E  ++ 
Sbjct: 249 GVNGGGIFAIGHVVQPTVNTTPLLPDQP---HYSVNMTAIQVGHTFLNLSTDASEQRDS- 304

Query: 369 DGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFD--TCYDFSGLRSVRVP 426
             G I+D GT +  L    Y  L    +    NLK  +   L D  TC+ +SG      P
Sbjct: 305 -KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQT---LHDEYTCFQYSGSVDDGFP 360

Query: 427 TVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPT------SSALSIIGNVQQQGTRVS 480
            V+ +F  G +L +   +YL    S   +C  +  +      S  ++++G++      V 
Sbjct: 361 NVTFYFENGLSLKVYPHDYLFL--SENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVF 418

Query: 481 FDLANNRVGFTPNKC 495
           +DL N  +G+T   C
Sbjct: 419 YDLENQVIGWTEYNC 433


>gi|300078619|gb|ADJ67210.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
          Length = 84

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 56/84 (66%), Positives = 64/84 (76%), Gaps = 1/84 (1%)

Query: 412 DTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFAFAPTSSALSIIGN 471
           DTC+D SG   V+VPTV+LHF  G  + LPA NYLIPVDS G+FCFAFA T S LSIIGN
Sbjct: 1   DTCFDLSGKTEVKVPTVALHF-RGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59

Query: 472 VQQQGTRVSFDLANNRVGFTPNKC 495
           +QQQG RV +DLA +RVGF P  C
Sbjct: 60  IQQQGFRVVYDLAGSRVGFAPRGC 83


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/400 (24%), Positives = 168/400 (42%), Gaps = 44/400 (11%)

Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
           L  ++ +RH  +   A  LP   F+ P       G+G Y++ IG+GTP  ++ + LDTGS
Sbjct: 27  LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 80

Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
              W+    C +C  +SD +     +DP++S S   + C    C S     C    RC Y
Sbjct: 81  KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 138

Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
              Y DG  T+G L T+ +     +GN     +   +  GCG    G    SA    G++
Sbjct: 139 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 198

Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
           G G    +   Q+ A        ++CL    +   G+            T P+++N +V 
Sbjct: 199 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 255

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            ++ V L   +V G  +Q+P ++F   +    G  +D G+ +  L    Y+ L       
Sbjct: 256 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 310

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
           A +   T G      C+ F G    + P ++ HF     LD+   +YL+  +    +CF 
Sbjct: 311 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 369

Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           F          + I+G++      V +D+    +G+T + 
Sbjct: 370 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 409


>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
          Length = 340

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 80/265 (30%), Positives = 123/265 (46%), Gaps = 26/265 (9%)

Query: 198 DPIFDPKTSSSYSPLPCAAPQCKSLDVSACRANRCLYQVAYGDGSFTVGDLVTETVSFGN 257
           D  FDP  SSS++ +PC +P+C       C    C + + +G+ +   G LV +T++   
Sbjct: 30  DVAFDPSRSSSFAAIPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLSP 85

Query: 258 SGSVKGIALGC---GHDNEGLFVGSAGLLGLGGGMLSLTKQI--------KATSLAYCLV 306
           S +  G   GC   G D +  F G+ GL+ L     SL  ++           + +YCL 
Sbjct: 86  SATFAGFTFGCIEVGADAD-TFDGAVGLIDLSRSSHSLASRVISNGATTTTTAAFSYCLP 144

Query: 307 DRDSPAS-GVLEFNSAR----GGDAVTAPLIRNKKVDTFYYVGLTGFSVGGQAVQIPPSL 361
              S  S G L   ++R    GGD   AP+  N      Y+V L G SVGG+ + +PP++
Sbjct: 145 SLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAV 204

Query: 362 FEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRLAGNLKPTSGVALFDTCYDFSGLR 421
                    G +++  T  T L   AY +LRD+F              + DTCY+ +GL 
Sbjct: 205 LAAH-----GTLLEAATEFTFLAPAAYAALRDAFRNDMAQYPAAPPFRVLDTCYNLTGLA 259

Query: 422 SVRVPTVSLHFGAGKALDLPAKNYL 446
           S+ VP V+L F  G  L+L  +  +
Sbjct: 260 SLAVPAVALRFAGGTELELDVRQTM 284


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 99/400 (24%), Positives = 168/400 (42%), Gaps = 44/400 (11%)

Query: 122 LAIYNVDRHELKPAEAQILP-EDFSTPVVSGASQGSGEYFSRIGVGTPPRQFSMVLDTGS 180
           L  ++ +RH  +   A  LP   F+ P       G+G Y++ IG+GTP  ++ + LDTGS
Sbjct: 51  LQTHDENRHRRRNLMAAELPLGGFNIPY------GTGLYYTDIGIGTPAVKYYVQLDTGS 104

Query: 181 DINWLQCRPCTECYQQSDPI-----FDPKTSSSYSPLPCAAPQCKSLDVSACRAN-RCLY 234
              W+    C +C  +SD +     +DP++S S   + C    C S     C    RC Y
Sbjct: 105 KAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR--PPCNMTLRCPY 162

Query: 235 QVAYGDGSFTVGDLVTETVS----FGNSG---SVKGIALGCGHDNEGLFVGSA----GLL 283
              Y DG  T+G L T+ +     +GN     +   +  GCG    G    SA    G++
Sbjct: 163 ITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSAVAIDGII 222

Query: 284 GLGGGMLSLTKQIKATS-----LAYCLVDRDSPASGVLEFNSARGGDAVTAPLIRNKKVD 338
           G G    +   Q+ A        ++CL    +   G+            T P+++N +V 
Sbjct: 223 GFGNSNQTALSQLAAAGKTKKIFSHCL--DSTNGGGIFAIGEVVEPKVKTTPIVKNNEV- 279

Query: 339 TFYYVGLTGFSVGGQAVQIPPSLFEMDEAGDGGIIVDCGTAITRLQTQAYNSLRDSFVRL 398
            ++ V L   +V G  +Q+P ++F   +    G  +D G+ +  L    Y+ L       
Sbjct: 280 -YHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGSTLVYLPEIIYSEL--ILAVF 334

Query: 399 AGNLKPTSGVALFDTCYDFSGLRSVRVPTVSLHFGAGKALDLPAKNYLIPVDSAGTFCFA 458
           A +   T G      C+ F G    + P ++ HF     LD+   +YL+  +    +CF 
Sbjct: 335 AKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFENDLTLDVYPYDYLLEYE-GNQYCFG 393

Query: 459 FAPTS----SALSIIGNVQQQGTRVSFDLANNRVGFTPNK 494
           F          + I+G++      V +D+    +G+T + 
Sbjct: 394 FQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGWTEHN 433


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,721,407,498
Number of Sequences: 23463169
Number of extensions: 337237611
Number of successful extensions: 836930
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2548
Number of HSP's successfully gapped in prelim test: 1930
Number of HSP's that attempted gapping in prelim test: 825596
Number of HSP's gapped (non-prelim): 5502
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)