BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 037264
         (249 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  396 bits (1017), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 201/249 (80%), Positives = 231/249 (92%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDFVTET+TLGSA VDN+AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA++FSYCLV
Sbjct: 236 GDFVTETITLGSAPVDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINATSFSYCLV 295

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDS+S STLEF+S+LPPNAV+APLLRNH LDTFYY+GLTG+SVGG+L+ I E+AF+IDE
Sbjct: 296 DRDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDE 355

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           SGNGG+IVDSGTA+TRLQT+ YN+LRDAFV+ TR L  T+G+ALFDTCYD SS+ +VEVP
Sbjct: 356 SGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVP 415

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSFHFP+GK LPLPAKNYL+P+DS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N 
Sbjct: 416 TVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNH 475

Query: 241 LIGFTPNKC 249
           L+GF PNKC
Sbjct: 476 LVGFVPNKC 484


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 199/249 (79%), Positives = 225/249 (90%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDFVTET+TLGSASVDN+AIGCGHNNEGLF+GAAGLLGLGGG LSFPSQINAS+FSYCLV
Sbjct: 231 GDFVTETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINASSFSYCLV 290

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDSDS STLEF+S+L P+A+TAPLLRN ELDTFYY+G+TG+SVGG+LL I E+ F++DE
Sbjct: 291 DRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDE 350

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           SGNGGII+DSGTAVTRLQT  YNALRDAFV+GT+ L  T  VALFDTCYD S ++SVEVP
Sbjct: 351 SGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVP 410

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TV+FH   GKVLPLPA NYLIPVDS+GTFCFAFAPTSS+LSIIGNVQQQGTRV F+L NS
Sbjct: 411 TVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANS 470

Query: 241 LIGFTPNKC 249
           L+GF P +C
Sbjct: 471 LVGFEPRQC 479


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  377 bits (967), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 197/250 (78%), Positives = 229/250 (91%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TL GSAS++N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQINAS+FSYCL
Sbjct: 242 GDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 301

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V+RD+DS STLEF+S +P ++VTAPLLRN++LDTFYYLG+TGI VGG +L I  ++F++D
Sbjct: 302 VNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVD 361

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESGNGGIIVDSGTAVTRLQ++ YN+LRD+FVRGT+ L  T GVALFDTCYD SSRSSVEV
Sbjct: 362 ESGNGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEV 421

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PTVSFHFP+GK L LPAKNYLIPVDS GTFCFAFAPT+S+LSIIGNVQQQGTRVS++L N
Sbjct: 422 PTVSFHFPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSN 481

Query: 240 SLIGFTPNKC 249
           SL+GF+PN C
Sbjct: 482 SLVGFSPNGC 491


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 196/249 (78%), Positives = 217/249 (87%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDFVTETVTLGS S+ NIAIGCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NAS+FSYCLV
Sbjct: 238 GDFVTETVTLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLV 297

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDSDSTSTL+F+S + P+AVTAPL RN  LDTF+YLGLTG+SVGG +LPI ET+F++ E
Sbjct: 298 DRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSE 357

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            GNGGIIVDSGTAVTRLQT  YN LRDAFV+ T  L    GVALFDTCYD SS+S VEVP
Sbjct: 358 DGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVP 417

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSFHF  G  LPLPAKNYLIPVDS GTFCFAFAPT S+LSI+GN QQQGTRV F+L NS
Sbjct: 418 TVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANS 477

Query: 241 LIGFTPNKC 249
           L+GF+PNKC
Sbjct: 478 LVGFSPNKC 486


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 196/249 (78%), Positives = 217/249 (87%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDFVTETVTLGS S+ NIAIGCGHNNEGLF+GAAGLLGLGGGSLSFPSQ+NAS+FSYCLV
Sbjct: 238 GDFVTETVTLGSTSLGNIAIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLNASSFSYCLV 297

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDSDSTSTL+F+S + P+AVTAPL RN  LDTF+YLGLTG+SVGG +LPI ET+F++ E
Sbjct: 298 DRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSE 357

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            GNGGIIVDSGTAVTRLQT  YN LRDAFV+ T  L    GVALFDTCYD SS+S VEVP
Sbjct: 358 DGNGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVP 417

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSFHF  G  LPLPAKNYLIPVDS GTFCFAFAPT S+LSI+GN QQQGTRV F+L NS
Sbjct: 418 TVSFHFANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANS 477

Query: 241 LIGFTPNKC 249
           L+GF+PNKC
Sbjct: 478 LVGFSPNKC 486


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  363 bits (932), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 183/249 (73%), Positives = 222/249 (89%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDF TET+T+GS  V N+A+GCGH+NEGLFVGAAGLLGLGGG L+ PSQ+N ++FSYCLV
Sbjct: 235 GDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 294

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDSDS ST++F +SL P+AV APLLRNH+LDTFYYLGLTGISVGG+LL I +++F++DE
Sbjct: 295 DRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 354

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           SG+GGII+DSGTAVTRLQTE YN+LRD+FV+GT  L    GVA+FDTCY+ S++++VEVP
Sbjct: 355 SGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVP 414

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TV+FHFP GK+L LPAKNY+IPVDS GTFC AFAPT+SSL+IIGNVQQQGTRV+F+L NS
Sbjct: 415 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 474

Query: 241 LIGFTPNKC 249
           LIGF+ NKC
Sbjct: 475 LIGFSSNKC 483


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  362 bits (929), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 182/249 (73%), Positives = 222/249 (89%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDF TET+T+GS  V N+A+GCGH+NEGLFVGAAGLLGLGGG L+ PSQ+N ++FSYCLV
Sbjct: 238 GDFATETLTIGSTLVQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTTSFSYCLV 297

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDSDS ST+EF +SLPP+AV APLLRNH+LDTFYYLGLTGISVGG+LL I +++F++DE
Sbjct: 298 DRDSDSASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDE 357

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           SG+GGII+DSGTAVTRLQT  YN+LRD+F++GT  L    GVA+FDTCY+ S+++++EVP
Sbjct: 358 SGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVP 417

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TV+FHFP GK+L LPAKNY+IPVDS GTFC AFAPT+SSL+IIGNVQQQGTRV+F+L NS
Sbjct: 418 TVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANS 477

Query: 241 LIGFTPNKC 249
           LIGF+ NKC
Sbjct: 478 LIGFSSNKC 486


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  358 bits (918), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 186/249 (74%), Positives = 221/249 (88%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G+F TETVTLGSA+V+N+AIGCGHNNEGLFVGAAGLLGLGGG LSFP+Q+NA++FSYCLV
Sbjct: 236 GEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLV 295

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           +RDSD+ STLEF+S LP NA TAPL+RN ELDTFYYLGL GISVGG+ LPI E++F++D 
Sbjct: 296 NRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDA 355

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            G GGII+DSGTAVTRL++E Y+ALRDAFV+G + +   +GV+LFDTCYD SSR SVE+P
Sbjct: 356 IGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIP 415

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSF FPEG+ LPLPA+NYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV F++ NS
Sbjct: 416 TVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANS 475

Query: 241 LIGFTPNKC 249
           L+GF+ + C
Sbjct: 476 LVGFSVDSC 484


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  357 bits (915), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 185/249 (74%), Positives = 220/249 (88%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G+F TETVTLG+A+V+N+AIGCGHNNEGLFVGAAGLLGLGGG LSFP+Q+NA++FSYCLV
Sbjct: 236 GEFATETVTLGTAAVENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVNATSFSYCLV 295

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           +RDSD+ STLEF+S LP N VTAPL RN ELDTFYYLGL GISVGG+ LPI E+ F++D 
Sbjct: 296 NRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDA 355

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            G GGII+DSGTAVTRL++E Y+ALRDAFV+G + +   +GV+LFDTCYD SSR SV+VP
Sbjct: 356 IGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVP 415

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSFHFPEG+ LPLPA+NYLIPVDS GTFCFAFAPT+SSLSI+GNVQQQGTRV F++ NS
Sbjct: 416 TVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANS 475

Query: 241 LIGFTPNKC 249
           L+GF+ + C
Sbjct: 476 LVGFSADSC 484


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  328 bits (840), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 171/251 (68%), Positives = 212/251 (84%), Gaps = 2/251 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TE+V+ G S SV N+A+GCGH+NEGLFVGAAGLLGLGGG LS  +Q+ A++FSYCL
Sbjct: 248 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 307

Query: 60  VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           V+RDS  +STL+F+S+ L  ++VTAPL++N ++DTFYY+GL+G+SVGG ++ I E+ F++
Sbjct: 308 VNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRL 367

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
           DESGNGGIIVD GTA+TRLQT+ YN LRDAFVR T+ L  T  VALFDTCYD S ++SV 
Sbjct: 368 DESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVR 427

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           VPTVSFHF +GK   LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRV+F+L 
Sbjct: 428 VPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLA 487

Query: 239 NSLIGFTPNKC 249
           N+ +GF+PNKC
Sbjct: 488 NNRMGFSPNKC 498


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  327 bits (838), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 171/251 (68%), Positives = 212/251 (84%), Gaps = 2/251 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TE+V+ G S SV N+A+GCGH+NEGLFVGAAGLLGLGGG LS  +Q+ A++FSYCL
Sbjct: 107 GDFATESVSFGNSGSVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCL 166

Query: 60  VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           V+RDS  +STL+F+S+ L  ++VTAPL++N ++DTFYY+GL+G+SVGG ++ I E+ F++
Sbjct: 167 VNRDSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRL 226

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
           DESGNGGIIVD GTA+TRLQT+ YN LRDAFVR T+ L  T  VALFDTCYD S ++SV 
Sbjct: 227 DESGNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVR 286

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           VPTVSFHF +GK   LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRV+F+L 
Sbjct: 287 VPTVSFHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLA 346

Query: 239 NSLIGFTPNKC 249
           N+ +GF+PNKC
Sbjct: 347 NNRMGFSPNKC 357


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  327 bits (837), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 167/249 (67%), Positives = 207/249 (83%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G++VTETV+ G+ SV+ +AIGCGH+NEGLFVG+AGLLGLGGG LS  SQI A++FSYCLV
Sbjct: 244 GEYVTETVSFGAGSVNRVAIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIKATSFSYCLV 303

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
           DRDS  +STLEF+S  P ++V APLL+N +++TFYY+ LTG+SVGG+++ +    F +D+
Sbjct: 304 DRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQ 363

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           SG GG+IVDSGTA+TRL+T+ YN++RDAF R T  L P +GVALFDTCYD SS  SV VP
Sbjct: 364 SGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVP 423

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           TVSFHF   +   LPAKNYLIPVD  GT+CFAFAPT+SS+SIIGNVQQQGTRVSF+L NS
Sbjct: 424 TVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANS 483

Query: 241 LIGFTPNKC 249
           L+GF+PNKC
Sbjct: 484 LVGFSPNKC 492


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  325 bits (834), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 176/253 (69%), Positives = 207/253 (81%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G+F TET+TLG A + N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQ+   N   FSY
Sbjct: 233 GNFATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSY 292

Query: 58  CLVDRDSDSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLVDRDS+S+STL+F  +  PN AV AP+L+N  LDTFYY+ L+GISVGG +L IS++ F
Sbjct: 293 CLVDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVF 352

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            ID SGNGG+IVDSGTAVTRLQT  Y++LRDAF  GT+ L  TDGV+LFDTCYD SS+ S
Sbjct: 353 GIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKES 412

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V+VPTV FHF  G  + LPAKNYL+PVDS GTFCFAFAPTSSSLSI+GN+QQQG RVSF+
Sbjct: 413 VDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFD 472

Query: 237 LRNSLIGFTPNKC 249
             N+ +GF  NKC
Sbjct: 473 RANNQVGFAVNKC 485


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 169/250 (67%), Positives = 213/250 (85%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDFVTET++ G S +V++IA+GCGH+NEGLFVGAAGLLGLGGG LS  SQ+ A++FSYCL
Sbjct: 246 GDFVTETMSFGGSGTVNSIALGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLKATSFSYCL 305

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V+RDS ++STL+F+S+   ++V APLL++ ++DTFYY+GL+G+SVGG+LL I +  FK+D
Sbjct: 306 VNRDSAASSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLD 365

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           +SG+GG+IVD GTA+TRLQ+E YN+LRD+FV  +R L  T GVALFDTCYD S +SSV+V
Sbjct: 366 DSGDGGVIVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKV 425

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PTVSFHF  GK   LPA NYLIPVDS GT+CFAFAPT+SSLSIIGNVQQQGTRVSF+L N
Sbjct: 426 PTVSFHFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLAN 485

Query: 240 SLIGFTPNKC 249
           + +GF+ NKC
Sbjct: 486 NRVGFSTNKC 495


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  318 bits (815), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 169/250 (67%), Positives = 204/250 (81%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TETV+ G S SVD +AIGCGH+NEGLFVGAAGL+GLGGG LS  SQI AS+FSYCL
Sbjct: 247 GDFATETVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQIKASSFSYCL 306

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V+RDS  +STLEF+S+ P ++VTAP+ +N ++DTFYY+G+TG+SVGG+ L I  + F++D
Sbjct: 307 VNRDSVDSSTLEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVD 366

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
            SG GGIIVD GTAVTRLQT+ YNALRD FV+ T+ L  T G ALFDTCY+ SSR+SV V
Sbjct: 367 GSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRV 426

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PTV+F F  GK LPLP  NYLIPVDS GTFC AFAPT++SLSIIGNVQQQGTRV+++L N
Sbjct: 427 PTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLAN 486

Query: 240 SLIGFTPNKC 249
           S + F+  KC
Sbjct: 487 SQVSFSSRKC 496


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  313 bits (802), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 170/250 (68%), Positives = 201/250 (80%), Gaps = 2/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG SA V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 256 GDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 315

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VDRDS S+STL+F  +     VTAPL+R+    TFYY+GL+G+SVGG +L I  +AF +D
Sbjct: 316 VDRDSPSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMD 374

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
            +G GG+IVDSGTAVTRLQ+  Y ALRDAFVRGT++L  T GV+LFDTCYD S R+SVEV
Sbjct: 375 STGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 434

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           P VS  F  G  L LPAKNYLIPVD  GT+C AFAPT++++SIIGNVQQQGTRVSF+   
Sbjct: 435 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 494

Query: 240 SLIGFTPNKC 249
           S +GFT NKC
Sbjct: 495 STVGFTTNKC 504


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  313 bits (801), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 172/250 (68%), Positives = 208/250 (83%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+FV ET+T G S  ++N+A+GCGH+NEGLFVG+AGLLGLGGGSLS  SQ+ AS+FSYCL
Sbjct: 242 GEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMKASSFSYCL 301

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VDRDS S+S LEF+S+ P ++V APLL++ ++DTFYY+GLTG+SVGG LL I    F++D
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMD 361

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           +SG GGIIVDSGTA+TRLQT+ YN LRDAFV  T  L  T+G ALFDTCYD SS+S V +
Sbjct: 362 DSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTI 421

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PTVSF F  GK L LP KNYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N
Sbjct: 422 PTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481

Query: 240 SLIGFTPNKC 249
           S++GF+P+KC
Sbjct: 482 SVVGFSPHKC 491


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  312 bits (800), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 171/250 (68%), Positives = 201/250 (80%), Gaps = 2/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG SA V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 252 GDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 311

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VDRDS S+STL+F  +     VTAPL+R+    TFYY+GL+GISVGG +L I  +AF +D
Sbjct: 312 VDRDSPSSSTLQFGDAADAE-VTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMD 370

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
            +G GG+IVDSGTAVTRLQ+  Y ALRDAFVRGT++L  T GV+LFDTCYD S R+SVEV
Sbjct: 371 GTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 430

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           P VS  F  G  L LPAKNYLIPVD  GT+C AFAPT++++SIIGNVQQQGTRVSF+   
Sbjct: 431 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 490

Query: 240 SLIGFTPNKC 249
           S +GFT NKC
Sbjct: 491 STVGFTSNKC 500


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  312 bits (800), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 171/250 (68%), Positives = 208/250 (83%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+FVTET+T G S  ++++A+GCGH+NEGLFVG+AGLLGLGGG LS  SQ+ AS+FSYCL
Sbjct: 242 GEFVTETLTFGNSGMINDVAVGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMKASSFSYCL 301

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VDRDS S+S LEF+S+ P ++V APLL++ ++DTFYY+GLTG+SVGG LL I    F++D
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMD 361

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           +SG GGIIVDSGTA+TRLQT+ YN LRDAFV  T  L  T+G ALFDTCYD SS+S V +
Sbjct: 362 DSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTI 421

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PTVSF F  GK L LP KNYLIPVDS GTFCFAFAPT+SSLSIIGNVQQQGTRV ++L N
Sbjct: 422 PTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLAN 481

Query: 240 SLIGFTPNKC 249
           S++GF+P+KC
Sbjct: 482 SVVGFSPHKC 491


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  310 bits (794), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 171/252 (67%), Positives = 199/252 (78%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG S  V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 258 GDFATETLTLGDSTPVTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 317

Query: 60  VDRDSDSTSTLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS + STL+F +     + VTAPL+R+    TFYY+ L+GISVGG  L I  +AF +
Sbjct: 318 VDRDSPAASTLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAM 377

Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           D  SG+GG+IVDSGTAVTRLQ+  Y ALRDAFVRGT +L  T GV+LFDTCYD S R+SV
Sbjct: 378 DATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSV 437

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           EVP VS  F  G  L LPAKNYLIPVD  GT+C AFAPT++++SIIGNVQQQGTRVSF+ 
Sbjct: 438 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 497

Query: 238 RNSLIGFTPNKC 249
              ++GFTPNKC
Sbjct: 498 AKGVVGFTPNKC 509


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  306 bits (785), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 170/252 (67%), Positives = 197/252 (78%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG S  V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 255 GDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 314

Query: 60  VDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS + STL+F D +     VTAPL+R+    TFYY+ L+GISVGG  L I  +AF +
Sbjct: 315 VDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAM 374

Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           D  SG+GG+IVDSGTAVTRLQ+  Y ALRDAFV+G  +L  T GV+LFDTCYD S R+SV
Sbjct: 375 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSV 434

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           EVP VS  F  G  L LPAKNYLIPVD  GT+C AFAPT++++SIIGNVQQQGTRVSF+ 
Sbjct: 435 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 494

Query: 238 RNSLIGFTPNKC 249
               +GFTPNKC
Sbjct: 495 ARGAVGFTPNKC 506


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  306 bits (784), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 170/252 (67%), Positives = 197/252 (78%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG S  V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+ASTFSYCL
Sbjct: 75  GDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISASTFSYCL 134

Query: 60  VDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS + STL+F D +     VTAPL+R+    TFYY+ L+GISVGG  L I  +AF +
Sbjct: 135 VDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAM 194

Query: 119 DE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           D  SG+GG+IVDSGTAVTRLQ+  Y ALRDAFV+G  +L  T GV+LFDTCYD S R+SV
Sbjct: 195 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSV 254

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           EVP VS  F  G  L LPAKNYLIPVD  GT+C AFAPT++++SIIGNVQQQGTRVSF+ 
Sbjct: 255 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDT 314

Query: 238 RNSLIGFTPNKC 249
               +GFTPNKC
Sbjct: 315 ARGAVGFTPNKC 326


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  306 bits (783), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 166/250 (66%), Positives = 201/250 (80%), Gaps = 2/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TLG SA V N+AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+TFSYCL
Sbjct: 252 GDFATETLTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCL 311

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VDRDS S+STL+F  S  P AVTAPL+R+   +TFYY+ L+GISVGG+ L I  +AF +D
Sbjct: 312 VDRDSPSSSTLQFGDSEQP-AVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMD 370

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ++G+GG+IVDSGTAVTRLQ+  Y ALR+AFV+GT++L    GV+LFDTCYD + RSSV+V
Sbjct: 371 DAGSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQV 430

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           P V+  F  G  L LPAKNYLIPVD+ GT+C AFA TS  +SIIGNVQQQG RVSF+   
Sbjct: 431 PAVALWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAK 490

Query: 240 SLIGFTPNKC 249
           + +GFT +KC
Sbjct: 491 NTVGFTADKC 500


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  300 bits (769), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 161/252 (63%), Positives = 203/252 (80%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  T+TVT G S  ++++A+GCGH+NEGLF GAAGLLGLGGG+LS  +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCL 308

Query: 60  VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS  +S+L+F+S  L     TAPLLRN ++DTFYY+GL+G SVGG  + + +  F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDV 368

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
           D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T  L   T  ++LFDTCYDFSS SSV
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSV 428

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           +VPTV+FHF  GK L LPAKNYLIPVD NGTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488

Query: 238 RNSLIGFTPNKC 249
            N +IG + NKC
Sbjct: 489 ANKIIGLSGNKC 500


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  300 bits (768), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 166/253 (65%), Positives = 198/253 (78%), Gaps = 5/253 (1%)

Query: 1   GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           GDF TET+TLG   SA+V ++AIGCGH+NEGLFVGAAGLL LGGG LSFPSQI+A+ FSY
Sbjct: 289 GDFATETLTLGGDGSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATEFSY 348

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAF 116
           CLVDRDS S STL+F +S   + VTAPL+R+   +TFYY+ L GISVGG+ L  I   AF
Sbjct: 349 CLVDRDSPSASTLQFGAS-DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAF 407

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            +DE G+GG+IVDSGTAVTRLQ+  Y+ALRDAFVRGT+AL    GV+LFDTCYD + RSS
Sbjct: 408 AMDEQGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSS 467

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V+VP VS  F  G  L LPAKNYLIPVD  GT+C AFA T  ++SI+GNVQQQG RVSF+
Sbjct: 468 VQVPAVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFD 527

Query: 237 LRNSLIGFTPNKC 249
              + +GF+PNKC
Sbjct: 528 TAKNTVGFSPNKC 540


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  297 bits (760), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 161/252 (63%), Positives = 202/252 (80%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G++ T+TVT G S  V+++A+GCGH+NEGLF GAAGLLGLGGG+LS  +QI A +FSYCL
Sbjct: 251 GNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCL 310

Query: 60  VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS  +S+L+F+S  +     TAPLLRN ++DTFYY+GL+G SVGG  + I  + F++
Sbjct: 311 VDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEV 370

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
           D SG GG+I+D GTAVTRLQT+ YN+LRDAFV+ T      T  ++LFDTCYDFSS S+V
Sbjct: 371 DASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTV 430

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           +VPTV+FHF  GK L LPAKNYLIP+D  GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 431 KVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 490

Query: 238 RNSLIGFTPNKC 249
            N+LIG + NKC
Sbjct: 491 ANNLIGLSANKC 502


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  295 bits (755), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 158/252 (62%), Positives = 203/252 (80%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  T+TVT G S  ++N+A+GCGH+NEGLF GAAGLLGLGGG LS  +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308

Query: 60  VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS  +S+L+F+S  L     TAPLLRN ++DTFYY+GL+G SVGG+ + + +  F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDV 368

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
           D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T  L   +  ++LFDTCYDFSS S+V
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV 428

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           +VPTV+FHF  GK L LPAKNYLIPVD +GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488

Query: 238 RNSLIGFTPNKC 249
             ++IG + NKC
Sbjct: 489 SKNVIGLSGNKC 500


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  295 bits (755), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 158/252 (62%), Positives = 203/252 (80%), Gaps = 3/252 (1%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  T+TVT G S  ++N+A+GCGH+NEGLF GAAGLLGLGGG LS  +Q+ A++FSYCL
Sbjct: 249 GELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCL 308

Query: 60  VDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           VDRDS  +S+L+F+S  L     TAPLLRN ++DTFYY+GL+G SVGG+ + + +  F +
Sbjct: 309 VDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDV 368

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFSSRSSV 177
           D SG+GG+I+D GTAVTRLQT+ YN+LRDAF++ T  L   +  ++LFDTCYDFSS S+V
Sbjct: 369 DASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTV 428

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
           +VPTV+FHF  GK L LPAKNYLIPVD +GTFCFAFAPTSSSLSIIGNVQQQGTR++++L
Sbjct: 429 KVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDL 488

Query: 238 RNSLIGFTPNKC 249
             ++IG + NKC
Sbjct: 489 SKNVIGLSGNKC 500


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  281 bits (718), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 147/250 (58%), Positives = 199/250 (79%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  TET++ G S S+ N+ IGCGH+NEGLF G AGL+GLGGG++S  SQ+ AS+FSYCL
Sbjct: 238 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V+ DSDS+STLEF+S++P +++T+PL++N    ++ Y+ + GISVGG  LPIS T F+ID
Sbjct: 298 VNLDSDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 357

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESG GGIIVDSGT ++RL ++ Y +LR+AFV+ T +LSP  G+++FDTCY+FS +S+VEV
Sbjct: 358 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 417

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PT++F   EG  L LPA+NYLI +D+ GT+C AF  T SSLSIIG+ QQQG RVS++L N
Sbjct: 418 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 477

Query: 240 SLIGFTPNKC 249
           SL+GF+ NKC
Sbjct: 478 SLVGFSTNKC 487


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  280 bits (717), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 155/255 (60%), Positives = 191/255 (74%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G + TET+T G+ S+ N+AIGCGH+N GLFVGAAGLLGLG GSLSFP+Q+   T   FSY
Sbjct: 241 GSYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSY 300

Query: 58  CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
           CLVDRDS+S+ TLEF   S+P  ++  PL+ N  L TFYYL +  ISVGG +L  +   A
Sbjct: 301 CLVDRDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEA 360

Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           F+IDE+ G GGII+DSGTAVTRLQT  Y+ALRDAF+ GT+ L   DG+++FDTCYD S+ 
Sbjct: 361 FRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSAL 420

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            SV +P V FHF  G    LPAKN LIP+DS GTFCFAFAP  S+LSI+GN+QQQG RVS
Sbjct: 421 QSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVS 480

Query: 235 FNLRNSLIGFTPNKC 249
           F+  NSL+GF  ++C
Sbjct: 481 FDSANSLVGFAIDQC 495


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  279 bits (714), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 155/255 (60%), Positives = 191/255 (74%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G + TET+T G+ S+ N+AIGCGH+N GLFVGAAGLLGLG GSLSFP+Q+   T   FSY
Sbjct: 95  GSYATETLTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSY 154

Query: 58  CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
           CLVDRDS+S+ TLEF   S+P  ++  PL+ N  L TFYYL +  ISVGG +L  +   A
Sbjct: 155 CLVDRDSESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEA 214

Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           F+IDE+ G GGII+DSGTAVTRLQT  Y+ALRDAF+ GT+ L   DG+++FDTCYD S+ 
Sbjct: 215 FRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSAL 274

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            SV +P V FHF  G    LPAKN LIP+DS GTFCFAFAP  S+LSI+GN+QQQG RVS
Sbjct: 275 QSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVS 334

Query: 235 FNLRNSLIGFTPNKC 249
           F+  NSL+GF  ++C
Sbjct: 335 FDSANSLVGFAIDQC 349


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  279 bits (714), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 146/250 (58%), Positives = 198/250 (79%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  TET++ G S S+ N+ IGCGH+NEGLF G AGL+GLGGG++S  SQ+ AS+FSYCL
Sbjct: 238 GELATETLSFGNSNSIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLKASSFSYCL 297

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V+ DSDS+STLEF+S +P +++T+PL++N    ++ Y+ + GISVGG  LPIS T F+ID
Sbjct: 298 VNLDSDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEID 357

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESG GGIIVDSGT ++RL ++ Y +LR+AFV+ T +LSP  G+++FDTCY+FS +S+VEV
Sbjct: 358 ESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEV 417

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PT++F   EG  L LPA+NYLI +D+ GT+C AF  T SSLSIIG+ QQQG RVS++L N
Sbjct: 418 PTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTN 477

Query: 240 SLIGFTPNKC 249
           S++GF+ NKC
Sbjct: 478 SIVGFSTNKC 487


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  275 bits (702), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 159/255 (62%), Positives = 185/255 (72%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G F TE +T G+ SV N+AIGCGH+N GLFVGAAGLLGLG G LSFPSQ+   T   FSY
Sbjct: 284 GSFATEMLTFGTTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSY 343

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
           CLVDR S+S+ TLEF   S+P  ++  PLL N  L TFYY+ L  ISVGG LL  +    
Sbjct: 344 CLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDV 403

Query: 116 FKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           F+IDE SG GG IVDSGTAVTRLQT  Y+A+RDAFV GTR L   +GV++FDTCYD S  
Sbjct: 404 FRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGL 463

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
             V VPTV FHF  G  L LPAKNY+IP+D  GTFCFAFAP +S LSI+GN+QQQG RVS
Sbjct: 464 PLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVS 523

Query: 235 FNLRNSLIGFTPNKC 249
           F+  NSL+GF   +C
Sbjct: 524 FDTANSLVGFALRQC 538


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  271 bits (692), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/250 (58%), Positives = 193/250 (77%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  TET +   S S+ N+ IGCGH+NEGLFVGA GL+GLGGG++S  SQ+ A++FSYCL
Sbjct: 274 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLEATSFSYCL 333

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VD DS+S+STL+F++  P +++T+PL++N    TF Y+ + G+SVGG  LPIS ++F+ID
Sbjct: 334 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 393

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESG+GGIIVDSGT +T + ++ Y+ LRDAFV  T+ L P  GV+ FDTCYD SS+S+VEV
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PT++F  P    L LPAKN LI VDS GTFC AF P++  LSIIGNVQQQG RVS++L N
Sbjct: 454 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513

Query: 240 SLIGFTPNKC 249
           SL+GF+ +KC
Sbjct: 514 SLVGFSTDKC 523


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  269 bits (687), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 146/250 (58%), Positives = 193/250 (77%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  TET +   S S+ N+ IGCGH+NEGLFVGAAGL+GLGGG++S  SQ+ A++FSYCL
Sbjct: 274 GELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLEATSFSYCL 333

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VD DS+S+STL+F++  P +++T+PL++N    TF Y+ + G+SVGG  LPIS ++F+ID
Sbjct: 334 VDLDSESSSTLDFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEID 393

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESG+GGIIVDSGT +T + ++ Y+ LRDAFV  T+ L P  GV+ FDTCYD SS+S+VEV
Sbjct: 394 ESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEV 453

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PT++F  P    L LPAKN L  VDS GTFC AF P++  LSIIGNVQQQG RVS++L N
Sbjct: 454 PTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLAN 513

Query: 240 SLIGFTPNKC 249
           SL+GF+ +KC
Sbjct: 514 SLVGFSTDKC 523


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  269 bits (687), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 156/255 (61%), Positives = 186/255 (72%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G F TET+T G+ SV N+AIGCGH N GLF+GAAGLLGLG G+LSFP+QI      TFSY
Sbjct: 244 GSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSY 303

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISETA 115
           CLVDR+SDS+  L+F   S+P  ++  PL +N  L TFYYL +T ISVGG LL  I    
Sbjct: 304 CLVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEV 363

Query: 116 FKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           F+IDE SG+GG I+DSGT VTRL T  Y+A+RDAFV GT  L  TD V++FDTCYD S  
Sbjct: 364 FRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGL 423

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
             V VPTV FHF  G  L LPAKNYLIP+D+ GTFCFAFAP +SS+SI+GN QQQ  RVS
Sbjct: 424 QFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVS 483

Query: 235 FNLRNSLIGFTPNKC 249
           F+  NSL+GF  ++C
Sbjct: 484 FDSANSLVGFAFDQC 498


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score =  261 bits (668), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 123/165 (74%), Positives = 147/165 (89%)

Query: 85  LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L RN +LDT+YY+GL GISVGG+LL I ET+F++D +GNGGIIVDSGTAVTRLQ++ YN 
Sbjct: 1   LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
           +RDAFV+GT+ L  T+ V+LFDTCYD SS++SVEVPTV+FHF EGKVL LPAKNYL+PVD
Sbjct: 61  VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120

Query: 205 SNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S GTFCFAFAPT SSLSIIGN+QQQGTRVSF+L NSL+GF+PN+C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  256 bits (653), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 143/250 (57%), Positives = 186/250 (74%), Gaps = 1/250 (0%)

Query: 1   GDFVTETVT-LGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+  TET+T + S S+ NI+IGCGH+NEGLFVGA GL+GLGGG++S  SQ+ AS+FSYCL
Sbjct: 87  GELATETLTFVHSNSIPNISIGCGHDNEGLFVGADGLIGLGGGAISISSQLKASSFSYCL 146

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           VD DS S STL+F++  P +++ +PL++N    +F Y+ + G+SVGG  LPIS + F+ID
Sbjct: 147 VDIDSPSFSTLDFNTDPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEID 206

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           ESG GGIIVDSGT +T+L ++ Y  LR+AF+  T  L P   ++ FDTCYD SS+S+VEV
Sbjct: 207 ESGLGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEV 266

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           PT++F  P    L LPAKN LI VDS GTFC AF   +  LSIIGN QQQG RVS++L N
Sbjct: 267 PTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTN 326

Query: 240 SLIGFTPNKC 249
           SL+GF+ NKC
Sbjct: 327 SLVGFSTNKC 336


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  254 bits (649), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 135/253 (53%), Positives = 173/253 (68%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    ET+T G   + N+AIGCGH+N+G+FVGAAGLLGLG G +SF  Q+      TFSY
Sbjct: 221 GTLALETLTFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSY 280

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R   S+  L+F   ++P  A   PL+ N    +FYY+GL+G+ VGG  +PISE  F
Sbjct: 281 CLVSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVF 340

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           K+ E G+GG+++D+GTAVTRL T  Y A RDAF+  T  L    GV++FDTCYD     S
Sbjct: 341 KLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVS 400

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA+N+LIPVD  G+FCFAFAP+SS LSIIGN+QQ+G  +S +
Sbjct: 401 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVD 460

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 461 GANGFVGFGPNVC 473


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  253 bits (645), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 139/253 (54%), Positives = 177/253 (69%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   V ++AIGCGH N G+FVGAAGLLGLGGGS+SF  Q+   T   FSY
Sbjct: 227 GTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSY 286

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R +DS+ +L F   +LP  A   PL+RN    +FYY+GL G+ VGG  +PISE  F
Sbjct: 287 CLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVF 346

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           ++ E G+GG+++D+GTAVTRL T  Y A RDAF+  T  L    GVA+FDTCYD     S
Sbjct: 347 RLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVS 406

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA+N+LIP+D  GTFCFAFAP++S LSI+GN+QQ+G ++SF+
Sbjct: 407 VRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 466

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 467 GANGYVGFGPNIC 479


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  251 bits (642), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 182/255 (71%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GD  +++ ++       +  GCGH+NEGLFVGAAGLLGLG G LSFPSQ+++  FSYCLV
Sbjct: 103 GDLASDSFSVSRGRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLV 162

Query: 61  DRDS--DSTSTLEF-DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
            RD+   ++S L F DS+LP +A  A   LL+N +LDTFYY GL+GIS+GG LL I  TA
Sbjct: 163 SRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTA 222

Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           FK+  S G GG+I+DSGT+VTRL T  Y  +RDAF   T+ L      +LFDTCYDFS+ 
Sbjct: 223 FKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSAL 282

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           +SV +PTVSFHF  G  + LP  NYL+PVD++GTFCFAF+ TS  LSIIGN+QQQ  RV+
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVA 342

Query: 235 FNLRNSLIGFTPNKC 249
            +L +S +GF P +C
Sbjct: 343 IDLDSSRVGFAPRQC 357


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  251 bits (641), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 126/254 (49%), Positives = 170/254 (66%), Gaps = 6/254 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   ET+TLG  +V  +AIGCGH N GLFVGAAGLLGLG G++S   Q+  +    FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280

Query: 58  CLVDRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL  R +    +L      ++P  AV  PL+RN++  +FYY+GLTGI VGG+ LP+ ++ 
Sbjct: 281 CLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSL 340

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F++ E G GG+++D+GTAVTRL  E Y ALR AF     AL  +  V+L DTCYD S  +
Sbjct: 341 FQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYA 400

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
           SV VPTVSF+F +G VL LPA+N L+ V     FC AFAP+SS +SI+GN+QQ+G +++ 
Sbjct: 401 SVRVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITV 459

Query: 236 NLRNSLIGFTPNKC 249
           +  N  +GF PN C
Sbjct: 460 DSANGYVGFGPNTC 473


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  250 bits (639), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 181/255 (70%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GD  +++  +       +  GCGH+NEGLFVGAAGLLGLG G LSFPSQ+++  FSYCLV
Sbjct: 103 GDLASDSFLVSRGRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSRKFSYCLV 162

Query: 61  DRDS--DSTSTLEF-DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
            RD+   ++S L F DS+LP +A  A   LL+N +LDTFYY GL+GIS+GG LL I  TA
Sbjct: 163 SRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTA 222

Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           FK+  S G GG+I+DSGT+VTRL T  Y  +RDAF   T+ L      +LFDTCYDFS+ 
Sbjct: 223 FKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSAL 282

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           +SV +PTVSFHF  G  + LP  NYL+PVD++GTFCFAF+ TS  LSIIGN+QQQ  RV+
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVA 342

Query: 235 FNLRNSLIGFTPNKC 249
            +L +S +GF P +C
Sbjct: 343 IDLDSSRVGFAPRQC 357


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  250 bits (639), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 149/255 (58%), Positives = 179/255 (70%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T     +  +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ        FSY
Sbjct: 199 GDFATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSY 258

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
           CLVDR + S  S++ F D+++   A   PL+RN +LDTFYY+GL GISVGG  +  +S +
Sbjct: 259 CLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPS 318

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF  G R L      +LFDTCYD S +
Sbjct: 319 LFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQ 378

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           SSV+VPTV  HF  G  + LPA NYLIPVD NG+FCFAFA T S LSIIGN+QQQG RV 
Sbjct: 379 SSVKVPTVVLHF-RGADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVV 437

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  S IGF P  C
Sbjct: 438 YDLAGSRIGFAPRGC 452


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  250 bits (639), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 126/254 (49%), Positives = 169/254 (66%), Gaps = 6/254 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   ET+TLG  +V  +AIGCGH N GLFVGAAGLLGLG G++S   Q+  +    FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSY 280

Query: 58  CLVDRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL  R +    +L      ++P  AV  PL+RN++  +FYY+GLTGI VGG+ LP+ +  
Sbjct: 281 CLASRGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGL 340

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F++ E G GG+++D+GTAVTRL  E Y ALR AF     AL  +  V+L DTCYD S  +
Sbjct: 341 FQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYA 400

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
           SV VPTVSF+F +G VL LPA+N L+ V     FC AFAP+SS +SI+GN+QQ+G +++ 
Sbjct: 401 SVRVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITV 459

Query: 236 NLRNSLIGFTPNKC 249
           +  N  +GF PN C
Sbjct: 460 DSANGYVGFGPNTC 473


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  249 bits (636), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 150/259 (57%), Positives = 182/259 (70%), Gaps = 11/259 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T   A VD++A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ        FSY
Sbjct: 227 GDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 286

Query: 58  CLVDRDSDSTS-----TLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
           CLVDR S  +S     T+ F + ++P  AV  PLL N +LDTFYYL L GISVGG  +P 
Sbjct: 287 CLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 346

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +SE+ FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D
Sbjct: 347 VSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFD 406

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S  ++V+VPTV FHF  G+V  LPA NYLIPV++ G FCFAFA T  SLSIIGN+QQQG
Sbjct: 407 LSGMTTVKVPTVVFHFTGGEV-SLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQG 465

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            RV+++L  S +GF    C
Sbjct: 466 FRVAYDLVGSRVGFLSRAC 484


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  248 bits (634), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 137/253 (54%), Positives = 176/253 (69%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   V N+AIGCGH N G+FVGAAGLLGLGGGS+S   Q+   T   FSY
Sbjct: 229 GTLALETLTFGRTVVRNVAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 288

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R +DS  +LEF   ++P  A   PL+RN    +FYY+ L+G+ VGG  +PISE  F
Sbjct: 289 CLVSRGTDSAGSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVF 348

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           +++E GNGG+++D+GTAVTR+ T  Y A RDAF+  T  L    GV++FDTCY+ +   S
Sbjct: 349 QLNEMGNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVS 408

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA+N+LIPVD  GTFCFAFA + S LSIIGN+QQ+G ++SF+
Sbjct: 409 VRVPTVSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFD 468

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 469 GANGFVGFGPNVC 481


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  248 bits (632), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 148/259 (57%), Positives = 181/259 (69%), Gaps = 11/259 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T   A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ  +     FSY
Sbjct: 229 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSY 288

Query: 58  CLVDR-----DSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
           CLVDR      S   ST+ F + ++P  +V  PLL N +LDTFYYL L GISVGG  +P 
Sbjct: 289 CLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 348

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +SE+ FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D
Sbjct: 349 VSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLFDTCFD 408

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S  ++V+VPTV FHF  G+V  LPA NYLIPV++ G FCFAFA T  SLSIIGN+QQQG
Sbjct: 409 LSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 467

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            RV+++L  S +GF    C
Sbjct: 468 FRVAYDLVGSRVGFLSRAC 486


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  246 bits (629), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 128/252 (50%), Positives = 165/252 (65%), Gaps = 5/252 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+TLG  +V+ +AIGCGH N GLFVGAAGLLGLG G +S   Q+  +    FSY
Sbjct: 215 GALALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSY 274

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL  R + S   L    ++P  AV  PL+RN +  +FYY+GL+GI VG + LP+ E  F+
Sbjct: 275 CLASRGAGSL-VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQ 333

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           + E G GG+++D+GTAVTRL  E Y ALRDAFV    AL    GV+L DTCYD S  +SV
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSV 393

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VPTVSF+F     L LPA+N L+ VD  G +C AFAP+SS  SI+GN+QQ+G +++ + 
Sbjct: 394 RVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGPSILGNIQQEGIQITVDS 452

Query: 238 RNSLIGFTPNKC 249
            N  IGF P  C
Sbjct: 453 ANGYIGFGPTTC 464


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  246 bits (629), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 148/259 (57%), Positives = 182/259 (70%), Gaps = 11/259 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T   A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ        FSY
Sbjct: 226 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 285

Query: 58  CLVDRDSDSTS-----TLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
           CLVDR S  +S     T+ F ++++P  +V  PLL N +LDTFYYL L GISVGG  +P 
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 345

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +SE+ FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D
Sbjct: 346 VSESQFKLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFD 405

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S  ++V+VPTV FHF  G+V  LPA NYLIPV++ G FCFAFA T  SLSIIGN+QQQG
Sbjct: 406 LSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQG 464

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            RV+++L  S +GF    C
Sbjct: 465 FRVAYDLVGSRVGFLSRAC 483


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  244 bits (624), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 127/260 (48%), Positives = 167/260 (64%), Gaps = 12/260 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+TLG  +V+ +AIGCGH N GLFVGAAGLLGLG G +S   Q+  +    FSY
Sbjct: 213 GTLALETLTLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSY 272

Query: 58  CLVDR--------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           CL  R        D+  +  L    ++P  AV  PL+RN +  +FYY+G++GI VG + L
Sbjct: 273 CLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERL 332

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           P+ +  F++ E G GG+++D+GTAVTRL  E Y ALRDAFV    AL    GV+L DTCY
Sbjct: 333 PLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCY 392

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
           D S  +SV VPTVSF+F     L LPA+N L+ VD  G +C AFAP+SS LSI+GN+QQ+
Sbjct: 393 DLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVD-GGIYCLAFAPSSSGLSILGNIQQE 451

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
           G +++ +  N  IGF P  C
Sbjct: 452 GIQITVDSANGYIGFGPATC 471


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  243 bits (619), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 177/255 (69%), Gaps = 6/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T     +  +A+GCGH+NEGLF+GAAGLLGLG GSLSFPSQ  A     FSY
Sbjct: 241 GDFSTETLTFRGQVIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSY 300

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PISET 114
           CLVDR +  T S+L F  +++P +A+  PLL N +LDTFYY+ L GISVGG  L  I  +
Sbjct: 301 CLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPAS 360

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F++D +GNGG+I+DSGT+VTRL    Y+ +RDAF  GT  L    G +LFDTCYD S  
Sbjct: 361 VFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGL 420

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            +V+VPT+ FHF  G  + LPA NYLIPVDS+ TFCFAFA  +  LSIIGN+QQQG RV 
Sbjct: 421 KTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVV 480

Query: 235 FNLRNSLIGFTPNKC 249
           F+   + +GF    C
Sbjct: 481 FDSLANRVGFKAGSC 495


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  242 bits (618), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 136/253 (53%), Positives = 176/253 (69%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G    ET+TLG   V N+AIGCGH N+G+FVGAAGLLGLGGGS+SF  Q++    + FSY
Sbjct: 130 GTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSY 189

Query: 58  CLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R ++S   LEF S ++P  A   PL+RN    ++YY+GL+G+ VG   +PISE  F
Sbjct: 190 CLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIF 249

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           ++ E GNGG+++D+GTAVTR  T  Y A RDAF+  T  L    GV++FDTCY+     S
Sbjct: 250 ELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLS 309

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA N+LIPVD  GTFCFAFAP+ S LSI+GN+QQ+G ++S +
Sbjct: 310 VRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVD 369

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 370 GANEFVGFGPNVC 382


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  242 bits (617), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 179/255 (70%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+FVTET+T     V+ +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ   +    FSY
Sbjct: 130 GEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSY 189

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +  +S++   A   PLL N  LDTFYY+ L GISVGG  +  I+ +
Sbjct: 190 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 249

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+D GT+VTRL    Y ALRDAF  G  +L      +LFDTCYD S +
Sbjct: 250 HFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 309

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           ++V+VPTV  HF  G  + LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV 
Sbjct: 310 TTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVV 368

Query: 235 FNLRNSLIGFTPNKC 249
           ++L +S +GF+P  C
Sbjct: 369 YDLASSRVGFSPRGC 383


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  241 bits (616), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 179/255 (70%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+FVTET+T     V+ +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ   +    FSY
Sbjct: 217 GEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSY 276

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +  +S++   A   PLL N  LDTFYY+ L GISVGG  +  I+ +
Sbjct: 277 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 336

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+D GT+VTRL    Y ALRDAF  G  +L      +LFDTCYD S +
Sbjct: 337 HFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 396

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           ++V+VPTV  HF  G  + LPA NYLIPVD +G FCFAFA T+S LSIIGN+QQQG RV 
Sbjct: 397 TTVKVPTVVLHF-RGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVV 455

Query: 235 FNLRNSLIGFTPNKC 249
           ++L +S +GF+P  C
Sbjct: 456 YDLASSRVGFSPRGC 470


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  241 bits (614), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 144/255 (56%), Positives = 178/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           G+F TET+T     V  + +GCGH+NEGLFVGAAGLLGLG G LSFPSQI     S FSY
Sbjct: 234 GEFSTETLTFRGTRVGRVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSY 293

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CL DR + S  S++ F DS++       PLL N +LDTFYY+ L GISVGG  +  IS +
Sbjct: 294 CLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISAS 353

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF+ G   L      +LFDTC+D S +
Sbjct: 354 LFKLDSTGNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGK 413

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  +PLPA NYLIPVD++G+FCFAFA T+S LSIIGN+QQQG RV 
Sbjct: 414 TEVKVPTVVLHF-RGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVV 472

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  S +GF P  C
Sbjct: 473 YDLATSRVGFAPRGC 487


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  240 bits (612), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 147/266 (55%), Positives = 181/266 (68%), Gaps = 19/266 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDF TET+T  G A V  +A+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+     +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFS 288

Query: 57  YCLVDRDSDS-----TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDL 108
           YCLVDR S +     +ST+ F S    + V +   P+++N  ++TFYY+ L GISVGG  
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348

Query: 109 LP-ISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA---LSPTDGVA 163
           +P ++ +  ++D  SG GG+IVDSGT+VTRL    Y+ALRDAF RG  A   LSP  G +
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAF-RGAAAGLRLSP-GGFS 406

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           LFDTCYD S R  V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +SII
Sbjct: 407 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSII 466

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQQG RV F+     + FTP  C
Sbjct: 467 GNIQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  240 bits (612), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 133/253 (52%), Positives = 171/253 (67%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T+G   + ++AIGCGH N+G+F+GAAGLLGLGGGS+SF  Q+   T   FSY
Sbjct: 230 GTLALETLTVGQVMIRDVAIGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSY 289

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R + ST  LEF   +LP  A    L+RN    +FYY+GL GI VGG  + + E  F
Sbjct: 290 CLVSRGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETF 349

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           ++ E G  G+++D+GTAVTR  T  Y A RD+F   T  L    GV++FDTCYD +   S
Sbjct: 350 QLTEYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFES 409

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F +G VL LPA+N+LIPVD  GTFC AFAP+ S LSIIGN+QQ+G ++SF+
Sbjct: 410 VRVPTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFD 469

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 470 GANGFVGFGPNIC 482


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  239 bits (610), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 141/255 (55%), Positives = 178/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G+F TET+T     V  +A+GCGH+NEGLF+GAAGLLGLG G LSFPSQI    +  FSY
Sbjct: 236 GEFSTETLTFRGTRVGRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSY 295

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  S + F DS++   A   PL+ N +LDTFYY+ L G+SVGG  +P I+ +
Sbjct: 296 CLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITAS 355

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D S +
Sbjct: 356 LFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGK 415

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD++G+FCFAFA T S LSI+GN+QQQG RV 
Sbjct: 416 TEVKVPTVVLHF-RGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVV 474

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  S +GF P  C
Sbjct: 475 YDLAASRVGFAPRGC 489


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  238 bits (608), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 131/253 (51%), Positives = 175/253 (69%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   V N+AIGCGH+N G+FVGAAGLLGLGGGS+SF  Q++  T   FSY
Sbjct: 130 GTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSY 189

Query: 58  CLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R +++   LEF S ++P  A   PL+RN    +FYY+ L G+ VG   +P+SE  F
Sbjct: 190 CLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVF 249

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           +++E G+GG+++D+GTAVTR  T  Y A R+AF+  T+ L    GV++FDTCY+     S
Sbjct: 250 QLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLS 309

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L +PA N+LIPVD  GTFCFAFAP+ S LSI+GN+QQ+G ++S +
Sbjct: 310 VRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVD 369

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 370 EANEFVGFGPNIC 382


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  238 bits (607), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/262 (54%), Positives = 186/262 (70%), Gaps = 14/262 (5%)

Query: 1   GDFVTETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---N 51
           G+F T+ V+L S S      ++ I +GCGH+NEG FVGAAGLLGLG G LSFP+Q+   N
Sbjct: 145 GEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQN 204

Query: 52  ASTFSYCLVDRDSDST--STLEF-DSSLPP-NAVTAPLLRNHELDTFYYLGLTGISVGGD 107
              FSYCL DR++DST  S+L F ++++PP  A   P   N  + TFYYL +TGISVGG 
Sbjct: 205 GGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGT 264

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
           +L I  +AF++D  GNGG+I+DSGT+VTRLQ   Y +LRDAF  GT  L+PT G +LFDT
Sbjct: 265 ILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDT 324

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CYD S  +SV+VPTV+ HF  G  L LPA NYLIPVD++ TFC AFA T+   SIIGN+Q
Sbjct: 325 CYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQ 383

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQG RV ++  ++ +GF P++C
Sbjct: 384 QQGFRVIYDNLHNQVGFVPSQC 405


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  238 bits (607), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 141/255 (55%), Positives = 175/255 (68%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T     V  +A+GCGH+NEGLFVGAAGLLGLG G LSFPSQ        FSY
Sbjct: 215 GDFSTETLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSY 274

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  S++ F DS++   A   PL+ N +LDTFYY+ L GISVGG  +P I+ +
Sbjct: 275 CLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITAS 334

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D++GNGG+I+DSGT+VTRL    Y A RDAF  G   L      +LFDTC+D S +
Sbjct: 335 LFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGK 394

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD++G FC AFA T   LSIIGN+QQQG RV 
Sbjct: 395 TEVKVPTVVLHF-RGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVV 453

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  S +GF P+ C
Sbjct: 454 YDLAGSRVGFAPHGC 468


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  238 bits (607), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 147/266 (55%), Positives = 180/266 (67%), Gaps = 19/266 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDF TET+T  G A V  IA+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+     +FS
Sbjct: 231 GDFATETLTFAGGARVARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFS 290

Query: 57  YCLVDRDSDS-----TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDL 108
           YCLVDR S +     +ST+ F S    + V A   P+++N  ++TFYY+ L GISVGG  
Sbjct: 291 YCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGAR 350

Query: 109 LP-ISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
           +  ++++  ++D  SG GG+IVDSGT+VTRL    Y+ALRDAF     G R LSP  G +
Sbjct: 351 VSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLR-LSP-GGFS 408

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           LFDTCYD S R  V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +SII
Sbjct: 409 LFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSII 468

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQQG RV F+     +GF P  C
Sbjct: 469 GNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  238 bits (606), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 121/244 (49%), Positives = 166/244 (68%), Gaps = 2/244 (0%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G  + ETV+  S+  VD +++GC + N+G FVG+ G  GLG GSLSFPS+INAS+ SYCL
Sbjct: 275 GVLINETVSFESSGWVDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINASSMSYCL 334

Query: 60  VD-RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           V+ +D  S+STLEF+S     +V A LL+N + +  YY+GL GI VGG+ + +  + F I
Sbjct: 335 VESKDGYSSSTLEFNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTI 394

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
           D  GNGG+IV S + +T L+ +TYN +RDAFV  T+ L        FDTCY+ SS ++VE
Sbjct: 395 DPYGNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVE 454

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           +P + F   +GK   LP ++YL  VD NGTFCFAFAP+  S SI+G +QQ GTRV+F+L 
Sbjct: 455 LPILEFEVNDGKSWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLV 514

Query: 239 NSLI 242
           NS +
Sbjct: 515 NSFV 518


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  237 bits (605), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 174/255 (68%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G+F TET+T     V  +A+GCGH+NEGLFVGAAGLLGLG G LSFP+Q        FSY
Sbjct: 235 GEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSY 294

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +   S++   AV  PL+ N +LDTFYYL LTGISVGG  +  I+ +
Sbjct: 295 CLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITAS 354

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D +GNGG+I+DSGT+VTRL    Y +LRDAF  G   L      +LFDTC+D S +
Sbjct: 355 LFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGK 414

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD+NG FCFAFA T S LSIIGN+QQQG RV 
Sbjct: 415 TEVKVPTVVMHF-RGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVV 473

Query: 235 FNLRNSLIGFTPNKC 249
           F++  S IGF    C
Sbjct: 474 FDVAASRIGFAARGC 488


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  237 bits (605), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 140/255 (54%), Positives = 175/255 (68%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T     V  +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q        FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +  ++++   A   PLL N +LDTFYY+GL GISVGG  +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 350

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D+ GNGG+I+DSGT+VTRL    Y A+RDAF  G + L      +LFDTC+D S+ 
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNM 410

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF    V  LPA NYLIPVD+NG FCFAFA T   LSIIGN+QQQG RV 
Sbjct: 411 NEVKVPTVVLHFRRADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469

Query: 235 FNLRNSLIGFTPNKC 249
           ++L +S +GF P  C
Sbjct: 470 YDLASSRVGFAPGGC 484


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  237 bits (605), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 142/255 (55%), Positives = 176/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T     V  +A+GCGH+NEGLF+GAAGLLGLG G LSFP Q        FSY
Sbjct: 218 GDFSTETLTFRRTRVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSY 277

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD-LLPISET 114
           CLVDR + +  S++ F DS++   A   PL++N +LDTFYYL L GISVGG  +  +S +
Sbjct: 278 CLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSAS 337

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F++D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D S  
Sbjct: 338 LFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGL 397

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG RVS
Sbjct: 398 TEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVS 456

Query: 235 FNLRNSLIGFTPNKC 249
           F+L  S +GF P  C
Sbjct: 457 FDLAGSRVGFAPRGC 471


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  237 bits (604), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 143/255 (56%), Positives = 178/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T   A+V  +AIGCGH+NEGLFVGAAGLLGLG G LSFP+Q      + FSY
Sbjct: 219 GDFSTETLTFRRAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSY 278

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
           CL DR + +  S++ F DS++   A   PL++N +LDTFYY+ L GISVGG  +  IS +
Sbjct: 279 CLTDRTASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISAS 338

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F++D +GNGG+I+DSGT+VTRL    Y +LRDAF  G   L      +LFDTCYD S  
Sbjct: 339 FFRLDSTGNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGL 398

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           S V+VPTV  HF  G  + LPA NYL+PVD++G+FCFAFA T S LSIIGN+QQQG RV 
Sbjct: 399 SEVKVPTVVLHF-RGADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVV 457

Query: 235 FNLRNSLIGFTPNKC 249
           F+L  S +GF P  C
Sbjct: 458 FDLAGSRVGFAPRGC 472


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  237 bits (604), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 140/255 (54%), Positives = 176/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T     V  +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q        FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +  ++++   A   PLL N +LDTFYY+GL GISVGG  +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTAS 350

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D+ GNGG+I+DSGT+VTRL    Y A+RDAF  G + L      +LFDTC+D S+ 
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNM 410

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD+NG FCFAFA T   LSIIGN+QQQG RV 
Sbjct: 411 NEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469

Query: 235 FNLRNSLIGFTPNKC 249
           ++L +S +GF P  C
Sbjct: 470 YDLASSRVGFAPGGC 484


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  236 bits (603), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 149/267 (55%), Positives = 180/267 (67%), Gaps = 20/267 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDF TET+T  G A V  +A+GCGH+NEGLFV AAGLLGLG GSLSFP+QI+     +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFS 288

Query: 57  YCLVDRDSDSTSTLEFDSSL------PPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
           YCLVDR S S+S     S        PP+A  A   P++RN  ++TFYY+ L GISVGG 
Sbjct: 289 YCLVDRTSSSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGA 348

Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
            +P ++E+  ++D S G GG+IVDSGT+VTRL   +Y+ALRDAF     G R LSP  G 
Sbjct: 349 RVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLR-LSP-GGF 406

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
           +LFDTCYD   R  V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +SI
Sbjct: 407 SLFDTCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSI 466

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN+QQQG RV F+     +GF P  C
Sbjct: 467 IGNIQQQGFRVVFDGDGQRVGFAPKGC 493


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  236 bits (602), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 121/252 (48%), Positives = 161/252 (63%), Gaps = 11/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   ET+TLG  +V  +AIGCGH N GLFVGAAGLLGLG G++S   Q+  +    FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL  R +    +L           T  + R     +FYY+GLTGI VGG+ LP+ ++ F+
Sbjct: 281 CLASRGAGGAGSLVLGR-------TEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQ 333

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           + E G GG+++D+GTAVTRL  E Y ALR AF     AL  +  V+L DTCYD S  +SV
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 393

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VPTVSF+F +G VL LPA+N L+ V     FC AFAP+SS +SI+GN+QQ+G +++ + 
Sbjct: 394 RVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITVDS 452

Query: 238 RNSLIGFTPNKC 249
            N  +GF PN C
Sbjct: 453 ANGYVGFGPNTC 464


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  236 bits (602), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 140/255 (54%), Positives = 176/255 (69%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GDF TET+T     V  +A+GCGH+NEGLFVGAAGLLGLG G LSFP Q        FSY
Sbjct: 231 GDFSTETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSY 290

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISET 114
           CLVDR + S  +S +  ++++   A   PLL N +LDTFYY+ L GISVGG  +P ++ +
Sbjct: 291 CLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAAS 350

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            FK+D+ GNGG+I+DSGT+VTRL    Y A+RDAF  G +AL      +LFDTC+D S+ 
Sbjct: 351 LFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNM 410

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD+NG FCFAFA T   LSIIGN+QQQG RV 
Sbjct: 411 NEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVV 469

Query: 235 FNLRNSLIGFTPNKC 249
           ++L +S +GF P  C
Sbjct: 470 YDLASSRVGFAPGGC 484


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 132/252 (52%), Positives = 167/252 (66%), Gaps = 21/252 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   V ++AIGCGH N G+FVGAAGLLGLGGGS+SF  Q+   T   FSY
Sbjct: 288 GTLALETLTFGRTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSY 347

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CLV                  +A   PL+RN    +FYY+GL G+ VGG  +PISE  F+
Sbjct: 348 CLV------------------SAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFR 389

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           + E G+GG+++D+GTAVTRL T  Y A RDAF+  T  L    GVA+FDTCYD     SV
Sbjct: 390 LTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSV 449

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VPTVSF+F  G +L LPA+N+LIP+D  GTFCFAFAP++S LSI+GN+QQ+G ++SF+ 
Sbjct: 450 RVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDG 509

Query: 238 RNSLIGFTPNKC 249
            N  +GF PN C
Sbjct: 510 ANGYVGFGPNIC 521


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  234 bits (596), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 146/269 (54%), Positives = 178/269 (66%), Gaps = 21/269 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDFVTET+T  G A V  +A+GCGH+NEGLFV AAGLLGLG G LSFP+QI+     +FS
Sbjct: 218 GDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFS 277

Query: 57  YCLVDR---------DSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
           YCLVDR          S  +ST+ F   S    +A   P++RN  ++TFYY+ L GISVG
Sbjct: 278 YCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVG 337

Query: 106 GDLLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTD 160
           G  +P ++E+  ++D S G GG+IVDSGT+VTRL   +Y+ALRDAF     G   LSP  
Sbjct: 338 GARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSP-G 396

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G +LFDTCYD   R  V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +
Sbjct: 397 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 456

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN+QQQG RV F+     +GF P  C
Sbjct: 457 SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 485


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  233 bits (595), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 146/269 (54%), Positives = 178/269 (66%), Gaps = 21/269 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDFVTET+T  G A V  +A+GCGH+NEGLFV AAGLLGLG G LSFP+QI+     +FS
Sbjct: 75  GDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFS 134

Query: 57  YCLVDR---------DSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
           YCLVDR          S  +ST+ F   S    +A   P++RN  ++TFYY+ L GISVG
Sbjct: 135 YCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVG 194

Query: 106 GDLLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTD 160
           G  +P ++E+  ++D S G GG+IVDSGT+VTRL   +Y+ALRDAF     G   LSP  
Sbjct: 195 GARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSP-G 253

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G +LFDTCYD   R  V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +
Sbjct: 254 GFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 313

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN+QQQG RV F+     +GF P  C
Sbjct: 314 SIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  233 bits (593), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 142/262 (54%), Positives = 183/262 (69%), Gaps = 14/262 (5%)

Query: 1   GDFVTETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST 54
           G+F T+ V+L S S      ++ I +GCGH+NEG FVGAAGLLGLG G LSFP+QIN+  
Sbjct: 124 GEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSEN 183

Query: 55  ---FSYCLVDRDSDST--STLEF-DSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGD 107
              FSYCL  RD+DST  S+L F D+++PP  V   P   N  + TFYYL +TGISVGG 
Sbjct: 184 GGRFSYCLTGRDTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGS 243

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
           +L I  +AF++D  GNGG+I+DSGT+VTRLQ   Y +LR+AF  GT  L  T   +LFDT
Sbjct: 244 ILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDT 303

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CY+ S  SSV+VPTV+ HF  G  L LPA NYL+PVD++ TFC AFA T+   SIIGN+Q
Sbjct: 304 CYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQ 362

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQG RV ++  ++ +GF P++C
Sbjct: 363 QQGFRVIYDNLHNQVGFVPSQC 384


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  232 bits (592), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 139/255 (54%), Positives = 174/255 (68%), Gaps = 7/255 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T     V  +A+GCGH+NEGLF GAAGLLGLG G LSFP Q        FSY
Sbjct: 207 GDFSTETLTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSY 266

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG-DLLPISET 114
           CLVDR + +  +S +  DS++   A   PL++N +LDTFYYL L GISVGG  +  +S +
Sbjct: 267 CLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSAS 326

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F++D +GNGG+I+DSGT+VTRL    Y ALRDAF  G   L      +LFDTC+D S  
Sbjct: 327 LFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSLFDTCFDLSGL 386

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           + V+VPTV  HF  G  + LPA NYLIPVD++G+FCFAFA T S LSIIGN+QQQG R+S
Sbjct: 387 TEVKVPTVVLHF-RGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRIS 445

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  S +GF P  C
Sbjct: 446 YDLTGSRVGFAPRGC 460


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  231 bits (589), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 136/253 (53%), Positives = 174/253 (68%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   + N+AIGCGH+N+G+FVGAAGLLGLGGG +SF  Q+   T   FSY
Sbjct: 223 GTLALETITFGRTLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSY 282

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R  +S+  LEF   ++P  A   PL+ N    +FYY+GL+G+ VGG  + ISE  F
Sbjct: 283 CLVSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVF 342

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           K+ E G+GG+++D+GTAVTRL T  Y A RD F+  T  L    GV++FDTCYD     S
Sbjct: 343 KLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVS 402

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA+N+LIPVD  GTFCFAFAP+SS LSIIGN+QQ+G ++S +
Sbjct: 403 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVD 462

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 463 GANGFVGFGPNVC 475


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  231 bits (588), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 128/252 (50%), Positives = 169/252 (67%), Gaps = 15/252 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T+G   + + AIGCGH NEG+FVGAAGLLGLGGG +SF  Q+ A T   F Y
Sbjct: 217 GTLALETITIGRTVIQDTAIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGY 276

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CLV R            ++P  A+  PL+ N    +FYY+ L+G++VGG  +PISE  F+
Sbjct: 277 CLVSR------------AMPVGAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQ 324

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           + + G GG+++D+GTA+TRL T  YNA RDAF+  T  L    GV++FDTCYD +   +V
Sbjct: 325 LTDIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTV 384

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VPTVSF+F  G++L  PA+N+LIP D  GTFCFAFAP+ S LSIIGN+QQ+G +VS + 
Sbjct: 385 RVPTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDG 444

Query: 238 RNSLIGFTPNKC 249
            N  +GF PN C
Sbjct: 445 TNGFVGFGPNVC 456


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  229 bits (584), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 119/252 (47%), Positives = 158/252 (62%), Gaps = 24/252 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   ET+TLG  +V  +AIGCGH N GLFVGAAGLLGLG G++S   Q+  +    FSY
Sbjct: 221 GELALETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSY 280

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL  R +    +L                      +FYY+GLTGI VGG+ LP+ ++ F+
Sbjct: 281 CLASRGAGGAGSLA--------------------SSFYYVGLTGIGVGGERLPLQDSLFQ 320

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
           + E G GG+++D+GTAVTRL  E Y ALR AF     AL  +  V+L DTCYD S  +SV
Sbjct: 321 LTEDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASV 380

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VPTVSF+F +G VL LPA+N L+ V     FC AFAP+SS +SI+GN+QQ+G +++ + 
Sbjct: 381 RVPTVSFYFDQGAVLTLPARNLLVEV-GGAVFCLAFAPSSSGISILGNIQQEGIQITVDS 439

Query: 238 RNSLIGFTPNKC 249
            N  +GF PN C
Sbjct: 440 ANGYVGFGPNTC 451


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  228 bits (581), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 124/225 (55%), Positives = 159/225 (70%), Gaps = 4/225 (1%)

Query: 29  LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLEF-DSSLPPNAVTAP 84
           +FVGAAGLLGLG G +SF  Q+      TFSYCLV R ++S+ +LEF   S+P  A    
Sbjct: 1   MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60

Query: 85  LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L+ N    +FYY+GL+G+ VGG  +PISE  F+++E G GG+++D+GTAVTRL    YNA
Sbjct: 61  LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
            RDAFV  T  L  T GV++FDTCYD +   +V VPT+SF+F  G +L LPA+N+LIPVD
Sbjct: 121 FRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180

Query: 205 SNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S GTFCFAFAP+SS LSIIGN+QQ+G  +S +  N  IGF PN C
Sbjct: 181 SVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  227 bits (579), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 128/266 (48%), Positives = 167/266 (62%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST 54
           G+ VT+ V L      G   + NI +GCGH+NEG F  AAG+LGLG G LSFP+ ++AST
Sbjct: 103 GELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDAST 162

Query: 55  ---FSYCLVDRDSD--STSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISV 104
              FSYCL DR+SD    STL F  +  P+  T      P LRN  + T+YY+ +TGISV
Sbjct: 163 RNIFSYCLPDRESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISV 222

Query: 105 GGDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           GG+LL  I  + F++D  GNGG I DSGT +TRL+   Y A+RDAF   T  L+      
Sbjct: 223 GGNLLTNIPASVFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFK 282

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           +FDTCYDF+  +S+ VPTV+FHF     + LP  NY++PV +N  FCFAFA  S   S+I
Sbjct: 283 IFDTCYDFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFA-ASMGPSVI 341

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ  RV ++  +  IG  P++C
Sbjct: 342 GNVQQQSFRVIYDNVHKQIGLLPDQC 367


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  227 bits (578), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 136/259 (52%), Positives = 177/259 (68%), Gaps = 14/259 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G+F TET++ GS +V+++AIGCGHNN+GLF GAAGLLGLG G LSFPSQ+     S FSY
Sbjct: 168 GEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSY 227

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL  R+S  +  L F + ++  NA    LL N +LDTFYY+ + GI VGG  + I   + 
Sbjct: 228 CLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSL 287

Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTCYD 170
            +D S GNGG+I+DSGTAVTRL T  YN +RDAF    RA  P+D     G +LFDTCYD
Sbjct: 288 SLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAF----RAGMPSDAKMTSGFSLFDTCYD 343

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S RSS+ +P VSF F  G  + LPA+N ++PVD++GT+C AFAP S + SIIGN+QQQ 
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQS 403

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+SF+   + +G   N+C
Sbjct: 404 FRMSFDSTGNRVGIGANQC 422


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  226 bits (577), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 134/262 (51%), Positives = 178/262 (67%), Gaps = 13/262 (4%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI--------N 51
           GDF ++  TLG+ S   ++A GCG +NEGLF GAAGLLGLG G LSFPSQI         
Sbjct: 221 GDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSST 280

Query: 52  ASTFSYCLVDRD---SDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           A++FSYCLVDR    + S+S+L F  +++P  A  +PLL+N +LDTFYY  + G+SVGG 
Sbjct: 281 ANSFSYCLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGA 340

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            LPIS  + ++ +SG+GG+I+DSGT+VTR  T  Y  +RDAF   T  L      +LFDT
Sbjct: 341 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDT 400

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CY+FS ++SV+VP +  HF  G  L LP  NYLIP+++ G+FC AFAPTS  L IIGN+Q
Sbjct: 401 CYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQ 460

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ  R+ F+L+ S + F P +C
Sbjct: 461 QQSFRIGFDLQKSHLAFAPQQC 482


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  226 bits (576), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 136/259 (52%), Positives = 177/259 (68%), Gaps = 14/259 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G+F TET++ GS +V+++AIGCGHNN+GLF GAAGLLGLG G LSFPSQ+     S FSY
Sbjct: 168 GEFSTETLSFGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSY 227

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL  R+S  +  L F + ++  NA    LL N +LDTFYY+ + GI VGG  + I   + 
Sbjct: 228 CLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSL 287

Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTCYD 170
            +D S GNGG+I+DSGTAVTRL T  YN +RDAF    RA  P+D     G +LFDTCYD
Sbjct: 288 SLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAF----RAGMPSDAKMTSGFSLFDTCYD 343

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S RSS+ +P VSF F  G  + LPA+N ++PVD++GT+C AFAP S + SIIGN+QQQ 
Sbjct: 344 LSGRSSIMLPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQS 403

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+SF+   + +G   N+C
Sbjct: 404 FRMSFDSTGNRVGIGANQC 422


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/262 (51%), Positives = 178/262 (67%), Gaps = 13/262 (4%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI--------N 51
           GDF ++  TLG+ S   ++A GCG +NEGLF GAAGLLGLG G LSFPSQI         
Sbjct: 146 GDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSST 205

Query: 52  ASTFSYCLVDRD---SDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           A++FSYCLVDR    + S+S+L F  +++P  A  +PLL+N +LDTFYY  + G+SVGG 
Sbjct: 206 ANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGA 265

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            LPIS  + ++ +SG+GG+I+DSGT+VTR  T  Y  +RDAF   T  L      +LFDT
Sbjct: 266 QLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSLFDT 325

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CY+FS ++SV+VP +  HF  G  L LP  NYLIP+++ G+FC AFAPTS  L IIGN+Q
Sbjct: 326 CYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQ 385

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ  R+ F+L+ S + F P +C
Sbjct: 386 QQSFRIGFDLQKSHLAFAPQQC 407


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  225 bits (573), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 147/267 (55%), Positives = 179/267 (67%), Gaps = 20/267 (7%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDF TET+T  S A V  +A+GCGH+NEGLFV AAGLLGLG GSLSFPSQI+     +FS
Sbjct: 236 GDFATETLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFS 295

Query: 57  YCLVDRDSDSTST------LEFDS-SLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGD 107
           YCLVDR S S S       + F S ++ P+A  +  P+++N  ++TFYY+ L GISVGG 
Sbjct: 296 YCLVDRTSSSASATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGA 355

Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
            +P ++ +  ++D S G GG+IVDSGT+VTRL    Y ALRDAF     G R LSP  G 
Sbjct: 356 RVPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLR-LSP-GGF 413

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
           +LFDTCYD S    V+VPTVS HF  G    LP +NYLIPVDS GTFCFAFA T   +SI
Sbjct: 414 SLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSI 473

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN+QQQG RV F+     +GF P  C
Sbjct: 474 IGNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 133/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T     V N+A+GCGH N G+F+GAAGLLG+GGGS+SF  Q++  T   F Y
Sbjct: 219 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 278

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R +DST +L F   +LP  A   PL+RN    +FYY+GL G+ VGG  +P+ +  F
Sbjct: 279 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 338

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            + E+G+GG+++D+GTAVTRL T  Y A RD F   T  L    GV++FDTCYD S   S
Sbjct: 339 DLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 398

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F EG VL LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSF+
Sbjct: 399 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 458

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 459 GANGFVGFGPNVC 471


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  224 bits (571), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 133/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T     V N+A+GCGH N G+F+GAAGLLG+GGGS+SF  Q++  T   F Y
Sbjct: 218 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 277

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R +DST +L F   +LP  A   PL+RN    +FYY+GL G+ VGG  +P+ +  F
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            + E+G+GG+++D+GTAVTRL T  Y A RD F   T  L    GV++FDTCYD S   S
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 397

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F EG VL LPA+N+L+PVD +GT+CFAFA + + LSIIGN+QQ+G +VSF+
Sbjct: 398 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 457

Query: 237 LRNSLIGFTPNKC 249
             N  +GF PN C
Sbjct: 458 GANGFVGFGPNVC 470


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  221 bits (564), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 132/253 (52%), Positives = 174/253 (68%), Gaps = 4/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G    ET+T G   + NIAIGCGH N G+F+GAAGLLGLGGG++SF  Q+   T   FSY
Sbjct: 224 GTLALETLTFGRVLIRNIAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSY 283

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV R ++ST TLEF   ++P  A   PL+RN    +FYY+GL+G+ VGG  +PI E  F
Sbjct: 284 CLVSRGTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIF 343

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           ++ + G GG+++D+GTAVTRL    Y A RD F+  T  L  +D V++FDTCY+ +   S
Sbjct: 344 ELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVS 403

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V VPTVSF+F  G +L LPA+N+LIPVD  GTFCFAFA ++S LSIIGN+QQ+G ++S +
Sbjct: 404 VRVPTVSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISID 463

Query: 237 LRNSLIGFTPNKC 249
             N  +GF P  C
Sbjct: 464 GSNGFVGFGPTIC 476


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  216 bits (550), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 121/264 (45%), Positives = 154/264 (58%), Gaps = 16/264 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    ET+TLG  +V+ + IGCGH N GLFVGAAGL+GLG G +S   Q+       FSY
Sbjct: 261 GALALETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSY 320

Query: 58  CLVDR---------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           CL  R         D      L    ++P  AV  PL+RN    +FYY+GL+GI VG + 
Sbjct: 321 CLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDER 380

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGV--ALF 165
           LP+    F++ E G G +++D+GT VTRL  E Y ALRDAFV       P   GV  ++ 
Sbjct: 381 LPLQAGLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVL 440

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           DTCYD S  +SV VPTVSF F     L L A+N L+ VD  G +C AFAP+SS LSI+GN
Sbjct: 441 DTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVD-MGIYCLAFAPSSSGLSIMGN 499

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQ G +++ +  N  IGF P  C
Sbjct: 500 TQQAGIQITVDSANGYIGFGPANC 523


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  214 bits (544), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/259 (50%), Positives = 168/259 (64%), Gaps = 10/259 (3%)

Query: 1   GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
           GD   E+  LG   S ++ NIA GCGH+N GLF G AGLLG+GGG+LSF SQI AS    
Sbjct: 99  GDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPA 158

Query: 55  FSYCLVDRDSD---STSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           FSYCLVDR S     +S L F  +++P  A   PLL+N  ++TFYY  LTGISVGG  LP
Sbjct: 159 FSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLP 218

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I    F +  +G GG I+DSGT+VTR+    Y  LRDA+   +R L P  GV L DTC++
Sbjct: 219 IPPAQFALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFN 278

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
           F    +V++P++  HF  G  + LP  N LIPVD +GTFC AFAP+S  +S+IGNVQQQ 
Sbjct: 279 FQGLPTVQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQT 338

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+ F+L+ SLI   P +C
Sbjct: 339 FRIGFDLQRSLIAIAPREC 357


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  214 bits (544), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 132/259 (50%), Positives = 167/259 (64%), Gaps = 10/259 (3%)

Query: 1   GDFVTETVTLG---SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
           GD   E+  LG   S ++ NIA GCGH+N GLF G AGLLG+GGG+LSF SQI AS    
Sbjct: 132 GDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPA 191

Query: 55  FSYCLVDRDSD---STSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           FSYCLVDR S     +S L F  +++P  A   PLL+N  +DTFYY  LTGISVGG  LP
Sbjct: 192 FSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAILTGISVGGTALP 251

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I    F +  +G GG I+DSGT+VTR+    Y  LRDA+   +R L P  GV L DTC++
Sbjct: 252 IPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYLLDTCFN 311

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
           F    +V++P++  HF     + LP  N LIPVD +GTFC AFAP+S  +S+IGNVQQQ 
Sbjct: 312 FQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQT 371

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+ F+L+ SLI   P +C
Sbjct: 372 FRIGFDLQRSLIAIAPREC 390


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  212 bits (539), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 126/253 (49%), Positives = 159/253 (62%), Gaps = 5/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           GDF TET++ G  +V ++A+GCG NN+GLF GAAGLLGLG G LSFPSQ     AS FSY
Sbjct: 169 GDFSTETLSFGEHAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSY 228

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL  R+S   ++L F  S++P  A    LL N  LDT+YY+GL  I V G  + I   AF
Sbjct: 229 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 288

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            +   G GG+IVDSGTA++RL T  Y ALRDAF R         G++LFDTCYD SS  +
Sbjct: 289 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKT 347

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
             +P V   F  G  +PLPA   L+ VD  GT+C AFAP   + SIIGNVQQQ  R+S +
Sbjct: 348 ATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISID 407

Query: 237 LRNSLIGFTPNKC 249
            +   +G  P++C
Sbjct: 408 NQKEQMGIAPDQC 420


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/253 (49%), Positives = 159/253 (62%), Gaps = 5/253 (1%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           GDF TET++ G  +V ++A+GCG NN+GLF GAAGLLGLG G LSFPSQ     AS FSY
Sbjct: 102 GDFSTETLSFGEHAVRSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSY 161

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL  R+S   ++L F  S++P  A    LL N  LDT+YY+GL  I V G  + I   AF
Sbjct: 162 CLPRRESAIAASLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAF 221

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
            +   G GG+IVDSGTA++RL T  Y ALRDAF R         G++LFDTCYD SS  +
Sbjct: 222 AMGSRGTGGVIVDSGTAISRLTTPAYTALRDAF-RSLVTFPSAPGISLFDTCYDLSSMKT 280

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
             +P V   F  G  +PLPA   L+ VD  GT+C AFAP   + SIIGNVQQQ  R+S +
Sbjct: 281 ATLPAVVLDFDGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISID 340

Query: 237 LRNSLIGFTPNKC 249
            +   +G  P++C
Sbjct: 341 NQKEQMGIAPDQC 353


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 134/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GDF +ET+T    A V  +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI  S   +FS
Sbjct: 217 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 276

Query: 57  YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
           YCLVDR S        +ST+ F +     A  A   P+ RN  + TFYY+ L G SVGG 
Sbjct: 277 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 336

Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
            +  +S++  +++ + G GG+I+DSGT+VTRL    Y A+RDAF      L  SP  G +
Sbjct: 337 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 395

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           LFDTCY+ S R  V+VPTVS H   G  + LP +NYLIPVD++GTFCFA A T   +SII
Sbjct: 396 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 455

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQQG RV F+     +GF P  C
Sbjct: 456 GNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  211 bits (537), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 134/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GDF +ET+T    A V  +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI  S   +FS
Sbjct: 211 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 270

Query: 57  YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
           YCLVDR S        +ST+ F +     A  A   P+ RN  + TFYY+ L G SVGG 
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330

Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
            +  +S++  +++ + G GG+I+DSGT+VTRL    Y A+RDAF      L  SP  G +
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 389

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           LFDTCY+ S R  V+VPTVS H   G  + LP +NYLIPVD++GTFCFA A T   +SII
Sbjct: 390 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 449

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQQG RV F+     +GF P  C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 125/260 (48%), Positives = 169/260 (65%), Gaps = 14/260 (5%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
           G    ET+T G S  V  +AIGCGH N GLFVGAAGLLGLG G +S   Q+  +    FS
Sbjct: 223 GVLAMETLTFGDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFS 282

Query: 57  YCLVDRDSDS-TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  R +D+   +L F  D ++P  AV  PLLRN +  +FYY+GLTG+ VGG+ LP+ +
Sbjct: 283 YCLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQD 342

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYD 170
             F + E G GG+++D+GTAVTRL  + Y ALRDAF   + G    +P  GV+L DTCYD
Sbjct: 343 GLFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAP--GVSLLDTCYD 400

Query: 171 FSSRSSVEVPTVSFHF-PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
            S  +SV VPTV+ +F  +G  L LPA+N L+ +   G +C AFA ++S LSI+GN+QQQ
Sbjct: 401 LSGYASVRVPTVALYFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQ 459

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
           G +++ +  N  +GF P+ C
Sbjct: 460 GIQITVDSANGYVGFGPSTC 479


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  210 bits (534), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 133/266 (50%), Positives = 170/266 (63%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GDF +ET+T    A V  +AIGCGH+NEGLF+ A+GLLGLG G LSFP+QI  S   +FS
Sbjct: 211 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFS 270

Query: 57  YCLVDRDSD------STSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGD 107
           YCLVDR S        +ST+ F +     A  A   P+ RN  + TFYY+ L G SVGG 
Sbjct: 271 YCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGA 330

Query: 108 LLP-ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVA 163
            +  +S++  +++ + G GG+I+DSGT+VTRL    Y A+RDAF      L  SP  G +
Sbjct: 331 RVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFS 389

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           LFDTCY+ S R  V+VPTVS H   G  + LP +NYLIPVD++GTFCFA A T   +SII
Sbjct: 390 LFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSII 449

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQQG RV F+     +GF P  C
Sbjct: 450 GNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  208 bits (530), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 133/269 (49%), Positives = 168/269 (62%), Gaps = 26/269 (9%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS-TFSY 57
           GDF+ ET+T  G   +  I+IGCGH+N+GLF   AAG+LGLG G +SFP+QI+ + TFSY
Sbjct: 228 GDFIEETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHNGTFSY 287

Query: 58  CLVDRDSDS---TSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CLVD  S     +STL F +      PP + T P + N  + TFYY+ LTGISVGG  +P
Sbjct: 288 CLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFT-PTVLNLNMPTFYYVRLTGISVGGVRVP 346

Query: 111 -ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV------ 162
            ++E   ++D  +G GG+IVDSGTAVTRL    Y A RDAF    RA++   G       
Sbjct: 347 GVTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAF----RAVAVDLGQVSIGGP 402

Query: 163 -ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
              FDTCY    R   +VPTVS HF     + L  KNYLIPVDS GT CFAFA T   S+
Sbjct: 403 SGFFDTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSV 462

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN+QQQG R+ +++    +GF PN C
Sbjct: 463 SIIGNIQQQGFRIVYDI-GGRVGFAPNSC 490


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  201 bits (512), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 122/258 (47%), Positives = 151/258 (58%), Gaps = 15/258 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF  ET+TLGS S  + A GCGH N GLF G+AGLLGLG  +LSFPSQ  +     FSY
Sbjct: 226 GDFSQETLTLGSDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSY 285

Query: 58  CLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           CL D  S STST  F     S+P  A   PL+ N    +FY++GL GISVGG+ L I   
Sbjct: 286 CLPDFVS-STSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPA 344

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
                  G GG IVDSGT +TRL  + Y+AL+ +F   TR L      ++ DTCYD SS 
Sbjct: 345 VL-----GRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSY 399

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLS--IIGNVQQQGT 231
           S V +PT++FHF     + + A   L  + S+G+  C AFA  S S+S  IIGN QQQ  
Sbjct: 400 SQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRM 459

Query: 232 RVSFNLRNSLIGFTPNKC 249
           RV+F+     IGF P  C
Sbjct: 460 RVAFDTGAGRIGFAPGSC 477


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  199 bits (506), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 127/257 (49%), Positives = 160/257 (62%), Gaps = 20/257 (7%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GDF +ET+T    A V  +AIGCGH+NEGLF+ A+GLLGLG G LSFPSQI  S   +FS
Sbjct: 212 GDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFS 271

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETA 115
           YCLVDR S   +         P            + TFYY+ L G SVGG  +  +S++ 
Sbjct: 272 YCLVDRTSSRRARPSRRWGGTP-----------RMATFYYVHLLGFSVGGARVKGVSQSD 320

Query: 116 FKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFS 172
            +++ + G GG+I+DSGT+VTRL    Y A+RDAF      L  SP  G +LFDTCY+ S
Sbjct: 321 LRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSP-GGFSLFDTCYNLS 379

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
            R  V+VPTVS H   G  + LP +NYLIPVD++GTFCFA A T   +SIIGN+QQQG R
Sbjct: 380 GRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFR 439

Query: 233 VSFNLRNSLIGFTPNKC 249
           V F+     +GF P  C
Sbjct: 440 VVFDGDAQRVGFVPKSC 456


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  194 bits (492), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 116/257 (45%), Positives = 146/257 (56%), Gaps = 13/257 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF  ET+TLGS S  N A GCGH N GLF G++GLLGLG  SLSFPSQ  +     F+Y
Sbjct: 229 GDFSQETLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAY 288

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL D  S +++        S+P +AV  PL+ N    TFY++GL GISVGGD L I    
Sbjct: 289 CLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAV 348

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
                 G G  IVDSGT +TRL  + YNAL+ +F   TR L      ++ DTCYD S  S
Sbjct: 349 L-----GRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILDTCYDLSRHS 403

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSS--SLSIIGNVQQQGTR 232
            V +PT++FHF     + +     L+PV + G+  C AFA  S     +IIGN QQQ  R
Sbjct: 404 QVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMR 463

Query: 233 VSFNLRNSLIGFTPNKC 249
           V+F+     IGF    C
Sbjct: 464 VAFDTGAGRIGFASGSC 480


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  193 bits (491), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 116/265 (43%), Positives = 155/265 (58%), Gaps = 16/265 (6%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
           G    ET+TL G   V  +A+GCGH N GLF  AAGLLGLG G +S   Q+  +    FS
Sbjct: 215 GVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFS 274

Query: 57  YCLVDRDSDSTS-----TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCL    S   S      L  + + P  AV  PL+RN +  +FYY+G+ G+ V G+ L +
Sbjct: 275 YCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQL 334

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYD 170
            +  F + + G GG+++D+GTAVTRL  E Y ALR AF       +P   GV+LFDTCYD
Sbjct: 335 QDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYD 394

Query: 171 FSSRSSVEVPTVSFHF------PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
            S  +SV VPTV+ +F       E   L LPA+N L+PVD  GT+C AFA  +S  SI+G
Sbjct: 395 LSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILG 454

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+QQQG  ++ +  +  +GF P  C
Sbjct: 455 NIQQQGIEITVDSASGYVGFGPATC 479


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  191 bits (484), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 131/269 (48%), Positives = 164/269 (60%), Gaps = 22/269 (8%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQI-----NAS 53
           GD V ET+T  G      ++IGCGH+N+GLF   AAG+LGLG G +S P QI     NAS
Sbjct: 228 GDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNAS 287

Query: 54  TFSYCLVDRDS---DSTSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            FSYCLVD  S     +STL F +      PP + T P + N  + TFYY+ L G+SVGG
Sbjct: 288 -FSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFT-PTVLNQNMPTFYYVRLIGVSVGG 345

Query: 107 DLLP-ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGV 162
             +P ++E   ++D  +G GG+I+DSGT VTRL    Y A RDAF     +L    T G 
Sbjct: 346 VRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGP 405

Query: 163 A-LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
           + LFDTCY    R+ V+VP VS HF  G  + L  KNYLIPVDS GT CFAFA T   S+
Sbjct: 406 SGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSV 465

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S+IGN+ QQG RV ++L    +GF PN C
Sbjct: 466 SVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  185 bits (470), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 126/258 (48%), Positives = 156/258 (60%), Gaps = 21/258 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
           GDF  ET+T      V  +AIGCG +N+GLF   AAG+LGLG GSLSFPSQI      +F
Sbjct: 220 GDFGVETLTFPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSF 279

Query: 56  SYCLVDRDSD-STSTLEFDSSLPPNAVTAP------LLRNHELDTFYYLGLTGISVGG-D 107
           SYCL  + +   +STL F S       T        +L N  + TFYY+GL GISVGG  
Sbjct: 280 SYCLAGQGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339

Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF-VRGTRAL---SPTDGV 162
           +  ++E+  ++D S G+GG+IVDSGTAVTRL    Y A RDAF V   + L   SP    
Sbjct: 340 VRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPF 399

Query: 163 ALFDTCY-DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTS-SS 219
           A FDTCY     R   +VP VS HF  G  + LP +NYLIPVDSN GT CFAFA +    
Sbjct: 400 AFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRG 459

Query: 220 LSIIGNVQQQGTRVSFNL 237
           +SIIGN+Q QG RV +++
Sbjct: 460 VSIIGNIQLQGFRVVYDV 477


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  181 bits (459), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 114/271 (42%), Positives = 152/271 (56%), Gaps = 25/271 (9%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GD  T+T+ L     V N+ +GCGH+NEGL   AAGLLG G G LSFP+Q+  +    FS
Sbjct: 182 GDLATDTLVLPDDTRVHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFS 241

Query: 57  YCLVDRDS---DSTSTLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP- 110
           YCL DR S   +S+S L F  +  LP  A T PL  N    + YY+ + G SVGG+ +  
Sbjct: 242 YCLGDRMSRARNSSSYLVFGRTPELPSTAFT-PLRTNPRRPSLYYVDMVGFSVGGERVAG 300

Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVR-----GTRALSPTDGVAL 164
            S  +  ++ + G GG++VDSGTA++R   + Y A+RDAFV      G R L   +  ++
Sbjct: 301 FSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLR--NKFSV 358

Query: 165 FDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
           FDTCYD       + V VP++  HF     + LP  NYLIPV   D    FC        
Sbjct: 359 FDTCYDVHGNGPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADD 418

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            L+++GNVQQQG  V F++    IGFTPN C
Sbjct: 419 GLNVLGNVQQQGFGVVFDVERGRIGFTPNGC 449


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  181 bits (458), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 128/273 (46%), Positives = 161/273 (58%), Gaps = 26/273 (9%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQI-----NAS 53
           GD V ET+T  G      ++IGCGH+N+GLF   AAG+LGL  G +S P QI     NAS
Sbjct: 237 GDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNAS 296

Query: 54  TFSYCLVDRDS---DSTSTLEFDSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            FSYCLVD  S     +STL F +      PP + T P + N  + TFYY+ L G+SVGG
Sbjct: 297 -FSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFT-PTVLNQNMPTFYYVRLIGVSVGG 354

Query: 107 DLLP-ISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDG 161
             +P ++E   ++D  +G+GG+I+DSGT VTRL    Y A RDAF     G   +S    
Sbjct: 355 VRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGP 414

Query: 162 VALFDTCYDFSSRSS----VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
             LFDTCY    R+     V+VP VS HF  G  L L  KNYLI VDS GT CFAFA T 
Sbjct: 415 SGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTG 474

Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S+S+IGN+ QQG RV +++    +GF PN C
Sbjct: 475 DRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  180 bits (456), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 133/273 (48%), Positives = 169/273 (61%), Gaps = 28/273 (10%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQI-----NAS 53
           GDF+ ET+T  G   V +++IGCGH+N+GLF   AAG+LGLG G +S PSQI     N +
Sbjct: 225 GDFIEETLTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVT 284

Query: 54  TFSYCLVD-------RDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISV 104
           +FSYCL D       R   ST T+   ++   PP + T P ++N  + TFYY+ L G+SV
Sbjct: 285 SFSYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFT-PTVQNLNMATFYYVRLVGVSV 343

Query: 105 GGDLLPI-SETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVR-----GTRALS 157
           GG  +P  +E   K+D  +G GG+I+DSGTAVTRL    Y A RDAF       G  ++ 
Sbjct: 344 GGVRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIG 403

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
              G   FDTCY    R+ ++VPTVS HF  G  L LP KNYLIPVDS GT CFAFA T 
Sbjct: 404 GPSG--FFDTCYTMGGRA-MKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTG 460

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S+SIIGN+QQQG RV +N+    +GF PN C
Sbjct: 461 DRSVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  177 bits (448), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 108/256 (42%), Positives = 141/256 (55%), Gaps = 9/256 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   +ET+T G ASV N+A GCG +NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYCL 242

Query: 60  VDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
              D   TSTL   S    NA      T PL+ +    +FYYL L GISVG   LPI ++
Sbjct: 243 TTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKS 302

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F + + G+GG+I+DSGT +T L+   +N +   F         + G    D C+   S 
Sbjct: 303 TFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPSG 362

Query: 175 SS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
           S+ +EVP + FHF +G  L LPA+NY+I   S G  C A   +SS +SI GNVQQQ   V
Sbjct: 363 STNIEVPKLVFHF-DGADLELPAENYMIGDSSMGVACLAMG-SSSGMSIFGNVQQQNMLV 420

Query: 234 SFNLRNSLIGFTPNKC 249
             +L    + F P +C
Sbjct: 421 LHDLEKETLSFLPTQC 436


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  177 bits (448), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 104/256 (40%), Positives = 137/256 (53%), Gaps = 9/256 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T G  S+ N+  GCG +NEG  F   +GL+GLG G LS  SQ+  + FSYCL
Sbjct: 186 GTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYCL 245

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
              D   TSTL   S    N  +A     PL++N    +FYYL L GISVGG  LPI E+
Sbjct: 246 TSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKES 305

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SS 173
            F++ + G GG+I+DSGT +T L+   ++ ++  F           G    + CY+  S 
Sbjct: 306 TFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPSD 365

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
            S +EVP +  HF  G  L LP +NY+I   S G  C A   +S  +SI GNVQQQ   V
Sbjct: 366 TSELEVPKLVLHF-TGADLELPGENYMIADSSMGVICLAMG-SSGGMSIFGNVQQQNMFV 423

Query: 234 SFNLRNSLIGFTPNKC 249
           S +L    + F P  C
Sbjct: 424 SHDLEKETLSFLPTNC 439


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  176 bits (446), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 112/260 (43%), Positives = 143/260 (55%), Gaps = 14/260 (5%)

Query: 1   GDFVTETVTLGSA----SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTF 55
           G   TET T G +    SV NI  GCG +NEG  F  A+GL+GLG G LS  SQ+    F
Sbjct: 194 GVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQRF 253

Query: 56  SYCLVDRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           SYCL   D    S L   S          VT PLL+N    +FYYL L  ISVG   L I
Sbjct: 254 SYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSI 313

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYD 170
            ++ F++ + GNGG+I+DSGT +T +Q + Y AL+  F+  T+ AL  T    L D C+ 
Sbjct: 314 EKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGL-DLCFS 372

Query: 171 FSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
             S S+ VE+P + FHF +G  L LPA+NY+I   + G  C A    SS +SI GNVQQQ
Sbjct: 373 LPSGSTQVEIPKLVFHF-KGGDLELPAENYMIGDSNLGVACLAMG-ASSGMSIFGNVQQQ 430

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V+ +L    I F P  C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 106/251 (42%), Positives = 145/251 (57%), Gaps = 17/251 (6%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD--RDSDSTS 68
           SV N+ +GCGH+NEGLF  AAGLLG+  G+ SF +Q+  S    F+YCL D  R   S+S
Sbjct: 200 SVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSS 259

Query: 69  TLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGD-LLPISETAFKIDES-GNG 124
            L F  +   PP++V  PL  N    + YY+ + G SVGG+ +   S  +  +D + G G
Sbjct: 260 YLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRG 319

Query: 125 GIIVDSGTAVTRLQTETYNALRDAF-----VRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
           G++VDSGT++TR   + Y ALRDAF       G R +    G+++FD CYD    +  + 
Sbjct: 320 GVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVG--RGISVFDACYDLRGVAVADA 377

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIGNVQQQGTRVSFNLR 238
           P V  HF  G  + LP +NYL+P +S    CFA  A     LS+IGNV QQ  RV F++ 
Sbjct: 378 PGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVE 437

Query: 239 NSLIGFTPNKC 249
           N  +GF PN C
Sbjct: 438 NERVGFEPNGC 448


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  175 bits (444), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 111/264 (42%), Positives = 149/264 (56%), Gaps = 17/264 (6%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
           GDFV +T+T+         V N A GCGH+NEG F GA G+LGLG G LSF SQ+ +   
Sbjct: 100 GDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYN 159

Query: 53  STFSYCLVDRDSDSTST---LEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
             FSYCLVD  +  T T   L  D+++P  P+    P+L N ++ T+YY+ L GISVG +
Sbjct: 160 GKFSYCLVDWLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDN 219

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVALFD 166
           LL IS T F ID  G  G I DSGT VT+L    Y  +  A    T A S   D ++  D
Sbjct: 220 LLNISSTVFDIDSVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLD 279

Query: 167 TCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
            C   F       VP ++FHF EG  + LP  NY I ++S+ ++CFA   +S  ++IIG+
Sbjct: 280 LCLSGFPKDQLPTVPAMTFHF-EGGDMVLPPSNYFIYLESSQSYCFAMT-SSPDVNIIGS 337

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           VQQQ  +V ++     +GF P  C
Sbjct: 338 VQQQNFQVYYDTAGRKLGFVPKDC 361


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 105/253 (41%), Positives = 136/253 (53%), Gaps = 6/253 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET   G ASV  I  GCG +N+G  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGEPKFSYCL 242

Query: 60  VDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
              D     +S L    +   NA+T PL++N    +FYYL L GISVG  LLPI ++ F 
Sbjct: 243 TSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFS 302

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSS 176
           I   G+GG+I+DSGT +T L+   + AL+  F+   +      G    D C+      S+
Sbjct: 303 IQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDAST 362

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V+VP + FHF EG  L LPA+NY+I     G  C     +SS +SI GN QQQ   V  +
Sbjct: 363 VDVPQLVFHF-EGADLKLPAENYIIADSGLGVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420

Query: 237 LRNSLIGFTPNKC 249
           L    I F P +C
Sbjct: 421 LEKETISFAPAQC 433


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 110/260 (42%), Positives = 143/260 (55%), Gaps = 14/260 (5%)

Query: 1   GDFVTETVTLGSA----SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTF 55
           G   TET T G +    SV NI  GCG +NEG  F  A+GL+GLG G LS  SQ+    F
Sbjct: 194 GVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEPRF 253

Query: 56  SYCLVDRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           SYCL   D    S L   S          VT PLL+N    +FYYL L GISVG   L I
Sbjct: 254 SYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSI 313

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYD 170
            ++ F++ + GNGG+I+DSGT +T ++ + + AL+  F+  T+  L  T    L D C+ 
Sbjct: 314 EKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGL-DLCFS 372

Query: 171 FSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
             S S+ VE+P + FHF +G  L LPA+NY+I   + G  C A    SS +SI GNVQQQ
Sbjct: 373 LPSGSTQVEIPKIVFHF-KGGDLELPAENYMIGDSNLGVACLAMG-ASSGMSIFGNVQQQ 430

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V+ +L    I F P  C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450


>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
          Length = 150

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 83/146 (56%), Positives = 107/146 (73%)

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           VGG  +PISE  F++ E G+GG+++D+GTAVTRL T  Y A RDAF+  T  L    GVA
Sbjct: 5   VGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA 64

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           +FDTCYD     SV VPTVSF+F  G +L LPA+N+LIP+D  GTFCFAFAP++S LSI+
Sbjct: 65  IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSIL 124

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+QQ+G ++SF+  N  +GF PN C
Sbjct: 125 GNIQQEGIQISFDGANGYVGFGPNIC 150


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  174 bits (440), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 108/258 (41%), Positives = 154/258 (59%), Gaps = 14/258 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET+T GS S+ NI  GCG NN+G   G  AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
               S ++STL   S    N+VTA      L+++ ++ TFYY+ L G+SVG   LPI  +
Sbjct: 242 TPIGSSNSSTLLLGSL--ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPS 299

Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF- 171
            FK++  +G GGII+DSGT +T      Y A+R AF+     LS  +G +  FD C+   
Sbjct: 300 VFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFISQMN-LSVVNGSSSGFDLCFQMP 358

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           S +S++++PT   HF +G  L LP++NY I   SNG  C A   +S  +SI GN+QQQ  
Sbjct: 359 SDQSNLQIPTFVMHF-DGGDLVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNL 416

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V ++  NS++ F   +C
Sbjct: 417 LVVYDTGNSVVSFLSAQC 434


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 108/258 (41%), Positives = 153/258 (59%), Gaps = 14/258 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET+T GS S+ NI  GCG NN+G   G  AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
               S ++STL   S    N+VTA      L+ + ++ TFYY+ L G+SVG   LPI  +
Sbjct: 242 TPIGSSTSSTLLLGSL--ANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPS 299

Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF- 171
            FK++  +G GGII+DSGT +T      Y A+R AF+     LS  +G +  FD C+   
Sbjct: 300 VFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFISQMN-LSVVNGSSSGFDLCFQMP 358

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           S +S++++PT   HF +G  L LP++NY I   SNG  C A   +S  +SI GN+QQQ  
Sbjct: 359 SDQSNLQIPTFVMHF-DGGDLVLPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNL 416

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V ++  NS++ F   +C
Sbjct: 417 LVVYDTGNSVVSFLFAQC 434


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  172 bits (437), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 111/264 (42%), Positives = 149/264 (56%), Gaps = 17/264 (6%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
           GDFV +T+T+         V N A GCGH+NEG F GA G+LGLG G LSFPSQ+     
Sbjct: 90  GDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFN 149

Query: 53  STFSYCLVDRDSDSTST---LEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
             FSYCLVD  +  T T   L  D+++P  P      LL N ++ T+YY+ L GISVGG 
Sbjct: 150 GKFSYCLVDWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGK 209

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-SPTDGVALFD 166
           LL IS TAF ID  G  G I DSGT VT+L  E +  +  A    T      +D  +  D
Sbjct: 210 LLNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLD 269

Query: 167 TCY-DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
            C   F+      VP+++FHF EG  + LP  NY I ++S+ ++CF+   +S  ++IIG+
Sbjct: 270 LCLGGFAEGQLPTVPSMTFHF-EGGDMELPPSNYFIFLESSQSYCFSMV-SSPDVTIIGS 327

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  +V ++     IGF P  C
Sbjct: 328 IQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  171 bits (433), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 106/270 (39%), Positives = 147/270 (54%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           GD  T+ +       V N+ +GCGH+N GL   AAGLLG+G G LSFP+Q+  +    FS
Sbjct: 178 GDLATDRLVFPDDTHVHNVTLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFS 237

Query: 57  YCLVDRDS---DSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
           YCL DR S   + +S L F  +  PP+    PL  N    + YY+ + G SVGG+ +   
Sbjct: 238 YCLGDRLSRAQNGSSYLVFGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGF 297

Query: 112 SETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
           S  +  ++ + G GGI+VDSGTA++R   + Y A+RDAF     A      +A    +FD
Sbjct: 298 SNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFD 357

Query: 167 TCYDF----SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSSS 219
            CYD     +  ++V VP++  HF  G  + LP  NYLIPV   D    FC         
Sbjct: 358 ACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDG 417

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           L+++GNVQQQG  + F++    IGFTPN C
Sbjct: 418 LNVLGNVQQQGFGLVFDVERGRIGFTPNGC 447


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  169 bits (429), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 102/256 (39%), Positives = 137/256 (53%), Gaps = 9/256 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   +ET+T G  SV  +A GCG +NEG  F   +GL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYCL 242

Query: 60  VDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
              D    STL   S     A      T PL++N    +FYYL L GISVG   LPI ++
Sbjct: 243 TSVDDTKASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKS 302

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F + E G+GG+I+DSGT +T L+   ++ +   F           G    + C+   S 
Sbjct: 303 TFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPSG 362

Query: 175 SS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
           S+ +EVP + FHF +G  L LPA+NY+I   S G  C A   +SS +SI GN+QQQ   V
Sbjct: 363 STDIEVPKLVFHF-DGADLELPAENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLV 420

Query: 234 SFNLRNSLIGFTPNKC 249
             +L    + F P +C
Sbjct: 421 LHDLEKETLSFLPTQC 436


>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
          Length = 110

 Score =  169 bits (427), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 80/110 (72%), Positives = 92/110 (83%)

Query: 140 ETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNY 199
           + Y ++RDAF R T+ L   +GVA+FDTCYD SS  SV VPTVSFHF   +V  LPAKNY
Sbjct: 1   QAYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNY 60

Query: 200 LIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LIPVDS+GTFCFAFAPTSSSLSIIGNVQQQGTRVSF++ NSL+GF+PNKC
Sbjct: 61  LIPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/253 (41%), Positives = 135/253 (53%), Gaps = 6/253 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T G ASV  I  GCG +N G  +   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCL 242

Query: 60  VD-RDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
               DS   STL   S     +A+  PL++N    +FYYL L GISVG  LLPI ++ F 
Sbjct: 243 TSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFS 302

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS- 176
           I + G+GG+I+DSGT +T L+   + AL+  F+   +      G    + C+      S 
Sbjct: 303 IQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSP 362

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           VEVP + FHF EG  L LP +NY+I   +    C     +SS +SI GN QQQ   V  +
Sbjct: 363 VEVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420

Query: 237 LRNSLIGFTPNKC 249
           L    I F P +C
Sbjct: 421 LEKETISFAPAQC 433


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  168 bits (425), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 104/260 (40%), Positives = 145/260 (55%), Gaps = 23/260 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           GDF  ET+TL S      SV N A GCGH N+GLF GAAGL+GLG  S+ FP+Q + +  
Sbjct: 77  GDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLGKSSIGFPAQTSVAFG 136

Query: 54  -TFSYCLVDRDSDSTS-TLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
             FSYCL    S   S  L F  +  L  +    PL+ +    + Y++ +TGI+VG +LL
Sbjct: 137 KVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQYFVSMTGINVGDELL 196

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           PIS T           ++VDSGT ++R +   Y  LRDAF +    L     VA FDTC+
Sbjct: 197 PISAT-----------VMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAPFDTCF 245

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
             S+   + +P ++ HF +   L L   + L PVD +G  CFAFAP+SS  S++GN QQQ
Sbjct: 246 RVSTVDDINIPLITLHFRDDAELRLSPVHILYPVD-DGVMCFAFAPSSSGRSVLGNFQQQ 304

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             R  +++  S +G +  +C
Sbjct: 305 NLRFVYDIPKSRLGISAFEC 324


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 103/253 (40%), Positives = 135/253 (53%), Gaps = 6/253 (2%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T G ASV  I  GCG +N G  +   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCL 242

Query: 60  VD-RDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
               DS   STL   S     +A+  PL++N    +FYYL L GISVG  LLPI ++ F 
Sbjct: 243 TSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFS 302

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS- 176
           I + G+GG+I+DSGT +T L+   + AL+  F+   +      G    + C+      S 
Sbjct: 303 IQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSP 362

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V+VP + FHF EG  L LP +NY+I   +    C     +SS +SI GN QQQ   V  +
Sbjct: 363 VDVPQLVFHF-EGVDLKLPKENYIIEDSALRVICLTMG-SSSGMSIFGNFQQQNIVVLHD 420

Query: 237 LRNSLIGFTPNKC 249
           L    I F P +C
Sbjct: 421 LEKETISFAPAQC 433


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  166 bits (421), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 146/255 (57%), Gaps = 8/255 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T  ++SV NIA GCG +N+G   G  AGL+G+G G LS PSQ+    FSYC+
Sbjct: 183 GYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM 242

Query: 60  VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
               S S STL   S+   +P  + +  L+ +    T+YY+ L GI+VGGD L I  + F
Sbjct: 243 TSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF 302

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR-S 175
           ++ + G GG+I+DSGT +T L  + YNA+  AF       +  +  +   TC+   S  S
Sbjct: 303 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGS 362

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVS 234
           +V+VP +S  F +G VL L  +N LI   + G  C A   +S   +SI GN+QQQ T+V 
Sbjct: 363 TVQVPEISMQF-DGGVLNLGEQNILI-SPAEGVICLAMGSSSQLGISIFGNIQQQETQVL 420

Query: 235 FNLRNSLIGFTPNKC 249
           ++L+N  + F P +C
Sbjct: 421 YDLQNLAVSFVPTQC 435


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  166 bits (421), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 146/257 (56%), Gaps = 12/257 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET+T GS S+ NI  GCG NN+G   G  AGL+G+G G LS PSQ++ + FSYC+
Sbjct: 182 GSMGTETLTFGSVSIPNITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTKFSYCM 241

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
               S + S L   S    N+VTA      L+++ ++ TFYY+ L G+SVG   LPI  +
Sbjct: 242 TPIGSSTPSNLLLGSL--ANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPS 299

Query: 115 AFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-S 172
           AF ++  +G GGII+DSGT +T      Y ++R  F+            + FD C+   S
Sbjct: 300 AFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPS 359

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
             S++++PT   HF +G  L LP++NY I   SNG  C A   +S  +SI GN+QQQ   
Sbjct: 360 DPSNLQIPTFVMHF-DGGDLELPSENYFIS-PSNGLICLAMGSSSQGMSIFGNIQQQNML 417

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++  NS++ F   +C
Sbjct: 418 VVYDTGNSVVSFASAQC 434


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 105/256 (41%), Positives = 148/256 (57%), Gaps = 10/256 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T  ++SV NIA GCG +N+G   G  AGL+G+G G LS PSQ+    FSYC+
Sbjct: 182 GYMATETFTFETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCM 241

Query: 60  VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
               S S STL   S+   +P  + +  L+ +    T+YY+ L GI+VGGD L I  + F
Sbjct: 242 TSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTF 301

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDTCYDFSSR- 174
           ++ + G GG+I+DSGT +T L  + YNA+  AF      LSP D   +   TC+   S  
Sbjct: 302 QLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQIN-LSPVDESSSGLSTCFQLPSDG 360

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIGNVQQQGTRV 233
           S+V+VP +S  F +G VL L  +N LI   + G  C A   +S   +SI GN+QQQ T+V
Sbjct: 361 STVQVPEISMQF-DGGVLNLGEENVLIS-PAEGVICLAMGSSSQQGISIFGNIQQQETQV 418

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L+N  + F P +C
Sbjct: 419 LYDLQNLAVSFVPTQC 434


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 106/267 (39%), Positives = 145/267 (54%), Gaps = 18/267 (6%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GD  T+ +   + + V+N+ +GCG +NEGLF  AAGLLG+G G +S  +Q+     S F 
Sbjct: 178 GDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFE 237

Query: 57  YCLVDRDSDST--STLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
           YCL DR S ST  S L F  +  PP+     LL N    + YY+ + G SVGG+ +   S
Sbjct: 238 YCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFS 297

Query: 113 ETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTC 168
             +  +D  +G GG++VDSGTA++R   + Y ALRDAF    RA          ++FD C
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDAC 357

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSSSLSI 222
           YD   R +   P +  HF  G  + LP +NY +PVD      ++   C  F      LS+
Sbjct: 358 YDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSV 417

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGNVQQQG RV F++    IGF P  C
Sbjct: 418 IGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 113/268 (42%), Positives = 148/268 (55%), Gaps = 19/268 (7%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E+ T+       S+ VD +  GCGH N GLF GAAGLLGLG G LSF SQ+ A  
Sbjct: 242 GDLALESFTVNLTAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 301

Query: 53  --STFSYCLVDRDSDSTSTLEF--DSSL-----PPNAVTAPLLRNHELDTFYYLGLTGIS 103
              TFSYCLVD  SD  S + F  D +L     P    TA    +   DTFYY+ LTG+ 
Sbjct: 302 GGHTFSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVL 361

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGV 162
           VGG+LL IS   +   E G+GG I+DSGT ++      Y  +R AF+ R + +  P    
Sbjct: 362 VGGELLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDF 421

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLS 221
            +   CY+ S     EVP +S  F +G V   PA+NY I +D +G  C A   T  + +S
Sbjct: 422 PVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMS 481

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN QQQ   V+++L N+ +GF P +C
Sbjct: 482 IIGNFQQQNFHVAYDLHNNRLGFAPRRC 509


>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
          Length = 144

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 78/144 (54%), Positives = 103/144 (71%)

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G  +PISE  F+++E G GG+++D+GTAVTRL T  Y+A RDAF+  T  L  +  V++F
Sbjct: 1   GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           DTCYD     SV VPT+SF+F  G +L LPA+N+LIPV+  GTFCFAFAP+ S LSIIGN
Sbjct: 61  DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQ+G  +S +  N  +GF PN C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 110/256 (42%), Positives = 139/256 (54%), Gaps = 34/256 (13%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GD  TET+     A V  +A+GCGH+NEGLFV AAGLLGLG G LS P+Q        FS
Sbjct: 234 GDLATETLWFARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFS 293

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YC    D D  + +               +  H         + G  V G    + E + 
Sbjct: 294 YCFQGSDLDHRTIIR-------------TVHQH---------VGGARVRG----VGERSL 327

Query: 117 KIDES-GNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSS 173
           ++D S G GG+I+DSGT+VTRL    Y A+R+AF    G   L+P  G +LFDTCYD   
Sbjct: 328 RLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAP-GGFSLFDTCYDLRG 386

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
           R  V+VPTVS H   G  + LP +NYLIPVD+ GTFC A A T   +SI+GN+QQQG RV
Sbjct: 387 RRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRV 446

Query: 234 SFNLRNSLIGFTPNKC 249
            F+     +   P  C
Sbjct: 447 VFDGDRQRVALVPKSC 462


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 104/267 (38%), Positives = 144/267 (53%), Gaps = 18/267 (6%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G+  T+ +   + + V+N+ +GCG +NEGLF  AAGLLG+  G +S  +Q+     S F 
Sbjct: 178 GELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFE 237

Query: 57  YCLVDRDSDST--STLEFDSS-LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
           YCL DR S ST  S L F  +  PP+     LL N    + YY+ + G SVGG+ +   S
Sbjct: 238 YCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFS 297

Query: 113 ETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTC 168
             +  +D  +G GG++VDSGTA++R   + Y ALRDAF    RA          ++FD C
Sbjct: 298 NASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDAC 357

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSSSLSI 222
           YD   R +   P +  HF  G  + LP +NY +PVD      ++   C  F      LS+
Sbjct: 358 YDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSV 417

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGNVQQQG RV F++    IGF P  C
Sbjct: 418 IGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  159 bits (403), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 101/270 (37%), Positives = 144/270 (53%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF TET T+   S         V+N+  GCGH N GLF GA+GLLGLG G LSF SQ+ 
Sbjct: 183 GDFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLGLGRGPLSFSSQLQ 242

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+SD+  + +       + +  P L        + + +DTFYY+ + 
Sbjct: 243 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIK 302

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I VGG++L I E+ + +   G GG IVDSGT ++      Y  ++DAFV+  +      
Sbjct: 303 SIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQ 362

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
              + D CY+ S    +++P     F +G V   P +NY I +D     C A   T  S+
Sbjct: 363 DFPILDPCYNVSGVEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSA 422

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   V ++ + S +G+ P  C
Sbjct: 423 LSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 109/265 (41%), Positives = 145/265 (54%), Gaps = 16/265 (6%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E+ T+       S  VD +  GCGH N GLF GAAGLLGLG G LSF SQ+ A  
Sbjct: 243 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 302

Query: 53  -STFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
             TFSYCLV+  SD+ S + F       + P    TA    +   DTFYY+ L G+ VGG
Sbjct: 303 GHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGG 362

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALF 165
           DLL IS   + + + G+GG I+DSGT ++      Y  +R AFV     L P      + 
Sbjct: 363 DLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVL 422

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIG 224
           + CY+ S     EVP +S  F +G V   PA+NY + +D +G  C A   T  + +SIIG
Sbjct: 423 NPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIG 482

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   V ++L+N+ +GF P +C
Sbjct: 483 NFQQQNFHVVYDLQNNRLGFAPRRC 507


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 144/257 (56%), Gaps = 10/257 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF  ETVTL  +++  I  GCGHN EG F GA GL+GLG G LS PSQ+N+S    FSY
Sbjct: 96  GDFAFETVTLNGSTLARIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQLNSSFTHIFSY 155

Query: 58  CLVDRDSDST-STLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CLVD+ +  T S + F +++    A   PLL+N +  ++YY+G+  ISVG   +P   +A
Sbjct: 156 CLVDQSTTGTFSPITFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSA 215

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS--S 173
           F+ID +G GG+I+DSGT +T  +   +  +     R              + CYD S  S
Sbjct: 216 FRIDANGVGGVILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVS 275

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTR 232
            SS+ +P+++ H        +P  N  + VD+ G T C A + TS   SIIGNVQQQ   
Sbjct: 276 ASSLTLPSMTVHLTNVD-FEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNL 333

Query: 233 VSFNLRNSLIGFTPNKC 249
           +  ++ NS +GF    C
Sbjct: 334 IVTDVANSRVGFLATDC 350


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/280 (43%), Positives = 142/280 (50%), Gaps = 45/280 (16%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           GDF TET+T  S A V  +A+GCGH+NEGLFV AAGLLGLG GSLSFPSQI+     +FS
Sbjct: 236 GDFATETLTFASGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFS 295

Query: 57  YCLVD---------------------RDSDSTSTLEFDSSLPPNAVTAPLLR---NHELD 92
           YCLVD                     R +     L  D   P +     LLR    H+  
Sbjct: 296 YCLVDRTSSSASATSRSSTVTFGSGARGALGRRVLHPDGEEPQDGDV--LLRAAHGHQRR 353

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGT---AVTRLQTETYNALRDAF 149
                G   +    D             +G GG+IVDSG    A  R       A R   
Sbjct: 354 RRARPGRGRVRPPPD-----------PSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRA 402

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
                 LSP  G +LFDTCYD S    V+VPTVS HF  G    LP +NYLIPVDS GTF
Sbjct: 403 AAAGLRLSP-GGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTF 461

Query: 210 CFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           CFAFA T   +SIIGN+QQQG RV F+     +GF P  C
Sbjct: 462 CFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/267 (35%), Positives = 139/267 (52%), Gaps = 19/267 (7%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G   +ET T G+A+     V ++A GCG+ N G    ++G++GLG G LS  SQ+  S F
Sbjct: 180 GVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRF 239

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTA----------PLLRNHELDTFYYLGLTGISVG 105
           SYCL    S   S L F      N   A          PL+ N  L + Y++ L GIS+G
Sbjct: 240 SYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLG 299

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL- 164
              LPI    F I++ G GG+ +DSGT++T LQ + Y+A+R   V   R L PT+   + 
Sbjct: 300 QKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIG 359

Query: 165 FDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            +TC+ +    SV   VP +  HF  G  + +P +NY++   + G  C A    S   +I
Sbjct: 360 LETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATI 418

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN QQQ   + +++ NSL+ F P  C
Sbjct: 419 IGNYQQQNMHILYDIANSLLSFVPAPC 445


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/261 (37%), Positives = 140/261 (53%), Gaps = 15/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G    ET TL    + ++A GCG  NEG  F   AGL+GLG G LS  SQ+  + FSYCL
Sbjct: 189 GVLAAETFTLAKTKLPDVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCL 248

Query: 60  VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  S S L   S        +   +  T PL+RN    +FYY+ L G++VG   + +
Sbjct: 249 TSLDDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITL 308

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y AL+ AF    + L   DG  +  DTC++
Sbjct: 309 PSSAFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQMK-LPAADGSGIGLDTCFE 367

Query: 171 --FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
              S    VEVP + FH  +G  L LPA+NY++    +G  C      S  LSIIGN QQ
Sbjct: 368 APASGVDQVEVPKLVFHL-DGADLDLPAENYMVLDSGSGALCLTVM-GSRGLSIIGNFQQ 425

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++  + + F P +C
Sbjct: 426 QNIQFVYDVGENTLSFAPVQC 446


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 96/267 (35%), Positives = 139/267 (52%), Gaps = 19/267 (7%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G   +ET T G+A+     V ++A GCG+ N G    ++G++GLG G LS  SQ+  S F
Sbjct: 180 GVLASETFTFGAANSSKVMVSDVAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLGPSRF 239

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTA----------PLLRNHELDTFYYLGLTGISVG 105
           SYCL    S   S L F      N   A          PL+ N  L + Y++ L GIS+G
Sbjct: 240 SYCLTSFLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLG 299

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL- 164
              LPI    F I++ G GG+ +DSGT++T LQ + Y+A+R   V   R L PT+   + 
Sbjct: 300 QKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIG 359

Query: 165 FDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            +TC+ +    SV   VP +  HF  G  + +P +NY++   + G  C A    S   +I
Sbjct: 360 LETCFPWPPPPSVAVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMI-RSGDATI 418

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN QQQ   + +++ NSL+ F P  C
Sbjct: 419 IGNYQQQNMHILYDIANSLLSFVPAPC 445


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 135/261 (51%), Gaps = 14/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G    ET TL    +  +A GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 207 GVLAAETFTLAKTKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGKFSYCL 266

Query: 60  VDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  S S L          D++      T PL++N    +FYY+ L  ++VG   +P+
Sbjct: 267 TSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPL 326

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y  L+ AF    + L   DG A+  D C+ 
Sbjct: 327 PGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMK-LPVADGSAVGLDLCFK 385

Query: 171 --FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
              S    VEVP +  HF  G  L LPA+NY++   ++G  C      S  LSIIGN QQ
Sbjct: 386 APASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-GSRGLSIIGNFQQ 444

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++    + F P +C
Sbjct: 445 QNIQFVYDVDKDTLSFAPVQC 465


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 102/254 (40%), Positives = 144/254 (56%), Gaps = 16/254 (6%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR 62
           ET ++ S S+ NI  GCGH+N+G F    GL+G G GSLS  SQ+  S    FSYCLV R
Sbjct: 133 ETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSR 191

Query: 63  -DSDSTSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            DS  TS L   ++    A T    PL+++   +  YYL L GISVGG  L I    F I
Sbjct: 192 TDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTN-HYYLSLEGISVGGQSLAIPTGTFDI 250

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
              G+GG+I+DSGT +T LQ   Y+A+++A V     L   DG    D C++    S+  
Sbjct: 251 QSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN-LPQADG--QLDLCFNQQGSSNPG 307

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGNVQQQGTRVSF 235
            P+++FHF +G    +P +NYL P  ++   C A  PT+S+L   +I GNVQQQ  ++ +
Sbjct: 308 FPSMTFHF-KGADYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILY 366

Query: 236 NLRNSLIGFTPNKC 249
           +  N+++ F P  C
Sbjct: 367 DNENNVLSFAPTAC 380


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  157 bits (397), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 102/264 (38%), Positives = 141/264 (53%), Gaps = 17/264 (6%)

Query: 1   GDFVTETVTLGS--ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   +ET TLG     +  +A GCG  NEG  F   AGL+GLG G LS  SQ+    FSY
Sbjct: 188 GVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDKFSY 247

Query: 58  CLVD-RDSDSTSTLEFDSSLPPNAV--------TAPLLRNHELDTFYYLGLTGISVGGDL 108
           CL    D D  S L    S    +         T PL++N    +FYY+ LTG++VG   
Sbjct: 248 CLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTR 307

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDT 167
           + +  +AF I + G GG+IVDSGT++T L+ + Y AL+ AFV    AL   DG  +  D 
Sbjct: 308 ITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFV-AQMALPTVDGSEIGLDL 366

Query: 168 CYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           C+   ++    V+VP +  HF  G  L LPA+NY++   ++G  C   AP S  LSIIGN
Sbjct: 367 CFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAP-SRGLSIIGN 425

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ  +  +++    + F P +C
Sbjct: 426 FQQQNFQFVYDVAGDTLSFAPVQC 449


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 142/263 (53%), Gaps = 16/263 (6%)

Query: 1   GDFVTETVTLGSASVDNIAI-----GCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST 54
           G    ET T G ++ D I+I     GCG++N G  F   AGL+GLG G LS  SQ+    
Sbjct: 198 GVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQK 257

Query: 55  FSYCLVDRDSDSTSTLEFDS--SLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGD 107
           F+YCL   D    S+L   S  ++ P        T PL++N    +FYYL L GISVGG 
Sbjct: 258 FAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGT 317

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            L I ++ F++ + G+GG+I+DSGT +T ++   + +L++ F+          G    D 
Sbjct: 318 QLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDL 377

Query: 168 CYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
           C++  +  + VEVP ++FHF +G  L LP +NY+I     G  C A   +S  +SI GN+
Sbjct: 378 CFNLPAGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNL 435

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   V  +L+   + F P +C
Sbjct: 436 QQQNFMVVHDLQEETLSFLPTQC 458


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  156 bits (395), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 95/258 (36%), Positives = 141/258 (54%), Gaps = 16/258 (6%)

Query: 6   ETVTLGSASVDNIAI-----GCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           ET T G ++ D I+I     GCG++N G  F   AGL+GLG G LS  SQ+    F+YCL
Sbjct: 458 ETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCL 517

Query: 60  VDRDSDSTSTLEFDS--SLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
              D    S+L   S  ++ P        T PL++N    +FYYL L GISVGG  L I 
Sbjct: 518 TAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIP 577

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF- 171
           ++ F++ + G+GG+I+DSGT +T ++   + +L++ F+          G    D C++  
Sbjct: 578 KSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLP 637

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           +  + VEVP ++FHF +G  L LP +NY+I     G  C A   +S  +SI GN+QQQ  
Sbjct: 638 AGTNQVEVPKLTFHF-KGADLELPGENYMIGDSKAGLLCLAIG-SSRGMSIFGNLQQQNF 695

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V  +L+   + F P +C
Sbjct: 696 MVVHDLQEETLSFLPTQC 713


>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
           thaliana]
          Length = 142

 Score =  156 bits (395), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 76/139 (54%), Positives = 98/139 (70%), Gaps = 1/139 (0%)

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           ++ + FK+D+ GNGG+I+DSGT+VTRL    Y A+RDAF  G + L      +LFDTC+D
Sbjct: 4   VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S+ + V+VPTV  HF  G  + LPA NYLIPVD+NG FCFAFA T   LSIIGN+QQQG
Sbjct: 64  LSNMNEVKVPTVVLHF-RGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            RV ++L +S +GF P  C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  156 bits (394), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 97/260 (37%), Positives = 139/260 (53%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   E ++ G  SV N   GCG NN+GLF G +G++GLG  +LS  SQ N +    FSY
Sbjct: 226 GELGVEHLSFGGISVSNFVFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSY 285

Query: 58  CLVDRDSDSTSTL------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CL   DS ++ +L          +L P A T+ ++ N +L  FY L LTGI VGG  + I
Sbjct: 286 CLPTTDSGASGSLVIGNESSLFKNLTPIAYTS-MVSNPQLSNFYVLNLTGIDVGG--VAI 342

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            +T+F     GNGGI++DSGT +TRL    YNAL+  F++          +++ DTC++ 
Sbjct: 343 QDTSF-----GNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPALSILDTCFNL 397

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +    V +PT+S HF     L + A   L         C A A  S  + ++IIGN QQ+
Sbjct: 398 TGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQR 457

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV ++ + S IGF    C
Sbjct: 458 NQRVIYDAKQSKIGFAREDC 477


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  156 bits (394), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 107/270 (39%), Positives = 146/270 (54%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+   S         V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 285 GDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 344

Query: 52  A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
           +    +FSYCLVDR+SD+  +S L F  D  L   P      L+   E  +DTFYY+ + 
Sbjct: 345 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIK 404

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I VGG++L I E  + +   G GG IVDSGT ++     +Y  ++DAFV+  +      
Sbjct: 405 SIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIK 464

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
              + D CY+ S    +E+P     F +G V   P +NY I ++     C A   T  S+
Sbjct: 465 DFPILDPCYNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSA 524

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   + ++ + S +G+ P KC
Sbjct: 525 LSIIGNYQQQNFHILYDTKKSRLGYAPMKC 554


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TL  + +  +  GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 193 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 252

Query: 60  VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  + S L   S        +   +  T PL++N    +FYY+ L  I+VG   + +
Sbjct: 253 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 312

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y AL+ AF     AL   DG  +  D C+ 
Sbjct: 313 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 371

Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
             ++    VEVP + FHF  G  L LPA+NY++    +G  C      S  LSIIGN QQ
Sbjct: 372 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 430

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++ +  + F P +C
Sbjct: 431 QNFQFVYDVGHDTLSFAPVQC 451


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TL  + +  +  GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 183 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 242

Query: 60  VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  + S L   S        +   +  T PL++N    +FYY+ L  I+VG   + +
Sbjct: 243 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 302

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y AL+ AF     AL   DG  +  D C+ 
Sbjct: 303 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 361

Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
             ++    VEVP + FHF  G  L LPA+NY++    +G  C      S  LSIIGN QQ
Sbjct: 362 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 420

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++ +  + F P +C
Sbjct: 421 QNFQFVYDVGHDTLSFAPVQC 441


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TL  + +  +  GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 162 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 221

Query: 60  VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  + S L   S        +   +  T PL++N    +FYY+ L  I+VG   + +
Sbjct: 222 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 281

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y AL+ AF     AL   DG  +  D C+ 
Sbjct: 282 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 340

Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
             ++    VEVP + FHF  G  L LPA+NY++    +G  C      S  LSIIGN QQ
Sbjct: 341 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 399

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++ +  + F P +C
Sbjct: 400 QNFQFVYDVGHDTLSFAPVQC 420


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 101/269 (37%), Positives = 142/269 (52%), Gaps = 25/269 (9%)

Query: 1   GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+ S    V  ++ GCG+ N G     +G++G G G+LS  SQ+ +  FS
Sbjct: 172 GVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFS 231

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAV---------TAPLLRNHELDTFYYLGLTGISVGGD 107
           YCL    S +TS L F +    N+          + P + N  L T Y+L +TGISV GD
Sbjct: 232 YCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGD 291

Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RA-LSPTDGV 162
           LLPI  + F I+E+ G GG+I+DSGT VT L    Y  ++ AFV      RA  +P+D  
Sbjct: 292 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-- 349

Query: 163 ALFDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
             FDTC+ +    R  V +P +  HF +G  + LP +NY++     G  C A  P+    
Sbjct: 350 -TFDTCFKWPPPPRRMVTLPEMVLHF-DGADMELPLENYMVMDGGTGNLCLAMLPSDDG- 406

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIG+ Q Q   + ++L NSL+ F P  C
Sbjct: 407 SIIGSFQHQNFHMLYDLENSLLSFVPAPC 435


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 137/261 (52%), Gaps = 14/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TL  + +  +  GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 255 GVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCL 314

Query: 60  VDRDSDSTSTLEFDS--------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  + S L   S        +   +  T PL++N    +FYY+ L  I+VG   + +
Sbjct: 315 TSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISL 374

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF + + G GG+IVDSGT++T L+ + Y AL+ AF     AL   DG  +  D C+ 
Sbjct: 375 PSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFA-AQMALPAADGSGVGLDLCFR 433

Query: 171 FSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
             ++    VEVP + FHF  G  L LPA+NY++    +G  C      S  LSIIGN QQ
Sbjct: 434 APAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVM-GSRGLSIIGNFQQ 492

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  +  +++ +  + F P +C
Sbjct: 493 QNFQFVYDVGHDTLSFAPVQC 513


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 98/255 (38%), Positives = 131/255 (51%), Gaps = 12/255 (4%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F TET+TL S++V  N   GCG  N GLF GAAGLLGLG   LS PSQ        FS
Sbjct: 224 GFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFS 283

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S S   L F   +       PL  + +   FY L +T +SVGG+ L I  + F
Sbjct: 284 YCL-PASSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIF 342

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                   G ++DSGT +TRL +  Y+AL  AF +       TDG ++FDTCYDFS   +
Sbjct: 343 -----STSGTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNET 397

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQGTRVS 234
           +++P V   F  G  + +     L PV+     C AFA     +  +I GN QQ+  +V 
Sbjct: 398 IKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVV 457

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P+ C
Sbjct: 458 YDDAKGRVGFAPSGC 472


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 101/269 (37%), Positives = 142/269 (52%), Gaps = 25/269 (9%)

Query: 1   GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+ S    V  ++ GCG+ N G     +G++G G G+LS  SQ+ +  FS
Sbjct: 175 GVLANETFTFGTNSTRVAVPRVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLGSPRFS 234

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAV---------TAPLLRNHELDTFYYLGLTGISVGGD 107
           YCL    S +TS L F +    N+          + P + N  L T Y+L +TGISV GD
Sbjct: 235 YCLTSFMSPATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGD 294

Query: 108 LLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RA-LSPTDGV 162
           LLPI  + F I+E+ G GG+I+DSGT VT L    Y  ++ AFV      RA  +P+D  
Sbjct: 295 LLPIDPSVFAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSD-- 352

Query: 163 ALFDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
             FDTC+ +    R  V +P +  HF +G  + LP +NY++     G  C A  P+    
Sbjct: 353 -TFDTCFKWPPPPRRMVTLPEMVLHF-DGADMELPLENYMVMDGGTGNLCLAMLPSDDG- 409

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIG+ Q Q   + ++L NSL+ F P  C
Sbjct: 410 SIIGSFQHQNFHMLYDLENSLLSFVPAPC 438


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F  +T+TL S  +V     GCG  N+GLF  AAGLLGLG G  S P Q        F+
Sbjct: 271 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 330

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           +CL  R S  T  L+F +  PP   T P+L  +   TFYY+G+TGI VGG LLPI+ + F
Sbjct: 331 HCLPAR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 388

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
                   G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF+  
Sbjct: 389 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 443

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
           S V +PTVS  F  G  L + A   +  V ++   C AFA       + I+GN Q +   
Sbjct: 444 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 502

Query: 233 VSFNLRNSLIGFTPNKC 249
           V++++   ++GF+P  C
Sbjct: 503 VAYDIGKKVVGFSPGAC 519


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F  +T+TL S  +V     GCG  N+GLF  AAGLLGLG G  S P Q        F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 326

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           +CL  R S  T  L+F +  PP   T P+L  +   TFYY+G+TGI VGG LLPI+ + F
Sbjct: 327 HCLPAR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 384

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
                   G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF+  
Sbjct: 385 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 439

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
           S V +PTVS  F  G  L + A   +  V ++   C AFA       + I+GN Q +   
Sbjct: 440 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 498

Query: 233 VSFNLRNSLIGFTPNKC 249
           V++++   ++GF+P  C
Sbjct: 499 VAYDIGKKVVGFSPGAC 515


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 99/264 (37%), Positives = 135/264 (51%), Gaps = 17/264 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TL    V  +A GCG  NEG  F   AGL+GLG G LS  SQ+    FSYCL
Sbjct: 211 GVLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCL 270

Query: 60  VDRDSDS--------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
              D  +        ++     S+    A T PL++N    +FYY+ LTG++VG   L +
Sbjct: 271 TSLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLAL 330

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYD 170
             +AF I + G GG+IVDSGT++T L+   Y ALR AFV    +L   D   +  D C+ 
Sbjct: 331 PSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFV-AHMSLPTVDASEIGLDLCFQ 389

Query: 171 -----FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
                      V+VP +  HF  G  L LPA+NY++   ++G  C      S  LSIIGN
Sbjct: 390 GPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVM-ASRGLSIIGN 448

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ  +  +++    + F P +C
Sbjct: 449 FQQQNFQFVYDVAGDTLSFAPAEC 472


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  154 bits (389), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 102/257 (39%), Positives = 136/257 (52%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F  +T+TL S  +V     GCG  N+GLF  AAGLLGLG G  S P Q        F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFA 327

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           +CL  R S  T  L+F +  PP   T P+L  +   TFYY+G+TGI VGG LLPI+ + F
Sbjct: 328 HCLPPR-STGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLLPIAPSVF 385

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSR 174
                   G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF+  
Sbjct: 386 AA-----AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGM 440

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTR 232
           S V +PTVS  F  G  L + A   +  V ++   C AFA       + I+GN Q +   
Sbjct: 441 SQVAIPTVSLLFQGGAALDVDASGIMYTVSAS-QVCLAFAGNEDGGDVGIVGNTQLKTFG 499

Query: 233 VSFNLRNSLIGFTPNKC 249
           V++++   ++GF+P  C
Sbjct: 500 VAYDIGKKVVGFSPGAC 516


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/266 (40%), Positives = 144/266 (54%), Gaps = 17/266 (6%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E  T+       S  VD++  GCGH+N GLF GAAGLLGLG G+LSF SQ+ A  
Sbjct: 246 GDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVY 305

Query: 53  -STFSYCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLVD  S   S + F  D +L      N            DTFYY+ L G+ VG
Sbjct: 306 GHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVG 365

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVAL 164
           G+ L IS + + + + G+GG I+DSGT ++      Y  +R AFV R  +A        +
Sbjct: 366 GEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPV 425

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSII 223
              CY+ S    VEVP  S  F +G V   PA+NY + +D +G  C A   T  S++SII
Sbjct: 426 LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQQ   V ++L+N+ +GF P +C
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/266 (40%), Positives = 144/266 (54%), Gaps = 17/266 (6%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E  T+       S  VD++  GCGH+N GLF GAAGLLGLG G+LSF SQ+ A  
Sbjct: 246 GDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVY 305

Query: 53  -STFSYCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLVD  S   S + F  D +L      N            DTFYY+ L G+ VG
Sbjct: 306 GHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVG 365

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVAL 164
           G+ L IS + + + + G+GG I+DSGT ++      Y  +R AFV R  +A        +
Sbjct: 366 GEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPV 425

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSII 223
              CY+ S    VEVP  S  F +G V   PA+NY + +D +G  C A   T  S++SII
Sbjct: 426 LSPCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSII 485

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQQ   V ++L+N+ +GF P +C
Sbjct: 486 GNFQQQNFHVLYDLQNNRLGFAPRRC 511


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  154 bits (388), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 103/265 (38%), Positives = 137/265 (51%), Gaps = 18/265 (6%)

Query: 1   GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+     ++  I+ GCG+ N G     +G++G G GSLS  SQ+ +  FS
Sbjct: 179 GVLANETFTFGTNDTRVTLPRISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 238

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
           YCL    S   S L F +    N+  A      P + N  L T Y+L +TGISVGG+ LP
Sbjct: 239 YCLTSFLSPVRSRLYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLP 298

Query: 111 ISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFD 166
           I      I D  G GG I+DSGT +T L    Y A+R+AFV     T  L      ++ D
Sbjct: 299 IDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLD 358

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
           TC+ +    R SV +P +  HF +G    LP +NY++   S G  C A A TSS  SIIG
Sbjct: 359 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYMLVDPSTGGLCLAMA-TSSDGSIIG 416

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           + Q Q   V ++L NSL+ F P  C
Sbjct: 417 SYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  153 bits (386), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 99/259 (38%), Positives = 142/259 (54%), Gaps = 13/259 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLF--VGAAGLLGLGGGSLSFPSQINA---STF 55
           G F  ET+T    + + +  G    N G F   G  G+LGLG G +S PSQ+ +   + F
Sbjct: 114 GYFSKETITATDTAGEEVKFGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKF 173

Query: 56  SYCLVDRDS--DSTSTLEF-DSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPI 111
           SYCLVD  S    TST+ F D+++P   V   P++ N +  T+YY+ + GISVGG LL I
Sbjct: 174 SYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDI 233

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            ++ ++ID  G+GG I+DSGT +T LQ E +NAL  A+    R  + T    L D C++ 
Sbjct: 234 DQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGL-DLCFNT 292

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQG 230
               S   P ++ H  +G  L LP  N  I +++N   C AFA      ++I GN+QQQ 
Sbjct: 293 RGTGSPVFPAMTIHL-DGVHLELPTANTFISLETN-IICLAFASALDFPIAIFGNIQQQN 350

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             + ++L N  IGF P  C
Sbjct: 351 FDIVYDLDNMRIGFAPADC 369


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  152 bits (385), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 97/240 (40%), Positives = 129/240 (53%), Gaps = 8/240 (3%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
           + N+A GCGH N G F GAAG++GLG G LS  SQ   I +  FSYCLV   S  TS + 
Sbjct: 180 IPNVAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPML 239

Query: 72  F-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
             DS+         LL N    TFYY  LTGISV G  +      F ID SG GG I+DS
Sbjct: 240 IGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDS 299

Query: 131 GTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEG 189
           GT +T L+T  +NAL  A ++        DG     D C+  +  ++   PT++FHF +G
Sbjct: 300 GTTLTYLETGAFNALVAA-LKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHF-KG 357

Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               LP +N  + +D+ G+ C A A  S+  SI+GN+QQQ   +  +L N  +GF    C
Sbjct: 358 ADYELPPENVFVALDTGGSICLAMA-ASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 95/265 (35%), Positives = 134/265 (50%), Gaps = 18/265 (6%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G    ET T G+AS       NI+ GCG  N G    ++G++G G G LS  SQ+  S F
Sbjct: 176 GVLANETFTFGAASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSRF 235

Query: 56  SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           SYCL    S + S L F         ++S      + P + N  L   Y+L + GIS+G 
Sbjct: 236 SYCLTSYLSPTPSRLYFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGT 295

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             LPI    F I++ G GG+I+DSGT++T LQ + Y A+R          +  D     D
Sbjct: 296 KRLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLD 355

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
           TC+ +      +V VP   FHF +G  + LP +NY++   + G  C A APTS   +IIG
Sbjct: 356 TCFQWPPPPNVTVTVPDFVFHF-DGANMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIG 413

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + +++ NS + F P  C
Sbjct: 414 NYQQQNLHLLYDIANSFLSFVPAPC 438


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 103/260 (39%), Positives = 136/260 (52%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 326

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F    P  A   +T P+L ++   TFYY+G+TGI VGG LL I +
Sbjct: 327 HCLPAR-SSGTGYLDFGPGSPAAAGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 384

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR AFV     R       V+L DTCYDF
Sbjct: 385 SVFA-----TAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDF 439

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +PTVS  F  G +L + A   +    S    C  FA       + I+GN Q +
Sbjct: 440 TGMSQVAIPTVSLLFQGGAILDVDASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLK 498

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V++++   ++GF+P  C
Sbjct: 499 TFGVAYDIGKKVVGFSPGAC 518


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 141/268 (52%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   TET T     S+  I  GCG  NEG  F   +GL+GLG G LS  SQ+  + FSYC
Sbjct: 196 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 255

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
           L    DS+++S+L F  SL    V             T  LLRN +  +FYYL L GI+V
Sbjct: 256 LTSIEDSEASSSL-FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 314

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
           G   L + ++ F++ E G GG+I+DSGT +T L+   +  L++ F   +R   P D  G 
Sbjct: 315 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 372

Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
              D C+    +  ++ VP + FHF +G  L LP +NY++   S G  C A   +S+ +S
Sbjct: 373 TGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 430

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I GNVQQQ   V  +L    + F P +C
Sbjct: 431 IFGNVQQQNFNVLHDLEKETVSFVPTEC 458


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 141/268 (52%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   TET T     S+  I  GCG  NEG  F   +GL+GLG G LS  SQ+  + FSYC
Sbjct: 88  GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 147

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
           L    DS+++S+L F  SL    V             T  LLRN +  +FYYL L GI+V
Sbjct: 148 LTSIEDSEASSSL-FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITV 206

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
           G   L + ++ F++ E G GG+I+DSGT +T L+   +  L++ F   +R   P D  G 
Sbjct: 207 GAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 264

Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
              D C+    +  ++ VP + FHF +G  L LP +NY++   S G  C A   +S+ +S
Sbjct: 265 TGLDLCFKLPDAAKNIAVPKMIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 322

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I GNVQQQ   V  +L    + F P +C
Sbjct: 323 IFGNVQQQNFNVLHDLEKETVSFVPTEC 350


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 101/268 (37%), Positives = 142/268 (52%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   TET T     S+  I  GCG  NEG  F   +GL+GLG G LS  SQ+  + FSYC
Sbjct: 197 GLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYC 256

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV-------------TAPLLRNHELDTFYYLGLTGISV 104
           L    DS+++S+L F  SL    V             T  LLRN +  +FYYL L GI+V
Sbjct: 257 LTSIEDSEASSSL-FIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITV 315

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD--GV 162
           G   L + ++ F++ E G GG+I+DSGT +T L+   +  L++ F   +R   P D  G 
Sbjct: 316 GAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFT--SRMSLPVDDSGS 373

Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
              D C+   ++  ++ VP + FHF +G  L LP +NY++   S G  C A   +S+ +S
Sbjct: 374 TGLDLCFKLPNAAKNIAVPKLIFHF-KGADLELPGENYMVADSSTGVLCLAMG-SSNGMS 431

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I GNVQQQ   V  +L    + F P +C
Sbjct: 432 IFGNVQQQNFNVLHDLEKETVTFVPTEC 459


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 110/273 (40%), Positives = 147/273 (53%), Gaps = 24/273 (8%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E+ T+       S  VD +  GCGH N GLF GAAGLLGLG G LSF SQ+ A  
Sbjct: 250 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 309

Query: 53  -STFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-PLLR----------NHELDTFYYLGLT 100
             TFSYCLVD  SD  S + F       A+ A P L+          +   DTFYY+ L 
Sbjct: 310 GHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLK 369

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPT 159
           G+ VGG+LL IS   + + + G+GG I+DSGT ++      Y  +R AF+ R +R+    
Sbjct: 370 GVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLV 429

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG--TFCFAFAPT- 216
               +   CY+ S     EVP +S  F +G V   PA+NY I +D +G    C A   T 
Sbjct: 430 PEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP 489

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + +SIIGN QQQ   V ++L+N+ +GF P +C
Sbjct: 490 RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/270 (35%), Positives = 145/270 (53%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+   +         V+N+  GCGH N GLF GAAGLLGLG G LSF +Q+ 
Sbjct: 288 GDFALETFTVNLTTPEGKPELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQ 347

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+S+S+ + +         ++ P L        + + +DTFYY+ + 
Sbjct: 348 SLYGHSFSYCLVDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIK 407

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I VGG++L I E  + +   G GG I+DSGT +T      Y  +++AF+R  +     +
Sbjct: 408 SIMVGGEVLKIPEETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVE 467

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
                  CY+ S    +E+P  +  F +G +   P +NY I ++     C A   T  S+
Sbjct: 468 TFPPLKPCYNVSGVEKMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSA 527

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   + ++L+ S +G+ P KC
Sbjct: 528 LSIIGNYQQQNFHILYDLKKSRLGYAPMKC 557


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  151 bits (381), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 100/271 (36%), Positives = 142/271 (52%), Gaps = 22/271 (8%)

Query: 1   GDFVTETVTLGSAS----------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI 50
           GDF  ET T+   S          V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 348

Query: 51  NA---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGL 99
            +    +FSYCLVDRDSD++ + +       + +T P L        + + +DTFYYL +
Sbjct: 349 QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQI 408

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
             I VGG+ L I E  + +   G GG I+DSGT ++      Y  +++AF+R  +     
Sbjct: 409 KSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLV 468

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
           +   +   CY+ S    +  P     F +G V   P +NY I +      C A   T  S
Sbjct: 469 EDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKS 528

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +LSIIGN QQQ   + ++ +NS +G+ P +C
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 109/264 (41%), Positives = 142/264 (53%), Gaps = 15/264 (5%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
           GD   E  T+      S  VD + +GCGH N GLF GAAGLLGLG G LSF SQ+ A   
Sbjct: 244 GDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYG 303

Query: 53  STFSYCLVDRDSDSTSTLEF--DSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
             FSYCLVD  S   S + F  D+ L   P         +   +TFYY+ L GI VGG++
Sbjct: 304 HAFSYCLVDHGSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEM 363

Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
           L I    + +  E G+GG I+DSGT ++      Y A+R AFV R  +A        +  
Sbjct: 364 LDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLS 423

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGN 225
            CY+ S    VEVP  S  F +G V   PA+NY I +D+ G  C A   T  S++SIIGN
Sbjct: 424 PCYNVSGVERVEVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGN 483

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   V ++L ++ +GF P +C
Sbjct: 484 YQQQNFHVLYDLHHNRLGFAPRRC 507


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  151 bits (381), Expect = 3e-34,   Method: Composition-based stats.
 Identities = 100/271 (36%), Positives = 142/271 (52%), Gaps = 22/271 (8%)

Query: 1   GDFVTETVTLGSAS----------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI 50
           GDF  ET T+   S          V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQL 348

Query: 51  NA---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGL 99
            +    +FSYCLVDRDSD++ + +       + +T P L        + + +DTFYYL +
Sbjct: 349 QSLYGHSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQI 408

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
             I VGG+ L I E  + +   G GG I+DSGT ++      Y  +++AF+R  +     
Sbjct: 409 KSIFVGGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLV 468

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
           +   +   CY+ S    +  P     F +G V   P +NY I +      C A   T  S
Sbjct: 469 EDFPILHPCYNVSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKS 528

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +LSIIGN QQQ   + ++ +NS +G+ P +C
Sbjct: 529 ALSIIGNYQQQNFHILYDTKNSRLGYAPMRC 559


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  150 bits (380), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 101/267 (37%), Positives = 137/267 (51%), Gaps = 20/267 (7%)

Query: 1   GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+     +V  IA GCG+ N G     +G++G G G LS  SQ+ +  FS
Sbjct: 176 GVLSNETFTFGTNDTRVTVPRIAFGCGNLNAGSLFNGSGMVGFGRGPLSLVSQLGSPRFS 235

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
           YCL    S   S L F +    N+ +A         P + N  L T YYL +TGISVGG+
Sbjct: 236 YCLTSFMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGE 295

Query: 108 LLPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVAL 164
           LLPI  + F I D  G GG+I+DSG+ +T L    Y+ +  AF    G    + T    +
Sbjct: 296 LLPIDPSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV 355

Query: 165 FDTCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            DTC+ +    R  V +P ++FHF EG  + LP +NY++     G  C A A  S   SI
Sbjct: 356 LDTCFVWPPPPRKIVTMPELAFHF-EGANMELPLENYMLIDGDTGNLCLAIA-ASDDGSI 413

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IG+ Q Q   V ++  NSL+ FTP  C
Sbjct: 414 IGSFQHQNFHVLYDNENSLLSFTPATC 440


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  150 bits (379), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 97/256 (37%), Positives = 135/256 (52%), Gaps = 14/256 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSF---PSQINASTFS 56
           G F  ET+TL S  V +N   GCG NN GLF  AAGL+GLG   +S     +Q     FS
Sbjct: 225 GYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFS 284

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL  + S ST  L F       A+   P+ + H +  FY + + G+ VGG  +PIS + 
Sbjct: 285 YCL-PKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSV 343

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G I+DSGT +TRL  + Y+AL+ AF +G         +++ DTCYD S  S
Sbjct: 344 FS-----TSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYS 398

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           ++++P V F F  G+ L L     +    S    C AFA     S+++IIGNVQQ+  +V
Sbjct: 399 TIQIPKVGFVFKGGEELDLDGIGIMYGA-STSQVCLAFAGNQDPSTVAIIGNVQQKTLQV 457

Query: 234 SFNLRNSLIGFTPNKC 249
            +++    IGF  N C
Sbjct: 458 VYDVGGGKIGFGYNGC 473


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  150 bits (378), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 93/261 (35%), Positives = 138/261 (52%), Gaps = 20/261 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   E ++ G  SV +   GCG NN+GLF G +GL+GLG   LS  SQ NA+    FSY
Sbjct: 157 GELGVEQLSFGGVSVSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSY 216

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +S ++ +L    +SS+  N        +L N +L  FY L LTGI V G      
Sbjct: 217 CLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDG------ 270

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             A ++   GNGG+++DSGT +TRL +  Y AL+  F++         G ++ DTC++ +
Sbjct: 271 -VALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLT 329

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQ 228
               V +PT+S HF     L + A    Y++  D++   C A A  S +   +IIGN QQ
Sbjct: 330 GYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDAS-QVCLALASLSDAYDTAIIGNYQQ 388

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV ++ + S +GF    C
Sbjct: 389 RNQRVIYDTKQSKVGFAEESC 409


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  149 bits (377), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 100/262 (38%), Positives = 142/262 (54%), Gaps = 21/262 (8%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GSA+ D      IA GC + +   + G+AGL+GLG GSLS  SQ+ A  FSYCL
Sbjct: 188 SETFTFGSAAADQARVPGIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 247

Query: 60  VD-RDSDSTSTLEFDSSLPPNAV---TAPLLR---NHELDTFYYLGLTGISVGGDLLPIS 112
              +D++STSTL    S   N     + P +       + T+YYL LTGIS+G   L IS
Sbjct: 248 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSIS 307

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYD 170
             AF +   G GG+I+DSGT +T L    Y  +R A V+    L   DG      D CY 
Sbjct: 308 PDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYA 366

Query: 171 FSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNVQ 227
             + +S    +P+++ HF +G  + LPA +Y+I    +G +C A    T  ++S  GN Q
Sbjct: 367 LPTPTSAPPAMPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQ 423

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   + +++RN ++ F P KC
Sbjct: 424 QQNMHILYDVRNEMLSFAPAKC 445


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  149 bits (376), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 99/263 (37%), Positives = 138/263 (52%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   E +  G  SV N   GCG NN+GLF GA+GL+GLG   LS  SQ NA+    FSY
Sbjct: 212 GELGIEKLGFGGISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSY 271

Query: 58  CL--VDRDSDSTSTLEFDSS-----LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL   D+   S S +  + S     + P A T  +L N +L  FY L LTGI VGG  L 
Sbjct: 272 CLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTR-MLPNLQLSNFYILNLTGIDVGGVSLH 330

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +  ++F     GNGG+I+DSGT ++RL    Y AL+  F+          G ++ DTC++
Sbjct: 331 VQASSF-----GNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFN 385

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSS--SLSIIGNV 226
            +    V +PT+S +F     L + A    YL+  D++   C A A  S    + IIGN 
Sbjct: 386 LTGYDQVNIPTISMYFEGNAELNVDATGIFYLVKEDAS-RVCLALASLSDEYEMGIIGNY 444

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV ++ + S +GF    C
Sbjct: 445 QQRNQRVLYDAKLSQVGFAKEPC 467


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 94/265 (35%), Positives = 132/265 (49%), Gaps = 18/265 (6%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G    ET T G+A+       NIA GCG  N G    ++G++G G G LS  SQ+  S F
Sbjct: 176 GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 235

Query: 56  SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           SYCL    S + S L F         ++S      + P + N  L   Y+L L  IS+G 
Sbjct: 236 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 295

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
            LLPI    F I++ G GG+I+DSGT++T LQ + Y A+R   V      +  D     D
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLD 355

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
           TC+ +      +V VP + FHF    +  LP +NY++   + G  C   APT    +IIG
Sbjct: 356 TCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVMAPTGVG-TIIG 413

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + +++ NS + F P  C
Sbjct: 414 NYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 94/265 (35%), Positives = 132/265 (49%), Gaps = 18/265 (6%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G    ET T G+A+       NIA GCG  N G    ++G++G G G LS  SQ+  S F
Sbjct: 71  GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 130

Query: 56  SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           SYCL    S + S L F         ++S      + P + N  L   Y+L L  IS+G 
Sbjct: 131 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 190

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
            LLPI    F I++ G GG+I+DSGT++T LQ + Y A+R   V      +  D     D
Sbjct: 191 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLD 250

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
           TC+ +      +V VP + FHF    +  LP +NY++   + G  C   APT    +IIG
Sbjct: 251 TCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGYLCLVMAPTGVG-TIIG 308

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + +++ NS + F P  C
Sbjct: 309 NYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 140/263 (53%), Gaps = 23/263 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   E ++ G  SV +   GCG NN+GLF G +GL+GLG   LS  SQ NA+    FSY
Sbjct: 158 GELGVEALSFGGVSVSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSY 217

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLL--P 110
           CL   ++ S+ +L    +SS+  NA       +L N +L  FY L LTGI VGG  L  P
Sbjct: 218 CLPTTEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAP 277

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +S         GNGGI++DSGT +TRL +  Y AL+  F++         G ++ DTC++
Sbjct: 278 LS--------FGNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFN 329

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTSSSL--SIIGNV 226
            +    V +PT+S  F     L + A    Y++  D++   C A A  S +   +IIGN 
Sbjct: 330 LTGYDEVSIPTISLRFEGNAQLNVDATGTFYVVKEDAS-QVCLALASLSDAYDTAIIGNY 388

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV ++ + S +GF    C
Sbjct: 389 QQRNQRVIYDTKQSKVGFAEEPC 411


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/260 (39%), Positives = 134/260 (51%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F    P  A   +T P+L ++   TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-SSGTGYLDFGPGSPAAAGARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 385

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR AF      R       V+L DTCYDF
Sbjct: 386 SVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDF 440

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +PTVS  F  G  L + A   +    S    C  FA       + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLK 499

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V++++   ++GF+P  C
Sbjct: 500 TFGVAYDIGKKVVGFSPGAC 519


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 103/271 (38%), Positives = 146/271 (53%), Gaps = 24/271 (8%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---A 52
           G   +ETVTL S      +  NIA GCGH N G F  A+GL+GLG G+LSF SQ+     
Sbjct: 126 GTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG 185

Query: 53  STFSYCLVD-RDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
             FSYCLV  RD+ S ++  F         S    +    P++ N  +++FYY+ L  IS
Sbjct: 186 HKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDIS 245

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-V 162
           + G  L I   +F I   G+GG+I DSGT +T L    Y  +  A +R   +    DG  
Sbjct: 246 IAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKISFPKIDGSS 304

Query: 163 ALFDTCYDFS-SRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS 218
           A  D CYD S S++S  +++P + FHF EG    LP +NY I  +  GT  C A   ++ 
Sbjct: 305 AGLDLCYDVSGSKASYKMKIPAMVFHF-EGADYQLPVENYFIAANDAGTIVCLAMVSSNM 363

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + I GN+ QQ  RV +++ +S IG+ P++C
Sbjct: 364 DIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  148 bits (374), Expect = 2e-33,   Method: Composition-based stats.
 Identities = 103/270 (38%), Positives = 148/270 (54%), Gaps = 22/270 (8%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+   +         V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 285 GDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQ 344

Query: 52  A---STFSYCLVDRDSDST--STLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
           +    +FSYCLVDR+SD++  S L F  D  L   PN      +   E  +DTFYY+G+ 
Sbjct: 345 SIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIK 404

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I V G++L I E  + + + G GG I+DSGT +T      Y  +++AF++  +     +
Sbjct: 405 SIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVE 464

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
           G      CY+ S    +E+P     F +G +   P +NY I ++ +   C A   T  S+
Sbjct: 465 GFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVENYFIQIEPD-LVCLAILGTPKSA 523

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   + ++++ S +G+ P KC
Sbjct: 524 LSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 134/255 (52%), Gaps = 15/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G   TET TL + +V +N   GCG NN+GLF GAAGL+GLG    S  SQ+  S    FS
Sbjct: 104 GFLATETFTLAAGNVFNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFS 163

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S +T  L   + L     TA +L N    T Y++ L GISVGG  L +S T F
Sbjct: 164 YCL-PSTSSATGYLNIGNPLRTPGYTA-MLTNSRAPTLYFIDLIGISVGGTRLALSSTVF 221

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           +     + G I+DSGT +TRL    Y ALR AF       +     ++ DTCYDFS  ++
Sbjct: 222 Q-----SVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTT 276

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVS 234
           V  PT+  H+  G  + +P    +  V S+   C AFA  S S  + IIGNVQQ+   V+
Sbjct: 277 VTFPTIKLHY-TGLDVTIPGAG-VFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVT 334

Query: 235 FNLRNSLIGFTPNKC 249
           ++     IGF    C
Sbjct: 335 YDNALKRIGFAAGAC 349


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/258 (39%), Positives = 134/258 (51%), Gaps = 18/258 (6%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    E  TL ++ V D +  GCG NN+GLF G AGLLGLG   LSFPSQ   +    FS
Sbjct: 225 GFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 284

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL    S  T  L F S+    +V   P+    +  +FY L +  I+VGG  LPI  T 
Sbjct: 285 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 343

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G ++DSGT +TRL  + Y ALR +F         T GV++ DTC+D S   
Sbjct: 344 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 398

Query: 176 SVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGT 231
           +V +P V+F F  G V+ L +K   Y+  +      C AFA  S  S+ +I GNVQQQ  
Sbjct: 399 TVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VCLAFAGNSDDSNAAIFGNVQQQTL 455

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V ++     +GF PN C
Sbjct: 456 EVVYDGAGGRVGFAPNGC 473


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/258 (39%), Positives = 134/258 (51%), Gaps = 18/258 (6%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    E  TL ++ V D +  GCG NN+GLF G AGLLGLG   LSFPSQ   +    FS
Sbjct: 197 GFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 256

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL    S  T  L F S+    +V   P+    +  +FY L +  I+VGG  LPI  T 
Sbjct: 257 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 315

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G ++DSGT +TRL  + Y ALR +F         T GV++ DTC+D S   
Sbjct: 316 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 370

Query: 176 SVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGT 231
           +V +P V+F F  G V+ L +K   Y+  +      C AFA  S  S+ +I GNVQQQ  
Sbjct: 371 TVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VCLAFAGNSDDSNAAIFGNVQQQTL 427

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V ++     +GF PN C
Sbjct: 428 EVVYDGAGGRVGFAPNGC 445


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/271 (38%), Positives = 145/271 (53%), Gaps = 24/271 (8%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---A 52
           G   +ETVTL S      +  NIA GCGH N G F  A+GL+GLG G+LSF SQ+     
Sbjct: 126 GTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFG 185

Query: 53  STFSYCLVD-RDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
             FSYCLV  RD+ S ++  F         S    +    P++ N  +++FYY+ L  IS
Sbjct: 186 HKFSYCLVPWRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDIS 245

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-V 162
           + G  L I   +F I   G+GG+I DSGT +T L    Y  +  A +R   +    DG  
Sbjct: 246 IAGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRA-LRSKVSFPEIDGSS 304

Query: 163 ALFDTCYDFS-SRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS 218
           A  D CYD S S++S   ++P + FHF EG    LP +NY I  +  GT  C A   ++ 
Sbjct: 305 AGLDLCYDVSGSKASYKKKIPAMVFHF-EGADHQLPVENYFIAANDAGTIVCLAMVSSNM 363

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + I GN+ QQ  RV +++ +S IG+ P++C
Sbjct: 364 DIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/257 (38%), Positives = 131/257 (50%), Gaps = 15/257 (5%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 267 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 326

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           +CL  R S  T  L+F +  P   +T   +      TFYY+GLTGI VGG LL I ++ F
Sbjct: 327 HCLPAR-STGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVF 385

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG--TRALSPTDGVALFDTCYDFSSR 174
                   G IVDSGT +TRL    Y++LR AF      R       V+L DTCYDF+  
Sbjct: 386 -----ATAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGM 440

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTR 232
           S V +PTVS  F  G  L + A   +    ++   C AFA       + I+GN Q +   
Sbjct: 441 SQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLKTFG 499

Query: 233 VSFNLRNSLIGFTPNKC 249
           V++++   ++ F+P  C
Sbjct: 500 VAYDIGKKVVSFSPGAC 516


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  147 bits (370), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 96/284 (33%), Positives = 141/284 (49%), Gaps = 42/284 (14%)

Query: 1   GDFVTETVTLGS---------ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF +ET T+             V ++  GCGH N+G F GA+GLLGLG G +SFPSQI 
Sbjct: 264 GDFASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQ 323

Query: 52  A---STFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHEL-------------DT 93
           +    +FSYCL D  S++  +S L F            LL NH L             +T
Sbjct: 324 SIYGHSFSYCLTDLFSNTSVSSKLIFGED-------KELLNNHNLNFTTLLAGEETPDET 376

Query: 94  FYYLGLTGISVGGDLLPISETAFKIDES-----GNGGIIVDSGTAVTRLQTETYNALRDA 148
           FYYL +  I VGG++L ISE  +            GG I+DSG+ +T      Y+ +++A
Sbjct: 377 FYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEA 436

Query: 149 FVRGTRALSPTDGVALFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
           F +  +         +   CY+ S +   VE+P    HF +G V   PA+NY    + + 
Sbjct: 437 FEKKIKLQQIAADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE 496

Query: 208 TFCFAF--APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             C A    P  S L+IIGN+ QQ   + ++++ S +G++P +C
Sbjct: 497 VICLAIMKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 540


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  146 bits (369), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 95/270 (35%), Positives = 143/270 (52%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTLGSA---------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+             V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 88  GDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 147

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+SD+  + +       + ++ P L        + + +DTFYY+ + 
Sbjct: 148 SLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIK 207

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I VGG+++ I E  ++I   G+GG I+DSGT ++      Y  +++AF+   +      
Sbjct: 208 SIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVK 267

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
              + + CY+ +     ++P     F +G V   P +NY I ++     C A   T  S+
Sbjct: 268 DFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA 327

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   + ++ + S +GF P KC
Sbjct: 328 LSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 94/269 (34%), Positives = 142/269 (52%), Gaps = 20/269 (7%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+   +         V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 288 GDFALETFTVNLTTPNGKSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQ 347

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+S+++ + +         ++ P L        ++  +DTFYY+ + 
Sbjct: 348 SLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIN 407

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            + V  ++L I E  + +   G GG I+DSGT +T      Y  +++AFVR  +     +
Sbjct: 408 SVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVE 467

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G+     CY+ S    +E+P     F +G V   P +NY I +D +           S+L
Sbjct: 468 GLPPLKPCYNVSGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSAL 527

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN QQQ   + ++++ S +G+ P KC
Sbjct: 528 SIIGNYQQQNFHILYDMKKSRLGYAPMKC 556


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  146 bits (368), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 101/256 (39%), Positives = 132/256 (51%), Gaps = 14/256 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    +  TL S+ V D +  GCG NN+GLF G AGLLGLG   LSFPSQ   +    FS
Sbjct: 226 GFLAKDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFS 285

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL    S  T  L F S+    +V   P+    +  +FY L +  I+VGG  LPI  T 
Sbjct: 286 YCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTV 344

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G ++DSGT +TRL  + Y ALR +F         T GV++ DTC+D S   
Sbjct: 345 FSTP-----GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFK 399

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           +V +P V+F F  G V+ L +K        +   C AFA  S  S+ +I GNVQQQ   V
Sbjct: 400 TVTIPKVAFSFSGGAVVELGSKGIFYAFKIS-QVCLAFAGNSDDSNAAIFGNVQQQTLEV 458

Query: 234 SFNLRNSLIGFTPNKC 249
            ++     +GF PN C
Sbjct: 459 VYDGAGGRVGFAPNGC 474


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 132/259 (50%), Gaps = 18/259 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G +  +T+TL   ++ N   GCG  N GLF  AAGLLGLG G  S P Q        F+Y
Sbjct: 184 GFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAY 243

Query: 58  CLVDRDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    S  T  L+     P  NA   P+L +    TFYY+G+TGI VGG +LPI  + F
Sbjct: 244 CL-PATSAGTGFLDLGPGAPAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVF 301

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSSR 174
                   G +VDSGT +TRL    Y  LR AF +  + L  S     ++ DTCYD +  
Sbjct: 302 S-----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGH 356

Query: 175 S--SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQG 230
              S+ +P VS  F  G  L + A   L   D +   C AFAP +  + ++I+GN QQ+ 
Sbjct: 357 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAPNADDTDVAIVGNTQQKT 415

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V +++   ++GF P  C
Sbjct: 416 HGVLYDIGKKIVGFAPGAC 434


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 132/259 (50%), Gaps = 18/259 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G +  +T+TL   ++ N   GCG  N GLF  AAGLLGLG G  S P Q        F+Y
Sbjct: 249 GFYAQDTLTLAYDTIKNFRFGCGEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAY 308

Query: 58  CLVDRDSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    S  T  L+     P  NA   P+L +    TFYY+G+TGI VGG +LPI  + F
Sbjct: 309 CL-PATSAGTGFLDLGPGAPAANARLTPMLVDRG-PTFYYVGMTGIKVGGHVLPIPGSVF 366

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSSR 174
                   G +VDSGT +TRL    Y  LR AF +  + L  S     ++ DTCYD +  
Sbjct: 367 S-----TAGTLVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGH 421

Query: 175 S--SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQG 230
              S+ +P VS  F  G  L + A   L   D +   C AFAP +  + ++I+GN QQ+ 
Sbjct: 422 KGGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAPNADDTDVAIVGNTQQKT 480

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V +++   ++GF P  C
Sbjct: 481 HGVLYDIGKKIVGFAPGAC 499


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 96/263 (36%), Positives = 140/263 (53%), Gaps = 21/263 (7%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GS++ D      +A GC + +   + G+AGL+GLG GSLS  SQ+ A  FSYCL
Sbjct: 207 SETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 266

Query: 60  VD-RDSDSTSTLEFDSSLPPNAV---TAPLLRN---HELDTFYYLGLTGISVGGDLLPIS 112
              +D++STSTL    S   N     + P + +     + T+YYL LTGIS+G   LPIS
Sbjct: 267 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPIS 326

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYD 170
             AF +   G GG+I+DSGT +T L    Y  +R A       L   DG      D C+ 
Sbjct: 327 PGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFA 386

Query: 171 FSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNV 226
             + +S     +P+++ HF +G  + LPA +Y+I    +G +C A    T  ++S  GN 
Sbjct: 387 LPAPTSAPPAVLPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNY 443

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   + +++R   + F P KC
Sbjct: 444 QQQNMHILYDVREETLSFAPAKC 466


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/261 (39%), Positives = 134/261 (51%), Gaps = 20/261 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  ++     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 35  GFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 94

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL----DTFYYLGLTGISVGGDLLPIS 112
           +C   R S  T  LEF     P AV+A L     L     TFYY+G+TGI VGG LLPI 
Sbjct: 95  HCFPAR-SSGTGYLEFGPGSSP-AVSAKLSTTPMLIDTGPTFYYVGMTGIRVGGKLLPIP 152

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGVALFDTCYD 170
           ++ F        G IVDSGT +TRL    Y++LR AF      R       ++L DTCYD
Sbjct: 153 QSVFA-----AAGTIVDSGTVITRLPPAAYSSLRSAFAASMAARGYKRAPALSLLDTCYD 207

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
            +  S V +PTVS  F  G  L + A   +I   S    C  FA   ++  ++I+GN Q 
Sbjct: 208 LTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQACLGFAGNEAADDVAIVGNTQL 266

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +   V +++ + ++GF P  C
Sbjct: 267 KTFGVVYDIASKVVGFCPGAC 287


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 95/270 (35%), Positives = 143/270 (52%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTLGSA---------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+             V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 274 GDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 333

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+SD+  + +       + ++ P L        + + +DTFYY+ + 
Sbjct: 334 SLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIK 393

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            I VGG+++ I E  ++I   G+GG I+DSGT ++      Y  +++AF+   +      
Sbjct: 394 SIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVK 453

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSS 219
              + + CY+ +     ++P     F +G V   P +NY I ++     C A   T  S+
Sbjct: 454 DFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSA 513

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           LSIIGN QQQ   + ++ + S +GF P KC
Sbjct: 514 LSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/257 (37%), Positives = 130/257 (50%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G F  +T+T+   ++     GCG  N GLF   AGL+GLG G  S   Q        F+Y
Sbjct: 251 GFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAY 310

Query: 58  CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    +  T  L+F   S   NA   P+L + +  TFYY+G+TGI VGG  +P++E+ F
Sbjct: 311 CLPAL-TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVF 368

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSSR 174
                   G +VDSGT +TRL    Y AL  AF  V   R      G ++ DTCYDF+  
Sbjct: 369 S-----TAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGL 423

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
           S VE+PTVS  F  G  L +     +  + S    C AFA      S++I+GN QQ+   
Sbjct: 424 SDVELPTVSLVFQGGACLDVDVSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYG 482

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++L    +GF P  C
Sbjct: 483 VLYDLGKKTVGFAPGSC 499


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/257 (37%), Positives = 130/257 (50%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G F  +T+T+   ++     GCG  N GLF   AGL+GLG G  S   Q        F+Y
Sbjct: 251 GFFAQDTLTIAHDAIKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAY 310

Query: 58  CLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    +  T  L+F   S   NA   P+L + +  TFYY+G+TGI VGG  +P++E+ F
Sbjct: 311 CLPAL-TTGTGYLDFGPGSAGNNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVF 368

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDFSSR 174
                   G +VDSGT +TRL    Y AL  AF  V   R      G ++ DTCYDF+  
Sbjct: 369 S-----TAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGL 423

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
           S VE+PTVS  F  G  L +     +  + S    C AFA      S++I+GN QQ+   
Sbjct: 424 SDVELPTVSLVFQGGACLDVDVSGIVYAI-SEAQVCLAFASNGDDESVAIVGNTQQKTYG 482

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++L    +GF P  C
Sbjct: 483 VLYDLGKKTVGFAPGSC 499


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 100/243 (41%), Positives = 124/243 (51%), Gaps = 15/243 (6%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
           VD+   GCG +NEGLF G+AGL+GLG   +SF  Q   I    FSYCL    S S   L 
Sbjct: 245 VDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL-PSTSSSLGHLT 303

Query: 72  FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
           F +S   NA     PL      +TFY L + GISVGG  LP +S + F       GG I+
Sbjct: 304 FGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSII 358

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
           DSGT +TRL    Y ALR AF +G       +   LFDTCYDFS    + VP + F F  
Sbjct: 359 DSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAG 418

Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
           G  + LP    LI   S    C AFA     + ++I GNVQQ+   V +++    IGF  
Sbjct: 419 GVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGA 477

Query: 247 NKC 249
             C
Sbjct: 478 AGC 480


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/263 (39%), Positives = 141/263 (53%), Gaps = 20/263 (7%)

Query: 1   GDFVTETVTL----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---AS 53
           GD   ET++L    G+ SV N A GCG  N G F GAAGL+GLG G LS  SQ++   A+
Sbjct: 128 GDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFAN 187

Query: 54  TFSYCLVDRDSDSTSTLEFDS-SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
            FSYCLV  +S S S L F S +   N     ++ N    T+YY+ L  I VGG  L ++
Sbjct: 188 KFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLA 247

Query: 113 ETAFKIDES-GNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVAL-FDT 167
            + F ID+S G GG I+DSGT +T L    Y+A+    ++FV   R     DG A   D 
Sbjct: 248 PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPR----LDGSAYGLDL 303

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNV 226
           C++ +  S+  VP + F F +G    +  +N  + VD++  T C A    S   SIIGN+
Sbjct: 304 CFNIAGVSNPSVPDMVFKF-QGADFQMRGENLFVLVDTSATTLCLAMG-GSQGFSIIGNI 361

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   V ++L    IGF    C
Sbjct: 362 QQQNHLVVYDLEAKKIGFATADC 384


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 101/260 (38%), Positives = 132/260 (50%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  ++     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 274 GFFAMDTLTLSSYDAIKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFA 333

Query: 57  YCLVDRDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +C   R S  T  L+F     P     +T P+L ++ L TFYY+GLTGI VGG LL I  
Sbjct: 334 HCFPAR-SSGTGYLDFGPGSSPAVSTKLTTPMLVDNGL-TFYYVGLTGIRVGGKLLSIPP 391

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR AF      R       ++L DTCYDF
Sbjct: 392 SVFT-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDF 446

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
           +  S V +PTVS  F  G  L + A   +I   S    C  FA       + I+GN Q +
Sbjct: 447 TGMSQVAIPTVSLLFQGGASLDVDASG-IIYAASVSQACLGFAANEEDDDVGIVGNTQLK 505

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V +++   ++GF+P  C
Sbjct: 506 TFGVVYDIGKKVVGFSPGAC 525


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 141/265 (53%), Gaps = 24/265 (9%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GS++ D      +A GC + +   + G+AGL+GLG GSLS  SQ+ A  FSYCL
Sbjct: 209 SETFTFGSSAADQARVPGVAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGRFSYCL 268

Query: 60  VD-RDSDSTSTLEFDSSLPPNAV---TAPLLRN---HELDTFYYLGLTGISVGGDLLPIS 112
              +D++STSTL    S   N     + P + +     + T+YYL LTGIS+G   LPIS
Sbjct: 269 TPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPIS 328

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT----DGVALFDTC 168
             AF +   G GG+I+DSGT +T L    Y  +R A         PT    D   L D C
Sbjct: 329 PGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGL-DLC 387

Query: 169 YDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIG 224
           +   + +S     +P+++ HF +G  + LPA +Y+I    +G +C A    T  ++S  G
Sbjct: 388 FALPAPTSAPPAVLPSMTLHF-DGADMVLPADSYMI--SGSGVWCLAMRNQTDGAMSTFG 444

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + +++R   + F P KC
Sbjct: 445 NYQQQNMHILYDVREETLSFAPAKC 469


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 86/115 (74%), Positives = 103/115 (89%), Gaps = 1/115 (0%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           GDF TET+TL GSAS++N+AIGCGH+NEGLFVGAAGLLGLGGGSLSFPSQINAS+FSYCL
Sbjct: 140 GDFATETITLDGSASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQINASSFSYCL 199

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           V+RD+DS STLEF+S +P ++VTAPLLRN++LDTFYYLG+TGI     +L I+ T
Sbjct: 200 VNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGESYKILQITCT 254


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  143 bits (361), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 93/269 (34%), Positives = 140/269 (52%), Gaps = 20/269 (7%)

Query: 1   GDFVTETVTLGSAS---------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+   +         V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 290 GDFALETFTVNLTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQ 349

Query: 52  A---STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLL--------RNHELDTFYYLGLT 100
           +    +FSYCLVDR+S+++ + +         ++ P L        ++  +DTFYY+ + 
Sbjct: 350 SLYGQSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIK 409

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD 160
            + V  ++L I E  + +   G GG I+DSGT +T      Y  +++AFVR  +     +
Sbjct: 410 SVMVDDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVE 469

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G+     CY+ S    +E+P     F +  V   P +NY I +D             S+L
Sbjct: 470 GLPPLKPCYNVSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSAL 529

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN QQQ   + ++++ S +G+ P KC
Sbjct: 530 SIIGNYQQQNFHILYDMKKSRLGYAPMKC 558


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 101/259 (38%), Positives = 132/259 (50%), Gaps = 18/259 (6%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD 61
           +ET+++GS  V+N   GC +   GL      L+G G   LSF SQ   +  STFSYCL  
Sbjct: 216 SETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPS 275

Query: 62  RDSDS--TSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
             S +   S L    +L    +   PLL N    +FYY+GL GISVG +L+ I      +
Sbjct: 276 LFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSL 335

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVALFDTCYDFSSRS 175
           DES   G I+DSGT +TRL    YNA+RD+F      L   SPTD   LFDTCY+  S  
Sbjct: 336 DESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTD---LFDTCYNRPS-G 391

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT----SSSLSIIGNVQQQG 230
            VE P ++ HF +   L LP  N L P + +G+  C AF          LS  GN QQQ 
Sbjct: 392 DVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQK 451

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+  ++  S +G     C
Sbjct: 452 LRIVHDVAESRLGIASENC 470


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 12/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 215 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 274

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S S        + P      P+ ++   D+ Y++ +TGI+V G  L +S +A+ 
Sbjct: 275 CLPTSSSSSGYLSIGSYN-PGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYS 333

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T+ Y+AL  A     +        ++ DTC+     S +
Sbjct: 334 SLPT-----IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQASRL 387

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP VS  F  G  L L A N L+ VDS  T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 388 RVPQVSMAFAGGAALKLKATNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 445

Query: 238 RNSLIGFTPNKC 249
           +NS IGF    C
Sbjct: 446 KNSKIGFAAGGC 457


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/261 (39%), Positives = 133/261 (50%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  ++     GCG  NEGL+  AAGLLGLG G  S P Q        F+
Sbjct: 249 GFFAMDTLTLSSYDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFA 308

Query: 57  YCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLRNHELD---TFYYLGLTGISVGGDLLPIS 112
           +C   R S  T  L+F   SLP  AV+A L     +D   TFYY+GLTGI VGG LL I 
Sbjct: 309 HCFPAR-SSGTGYLDFGPGSLP--AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIP 365

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYD 170
           ++ F        G IVDSGT +TRL    Y++LR AF      R       ++L DTCYD
Sbjct: 366 QSVFTTS-----GTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYD 420

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQ 228
           F+  S V +PTVS  F  G  L + A   +I   S    C  FA       + I+GN Q 
Sbjct: 421 FTGMSEVAIPTVSLLFQGGASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQL 479

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +   V +++   ++GF P  C
Sbjct: 480 KTFGVVYDIGKKVVGFCPGAC 500


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 110/270 (40%), Positives = 143/270 (52%), Gaps = 21/270 (7%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA-- 52
           GD   E+ T+       S  VD++  GCGH N GLF GAAGLLGLG G LSF SQ+ A  
Sbjct: 245 GDLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVY 304

Query: 53  -STFSYCLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGIS 103
             TFSYCLVD  SD  S + F            P    TA    +   DTFYY+ L G+ 
Sbjct: 305 GHTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVL 364

Query: 104 VGGDLLPISETAF--KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTD 160
           VGG+LL IS   +     E G+GG I+DSGT ++      Y  +R AF+ R  R+     
Sbjct: 365 VGGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIP 424

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
              +   CY+ S     EVP +S  F +G V   PA+NY I +D +G  C A   T  + 
Sbjct: 425 DFPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTG 484

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +SIIGN QQQ   V ++L+N+ +GF P +C
Sbjct: 485 MSIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/228 (39%), Positives = 117/228 (51%), Gaps = 12/228 (5%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVDRDSDSTSTLE 71
           VDN   GCG NN+GLF G+AGL+GLG   +SF  Q  A     FSYCL    S ST  L 
Sbjct: 255 VDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL-PATSSSTGRLS 313

Query: 72  FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSG 131
           F ++        P        +FY L +TGISVGG  LP+S + F       GG I+DSG
Sbjct: 314 FGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFS-----TGGAIIDSG 368

Query: 132 TAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV 191
           T +TRL    Y ALR AF +G         +++ DTCYD S      +P + F F  G  
Sbjct: 369 TVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSFAGGVT 428

Query: 192 LPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFNL 237
           + LP +  L  V S    C AFA     S ++I GNVQQ+   V +++
Sbjct: 429 VQLPPQGILY-VASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 106/267 (39%), Positives = 140/267 (52%), Gaps = 21/267 (7%)

Query: 1   GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+     S+  I+ GCG+ N GL    +G++G G GSLS  SQ+ +  FS
Sbjct: 177 GVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLGSPRFS 236

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDL 108
           YCL    S   S L F      N+  A        P + N  L T Y+L +TGISVGG L
Sbjct: 237 YCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYL 296

Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
           LPI    F I D  G GG I+DSGT +T L    Y+A+R AF  + T  L      ++ D
Sbjct: 297 LPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLD 356

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSLSI 222
           TC+ +    R SV +P +  HF +G    LP +NY++ VD  + G  C A A +SS  SI
Sbjct: 357 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSI 413

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IG+ Q Q   V ++L NSL+ F P  C
Sbjct: 414 IGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           F +    +G ASV ++  GCG  N G+FV    G+ G   G+LS P+Q+    FSYC   
Sbjct: 186 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 245

Query: 62  RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
                 S +     +PPN              +  L+R H      YY+ L G++VG   
Sbjct: 246 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 303

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           LPI E+ F + E G GG IVDSGT +T L    YN + DAFV  T+        +L   C
Sbjct: 304 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 363

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
           +     +  +VP +  HF EG  L LP +NY+  ++  G     C A       LS+IGN
Sbjct: 364 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 421

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   V ++L N ++ F P +C
Sbjct: 422 FQQQNMHVLYDLANDMLSFVPARC 445


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 133/259 (51%), Gaps = 18/259 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G +  +T+TLG  +V +   GCG  N GLF  AAGL+GLG G  S P Q     +  F+Y
Sbjct: 253 GFYAQDTLTLGYDTVKDFRFGCGEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAY 312

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVT--APLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           C +   S  T  L+F    P  A     P+L ++   TFYY+G+TGI VGG LL I  T 
Sbjct: 313 C-IPATSSGTGFLDFGPGAPAAANARLTPMLVDNG-PTFYYVGMTGIKVGGHLLSIPATV 370

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFSS 173
           F      + G +VDSGT +TRL    Y  LR AF +G   L        ++ DTCYD + 
Sbjct: 371 FS-----DAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTG 425

Query: 174 -RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQG 230
            + S+ +P VS  F  G  L + A   L   D +   C AFA     + ++I+GN QQ+ 
Sbjct: 426 YQGSIALPAVSLVFQGGACLDVDASGILYVADVSQA-CLAFAANDDDTDMTIVGNTQQKT 484

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V ++L   ++GF P  C
Sbjct: 485 YSVLYDLGKKVVGFAPGAC 503


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           F +    +G ASV ++  GCG  N G+FV    G+ G   G+LS P+Q+    FSYC   
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271

Query: 62  RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
                 S +     +PPN              +  L+R H      YY+ L G++VG   
Sbjct: 272 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 329

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           LPI E+ F + E G GG IVDSGT +T L    YN + DAFV  T+        +L   C
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 389

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
           +     +  +VP +  HF EG  L LP +NY+  ++  G     C A       LS+IGN
Sbjct: 390 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 447

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   V ++L N ++ F P +C
Sbjct: 448 FQQQNMHVLYDLANDMLSFVPARC 471


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 90/264 (34%), Positives = 126/264 (47%), Gaps = 21/264 (7%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           F +    +G ASV ++  GCG  N G+FV    G+ G   G+LS P+Q+    FSYC   
Sbjct: 212 FASADHAIGGASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDNFSYCFTA 271

Query: 62  RDSDSTSTLEFDSSLPPN------------AVTAPLLRNHELD-TFYYLGLTGISVGGDL 108
                 S +     +PPN              +  L+R H      YY+ L G++VG   
Sbjct: 272 ITGSEPSPVFL--GVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTR 329

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           LPI E+ F + E G GG IVDSGT +T L    YN + DAFV  T+        +L   C
Sbjct: 330 LPIPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLC 389

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSLSIIGN 225
           +     +  +VP +  HF EG  L LP +NY+  ++  G     C A       LS+IGN
Sbjct: 390 FSVPPGAKPDVPALVLHF-EGATLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGN 447

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   V ++L N ++ F P +C
Sbjct: 448 FQQQNMHVLYDLANDMLSFVPARC 471


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 132/248 (53%), Gaps = 14/248 (5%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCL--VDRDSDSTST 69
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C   V+    ST  
Sbjct: 241 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVL 300

Query: 70  LEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
           L+  + L  N   A    PL++N    T YYL L GI+VG   LP+ E+AF +  +G GG
Sbjct: 301 LDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFAL-TNGTGG 359

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
            I+DSGT++T L  + Y  +RD F    +  + P +    + TC+   S++  +VP +  
Sbjct: 360 TIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVL 418

Query: 185 HFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
           HF EG  + LP +NY+  +P D+ N   C A        + IGN QQQ   V ++L+N++
Sbjct: 419 HF-EGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNM 477

Query: 242 IGFTPNKC 249
           + F   +C
Sbjct: 478 LSFVAAQC 485



 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 76/136 (55%), Gaps = 8/136 (5%)

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-AL 156
           G  GI+VG   LP+ E+AF +  +G GG I+DSGT++T L  + Y  +RD F    +  +
Sbjct: 38  GRPGITVGSTRLPVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPV 96

Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAF 213
            P +    + TC+   S++  +VP +  HF EG  + LP +NY+  +P D+ N   C A 
Sbjct: 97  VPGNATGPY-TCFSAPSQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAI 154

Query: 214 APTSSSLSIIGNVQQQ 229
                + +IIGN QQQ
Sbjct: 155 NKGDET-TIIGNFQQQ 169


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 91/258 (35%), Positives = 136/258 (52%), Gaps = 12/258 (4%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TLGS  +V  +A GCG  N G    ++GL+G+G G LS  SQ+  + FSYC 
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF 243

Query: 60  VDRDSDSTSTLEFDSS--LPPNAVTAPLLRN-----HELDTFYYLGLTGISVGGDLLPIS 112
              ++ + S L   SS  L   A T P + +         ++YYL L GI+VG  LLPI 
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF 171
              F++   G+GG+I+DSGT  T L+   + AL  A     R L    G  L    C+  
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRVR-LPLASGAHLGLSLCFAA 362

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           +S  +VEVP +  HF +G  + L  ++Y++   S G  C     ++  +S++G++QQQ T
Sbjct: 363 ASPEAVEVPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNT 420

Query: 232 RVSFNLRNSLIGFTPNKC 249
            + ++L   ++ F P KC
Sbjct: 421 HILYDLERGILSFEPAKC 438


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 129/260 (49%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 270 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 329

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F    P       T P+L ++   TFYY+G+TGI VGG LL I +
Sbjct: 330 HCLPAR-SSGTGYLDFGPGSPAAVGARQTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 387

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR AF      R       ++L DTCYDF
Sbjct: 388 SVFS-----TAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDF 442

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +P VS  F  G  L + A   +    S    C  FA       + I+GN Q +
Sbjct: 443 TGMSEVAIPKVSLLFQGGAYLDVNASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLK 501

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V +++    +GF+P  C
Sbjct: 502 TFGVVYDIGKKTVGFSPGAC 521


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 94/262 (35%), Positives = 137/262 (52%), Gaps = 22/262 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
           G+  TE + LG S +V+N   GCG NN+GLF GA+GL+GLG  SLS  SQ +A     FS
Sbjct: 227 GELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFS 286

Query: 57  YCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
           YCL   +++++ +L    +SS+  N       R   N +L  FY+L LTGI+VG      
Sbjct: 287 YCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQL-PFYFLNLTGITVG------ 339

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
              A +    G  G+++DSGT +TRL    Y AL+D FV+            + DTC++ 
Sbjct: 340 -SVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNL 398

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
           S    VE+P +  HF     L +      Y +  D++   C A A  S  + + IIGN Q
Sbjct: 399 SGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDAS-QVCLAIASLSYENEVGIIGNYQ 457

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+  RV ++ + S++GF    C
Sbjct: 458 QKNQRVIYDTKGSMLGFAAEAC 479


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 96/244 (39%), Positives = 134/244 (54%), Gaps = 10/244 (4%)

Query: 14  SVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
           S+  I  GCG NN    +   AGLLGLG G LS  SQ+    FSYCL     + TS+L F
Sbjct: 138 SIPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQKFSYCLTSIHENKTSSLLF 197

Query: 73  DSSL-----PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
            S       P      PL++N  L ++YYL L GI+VG  LLPI E AF++ + G+GG+I
Sbjct: 198 GSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKDGSGGMI 257

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS--VEVPTVSFH 185
           +DSGT +T LQ + ++ L++AF+  T            D C+    +++  V+VP + FH
Sbjct: 258 LDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFHLPVKNAAEVKVPKLIFH 317

Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           F +G  L LP +NY++     G  C A   T  SLSI GN+QQQ   V  +L+ S +   
Sbjct: 318 F-KGLDLALPVENYMVSDPEMGLICLAIDAT-GSLSIFGNIQQQNMLVLHDLKKSTLSLV 375

Query: 246 PNKC 249
           P +C
Sbjct: 376 PTQC 379


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 105/264 (39%), Positives = 137/264 (51%), Gaps = 20/264 (7%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---- 51
           GD   E  T+     G+  VD +A GCGH N GLF GAAGLLGLG G LSF SQ+     
Sbjct: 245 GDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYG 304

Query: 52  ASTFSYCLVDRDSDSTSTLEF--DSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
              FSYCLV+  S + S + F  D +L   P           + DTFYYL L  I VGG+
Sbjct: 305 GHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGE 364

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
            + IS      D    GG I+DSGT ++      Y A+R AF+ R + +     G  +  
Sbjct: 365 AVNISS-----DTLSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLS 419

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGN 225
            CY+ S    VEVP +S  F +G     PA+NY I ++  G  C A   T  S +SIIGN
Sbjct: 420 PCYNVSGAEKVEVPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGN 479

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   V ++L ++ +GF P +C
Sbjct: 480 YQQQNFHVLYDLEHNRLGFAPRRC 503


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 95/261 (36%), Positives = 137/261 (52%), Gaps = 15/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G F  E+ T+    +D +A GCG +N+G F  A G+LGLG G LSF SQ+     + F+Y
Sbjct: 157 GVFAYESATVDGVRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAY 216

Query: 58  CLV---DRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CLV   D  S S+S +   E  S++     T P++ N +  T YY+ +  ++VGG  LPI
Sbjct: 217 CLVNYLDPTSVSSSLIFGDELISTIHDMQYT-PIVSNPKSPTLYYVQIEKVTVGGKSLPI 275

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
           S++A++ID  GNGG I DSGT +T      Y+ +  AF  G       + V   D C + 
Sbjct: 276 SDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVH-YPRAESVQGLDLCVEL 334

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGNVQQ 228
           +       P+ +  F +G V    A+NY + V  N   C A A  +S L   + IGN+ Q
Sbjct: 335 TGVDQPSFPSFTIEFDDGAVFQPEAENYFVDVAPN-VRCLAMAGLASPLGGFNTIGNLLQ 393

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V ++   +LIGF P KC
Sbjct: 394 QNFFVQYDREENLIGFAPAKC 414


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  140 bits (353), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 91/258 (35%), Positives = 136/258 (52%), Gaps = 12/258 (4%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TLGS  +V  +A GCG  N G    ++GL+G+G G LS  SQ+  + FSYC 
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLGVTRFSYCF 243

Query: 60  VDRDSDSTSTLEFDSS--LPPNAVTAPLLRN-----HELDTFYYLGLTGISVGGDLLPIS 112
              ++ + S L   SS  L   A T P + +         ++YYL L GI+VG  LLPI 
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDF 171
              F++   G+GG+I+DSGT  T L+   + AL  A     R L    G  L    C+  
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARALASRVR-LPLASGAHLGLSLCFAA 362

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           +S  +VEVP +  HF +G  + L  ++Y++   S G  C     ++  +S++G++QQQ T
Sbjct: 363 ASPEAVEVPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNT 420

Query: 232 RVSFNLRNSLIGFTPNKC 249
            + ++L   ++ F P KC
Sbjct: 421 HILYDLERGILSFEPAKC 438


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/260 (38%), Positives = 133/260 (51%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F +     A   +T P+L  +   TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-STGTGYLDFGAGSLAAARARLTTPMLTENG-PTFYYVGMTGIRVGGQLLSIPQ 385

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF
Sbjct: 386 SVF-----ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +PTVS  F  G  L + A   +    ++   C AFA       + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 499

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V++++   ++GF P  C
Sbjct: 500 TFGVAYDIGKKVVGFYPGAC 519


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 107/273 (39%), Positives = 152/273 (55%), Gaps = 31/273 (11%)

Query: 1   GDFVTETVTLG------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
           GD   E++++       S  + ++ IGCGH+N+GLF GA GLLGLG G+LSFPSQ+ +S 
Sbjct: 265 GDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSP 324

Query: 54  ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----------PLLR-NHELDTFYYLG 98
              +FSYCLVDR    T+ L   S++   A  A           P +R N+ ++TFYYLG
Sbjct: 325 IGQSFSYCLVDR----TNNLSVSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLG 380

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           + GI +  +LLPI    F I  +G+GG I+DSGT +T L  + Y A+  AF+   R   P
Sbjct: 381 IQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFL--ARISYP 438

Query: 159 -TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-PVDSNGTFCFAFAPT 216
             D   +   CY+ + R++V  PT+S  F  G  L LP +NY I P       C A  PT
Sbjct: 439 RADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT 498

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              +SIIGN QQQ     ++++++ +GF    C
Sbjct: 499 -DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  139 bits (351), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 129/261 (49%), Gaps = 20/261 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           GD   E + LG+  V N   GCG NN+GLF GA+GL+GLG   LS  SQ +A     FSY
Sbjct: 159 GDLGMEQLNLGTTHVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSY 218

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL    +D++ +L    +      T P     ++ N +L TFY+L LTGIS+GG      
Sbjct: 219 CLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG------ 272

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             A +       GI++DSGT +TRL    Y  L+  F++           ++ DTC++ +
Sbjct: 273 -VALQAPNYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLN 331

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
               V++PT+   F     L +      Y +  D++   C A A  S    + IIGN QQ
Sbjct: 332 GYDEVDIPTIRMQFEGNAELTVDVTGIFYFVKTDAS-QVCLALASLSFDDEIPIIGNYQQ 390

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV +N + S +GF    C
Sbjct: 391 RNQRVIYNTKESKLGFAAEAC 411


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/260 (38%), Positives = 134/260 (51%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 268 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 327

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F +     A   +T P+L ++   TFYY+G+TGI VGG LL I +
Sbjct: 328 HCLPAR-STGTGYLDFGAGSLAAASARLTTPMLTDNG-PTFYYVGMTGIRVGGQLLSIPQ 385

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF
Sbjct: 386 SVF-----ATAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 440

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +PTVS  F  G  L + A   +    ++   C AFA       + I+GN Q +
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 499

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V++++   ++GF P  C
Sbjct: 500 TFGVAYDIGKKVVGFYPGAC 519


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 101/260 (38%), Positives = 135/260 (51%), Gaps = 19/260 (7%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFS 56
           G F  +T+TL S  +V     GCG  NEGLF  AAGLLGLG G  S P Q        F+
Sbjct: 266 GFFAMDTLTLSSYDAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFA 325

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           +CL  R S  T  L+F +  P  A   +T P+L ++   TFYY+G+TGI VGG LL I +
Sbjct: 326 HCLPAR-STGTGYLDFGAGSPAAASARLTTPMLTDNG-PTFYYIGMTGIRVGGQLLSIPQ 383

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALR--DAFVRGTRALSPTDGVALFDTCYDF 171
           + F        G IVDSGT +TRL    Y++LR   A     R       V+L DTCYDF
Sbjct: 384 SVFA-----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDF 438

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQ 229
           +  S V +PTVS  F  G  L + A   +    ++   C AFA       + I+GN Q +
Sbjct: 439 TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASAS-QVCLAFAANEDGGDVGIVGNTQLK 497

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V++++   ++GF P  C
Sbjct: 498 TFGVAYDIGKKVVGFYPGVC 517


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)

Query: 6   ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
           ETV+ +  ASV  +  GCG NN G+F     G+ G G G LS PSQ+    FS+C     
Sbjct: 187 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 246

Query: 64  SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
               ST+ FD  LP +          T PL++N    TFYYL L GI+VG   LP+ E+A
Sbjct: 247 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
           F + ++G GG I+DSGTA T L    Y  + D F    +  + P++       C+     
Sbjct: 305 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 362

Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
                VP +  HF EG  + LP +NY+      G      A     ++IIGN QQQ   V
Sbjct: 363 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 421

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L+NS + F   KC
Sbjct: 422 LYDLKNSKLSFVRAKC 437


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)

Query: 6   ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
           ETV+ +  ASV  +  GCG NN G+F     G+ G G G LS PSQ+    FS+C     
Sbjct: 131 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 190

Query: 64  SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
               ST+ FD  LP +          T PL++N    TFYYL L GI+VG   LP+ E+A
Sbjct: 191 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 248

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
           F + ++G GG I+DSGTA T L    Y  + D F    +  + P++       C+     
Sbjct: 249 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 306

Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
                VP +  HF EG  + LP +NY+      G      A     ++IIGN QQQ   V
Sbjct: 307 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 365

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L+NS + F   KC
Sbjct: 366 LYDLKNSKLSFVRAKC 381


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 92/256 (35%), Positives = 126/256 (49%), Gaps = 17/256 (6%)

Query: 6   ETVT-LGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
           ETV+ +  ASV  +  GCG NN G+F     G+ G G G LS PSQ+    FS+C     
Sbjct: 187 ETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVS 246

Query: 64  SDSTSTLEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
               ST+ FD  LP +          T PL++N    TFYYL L GI+VG   LP+ E+A
Sbjct: 247 GRKPSTVLFD--LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR 174
           F + ++G GG I+DSGTA T L    Y  + D F    +  + P++       C+     
Sbjct: 305 FAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPL 362

Query: 175 SSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
                VP +  HF EG  + LP +NY+      G      A     ++IIGN QQQ   V
Sbjct: 363 GKAPHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHV 421

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L+NS + F   KC
Sbjct: 422 LYDLKNSKLSFVRAKC 437


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 95/250 (38%), Positives = 133/250 (53%), Gaps = 22/250 (8%)

Query: 15  VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
           VD +A    GC H   G  V   GL+G G G LSFPSQ   +  S FSYCL   + S+ +
Sbjct: 207 VDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSNFS 266

Query: 68  STLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
            TL    +  P  + T PLL N    + YY+ + GI VGG  +P+  +A   D +   G 
Sbjct: 267 GTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGT 326

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
           IVD+GT  TRL    Y A+RD F    RA  P  G +  FDTCY+     ++ VPTV+F 
Sbjct: 327 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVAGPLGGFDTCYNV----TISVPTVTFS 380

Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRN 239
           F +G+V + LP +N +I   S G  C A A        ++L+++ ++QQQ  RV F++ N
Sbjct: 381 F-DGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVAN 439

Query: 240 SLIGFTPNKC 249
             +GF+   C
Sbjct: 440 GRVGFSRELC 449


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 87/261 (33%), Positives = 137/261 (52%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GD  +E++ LG   ++N+  GCG NN+GLF GA+GL+GLG  S+S  SQ   +    FSY
Sbjct: 234 GDLASESIVLGDTKLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSY 293

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +  ++ TL F  D S+  N+ +    PL++N +L +FY L LTG S+GG  + + 
Sbjct: 294 CLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG--VELK 351

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             +F        GI++DSGT +TRL    Y A++  F++         G ++ DTC++ +
Sbjct: 352 TLSF------GRGILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLT 405

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
           S   + +PT+   F     L +      Y +  D++   C A A  S  + + IIGN QQ
Sbjct: 406 SYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 464

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV ++     +G     C
Sbjct: 465 KNQRVIYDTTQERLGIAGENC 485


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 91/255 (35%), Positives = 127/255 (49%), Gaps = 22/255 (8%)

Query: 11  GSASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
           G A+V ++A GCG  N G+F     G+ G G G+LS PSQ+    FS+C         S+
Sbjct: 523 GQATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDNFSHCFTAITGSEPSS 582

Query: 70  LEFDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           +     LP N          + PL++N      YYL L GI+VG   LPI E+ F + + 
Sbjct: 583 VLL--GLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQD 640

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---ALFDTCYDFS--SRSS 176
           G GG I+DSGT +T L  + Y  + DAF    R   P D     +L   C+ FS   R+ 
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL--PVDNATSSSLSRLCFSFSVPRRAK 698

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG--TFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            +VP +  HF EG  L LP +NY+   +  G    C A       L+IIGN QQQ   V 
Sbjct: 699 PDVPKLVLHF-EGATLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVL 756

Query: 235 FNLRNSLIGFTPNKC 249
           ++L  +++ F P +C
Sbjct: 757 YDLVRNMLSFVPAQC 771


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 96/261 (36%), Positives = 136/261 (52%), Gaps = 21/261 (8%)

Query: 6   ETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           ET T GS   D      IA GC + +   + G+AGL+GLG GS+S  SQ+ A  FSYCL 
Sbjct: 183 ETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLGAGMFSYCLT 242

Query: 61  D-RDSDSTSTLEFDSSLPPN---AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISE 113
             +D++STSTL    S   N    +T P +       + T+YYL LTGIS+G   L I  
Sbjct: 243 PFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPP 302

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDTCYDF 171
            AF +   G GG+I+DSGT +T L    Y  +R A +     L   DG      D C+  
Sbjct: 303 NAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAA-IESLVTLPVADGSDSTGLDLCFAL 361

Query: 172 SSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGNVQQ 228
           +S +S    +P+++FHF +G  + LP  NY+I    +G +C A    T  ++S  GN QQ
Sbjct: 362 TSETSTPPSMPSMTFHF-DGADMVLPVDNYMI--LGSGVWCLAMRNQTVGAMSTFGNYQQ 418

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   + +++    + F P KC
Sbjct: 419 QNVHLLYDIHEETLSFAPAKC 439


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 125/248 (50%), Gaps = 16/248 (6%)

Query: 13  ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C         ST+ 
Sbjct: 170 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 229

Query: 72  FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            D  LP +          T PL++N    TFYYL L GI+VG   LP+ E+ F + ++G 
Sbjct: 230 LD--LPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 286

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGTA+T L T  Y  +RDAF    +    +        C     R+   VP + 
Sbjct: 287 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 346

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
            HF EG  + LP +NY+  V+  G+   C A       ++ IGN QQQ   V ++L+NS 
Sbjct: 347 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 404

Query: 242 IGFTPNKC 249
           + F P +C
Sbjct: 405 LSFVPAQC 412


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/271 (36%), Positives = 139/271 (51%), Gaps = 34/271 (12%)

Query: 6   ETVTLGSAS------VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           ET T GS+S      V NIA GC + +   + G+AGL+GLG GS+S  SQ+ A  FSYCL
Sbjct: 189 ETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLGAGAFSYCL 248

Query: 60  VD-RDSDSTSTLEFDSSLPPNAVTA----------PLL---RNHELDTFYYLGLTGISVG 105
              +D++STSTL     L P+A  A          P +       + T+YYL LTGISVG
Sbjct: 249 TPFQDANSTSTLL----LGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVG 304

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA-----FVRGTRALSPTD 160
              L I   AF +   G GG+I+DSGT +T L    Y  +R A       R   A  P  
Sbjct: 305 ETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDH 364

Query: 161 GVALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSS 218
              L D C+   +S     +P+++ HF  G  + LP +NY+I    +G +C A    T  
Sbjct: 365 STGL-DLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI--LGSGVWCLAMRNQTVG 421

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++S++GN QQQ   V +++R   + F P  C
Sbjct: 422 AMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/260 (37%), Positives = 137/260 (52%), Gaps = 20/260 (7%)

Query: 9   TLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD--- 61
           T G A+V  +A GCG  N+G  F G  G++GLG G LSFP+Q   + A TFSYCL+D   
Sbjct: 169 TSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEG 228

Query: 62  -RDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
            R   S+S L         A    PL+ N    TFYY+G+  I VG  +LP+  + + ID
Sbjct: 229 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAID 288

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSS 176
             GNGG ++DSG+ +T L+   Y  L  AF   V   R  S        + CY+ SS SS
Sbjct: 289 VLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSS 348

Query: 177 VE-----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
           +       P ++  F +G  L LP  NYL+ V ++   C A  PT S  + +++GN+ QQ
Sbjct: 349 LAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFAFNVLGNLMQQ 407

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
           G  V F+  ++ IGF   +C
Sbjct: 408 GYHVEFDRASARIGFARTEC 427


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/273 (38%), Positives = 151/273 (55%), Gaps = 31/273 (11%)

Query: 1   GDFVTETVTLG------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
           GD   E++++       S  + ++ IGCGH+N+GLF GA GLLGLG G+LSFPSQ+ +S 
Sbjct: 181 GDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSP 240

Query: 54  ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----------PLLR-NHELDTFYYLG 98
              +FSYCLVDR    T+ L   S++   A  A           P +R N+ ++TFYYLG
Sbjct: 241 IGQSFSYCLVDR----TNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLG 296

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           + GI +  +LLPI    F I  +G+GG I+DSGT +T L  + Y A+  AF+   R   P
Sbjct: 297 IQGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFL--ARISYP 354

Query: 159 -TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNGTFCFAFAPT 216
             D   +   CY+ + R++V  P +S  F  G  L LP +NY I  D      C A  PT
Sbjct: 355 RADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPT 414

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              +SIIGN QQQ     ++++++ +GF    C
Sbjct: 415 -DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 446


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 126/248 (50%), Gaps = 16/248 (6%)

Query: 13  ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C    +    ST+ 
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248

Query: 72  FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            D  LP +          + PL++N    TFYYL L GI+VG   LP+ E+ F + ++G 
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTL-KNGT 305

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGTA+T L T  Y  +RDAF    +    +        C     R+   VP + 
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
            HF EG  + LP +NY+  V+  G+   C A       ++ IGN QQQ   V ++L+NS 
Sbjct: 366 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 423

Query: 242 IGFTPNKC 249
           + F P +C
Sbjct: 424 LSFVPAQC 431


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 126/248 (50%), Gaps = 16/248 (6%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C    +    ST+ 
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248

Query: 72  FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            D  LP +          + PL++N    TFYYL L GI+VG   LP+ E+ F + ++G 
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 305

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGTA+T L T  Y  +RDAF    +    +        C     R+   VP + 
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGT--FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
            HF EG  + LP +NY+  V+  G+   C A       ++ IGN QQQ   V ++L+NS 
Sbjct: 366 LHF-EGATMDLPRENYVFEVEDAGSSILCLAII-EGGEVTTIGNFQQQNMHVLYDLQNSK 423

Query: 242 IGFTPNKC 249
           + F P +C
Sbjct: 424 LSFVPAQC 431


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/267 (39%), Positives = 139/267 (52%), Gaps = 21/267 (7%)

Query: 1   GDFVTETVTLGS----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET T G+     S+  I+ GCG+ N G     +G++G G GSLS  SQ+ +  FS
Sbjct: 177 GVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLGSPRFS 236

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDL 108
           YCL    S   S L F      N+  A        P + N  L T Y+L +TGISVGG L
Sbjct: 237 YCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYL 296

Query: 109 LPISETAFKI-DESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFD 166
           LPI    F I D  G GG I+DSGT +T L    Y+A+R AF  + T  L      ++ D
Sbjct: 297 LPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPAYDAVRAAFASQITLPLLNVTDASVLD 356

Query: 167 TCYDF--SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSLSI 222
           TC+ +    R SV +P +  HF +G    LP +NY++ VD  + G  C A A +SS  SI
Sbjct: 357 TCFQWPPPPRQSVTLPQLVLHF-DGADWELPLQNYML-VDPSTGGGLCLAMA-SSSDGSI 413

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IG+ Q Q   V ++L NSL+ F P  C
Sbjct: 414 IGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 93/252 (36%), Positives = 135/252 (53%), Gaps = 13/252 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  +   +FSY
Sbjct: 231 GYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 290

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S    ++   +  P      P++ +   D+ Y++ L+G++V G  L +S +   
Sbjct: 291 CLPSSSSSGYLSIGSYN--PGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS--- 345

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
             E  +   I+DSGT +TRL T  Y+AL  A     +     D  ++ DTC+     SS+
Sbjct: 346 --EYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILDTCF-VGQASSL 402

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP VS  F  G  L L A+N L+ VDS+ T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 403 RVPAVSMAFSGGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDV 460

Query: 238 RNSLIGFTPNKC 249
           +++ IGF    C
Sbjct: 461 KSNRIGFAAGGC 472


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/278 (34%), Positives = 138/278 (49%), Gaps = 36/278 (12%)

Query: 1   GDFVTETVTLGS---------ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+             V ++  GCGH N+G F GA GLLGLG G LSFPSQ+ 
Sbjct: 263 GDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQ 322

Query: 52  A---STFSYCLVDRDSDST--STLEFDSSLPPNAVTAPLLRNHEL-------------DT 93
           +    +FSYCL D  S+++  S L F            LL +H L             DT
Sbjct: 323 SIYGHSFSYCLTDLFSNTSVSSKLIFGED-------KELLNHHNLNFTKLLAGEETPDDT 375

Query: 94  FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT 153
           FYYL +  I VGG++L I E  +     G GG I+DSG+ +T      Y+ +++AF +  
Sbjct: 376 FYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKI 435

Query: 154 RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF 213
           +         +   CY+ S    VE+P    HF +G V   PA+NY    + +   C A 
Sbjct: 436 KLQQIAADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAI 495

Query: 214 --APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              P  S L+IIGN+ QQ   + ++++ S +G++P +C
Sbjct: 496 LKTPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRC 533


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/243 (37%), Positives = 125/243 (51%), Gaps = 15/243 (6%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLE 71
           VD+   GCG +NEGLF G+AGL+GLG   +S   Q +++    FSYCL    S S   L 
Sbjct: 156 VDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL-PATSSSLGHLT 214

Query: 72  FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
           F +S   NA  +  PL      ++FY L +  ISVGG  LP +S + F       GG I+
Sbjct: 215 FGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSA-----GGSII 269

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
           DSGT +TRL    Y ALR AF R        +   L DTCYD S    + VP + F F  
Sbjct: 270 DSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSG 329

Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
           G  + L  +  ++ V+S    C AFA   S   +++ GNVQQ+   V ++++   IGF  
Sbjct: 330 GVTVELXHRG-ILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGA 388

Query: 247 NKC 249
             C
Sbjct: 389 AGC 391


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 93/257 (36%), Positives = 139/257 (54%), Gaps = 21/257 (8%)

Query: 14  SVDNIAIGCGH-NNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--DST 67
            + NI +GC   + EGL  GA+GLLG+    +SFPSQ++   A  FS+C  D+ +  +S+
Sbjct: 251 KLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSS 310

Query: 68  STLEFDSS--LPPNAVTAPLLRNHELDT----FYYLGLTGISVGGDLLPISETAFKIDE- 120
             + F  S  + P     PL++N  + +    +YY+GL GISV    LP+S   F ID+ 
Sbjct: 311 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 370

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSS 176
           +G+GG I+DSGTA T L+   + A+R  F+  T  L+  D  + F  CY+ +S      S
Sbjct: 371 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 430

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSS-SLSIIGNVQQQGTR 232
             +P+++ HF  G  + LP  + LIPV S     T C AF  +     +IIGN QQQ   
Sbjct: 431 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNLW 490

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++L    +G  P +C
Sbjct: 491 VEYDLEKLRLGIAPAQC 507


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 93/257 (36%), Positives = 139/257 (54%), Gaps = 21/257 (8%)

Query: 14  SVDNIAIGCGH-NNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--DST 67
            + NI +GC   + EGL  GA+GLLG+    +SFPSQ++   A  FS+C  D+ +  +S+
Sbjct: 252 KLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCFPDKIAHLNSS 311

Query: 68  STLEFDSS--LPPNAVTAPLLRNHELDT----FYYLGLTGISVGGDLLPISETAFKIDE- 120
             + F  S  + P     PL++N  + +    +YY+GL GISV    LP+S   F ID+ 
Sbjct: 312 GLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKV 371

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSS 176
           +G+GG I+DSGTA T L+   + A+R  F+  T  L+  D  + F  CY+ +S      S
Sbjct: 372 TGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGTAALES 431

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSS-SLSIIGNVQQQGTR 232
             +P+++ HF  G  + LP  + LIPV S     T C AF  +     +IIGN QQQ   
Sbjct: 432 TILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNLW 491

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++L    +G  P +C
Sbjct: 492 VEYDLEKLRLGIAPAQC 508


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/249 (38%), Positives = 125/249 (50%), Gaps = 24/249 (9%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F  E +T+ +  V DN   GCG NN+GLF G+AGL+GLG   +SF  Q  A     FS
Sbjct: 241 GYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFS 300

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDT------FYYLGLTGISVGGDLLP 110
           YCL    S ST  L F       A T   L+     T      FY L +T I+VGG  LP
Sbjct: 301 YCL-PSTSSSTGHLSFGP-----AATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLP 354

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +S + F       GG I+DSGT +TRL    Y ALR AF +G         +++ DTCYD
Sbjct: 355 VSSSTFS-----TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYD 409

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
            S      +PT+ F F  G  + LP +  L  V S    C AFA     S ++I GNVQQ
Sbjct: 410 LSGYKVFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQ 468

Query: 229 QGTRVSFNL 237
           +   V +++
Sbjct: 469 RTIEVVYDV 477


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 86/260 (33%), Positives = 130/260 (50%), Gaps = 18/260 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
           G+   E +  G+  V +   GCG NN+GLF G +GL+GLG   LS  SQ   I    FSY
Sbjct: 147 GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSY 206

Query: 58  CL--VDRDSDSTSTLEFDSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +R    +  L  +SS+  N+     A ++ N +L  FY++ LTGIS+GG      
Sbjct: 207 CLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGG------ 260

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             A +    G   I+VDSGT +TRL    Y AL+  F++      P    ++ DTC++ S
Sbjct: 261 -VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLS 319

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
           +   V++PT+  HF     L +        V S+ +  C A A       ++I+GN QQ+
Sbjct: 320 AYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQK 379

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV ++ + + +GF    C
Sbjct: 380 NLRVIYDTKETKVGFALETC 399


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  136 bits (342), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G + ++T+ LGS++V +   GC +   G      GL+GLGGG+ S  SQ   +    FSY
Sbjct: 218 GTYSSDTLALGSSAVKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 277

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL    S S   +      S     V  P+LR+ ++ TFY + L  I VGG  L I  + 
Sbjct: 278 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 337

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F      + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +S
Sbjct: 338 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 391

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           SV +P+V+  F  G V+ L A   ++   SN   C AFA  S  SSL IIGNVQQ+   V
Sbjct: 392 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAANSDDSSLGIIGNVQQRTFEV 445

Query: 234 SFNLRNSLIGFTPNKC 249
            +++   ++GF    C
Sbjct: 446 LYDVGRGVVGFRAGAC 461


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 104/255 (40%), Positives = 139/255 (54%), Gaps = 15/255 (5%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS---TFSYCLVD 61
           E+ TL S S+ +IA GCG  NEG      G L   G   LS  SQ+  S    FSYCLV 
Sbjct: 207 ESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVS 266

Query: 62  -RDSDS-TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
             DS S TS L    +   NA T    PL+++    TFYYL L GISVGG LL I++  F
Sbjct: 267 ITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTF 326

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
            +   G GG+I+DSGT VT L+   Y+ ++ A +     L   DG  +  D C++  S S
Sbjct: 327 DLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSIN-LPQVDGSNIGLDLCFEPQSGS 385

Query: 176 SV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           S    PT++FHF EG    LP +NY I  DS+G  C A  P S+ +SI GN+QQQ  ++ 
Sbjct: 386 STSHFPTITFHF-EGADFNLPKENY-IYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQIL 442

Query: 235 FNLRNSLIGFTPNKC 249
           ++   +++ F P  C
Sbjct: 443 YDNERNVLSFAPTVC 457


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 130/256 (50%), Gaps = 13/256 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F  + + L S  V +N   GCG NN GLFVG AGL+GLG  +LS  SQ        FS
Sbjct: 231 GFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFS 290

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YCL    S ST  L F S    +      P L N +  +FY+L L  ISVGG  L  S +
Sbjct: 291 YCL-PSTSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSAS 349

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F        G I+DSGT ++RL    Y+ LR +F +           ++ DTCYDFS  
Sbjct: 350 VFS-----TAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQY 404

Query: 175 SSVEVPTVSFHFPEGKVLPL-PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
            +V+VP ++ +F +G  + L P+  + I   S     FA    ++ ++I+GNVQQ+   V
Sbjct: 405 DTVDVPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDV 464

Query: 234 SFNLRNSLIGFTPNKC 249
            +++    IGF P  C
Sbjct: 465 VYDVAGGRIGFAPGGC 480


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G + ++T+ LGS++V +   GC +   G      GL+GLGGG+ S  SQ   +    FSY
Sbjct: 142 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 201

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL    S S   +      S     V  P+LR+ ++ TFY + L  I VGG  L I  + 
Sbjct: 202 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 261

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F      + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +S
Sbjct: 262 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 315

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           SV +P+V+  F  G V+ L A   ++   SN   C AFA  S  SSL IIGNVQQ+   V
Sbjct: 316 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 369

Query: 234 SFNLRNSLIGFTPNKC 249
            +++   ++GF    C
Sbjct: 370 LYDVGRGVVGFRAGAC 385


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G + ++T+ LGS++V +   GC +   G      GL+GLGGG+ S  SQ   +    FSY
Sbjct: 288 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 347

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL    S S   +      S     V  P+LR+ ++ TFY + L  I VGG  L I  + 
Sbjct: 348 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 407

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F      + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +S
Sbjct: 408 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 461

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           SV +P+V+  F  G V+ L A   ++   SN   C AFA  S  SSL IIGNVQQ+   V
Sbjct: 462 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 515

Query: 234 SFNLRNSLIGFTPNKC 249
            +++   ++GF    C
Sbjct: 516 LYDVGRGVVGFRAGAC 531


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 89/255 (34%), Positives = 136/255 (53%), Gaps = 20/255 (7%)

Query: 7   TVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCL---- 59
           T+T  +A       GCG +N+GLF  +AG++GL    LS   Q++    + FSYCL    
Sbjct: 210 TLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSF 269

Query: 60  -VDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
               +S  +  L   +S     P   T PL++N ++ + Y+LGLT I+V G  L +S ++
Sbjct: 270 SAQPNSSVSGFLSIGASSLSSSPYKFT-PLVKNPKIPSLYFLGLTTITVAGKPLGVSASS 328

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSR 174
           + +        I+DSGT +TRL    YNAL+ +FV   ++  +   G ++ DTC+  S +
Sbjct: 329 YNVPT------IIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVK 382

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
               VP +   F  G  L L   N L+ ++  GT C A A +S+ +SIIGN QQQ   V+
Sbjct: 383 EMSTVPEIRIIFRGGAGLELKVHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFTVA 441

Query: 235 FNLRNSLIGFTPNKC 249
           +++ NS IGF P  C
Sbjct: 442 YDVANSKIGFAPGGC 456


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 133/256 (51%), Gaps = 19/256 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G + ++T+ LGS++V +   GC +   G      GL+GLGGG+ S  SQ   +    FSY
Sbjct: 218 GTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSY 277

Query: 58  CLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL    S S   +      S     V  P+LR+ ++ TFY + L  I VGG  L I  + 
Sbjct: 278 CLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASV 337

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F      + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +S
Sbjct: 338 F------SAGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQS 391

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           SV +P+V+  F  G V+ L A   ++   SN   C AFA  S  SSL IIGNVQQ+   V
Sbjct: 392 SVSIPSVALVFSGGAVVSLDASGIIL---SN---CLAFAGNSDDSSLGIIGNVQQRTFEV 445

Query: 234 SFNLRNSLIGFTPNKC 249
            +++   ++GF    C
Sbjct: 446 LYDVGRGVVGFRAGAC 461


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 90/238 (37%), Positives = 128/238 (53%), Gaps = 15/238 (6%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCL--VDRDSDSTST 69
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C   V+    ST  
Sbjct: 89  ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKQSTVL 148

Query: 70  LEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
           L+  + L  N   A    PL++N    TFYYL L GI+VG   LP+ E+AF +  +G GG
Sbjct: 149 LDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFAL-TNGTGG 207

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
            I+DSGT++T L  + Y  +RD F    +  + P +    + TC+   S++  +VP +  
Sbjct: 208 TIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY-TCFSAPSQAKPDVPKLVL 266

Query: 185 HFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           HF EG  + LP +NY+  +P D+ N   C A        +IIGN QQQ   V ++L+N
Sbjct: 267 HF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDLQN 322


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/263 (35%), Positives = 132/263 (50%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSA-------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA- 52
           G+   +T+TLG +        +     GCG ++ GLF  A GL GLG   +S  SQ  A 
Sbjct: 225 GNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAK 284

Query: 53  --STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
             + FSYCL    S +   L   S+ PPNA    ++   +  +FYYL L GI V G  + 
Sbjct: 285 YGAGFSYCLPS-SSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVR 343

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
           +S   F+       G ++DSGT +TRL +  Y ALR +F    R  S     AL   DTC
Sbjct: 344 VSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTC 398

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNV 226
           YDF+ R+ V++P+V+  F  G  L L     L  V +    C AFA     +S++I+GN+
Sbjct: 399 YDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQACLAFASNGDDTSIAILGNM 457

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+   V +++ N  IGF    C
Sbjct: 458 QQKTFAVVYDVANQKIGFGAKGC 480


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 88/256 (34%), Positives = 130/256 (50%), Gaps = 13/256 (5%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STF 55
           G+   +T+TLG +S  +     GCG ++ GLF  A GL GLG   +S  SQ  A   + F
Sbjct: 273 GNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGF 332

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           SYCL              ++ PP+A    ++   +  +FYYL L GI V G  + ++   
Sbjct: 333 SYCLPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAV 392

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           FK       G ++DSGT +TRL +  Y+ALR +F    R       +++ DTCYDF+ R+
Sbjct: 393 FKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRT 447

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRV 233
            V++P+V+  F  G  L L     L  V +    C AFA     +S+ I+GN+QQ+   V
Sbjct: 448 KVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFASNGDDTSVGILGNMQQKTFAV 506

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L N  IGF    C
Sbjct: 507 VYDLANQKIGFGAKGC 522


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  134 bits (338), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 94/255 (36%), Positives = 131/255 (51%), Gaps = 16/255 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           G + ++T+ LGS +V     GC +   G      GL+GLGGG+ S  SQ      + FSY
Sbjct: 222 GTYSSDTLALGSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281

Query: 58  CLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    S S   TL   +S     V  P+LR+ ++ TFY + +  I VGG  L I  + F
Sbjct: 282 CLPATSSSSGFLTLGAGTS---GFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF 338

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G I+DSGT +TRL    Y+AL  AF  G +         + DTC+DFS +SS
Sbjct: 339 ------SAGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSS 392

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
           V +PTV+  F  G V+ + +   ++   SN   C AFA  S  SSL IIGNVQQ+   V 
Sbjct: 393 VSIPTVALVFSGGAVVDIASDGIMLQT-SNSILCLAFAANSDDSSLGIIGNVQQRTFEVL 451

Query: 235 FNLRNSLIGFTPNKC 249
           +++    +GF    C
Sbjct: 452 YDVGGGAVGFKAGAC 466


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 99/268 (36%), Positives = 140/268 (52%), Gaps = 25/268 (9%)

Query: 5   TETVTLGSAS------VDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSY 57
           TET T GS++      V  IA GC + + G    +A GL+GLG GSLS  SQ+ A  FSY
Sbjct: 171 TETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLGAPKFSY 230

Query: 58  CLVD-RDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           CL   +D++STSTL    S   N    V++          +YYL LTGIS+G   LPI  
Sbjct: 231 CLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPP 290

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYDF 171
            AF +   G GG+I+DSGT +T L    Y  +R A V     L  TDG A    D C++ 
Sbjct: 291 NAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAA-VLSLVTLPTTDGSAATGLDLCFEL 349

Query: 172 SSRSSV--EVPTVSFHFPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSS----LS 221
            S +S    +P+++ HF +G  + LPA NY++    P   +  +C A    + +    +S
Sbjct: 350 PSSTSAPPSMPSMTLHF-DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVS 408

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I+GN QQQ   + +++    + F P KC
Sbjct: 409 ILGNYQQQNMHILYDVGKETLSFAPAKC 436


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 22/253 (8%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C         ST+ 
Sbjct: 142 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 201

Query: 72  FDSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
            D  LP +          T PL+   +N    T YYL L GI+VG   LP+ E+AF +  
Sbjct: 202 LD--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-T 258

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEV 179
           +G GG I+DSGT++T L  + Y  +RD F    +  + P +    + TC+   S++  +V
Sbjct: 259 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDV 317

Query: 180 PTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           P +  HF EG  + LP +NY+  +P D+ N   C A        +IIGN QQQ   V ++
Sbjct: 318 PKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYD 375

Query: 237 LRNSLIGFTPNKC 249
           L+N+++ F   +C
Sbjct: 376 LQNNMLSFVAAQC 388


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  134 bits (337), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 132/264 (50%), Gaps = 22/264 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           G+   E +TLG   +DN   GCG NN+GLF GA+GL+GL    LS  SQ ++   S FSY
Sbjct: 238 GELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSY 297

Query: 58  CLVDRDSDSTSTLEFD-------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S+ +L           ++ P + T  +++N ++  FY+L LTGIS+GG  L 
Sbjct: 298 CLPTTGVGSSGSLTLGGADFSNFKNISPISYTR-MIQNPQMSNFYFLNLTGISIGGVNLN 356

Query: 111 ISETAFKIDESGNGGI--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +         S N G+  ++DSGT +TRL    Y A +  F +       T G ++ +TC
Sbjct: 357 VPRL------SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTC 410

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT--SSSLSIIGN 225
           ++ +    V +PTV F F     + +  +     V S+ +  C AFA         IIGN
Sbjct: 411 FNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGN 470

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQ+  RV +N + S +GF    C
Sbjct: 471 YQQKNQRVIYNSKESKVGFAGEPC 494


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 132/264 (50%), Gaps = 22/264 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
           G+   E +TLG   +DN   GCG NN+GLF GA+GL+GL    LS  SQ ++   S FSY
Sbjct: 159 GELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSY 218

Query: 58  CLVDRDSDSTSTLEFD-------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S+ +L           ++ P + T  +++N ++  FY+L LTGIS+GG  L 
Sbjct: 219 CLPTTGVGSSGSLTLGGADFSNFKNISPISYTR-MIQNPQMSNFYFLNLTGISIGGVNLN 277

Query: 111 ISETAFKIDESGNGGI--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +         S N G+  ++DSGT +TRL    Y A +  F +       T G ++ +TC
Sbjct: 278 VPRL------SSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTC 331

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPT--SSSLSIIGN 225
           ++ +    V +PTV F F     + +  +     V S+ +  C AFA         IIGN
Sbjct: 332 FNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGN 391

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQ+  RV +N + S +GF    C
Sbjct: 392 YQQKNQRVIYNSKESKVGFAGEPC 415


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 22/253 (8%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C         ST+ 
Sbjct: 90  ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVL 149

Query: 72  FDSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
            D  LP +          T PL+   +N    T YYL L GI+VG   LP+ E+AF +  
Sbjct: 150 LD--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-T 206

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEV 179
           +G GG I+DSGT++T L  + Y  +RD F    +  + P +    + TC+   S++  +V
Sbjct: 207 NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDV 265

Query: 180 PTVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           P +  HF EG  + LP +NY+  +P D+ N   C A        +IIGN QQQ   V ++
Sbjct: 266 PKLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYD 323

Query: 237 LRNSLIGFTPNKC 249
           L+N+++ F   +C
Sbjct: 324 LQNNMLSFVAAQC 336


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 90/253 (35%), Positives = 131/253 (51%), Gaps = 13/253 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ G+ SV N   GCG +NEGLF  +AGL+GL    LS   Q+  +   +FSY
Sbjct: 211 GYLSKDTVSFGANSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSY 270

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    + S+  L   S  P      P++ N   D+ Y++ L+G++V G  L +S + + 
Sbjct: 271 CL--PSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYT 328

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSS 176
              +     I+DSGT +TRL T  Y AL  A     + +       ++ DTC++  +   
Sbjct: 329 SLPT-----IIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKL 383

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
             VP VS  F  G  L L A N L+ VD   T C AFAP  S+ +IIGN QQQ   V ++
Sbjct: 384 RAVPAVSMAFSGGATLKLSAGNLLVDVD-GATTCLAFAPARSA-AIIGNTQQQTFSVVYD 441

Query: 237 LRNSLIGFTPNKC 249
           ++++ IGF    C
Sbjct: 442 VKSNRIGFAAAGC 454


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  133 bits (335), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)

Query: 1   GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+      GS+   +V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 248 GDFAVETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 307

Query: 52  A---STFSYCLVDRDSDS--TSTLEFDS-----SLPPNAVTAPLLRNHEL-DTFYYLGLT 100
           +    +FSYCLVDR+SD+  +S L F       S P    T+ + R   L DTFYY+ + 
Sbjct: 308 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIK 367

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
            I V G++L I E  + I   G GG I+DSGT ++      Y  +++      +   P  
Sbjct: 368 SIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 427

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
               + D C++ S   S+++P +   F +G V   P +N  I ++ +   C A   T  S
Sbjct: 428 RDFPILDPCFNVSGIDSIQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAILGTPKS 486

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + SIIGN QQQ   + ++ + S +G+ P KC
Sbjct: 487 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 517


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 88/252 (34%), Positives = 130/252 (51%), Gaps = 22/252 (8%)

Query: 14  SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
           S+  +  GCG NN G+F     G+ G G G LS PSQ+    FS+C         ST+  
Sbjct: 142 SLPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLL 201

Query: 73  DSSLPPN--------AVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           D  LP +          T PL+   +N    T YYL L GI+VG   LP+ E+AF +  +
Sbjct: 202 D--LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFAL-TN 258

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSRSSVEVP 180
           G GG I+DSGT++T L  + Y  +RD F    +  + P +    + TC+   S++  +VP
Sbjct: 259 GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHY-TCFSAPSQAKPDVP 317

Query: 181 TVSFHFPEGKVLPLPAKNYL--IPVDS-NGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            +  HF EG  + LP +NY+  +P D+ N   C A        +IIGN QQQ   V ++L
Sbjct: 318 KLVLHF-EGATMDLPRENYVFEVPDDAGNSIICLAIN-KGDETTIIGNFQQQNMHVLYDL 375

Query: 238 RNSLIGFTPNKC 249
           +N+++ F   +C
Sbjct: 376 QNNMLSFVAAQC 387


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  133 bits (334), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 92/247 (37%), Positives = 127/247 (51%), Gaps = 16/247 (6%)

Query: 16  DNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVD-RDSDSTST 69
           D+   GC       G  V   GL+G G G LSF SQ  A   S FSYCL   + S+ + T
Sbjct: 212 DHYTFGCLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGT 271

Query: 70  LEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES-GNGGII 127
           L    +  P  + T PLL N    + YY+ + G+ V G  +PI  +A  +D + G GG I
Sbjct: 272 LRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTI 331

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
           VD+GT  TRL    Y ALR+AF RG  A +    +  FDTCY  +   S  VP V+F F 
Sbjct: 332 VDAGTMFTRLSPPAYAALRNAFRRGVSAPA-APALGGFDTCYYVNGTKS--VPAVAFVFA 388

Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLI 242
            G  + LP +N +I   S G  C A A       ++ L+++ ++QQQ  RV F++ N  +
Sbjct: 389 GGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRV 448

Query: 243 GFTPNKC 249
           GF+   C
Sbjct: 449 GFSRELC 455


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 92/255 (36%), Positives = 129/255 (50%), Gaps = 17/255 (6%)

Query: 6   ETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
           ET++L S  ++   A GCG  N G F    GL+GLG G LS  SQ  AS   TFSYCL  
Sbjct: 228 ETLSLTSTRALPGFAFGCGQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCL-P 286

Query: 62  RDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            D+ +   L    + P    +     +++  +  +FY++ L  I +GG +LP+  T F  
Sbjct: 287 SDNTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTD 346

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
           D     G  +DSGT +T L  E Y ALRD F        P      FDTCYDF+ +S++ 
Sbjct: 347 D-----GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIF 401

Query: 179 VPTVSFHFPEGKVLPLPAKNYLI-PVDSN---GTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
           +P VSF F +G V  L     LI P D+    G   F   P++   +I+GN+QQ+ T V 
Sbjct: 402 IPAVSFKFSDGSVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVI 461

Query: 235 FNLRNSLIGFTPNKC 249
           +++    IGF    C
Sbjct: 462 YDVAAEKIGFASASC 476


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 138/264 (52%), Gaps = 22/264 (8%)

Query: 5   TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
           +ET T GS     A V  IA GC   + G    +A GL+GLG G LS  SQ+    FSYC
Sbjct: 189 SETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 248

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
           L   +D++STSTL    S   N      + P + +     ++TFYYL LTGIS+G   L 
Sbjct: 249 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 308

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
           I   AF ++  G GG+I+DSGT +T L    Y  +R A V     L  TDG A    D C
Sbjct: 309 IPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDLC 367

Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
           +   S +S    +P+++ HF  G  + LPA +Y++  D +G +C A    T   ++I+GN
Sbjct: 368 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 425

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   + +++    + F P KC
Sbjct: 426 YQQQNMHILYDIGQETLSFAPAKC 449


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/258 (36%), Positives = 132/258 (51%), Gaps = 27/258 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G + ++T+ LGS ++ N   GC H   G      GL+GLGGG+ S  SQ   +    FSY
Sbjct: 221 GTYSSDTLALGSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSY 280

Query: 58  CLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CL    S S   TL   +S     V  P+LR+  + TFY + L  I VGG  L I  + F
Sbjct: 281 CLPPTPSSSGFLTLGAGTS---GFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF 337

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G+++DSGT +TRL    Y+AL  AF  G +   P    ++ DTC+DFS +SS
Sbjct: 338 ------SAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSS 391

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQQGT 231
           V +P+V+  F  G V+ L         D+NG     C AFA  S  SS  I+GNVQQ+  
Sbjct: 392 VRLPSVALVFSGGAVVNL---------DANGIILGNCLAFAANSDDSSPGIVGNVQQRTF 442

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V +++    +GF    C
Sbjct: 443 EVLYDVGGGAVGFKAGAC 460


>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
          Length = 328

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 66/141 (46%), Positives = 93/141 (65%)

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           L ISE  +++ + G+ G ++D+G  VTRL T  Y A RDAFV  T  L    GV++F+TC
Sbjct: 188 LNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNTC 247

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
           YD +   +V VPTV F+F  G++L +  +N+LIP D  GTF FAFA + S+LSIIGN+QQ
Sbjct: 248 YDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQQ 307

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +G ++S +  N  +GF  N C
Sbjct: 308 EGIQISVDGANGFLGFGRNVC 328


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 138/264 (52%), Gaps = 22/264 (8%)

Query: 5   TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
           +ET T GS     A V  IA GC   + G    +A GL+GLG G LS  SQ+    FSYC
Sbjct: 129 SETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 188

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
           L   +D++STSTL    S   N      + P + +     ++TFYYL LTGIS+G   L 
Sbjct: 189 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 248

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
           I   AF ++  G GG+I+DSGT +T L    Y  +R A V     L  TDG A    D C
Sbjct: 249 IPPDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSADTGLDLC 307

Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
           +   S +S    +P+++ HF  G  + LPA +Y++  D +G +C A    T   ++I+GN
Sbjct: 308 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 365

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   + +++    + F P KC
Sbjct: 366 YQQQNMHILYDIGQETLSFAPAKC 389


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 85/242 (35%), Positives = 125/242 (51%), Gaps = 12/242 (4%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++ +   GCG +NEGLF  AAG++GL    LS  +Q++      FSYCL    S    
Sbjct: 226 SQTLPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGG 285

Query: 69  TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
            L      P +    P++RN +  + Y+L L  I+V G  + ++   +++        I+
Sbjct: 286 FLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------II 339

Query: 129 DSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
           DSGT VTRL    Y ALR+AFV+  +R        ++ DTC+  S +S    P +   F 
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQ 399

Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
            G  L L A N LI  D  G  C AFA +S+ ++IIGN QQQ   +++++  S IGF P 
Sbjct: 400 GGADLSLRAPNILIEAD-KGIACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPG 457

Query: 248 KC 249
            C
Sbjct: 458 GC 459


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 95/270 (35%), Positives = 130/270 (48%), Gaps = 24/270 (8%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G + TE  T  S+  D +       GCG  N G     +G++G G   LS  SQ++   F
Sbjct: 190 GVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPLSLVSQLSIRRF 249

Query: 56  SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           SYCL    S   STL F         D++ P    T PLL++ +  TFYY+ L G++VG 
Sbjct: 250 SYCLTSYGSGRKSTLLFGSLSGGVYGDATGP--VQTTPLLQSLQNPTFYYVHLAGLTVGA 307

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDG 161
             L I E+AF +   G+GG+IVDSGTA+T L       +  AF +  R       +P DG
Sbjct: 308 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPEDG 367

Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
           V           SS S V VP + FHF +   L LP +NY++     G  C   A +   
Sbjct: 368 VCFLVPAAWRRSSSTSQVPVPRMVFHFQDAD-LDLPRRNYVLDDHRKGRLCLLLADSGDD 426

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S IGN+ QQ  RV ++L    + F P +C
Sbjct: 427 GSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)

Query: 1   GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+      GS+   +V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 227 GDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 286

Query: 52  A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
           +    +FSYCLVDR+SD+  +S L F  D  L   PN      +   E  +DTFYY+ + 
Sbjct: 287 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIK 346

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
            I V G++L I E  + I   G GG I+DSGT ++      Y  +++      +   P  
Sbjct: 347 SILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 406

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
               + D C++ S   +V++P +   F +G V   P +N  I ++ +   C A   T  S
Sbjct: 407 RDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAMLGTPKS 465

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + SIIGN QQQ   + ++ + S +G+ P KC
Sbjct: 466 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/255 (34%), Positives = 132/255 (51%), Gaps = 16/255 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TLGS ++     GC  +  G F     GL+GLGG + S  SQ   +    FS
Sbjct: 222 GTYSSDTLTLGSNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFS 281

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL      S+  L   ++     V  P+LR+ ++ T+Y + L  I VGG  L I  + F
Sbjct: 282 YCLPPTPG-SSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF 340

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +SS
Sbjct: 341 ------SAGSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSS 394

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
           V +P+V+  F  G V+ L     ++ +D+   +C AFA  S  SSL  IGNVQQ+   V 
Sbjct: 395 VSIPSVALVFSGGAVVNLDFNGIMLELDN---WCLAFAANSDDSSLGFIGNVQQRTFEVL 451

Query: 235 FNLRNSLIGFTPNKC 249
           +++    +GF    C
Sbjct: 452 YDVGGGAVGFRAGAC 466


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/203 (36%), Positives = 106/203 (52%), Gaps = 5/203 (2%)

Query: 50  INASTFSYCLVDRDSDSTSTLEFDS--SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           +  + FSYCL   D    S L   S      +A++ PLL N    +FYYL L GI VGG 
Sbjct: 1   MKEAKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGT 60

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            L I ++ F + + G+GG+I+DSGT +T L+   ++ L+  F+  +            D 
Sbjct: 61  QLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDV 120

Query: 168 CYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
           C+   S ++ VEVP + FHF  G  L LPA++Y+I     G  C A    S+ +SI GNV
Sbjct: 121 CFSLPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMG-ASNGMSIFGNV 178

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   V+ +L    I F P +C
Sbjct: 179 QQQNILVNHDLEKETISFVPTQC 201


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 101/271 (37%), Positives = 145/271 (53%), Gaps = 23/271 (8%)

Query: 1   GDFVTETVTL------GSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+      GS+   +V+N+  GCGH N GLF GAAGLLGLG G LSF SQ+ 
Sbjct: 263 GDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQ 322

Query: 52  A---STFSYCLVDRDSDS--TSTLEF--DSSL--PPNAVTAPLLRNHE--LDTFYYLGLT 100
           +    +FSYCLVDR+SD+  +S L F  D  L   PN      +   E  +DTFYY+ + 
Sbjct: 323 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIK 382

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
            I V G++L I E  + I   G GG I+DSGT ++      Y  +++      +   P  
Sbjct: 383 SILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVY 442

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SS 218
               + D C++ S   +V++P +   F +G V   P +N  I ++ +   C A   T  S
Sbjct: 443 RDFPILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNED-LVCLAMLGTPKS 501

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + SIIGN QQQ   + ++ + S +G+ P KC
Sbjct: 502 AFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 532


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 101/276 (36%), Positives = 138/276 (50%), Gaps = 28/276 (10%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           GDF  ET+TL      S +  N   GCG  N G F GAAG++GLG G +S  +Q+ ++  
Sbjct: 93  GDFALETLTLRSSGGSSKAFPNFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAIN 152

Query: 54  -TFSYCLVDRDSDS--TSTLEFDSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
             FSYCLVD D DS  TS L F SS      A++ P++ N    T+Y++GL GISVGG  
Sbjct: 153 NKFSYCLVDFDDDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQ 212

Query: 109 LPISETAF-------------KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA 155
           L ++  A              +  E  +GG I DSGT +T L    Y+ ++ AF      
Sbjct: 213 LSLATRAIDFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSL 272

Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAF- 213
            +     + FD CYD S   + + P ++  F   K  P P KNY + VD+  T  C A  
Sbjct: 273 PTVDASSSGFDLCYDVSKSKNFKFPALTLAFKGTKFSP-PQKNYFVIVDTAETVACLAMG 331

Query: 214 APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              S  L IIGN+ QQ   V ++   S I  +P +C
Sbjct: 332 GSGSLGLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/264 (36%), Positives = 138/264 (52%), Gaps = 22/264 (8%)

Query: 5   TETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSYC 58
           +ET T GS     + V  IA GC   + G    +A GL+GLG G LS  SQ+    FSYC
Sbjct: 187 SETFTFGSTPAGQSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPKFSYC 246

Query: 59  LVD-RDSDSTSTLEFDSSLPPNAV----TAPLLRNHE---LDTFYYLGLTGISVGGDLLP 110
           L   +D++STSTL    S   N      + P + +     ++TFYYL LTGIS+G   L 
Sbjct: 247 LTPYQDTNSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS 306

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
           I   AF ++  G GG+I+DSGT +T L    Y  +R A V     L  TDG A    D C
Sbjct: 307 IPPDAFLLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGSAATGLDLC 365

Query: 169 YDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSIIGN 225
           +   S +S    +P+++ HF  G  + LPA +Y++  D +G +C A    T   ++I+GN
Sbjct: 366 FMLPSSTSAPPAMPSMTLHF-NGADMVLPADSYMM-SDDSGLWCLAMQNQTDGEVNILGN 423

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   + +++    + F P KC
Sbjct: 424 YQQQNMHILYDIGQETLSFAPAKC 447


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/254 (33%), Positives = 139/254 (54%), Gaps = 18/254 (7%)

Query: 7   TVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL-VDR 62
           T+T   A       GCG +N+GLF  ++G++GL    +S   Q++      FSYCL    
Sbjct: 216 TLTPSEAPSSGFVYGCGQDNQGLFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSF 275

Query: 63  DSDSTSTLEFDSSLPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
            + ++S+L    S+  +++T+      PL++N ++ + Y+L LT I+V G  L +S +++
Sbjct: 276 SAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSY 335

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRS 175
            +        I+DSGT +TRL    YNAL+ +FV   ++  +   G ++ DTC+  S + 
Sbjct: 336 NVPT------IIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKE 389

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
              VP +   F  G  L L A N L+ ++  GT C A A +S+ +SIIGN QQQ  +V++
Sbjct: 390 MSTVPEIQIIFRGGAGLELKAHNSLVEIE-KGTTCLAIAASSNPISIIGNYQQQTFKVAY 448

Query: 236 NLRNSLIGFTPNKC 249
           ++ N  IGF P  C
Sbjct: 449 DVANFKIGFAPGGC 462


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/260 (37%), Positives = 136/260 (52%), Gaps = 20/260 (7%)

Query: 9   TLGSASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD--- 61
           T G A+V  +A GCG  N+G  F G  G++GLG G LSFP+Q   + A TFSYCL+D   
Sbjct: 168 TSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEG 227

Query: 62  -RDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
            R   S+S L         A    PL+ N    TFYY+G+  I VG  +LP+  + + ID
Sbjct: 228 GRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAID 287

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSS 176
             GNGG ++DSG+ +T L+   Y  L  AF   V   R  S        + CY+ SS SS
Sbjct: 288 VLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSS 347

Query: 177 VE-----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQ 229
                   P ++  F +G  L LP  NYL+ V ++   C A  PT S  + +++GN+ QQ
Sbjct: 348 SAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-ADDVKCLAIRPTLSPFAFNVLGNLMQQ 406

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
           G  V F+  ++ IGF   +C
Sbjct: 407 GYHVEFDRASARIGFARTEC 426


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 98/281 (34%), Positives = 136/281 (48%), Gaps = 36/281 (12%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
           G F  ET TL ++S     + +IA GCG +  G       F GA+G++GLG G +SF SQ
Sbjct: 179 GFFSKETTTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQ 238

Query: 50  IN---ASTFSYCLVDRD-----------SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFY 95
           +      +FSYCL+D              D  ST + + S+       PLL N E  TFY
Sbjct: 239 LGRRFGRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSM---MSFTPLLINPEAPTFY 295

Query: 96  YLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA 155
           Y+ + G+ V G  L I  + + +DE GNGG ++DSGT +T L    Y  +  AF R  + 
Sbjct: 296 YISIKGVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKL 355

Query: 156 LSPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
            SPT G A     FD C + +  S    P +S       +   P +NY I + S G  C 
Sbjct: 356 PSPTPGGASTRSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-SEGIKCL 414

Query: 212 AFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           A  P    S   S+IGN+ QQG  + F+   S +GF+   C
Sbjct: 415 AIQPVEAESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 83/259 (32%), Positives = 133/259 (51%), Gaps = 17/259 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G+   E + LG+ +V+N   GCG  N+GLF GA+GL+GLG   LS  SQI+      FSY
Sbjct: 158 GEVGMEHLNLGNTTVNNFIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSY 217

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTAPLLR--NHELDTFYYLGLTGISVGGDLLPISE 113
           CL   +++++ +L    +SS+  N       R  ++ L  FY+L LTGI+VGG  + +  
Sbjct: 218 CLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGG--VEVQA 275

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
            +F  D      +I+DSGT ++RL    Y AL+  FV+            + D+C++ S 
Sbjct: 276 PSFGKDR-----MIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSG 330

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQG 230
              V++P +  +F     L +        V ++ +  C A A  P    + IIGN QQ+ 
Sbjct: 331 YQEVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKN 390

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+ ++ + S++GF    C
Sbjct: 391 QRIIYDTKGSMLGFAEEAC 409


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/257 (36%), Positives = 127/257 (49%), Gaps = 15/257 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F  E ++L S  V +N   GCG NN GLF G AGLLGL    LS  SQ        FS
Sbjct: 240 GFFAREKLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFS 299

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YCL    S ST  L F S    +      P   N +  +FY+L + GISVG   LPI ++
Sbjct: 300 YCL-PSSSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKS 358

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F        G I+DSGT ++RL    Y++++  F           GV++ DTCYD S  
Sbjct: 359 VFS-----TAGTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKY 413

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTR 232
            +V+VP +  +F  G  + L A   +I V      C AFA  S    ++IIGNVQQ+   
Sbjct: 414 KTVKVPKIILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIH 472

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++     +GF P+ C
Sbjct: 473 VVYDDAEGRVGFAPSGC 489


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 97/259 (37%), Positives = 134/259 (51%), Gaps = 23/259 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNN---EGLFVGAA-GLLGLGGGSLSFPSQINA--- 52
           G + ++T+ L S   V+N   GC   +   EGL      GL+GLGGG+ S  SQ  A   
Sbjct: 213 GTYGSDTLALNSTEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYG 272

Query: 53  STFSYCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           S FSYCL    + S+  L   +S   +  VT P+ R+    TFY++ L GI+VGGD + I
Sbjct: 273 SAFSYCL-PATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAI 331

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
           S T F        G I+DSGT +TRL    Y+AL  AF  G R        ++ DTC+DF
Sbjct: 332 SPTVFA------AGSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDF 385

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQG 230
           + + +V +P V   F  G V+ L A   +         C AFAP +  + SIIGNVQQ+ 
Sbjct: 386 TGQDNVSIPAVELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRT 439

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V  ++  S++GF P  C
Sbjct: 440 FEVLHDVGQSVLGFRPGAC 458


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 101/247 (40%), Positives = 125/247 (50%), Gaps = 29/247 (11%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS-T 69
           +V     GCGH   G+F G  GLL LG  S+S  SQ   +    FSYCL  + S +   T
Sbjct: 248 TVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLT 307

Query: 70  LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
           L   SS    A T  LL      TFY + LTGISVGG  + +  +AF       GG +VD
Sbjct: 308 LGGPSSASGFATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVD 360

Query: 130 SGTAVTRLQTETYNALRDAFVRGTRA-----LSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
           +GT +TRL    Y ALR AF RG  A      +P +G+   DTCYDFS    V +PTV+ 
Sbjct: 361 TGTVITRLPPTAYAALRSAF-RGAIAPCGYPSAPANGI--LDTCYDFSRYGVVTLPTVAL 417

Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLI 242
            F  G  L L A   L    S+G  C AFAP       +I+GNVQQ+   V F+   S +
Sbjct: 418 TFSGGATLALEAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTV 469

Query: 243 GFTPNKC 249
           GF P  C
Sbjct: 470 GFMPGAC 476


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 96/255 (37%), Positives = 126/255 (49%), Gaps = 11/255 (4%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F TE +T+  + V  N   GCG  N G F   AGLLGLG G LS   Q +      F+
Sbjct: 137 GFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFT 196

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L     +P +    PL    +   FY + + G+SVGG +LPI  + F
Sbjct: 197 YCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVF 256

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 N G I+DSGT +TRLQ   Y+AL   F +  +    TDG ++ DTCYDFS   S
Sbjct: 257 S-----NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSILDTCYDFSGNES 311

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVS 234
           + VP +SF F  G  + +     L  +++    C AFAP        + GN QQQ   V 
Sbjct: 312 ISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVV 371

Query: 235 FNLRNSLIGFTPNKC 249
            +L    IGF P+ C
Sbjct: 372 HDLAKGRIGFAPSGC 386


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  130 bits (327), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 96/264 (36%), Positives = 130/264 (49%), Gaps = 21/264 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G F  E+ T+    +D +A GCG +N+G F  A G+LGLG G LSF SQ+     + F+Y
Sbjct: 158 GVFAYESATVDDVRIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAY 217

Query: 58  CLVDR-DSDSTSTL-----EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CLV+  D  S S+      E  S++     T P++ N    T YY+ +  + VGG+ LPI
Sbjct: 218 CLVNYLDPTSVSSWLIFGDELISTIHDLQFT-PIVSNSRNPTLYYVQIEKVMVGGESLPI 276

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTC 168
           S +A+ +D  GNGG I DSGT VT      Y  +  AF   VR  RA S    V   D C
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAAS----VQGLDLC 332

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL---SIIGN 225
            D +       P+ +     G V      NY + V  N   C A A   SS+   + IGN
Sbjct: 333 VDVTGVDQPSFPSFTIVLGGGAVFQPQQGNYFVDVAPN-VQCLAMAGLPSSVGGFNTIGN 391

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + QQ   V ++   + IGF P KC
Sbjct: 392 LLQQNFLVQYDREENRIGFAPAKC 415


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 98/266 (36%), Positives = 141/266 (53%), Gaps = 25/266 (9%)

Query: 5   TETVTLGSAS------VDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINASTFSY 57
           +ET T GS++      V  IA GC + + G    +A GL+GLG GSLS  SQ+    FSY
Sbjct: 181 SETFTFGSSTPANQTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPKFSY 240

Query: 58  CLVD-RDSDSTSTLEFDSSLPPN----AVTAPLL---RNHELDTFYYLGLTGISVGGDLL 109
           CL   +D++STSTL    S   N      + P +    +  + T+YYL LTGIS+G   L
Sbjct: 241 CLTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTAL 300

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL---FD 166
            I  TA  +   G GG I+DSGT +T L    Y  +R A V     L  TDG +     D
Sbjct: 301 SIPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVS-LVTLPTTDGGSAATGLD 359

Query: 167 TCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSII 223
            C++  S +S    +P+++ HF +G  + LPA +Y++ +DSN  +C A    T   +SI+
Sbjct: 360 LCFELPSSTSAPPTMPSMTLHF-DGADMVLPADSYMM-LDSN-LWCLAMQNQTDGGVSIL 416

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQQ   + +++    + F P KC
Sbjct: 417 GNYQQQNMHILYDVGQETLTFAPAKC 442


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 91/252 (36%), Positives = 134/252 (53%), Gaps = 13/252 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS S+ N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +F+Y
Sbjct: 91  GYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTY 150

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S    +L   +  P      P++ +   D+ Y++ L+G++V G+ L +S +A+ 
Sbjct: 151 CLPSSSSSGYLSLGSYN--PGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 208

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +  S     ++ DTC+     S V
Sbjct: 209 SLPT-----IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFK-GQASRV 262

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
             P V+  F  G  L L A+N L+ VD + T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 263 SAPAVTMSFAGGAALKLSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDV 320

Query: 238 RNSLIGFTPNKC 249
           ++S IGF    C
Sbjct: 321 KSSRIGFAAGGC 332


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 83/251 (33%), Positives = 126/251 (50%), Gaps = 18/251 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
           G+   E +  G+  V +   GCG NN+GLF G +GL+GLG   LS  SQ   I    FSY
Sbjct: 90  GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSY 149

Query: 58  CL--VDRDSDSTSTLEFDSSLPPNAV---TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +R    +  L  +SS+  N+     A ++ N +L  FY++ LTGIS+GG      
Sbjct: 150 CLPSTERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLYNFYFINLTGISIGG------ 203

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             A +    G   I+VDSGT +TRL    Y AL+  F++      P    ++ DTC++ S
Sbjct: 204 -VALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLS 262

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
           +   V++PT+  HF     L +        V S+ +  C A A       ++I+GN QQ+
Sbjct: 263 AYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQK 322

Query: 230 GTRVSFNLRNS 240
             RV ++ + +
Sbjct: 323 NLRVIYDTKET 333


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 100/247 (40%), Positives = 125/247 (50%), Gaps = 29/247 (11%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS-T 69
           +V     GCGH   G+F G  GLL LG  S+S  SQ   +    FSYCL  + S +   T
Sbjct: 248 TVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLT 307

Query: 70  LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
           L   +S    A T  LL      TFY + LTGISVGG  + +  +AF       GG +VD
Sbjct: 308 LGGPTSASGFATTG-LLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------GGTVVD 360

Query: 130 SGTAVTRLQTETYNALRDAFVRGTRA-----LSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
           +GT +TRL    Y ALR AF RG  A      +P +G+   DTCYDFS    V +PTV+ 
Sbjct: 361 TGTVITRLPPTAYAALRSAF-RGAIAPYGYPSAPANGI--LDTCYDFSRYGVVTLPTVAL 417

Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLI 242
            F  G  L L A   L    S+G  C AFAP       +I+GNVQQ+   V F+   S +
Sbjct: 418 TFSGGATLALEAPGIL----SSG--CLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTV 469

Query: 243 GFTPNKC 249
           GF P  C
Sbjct: 470 GFMPGAC 476


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  129 bits (325), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 94/243 (38%), Positives = 118/243 (48%), Gaps = 15/243 (6%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDSTSTLE 71
           V +   GCG +NEGLF G AGL+GL    +SF  Q   I    FSYCL    S S   L 
Sbjct: 246 VHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPS-SLGHLT 304

Query: 72  FDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGIIV 128
           F +S   NA     P       ++FY L + GISVGG  LP +S + F       GG I+
Sbjct: 305 FGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-----GGSII 359

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
           DSGT +TRL    Y ALR AF +         G  L DTCYDFS    + VP + F F  
Sbjct: 360 DSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAG 419

Query: 189 GKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
           G  + LP    L   +S    C AFA     + ++I GNVQQ+   V +++    IGF  
Sbjct: 420 GVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGA 478

Query: 247 NKC 249
             C
Sbjct: 479 AGC 481


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  129 bits (325), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 93/258 (36%), Positives = 129/258 (50%), Gaps = 15/258 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
                +T+TL +  + +   GC     G  + A GL+GLG G LS  SQ   +  STFSY
Sbjct: 176 ASLTQDTLTLANDVIKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSY 235

Query: 58  CLVD-RDSDSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           CL + + S+ + +L       P  + T PLL+N    + YY+ L GI VG  ++ I  +A
Sbjct: 236 CLPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSA 295

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
              D S   G I DSGT  TRL    Y A+R+ F R  +  + T  +  FDTCY      
Sbjct: 296 LAFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATS-LGGFDTCYS----G 350

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGT 231
           SV  P+V+F F  G  + LP  N LI   S  T C A A      +S L++I ++QQQ  
Sbjct: 351 SVVYPSVTFMF-AGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNH 409

Query: 232 RVSFNLRNSLIGFTPNKC 249
           RV  +L NS +G +   C
Sbjct: 410 RVLIDLPNSRLGISRETC 427


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  129 bits (325), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 91/261 (34%), Positives = 124/261 (47%), Gaps = 24/261 (9%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
            V + +TL +  +     GC +   G  +   GLLGLG G +S  SQ  A     FSYCL
Sbjct: 187 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 246

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P +  T PLLRN    + YY+ LTG+SVG   +PI 
Sbjct: 247 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 301

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 D +   G I+DSGT +TR     Y A+RD F +      P   +  FDTC  F+
Sbjct: 302 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 357

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
           + +  E P ++ HF EG  L LP +N LI   S    C + A      +S L++I N+QQ
Sbjct: 358 ATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  R+ F+  NS +G     C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 91/261 (34%), Positives = 124/261 (47%), Gaps = 24/261 (9%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
            V + +TL +  +     GC +   G  +   GLLGLG G +S  SQ  A     FSYCL
Sbjct: 187 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 246

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P +  T PLLRN    + YY+ LTG+SVG   +PI 
Sbjct: 247 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 301

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 D +   G I+DSGT +TR     Y A+RD F +      P   +  FDTC  F+
Sbjct: 302 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 357

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
           + +  E P ++ HF EG  L LP +N LI   S    C + A      +S L++I N+QQ
Sbjct: 358 ATNEAEAPAITLHF-EGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  R+ F+  NS +G     C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 91/252 (36%), Positives = 134/252 (53%), Gaps = 13/252 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS S+ N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +F+Y
Sbjct: 216 GYLSKDTVSFGSTSLPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTY 275

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S    +L   +  P      P++ +   D+ Y++ L+G++V G+ L +S +A+ 
Sbjct: 276 CLPSSSSSGYLSLGSYN--PGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYS 333

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +  S     ++ DTC+     S V
Sbjct: 334 SLPT-----IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFK-GQASRV 387

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
             P V+  F  G  L L A+N L+ VD + T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 388 SAPAVTMSFAGGAALKLSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDV 445

Query: 238 RNSLIGFTPNKC 249
           ++S IGF    C
Sbjct: 446 KSSRIGFAAGGC 457


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 93/252 (36%), Positives = 125/252 (49%), Gaps = 13/252 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS S  N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 228 GYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL      ST  L        +    P+  +    + Y++ L+G+SVGG  L +S     
Sbjct: 288 CL--PTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPA--- 342

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
             E  +   I+DSGT +TRL T  Y AL  A       +      ++ DTC+     S +
Sbjct: 343 --EYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILDTCFQ-GQASQL 399

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP V+  F  G  L L  +N LI VD + T C AFAPT S+ +IIGN QQQ   V +++
Sbjct: 400 RVPAVAMAFAGGATLKLATQNVLIDVD-DSTTCLAFAPTDST-TIIGNTQQQTFSVVYDV 457

Query: 238 RNSLIGFTPNKC 249
             S IGF    C
Sbjct: 458 AQSRIGFAAGGC 469


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 96/258 (37%), Positives = 127/258 (49%), Gaps = 32/258 (12%)

Query: 6   ETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
           E +T+GS  + +N   GCG + +GLF  AAGLLGLG   LS  SQ        FSYCL  
Sbjct: 223 ERLTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL-- 280

Query: 62  RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
             S ST  L F SS   +A   PL  +    +FY L LTGI+VGG  L I  + F     
Sbjct: 281 PSSSSTGFLSFGSSQSKSAKFTPL--SSGPSSFYNLDLTGITVGGQKLAIPLSVFS---- 334

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
              G I+DSGT VTRL    Y+ALR AF +   +      +++ DTCYDFS   +++VP 
Sbjct: 335 -TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPK 393

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTF--------CFAFAPTSSS--LSIIGNVQQQGT 231
           +   F  G           + VD  G F        C AFA  + +   +I GN QQ+  
Sbjct: 394 IVISFSGG---------VDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNF 444

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V +++    +GF P  C
Sbjct: 445 EVVYDVSGGKVGFAPASC 462


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 36/264 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL GS ++     GCGH  +GLF G  GLLGLG    S  SQ +++    FS
Sbjct: 233 GVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFS 292

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           YCL      + +++ + S   P++     T PLL      T+Y + L GISVGG  L I 
Sbjct: 293 YCL----PPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSID 348

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDT 167
            + F        G +VD+GT VTRL    Y+ALR AF     A++P          + DT
Sbjct: 349 ASVFA------SGAVVDTGTVVTRLPPTAYSALRSAF---RAAMAPYGYPSAPATGILDT 399

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGN 225
           CYDF+   +V +PT+S  F  G  + L     L       + C AFAPT   S  SI+GN
Sbjct: 400 CYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGN 453

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           VQQ+   V F+   S +GF P  C
Sbjct: 454 VQQRSFEVRFD--GSTVGFMPASC 475


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 36/264 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL GS ++     GCGH  +GLF G  GLLGLG    S  SQ +++    FS
Sbjct: 222 GVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFS 281

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           YCL      + +++ + S   P++     T PLL      T+Y + L GISVGG  L I 
Sbjct: 282 YCL----PPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSID 337

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDT 167
            + F        G +VD+GT VTRL    Y+ALR AF     A++P          + DT
Sbjct: 338 ASVFA------SGAVVDTGTVVTRLPPTAYSALRSAF---RAAMAPYGYPSAPATGILDT 388

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGN 225
           CYDF+   +V +PT+S  F  G  + L     L       + C AFAPT   S  SI+GN
Sbjct: 389 CYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGN 442

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           VQQ+   V F+   S +GF P  C
Sbjct: 443 VQQRSFEVRFD--GSTVGFMPASC 464


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/272 (34%), Positives = 135/272 (49%), Gaps = 24/272 (8%)

Query: 1   GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G + TE  T  S+S + +++    GCG  N G     +G++G G   LS  SQ++   FS
Sbjct: 191 GVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRDPLSLVSQLSIRRFS 250

Query: 57  YCLVDRDSDSTSTLEF----------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           YCL    S   STL F          D +      T  LL++ +  TFYY+  TG++VG 
Sbjct: 251 YCLTPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGT 310

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVT----RLQTETYNALRDAF-VRGTRALSPTDG 161
             L I  +AF +   G+GG+IVDSGTA+T     + TE   A R    +  T + SP DG
Sbjct: 311 RRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDG 370

Query: 162 VA----LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
           V     +       S+ + V VP ++FHF +G  L LP +NY++     G+ C   A + 
Sbjct: 371 VCFATPMAAGGRRASAATVVSVPRMAFHF-QGADLELPRRNYVLDDPRRGSLCILLADSG 429

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S + IGN  QQ  RV ++L    + F P +C
Sbjct: 430 DSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 84/247 (34%), Positives = 123/247 (49%), Gaps = 16/247 (6%)

Query: 18  IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD--SS 75
           +  GCG  + G  VGA+GL+GL  G++S  SQ++   FSYCL       TS + F   + 
Sbjct: 94  LGFGCGALSAGSLVGASGLMGLSPGTMSLISQLSVPRFSYCLTPFAERKTSPMLFGAMAD 153

Query: 76  LPPNAVTAP-----LLRNHELDTF-YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
           L     T P     +LRN  +DTF YY+ L G+S+G   L +   +  I+  G GG IVD
Sbjct: 154 LRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINPDGTGGTIVD 213

Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS---RSSVEVPTVSFHF 186
           SG+ +  L  + ++A++ A +   +       V  ++ C+   S    ++V+ P +  HF
Sbjct: 214 SGSTMAHLAGKAFDAVKKAVLEAVKLPVFNGTVEDYELCFAVPSGVAMAAVKTPPLVLHF 273

Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL----SIIGNVQQQGTRVSFNLRNSLI 242
             G  + LP  NY     + G  C A A +   L    SIIGNVQQQ   V F++ N   
Sbjct: 274 DGGAAMALPRDNYFQEPRA-GLMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDVHNQKF 332

Query: 243 GFTPNKC 249
            F P KC
Sbjct: 333 SFAPTKC 339


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 137/265 (51%), Gaps = 35/265 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  S++V     GCGH   GLF G  GLLGLG    S   Q   +    FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292

Query: 57  YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  + S +   T  L   S   P   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 293 YCLPTKPSTAGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF       GG +VD+GT +TRL    Y ALR AF  G  +     +P++G+   DTCY
Sbjct: 353 SAFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
           +F+   +V +P V+  F  G  + L A   L       +F C AFAP+ S   ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457

Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
           QQ+    SF +R   + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/253 (36%), Positives = 132/253 (52%), Gaps = 14/253 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G   T+TV+ GS    +   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 228 GSLSTDTVSFGSTRYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
           CL    + ST  L        +  +   + +  LD + Y++ L+G+SVGG  L +S +  
Sbjct: 288 CL--PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-- 343

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
              E  +   I+DSGT +TRL T  + AL  A  +           ++ DTC++    S 
Sbjct: 344 ---EYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE-GQASQ 399

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           + VPTV+  F  G  + L  +N LI VD + T C AFAPT S+ +IIGN QQQ   V ++
Sbjct: 400 LRVPTVAMAFAGGASMKLTTRNVLIDVD-DSTTCLAFAPTDST-AIIGNTQQQTFSVIYD 457

Query: 237 LRNSLIGFTPNKC 249
           +  S IGF+   C
Sbjct: 458 VAQSRIGFSAGGC 470


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 122/239 (51%), Gaps = 15/239 (6%)

Query: 21  GCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVD-RDSDSTSTLEFDSSL 76
           GC     G  V   GL+G G G LSF SQ      S FSYCL + R S+ + TL+     
Sbjct: 191 GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIG 250

Query: 77  PPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
            P  + T PLL N    + YY+ + GI VG  ++ + ++A   +     G I+D+GT  T
Sbjct: 251 QPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFT 310

Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
           RL    Y A+RDAF RG         +  FDTCY+     +V VPTV+F F     + LP
Sbjct: 311 RLAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLP 365

Query: 196 AKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +N +I   S G  C A A       +++L+++ ++QQQ  RV F++ N  +GF+   C
Sbjct: 366 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 122/239 (51%), Gaps = 15/239 (6%)

Query: 21  GCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVD-RDSDSTSTLEFDSSL 76
           GC     G  V   GL+G G G LSF SQ      S FSYCL + R S+ + TL+     
Sbjct: 210 GCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIG 269

Query: 77  PPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
            P  + T PLL N    + YY+ + GI VG  ++ + ++A   +     G I+D+GT  T
Sbjct: 270 QPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFT 329

Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
           RL    Y A+RDAF RG         +  FDTCY+     +V VPTV+F F     + LP
Sbjct: 330 RLAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLP 384

Query: 196 AKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +N +I   S G  C A A       +++L+++ ++QQQ  RV F++ N  +GF+   C
Sbjct: 385 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/256 (35%), Positives = 131/256 (51%), Gaps = 15/256 (5%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
              +T+TL +  + N   GC +   G  + A GL+GLG G LS  SQ   +  STFSYCL
Sbjct: 175 LTQDTLTLATDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234

Query: 60  VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
            + + S+ + +L     + P    T PLL+N    + YY+ L GI VG  ++ I  +A  
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
            D +   G I DSGT  TRL    Y A+R+ F R  +  + T  +  FDTCY      SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATS-LGGFDTCYS----GSV 349

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APT--SSSLSIIGNVQQQGTRV 233
             P+V+F F  G  + LP  N LI   +    C A   APT  +S L++I ++QQQ  RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRV 408

Query: 234 SFNLRNSLIGFTPNKC 249
             ++ NS +G +   C
Sbjct: 409 LIDVPNSRLGISRETC 424


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 89/258 (34%), Positives = 128/258 (49%), Gaps = 24/258 (9%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
           +++ L   ++ + + GC +   G  +   GLLGLG G +S  SQ   + +  FSYC    
Sbjct: 191 DSLGLAVDTLPSYSFGCVNAVSGSTLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCF--- 247

Query: 63  DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
              S  +  F  SL       P N  T PLLRN    T YY+ LTG+SVG  L+P++   
Sbjct: 248 --PSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPEL 305

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
              D +   G I+DSGT +TR     Y A+RD F +  +   P   +  FDTC  F++ +
Sbjct: 306 LAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATN 361

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGT 231
               P V+FHF  G  L LP +N LI   +    C A A      +S L++I N+QQQ  
Sbjct: 362 EDIAPPVTFHF-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNL 420

Query: 232 RVSFNLRNSLIGFTPNKC 249
           R+ F++ NS +G     C
Sbjct: 421 RIMFDVTNSRLGIARELC 438


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 93/253 (36%), Positives = 132/253 (52%), Gaps = 14/253 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G   T+TV+ GS S  +   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 228 GYLSTDTVSFGSTSYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSY 287

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
           CL    + ST  L        +  +   + +  LD + Y++ L+G+SVGG  L +S +  
Sbjct: 288 CL--PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPS-- 343

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
              E  +   I+DSGT +TRL T  + AL  A  +           ++ DTC++    S 
Sbjct: 344 ---EYSSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE-GQASQ 399

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           + VPTV   F  G  + L  +N LI VD + T C AFAPT S+ +IIGN QQQ   V ++
Sbjct: 400 LRVPTVVMAFAGGASMKLTTRNVLIDVD-DSTTCLAFAPTDST-AIIGNTQQQTFSVIYD 457

Query: 237 LRNSLIGFTPNKC 249
           +  S IGF+   C
Sbjct: 458 VAQSRIGFSAGGC 470


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/270 (33%), Positives = 125/270 (46%), Gaps = 22/270 (8%)

Query: 1   GDFVTETVTLGSASVDN-------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS 53
           G + TE  T  S+           +  GCG  N G     +G++G G   LS  SQ++  
Sbjct: 184 GVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIR 243

Query: 54  TFSYCLVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            FSYCL    S   STL F S              T PLL++ +  TFYY+  TG++VG 
Sbjct: 244 RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGA 303

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDG 161
             L I E+AF +   G+GG+IVDSGTA+T L       +  AF +  R       +P DG
Sbjct: 304 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG 363

Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
           V           SS S + VP +  HF +G  L LP +NY++     G  C   A +   
Sbjct: 364 VCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDD 422

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S IGN+ QQ  RV ++L    +   P +C
Sbjct: 423 GSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/270 (33%), Positives = 125/270 (46%), Gaps = 22/270 (8%)

Query: 1   GDFVTETVTLGSASVDN-------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS 53
           G + TE  T  S+           +  GCG  N G     +G++G G   LS  SQ++  
Sbjct: 36  GVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIR 95

Query: 54  TFSYCLVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            FSYCL    S   STL F S              T PLL++ +  TFYY+  TG++VG 
Sbjct: 96  RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGA 155

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDG 161
             L I E+AF +   G+GG+IVDSGTA+T L       +  AF +  R       +P DG
Sbjct: 156 RRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG 215

Query: 162 VALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
           V           SS S + VP +  HF +G  L LP +NY++     G  C   A +   
Sbjct: 216 VCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDLPRRNYVLDDHRRGRLCLLLADSGDD 274

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S IGN+ QQ  RV ++L    +   P +C
Sbjct: 275 GSTIGNLVQQDMRVLYDLEAETLSIAPARC 304


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/256 (35%), Positives = 129/256 (50%), Gaps = 15/256 (5%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
              +T+TL S  + N   GC +   G  + A GL+GLG G LS  SQ   +  STFSYCL
Sbjct: 175 LTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234

Query: 60  VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
            + + S+ + +L     + P    T PLL+N    + YY+ L GI VG  ++ I  +A  
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
            D +   G I DSGT  TRL    Y A+R+ F R  +  + T  +  FDTCY      SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCYS----GSV 349

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
             P+V+F F  G  + LP  N LI   +    C A A      +S L++I ++QQQ  RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRV 408

Query: 234 SFNLRNSLIGFTPNKC 249
             ++ NS +G +   C
Sbjct: 409 LIDVPNSRLGISRETC 424


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 94/264 (35%), Positives = 127/264 (48%), Gaps = 29/264 (10%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
           +   +T+ L    V   A GC     G  V   GLLG G G LSF SQ   +  STFSYC
Sbjct: 119 NLTRDTIALSMDPVPYYAFGCIQKATGSSVPPQGLLGFGRGPLSFLSQTQNLYKSTFSYC 178

Query: 59  LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           L      S  TL F  SL       PP   T PLL+N    + YY+ L GI VG  ++ I
Sbjct: 179 L-----PSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDI 233

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
             +A   + +   G I DSGT  TRL    Y A+R+ F +  G   +S   G   FDTCY
Sbjct: 234 PRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAVRNEFRKRVGNATVSSLGG---FDTCY 290

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + +P +N LI   +  T C A A      +S L++I +
Sbjct: 291 SV----PIVPPTITFMF-SGMNVTMPPENLLIHSTAGVTSCLAMAAAPDNVNSVLNVIAS 345

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  R+ F++ NS +G    +C
Sbjct: 346 MQQQNHRILFDVPNSRLGVAREQC 369


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/256 (35%), Positives = 129/256 (50%), Gaps = 15/256 (5%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
              +T+TL S  + N   GC +   G  + A GL+GLG G LS  SQ   +  STFSYCL
Sbjct: 175 LTQDTLTLASDVIPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL 234

Query: 60  VD-RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
            + + S+ + +L     + P    T PLL+N    + YY+ L GI VG  ++ I  +A  
Sbjct: 235 PNSKSSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALA 294

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
            D +   G I DSGT  TRL    Y A+R+ F R  +  + T  +  FDTCY      SV
Sbjct: 295 FDPATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATS-LGGFDTCYS----GSV 349

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
             P+V+F F  G  + LP  N LI   +    C A A      +S L++I ++QQQ  RV
Sbjct: 350 VFPSVTFMF-AGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRV 408

Query: 234 SFNLRNSLIGFTPNKC 249
             ++ NS +G +   C
Sbjct: 409 LIDVPNSRLGISRETC 424


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  127 bits (318), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 89/273 (32%), Positives = 130/273 (47%), Gaps = 25/273 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
           G F  E +TL +++     ++ ++ GCG    G       F GA G++GLG   +SF SQ
Sbjct: 181 GFFSKEALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQ 240

Query: 50  IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLG 98
           +     S FSYCL+D       T         N   +        PLL N    TFYY+ 
Sbjct: 241 LGRRFGSKFSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIA 300

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           + G+ V G  LPI+ + + ID+ GNGG I+DSGT +T +    Y  +  AF +  +  SP
Sbjct: 301 IKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSP 360

Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS 218
            +    FD C + S  +   +P +SF+   G V   P +NY I    +   C A  P S 
Sbjct: 361 AEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIET-GDQIKCLAVQPVSQ 419

Query: 219 S--LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               S++GN+ QQG  + F+   S +GFT   C
Sbjct: 420 DGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGC 452


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  127 bits (318), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 94/242 (38%), Positives = 125/242 (51%), Gaps = 19/242 (7%)

Query: 16  DNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS---TFSYCLVDRDSDSTSTLE 71
            N   GCG NN GLF G AGL+GLG  S  S  SQ+  S    FSYCL    S S++T  
Sbjct: 120 KNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL---PSTSSATGY 176

Query: 72  FDSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
            +   P N    TA +L +  + T Y++ L GISVGG  L +S T F+     + G I+D
Sbjct: 177 LNIGNPQNTPGYTA-MLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIID 230

Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEG 189
           SGT +TRL    Y+AL+ A        +    V + DTCYDFS  +SV  P +  HF  G
Sbjct: 231 SGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHF-AG 289

Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
             + +PA       +S+   C AFA  + S  + IIGNVQQ    V+++     IGF+  
Sbjct: 290 LDVRIPATGVFFVFNSS-QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAG 348

Query: 248 KC 249
            C
Sbjct: 349 AC 350


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 90/269 (33%), Positives = 127/269 (47%), Gaps = 21/269 (7%)

Query: 1   GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G + TE  T  S+S +  ++    GCG  N G    A+G++G G   LS  SQ++   FS
Sbjct: 186 GYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFS 245

Query: 57  YCLVDRDSDSTSTLEF----DSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDL 108
           YCL    S   STL+F    D  L  +A     T P+L++ +  TFYY+  TG++VG   
Sbjct: 246 YCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARR 305

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDGVA 163
           L I  +AF +   G+GG+I+DSGTA+T         +  AF    R       SP DGV 
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVC 365

Query: 164 LFDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
                           V VP + FHF +G  L LP +NY++     G  C     +    
Sbjct: 366 FAAPAVAAGGGRMARQVAVPRMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDG 424

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + IGN  QQ  RV ++L    + F P +C
Sbjct: 425 ATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 130/250 (52%), Gaps = 22/250 (8%)

Query: 15  VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
           VD +A    GC     G  V   GL+G G G LSFPSQ   +    FSYCL   + S+ +
Sbjct: 293 VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFS 352

Query: 68  STLEFDSSLPPNAVTA-PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
           STL    +  P  +   PLL N    + YY+ + GI VGG  + +  +A   D +   G 
Sbjct: 353 STLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGT 412

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
           IVD+GT  TRL    Y A+RD F    RA  P  G +  FDTCY+     ++ VPTV+F 
Sbjct: 413 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVTGPLGGFDTCYNV----TISVPTVTFS 466

Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSIIGNVQQQGTRVSFNLRN 239
           F +G+V + LP +N +I   S+G  C A A   S      L+++ ++QQQ  RV F++ N
Sbjct: 467 F-DGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVAN 525

Query: 240 SLIGFTPNKC 249
             +GF+   C
Sbjct: 526 GRVGFSRELC 535


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 120/263 (45%), Gaps = 15/263 (5%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   TET T G+      N+  GCG    G   GA+G++G+  G LS   Q++ + FSYC
Sbjct: 5   GVLATETFTFGAHQNFSANLTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITKFSYC 64

Query: 59  LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           L       TS + F +              T PLL+N   D +YY+ + GIS+G   L +
Sbjct: 65  LTPFTDHKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDV 124

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            E    +   G GG ++DS T +  L    +  L+ A + G +  +    +  +  C++ 
Sbjct: 125 PEAILALRPDGTGGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDDYPVCFEL 184

Query: 172 S---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNV 226
               S   V+VP +  HF     + LP  +Y     S G  C A   AP   + ++IGNV
Sbjct: 185 PRGMSMEGVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNVIGNV 243

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   V ++L N    + P KC
Sbjct: 244 QQQNMHVLYDLGNRKFSYAPTKC 266


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 87/264 (32%), Positives = 126/264 (47%), Gaps = 22/264 (8%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
             ++T+ LG  ++ N   GC  +  G    +   GLLGLG G ++  SQ  +     FSY
Sbjct: 181 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSY 240

Query: 58  CLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           CL      S  +  F  SL        P +    P+LRN    + YY+ +TG+SVG   +
Sbjct: 241 CL-----PSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWV 295

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +   +F  D +   G +VDSGT +TR     Y ALR+ F R   A S    +  FDTC+
Sbjct: 296 KVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCF 355

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGN 225
           +    ++   P V+ H   G  L LP +N LI   +    C A A      +S +++I N
Sbjct: 356 NTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIAN 415

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  RV F++ NS IGF    C
Sbjct: 416 LQQQNIRVVFDVANSRIGFAKESC 439


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 130/250 (52%), Gaps = 22/250 (8%)

Query: 15  VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
           VD +A    GC     G  V   GL+G G G LSFPSQ   +    FSYCL   + S+ +
Sbjct: 354 VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFS 413

Query: 68  STLEFDSSLPPNAVTA-PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
           STL    +  P  +   PLL N    + YY+ + GI VGG  + +  +A   D +   G 
Sbjct: 414 STLRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGT 473

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
           IVD+GT  TRL    Y A+RD F    RA  P  G +  FDTCY+     ++ VPTV+F 
Sbjct: 474 IVDAGTMFTRLSAPVYAAVRDVFRSRVRA--PVTGPLGGFDTCYNV----TISVPTVTFS 527

Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSIIGNVQQQGTRVSFNLRN 239
           F +G+V + LP +N +I   S+G  C A A   S      L+++ ++QQQ  RV F++ N
Sbjct: 528 F-DGRVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVAN 586

Query: 240 SLIGFTPNKC 249
             +GF+   C
Sbjct: 587 GRVGFSRELC 596


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  126 bits (317), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 94/262 (35%), Positives = 130/262 (49%), Gaps = 23/262 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
           G+   +T+TL  + +V     GCGHNN G F    GLLGLG G  S  SQ+ A   + FS
Sbjct: 225 GNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFS 284

Query: 57  YCLVDRDSDSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL    S +T  L F    ++ P NA    ++      +FYYL LTGI+V G  + +  
Sbjct: 285 YCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAGQH-PSFYYLNLTGITVAGRAIKVPP 342

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----AFVRGTRALSPTDGVALFDTCY 169
           + F        G I+DSGTA + L    Y ALR     A  R  RA S T    +FDTCY
Sbjct: 343 SVFAT----AAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSST----IFDTCY 394

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
           D +   +V +P+V+  F +G  + L     L    +    C AF P    +SL ++GN Q
Sbjct: 395 DLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQ 454

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+   V +++ N  +GF  N C
Sbjct: 455 QRTLAVIYDVDNQKVGFGANGC 476


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 86/254 (33%), Positives = 122/254 (48%), Gaps = 13/254 (5%)

Query: 1   GDFVTETVTLGSASVD--NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTF 55
           G   TET++      D  NI IGC     G  +G +G++GL    +S  SQ   I    F
Sbjct: 217 GTLATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLF 276

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           SYC +     ST  L F   +P +   +P+ +     + Y + +TGISVGG  L I  +A
Sbjct: 277 SYC-IPSTPGSTGHLTFGGKVPNDVRFSPVSKTAP-SSDYDIKMTGISVGGRKLLIDASA 334

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           FKI  +      +DSG  +TRL  + Y+ALR  F    +     D     DTCYDFS+ S
Sbjct: 335 FKIAST------IDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYS 388

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
           +V +P++S  F  G  + +     +  V  +  +C AFA     +SI GN QQ+   V F
Sbjct: 389 TVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVF 448

Query: 236 NLRNSLIGFTPNKC 249
           +     IGF P  C
Sbjct: 449 DGAKERIGFAPGGC 462


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 101/273 (36%), Positives = 145/273 (53%), Gaps = 25/273 (9%)

Query: 1   GDFVTETVTL-------GSA--SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+       GS+   V N+  GCGH N GLF GA+GLLGLG G LSF SQ+ 
Sbjct: 253 GDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQ 312

Query: 52  A---STFSYCLVDRDSDS--TSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLT 100
           +    +FSYCLVDR+S++  +S L F  D  L      N  +    + + ++TFYY+ + 
Sbjct: 313 SLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIK 372

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT- 159
            I VGG  L I E  + I   G+GG I+DSGT ++      Y  +++ F    +   P  
Sbjct: 373 SILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIF 432

Query: 160 DGVALFDTCYDFS--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
               + D C++ S    +++ +P +   F +G V   PA+N  I + S    C A   T 
Sbjct: 433 RDFPVLDPCFNVSGIEENNIHLPELGIAFVDGTVWNFPAENSFIWL-SEDLVCLAILGTP 491

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S+ SIIGN QQQ   + ++ + S +GFTP KC
Sbjct: 492 KSTFSIIGNYQQQNFHILYDTKRSRLGFTPTKC 524


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  126 bits (317), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 90/269 (33%), Positives = 127/269 (47%), Gaps = 21/269 (7%)

Query: 1   GDFVTETVTLGSASVDNIAI----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G + TE  T  S+S +  ++    GCG  N G    A+G++G G   LS  SQ++   FS
Sbjct: 186 GYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFGRDPLSLVSQLSIRRFS 245

Query: 57  YCLVDRDSDSTSTLEF----DSSLPPNAV----TAPLLRNHELDTFYYLGLTGISVGGDL 108
           YCL    S   STL+F    D  L  +A     T P+L++ +  TFYY+  TG++VG   
Sbjct: 246 YCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARR 305

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-----ALSPTDGVA 163
           L I  +AF +   G+GG+I+DSGTA+T         +  AF    R       SP DGV 
Sbjct: 306 LRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVC 365

Query: 164 LFDTCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
                           V VP + FHF +G  L LP +NY++     G  C     +    
Sbjct: 366 FAAPAVAAGGGRMARQVAVPRMVFHF-QGADLDLPRENYVLEDHRRGHLCVLLGDSGDDG 424

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + IGN  QQ  RV ++L    + F P +C
Sbjct: 425 ATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/264 (32%), Positives = 126/264 (47%), Gaps = 22/264 (8%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
             ++T+ LG  ++ N   GC  +  G    +   GLLGLG G ++  SQ  +     FSY
Sbjct: 179 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSY 238

Query: 58  CLVDRDSDSTSTLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           CL      S  +  F  SL        P +    P+LRN    + YY+ +TG+SVG   +
Sbjct: 239 CL-----PSYRSYYFSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWV 293

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +   +F  D +   G +VDSGT +TR     Y ALR+ F R   A S    +  FDTC+
Sbjct: 294 KVPAGSFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCF 353

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGN 225
           +    ++   P V+ H   G  L LP +N LI   +    C A A      +S +++I N
Sbjct: 354 NTDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIAN 413

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  RV F++ NS +GF    C
Sbjct: 414 LQQQNIRVVFDVANSRVGFAKESC 437


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F TET+TL S++V  N   GCG  N GLF GAAGLLGLG   L+ PSQ   +    FS
Sbjct: 164 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 223

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S S   L     +  +    PL  + +   FY L +TG+SVGG  L I E+AF
Sbjct: 224 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAF 282

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G ++DSGT +TRL    Y+ L  AF         T G ++FDTCYDFS   +
Sbjct: 283 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 336

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
           V +P V   F  G  + +     L PV+     C AFA     S  SI GNVQQ+  +V 
Sbjct: 337 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 396

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P  C
Sbjct: 397 YDGAKGRVGFAPGGC 411


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/273 (34%), Positives = 126/273 (46%), Gaps = 25/273 (9%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
           G F  ET TL +     A +  IA GC     G       F GA G++GLG G +S  SQ
Sbjct: 184 GFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQ 243

Query: 50  IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVT--------APLLRNHELDTFYYLG 98
           +     + FSYCL+D D   + T         N V          PL  N    TFYY+G
Sbjct: 244 LGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIG 303

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           +  +SV G  LPI+ + + +DE GNGG IVDSGT +T L    Y  +     R  R  SP
Sbjct: 304 IESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSP 363

Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--T 216
            +    FD C + S      +P +SF      V   P +NY +  D +   C A     T
Sbjct: 364 AEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDED-VKCLALQAVMT 422

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S  S+IGN+ QQG  + F+   + +GF+ + C
Sbjct: 423 PSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F TET+TL S++V  N   GCG  N GLF GAAGLLGLG   L+ PSQ   +    FS
Sbjct: 212 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 271

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S S   L     +  +    PL  + +   FY L +TG+SVGG  L I E+AF
Sbjct: 272 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 330

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G ++DSGT +TRL    Y+ L  AF         T G ++FDTCYDFS   +
Sbjct: 331 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 384

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
           V +P V   F  G  + +     L PV+     C AFA     S  SI GNVQQ+  +V 
Sbjct: 385 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 444

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P  C
Sbjct: 445 YDGAKGRVGFAPGGC 459


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 125/256 (48%), Gaps = 15/256 (5%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
           G+ V +T+TL  S ++     GCG  N GLF    GL GLG   +S PSQ   S    F+
Sbjct: 237 GNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFT 296

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL    S     L    + P NA  TA  L +    +FYY+ L GI VGG  + I  TA
Sbjct: 297 YCLPS-SSSGRGYLSLGGAPPANAQFTA--LADGATPSFYYIDLVGIKVGGRAIRIPATA 353

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F          ++DSGT +TRL    Y  LR AF R          +++ DTCYDF+   
Sbjct: 354 FAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHR 409

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           + ++PTV   F  G  + L     L  V      C AFAP +  SS++I+GN QQ+   V
Sbjct: 410 TAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPNADDSSIAILGNTQQKTFAV 468

Query: 234 SFNLRNSLIGFTPNKC 249
           ++++ N  IGF    C
Sbjct: 469 AYDVANQRIGFGAKGC 484


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 92/263 (34%), Positives = 124/263 (47%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G   T+TV LG AS+D    GCG +N GLF G AGL+GLG   LS  SQ        FSY
Sbjct: 285 GVLATDTVALGGASLDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSY 344

Query: 58  CLVDRDS-DSTSTLEFDSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDLLPI 111
           CL    S D++ +L           T P     ++ +     FY+L +TG +VGG     
Sbjct: 345 CLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG----- 399

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTD-GVALFDTCY 169
             TA      G   +++DSGT +TRL    Y  +R  F R   A   PT  G ++ DTCY
Sbjct: 400 --TALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCY 457

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
           D +    V+VP ++     G  + + A   L  V  +G+  C A A  S      IIGN 
Sbjct: 458 DLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNY 517

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV ++   S +GF    C
Sbjct: 518 QQKNKRVVYDTVGSRLGFADEDC 540


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/255 (39%), Positives = 130/255 (50%), Gaps = 13/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F TET+TL S++V  N   GCG  N GLF GAAGLLGLG   L+ PSQ   +    FS
Sbjct: 224 GFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFS 283

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S S   L     +  +    PL  + +   FY L +TG+SVGG  L I E+AF
Sbjct: 284 YCL-PASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF 342

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                 + G ++DSGT +TRL    Y+ L  AF         T G ++FDTCYDFS   +
Sbjct: 343 ------SAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDT 396

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSIIGNVQQQGTRVS 234
           V +P V   F  G  + +     L PV+     C AFA     S  SI GNVQQ+  +V 
Sbjct: 397 VRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVV 456

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P  C
Sbjct: 457 YDGAKGRVGFAPGGC 471


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 97/254 (38%), Positives = 132/254 (51%), Gaps = 8/254 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSY 57
           G   T+ VT+G+  + N+A GCG++N G F GA GL+GLG G LS  SQ+  +    FSY
Sbjct: 176 GALSTDDVTIGTGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSY 235

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           CLV   S  TS L   DS+L       P+L N+   TFYY  L GISV G  +      F
Sbjct: 236 CLVPLGSTKTSPLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTF 295

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
            I  +G GG+I+DSGT +T L  + +N +  A ++        DG     + C+  +  +
Sbjct: 296 DIAATGRGGLILDSGTTLTYLDVDAFNPMVAA-LKAALPYPEADGSFYGLEYCFSTAGVA 354

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSF 235
           +   PTV FHF  G  + L   N  I +D  GT C A A +S+  SI GN+QQ    +  
Sbjct: 355 NPTYPTVVFHF-NGADVALAPDNTFIALDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVH 412

Query: 236 NLRNSLIGFTPNKC 249
           +L N  IGF    C
Sbjct: 413 DLVNKRIGFKSANC 426


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/274 (36%), Positives = 144/274 (52%), Gaps = 27/274 (9%)

Query: 1   GDFVTETVTLG---------SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN 51
           GDF  ET T+             V+N+  GCGH N GLF GA+GLLGLG G LSF SQ+ 
Sbjct: 255 GDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQ 314

Query: 52  A---STFSYCLVDRDSDS--TSTLEF--DSSL----PPNAVTAPLLRNHELDTFYYLGLT 100
           +    +FSYCLVDR+SD+  +S L F  D  L      N  +    + + ++TFYY+ + 
Sbjct: 315 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIK 374

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSP 158
            I VGG+ L I E  + I   G GG I+DSGT ++      Y  +++ F    +   L  
Sbjct: 375 SILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVF 434

Query: 159 TDGVALFDTCYDFS--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
            D   + D C++ S    +++ +P +   F +G V   PA+N  I + S    C A   T
Sbjct: 435 RD-FPVLDPCFNVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWL-SEDLVCLAILGT 492

Query: 217 -SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S+ SIIGN QQQ   + ++ + S +GFTP KC
Sbjct: 493 PKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/256 (36%), Positives = 125/256 (48%), Gaps = 15/256 (5%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
           G+ V +T+TL  S ++     GCG  N GLF    GL GLG   +S PSQ   S    F+
Sbjct: 237 GNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFT 296

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           YCL    S     L    + P NA  TA  L +    +FYY+ L GI VGG  + I  TA
Sbjct: 297 YCLPS-SSSGRGYLSLGGAPPANAQFTA--LADGATPSFYYIDLVGIKVGGRAIRIPATA 353

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F          ++DSGT +TRL    Y  LR AF R          +++ DTCYDF+   
Sbjct: 354 FAAAGG----TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHR 409

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           + ++PTV   F  G  + L     L  V      C AFAP +  SS++I+GN QQ+   V
Sbjct: 410 TAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLAFAPNADDSSIAILGNTQQKTFAV 468

Query: 234 SFNLRNSLIGFTPNKC 249
           ++++ N  IGF    C
Sbjct: 469 TYDVANQRIGFGAKGC 484


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/257 (33%), Positives = 121/257 (47%), Gaps = 15/257 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
           G    +T+TL  + V      GCG  + GLF  A GL+GLG   +S  SQ  +   + FS
Sbjct: 234 GALARDTLTLTQSDVLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFS 293

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S +   L      P NA    +   H+  +FYY+ L G+ V G  + +S   F
Sbjct: 294 YCLPSSPS-AAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVF 352

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSR 174
                   G ++DSGT +TRL    Y ALR AF R  G         +++ DTCYDF+  
Sbjct: 353 SA-----AGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGH 407

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTR 232
           ++V +P+V+  F  G  + L     L  V      C AFAP    +   IIGN QQ+   
Sbjct: 408 TTVRIPSVALVFAGGAAVGLDFSGVLY-VAKVSQACLAFAPNGDGADAGIIGNTQQKTLA 466

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++    IGF  N C
Sbjct: 467 VVYDVARQKIGFGANGC 483


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 82/251 (32%), Positives = 119/251 (47%), Gaps = 18/251 (7%)

Query: 14  SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRD-SDSTSTLE 71
           +V  +A GCG  N G+F    +G+ G G G LS PSQ+    FSYCL   D ++S  T  
Sbjct: 201 AVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGRFSYCLTSHDETESNKTSA 260

Query: 72  FDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
                PPN + A         P++ +    TFYYL L GI+VG   LP+  + F + + G
Sbjct: 261 VFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDG 320

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR---SSVEV 179
           +GG ++DSGT VT      +  L++ FV     L   D  +       F        V V
Sbjct: 321 SGGTVIDSGTGVTTFPAAVFEQLKNEFV-AQLPLPRYDNTSEVGNLLCFQRPKGGKQVPV 379

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           P + FH      + LP +NY IP D++ G  C         + +IGN QQQ   + +++ 
Sbjct: 380 PKLIFHLASAD-MDLPRENY-IPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVE 437

Query: 239 NSLIGFTPNKC 249
           NS + F   +C
Sbjct: 438 NSKLLFASAQC 448


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 87/257 (33%), Positives = 128/257 (49%), Gaps = 16/257 (6%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G   TET+T+  + V +N  IGCG  N G F G AGLLGLG   ++ PSQ +++    FS
Sbjct: 223 GFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFS 282

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L F   +   A   P+    ++   Y L ++GISVGG  LPI  + F
Sbjct: 283 YCL-PASSSSTGHLSFGGGVSQAAKFTPI--TSKIPELYGLDVSGISVGGRKLPIDPSVF 339

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS- 175
           +       G I+DSGT +T L +  ++AL  AF       + T G +    CYDFS  + 
Sbjct: 340 R-----TAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHAN 394

Query: 176 -SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTR 232
            ++ +P +S  F  G  + +      I  +     C AF      + ++I GNVQQ+   
Sbjct: 395 DNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYE 454

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++   ++GF P  C
Sbjct: 455 VVYDVAKGMVGFAPGGC 471


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 91/266 (34%), Positives = 126/266 (47%), Gaps = 24/266 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G   T+TV LG AS+     GCG +N GLF G AGL+GLG   LS  SQ  +     FSY
Sbjct: 246 GVLATDTVALGGASLGGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSY 305

Query: 58  CLVDRDS-DSTSTLEF---DSSLPPNAVTAP-----LLRNHELDTFYYLGLTGISVGGDL 108
           CL    S D++ +L     D +      T P     ++ +     FY+L +TG +VGG  
Sbjct: 306 CLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGG-- 363

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFD 166
                TA      G   +++DSGT +TRL    Y A+R  F+R  G        G ++ D
Sbjct: 364 -----TALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILD 418

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSII 223
           TCYD +    V+VP ++     G  + + A   L  V  +G+  C A A  S      II
Sbjct: 419 TCYDLTGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPII 478

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQ+  RV ++   S +GF    C
Sbjct: 479 GNYQQKNKRVVYDTLGSRLGFADEDC 504


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 223 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 282

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S S+  L   S  P      P+  +   D+ Y++ +TGI V G  L +S +A+ 
Sbjct: 283 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 342

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +        ++ DTC+     + +
Sbjct: 343 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 396

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP V+  F  G  L L A+N L+ VDS  T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 397 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 454

Query: 238 RNSLIGFTPNKC 249
           +NS IGF    C
Sbjct: 455 KNSKIGFAAGGC 466


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 89/259 (34%), Positives = 124/259 (47%), Gaps = 25/259 (9%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TF 55
           G +  +T+TL  AS  V     GC H   G      GL+GLGGG+ S  SQ  A+   +F
Sbjct: 220 GTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSF 279

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           SYCL      S              VT  +LR+ ++ TFY   L  I+VGG  L +S + 
Sbjct: 280 SYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSV 339

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G +VDSGT +TRL    Y+AL  AF  G +        ++ DTC+DF+ ++
Sbjct: 340 FA------AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQT 393

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPT--SSSLSIIGNVQQQG 230
            + +PTV+  F  G  + L         D NG     C AFA T    +  IIGNVQQ+ 
Sbjct: 394 QISIPTVALVFSGGAAIDL---------DPNGIMYGNCLAFAATGDDGTTGIIGNVQQRT 444

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V +++ +S +GF    C
Sbjct: 445 FEVLYDVGSSTLGFRSGAC 463


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 92/265 (34%), Positives = 127/265 (47%), Gaps = 29/265 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            +   + VTL + S+ +   GC     G  +   GLLGLG G +S  SQ   +  STFSY
Sbjct: 179 ANLSQDVVTLATDSIPSYTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSY 238

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P    T PLL+N    + YY+ L  I VG  ++ 
Sbjct: 239 CL-----PSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVD 293

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTC 168
           I  +A   + +   G I DSGT  TRL    Y A+RDAF +  G   ++   G   FDTC
Sbjct: 294 IPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTSLGG---FDTC 350

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIG 224
           Y     S +  PT++F F  G  + LP  N LI   ++   C A A      +S L++I 
Sbjct: 351 YT----SPIVAPTITFMF-SGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIA 405

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+QQQ  R+ F++ NS +G     C
Sbjct: 406 NMQQQNHRILFDVPNSRLGVAREPC 430


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 102/261 (39%), Positives = 129/261 (49%), Gaps = 20/261 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G F  +T+ +   ++     GCG  N GLF   AGLLGLG G  S   Q       +FSY
Sbjct: 251 GFFAKDTLAVAQDAIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSY 310

Query: 58  CLVDRDSDSTSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
           CL    S +T  LEF     SS   NA T P+L + +  TFYY+GLTGI VGG  L  I 
Sbjct: 311 CL-PASSAATGYLEFGPLSPSSSGSNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGAIP 368

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA--LFDTCYD 170
           E+ F      N G +VDSGT +TRL    Y AL  AF     A       A  + DTCYD
Sbjct: 369 ESVFS-----NSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYD 423

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
           F+  S V +PTVS  F  G  L L A   +  + S    C  FA      S+ I+GN QQ
Sbjct: 424 FTGLSQVSLPTVSLVFQGGACLDLDASGIVYAI-SQSQVCLGFASNGDDESVGIVGNTQQ 482

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +   V +++   ++GF P  C
Sbjct: 483 RTYGVLYDVSKKVVGFAPGAC 503


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 88/249 (35%), Positives = 127/249 (51%), Gaps = 17/249 (6%)

Query: 8   VTLGSASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFSYCLVDRD 63
           +TLGS+++ +   GC  +  G F     GL+GLGGG+ S  SQ   +    FSYCL    
Sbjct: 220 LTLGSSAMTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTS 279

Query: 64  SDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
             S   TL   SS     V  P+LR+ ++ T+Y + L  I VG   L +  + F      
Sbjct: 280 GSSGFLTLGTGSS---GFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVF------ 330

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
           + G ++DSGT +TRL    Y+AL  AF  G +   P     + DTC+DFS +SS+ +PTV
Sbjct: 331 SAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTV 390

Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFNLRNS 240
           +  F  G  + L     ++ + S+   C AF P    SSL IIGNVQQ+   V +++   
Sbjct: 391 TLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGG 449

Query: 241 LIGFTPNKC 249
            +GF    C
Sbjct: 450 AVGFKAGAC 458


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 223 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 282

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S S+  L   S  P      P+  +   D+ Y++ +TGI V G  L +S +A+ 
Sbjct: 283 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 342

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +        ++ DTC+     + +
Sbjct: 343 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 396

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP V+  F  G  L L A+N L+ VDS  T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 397 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 454

Query: 238 RNSLIGFTPNKC 249
           +NS IGF    C
Sbjct: 455 KNSKIGFAAGGC 466


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 221 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 280

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S S+  L   S  P      P+  +   D+ Y++ +TGI V G  L +S +A+ 
Sbjct: 281 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 340

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +        ++ DTC+     + +
Sbjct: 341 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 394

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP V+  F  G  L L A+N L+ VDS  T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 395 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 452

Query: 238 RNSLIGFTPNKC 249
           +NS IGF    C
Sbjct: 453 KNSKIGFAAGGC 464


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/261 (34%), Positives = 124/261 (47%), Gaps = 18/261 (6%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F  +T+TL    V D    GCG NN GLF   AGL+GLG   LS   Q        FS
Sbjct: 247 GFFAKDTLTLTQNDVFDGFMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFS 306

Query: 57  YCL-VDRDSDSTSTLE-----FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           YCL   R S+   T         S    N +T     + +  TFY++ + GISVGG  L 
Sbjct: 307 YCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALS 366

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           IS   F+     N G I+DSGT +TRL +  Y +L+  F +          ++L DTCYD
Sbjct: 367 ISPMLFQ-----NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYD 421

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
            S+ +S+ +P +SF+F     + L     LI  +     C AFA      ++ I GN+QQ
Sbjct: 422 LSNYTSISIPKISFNFNGNANVDLEPNGILI-TNGASQVCLAFAGNGDDDTIGIFGNIQQ 480

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V +++    +GF    C
Sbjct: 481 QTLEVVYDVAGGQLGFGYKGC 501


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 23/263 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
              V +T+TL    + N + GC ++  G  +   GL+GLG G +S  SQ   + +  FSY
Sbjct: 195 ASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 254

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +  F  SL       P +    PLLRN    + YY+ LTG+SVG   +P
Sbjct: 255 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 309

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +       D +   G I+DSGT +TR     Y A+RD F R    +S    +  FDTC  
Sbjct: 310 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC-- 366

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
           FS+ +    P ++ H      L LP +N LI   +    C + A      ++ L++I N+
Sbjct: 367 FSADNENVAPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 425

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  R+ F++ NS IG  P  C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 132/252 (52%), Gaps = 11/252 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS SV N   GCG +NEGLF  +AGL+GL    LS   Q+  S   +FSY
Sbjct: 221 GYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSY 280

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S S+  L   S  P      P+  +   D+ Y++ +TGI V G  L +S +A+ 
Sbjct: 281 CLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYS 340

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
              +     I+DSGT +TRL T  Y+AL  A     +        ++ DTC+     + +
Sbjct: 341 SLPT-----IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQ-GQAARL 394

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP V+  F  G  L L A+N L+ VDS  T C AFAP  S+ +IIGN QQQ   V +++
Sbjct: 395 RVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDV 452

Query: 238 RNSLIGFTPNKC 249
           +NS IGF    C
Sbjct: 453 KNSKIGFAAAGC 464


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 91/257 (35%), Positives = 130/257 (50%), Gaps = 16/257 (6%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
            V +++ LG   + N + GC  +  G  +   GL+GLG G LS  SQ   + +  FSYCL
Sbjct: 185 LVQDSLHLGPNVIPNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCL 244

Query: 60  VDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
               S   S +L+      P A+ T PLL N    + YY+ LTGISVG  L+PIS     
Sbjct: 245 PSFKSYYFSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLA 304

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-RALSPTDGVALFDTCYDFSSRSS 176
            D +   G I+DSGT +TR     Y A+RD F +    + SP   +  FDTC  F++ + 
Sbjct: 305 FDPNTGAGTIIDSGTVITRFVPAIYTAVRDEFRKQVGGSFSP---LGAFDTC--FATNNE 359

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSIIGNVQQQGTR 232
           V  P ++ H   G  L LP +N LI   +    C A A      +S +++I N+QQQ  R
Sbjct: 360 VSAPAITLHL-SGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHR 418

Query: 233 VSFNLRNSLIGFTPNKC 249
           + F++ NS +G     C
Sbjct: 419 ILFDINNSKLGIARELC 435


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 23/263 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
              V +T+TL    + N + GC ++  G  +   GL+GLG G +S  SQ   + +  FSY
Sbjct: 121 ASLVQDTLTLAPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 180

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +  F  SL       P +    PLLRN    + YY+ LTG+SVG   +P
Sbjct: 181 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 235

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +       D +   G I+DSGT +TR     Y A+RD F R    +S    +  FDTC  
Sbjct: 236 VDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEF-RKQVNVSSFSTLGAFDTC-- 292

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
           FS+ +    P ++ H      L LP +N LI   +    C + A      ++ L++I N+
Sbjct: 293 FSADNENVAPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 351

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  R+ F++ NS IG  P  C
Sbjct: 352 QQQNLRILFDVPNSRIGIAPEPC 374


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   +ET T G+    ++ +G  CG  + G  +GA G+LGL   SLS  +Q+    FSYC
Sbjct: 108 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 167

Query: 59  LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
           L       TS L F +   L  +  T P+     + N     +YY+ L GIS+G   L +
Sbjct: 168 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAV 227

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
              +  +   G GG IVDSG+ V  L    + A+++A +   R       V  ++ C+  
Sbjct: 228 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 287

Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
             R+      +V+VP +  HF  G  + LP  NY     + G  C A   T+  S +SII
Sbjct: 288 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 346

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ   V F++++    F P +C
Sbjct: 347 GNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/258 (36%), Positives = 133/258 (51%), Gaps = 20/258 (7%)

Query: 6   ETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCLVD 61
           ET++L S   +   A GCG  N G F G  GL+GLG G+LS PSQ  A   +TFSYCL  
Sbjct: 254 ETLSLSSTRDLPGFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPS 313

Query: 62  RDSDSTSTLEFDSSLPP------NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
            D+ +   L   S+ P       +     +++  +  + Y++ +  I +GG +LP+  T 
Sbjct: 314 YDT-THGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTV 372

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F  D     G + DSGT +T L  E Y +LRD F        P      FDTCYDF+  +
Sbjct: 373 FTRD-----GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHN 427

Query: 176 SVEVPTVSFHFPEGKVLPL-PAKNYLIPVDSN-GTFCFAFAPTSSSL--SIIGNVQQQGT 231
           ++ +P V+F F +G V  L P    + P D+   T C AF P  S++  +IIGN QQ+GT
Sbjct: 428 AIFMPAVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGT 487

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V +++    IGF    C
Sbjct: 488 EVIYDVAAEKIGFGQFTC 505


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 97/280 (34%), Positives = 137/280 (48%), Gaps = 31/280 (11%)

Query: 1   GDFVTETVTLGS-------ASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQIN 51
           GDF  + + L S           ++A GC H+ +G  V  G+ G++G   G+LS PSQ+ 
Sbjct: 89  GDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLK 148

Query: 52  ----ASTFSYCLVDRDSDSTST---LEFDSSLPPNAVT-APLLRNH---ELDTFYYLGLT 100
                S FSYC   +     +T      DS L  + V+  PLL N         YY+GLT
Sbjct: 149 DRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLT 208

Query: 101 GISVGGDLLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSP 158
            ISV G  L I E+AFK+D S G+GG ++DSGT  TR+  + Y A R+AF    R+ L  
Sbjct: 209 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRK 268

Query: 159 TDGVAL-FDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF 213
             G A  FD CY+ S+ SS+  VP V         L L  ++  +PV + G   T C A 
Sbjct: 269 KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAI 328

Query: 214 APTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +  S    ++++GN QQ    V ++   S +GF    C
Sbjct: 329 LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/266 (34%), Positives = 135/266 (50%), Gaps = 35/266 (13%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN-----NEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G   ++ +TLGS  + N + GC  +     +    +   G   L   + +  +++   TF
Sbjct: 201 GTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTF 260

Query: 56  SYCLV------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           SYCL                + S+S+L+F +          L+++  + TFY++ L  IS
Sbjct: 261 SYCLPSSSTSSGSLVLGKEAAVSSSSLKFTT----------LIKDPSIPTFYFVTLKAIS 310

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           VG   + +  T      +  GG I+DSGT +T L    Y ALRDAF +   +L PT  V 
Sbjct: 311 VGNTRISVPGTNI----ASGGGTIIDSGTTITHLVPSAYTALRDAFRQQLSSLQPTP-VE 365

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
             DTCYD SS SSV+VPT++ H      L LP +N LI  +S G  C AF+ T S  SII
Sbjct: 366 DMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQES-GLACLAFSSTDSR-SII 422

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ  R+ F++ NS +GF   +C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 87/263 (33%), Positives = 126/263 (47%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            + V +T+TL    + N + GC ++  G  +   GL+GLG G +S  SQ   + +  FSY
Sbjct: 196 ANLVQDTLTLSPDVIPNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSY 255

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +  F  SL       P +    PLLRN    + YY+ LTG+SVG   +P
Sbjct: 256 CL-----PSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVP 310

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           +       D +   G I+DSGT +TR     Y A+RD F +       T G   FDTC  
Sbjct: 311 VDPVYLTFDSNSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNGSFSTLGA--FDTC-- 366

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
           FS+ +    P ++ H      L LP +N LI   +    C + A      ++ L++I N+
Sbjct: 367 FSADNENVTPKITLHMTSLD-LKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANL 425

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  R+ F++ NS IG  P  C
Sbjct: 426 QQQNLRILFDVPNSRIGIAPEPC 448


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  124 bits (310), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   +ET T G+    ++ +G  CG  + G  +GA G+LGL   SLS  +Q+    FSYC
Sbjct: 184 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 243

Query: 59  LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
           L       TS L F +   L  +  T P+     + N     +YY+ L GIS+G   L +
Sbjct: 244 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAV 303

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
              +  +   G GG IVDSG+ V  L    + A+++A +   R       V  ++ C+  
Sbjct: 304 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 363

Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
             R+      +V+VP +  HF  G  + LP  NY     + G  C A   T+  S +SII
Sbjct: 364 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 422

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ   V F++++    F P +C
Sbjct: 423 GNVQQQNMHVLFDVQHHKFSFAPTQC 448


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 18/266 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIG--CGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   +ET T G+    ++ +G  CG  + G  +GA G+LGL   SLS  +Q+    FSYC
Sbjct: 106 GVLASETFTFGARRAVSLRLGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYC 165

Query: 59  LVDRDSDSTSTLEFDS--SLPPNAVTAPL-----LRNHELDTFYYLGLTGISVGGDLLPI 111
           L       TS L F +   L  +  T P+     + N     +YY+ L GIS+G   L +
Sbjct: 166 LTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAV 225

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
              +  +   G GG IVDSG+ V  L    + A+++A +   R       V  ++ C+  
Sbjct: 226 PAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVL 285

Query: 172 SSRS------SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSII 223
             R+      +V+VP +  HF  G  + LP  NY     + G  C A   T+  S +SII
Sbjct: 286 PRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-GLMCLAVGKTTDGSGVSII 344

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ   V F++++    F P +C
Sbjct: 345 GNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 92/257 (35%), Positives = 129/257 (50%), Gaps = 22/257 (8%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINAS---TF 55
           G + ++T+TL + A+V     GCGH   G LF G  GLLG G    S   Q   +    F
Sbjct: 228 GVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVF 287

Query: 56  SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           SYCL  + S +   TL   S + P   T  LL +    T+Y + LTGISVGG  L +  +
Sbjct: 288 SYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPAS 347

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           AF        G +VD+GT +TRL    Y ALR AF  G  +      + + DTCY F+  
Sbjct: 348 AFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGY 401

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            +V + +V+  F  G  + L A   +    S G   FA + +  S++I+GNVQQ+    S
Sbjct: 402 GTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASSGSDGSMAILGNVQQR----S 453

Query: 235 FNLR--NSLIGFTPNKC 249
           F +R   S +GF P+ C
Sbjct: 454 FEVRIDGSSVGFRPSSC 470


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 97/280 (34%), Positives = 136/280 (48%), Gaps = 31/280 (11%)

Query: 1   GDFVTETVTLGS-------ASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQIN 51
           GDF  + + L S           ++A GC H+ +G  V  G+ G++G   G+LS PSQ+ 
Sbjct: 190 GDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLK 249

Query: 52  ----ASTFSYCLVDRDSDSTST---LEFDSSLPPNAV-TAPLLRNH---ELDTFYYLGLT 100
                S FSYC   +     +T      DS L  + V   PLL N         YY+GLT
Sbjct: 250 DRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLT 309

Query: 101 GISVGGDLLPISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSP 158
            ISV G  L I E+AFK+D S G+GG ++DSGT  TR+  + Y A R+AF    R+ L  
Sbjct: 310 SISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRK 369

Query: 159 TDGVAL-FDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF 213
             G A  FD CY+ S+ SS+  VP V         L L  ++  +PV + G   T C A 
Sbjct: 370 KVGAAAGFDDCYNISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAI 429

Query: 214 APTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +  S    ++++GN QQ    V ++   S +GF    C
Sbjct: 430 LSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFERADC 469


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  123 bits (308), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 124/261 (47%), Gaps = 18/261 (6%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F  + +TL    V D    GCG NN+GLF   AGL+GLG   LS   Q        FS
Sbjct: 247 GFFAKDKLTLTQNDVFDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFS 306

Query: 57  YCL-VDRDSDSTSTLE-----FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           YCL   R S+   T         S    N +T     + +   +Y++ + GISVGG  L 
Sbjct: 307 YCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALS 366

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           IS   F+     N G I+DSGT +TRL +  Y +L+ AF +          ++L DTCYD
Sbjct: 367 ISPMLFQ-----NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYD 421

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQ 228
            S+ +S+ +P +SF+F     + L     LI  +     C AFA      S+ I GN+QQ
Sbjct: 422 LSNYTSISIPKISFNFNGNANVELDPNGILI-TNGASQVCLAFAGNGDDDSIGIFGNIQQ 480

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V +++    +GF    C
Sbjct: 481 QTLEVVYDVAGGQLGFGYKGC 501


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 82/263 (31%), Positives = 117/263 (44%), Gaps = 15/263 (5%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   TET T G+      N+  GCG    G    A+G+LGL  G LS   Q+  + FSYC
Sbjct: 195 GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITKFSYC 254

Query: 59  LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           L       TS + F +              T PLL+N   D +YY+ + G+SVG   L +
Sbjct: 255 LTPFADRKTSPVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDV 314

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            +    I   G GG ++DS T +  L    +  L+ A + G +       V  +  C++ 
Sbjct: 315 PQETLAIKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPVCFEL 374

Query: 172 S---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNV 226
               S   V+VP +  HF     + LP  NY     S G  C A   AP   + ++IGNV
Sbjct: 375 PRGMSMEGVQVPPLVLHFDGDAEMSLPRDNYFQE-PSPGMMCLAVMQAPFEGAPNVIGNV 433

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ   V +++ N    + P KC
Sbjct: 434 QQQNMHVLYDVGNRKFSYAPTKC 456


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 20/250 (8%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++ +   GCG +N+GLF    G++GL    LS  SQ++      FSYCL    S   S
Sbjct: 210 SQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNS 269

Query: 69  TLEF-----DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
             E       SSL P++     PLL+N    + Y++ L  I+V G  L ++ +++K+   
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT- 328

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEV- 179
                I+DSGT +TRL T  Y  L++A+V   ++      G++L DTC+  S     EV 
Sbjct: 329 -----IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA 383

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           P +   F  G  L L   N L+ +++ G  C A A  SSS++IIGN QQQ  +V++++ N
Sbjct: 384 PDIRIIFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGN 441

Query: 240 SLIGFTPNKC 249
           S +GF P  C
Sbjct: 442 SRVGFAPGGC 451


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 20/250 (8%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++ +   GCG +N+GLF    G++GL    LS  SQ++      FSYCL    S   S
Sbjct: 210 SQTLSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNS 269

Query: 69  TLEF-----DSSLPPNAVTA--PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
             E       SSL P++     PLL+N    + Y++ L  I+V G  L ++ +++K+   
Sbjct: 270 PKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT- 328

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEV- 179
                I+DSGT +TRL T  Y  L++A+V   ++      G++L DTC+  S     EV 
Sbjct: 329 -----IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA 383

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRN 239
           P +   F  G  L L   N L+ +++ G  C A A  SSS++IIGN QQQ  +V++++ N
Sbjct: 384 PDIRIIFKGGADLQLKGHNSLVELET-GITCLAMA-GSSSIAIIGNYQQQTVKVAYDVGN 441

Query: 240 SLIGFTPNKC 249
           S +GF P  C
Sbjct: 442 SRVGFAPGGC 451


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/266 (34%), Positives = 138/266 (51%), Gaps = 27/266 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAST 54
           G+  ++T+TL S      S     IGCGH N+G F    +G++GLG G LS  SQ+ +S 
Sbjct: 182 GNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSV 241

Query: 55  ---FSYCLVDRDSDS--TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   S +  +S L F S+     P   + PLL +  + +FY+L L  +SVG 
Sbjct: 242 GGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGN 301

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
           + +   +++     +G G II+DSGT +T +  + ++ L  A    V G RA  P+    
Sbjct: 302 ERIKFGDSSLG---TGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPS---G 355

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
               CY  S+ S ++VP ++ HF  G  + L   N  + V S+   C AFA T+S +SI 
Sbjct: 356 FLSVCY--SATSDLKVPAITAHF-TGADVKLKPINTFVQV-SDDVVCLAFASTTSGISIY 411

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNV Q    V +N++   + F P  C
Sbjct: 412 GNVAQMNFLVEYNIQGKSLSFKPTDC 437


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GS+  D      IA GC + +   + G+AGL+GLG G LS  SQ+ A  FSYCL
Sbjct: 189 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248

Query: 60  VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
              +D+ S STL    +    A+    +R            + T+YYL LTGISVG   L
Sbjct: 249 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAAL 308

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
           PI   AF +   G GG+I+DSGT +T L    Y  +R A VR    L  TDG      D 
Sbjct: 309 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 367

Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
           C+   S S+    +P+++ HF  G  + LP +NY+I +D  G +C A  + T   LS +G
Sbjct: 368 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 425

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + ++++   + F P KC
Sbjct: 426 NYQQQNLHILYDVQKETLSFAPAKC 450


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/277 (32%), Positives = 125/277 (45%), Gaps = 45/277 (16%)

Query: 1   GDFVTETVTLG-------SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINA 52
           G+  T+  T G       S     +  GCGH N+G+F     G+ G G G  S PSQ+N 
Sbjct: 177 GEIATDRFTFGDSGGSGESLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNV 236

Query: 53  STFSYCLVDRDSDSTSTLEFDSSL-----PPNAV----------TAPLLRNHELDTFYYL 97
           ++FSYC        TS  E  SSL      P A+          T P+L+N    + Y+L
Sbjct: 237 TSFSYCF-------TSMFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFL 289

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
            L GISVG   LP+ ET F+         I+DSG ++T L  E Y A++  F      L 
Sbjct: 290 SLKGISVGKTRLPVPETKFR-------STIIDSGASITTLPEEVYEAVKAEFA-AQVGLP 341

Query: 158 PT--DGVALFDTCYDF---SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
           P+  +G AL D C+     +      VP+++ H  EG    LP  NY+         C  
Sbjct: 342 PSGVEGSAL-DLCFALPVTALWRRPAVPSLTLHL-EGADWELPRSNYVFEDLGARVMCIV 399

Query: 213 FAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   ++IGN QQQ T V ++L N  + F P +C
Sbjct: 400 LDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPARC 436


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 29/265 (10%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + TET+TL    V  +   GCG +  G +    GLLGLGG   S  SQ ++     FS
Sbjct: 270 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 329

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
           YCL    S     L   +  PPN+ ++         P+ R   + TFY + LTGISVGG 
Sbjct: 330 YCL-PPTSGGAGFLTLGA--PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGA 386

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVAL 164
            L I  +AF      + G+++DSGT +T L    Y ALR AF       R L P++G  +
Sbjct: 387 PLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GV 439

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
            DTCYDF+  ++V VPT+S  F  G  + L A   ++ VD  G   FA A T +++ IIG
Sbjct: 440 LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIG 496

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           NV Q+   V ++     +GF    C
Sbjct: 497 NVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 91/261 (34%), Positives = 128/261 (49%), Gaps = 16/261 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G F  ET T+G   V+++A GCG+ N+G FV A G+LGLG G+LSF SQ      + F+Y
Sbjct: 132 GVFAYETATVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAY 191

Query: 58  CLVDRDSDST--STLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL    S ++  S+L F   +     +    PL+ N    + YY+ +  I  GG+ L I 
Sbjct: 192 CLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIP 251

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFDTCY 169
           ++A+KID  GNGG I DSGT VT    + Y  +  AF +     RA     G+ L   C 
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL---CV 308

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIGNVQQ 228
           + S       P+ +  F +G        NY I V  N   C A   +SS   ++IGN+ Q
Sbjct: 309 NVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPN-IDCLAMLESSSDGFNVIGNIIQ 367

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V ++     IGF    C
Sbjct: 368 QNYLVQYDREEHRIGFAHANC 388


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/253 (37%), Positives = 130/253 (51%), Gaps = 10/253 (3%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    E  TL ++ V +++  GCG NN+GLF G AGLLGLG G LS P+Q   +    FS
Sbjct: 218 GFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S+ST  L F S+    +V    + +      Y + + GISVG   L I+  +F
Sbjct: 278 YCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSF 337

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
             +     G I+DSGT  TRL T+ Y  LR  F     +   T G  LFDTCYDF+   +
Sbjct: 338 STE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDT 392

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V  PT++F F  G V+ L      +P+  +   C AFA      +I GNVQQ    V ++
Sbjct: 393 VTYPTIAFSFAGGTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYD 451

Query: 237 LRNSLIGFTPNKC 249
           +    +GF PN C
Sbjct: 452 VAGGRVGFAPNGC 464


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/255 (38%), Positives = 126/255 (49%), Gaps = 14/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G F TET+T+ S+ V  N   GCG +N GLF  AAGLLGL   S+S PSQ        FS
Sbjct: 227 GFFATETLTISSSDVFTNFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFS 286

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L F   +   A   P+  +    +FY + + GISV G  LPI  + F
Sbjct: 287 YCLPSTPS-STGYLNFGGKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQLPIDPSIF 343

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                   G I+DSGT +TRL    Y AL++AF         T+G  L DTCYDFS+ ++
Sbjct: 344 T-----TSGAIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTT 398

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
           V  P VS  F  G  + + A   L  V+     C AFA     S   I GN QQ+   V 
Sbjct: 399 VSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVV 458

Query: 235 FNLRNSLIGFTPNKC 249
           ++    +IGF    C
Sbjct: 459 YDGAKGMIGFAAGAC 473


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/260 (31%), Positives = 125/260 (48%), Gaps = 11/260 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINA---STF 55
               ++ + LG  ++ N A GC     G    +   GLLGLG G ++  SQ+       F
Sbjct: 172 ASLASDWLHLGKDAIPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVF 231

Query: 56  SYCLVDRDSDSTS-TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           SYCL    S   S +L   ++  P  V   P+L+N    + YY+ +TG+SVG   + +  
Sbjct: 232 SYCLPSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPA 291

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
            +F  D +   G +VDSGT +TR     Y ALR+ F R   A S    +  FDTC++   
Sbjct: 292 GSFAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDE 351

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS----LSIIGNVQQQ 229
            ++   P V+ H   G  L LP +N LI   +    C A A    +    ++++ N+QQQ
Sbjct: 352 VAAGVAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQ 411

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F++ NS +GF    C
Sbjct: 412 NLRVVFDVANSRVGFARESC 431


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 90/268 (33%), Positives = 131/268 (48%), Gaps = 20/268 (7%)

Query: 1   GDFVTETVTLG-SASVD-NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   +ET T G +A V   +  GCG  + G  VGA+GL+GL  G +S  SQ++   FSYC
Sbjct: 180 GVLASETFTFGVNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSYC 239

Query: 59  LVDRDSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-FYYLGLTGISVGGDLLP 110
           L       TS L F +              T  +LRN  ++T +YY+ L G+S+G   L 
Sbjct: 240 LTPFAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLD 299

Query: 111 ISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR---ALSPTDGVALFD 166
           +  T+   I   G+GG IVDSG+ ++ L+   + A++ A V   R   A    +    ++
Sbjct: 300 VPATSLGMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYE 359

Query: 167 TCYDFS---SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLS 221
            C+      +  +V+ P +  HF  G  + LP  NY     + G  C A   +P    +S
Sbjct: 360 LCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA-GLMCLAVGTSPDGFGVS 418

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGNVQQQ   V F++RN    F P KC
Sbjct: 419 IIGNVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 29/265 (10%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + TET+TL    V  +   GCG +  G +    GLLGLGG   S  SQ ++     FS
Sbjct: 190 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 249

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGD 107
           YCL    S     L   +  PPN+ ++         P+ R   + TFY + LTGISVGG 
Sbjct: 250 YCL-PPTSGGAGFLTLGA--PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGA 306

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVAL 164
            L I  +AF      + G+++DSGT +T L    Y ALR AF       R L P++G  +
Sbjct: 307 PLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GV 359

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
            DTCYDF+  ++V VPT+S  F  G  + L A   ++ VD  G   FA A T +++ IIG
Sbjct: 360 LDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIGIIG 416

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           NV Q+   V ++     +GF    C
Sbjct: 417 NVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/271 (33%), Positives = 127/271 (46%), Gaps = 27/271 (9%)

Query: 1   GDFVTETVTLGS-----ASVDN------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ 49
           G    +T+TLG+     AS +N         GCG NN GLF  A GL GLG G +S  SQ
Sbjct: 177 GHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQ 236

Query: 50  INAST---FSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
                   FSYCL    S++   L     +  P +A   P+L      +FYY+ L GI V
Sbjct: 237 AAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRV 296

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGV 162
            G  + +S            G+IVDSGT +TRL    Y+ALR AF+   G         +
Sbjct: 297 AGRAIKVSSRPALWP----AGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRL 352

Query: 163 ALFDTCYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-- 218
           ++ DTCYDF++   ++V +P V+  F  G  + +     L  V      C AFAP  +  
Sbjct: 353 SILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGNGR 411

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S  I+GN QQ+   V +++    IGF    C
Sbjct: 412 SAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/275 (34%), Positives = 129/275 (46%), Gaps = 27/275 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEG------LFVGAAGLLGLGGGSLSFPSQ 49
           G F  ET TL S S     +  ++ GCG    G       F GA G++GLG GS+SF SQ
Sbjct: 183 GFFSKETTTLKSLSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQ 242

Query: 50  IN---ASTFSYCLVDR--DSDSTSTLEFD---SSLPPNAVTA----PLLRNHELDTFYYL 97
           +     + FSYCL+D       TS L       SLP    T     PL  N    TFYY+
Sbjct: 243 LGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYI 302

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
            +  I++ G  LPI+   ++IDE GNGG +VDSGT +T L    Y  +  +  R  +  +
Sbjct: 303 TIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPN 362

Query: 158 PTDGVALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
             +    FD C + S  S    +P + F    G V   P +NY +  +  G  C A    
Sbjct: 363 AAELTPGFDLCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETE-EGVMCLAIRAV 421

Query: 217 SS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S    S+IGN+ QQG  + F+   S +GFT   C
Sbjct: 422 ESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 93/266 (34%), Positives = 135/266 (50%), Gaps = 35/266 (13%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN-NEGLF----VGAAGLLGLGGGSLSFPSQINASTF 55
           G   ++ +TLGS  + N + GC  + +E  +    +   G   L   + +  +++   TF
Sbjct: 201 GTLASDAITLGSQYLPNFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTF 260

Query: 56  SYCLV------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           SYCL                + S+S+L+F +          L+++    TFY++ L  IS
Sbjct: 261 SYCLPSSSTSSGSLVLGKEAAVSSSSLKFTT----------LIKDPSFPTFYFVTLKAIS 310

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           VG   + +  T      +  GG I+DSGT +T L    Y  LRDAF +   +L PT  V 
Sbjct: 311 VGNTRISVPATNI----ASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSLQPTP-VE 365

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
             DTCYD SS SSV+VPT++ H      L LP +N LI  +S G  C AF+ T S  SII
Sbjct: 366 DMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILITQES-GLSCLAFSSTDSR-SII 422

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQQ  R+ F++ NS +GF   +C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GS+  D      IA GC + +   + G+AGL+GLG G LS  SQ+ A  FSYCL
Sbjct: 189 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 248

Query: 60  VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
              +D+ S STL    +    A+    +R            + T+YYL LTGISVG   L
Sbjct: 249 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL 308

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
           PI   AF +   G GG+I+DSGT +T L    Y  +R A VR    L  TDG      D 
Sbjct: 309 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 367

Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
           C+   S S+    +P+++ HF  G  + LP +NY+I +D  G +C A  + T   LS +G
Sbjct: 368 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 425

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + ++++   + F P KC
Sbjct: 426 NYQQQNLHILYDVQKETLSFAPAKC 450


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 93/250 (37%), Positives = 131/250 (52%), Gaps = 22/250 (8%)

Query: 15  VDNIA---IGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVD-RDSDST 67
           VD IA    GC     G  V + GL+G   G LSFPSQ   +  S FSYCL   + S+ +
Sbjct: 321 VDAIAAYTFGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFS 380

Query: 68  STLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
            TL    +  P  + T PLL N    + YY+ + GI VGG  + +  +A   D +   G 
Sbjct: 381 GTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPASALAFDPASGHGT 440

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSSVEVPTVSFH 185
           IVD+GT  TRL    Y A+ D F    RA  P  G +  FDTCY+     ++ VPTV+F 
Sbjct: 441 IVDAGTMFTRLSAPVYAAVCDVFRSRVRA--PVAGPLGGFDTCYNV----TISVPTVTFL 494

Query: 186 FPEGKV-LPLPAKNYLIPVDSNGTFCFAFA--PTSS---SLSIIGNVQQQGTRVSFNLRN 239
           F +G+V + LP +N +I    +G  C A A  P+ S    L+++ ++QQQ  RV F++ N
Sbjct: 495 F-DGRVSVTLPEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVAN 553

Query: 240 SLIGFTPNKC 249
             +GF+   C
Sbjct: 554 GRVGFSRELC 563


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/265 (37%), Positives = 138/265 (52%), Gaps = 23/265 (8%)

Query: 5   TETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           +ET T GS+  D      IA GC + +   + G+AGL+GLG G LS  SQ+ A  FSYCL
Sbjct: 194 SETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCL 253

Query: 60  VD-RDSDSTSTLEFDSSLPPNAVTAPLLR---------NHELDTFYYLGLTGISVGGDLL 109
              +D+ S STL    +    A+    +R            + T+YYL LTGISVG   L
Sbjct: 254 TPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL 313

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VALFDT 167
           PI   AF +   G GG+I+DSGT +T L    Y  +R A VR    L  TDG      D 
Sbjct: 314 PIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAA-VRSLVKLPVTDGSNATGLDL 372

Query: 168 CYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIG 224
           C+   S S+    +P+++ HF  G  + LP +NY+I +D  G +C A  + T   LS +G
Sbjct: 373 CFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMI-LD-GGMWCLAMRSQTDGELSTLG 430

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N QQQ   + ++++   + F P KC
Sbjct: 431 NYQQQNLHILYDVQKETLSFAPAKC 455


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  121 bits (303), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 89/259 (34%), Positives = 124/259 (47%), Gaps = 25/259 (9%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TF 55
           G +  +T+TL  AS  V     GC H   G      GL+GLGGG+ S  SQ  A+   +F
Sbjct: 220 GTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSF 279

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           SYCL      S              VT  +LR+ ++ TFY   L  I+VGG  L +S + 
Sbjct: 280 SYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSV 339

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F        G +VDSGT +TRL    Y+AL  AF  G +        ++ DTC+DF+ ++
Sbjct: 340 FA------AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQT 393

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPT--SSSLSIIGNVQQQG 230
            + +PTV+  F  G  + L         D NG     C AFA T    +  IIGNVQQ+ 
Sbjct: 394 QISIPTVALVFSGGAAIDL---------DPNGIMYGNCLAFAATGDDGTTGIIGNVQQRT 444

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V +++ +S +GF    C
Sbjct: 445 FEVLYDVGSSTLGFRSGAC 463


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 82/246 (33%), Positives = 126/246 (51%), Gaps = 15/246 (6%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++     GCG +N+GLF  AAG++GL    LS  +Q++      FSYCL   +S S+ 
Sbjct: 93  SQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSG 152

Query: 69  TLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
                     P +    P+L + +  + Y+L LT I+V G  L ++   +++        
Sbjct: 153 GGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------ 206

Query: 127 IVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
           ++DSGT +TRL    Y ALR AFV+  +   +     ++ DTC+  S +S   VP +   
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 266

Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIG 243
           F  G  L L A + LI  D  G  C AFA +S  + ++IIGN QQQ   +++++  S IG
Sbjct: 267 FQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIG 325

Query: 244 FTPNKC 249
           F P  C
Sbjct: 326 FAPGSC 331


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 121/248 (48%), Gaps = 12/248 (4%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTL 70
           S+S   +A GC   N G   GA+G++GLG  +LS  SQI    FSYCL  D D+ ++  L
Sbjct: 204 SSSFAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPIL 263

Query: 71  --EFDSSLPPNAVTAPLLRN----HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
                +       +  LLRN         +YY+ LTGI+VG   LP++ + F    +G G
Sbjct: 264 FGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAG 323

Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVAL-FDTCYDFSSRSSVEVPTV 182
           G+IVDSGT  T L    Y  LR AF+  T   L+   G    FD C++ +  +   VP +
Sbjct: 324 GVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRL 382

Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
            F F  G    +P ++Y   VD  G   C    PT   +S+IGNV Q    V ++L  + 
Sbjct: 383 VFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGAT 441

Query: 242 IGFTPNKC 249
             F P  C
Sbjct: 442 FSFAPADC 449


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 85/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYC 58
           D +T T    S ++     GCG +N+GLF  AAG++GL    LS  +Q++      FSYC
Sbjct: 225 DLLTLT---SSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYC 281

Query: 59  LVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           L   +S S+           P +    P+L + +  + Y+L LT I+V G  L ++   +
Sbjct: 282 LPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMY 341

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRS 175
           ++        ++DSGT +TRL    Y ALR AFV+  +   +     ++ DTC+  S +S
Sbjct: 342 RVPT------LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKS 395

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
              VP +   F  G  L L A + LI  D  G  C AFA +S  + ++IIGN QQQ   +
Sbjct: 396 ISAVPEIKMIFQGGADLTLRAPSILIEAD-KGITCLAFAGSSGTNQIAIIGNRQQQTYNI 454

Query: 234 SFNLRNSLIGFTPNKC 249
           ++++  S IGF P  C
Sbjct: 455 AYDVSTSRIGFAPGSC 470


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  120 bits (301), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/273 (35%), Positives = 126/273 (46%), Gaps = 31/273 (11%)

Query: 1   GDFVTETVTLGSAS--VDNIAIGCGHNNEGLFVGA------AGLLGLGGGSLSFPSQI-- 50
           G+   E  TL  ++     +  GC H       GA      AGLLGLG G  S  SQ   
Sbjct: 216 GNLAQEAFTLSPSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRR 275

Query: 51  --NASTFSYCLVDRDSDSTSTLEFDSSLPP--NAVTAPLLR-NHELDTFYYLGLTGISVG 105
             +   FSYCL  R S S   L   ++ PP  N    PL+  N +L + Y + L GISV 
Sbjct: 276 GNSGDVFSYCLPPRGS-SAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVS 334

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVA 163
           G  LPI  +AF I      G ++DSGT +T +    Y  LRD F R  G   + P   V 
Sbjct: 335 GAALPIDASAFYI------GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVE 388

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI--PVDSNGT----FCFAFAPTS 217
             DTCYD +    V  P V+  F  G  + + A   L+   VD++G      C AF PT+
Sbjct: 389 SLDTCYDVTGHDVVTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTN 448

Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                IIGN+QQ+   V F++    IGF  N C
Sbjct: 449 LPGFVIIGNMQQRAYNVVFDVEGRRIGFGANGC 481


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 133/265 (50%), Gaps = 37/265 (13%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  + +V     GCGH   G F G  GLLGLG    S   Q   +    FS
Sbjct: 231 GVYSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTYGGVFS 289

Query: 57  YCLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  R S +T  L       + PP   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 290 YCLPTRPS-TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPS 348

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           + F       GG +VD+GT +TRL    Y ALR AF  G  +     +P  G+   DTCY
Sbjct: 349 SVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGI--LDTCY 400

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
           +FS   +V +P V+  F  G  + L A   L       +F C AFAP+ S   ++I+GNV
Sbjct: 401 NFSGYGTVTLPNVALTFSGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 453

Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
           QQ+    SF +R   + +GF P+ C
Sbjct: 454 QQR----SFEVRIDGTSVGFKPSSC 474


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 97/273 (35%), Positives = 136/273 (49%), Gaps = 26/273 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS   +A GC   N G+   ++G++GLG   LS  SQ+    FSYCL 
Sbjct: 178 GYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCL- 235

Query: 61  DRDSDSTSTLEFDSSLPP----NAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISET 114
             D+D+  +     SL      N  + PLL N E+   ++YY+ LTGI+VG   LP++ T
Sbjct: 236 RSDADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTST 295

Query: 115 AFKIDESGN----GGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPT-DGVAL-FD 166
            F           GG IVDSGT +T L  E Y  ++ AF+    T  L+ T +G    FD
Sbjct: 296 TFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFD 355

Query: 167 TCYDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS 218
            C+D ++    S V VPT+   F  G    +  ++Y  ++ VDS G     C    P S 
Sbjct: 356 LCFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASE 415

Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S+SIIGNV Q    V ++L   +  F P  C
Sbjct: 416 KLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 448


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score =  120 bits (300), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 96/172 (55%), Gaps = 3/172 (1%)

Query: 79  NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
             VT PL+ N    +FYY+ L  ISVG   L I ++ F++ + G+GG+I+DSGT +T ++
Sbjct: 21  KQVTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGTTITYIE 80

Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAK 197
              +++L+  F   T+      G    D C+   S ++ VE+P + FHF  G  L LP +
Sbjct: 81  ENAFDSLKKEFTSQTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKGGD-LELPGE 139

Query: 198 NYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           NY+I   S G  C A    S+ +SI GN+QQQ   V+ +L+   I F P +C
Sbjct: 140 NYMIADSSLGVACLAMG-ASNGMSIFGNIQQQNILVNHDLQKETITFIPTQC 190


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 90/263 (34%), Positives = 132/263 (50%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           G+  +ET+T+ S      S    A GCGH++ G+F   ++G++GLGGG LS  SQ+ ++ 
Sbjct: 181 GNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTI 240

Query: 55  ---FSYCL--VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL  V  DS  +S + F +S   +    V+ PL++    DTFYYL L GISVG 
Sbjct: 241 NGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSP-DTFYYLTLEGISVGK 299

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             LP    + K  E   G IIVDSGT  T L  E Y+ L  +     +     D   +F 
Sbjct: 300 KRLPYKGYS-KKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFS 358

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY+  + + +  P ++ HF +  V   P   ++   +     CF  APT S + ++GN+
Sbjct: 359 LCYN--TTAEINAPIITAHFKDANVELQPLNTFMRMQED--LVCFTVAPT-SDIGVLGNL 413

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    V F+LR   + F    C
Sbjct: 414 AQVNFLVGFDLRKKRVSFKAADC 436


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 95/253 (37%), Positives = 129/253 (50%), Gaps = 10/253 (3%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    E  TL ++ V +++  GCG NN+GLF G AGLLGLG G LS P+Q   +    FS
Sbjct: 218 GFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFS 277

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S+ST  L F S+    +V    + +      Y + + GISVG   L I+  +F
Sbjct: 278 YCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSF 337

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
             +     G I+DSGT  TRL T+ Y  LR  F     +   T G  LFDTCYDF+   +
Sbjct: 338 STE-----GAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDT 392

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           V  PT++F F    V+ L      +P+  +   C AFA      +I GNVQQ    V ++
Sbjct: 393 VTYPTIAFSFAGSTVVELDGSGISLPIKIS-QVCLAFAGNDDLPAIFGNVQQTTLDVVYD 451

Query: 237 LRNSLIGFTPNKC 249
           +    +GF PN C
Sbjct: 452 VAGGRVGFAPNGC 464


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  119 bits (299), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 91/260 (35%), Positives = 125/260 (48%), Gaps = 25/260 (9%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G   TET+ + S+ V  N   GC   + G F G  GLLGLG   ++ PSQ      + FS
Sbjct: 232 GFLATETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFS 291

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L F   +   A + P+  + +L   Y L   GISV G  LPI     
Sbjct: 292 YCLPASPS-STGHLSFGVEVSQAAKSTPI--SPKLKQLYGLNTVGISVRGRELPI----- 343

Query: 117 KIDESGNGGI---IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
                 NG I   I+DSGT  T L + TY+AL  AF       + T+G + F  CYDFS+
Sbjct: 344 ------NGSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSN 397

Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQ 229
               ++ +P +S  F  G  + +     +IPV+     C AFA T   S  +I GN QQ+
Sbjct: 398 IGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQK 457

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V +++   ++GF P  C
Sbjct: 458 TYEVIYDVAKGMVGFAPKGC 477


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 88/264 (33%), Positives = 141/264 (53%), Gaps = 22/264 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
           GD   +T++L S      S   I IGCG +N G F GA+ G++GLGGG +S  +Q+ +S 
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234

Query: 54  --TFSYCLV---DRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLV   +++S+++S L F D+++      V+ PL++   +  FY+L L   SVG
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVG 292

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
              +    ++   D+ GN  II+DSGT +T + ++ Y  L  A V   +     D    F
Sbjct: 293 NKRVEFGGSSEGGDDEGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
             CY   S +  + P ++ HF +G  + L + +  +P+ ++G  CFAF P+    SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITVHF-KGADVELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + QQ   V ++L+   + F P  C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GD  +E++ LG   ++N   GCG NN+GLF G++GL+GLG  S+S  SQ   +    FSY
Sbjct: 183 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 242

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +  ++ +L F  DSS+  N+ +    PL++N +L +FY L LTG S+GG  + + 
Sbjct: 243 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 300

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            ++F        GI++DSGT +TRL    Y A++  F++         G ++ DTC++ +
Sbjct: 301 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 354

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
           S   + +P +   F     L +      Y +  D++   C A A  S  + + IIGN QQ
Sbjct: 355 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 413

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV ++     +G     C
Sbjct: 414 KNQRVIYDTTQERLGIVGENC 434


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GD  +E++ LG   ++N   GCG NN+GLF G++GL+GLG  S+S  SQ   +    FSY
Sbjct: 231 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 290

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +  ++ +L F  DSS+  N+ +    PL++N +L +FY L LTG S+GG  + + 
Sbjct: 291 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 348

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            ++F        GI++DSGT +TRL    Y A++  F++         G ++ DTC++ +
Sbjct: 349 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 402

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
           S   + +P +   F     L +      Y +  D++   C A A  S  + + IIGN QQ
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 461

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV ++     +G     C
Sbjct: 462 KNQRVIYDTTQERLGIVGENC 482


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/260 (32%), Positives = 129/260 (49%), Gaps = 21/260 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G    + ++L    +D    GCG +N+G F G +GL+GLG   LS  SQ        FSY
Sbjct: 221 GVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 280

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL  ++S+S+ +L    D+S+  N+   V   ++ +     FY++ LTGI++GG  +   
Sbjct: 281 CLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV--- 337

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                  ES  G +IVDSGT +T L    YNA++  F+          G ++ DTC++ +
Sbjct: 338 -------ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLT 390

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQQ 229
               V++P++ F F     + + +   L  V S+ +  C A A   S    SIIGN QQ+
Sbjct: 391 GFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQK 450

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F+   S IGF    C
Sbjct: 451 NLRVIFDTLGSQIGFAQETC 470


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 94/256 (36%), Positives = 137/256 (53%), Gaps = 20/256 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TF 55
           G + ++T+ LGS++V+N   GC  +  G  L    AGL+GLGGG+ S  +Q   +    F
Sbjct: 214 GTYSSDTLALGSSTVENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAF 273

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           SYCL      S+  L   +S     V  P+LR+ ++ ++Y + L  I VGG  L I  +A
Sbjct: 274 SYCL-PPTPGSSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASA 332

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F      + G I+DSGT +TRL    Y+AL  AF  G +   P   + +FDTC+DFS +S
Sbjct: 333 F------SAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQS 386

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRV 233
           SV +PTV+  F  G V+ L +   ++        C AFA  S  +SL IIGNVQQ+   V
Sbjct: 387 SVSIPTVALVFSGGAVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEV 440

Query: 234 SFNLRNSLIGFTPNKC 249
            +++    +GF    C
Sbjct: 441 LYDVGGGAVGFKAGAC 456


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 78/257 (30%), Positives = 121/257 (47%), Gaps = 28/257 (10%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
            +V  +A GCG  N GLFV   +G+ G G G  S PSQ+    FSYCL       +S + 
Sbjct: 64  VAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLKVGRFSYCLTLVTESKSSVVI 123

Query: 72  FDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
             +   P+ + A         P++ N  + TFYYL L GI+VG   LP  ++ F + + G
Sbjct: 124 LGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFALKKDG 183

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR-------- 174
           +GG ++DSGT++T L    +  L++  V    A  P   +  +D   +   R        
Sbjct: 184 SGGTVIDSGTSLTTLPEAVFELLQEELV----AQFP---LPRYDNTPEVGDRLCFRRPKG 236

Query: 175 -SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF-APTSSSLSIIGNVQQQGTR 232
              V VP +  H   G  + LP  NY +    +G  C        +++ +IGN QQQ   
Sbjct: 237 GKQVPVPKLILHL-AGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMH 295

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++ N+ + F P +C
Sbjct: 296 VVYDVENNKLLFAPAQC 312


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 115/246 (46%), Gaps = 17/246 (6%)

Query: 18  IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDS--- 74
           +  GCG  N+G     +G++G G   LS  SQ+    FSYCL    S   STL F S   
Sbjct: 217 LGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRG 276

Query: 75  ----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
               +      T  LLR+ +  TFYY+  TG++VG   L I  +AF +   G+GG IVDS
Sbjct: 277 GVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDS 336

Query: 131 GTAVTRLQTETYNALRDAFVRGTR-------ALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GTA+T         +  AF    R       +  P DGV  F        R +V VP + 
Sbjct: 337 GTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVC-FAAAASRVPRPAV-VPRMV 394

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIG 243
           FH  +G  L LP +NY++     G  C   A +  S + IGN  QQ  RV ++L    + 
Sbjct: 395 FHL-QGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLS 453

Query: 244 FTPNKC 249
           F P +C
Sbjct: 454 FAPAQC 459


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 84/262 (32%), Positives = 136/262 (51%), Gaps = 23/262 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST----F 55
           G+  T T+      + N+A+GC   + G  F+GA+G+LGLG G +S  +Q   +     F
Sbjct: 143 GNHKTRTI-----RIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIF 197

Query: 56  SYCLVD--RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-IS 112
           SYCLVD  R S+++S L    +        P++RN    +FYY+ +TG++V G  +  I+
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCY 169
            + + ID  GN G I DSGT ++ L+   Y+ +  A    +   RA    +G   F+ CY
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCY 314

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQ 227
           +  +R    +P +   F  G V+ LP  NY++ V  N   C A     T++  +I+GN+ 
Sbjct: 315 NV-TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAEN-VQCVALQKVTTTNGSNILGNLL 372

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   + ++L  + IGF  + C
Sbjct: 373 QQDHHIEYDLAKARIGFKWSPC 394


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 83/220 (37%), Positives = 114/220 (51%), Gaps = 19/220 (8%)

Query: 37  LGLGGGSLSFPSQINAS---TFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHEL 91
           +GLGGG+ S  SQ   +    FSYCL    S S   +      S     V  P+LR+ ++
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQV 60

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
            TFY + L  I VGG  L I  + F      + G ++DSGT +TRL    Y+AL  AF  
Sbjct: 61  PTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRLPPTAYSALSSAFKA 114

Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
           G +   P     + DTC+DFS +SSV +P+V+  F  G V+ L A   ++   SN   C 
Sbjct: 115 GMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL---SN---CL 168

Query: 212 AFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           AFA  S  SSL IIGNVQQ+   V +++   ++GF    C
Sbjct: 169 AFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/269 (30%), Positives = 127/269 (47%), Gaps = 27/269 (10%)

Query: 4   VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
           ++ET+ L    V N  +GC   +       AG+ G G G  S PSQ+  + FSYCL+   
Sbjct: 182 LSETLHLHGLIVPNFLVGCSVFSSR---QPAGIAGFGRGPSSLPSQLGLTKFSYCLLSHK 238

Query: 63  --DSDSTSTLEFDSSLPPNAVTA-----PLLRNHELD------TFYYLGLTGISVGGDLL 109
             D+  +S+L  DS    +  TA     PL++N ++        +YY+ L  IS+GG  +
Sbjct: 239 FDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSV 298

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALF 165
            I       D+ GNGG I+DSGT  T + TE +  L + F+       RAL   + ++  
Sbjct: 299 KIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALM-VEALSGL 357

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---- 221
             C++ S    +E+P +  HF  G  + LP +NY   + S    CF      +  +    
Sbjct: 358 KPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPG 417

Query: 222 -IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            I+GN Q Q   V ++L+N  +GF    C
Sbjct: 418 MILGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 137/261 (52%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GD  +E++ LG   ++N   GCG NN+GLF G++GL+GLG  S+S  SQ   +    FSY
Sbjct: 231 GDLASESILLGDTKLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSY 290

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL   +  ++ +L F  DSS+  N+ +    PL++N +L +FY L LTG S+GG  + + 
Sbjct: 291 CLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELK 348

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            ++F        GI++DSGT +TRL    Y A++  F++         G ++ DTC++ +
Sbjct: 349 SSSF------GRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLT 402

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQ 228
           S   + +P +   F     L +      Y +  D++   C A A  S  + + IIGN QQ
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDAS-LVCLALASLSYENEVGIIGNYQQ 461

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV ++     +G     C
Sbjct: 462 KNQRVIYDSTQERLGIVGENC 482


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/260 (32%), Positives = 129/260 (49%), Gaps = 21/260 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G    + ++L    +D    GCG +N+G F G +GL+GLG   LS  SQ        FSY
Sbjct: 220 GVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 279

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL  ++S+S+ +L    D+S+  N+   V   ++ +     FY++ LTGI++GG  +   
Sbjct: 280 CLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV--- 336

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                  ES  G +IVDSGT +T L    YNA++  F+          G ++ DTC++ +
Sbjct: 337 -------ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILDTCFNLT 389

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQQ 229
               V++P++ F F     + + +   L  V S+ +  C A A   S    SIIGN QQ+
Sbjct: 390 GFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQK 449

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F+   S IGF    C
Sbjct: 450 NLRVIFDTLGSQIGFAQETC 469


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/271 (33%), Positives = 125/271 (46%), Gaps = 29/271 (10%)

Query: 1   GDFVTETVTLGS-----ASVDN------IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ 49
           G    +T+TLG+     AS +N         GCG NN GLF  A GL GLG G +S  SQ
Sbjct: 245 GHLGNDTLTLGTMAPANASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQ 304

Query: 50  INAS---TFSYCLVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGISV 104
                   FSYCL    S +   L   + +  P +A   P+L      +FYY+ L GI V
Sbjct: 305 AAGKFGEGFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRV 364

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPTDGV 162
            G  + +S     +       +IVDSGT +TRL    Y ALR AF+   G         +
Sbjct: 365 AGRAIRVSSPRVALP------LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRL 418

Query: 163 ALFDTCYDFSSR--SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SS 218
           ++ DTCYDF++   ++V +P V+  F  G  + +     L  V      C AFAP     
Sbjct: 419 SILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGDGR 477

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S  I+GN QQ+   V +++    IGF    C
Sbjct: 478 SAGILGNTQQRTLAVVYDVARQKIGFAAKGC 508


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 81/248 (32%), Positives = 131/248 (52%), Gaps = 18/248 (7%)

Query: 15  VDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAST----FSYCLVD--RDSDST 67
           + N+A+GC   + G  F+GA+G+LGLG G +S  +Q   +     FSYCLVD  R S+++
Sbjct: 184 IKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYCLVDYLRGSNAS 243

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDESGNGGI 126
           S L    +        P++RN    +FYY+ +TG++V G  +  I+ + + ID  GN G 
Sbjct: 244 SFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSDWGIDGDGNKGT 303

Query: 127 IVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           I DSGT ++ L+   Y+ +  A    +   RA    +G   F+ CY+  +R    +P + 
Sbjct: 304 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEG---FELCYNV-TRMEKGMPKLG 359

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQQGTRVSFNLRNSL 241
             F  G V+ LP  NY++ V  N   C A     T++  +I+GN+ QQ   + ++L  + 
Sbjct: 360 VEFQGGAVMELPWNNYMVLVAEN-VQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKAR 418

Query: 242 IGFTPNKC 249
           IGF  + C
Sbjct: 419 IGFKWSPC 426


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 91/257 (35%), Positives = 131/257 (50%), Gaps = 20/257 (7%)

Query: 6   ETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS---YCLVD 61
           ET++L SA ++   A GCG  N G F    GL+GLG G LS  SQ  AS  +   YCL  
Sbjct: 213 ETLSLTSARALPGFAFGCGETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPS 272

Query: 62  RDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
            ++ S   L   ++ P +       TA +++  +  +FY++ L  I VGG +LP+    F
Sbjct: 273 YNT-SHGYLTIGTTTPASGSDGVRYTA-MIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF 330

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
             D     G ++DSGT +T L  E Y ALRD F        P      FDTCYDF+ +++
Sbjct: 331 TRD-----GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNA 385

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAPTSSSL--SIIGNVQQQGTR 232
           + +P VSF F +G    L     LI  D  +  T C AF P  S++  +I+GN QQ+ T 
Sbjct: 386 IFMPLVSFKFSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTE 445

Query: 233 VSFNLRNSLIGFTPNKC 249
           + +++    IGF    C
Sbjct: 446 MIYDVAAEKIGFVSGSC 462


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 105/255 (41%), Positives = 131/255 (51%), Gaps = 13/255 (5%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G F TE +TL S  + +NI  GCG NN+GLF G+AGLLGLG   LS  SQ        FS
Sbjct: 242 GFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFS 301

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L F  S   NA   PL       +FY L  TGISVGG  L IS + F
Sbjct: 302 YCL-PSSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF 360

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                   G I+DSGT +TRL    Y+ALR +F         T  +++ DTCYDFSS ++
Sbjct: 361 S-----TAGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTT 415

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVS 234
           + VP + F F  G  + + A   L    S    C AFA  S +  + I GNVQQ+   V 
Sbjct: 416 ISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVF 474

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P  C
Sbjct: 475 YDGSAGKVGFAPGGC 489


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 84/258 (32%), Positives = 118/258 (45%), Gaps = 27/258 (10%)

Query: 11  GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
           G  +V ++  GCG  N G F     G+ G G G LS P Q+  S+FSYC      +S ST
Sbjct: 195 GKVTVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTT-IFESKST 253

Query: 70  LEFDSSLPPNAVTA---------PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
             F    P + + A         P L NH    +YYL L GI+VG   L + E+AF +  
Sbjct: 254 PVFLGGAPADGLRAHATGPILSTPFLPNHP--EYYYLSLKGITVGKTRLAVPESAFVVKA 311

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT------CYDFSS- 173
            G+GG I+DSGTA+T      + +L +AFV    A  P    +  DT      C+   S 
Sbjct: 312 DGSGGTIIDSGTAITAFPRAVFRSLWEAFV----AQVPLPHTSYNDTGEPTLQCFSTESV 367

Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
              S V VP ++ H  EG    LP +NY+     +   C          ++IGN QQQ  
Sbjct: 368 PDASKVPVPKMTLHL-EGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNM 426

Query: 232 RVSFNLRNSLIGFTPNKC 249
            +  +L  + +   P +C
Sbjct: 427 HIVHDLAGNKLVIEPAQC 444


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 129/264 (48%), Gaps = 25/264 (9%)

Query: 11  GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAST-------FSYCLVDR 62
            ++++ ++  GC   +    V  ++G LGL  GS SFP+QI + +       FSYC  +R
Sbjct: 107 AASTLGDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNR 166

Query: 63  DS--DSTSTLEF-DSSLPPNAVTAPLLRNH----ELDTFYYLGLTGISVGGDLLPISETA 115
               +S+  + F DS +P +      L        +  FYY+GL GISVGG+LL I  +A
Sbjct: 167 AEHLNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSA 226

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSR 174
           FKID  GNGG   DSGT V+ L    + AL +AF R    L+ T G     + CYD ++ 
Sbjct: 227 FKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAG 286

Query: 175 SSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF----APTSSSLSIIGN 225
            +     P V+ HF     + L   +  +P+       T C AF    A     +++IGN
Sbjct: 287 DARLPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGN 346

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   +  +L  S IGF P  C
Sbjct: 347 YQQQDYLIEHDLERSRIGFAPANC 370


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 92/263 (34%), Positives = 130/263 (49%), Gaps = 25/263 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + TET+TL     V +   GCG +  G +    GLLGLGG   S  SQ ++     FS
Sbjct: 214 GVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFS 273

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVGGDLL 109
           YCL    S     L   +    ++ TA       P+ R   + TFY + LTGISVGG  L
Sbjct: 274 YCL-PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPL 332

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFD 166
            +  +AF      + G+++DSGT +T L    Y ALR AF       R L P++G A+ D
Sbjct: 333 AVPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-AVLD 385

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
           TCYDF+  ++V VPT++  F  G  + L     ++    +G   FA A T  ++ IIGNV
Sbjct: 386 TCYDFTGHTNVTVPTIALTFSGGATIDLATPAGVL---VDGCLAFAGAGTDDTIGIIGNV 442

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q+   V ++     +GF    C
Sbjct: 443 NQRTFEVLYDSGKGTVGFRAGAC 465


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  117 bits (292), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 87/264 (32%), Positives = 142/264 (53%), Gaps = 22/264 (8%)

Query: 1   GDFVTETVTLGSASVDNIA-----IGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
           GD   +T++L S S   ++     IGCG +N G F GA+ G++GLGGG +S  +Q+ +S 
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234

Query: 54  --TFSYCLV---DRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLV   +++S+++S L F D+++      V+ PL++   +  FY+L L   SVG
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV--FYFLTLQAFSVG 292

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
              +    ++   D+ GN  II+DSGT +T + ++ Y  L  A V   +     D    F
Sbjct: 293 NKRVEFGGSSEGGDDEGN--IIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQF 350

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
             CY   S +  + P ++ HF +G  + L + +  +P+ ++G  CFAF P+    SI GN
Sbjct: 351 SLCYSLKS-NEYDFPIITAHF-KGADIELHSISTFVPI-TDGIVCFAFQPSPQLGSIFGN 407

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + QQ   V ++L+   + F P  C
Sbjct: 408 LAQQNLLVGYDLQQKTVSFKPTDC 431


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 87/274 (31%), Positives = 125/274 (45%), Gaps = 33/274 (12%)

Query: 4   VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV--- 60
           ++ET+ L S S  N  +GC   +       AG+ G G G  S PSQ+    FSYCL+   
Sbjct: 177 LSETLHLHSLSKPNFLVGCSVFSSH---QPAGIAGFGRGLSSLPSQLGLGKFSYCLLSHR 233

Query: 61  -DRDSDSTSTL-----EFDSSLPPNA-VTAPLLRNHELD------TFYYLGLTGISVGGD 107
            D D+  +S+L     + DS    NA V  P ++N ++D       +YYLGL  I+VGG 
Sbjct: 234 FDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGH 293

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVA 163
            + +        E GNGG+I+DSGT  T +  E +  L D F+R      R     D + 
Sbjct: 294 HVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIG 353

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS-- 221
           L   C++ S   +V  P +  +F  G  + LP +NY   V      C        +    
Sbjct: 354 L-RPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGE-VACLTVVTDGVAGPER 411

Query: 222 ------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 I+GN Q Q   V ++LRN  +GF   KC
Sbjct: 412 VGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 128/259 (49%), Gaps = 12/259 (4%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET T G+ + V ++A GCG +N G    ++GL+G+G G LS  SQ+  + FSYC 
Sbjct: 201 GVLATETFTFGAGTTVHDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTKFSYCF 260

Query: 60  VDRDSDSTSTLEF---DSSLPPNAVTAPLL---RNHELDTFYYLGLTGISVGGDLLPISE 113
              +  +TS+  F    +SL P A + P +         ++YYL L GI+VG  LLPI  
Sbjct: 261 TPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDP 320

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY---D 170
             F++  SG GG+I+DSGT  T L+   +  L  A          +        C+    
Sbjct: 321 AVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQ 380

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
                +V+VP +  HF +G  + LP  + ++     G  C     ++  +S++G++QQQ 
Sbjct: 381 GRGPEAVDVPRLVLHF-DGADMELPRSSAVVEDRVAGVACLGIV-SARGMSVLGSMQQQN 438

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V +++   ++ F P  C
Sbjct: 439 MHVRYDVGRDVLSFEPANC 457


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 98/265 (36%), Positives = 135/265 (50%), Gaps = 22/265 (8%)

Query: 1   GDFVTETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ----INA 52
           G + ++T+ LGS S    V     GC H   G+    AGL+GLGGG+ S  SQ       
Sbjct: 235 GTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGT 294

Query: 53  STFSYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           + FSYCL    S S   TL    +     V  P+LR+ ++  FY + L  I VGG  L I
Sbjct: 295 TAFSYCLPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSI 354

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP---TDGVALFDTC 168
             T F      + G+I+DSGT VTRL    Y++L  AF  G +   P   + G    DTC
Sbjct: 355 PTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTC 408

Query: 169 YDFSSRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIG 224
           +D S +SSV +PTV+  F    G V+ L A   L+ ++++  FC AF  TS   S  IIG
Sbjct: 409 FDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIG 468

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           NVQQ+  +V +++    +GF    C
Sbjct: 469 NVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 112/252 (44%), Gaps = 45/252 (17%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G+   E +  G+  V +   GCG NN+GLF G +GL+GLG   LS  SQ +         
Sbjct: 147 GELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTS--------- 197

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
                                      N +L  FY++ LTGIS+GG        A +   
Sbjct: 198 --------------------------ENPQLYNFYFINLTGISIGG-------VALQAPS 224

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            G   I+VDSGT +TRL    Y AL+  F++      P    ++ DTC++ S+   V++P
Sbjct: 225 VGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIP 284

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQGTRVSFNL 237
           T+  HF     L +        V S+ +  C A A       ++I+GN QQ+  RV ++ 
Sbjct: 285 TIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDT 344

Query: 238 RNSLIGFTPNKC 249
           + + +GF    C
Sbjct: 345 KETKVGFALETC 356


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  115 bits (289), Expect = 1e-23,   Method: Composition-based stats.
 Identities = 96/274 (35%), Positives = 132/274 (48%), Gaps = 36/274 (13%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G++  +T+TL  + V      GCG NNEG F  GA G+LGLG G LS  SQ  +     F
Sbjct: 203 GNYGCDTMTLEPSDVFQKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 262

Query: 56  SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           SYCL + +S             +S+L+F S      V  P     E   +Y++ L  ISV
Sbjct: 263 SYCLPEENSIGSLLFGEKATSQSSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISV 317

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
           G   L I  + F      + G I+DSGT +TRL    Y+AL+ AF +       ++G   
Sbjct: 318 GNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 372

Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
              + DTCY+ S R  V +P    HF +G  + L  K  +   D++   C AFA  S S 
Sbjct: 373 ENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGKRVVWGNDAS-RLCLAFAGNSKST 431

Query: 220 ----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               L+IIGN QQ    V +++R   IGF  N C
Sbjct: 432 MNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 89/257 (34%), Positives = 120/257 (46%), Gaps = 20/257 (7%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQIN---ASTF 55
           G +VT+T+T+  +  V +   GC H   G F    AG+L LGGG  S   Q      + F
Sbjct: 250 GTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAF 309

Query: 56  SYCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           SYC+    S    +L    ++SL       PL++N    TFY + L  I V G  L +  
Sbjct: 310 SYCIPKPSSAGFLSLGGPVEASL--KFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPP 367

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFS 172
           TAF        G ++DSG  VT+L  + Y ALR AF     A  P    V   DTCYDF+
Sbjct: 368 TAFAT------GAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRNLDTCYDFT 421

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
               V+VP VS  F  G  L L   + ++    +G   FA  P   S+  IGNVQQQ   
Sbjct: 422 RFPDVKVPKVSLVFAGGATLDLEPASIIL----DGCLAFAATPGEESVGFIGNVQQQTYE 477

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++    +GF    C
Sbjct: 478 VLYDVGGGKVGFRRGAC 494


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 93/262 (35%), Positives = 129/262 (49%), Gaps = 30/262 (11%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
           G +  ET+TL    +V++   GCG +  G      GLLGLGG  +S   Q   +    FS
Sbjct: 225 GVYSNETLTLAPGITVEDFHFGCGRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFS 284

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCL   +S++   L   S  PP+      V  P+       TFY + +TGISVGG  L I
Sbjct: 285 YCLPALNSEA-GFLVLGS--PPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHI 341

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCY 169
            ++AF+      GG+I+DSGT  T L    YNAL  A  +  +A  L P+D    FDTCY
Sbjct: 342 PQSAFR------GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD---FDTCY 392

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
           +F+  S++ VP V+F F  G  + L   N ++  D     C AF  +     L IIGNV 
Sbjct: 393 NFTGYSNITVPRVAFTFSGGATIDLDVPNGILVND-----CLAFQESGPDDGLGIIGNVN 447

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+   V ++     +GF    C
Sbjct: 448 QRTLEVLYDAGRGNVGFRAGAC 469


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/259 (32%), Positives = 128/259 (49%), Gaps = 15/259 (5%)

Query: 5   TETVTLGSA---SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           TET+T   A   SV  IA GCG +N GL   + G +GLG GSLS  +Q+    FSYCL D
Sbjct: 187 TETLTFPGAPGVSVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTD 246

Query: 62  RDSDSTSTLEFDSSLPPNAV--------TAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
             + S  +     +L   A         + PL+++  + T+YY+ L GIS+G   LPI  
Sbjct: 247 FFNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPN 306

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
             F + + G+GG+IVDSGT  T L    +  + D  V G       +  +L   C+  ++
Sbjct: 307 GTFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVD-HVAGVLRQPVVNASSLDSPCFPAAT 365

Query: 174 --RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQG 230
             +    +P +  HF  G  + L   NY+       +FC   A + S+ +SI+GN QQQ 
Sbjct: 366 GEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQN 425

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            ++ F++    + F P  C
Sbjct: 426 IQMLFDITVGQLSFMPTDC 444


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 87/247 (35%), Positives = 124/247 (50%), Gaps = 14/247 (5%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
           +ET TLG  +V  +  GC    EG +   AGL+GLG G LS  SQ++A TF YCL   D+
Sbjct: 199 SETFTLGGDAVPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLDAGTFMYCLT-ADA 257

Query: 65  DSTSTLEFDSSLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETAFKIDESG 122
              S L F +        A +     L   TFY + L  I++G        +A      G
Sbjct: 258 SKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIG--------SATTAGVGG 309

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
            GG++ DSGT +T L    Y   + AF+  T +L+P +G   F+ CY+    S+  +P +
Sbjct: 310 PGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYE-KPDSARLIPAM 368

Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLI 242
             HF  G  + LP  NY++ VD +G  C+     S SLSIIGN+ Q    V  ++R S++
Sbjct: 369 VLHFDGGADMALPVANYVVEVD-DGVVCWV-VQRSPSLSIIGNIMQMNYLVLHDVRKSVL 426

Query: 243 GFTPNKC 249
            F P  C
Sbjct: 427 SFQPANC 433


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 87/258 (33%), Positives = 127/258 (49%), Gaps = 16/258 (6%)

Query: 5   TETVTLGSAS----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           TET+TLG +S    V  +A GCG +N G  + + G +GLG G+LS  +Q+    FSYCL 
Sbjct: 163 TETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYCLT 222

Query: 61  D---RDSDSTSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           D      DS   L   + L P   T    PLL++ +  + Y++ L GIS+G   LPI   
Sbjct: 223 DFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNG 282

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALFDTCYDFS 172
            F +   G GG+IVDSGT  T L    +   R+   R  R L   P +  +L   C+   
Sbjct: 283 TFDLRGDGTGGMIVDSGTTFTILAESGF---REVVGRVARVLGQPPVNASSLDAPCFPAP 339

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGT 231
           +     +P +  HF  G  + L   NY+   + + +FC   A T+  S S++GN QQQ  
Sbjct: 340 AGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNI 399

Query: 232 RVSFNLRNSLIGFTPNKC 249
           ++ F+     + F P  C
Sbjct: 400 QMLFDTTVGQLSFLPTDC 417


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 119/266 (44%), Gaps = 31/266 (11%)

Query: 11  GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
           G  S   +  GCGH N+G+F     G+ G G G  S PSQ+  ++FSYC       ++S 
Sbjct: 207 GGVSERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSL 266

Query: 70  LEF-----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
           +       +  L     + PLLR+    + Y+L L  I+VG   +PI E   ++ E+   
Sbjct: 267 VTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREAS-- 324

Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGT-RALSPTDGVALFDTCYDFSSRSS------- 176
             I+DSG ++T L  + Y A++  FV      +S  +G AL D C+   S ++       
Sbjct: 325 -AIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSAL-DLCFALPSAAAPKSAFGW 382

Query: 177 ----------VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLSII 223
                     V VP + FH   G    LP +NY+         C      +       +I
Sbjct: 383 RWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVI 442

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQQ T V ++L N ++ F P +C
Sbjct: 443 GNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 78/244 (31%), Positives = 121/244 (49%), Gaps = 11/244 (4%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC    +G  F  + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 225 AKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 284

Query: 67  TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
           TS L F       A   PLL +  +  FY + +  + V G+ L I    + +D   NGG 
Sbjct: 285 TSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDR--NGGA 342

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
           I+DSGT++T L T  Y A+  A  +    L P   +  F+ CY+++   ++E+P +  HF
Sbjct: 343 ILDSGTSLTILATPAYRAVVTALSKHLAGL-PRVTMDPFEYCYNWTDAGALEIPKMEVHF 401

Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
                L  PAK+Y+I   + G  C      S   +S+IGN+ QQ     F+LR+  + F 
Sbjct: 402 AGSARLEPPAKSYVIDA-APGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLRDRWLRFK 460

Query: 246 PNKC 249
             +C
Sbjct: 461 HTRC 464


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 87/261 (33%), Positives = 132/261 (50%), Gaps = 15/261 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G    + ++L    +D    GCG +N+G  F G +GL+GLG   LS  SQ        FS
Sbjct: 252 GVLAHDRLSLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFS 311

Query: 57  YCLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCL  ++SDS+ +L    DSS+  N+   V A ++ +     FY++ LTGI+VGG  +  
Sbjct: 312 YCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEV-- 369

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            E++      G G  I+DSGT +T L    YNA++  F+          G ++ DTC++ 
Sbjct: 370 -ESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNM 428

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNVQQ 228
           +    V+VP++   F  G  + + +   L  V S+ +  C A AP  S    +IIGN QQ
Sbjct: 429 TGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQ 488

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +  RV F+   S +GF    C
Sbjct: 489 KNLRVIFDTSGSQVGFAQETC 509


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 93/271 (34%), Positives = 132/271 (48%), Gaps = 23/271 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS   +A GC   N G+   ++G++GLG   LS  SQ+    FSYCL 
Sbjct: 142 GYLATETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVGRFSYCLR 200

Query: 61  DRDSDSTSTLEFDS--SLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETAF 116
                  S + F S   +     +  +L N E+   ++YY+ LTGI+VG   LP++ T F
Sbjct: 201 SDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTTF 260

Query: 117 KIDESGN----GGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALSPT-DGVAL-FDTC 168
                      GG IVDSGT +T L  E Y  ++ AF+    T  L+ T +G    FD C
Sbjct: 261 GFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLC 320

Query: 169 YDFSSR---SSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS-- 218
           +D ++    S V VPT+   F  G    +  ++Y  ++ VDS G     C    P S   
Sbjct: 321 FDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKL 380

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S+SIIGNV Q    V ++L   +  F P  C
Sbjct: 381 SISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  114 bits (284), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/250 (34%), Positives = 119/250 (47%), Gaps = 28/250 (11%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S +V N   GC H   G      GL+GLGG + S  SQ  A+    FSYCL    S +  
Sbjct: 233 SDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGG 292

Query: 69  TLEFDSSLPPNAVT----APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
            L   ++    + +     PL+R   + TFY + L  I+V G  L +  + F      +G
Sbjct: 293 FLTLGAAAGGTSSSRYSRTPLVR-FNVPTFYGVFLQAITVAGTKLNVPASVF------SG 345

Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSF 184
             +VDSGT +T+L    Y ALR AF +  +A      V + DTC+DFS   +V VP V+ 
Sbjct: 346 ASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTL 405

Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSS--SLSIIGNVQQQGTRVSFNLRN 239
            F  G V+ L         D +G F   C AF  T+      I+GNVQQ+   + F++  
Sbjct: 406 TFSRGAVMDL---------DVSGIFYAGCLAFTATAQDGDTGILGNVQQRTFEMLFDVGG 456

Query: 240 SLIGFTPNKC 249
           S +GF P  C
Sbjct: 457 STLGFRPGAC 466


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)

Query: 1   GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G  +TET T G  +A+   IA GC   +EG F   +GL+GLG G LS  +Q+N   F Y 
Sbjct: 72  GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 131

Query: 59  LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
           L   D  + S + F S            ++ PLL N  +    FYY+GLTGISVGG L+ 
Sbjct: 132 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 190

Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           I    F  D S G GG+I DSGT +T L    Y  +RD  +       P       D   
Sbjct: 191 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 250

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
                S+   P++  HF  G  + L  +NYL  +   NG    C++   +S +L+IIGN+
Sbjct: 251 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 310

Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
            Q    V F+L  N+ + F P
Sbjct: 311 MQMDFHVVFDLSGNARMLFQP 331


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 80/268 (29%), Positives = 129/268 (48%), Gaps = 21/268 (7%)

Query: 1   GDFVTETVTLGSASVDNIAI--GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G+  +ET T G     ++++  GCG    G   GA+G+LG+    LS  SQ+    FSYC
Sbjct: 177 GELASETFTFGEHRRVSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPRFSYC 236

Query: 59  L---VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL------DTFYYLGLTGISVGGDLL 109
           L   +DR++ S       + L     T P+     +      + +YY+ L GISVG   L
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDT 167
            +  ++F I   G+GG  VDSG     L +    AL++A V   +   ++ TD    ++ 
Sbjct: 297 NVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYEL 356

Query: 168 CYDF------SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
           C+        +  ++V+VP + +HF  G  + L   +Y++ V S G  C   + + +  +
Sbjct: 357 CFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEV-SAGRMCLVIS-SGARGA 414

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN QQQ   V F++ N    F P +C
Sbjct: 415 IIGNYQQQNMHVLFDVENHEFSFAPTQC 442


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 131/260 (50%), Gaps = 21/260 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
           GD   E +T+GS+SV ++ IGCGH + G F  A+G++GLGGG LS  SQ++ ++     F
Sbjct: 180 GDLGFEKITIGSSSVKSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 238

Query: 56  SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           SYCL    S +   + F  +     P  V+ PL+  + + T+YY+ L  IS+G +     
Sbjct: 239 SYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTV-TYYYITLEAISIGNE----R 293

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
             AF    +  G +I+DSGT +T L  E Y+ +  + ++  +A    D     D C+D  
Sbjct: 294 HMAF----AKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349

Query: 171 FSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
            ++ +S+ +P ++ HF  G  V  LP   +    D+        A  ++   IIGN+ Q 
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              + ++L    + F P  C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 81/257 (31%), Positives = 127/257 (49%), Gaps = 14/257 (5%)

Query: 5   TETVTLGSA------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           TET+T+GS+      SV ++A GCG +N G  + + G +GLG G+LS  +Q+    FSYC
Sbjct: 160 TETLTIGSSVPGQTVSVGSVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 219

Query: 59  LVDRDSDSTSTLEFDSSL------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           L D  + +  +  F  +L      P    + PLL++    + Y++ L GIS+G   LPI 
Sbjct: 220 LTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIP 279

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
              F +   GNGG++VDSGT  T L    +  + D  V       P +  +L   C+  S
Sbjct: 280 NGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDR-VAQLLGQPPVNASSLDSPCFP-S 337

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
                 +P +  HF  G  + L   NY+   + + +FC     + S+ S +GN QQQ  +
Sbjct: 338 PDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQ 397

Query: 233 VSFNLRNSLIGFTPNKC 249
           + F++    + F P  C
Sbjct: 398 MLFDMTVGQLSFLPTDC 414


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 80/260 (30%), Positives = 126/260 (48%), Gaps = 12/260 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS  ++A GC   N G+    +G+ GLG G+LS   Q+    FSYCL 
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLR 232

Query: 61  DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
              +   S + F S       N  + P + N  +  ++YY+ LTGI+VG   LP++ + F
Sbjct: 233 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 292

Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-SR 174
              ++G  GG IVDSGT +T L  + Y  ++ AF+  T  ++  +G    D C+  +   
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGG 352

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQQ 229
             + VP++   F  G    +P     +  DS G+    C    P      +S+IGNV Q 
Sbjct: 353 GGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQM 412

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              + ++L   +  F+P  C
Sbjct: 413 DMHLLYDLDGGIFSFSPADC 432


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 92/247 (37%), Positives = 122/247 (49%), Gaps = 16/247 (6%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++     GCG  N G F    GLLGLG G LS PSQ  AS    FSYCL   +S +T 
Sbjct: 252 SRALAGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTG 310

Query: 69  TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
            L   ++   +   A    +LR  +  +FY++ L  I +GG +LP+    F       GG
Sbjct: 311 YLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGG 365

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
            ++DSGT +T L  + Y  LRD F       +P     + D CYDF+  S V VP VSF 
Sbjct: 366 TLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFR 425

Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLI 242
           F +G V  L     +I +D N   C AFA   +    LSIIGN QQ+   V +++    I
Sbjct: 426 FGDGAVFELDFFGVMIFLDEN-VGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKI 484

Query: 243 GFTPNKC 249
           GF P  C
Sbjct: 485 GFVPASC 491


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  113 bits (282), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)

Query: 1   GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G  +TET T G  +A+   IA GC   +EG F   +GL+GLG G LS  +Q+N   F Y 
Sbjct: 191 GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 250

Query: 59  LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
           L   D  + S + F S            ++ PLL N  +    FYY+GLTGISVGG L+ 
Sbjct: 251 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 309

Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           I    F  D S G GG+I DSGT +T L    Y  +RD  +       P       D   
Sbjct: 310 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 369

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
                S+   P++  HF  G  + L  +NYL  +   NG    C++   +S +L+IIGN+
Sbjct: 370 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 429

Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
            Q    V F+L  N+ + F P
Sbjct: 430 MQMDFHVVFDLSGNARMLFQP 450


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  113 bits (282), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 92/261 (35%), Positives = 126/261 (48%), Gaps = 16/261 (6%)

Query: 1   GDFVTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G  +TET T G  +A+   IA GC   +EG F   +GL+GLG G LS  +Q+N   F Y 
Sbjct: 191 GILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYR 250

Query: 59  LVDRDSDSTSTLEFDSSLPPNA------VTAPLLRNHELD--TFYYLGLTGISVGGDLLP 110
           L   D  + S + F S            ++ PLL N  +    FYY+GLTGISVGG L+ 
Sbjct: 251 L-SSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQ 309

Query: 111 ISETAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           I    F  D S G GG+I DSGT +T L    Y  +RD  +       P       D   
Sbjct: 310 IPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLIC 369

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNV 226
                S+   P++  HF  G  + L  +NYL  +   NG    C++   +S +L+IIGN+
Sbjct: 370 FTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNI 429

Query: 227 QQQGTRVSFNLR-NSLIGFTP 246
            Q    V F+L  N+ + F P
Sbjct: 430 MQMDFHVVFDLSGNARMLFQP 450


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/247 (32%), Positives = 125/247 (50%), Gaps = 12/247 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   T+T T G+ +V  +  GC   + G F GA+G++G+G G+LS  SQ+    FSY L+
Sbjct: 131 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 190

Query: 61  ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
                 D  + S + F D ++P      + PLL +     FYY+ LTG+ V G+ L  I 
Sbjct: 191 APEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 250

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
              F +  +G GG+I+ S T VT L+   Y+ +R A V     L   +G A    D CY+
Sbjct: 251 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 309

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            SS + V+VP ++  F  G  + L A NY    +  G  C    P+    S++G + Q G
Sbjct: 310 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 368

Query: 231 TRVSFNL 237
           T + +++
Sbjct: 369 TNMIYDV 375


>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
           Group]
          Length = 260

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 91/258 (35%), Positives = 125/258 (48%), Gaps = 16/258 (6%)

Query: 4   VTETVTLG--SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           +TET T G  +A+   IA GC   +EG F   +GL+GLG G LS  +Q+N   F Y L  
Sbjct: 1   MTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGYRL-S 59

Query: 62  RDSDSTSTLEFDSSLPPNA------VTAPLLRNHELDT--FYYLGLTGISVGGDLLPISE 113
            D  + S + F S            ++ PLL N  +    FYY+GLTGISVGG L+ I  
Sbjct: 60  SDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPS 119

Query: 114 TAFKIDES-GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             F  D S G GG+I DSGT +T L    Y  +RD  +       P       D      
Sbjct: 120 GTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTG 179

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-SNG--TFCFAFAPTSSSLSIIGNVQQQ 229
             S+   P++  HF  G  + L  +NYL  +   NG    C++   +S +L+IIGN+ Q 
Sbjct: 180 GSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQM 239

Query: 230 GTRVSFNLR-NSLIGFTP 246
              V F+L  N+ + F P
Sbjct: 240 DFHVVFDLSGNARMLFQP 257


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/261 (30%), Positives = 124/261 (47%), Gaps = 13/261 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS  ++A GC   N G+    +G+ GLG G+LS   Q+    FSYCL 
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GVGNSTSGIAGLGRGALSLIPQLGVGRFSYCLR 232

Query: 61  DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
              +   S + F S       N  + P + N  +  ++YY+ LTGI+VG   LP++ + F
Sbjct: 233 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 292

Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD--FSS 173
              ++G  GG IVDSGT +T L  + Y  ++ AF+  T  ++  +G    D C+      
Sbjct: 293 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGG 352

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQ 228
              + VP++   F  G    +P     +  DS G+    C    P      +S+IGNV Q
Sbjct: 353 GGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQ 412

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
               + ++L   +  F P  C
Sbjct: 413 MDMHLLYDLDGGIFSFAPADC 433


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 90/253 (35%), Positives = 123/253 (48%), Gaps = 13/253 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G    +TV+ GS S      GCG +NEGLF  +AGL+GL    LS   Q+  S    FSY
Sbjct: 225 GYLSKDTVSFGSGSFPGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSY 284

Query: 58  CLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           CL    S +   L   S  P      P+  +    + Y++ L+GISV G  L +  + ++
Sbjct: 285 CL-PTSSAAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYR 343

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV-ALFDTCYDFSSRSS 176
              +     I+DSGT +TRL    Y AL  A      + +P     ++ DTC+   S + 
Sbjct: 344 SLPT-----IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFR-GSAAG 397

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           + VP V   F  G  L L   N LI VD + T C AFAPT  + +IIGN QQQ   V ++
Sbjct: 398 LRVPRVDMAFAGGATLALSPGNVLIDVD-DSTTCLAFAPTGGT-AIIGNTQQQTFSVVYD 455

Query: 237 LRNSLIGFTPNKC 249
           +  S IGF    C
Sbjct: 456 VAQSRIGFAAGGC 468


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/251 (32%), Positives = 119/251 (47%), Gaps = 27/251 (10%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   TET TLGS  +V  +A GCG  N G    ++GL+G+G G LS  SQ+         
Sbjct: 184 GVLATETFTLGSDTAVRGVAFGCGTENLGSTDNSSGLVGMGRGPLSLVSQLG-------- 235

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
           V R   S               T+PL               GI+VG  LLPI    F++ 
Sbjct: 236 VTRPRRSCRARAAARGGGAPTTTSPL--------------EGITVGDTLLPIDPAVFRLT 281

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
             G+GG+I+DSGT  T L+   + AL  A     R L    G  L    C+  +S  +VE
Sbjct: 282 PMGDGGVIIDSGTTFTALEERAFVALARALASRVR-LPLASGAHLGLSLCFAAASPEAVE 340

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           VP +  HF +G  + L  ++Y++   S G  C     ++  +S++G++QQQ T + ++L 
Sbjct: 341 VPRLVLHF-DGADMELRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLE 398

Query: 239 NSLIGFTPNKC 249
             ++ F P KC
Sbjct: 399 RGILSFEPAKC 409


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 82/254 (32%), Positives = 127/254 (50%), Gaps = 12/254 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   T+T T G+ +V  +  GC   + G F GA+G++G+G G+LS  SQ+    FSY L+
Sbjct: 191 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 250

Query: 61  ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
                 D  + S + F D ++P      + PLL +     FYY+ LTG+ V G+ L  I 
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 310

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
              F +  +G GG+I+ S T VT L+   Y+ +R A V     L   +G A    D CY+
Sbjct: 311 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 369

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            SS + V+VP ++  F  G  + L A NY    +  G  C    P+    S++G + Q G
Sbjct: 370 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 428

Query: 231 TRVSFNLRNSLIGF 244
           T + +++    + F
Sbjct: 429 TNMIYDVDAGRLTF 442


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/262 (33%), Positives = 122/262 (46%), Gaps = 27/262 (10%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
           G +  ET+ L    +V +   GCGH+ +G      GLLGLGG   S   Q   +    FS
Sbjct: 221 GVYSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFS 280

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT-------APLLRNHELDTFYYLGLTGISVGGDLL 109
           YCL   ++            P   V         P++R  E  TFY + +TGI+VGG+ +
Sbjct: 281 YCLPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE--TFYVVNMTGITVGGEPI 338

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +  +AF      +GG+I+DSGT VT LQ   YNAL+ AF R   A  P       DTCY
Sbjct: 339 DVPPSAF------SGGMIIDSGTVVTELQHTAYNALQAAF-RKAMAAYPLVRNGELDTCY 391

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT--SSSLSIIGNVQ 227
           DFS  S+V +P V+  F  G  + L   N ++  D     C AF  +       I+GNV 
Sbjct: 392 DFSGYSNVTLPKVALTFSGGATIDLDVPNGILLDD-----CLAFQESGPDDQPGILGNVN 446

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+   V ++     +GF    C
Sbjct: 447 QRTLEVLYDAGRGRVGFRAAVC 468


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 88/266 (33%), Positives = 123/266 (46%), Gaps = 33/266 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
           G + ++ +TLG   V      GC H ++G       AG L LGGGS SF  Q  +     
Sbjct: 160 GTYSSDDLTLGPYDVVRGFLFGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRV 219

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
           FSYC+      STS+  F         ++L P  V+ PLL +  +  TFY + L  I V 
Sbjct: 220 FSYCV----PPSTSSFGFIMFGVPPQRAALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVA 275

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G  LP+  T F          ++DS T ++R+    Y ALR AF        P   V++ 
Sbjct: 276 GRPLPVPPTVFSASS------VIDSATVISRIPPTAYQALRAAFRSAMTMYRPAPPVSIL 329

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
           DTCYDFS   S+ +P+++  F  G  + L A   L+        C AFAPT+S      I
Sbjct: 330 DTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILL------QGCLAFAPTASDRMPGFI 383

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQ+   V +++    I F    C
Sbjct: 384 GNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 89/267 (33%), Positives = 130/267 (48%), Gaps = 26/267 (9%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAST 54
           GD   +T+TL S      S  NI IGCG NN   + GA+ G++G G G  SF +Q+ +ST
Sbjct: 175 GDLSVDTLTLESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSST 234

Query: 55  ---FSYCLV------DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGI 102
              FSYCL       +  S++TS L F  +   +    VT P+L+  + +TFYYL L   
Sbjct: 235 GGKFSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKK-DPETFYYLTLEAF 293

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
           SVG   + I       +E   G II+DSGT +T L  + Y+ L  A V   +     D  
Sbjct: 294 SVGNRRVEIGGVPNGDNE---GNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPT 350

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
              + CY   +    + P ++ HF    V   P   ++   D  G FC AF  +S   +I
Sbjct: 351 QTLNLCYSVKAE-GYDFPIITMHFKGADVDLHPISTFVSVAD--GVFCLAFE-SSQDHAI 406

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN+ QQ   V ++L+  ++ F P+ C
Sbjct: 407 FGNLAQQNLMVGYDLQQKIVSFKPSDC 433


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 82/254 (32%), Positives = 127/254 (50%), Gaps = 12/254 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   T+T T G+ +V  +  GC   + G F GA+G++G+G G+LS  SQ+    FSY L+
Sbjct: 191 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 250

Query: 61  ----DRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL-PIS 112
                 D  + S + F D ++P      + PLL +     FYY+ LTG+ V G+ L  I 
Sbjct: 251 APEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIP 310

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYD 170
              F +  +G GG+I+ S T VT L+   Y+ +R A V     L   +G A    D CY+
Sbjct: 311 AGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYN 369

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            SS + V+VP ++  F  G  + L A NY    +  G  C    P+    S++G + Q G
Sbjct: 370 ASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTG 428

Query: 231 TRVSFNLRNSLIGF 244
           T + +++    + F
Sbjct: 429 TNMIYDVDAGRLTF 442


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/255 (29%), Positives = 115/255 (45%), Gaps = 22/255 (8%)

Query: 11  GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYC---LVDRDSDS 66
           G  +   +  GCGH N+G+F     G+ G G G  S PSQ+N ++FSYC   + D  S S
Sbjct: 199 GGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSS 258

Query: 67  TSTL---------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
             TL            ++   +  T  L++N    + Y++ L GISVGG  + + E+  +
Sbjct: 259 VVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR 318

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
                    I+DSG ++T L  + Y A++  FV      +   G A  D C+     +  
Sbjct: 319 ------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALW 372

Query: 178 E---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
               VP ++ H   G    LP  NY+    +    C      +    +IGN QQQ T V 
Sbjct: 373 RRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVV 432

Query: 235 FNLRNSLIGFTPNKC 249
           ++L N ++ F P +C
Sbjct: 433 YDLENDVLSFAPARC 447


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 85/260 (32%), Positives = 122/260 (46%), Gaps = 15/260 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G    + ++L    +     GCG +N+G F G +GL+GLG   LS  SQ        FSY
Sbjct: 206 GVLAHDRLSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSY 265

Query: 58  CLVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL  ++S S+ +L    D+S+  N+   V   ++ +     FY   LTGI+VGG+   + 
Sbjct: 266 CLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGE--DVQ 323

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
              F     G G  IVDSGT +T L    Y A+R  FV            ++ DTC+D +
Sbjct: 324 SPGFS--AGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLT 381

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLS--IIGNVQQQ 229
               V+VP++   F  G  + + +K  L  V  + +  C A A   S     IIGN QQ+
Sbjct: 382 GLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQK 441

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F+   S IGF    C
Sbjct: 442 NLRVIFDTVGSQIGFAQETC 461


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 93/280 (33%), Positives = 130/280 (46%), Gaps = 37/280 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQIN 51
           G    ET TL   S        +  GC H    +F    +G AGLLGLG G  S  SQ  
Sbjct: 213 GSLAEETFTLSPPSPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTR 272

Query: 52  AS------TFSYCLVDRDSDSTS-TLEFDSSLPPNAVT----APLLRN-HELDTFYYLGL 99
            S       FSYCL  R S +   T+   ++ P    +     PL+    +L + Y + L
Sbjct: 273 RSINSGGGVFSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNL 332

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV--RGTRALS 157
            G+SV G  + I  +AF +      G ++DSGT VT +    Y  LRD F    G+  + 
Sbjct: 333 AGVSVNGAAVDIPASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKML 386

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGT----FC 210
           P   + L DTCYD + +  V  P V+  F  G  + + A   L+ +   D +G      C
Sbjct: 387 PEGSMKLLDTCYDVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLAC 446

Query: 211 FAFAPTSSS-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            AF PT+S+ L I+GN+QQ+   V F++    IGF PN C
Sbjct: 447 LAFLPTNSAGLVIVGNMQQRAYNVVFDVDGGRIGFGPNGC 486


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/262 (34%), Positives = 126/262 (48%), Gaps = 35/262 (13%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G++  +T+TL  + V      GCG NNEG F  GA G+LGLG G LS  SQ  +     F
Sbjct: 151 GNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 210

Query: 56  SYCLVDRDS----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
           SYCL + DS           S S+L+F S      V  P     E   +Y++ L  ISVG
Sbjct: 211 SYCLPEEDSIGSLLFGEKATSQSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISVG 265

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
              L +  + F      + G I+DSGT +T L    Y+AL  AF +       ++G    
Sbjct: 266 NKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKK 320

Query: 164 --LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-- 219
             + DTCY+ S R  V +P +  HF EG  + L  K  +   D++   C AFA  S S  
Sbjct: 321 GDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDAS-RLCLAFAGNSKSTM 379

Query: 220 ---LSIIGNVQQQGTRVSFNLR 238
              L+IIGN QQ    V ++++
Sbjct: 380 NSELTIIGNRQQVSLTVLYDIQ 401


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 92/247 (37%), Positives = 122/247 (49%), Gaps = 16/247 (6%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++     GCG  N G F    GLLGLG G LS PSQ  AS    FSYCL   +S +T 
Sbjct: 247 SRALTGFPFGCGTRNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTG 305

Query: 69  TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
            L   ++   +   A    +LR  +  +FY++ L  I +GG +LP+    F       GG
Sbjct: 306 YLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGG 360

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFH 185
            ++DSGT +T L  + Y  LRD F       +P     + D CYDF+  S V VP VSF 
Sbjct: 361 TLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFR 420

Query: 186 FPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLI 242
           F +G V  L     +I +D N   C AFA   +    LSIIGN QQ+   V +++    I
Sbjct: 421 FGDGAVFELDFFGVMIFLDEN-VGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKI 479

Query: 243 GFTPNKC 249
           GF P  C
Sbjct: 480 GFVPASC 486


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/264 (31%), Positives = 124/264 (46%), Gaps = 32/264 (12%)

Query: 12  SASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-- 68
           +A+V NI  GCG  N GLF    +G+ G G G LS PSQ+    FSYC    +    S  
Sbjct: 204 AAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKVRRFSYCFTAMEESRVSPV 263

Query: 69  -------TLEFDSSLP-------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
                   +E  ++ P       P    AP+        FY+L L G++VG   LP + +
Sbjct: 264 ILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQ----PFYFLSLRGVTVGETRLPFNAS 319

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
            F +   G+GG  +DSGTA+T      + +LR+AFV     L    G    D    FS  
Sbjct: 320 TFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFV-AQVPLPVAKGYTDPDNLLCFSVP 378

Query: 173 -SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-----FCFA-FAPTSSSLSIIGN 225
             + +  VP +  H  EG    LP +NY++  D +G+      C    +  +S+ +IIGN
Sbjct: 379 AKKKAPAVPKLILHL-EGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGN 437

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
            QQQ   + ++L ++ + F P +C
Sbjct: 438 FQQQNMHIVYDLESNKMVFAPARC 461


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 81/243 (33%), Positives = 124/243 (51%), Gaps = 14/243 (5%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTS 68
           S ++     GCG ++EGLF  AAG+LGLG   LS   Q+++     FSYCL  R      
Sbjct: 120 SQTLPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFL 179

Query: 69  TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
           ++   +SL  +A    P+  +    + Y+L LT I+VGG  L ++   +++        I
Sbjct: 180 SIG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------I 232

Query: 128 VDSGTAVTRLQTETYNALRDAFVR-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
           +DSGT +TRL    Y   + AFV+  +   +   G ++ DTC+  + +    VP V   F
Sbjct: 233 IDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVRLIF 292

Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
             G  L L   N L+ VD  G  C AFA  ++ ++IIGN QQQ  +V+ ++  + IGF  
Sbjct: 293 QGGADLNLRPVNVLLQVD-EGLTCLAFA-GNNGVAIIGNHQQQTFKVAHDISTARIGFAT 350

Query: 247 NKC 249
             C
Sbjct: 351 GGC 353


>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
          Length = 429

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 93/177 (52%), Gaps = 9/177 (5%)

Query: 77  PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
           P N  T PLLRN    T YY+ LTG+SVG  L+P++      D +   G I+DSGT +TR
Sbjct: 257 PKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITR 316

Query: 137 LQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
                Y A+RD F +  +   P   +  FDTC  F++ +    P V+FHF  G  L LP 
Sbjct: 317 FVEPVYAAIRDEFRKQVKG--PFATIGAFDTC--FAATNEDIAPPVTFHF-TGMDLKLPL 371

Query: 197 KNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +N LI   +    C A A      +S L++I N+QQQ  R+ F++ NS +G     C
Sbjct: 372 ENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 428


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/272 (33%), Positives = 137/272 (50%), Gaps = 34/272 (12%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS- 53
           G+   +T+TL S+     S     IGCG +N   F GA+ G++GLGGG  S  +Q+ +S 
Sbjct: 153 GNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSI 212

Query: 54  --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL+    +S++TS L F D+++      V+ P+++   +  FYYL L   SVG 
Sbjct: 213 DAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPI-VFYYLTLEAFSVGN 271

Query: 107 DLLPISETAFKIDESGNGG----IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
             +       + + S NGG    II+DSGT +T + T+ YN L  A +   +     D  
Sbjct: 272 KRI-------EFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPT 324

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---- 218
            LF+ CY  +S    + P ++ HF    V   P   ++   D  G  C AFA TS+    
Sbjct: 325 RLFNLCYSVTS-DGYDFPIITTHFKGADVKLHPISTFVDVAD--GIVCLAFATTSAFIPS 381

Query: 219 -SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +SI GN+ QQ   V ++L+  ++ F P  C
Sbjct: 382 DVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDC 413


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 86/262 (32%), Positives = 124/262 (47%), Gaps = 25/262 (9%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA---ST 54
           G + ++ +TL GS  V     GC H     G+     GL+GLGG + S  SQ  A    +
Sbjct: 230 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKS 289

Query: 55  FSYCLVDRDSDS-----TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           FSYCL    + S      +             T P+LR+ ++ T+Y+  L  I+VGG  L
Sbjct: 290 FSYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 349

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +S + F        G +VDSGT +TRL    Y AL  AF  G    +  + + + DTC+
Sbjct: 350 GLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 403

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
           +F+    V +PTV+  F  G V+ L A   +    S G  C AFAPT    +   IGNVQ
Sbjct: 404 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQ 457

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+   V +++   + GF    C
Sbjct: 458 QRTFEVLYDVGGGVFGFRAGAC 479


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 88/283 (31%), Positives = 123/283 (43%), Gaps = 51/283 (18%)

Query: 1   GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQIN 51
           G+  T+  T G  + D         +  GCGH N+G+F     G+ G G G  S PSQ+N
Sbjct: 189 GEIATDRFTFGGDNGDGDSRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLN 248

Query: 52  ASTFSYCLVDRDSDSTSTLEFDSSL------PPNAV-------------TAPLLRNHELD 92
            +TFSYC        TS  E  SSL      P  A+             T PLL+N    
Sbjct: 249 VTTFSYCF-------TSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQP 301

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           + Y+L L GISVG   L + E   +         I+DSG ++T L    Y A++  F   
Sbjct: 302 SLYFLSLKGISVGKTRLAVPEAKLR-------STIIDSGASITTLPEAVYEAVKAEFA-A 353

Query: 153 TRALSPT---DGVALFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSN 206
              L PT   +G AL D C+     +      VP+++ H  +G    LP  NY+    + 
Sbjct: 354 QVGLPPTGVVEGSAL-DLCFALPVTALWRRPPVPSLTLHL-DGADWELPRGNYVFEDLAA 411

Query: 207 GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              C          ++IGN QQQ T V ++L N  + F P +C
Sbjct: 412 RVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLENDWLSFAPARC 454


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 87/258 (33%), Positives = 123/258 (47%), Gaps = 21/258 (8%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
           G ++T+T+T+ G+ +V N   GC H   G F    AG + LGGG+ S  +Q   S    F
Sbjct: 230 GTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAGTMSLGGGAQSLLAQTARSLGNAF 289

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAV--TAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           SYC+    +    ++   ++     V  T PL+R+    + Y + L GI V G  L I  
Sbjct: 290 SYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPP 349

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
            AF      + G ++DS   +T+L    Y ALR AF    RA   +      DTCYDF  
Sbjct: 350 VAF------SAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGATGTLDTCYDFLG 403

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI--IGNVQQQGT 231
            ++V VP VS  F  G V+ L     +I        C AF  TSS L++  IGNVQQQ  
Sbjct: 404 LTNVRVPAVSLVFGGGAVVVLDPPAVMI------GGCLAFTATSSDLALGFIGNVQQQTH 457

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V +++    +GF    C
Sbjct: 458 EVLYDVAAGGVGFRRGAC 475


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 73/197 (37%), Positives = 100/197 (50%), Gaps = 13/197 (6%)

Query: 13  ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           ASV  +A GCG  N G+F     G+ G G G LS PSQ+    FS+C    +    ST+ 
Sbjct: 189 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTAVNGLKPSTVL 248

Query: 72  FDSSLPPN--------AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            D  LP +          + PL++N    TFYYL L GI+VG   LP+ E+ F + ++G 
Sbjct: 249 LD--LPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGT 305

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGTA+T L T  Y  +RDAF    +    +        C     R+   VP + 
Sbjct: 306 GGTIIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 184 FHFPEGKVLPLPAKNYL 200
            HF EG  + LP +NY+
Sbjct: 366 LHF-EGATMDLPRENYV 381


>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
          Length = 191

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 86/164 (52%), Gaps = 1/164 (0%)

Query: 87  RNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
           + + L+TFYY+ +  + VGG++L I E  + +   G GG I+DSGT ++      Y  ++
Sbjct: 25  KENHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIK 84

Query: 147 DAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN 206
            AFV   +     D   +   CY+ S    +E+P+    F +G +   P +NY I ++  
Sbjct: 85  QAFVNKVKRYPILDDFPILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEPE 144

Query: 207 GTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              C A   T  S++SIIGN QQQ   + ++ + S +GF P +C
Sbjct: 145 DIVCLAILGTPHSAMSIIGNYQQQNFHILYDTKRSRLGFAPRRC 188


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 24/271 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+T+G  +   +A GC   N      ++G++GLG G LS  SQ+    FSYCL 
Sbjct: 184 GYLATETLTVGDGTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLR 241

Query: 61  DRDSD-STSTLEFDS--SLPPNAV--TAPLLRNHELD--TFYYLGLTGISVGGDLLPISE 113
              +D   S + F S   L   +V  + PLL+N  L   T YY+ LTGI+V    LP++ 
Sbjct: 242 SDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTG 301

Query: 114 TAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVAL-FDTC 168
           + F   ++G  GG IVDSGT +T L  + Y  ++ AF      L   +P  G     D C
Sbjct: 302 STFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLC 361

Query: 169 YDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDSNGTF---CFAFAPTSSSL 220
           Y  S+     +V VP ++  F  G    +P +NY   +  DS G     C    P +  L
Sbjct: 362 YKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421

Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             SIIGN+ Q    + +++   +  F P  C
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/263 (32%), Positives = 129/263 (49%), Gaps = 23/263 (8%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
           G+   ET+TL      S S     IGCGHNN G+F G  +G++GLG G +S  +Q+ +S 
Sbjct: 175 GELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSI 234

Query: 54  --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL+    DS+ TS L F D+++      V+ P ++  +   FYYL L   SVG 
Sbjct: 235 GGKFSYCLLPLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKK-DPQAFYYLTLEAFSVGN 293

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +        +D+S  G II+DSGT +T L +  Y  L  A  +  +     D   L +
Sbjct: 294 KRIEFEV----LDDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLN 349

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY  +S    + P ++ HF    +   P   +    D  G  C AF  +S +  I GN+
Sbjct: 350 LCYSITS-DQYDFPIITAHFKGADIKLNPISTFAHVAD--GVVCLAFT-SSQTGPIFGNL 405

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    V ++L+ +++ F P+ C
Sbjct: 406 AQLNLLVGYDLQQNIVSFKPSDC 428


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/257 (35%), Positives = 128/257 (49%), Gaps = 17/257 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    +TV+L S+ S      GCG +N GLF  AAGL+GL    LS  SQ+  S   +F+
Sbjct: 202 GYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLIGLARNKLSLLSQLAPSVGNSFA 261

Query: 57  YCLVDRDSDSTSTLEFDS---SLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
           YCL    + S   L F S   +  P   +   + +  LD + Y++ L G+SV G  L + 
Sbjct: 262 YCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVP 321

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            +     E G+   I+DSGT +TRL T  Y AL  A V    A       ++  TC+   
Sbjct: 322 SS-----EYGSLPTIIDSGTVITRLPTPVYTALSKA-VGAALAAPSAPAYSILQTCFK-G 374

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
             + + VP V+  F  G  L L   N L+ V+   T C AFAPT S+ +IIGN QQQ   
Sbjct: 375 QVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNET-TTCLAFAPTDST-AIIGNTQQQTFS 432

Query: 233 VSFNLRNSLIGFTPNKC 249
           V ++++ S IGF    C
Sbjct: 433 VVYDVKGSRIGFAAGGC 449


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 24/271 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+T+G  +   +A GC   N      ++G++GLG G LS  SQ+    FSYCL 
Sbjct: 184 GYLATETLTVGDGTFPKVAFGCSTENG--VDNSSGIVGLGRGPLSLVSQLAVGRFSYCLR 241

Query: 61  DRDSD-STSTLEFDS--SLPPNAV--TAPLLRNHELD--TFYYLGLTGISVGGDLLPISE 113
              +D   S + F S   L   +V  + PLL+N  L   T YY+ LTGI+V    LP++ 
Sbjct: 242 SDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTG 301

Query: 114 TAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVAL-FDTC 168
           + F   ++G  GG IVDSGT +T L  + Y  ++ AF      L   +P  G     D C
Sbjct: 302 STFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLC 361

Query: 169 YDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYL--IPVDSNGTF---CFAFAPTSSSL 220
           Y  S+     +V VP ++  F  G    +P +NY   +  DS G     C    P +  L
Sbjct: 362 YKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL 421

Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             SIIGN+ Q    + +++   +  F P  C
Sbjct: 422 PISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 130/286 (45%), Gaps = 53/286 (18%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
           G F  ET +L ++S     + ++A GCG    G       F GA G++GLG G +SF SQ
Sbjct: 180 GLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQ 239

Query: 50  IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVT--------------APLLRNHELD 92
           +     + FSYCL+D          +  S PP +                 PLL N    
Sbjct: 240 LGRRFGNKFSYCLMD----------YTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSP 289

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           TFYY+ L  + V G  L I  + ++ID+SGNGG +VDSGT +  L    Y ++  A  R 
Sbjct: 290 TFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR 349

Query: 153 TR-----ALSPTDGVALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDS 205
            +     AL+P      FD C + S  +  E  +P + F F  G V   P +NY I  + 
Sbjct: 350 VKLPIADALTPG-----FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEE 404

Query: 206 NGTFCFAFAPTSSSL--SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               C A       +  S+IGN+ QQG    F+   S +GF+   C
Sbjct: 405 Q-IQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 449


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 81/259 (31%), Positives = 126/259 (48%), Gaps = 15/259 (5%)

Query: 5   TETVTLGSA------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           TET+TLGS+      SV ++A GCG +N G  + + G +GLG G+LS  +Q+    FSYC
Sbjct: 171 TETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGKFSYC 230

Query: 59  LVD---RDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           L D      DS   L   + L P      + PLL++    + Y + L GI++G   LPI 
Sbjct: 231 LTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIP 290

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
              F +  +  GG++VDSGT  + L    +  + D  V       P +  +L   C+   
Sbjct: 291 NKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVD-HVAQVLGQPPVNASSLDSPCFPAP 349

Query: 173 S--RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
           +  R    +P +  HF  G  + L   NY+     + +FC     T+S+ S++GN QQQ 
Sbjct: 350 AGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQN 409

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            ++ F++    + F P  C
Sbjct: 410 IQMLFDMTVGQLSFLPTDC 428


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  110 bits (275), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 88/263 (33%), Positives = 125/263 (47%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G    + + L    ++    GCG +N+G  F G +GL+GLG   +S  SQ        FS
Sbjct: 216 GVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFS 275

Query: 57  YCLVDRDSDSTSTLEF--DSSL----PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLL 109
           YCL  R+S S+ +L    DSS      P   TA +  +  L   FY+L LTGI+VGG   
Sbjct: 276 YCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQ-- 333

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +    F       G +I+DSGT +T L    YNA+R  F+            ++ DTC+
Sbjct: 334 EVESPWFSA-----GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILDTCF 388

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSL--SIIGNV 226
           + +    V+VP++ F F     + + +K  L  V S+ +  C A A   S    SIIGN 
Sbjct: 389 NLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNY 448

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV F+   S IGF    C
Sbjct: 449 QQKNLRVIFDTLGSQIGFAQETC 471


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 88/260 (33%), Positives = 125/260 (48%), Gaps = 27/260 (10%)

Query: 17  NIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF- 72
           ++A GC        G   GA+G++GLG G+LS  SQ+  + FSYCL    S ST+T    
Sbjct: 178 SLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTSRLF 237

Query: 73  ------DSSLPPNAVTAPLLRNHELD---TFYYLGLTGISVGGDLLPISETAFKIDESGN 123
                  SS    A + P L+N ++D   TFYYL LTGI+VG   L + E AF + +   
Sbjct: 238 VGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVAT 297

Query: 124 G---GIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSRSSVE 178
           G   G ++DSG+  T L    Y ALRD  V+  G   + P  G    D C   +     +
Sbjct: 298 GLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGK 357

Query: 179 -VPTVSFHFPE-GKVLPLPAKNYLIPVDSNGTFCFAFA---PTSS----SLSIIGNVQQQ 229
            VP +  HF   G  + +P +NY  PVD +      F+   P S+      +IIGN  QQ
Sbjct: 358 LVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQ 417

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              + ++L   ++ F P  C
Sbjct: 418 DMHLLYDLEKGMLSFQPADC 437


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 121/261 (46%), Gaps = 21/261 (8%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
           ++T+ LG  ++   A GC     G    +   GLLGLG G +S  SQ  ++    FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCL 234

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P N    PLL N    + YY+ +TG+SVG   + + 
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             +F  D +   G ++DSGT +TR     Y ALR+ F R   A S    +  FDTC++  
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
             ++   P V+ H   G  L LP +N LI   +    C A A         ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  RV  ++  S +GF    C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 87/268 (32%), Positives = 131/268 (48%), Gaps = 41/268 (15%)

Query: 5   TETVTLGS------ASVDNIAIGCG-HNNEGLFVG--AAGLLGLGGGSLSFPSQINAS-- 53
           TET++ GS       S  N   GCG  NN  ++      G+ GLG G LS  SQ+ A   
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242

Query: 54  -TFSYCLVDRDSDSTSTLEFDSS--LPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
             FSYCL+  DS STS L+F S   +  N  V+ PL+    L T+Y+L L  +++G  ++
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD--- 166
              +T        +G I++DSGT +T L+   YN           +L  T GV L     
Sbjct: 303 STGQT--------DGNIVIDSGTPLTYLENTFYNNF-------VASLQETLGVKLLQDLP 347

Query: 167 ----TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLS 221
               TC  F +R+++ +P ++F F  G  + L  KN LIP+  +   C A  P+S   +S
Sbjct: 348 SPLKTC--FPNRANLAIPDIAFQF-TGASVALRPKNVLIPLTDSNILCLAVVPSSGIGIS 404

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + G++ Q   +V ++L    + F P  C
Sbjct: 405 LFGSIAQYDFQVEYDLEGKKVSFAPTDC 432


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  110 bits (274), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 121/247 (48%), Gaps = 8/247 (3%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
           TE  T G   +D +  GCG  N G F G +G++GLG G+LS  SQ+    FSY     DS
Sbjct: 186 TEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 245

Query: 65  -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
            D+ S + F     P   + ++  LL +    + YY+ L GI V G  L I    F + +
Sbjct: 246 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 305

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
           + G+GG+ +     VT L+   Y  LR A V     L   +G AL  D CY   S +  +
Sbjct: 306 KDGSGGVFLSITDLVTVLEEAAYKPLRQA-VASKIGLPAVNGSALGLDLCYTGESLAKAK 364

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNL 237
           VP+++  F  G V+ L   NY     + G  C    P+S+   S++G++ Q GT + +++
Sbjct: 365 VPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDI 424

Query: 238 RNSLIGF 244
             S + F
Sbjct: 425 NGSKLVF 431


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  110 bits (274), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 121/247 (48%), Gaps = 8/247 (3%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
           TE  T G   +D +  GCG  N G F G +G++GLG G+LS  SQ+    FSY     DS
Sbjct: 190 TEAFTFGDTRIDGVVFGCGLQNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 249

Query: 65  -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
            D+ S + F     P   + ++  LL +    + YY+ L GI V G  L I    F + +
Sbjct: 250 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 309

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVE 178
           + G+GG+ +     VT L+   Y  LR A V     L   +G AL  D CY   S +  +
Sbjct: 310 KDGSGGVFLSITDLVTVLEEAAYKPLRQA-VASKIGLPAVNGSALGLDLCYTGESLAKAK 368

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNL 237
           VP+++  F  G V+ L   NY     + G  C    P+S+   S++G++ Q GT + +++
Sbjct: 369 VPSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDI 428

Query: 238 RNSLIGF 244
             S + F
Sbjct: 429 NGSKLVF 435


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  110 bits (274), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 87/257 (33%), Positives = 131/257 (50%), Gaps = 23/257 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
           G +  ET+T+    +V +   GCGH+ +G      GLLGLGG   S   Q   +    FS
Sbjct: 218 GVYSNETLTMAPGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFS 277

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YCL   + D    L   + +   +  V  P++R  E  TFY + +TGI+VGG+ + +  +
Sbjct: 278 YCLPAAN-DQAGFLALGAPVNDASGFVFTPMVR--EQQTFYVVNMTGITVGGEPIDVPPS 334

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
           AF      +GG+I+DSGT VT LQ   Y AL+ AF R   A  P       DTCY+F+  
Sbjct: 335 AF------SGGMIIDSGTVVTELQHTAYAALQAAF-RKAMAAYPLLPNGELDTCYNFTGH 387

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQQQGTR 232
           S+V VP V+  F  G  + L   + ++ +D+    C AF  A   +   I+GNV Q+   
Sbjct: 388 SNVTVPRVALTFSGGATVDLDVPDGIL-LDN----CLAFQEAGPDNQPGILGNVNQRTLE 442

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++ +  +GF  + C
Sbjct: 443 VLYDVGHGRVGFGADAC 459


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 87/271 (32%), Positives = 126/271 (46%), Gaps = 23/271 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGL------FVGAAGLLGLGGGSLSFPSQ 49
           G F  ET +L +     A + ++A GCG    G       F GA G++GLG G +SF SQ
Sbjct: 179 GLFARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQ 238

Query: 50  IN---ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGI 102
           +     + FSYCL+D       T         +AV+     PLL N    TFYY+ L  +
Sbjct: 239 LGRRFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSV 298

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            V G  L I  + ++ID+SGNGG ++DSGT +  L    Y  +  A  +  +  +  +  
Sbjct: 299 FVNGAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELT 358

Query: 163 ALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
             FD C + S  +  E  +P + F F  G V   P +NY I  +     C A       +
Sbjct: 359 PGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQ-IQCLAIQSVDPKV 417

Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S+IGN+ QQG    F+   S +GF+   C
Sbjct: 418 GFSVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 73/207 (35%), Positives = 104/207 (50%), Gaps = 22/207 (10%)

Query: 54  TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           TFSYCL      S  +L F  +L       P    T PLL N    + YY+ +TGI VG 
Sbjct: 250 TFSYCL-----PSFKSLNFSGTLRLGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGK 304

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
            ++PI   A   D +   G ++DSGT  TRL    Y A+RD   R  R  +P   +  FD
Sbjct: 305 KVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRRRIRG-APLSSLGGFD 363

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSI 222
           TCY+    ++V+ P V+F F  G  + LPA N +I      T C A A      ++ L++
Sbjct: 364 TCYN----TTVKWPPVTFMF-TGMQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNV 418

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I ++QQQ  R+ F++ N  +GF   +C
Sbjct: 419 IASMQQQNHRILFDVPNGRVGFAREQC 445


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 91/267 (34%), Positives = 122/267 (45%), Gaps = 38/267 (14%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
           G +  ET+T     +V +   GCGH+  G      GLLGLGG   S   Q   +    FS
Sbjct: 219 GVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFS 278

Query: 57  YCLVD------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           YCL              R S +T+T  F        V  P+       T Y + +TGISV
Sbjct: 279 YCLPALNSEAGFLALGVRPSAATNTSAF--------VFTPMWHLPMDATSYMVNMTGISV 330

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
           GG  L I  +AF+      GG+++DSGT VT L    YNAL +A +R   A  P      
Sbjct: 331 GGKPLDIPRSAFR------GGMLIDSGTIVTELPETAYNAL-NAALRKAFAAYPMVASED 383

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSI 222
           FDTCY+F+  S+V VP V+  F  G  + L   N ++  D     C AF  +     L I
Sbjct: 384 FDTCYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD-----CLAFRESGPDVGLGI 438

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGNV Q+   V ++  +  +GF    C
Sbjct: 439 IGNVNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 88/263 (33%), Positives = 126/263 (47%), Gaps = 19/263 (7%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
           G    ET+TL S       +  I  GCGHNN G F     G++GLGGG +SF SQI +S 
Sbjct: 113 GVLAQETITLSSTKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSF 172

Query: 54  ---TFSYCLVDRDSD----STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
               FS CLV   +D    S  +L   S +    V +  L   +  T Y++ L GISVG 
Sbjct: 173 GGKRFSQCLVPFHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGN 232

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             L  + ++ +  E GN  + +DSGT  T L T+ Y+ L  A VR   A+ P        
Sbjct: 233 TYLHFNGSSSQSVEKGN--VFLDSGTPPTILPTQLYDRLV-AQVRSEVAMKPVTNDLDLG 289

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
               + +++++  P ++ HF  G V  LP + ++ P D  G FC  F  TSS   + GN 
Sbjct: 290 PQLCYRTKNNLRGPVLTAHFEGGDVKLLPTQTFVSPKD--GVFCLGFTNTSSDGGVYGNF 347

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    + F+L   ++ F P  C
Sbjct: 348 AQSNYLIGFDLDRQVVSFKPMDC 370


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  109 bits (273), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 120/261 (45%), Gaps = 21/261 (8%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
           ++T+ LG  ++   A GC     G    +   GLLGLG G +S  SQ  +     FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCL 234

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P N    PLL N    + YY+ +TG+SVG   + + 
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             +F  D +   G ++DSGT +TR     Y ALR+ F R   A S    +  FDTC++  
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
             ++   P V+ H   G  L LP +N LI   +    C A A         ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  RV  ++  S +GF    C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 83/261 (31%), Positives = 120/261 (45%), Gaps = 21/261 (8%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
           ++T+ LG  ++   A GC     G    +   GLLGLG G +S  SQ  +     FSYCL
Sbjct: 175 SDTLRLGKDAIAGYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCL 234

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P N    PLL N    + YY+ +TG+SVG   + + 
Sbjct: 235 -----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVP 289

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             +F  D +   G ++DSGT +TR     Y ALR+ F R   A S    +  FDTC++  
Sbjct: 290 AGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTD 349

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSIIGNVQQ 228
             ++   P V+ H   G  L LP +N LI   +    C A A         ++++ N+QQ
Sbjct: 350 EVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQ 409

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  RV  ++  S +GF    C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/260 (35%), Positives = 126/260 (48%), Gaps = 20/260 (7%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    +T+ LG+ + +D    GCG +N GLF G AGL+GLG   LS  SQ  A     FS
Sbjct: 282 GVLAQDTLGLGTTTKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFS 341

Query: 57  YCLVDRDSDSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL    + ST +L      SS  PN     ++ +     FY++ +TG +VGG    ++ 
Sbjct: 342 YCL-PATTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAA-LTA 399

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVALFDTCYDFS 172
             F     G G ++VDSGT +TRL    Y A+R  F R  R   P   G ++ D CYD +
Sbjct: 400 PGF-----GAGNVLVDSGTVITRLAPSVYKAVRAEFAR--RFEYPAAPGFSILDACYDLT 452

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFA--PTSSSLSIIGNVQQQ 229
            R  V VP ++     G  + + A   L  V  +G+  C A A  P      IIGN QQ+
Sbjct: 453 GRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQR 512

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV ++   S +GF    C
Sbjct: 513 NKRVVYDTVGSRLGFADEDC 532


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 96/199 (48%), Gaps = 14/199 (7%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G    ET T G+A+       NIA GCG  N G    ++G++G G G LS  SQ+  S F
Sbjct: 176 GVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLSLVSQLGPSRF 235

Query: 56  SYCLVDRDSDSTSTLEF---------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           SYCL    S + S L F         ++S      + P + N  L   Y+L L  IS+G 
Sbjct: 236 SYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGT 295

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
            LLPI    F I++ G GG+I+DSGT++T LQ + Y A+R   V      +  D     D
Sbjct: 296 KLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLD 355

Query: 167 TCYDFSSRSSVEVPTVSFH 185
           TC+ +    +V V    F 
Sbjct: 356 TCFQWPPPPNVTVTVPDFR 374


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/264 (36%), Positives = 125/264 (47%), Gaps = 29/264 (10%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
           +   +T+ L +  V     GC     G  V   GLLGLG G LSF SQ   +  STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233

Query: 59  LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           L      S  TL F  +L       P    T PLL+N    + YY+ L GI VG  ++ I
Sbjct: 234 L-----PSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDI 288

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
             +A   + +   G I DSGT  TRL    Y A+RD F +  G   +S   G   FDTCY
Sbjct: 289 PASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG---FDTCY 345

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N
Sbjct: 346 T----GPIVAPTMTFMF-SGMNVTLPTDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIAN 400

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  R+ F++ NS IG     C
Sbjct: 401 MQQQNHRILFDVPNSRIGVAREPC 424


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/264 (36%), Positives = 125/264 (47%), Gaps = 29/264 (10%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYC 58
           +   +T+ L +  V     GC     G  V   GLLGLG G LSF SQ   +  STFSYC
Sbjct: 174 NLTRDTIALSTDIVPGYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYC 233

Query: 59  LVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           L      S  TL F  +L       P    T PLL+N    + YY+ L GI VG  ++ I
Sbjct: 234 L-----PSFRTLNFSGTLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDI 288

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
             +A   + +   G I DSGT  TRL    Y A+RD F +  G   +S   G   FDTCY
Sbjct: 289 PASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG---FDTCY 345

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N
Sbjct: 346 T----GPIVAPTMTFMF-SGMNVTLPPDNLLIRSTAGSTSCLAMAAAPDNVNSVLNVIAN 400

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  R+ F++ NS IG     C
Sbjct: 401 MQQQNHRILFDVPNSRIGVAREPC 424


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/267 (33%), Positives = 135/267 (50%), Gaps = 29/267 (10%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAST 54
           GD   ET+TLGS +  ++      IGCG NN   F G ++G++GLG G +S  +Q+   +
Sbjct: 176 GDLSVETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRS 235

Query: 55  ------FSYCLVDRDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVG 105
                 FSYCL    S+ +S L F D+++      V+ P++  H+   FYYL L   SVG
Sbjct: 236 SSIGRKFSYCLASM-SNISSKLNFGDAAVVSGDGTVSTPIV-THDPKVFYYLTLEAFSVG 293

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
            + +  + ++F+  E GN  II+DSGT +T L  + Y+ L  A    V   R   P   +
Sbjct: 294 NNRIEFTSSSFRFGEKGN--IIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQL 351

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
           +L   CY  S+   +  P +  HF  G  + L A N  I V+  G  C AF  +S    I
Sbjct: 352 SL---CYR-STFDELNAPVIMAHF-SGADVKLNAVNTFIEVE-QGVTCLAFI-SSKIGPI 404

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN+ QQ   V ++L+  ++ F P  C
Sbjct: 405 FGNMAQQNFLVGYDLQKKIVSFKPTDC 431


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 95/265 (35%), Positives = 123/265 (46%), Gaps = 26/265 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
              V +TVTL +  V   A GC     G  V   GLLGLG G LS  +Q   +  STFSY
Sbjct: 182 ASLVQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 241

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  TL F  SL       P      PLL+N    + YY+ L  I VG  ++ 
Sbjct: 242 CL-----PSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVD 296

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTC 168
           I   A   + +   G + DSGT  TRL    YNA+R+ F R           +L  FDTC
Sbjct: 297 IPPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTC 356

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIG 224
           Y     + +  PT++F F  G  + LP  N LI   +    C A AP     +S L++I 
Sbjct: 357 YT----APIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIA 411

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+QQQ  RV F++ NS +G     C
Sbjct: 412 NMQQQNHRVLFDVPNSRLGVARELC 436


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/261 (35%), Positives = 121/261 (46%), Gaps = 25/261 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TF 55
           G ++ + +TL  S  V N   GC H   G F  + +G + LGGG  S  SQ  A+    F
Sbjct: 240 GTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAF 299

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPIS 112
           SYC+ D  S    +L   +        A  PL+RN  +  T Y + L GI VGG  L + 
Sbjct: 300 SYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVP 359

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYD 170
              F       GG ++DS   +T+L    Y ALR AF R   A  P    G A  DTCYD
Sbjct: 360 PVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYD 412

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
           F   +SV VP VS  F  G V+ L A   ++        C AF PT    +L  IGNVQQ
Sbjct: 413 FVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFALGFIGNVQQ 466

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V +++    +GF    C
Sbjct: 467 QTHEVLYDVGGGSVGFRRGAC 487


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 33/264 (12%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  S++V     GCGH   GLF G  GLLGLG    S   Q   +    FS
Sbjct: 141 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 200

Query: 57  YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  + S +   T  +   S   P   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 201 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 260

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY
Sbjct: 261 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 312

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQ 227
           +F+   +V +P V+  F  G  + L A   L    S G  C AFAP+ S   ++I+GNVQ
Sbjct: 313 NFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--CLAFAPSGSDGGMAILGNVQ 366

Query: 228 QQGTRVSFNLR--NSLIGFTPNKC 249
           Q+    SF +R   + +GF P+ C
Sbjct: 367 QR----SFEVRIDGTSVGFKPSSC 386


>gi|300078619|gb|ADJ67210.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
          Length = 84

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/84 (60%), Positives = 63/84 (75%), Gaps = 1/84 (1%)

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           DTC+D S ++ V+VPTV+ HF  G  + LPA NYLIPVDS+G+FCFAFA T S LSIIGN
Sbjct: 1   DTCFDLSGKTEVKVPTVALHF-RGADVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQG RV ++L  S +GF P  C
Sbjct: 60  IQQQGFRVVYDLAGSRVGFAPRGC 83


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 93/261 (35%), Positives = 121/261 (46%), Gaps = 25/261 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TF 55
           G ++ + +TL  S  V N   GC H   G F  + +G + LGGG  S  SQ  A+    F
Sbjct: 224 GTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAF 283

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNAVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPIS 112
           SYC+ D  S    +L   +        A  PL+RN  +  T Y + L GI VGG  L + 
Sbjct: 284 SYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVP 343

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYD 170
              F       GG ++DS   +T+L    Y ALR AF R   A  P    G A  DTCYD
Sbjct: 344 PVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYD 396

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
           F   +SV VP VS  F  G V+ L A   ++        C AF PT    +L  IGNVQQ
Sbjct: 397 FVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVPTPGDFALGFIGNVQQ 450

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V +++    +GF    C
Sbjct: 451 QTHEVLYDVGGGSVGFRRGAC 471


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 91/262 (34%), Positives = 124/262 (47%), Gaps = 20/262 (7%)

Query: 1   GDFVTETVTLGSA-------SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA- 52
           GD   +T+TL  +       +V     GCGH+N G F    GLLGLG G  S PSQ+ A 
Sbjct: 233 GDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAAR 292

Query: 53  --STFSYCLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
             + FSYCL    S +   L F  ++   NA    ++   +  T YYL LTGI V G  +
Sbjct: 293 YGAAFSYCLPSSPS-AAGYLSFGGAAARANAQFTEMVTGQD-PTSYYLNLTGIVVAGRAI 350

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--VRGTRALSPTDGVALFDT 167
            +  +AF        G I+DSGTA +RL    Y ALR +F    G           +FDT
Sbjct: 351 KVPASAFAT----AAGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDT 406

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CYDF+   +V +P V   F +G  + L     L   +     C AF P +  L I+GN Q
Sbjct: 407 CYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVAQTCLAFVP-NHDLGILGNTQ 465

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+   V +++ +  IGF    C
Sbjct: 466 QRTLAVIYDVGSQRIGFGRKGC 487


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 98/264 (37%), Positives = 134/264 (50%), Gaps = 33/264 (12%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  S++V     GCGH   GLF G  GLLGLG    S   Q   +    FS
Sbjct: 79  GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 138

Query: 57  YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  + S +   T  +   S   P   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 139 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 198

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY
Sbjct: 199 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 250

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQ 227
           +F+   +V +P V+  F  G  + L A   L    S G  C AFAP+ S   ++I+GNVQ
Sbjct: 251 NFAGYGTVTLPNVALTFGSGATVTLGADGIL----SFG--CLAFAPSGSDGGMAILGNVQ 304

Query: 228 QQGTRVSFNLR--NSLIGFTPNKC 249
           Q+    SF +R   + +GF P+ C
Sbjct: 305 QR----SFEVRIDGTSVGFKPSSC 324


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 35/265 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  S++V     GCGH   GLF G  GLLGLG    S   Q   +    FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292

Query: 57  YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  + S +   T  +   S   P   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 293 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY
Sbjct: 353 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
           +F+   +V +P V+  F  G  + L A   L       +F C AFAP+ S   ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457

Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
           QQ+    SF +R   + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 134/265 (50%), Gaps = 35/265 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T+TL  S++V     GCGH   GLF G  GLLGLG    S   Q   +    FS
Sbjct: 233 GVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFS 292

Query: 57  YCLVDRDSDS---TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL  + S +   T  +   S   P   T  LL +    T+Y + LTGISVGG  L +  
Sbjct: 293 YCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPA 352

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY
Sbjct: 353 SAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCY 404

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNV 226
           +F+   +V +P V+  F  G  + L A   L       +F C AFAP+ S   ++I+GNV
Sbjct: 405 NFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNV 457

Query: 227 QQQGTRVSFNLR--NSLIGFTPNKC 249
           QQ+    SF +R   + +GF P+ C
Sbjct: 458 QQR----SFEVRIDGTSVGFKPSSC 478


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 92/260 (35%), Positives = 122/260 (46%), Gaps = 24/260 (9%)

Query: 4   VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLV 60
           V +T+TL +  +     GC +   G      GLLGLG G LS  SQ   +  STFSYCL 
Sbjct: 184 VQDTLTLAADPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCL- 242

Query: 61  DRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
                S  ++ F  SL       P      PLLRN    + YY+ L  I VG  ++ I  
Sbjct: 243 ----PSFKSINFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPP 298

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
            A   + +   G I DSGT  TRL    Y A+R+ F R      P   +  FDTCY+   
Sbjct: 299 AALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV-- 356

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQ 229
              + VPT++F F  G  + LP  N +I   +  T C A A      +S L++I N+QQQ
Sbjct: 357 --PIVVPTITFLF-SGMNVALPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQ 413

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F++ NS IG     C
Sbjct: 414 NHRVLFDVPNSRIGIARELC 433


>gi|300078594|gb|ADJ67200.1| aspartic proteinase nepenthesin-1 precursor [Jatropha curcas]
          Length = 84

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/84 (60%), Positives = 63/84 (75%), Gaps = 1/84 (1%)

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           DTC+D S ++ V+VPTV+ HF  G  + LPA NYLIPVDS+G+FCFAFA T S LSIIGN
Sbjct: 1   DTCFDLSGKTEVKVPTVALHF-RGVDVSLPASNYLIPVDSDGSFCFAFAGTMSGLSIIGN 59

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQG RV ++L  S +GF P  C
Sbjct: 60  IQQQGFRVVYDLAGSRVGFAPRGC 83


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 120/273 (43%), Gaps = 27/273 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++ET+ L    V N  +GC   +       AG+ G G G  S PSQ+    FSYCL+
Sbjct: 197 GIMLSETLDLPGKGVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLKKFSYCLL 253

Query: 61  DRDSDST---STLEFDSSLPPNAVTA-----PLLRN------HELDTFYYLGLTGISVGG 106
            R  D T   S+L  D        TA     P ++N      H    +YYLGL  I+VGG
Sbjct: 254 SRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGG 313

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVAL 164
             + I          G+GG I+DSGT  T ++ E +  +   F +  ++   T  +G+  
Sbjct: 314 KHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITG 373

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS--- 221
              C++ S  ++   P ++  F  G  + LP  NY+  +  +   C       ++     
Sbjct: 374 LRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFS 433

Query: 222 -----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                I+GN QQQ   V ++LRN  +GF    C
Sbjct: 434 GGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 91/267 (34%), Positives = 130/267 (48%), Gaps = 25/267 (9%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQI 50
           GD  ++T+T+GS     AS   IA GCGH+N G F            G     +   S++
Sbjct: 183 GDLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEV 242

Query: 51  NASTFSYCLVDRDSDST--STLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
               FSYCLV   SDST  S + F  S        V+ PL++    DTFYYL L G+SVG
Sbjct: 243 GGQ-FSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTP-DTFYYLTLEGLSVG 300

Query: 106 GDLLPI---SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            + +     SE          G II+DSGT +T L  + Y  +  A        + TD  
Sbjct: 301 SETVAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPN 360

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            +F  CY  SS +++E+PT++ HF  G  + LP  N  + V  +   CF+  P SS+L+I
Sbjct: 361 GIFSLCY--SSVNNLEIPTITAHF-TGADVQLPPLNTFVQVQED-LVCFSMIP-SSNLAI 415

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN+ Q    V ++L+N+ + F    C
Sbjct: 416 FGNLAQINFLVGYDLKNNKVSFKQTDC 442


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 92/260 (35%), Positives = 122/260 (46%), Gaps = 24/260 (9%)

Query: 4   VTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLV 60
           V +T+TL +  +     GC +   G      GLLGLG G LS  SQ   +  STFSYCL 
Sbjct: 184 VQDTLTLATDPIPGYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCL- 242

Query: 61  DRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
                S  ++ F  SL       P      PLLRN    + YY+ L  I VG  ++ I  
Sbjct: 243 ----PSFKSINFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPP 298

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
            A   + +   G I DSGT  TRL    Y A+R+ F R      P   +  FDTCY+   
Sbjct: 299 AALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYNV-- 356

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQ 229
              + VPT++F F  G  + LP  N +I   +  T C A A      +S L++I N+QQQ
Sbjct: 357 --PIVVPTITFLF-SGMNVTLPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQ 413

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
             RV F++ NS IG     C
Sbjct: 414 NHRVLFDVPNSRIGIARELC 433


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 88/259 (33%), Positives = 129/259 (49%), Gaps = 18/259 (6%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           GD + ETVTLGS            IGC  N    F  + G++GLGGG +S   Q+++S  
Sbjct: 178 GDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSF-DSIGIVGLGGGPVSLVPQLSSSIS 236

Query: 54  -TFSYCLVDRDSDSTSTLEF-DSSLPP-NAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
             FSYCL    SD +S L+F D+++   +   +  +   +   FYYL L   SVG + + 
Sbjct: 237 KKFSYCLAPI-SDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIE 295

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
              ++ +   SG G II+DSGT  T L  + Y+ L  A     +     D +  F  CY 
Sbjct: 296 FRSSSSR--SSGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYK 353

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S+   V+VP ++ HF  G  + L A N  I V S+   C AF  +S S +I GN+ QQ 
Sbjct: 354 -STYDKVDVPVITAHF-SGADVKLNALNTFI-VASHRVVCLAFL-SSQSGAIFGNLAQQN 409

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V ++L+  ++ F P  C
Sbjct: 410 FLVGYDLQRKIVSFKPTDC 428


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 94/267 (35%), Positives = 134/267 (50%), Gaps = 29/267 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
           GD   +T+TL S      S  NI IGCGH N+G   G  +G +GLG G LSF SQ+N+S 
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238

Query: 54  --TFSYCLVDRDSDS--TSTLEF-DSSLPPNA--VTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   S+   +  L F D S+      V+ P+      +  Y   L  +SVG 
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAG---EIGYSTTLNALSVGD 295

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD---AFVRGTRALSPTDGVA 163
            ++    +  K D  GN   I+DSGT +T L    Y+ L     + V+  RA SP     
Sbjct: 296 HIIKFENSTSKNDNLGN--TIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQ-- 351

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSI 222
            F  CY  ++  +++VP ++ HF  G  + L + N   P+D +   CFAF    +   +I
Sbjct: 352 -FKLCYK-ATLKNLDVPIITAHF-NGADVHLNSLNTFYPID-HEVVCFAFVSVGNFPGTI 407

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN+ QQ   V F+L+ ++I F P  C
Sbjct: 408 IGNIAQQNFLVGFDLQKNIISFKPTDC 434


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 97/265 (36%), Positives = 137/265 (51%), Gaps = 27/265 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           G+   +T+TLGS       + NI IGCGHNN G F    +G++GLGGG++S  +Q+  S 
Sbjct: 184 GNIAVDTLTLGSTDTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSI 243

Query: 54  --TFSYCLV--DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV    ++D TS + F ++   +    V+ PL+   + +TFYYL L  ISVG 
Sbjct: 244 DGKFSYCLVPLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQ-ETFYYLTLKSISVGS 302

Query: 107 DLL--PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
             +  P S++      SG G II+DSGT +T L TE Y+ L DA      A    D    
Sbjct: 303 KEVQYPGSDSG-----SGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTG 357

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
              CY  S+   ++VP ++ HF +G  + L   N  + + S    CFAF   S S SI G
Sbjct: 358 LSLCY--SATGDLKVPAITMHF-DGADVNLKPSNCFVQI-SEDLVCFAFR-GSPSFSIYG 412

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           NV Q    V ++  +  + F P  C
Sbjct: 413 NVAQMNFLVGYDTVSKTVSFKPTDC 437


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 78/268 (29%), Positives = 123/268 (45%), Gaps = 27/268 (10%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS 53
           G F+ ++ T       G  +V +I  GCG  N G F+    G+ G G G LS PSQ+   
Sbjct: 180 GHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVR 239

Query: 54  TFSYCLVDRDSDSTSTL------EFDSSLPPNAVTAPLLRNHELDT---FYYLGLTGISV 104
            FSYC   R    +S +      +  +      ++ P +R+    T    Y L   G++V
Sbjct: 240 QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTV 299

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
           G   LP+ E    I   G+G   +DSGT +T      +  L+ AF+   +A  P +  A 
Sbjct: 300 GKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFI--AQAALPVNKTAD 353

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--S 221
             D C+ +  + +  +P + FH  EG    LP +NY+     +G  C A + TS  +  +
Sbjct: 354 EDDICFSWDGKKTAAMPKLVFHL-EGADWDLPRENYVTEDRESGQVCVAVS-TSGQMDRT 411

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +IGN QQQ T + ++L    +   P +C
Sbjct: 412 LIGNFQQQNTHIVYDLAAGKLLLVPAQC 439


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 74/270 (27%), Positives = 116/270 (42%), Gaps = 25/270 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDF+ E +     ++    +GC  +  G  V +A L G G    S P Q+    F+YCL 
Sbjct: 195 GDFLLENLNFPGKTIHEFLVGCTTSAVGE-VTSAALAGFGRSMFSLPMQMGVKKFAYCLN 253

Query: 61  DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
             D D T       L++          AP L+N  +   +YYLG+  I +G  LL I   
Sbjct: 254 SHDYDDTRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSK 313

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
                  G GG+++DSG A   +    +    N L+    +  R+L     + +   CY+
Sbjct: 314 YLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGV-TPCYN 372

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----------AFAPTSSS 219
           F+ + S+++P + + F  G  + +P KNY + +      CF            F P  S 
Sbjct: 373 FTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPS- 431

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             I+GN Q     V F+L+N  +GF    C
Sbjct: 432 -IILGNSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 100/209 (47%), Gaps = 20/209 (9%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
            V + +TL +  +     GC +   G  +   GLLGLG G +S  SQ  A     FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P +  T PLLRN    + YY+ LTG+SVG   +PI 
Sbjct: 194 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 248

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 D +   G I+DSGT +TR     Y A+RD F +      P   +  FDTC  F+
Sbjct: 249 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 304

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
           + +  E P V+ HF EG  L LP +N LI
Sbjct: 305 ATNEAEAPAVTLHF-EGLNLVLPMENSLI 332


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 91/263 (34%), Positives = 123/263 (46%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            D V + +TL + SV +   GC     G  V   GLLGLG G LS   Q   +  STFSY
Sbjct: 110 ADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSY 169

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  ++ F  SL       P      PLLRN    + YY+ L  I VG  ++ 
Sbjct: 170 CL-----PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVD 224

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I  +A   + +   G ++DSGT  TRL    Y A+RD F R          +  FDTCY 
Sbjct: 225 IPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT 284

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNV 226
                 +  PT++F F  G  + LP  N+LI   S  T C A A      +S L++I ++
Sbjct: 285 V----PIISPTITFMF-AGMNVTLPPDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASM 339

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  R+ F++ NS +G     C
Sbjct: 340 QQQNHRILFDIPNSRVGVARESC 362


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 96/280 (34%), Positives = 131/280 (46%), Gaps = 35/280 (12%)

Query: 5   TETVTLG--SASVDNI--AIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           TE  T G   +S +N+  A GC        G   GA+G++GLG G LS PSQ+  + FSY
Sbjct: 178 TEVFTFGHGQSSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNKFSY 237

Query: 58  CLVDRDSDS--TSTL-----EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGISVGGD 107
           CL    SD+  TSTL        S     A + P L+N +    D+FYYL LTGI+VG  
Sbjct: 238 CLTPYFSDAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTA 297

Query: 108 LLPISETAFKIDE---SGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGV 162
            L +   AF + E   +  GG ++DSG+  T L    Y ALRD  VR  G   + P  G 
Sbjct: 298 KLDVPAAAFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGA 357

Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKV----LPLPAKNYLIPVDSNGTFCFAFA-- 214
              D C        +   VP +  HF  G      + +P +NY  PVD +      F+  
Sbjct: 358 EGLDLCVGGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSG 417

Query: 215 -PTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            P S+      +IIGN  QQ   + ++L   ++ F P  C
Sbjct: 418 GPNSTLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 103/201 (51%), Gaps = 11/201 (5%)

Query: 55  FSYCLVDRDSDSTS-TLEFDSSLPPNAVT-APLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           FSYCL    S   S +L+   +  P ++   PLLRN    + YY+ LTG+SVG  L+PI+
Sbjct: 241 FSYCLPSFKSYYFSGSLKLGPAGQPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIA 300

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 + +   G I+DSGT +TR     Y A+RD F +  +   P   +  FDTC  F+
Sbjct: 301 PELLAFNPNTGAGTIIDSGTVITRFVQPIYTAIRDEFRK--QVAGPFSSLGAFDTC--FA 356

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
           + +    P V+ HF  G  L LP +N LI   +    C A A      +S L++I N+QQ
Sbjct: 357 ATNEAVAPAVTLHF-TGLNLVLPMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQ 415

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  R+ F++ NS +G     C
Sbjct: 416 QNLRLLFDVPNSRLGIARELC 436


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 88/269 (32%), Positives = 128/269 (47%), Gaps = 32/269 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQIN---ASTFS 56
           G    + ++L    +D    GCG +N+G  F G +GL+GLG   LS  SQ        FS
Sbjct: 241 GVLAHDRLSLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFS 300

Query: 57  YCL-VDRDSDSTSTLEF--DSSLPPNAV----------TAPLLRNHELDTFYYLGLTGIS 103
           YCL + R+SD++ +L    D S   N+           + PLL+      FY + LTGI+
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQG----PFYLVNLTGIT 356

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           VGG    +  T F      +   IVDSGT +T L    YNA+R  F+          G +
Sbjct: 357 VGGQ--EVESTGF------SARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFS 408

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTSS--SL 220
           + DTC++ +    V+VP+++  F  G  + + +   L  V S+ +  C A A   S    
Sbjct: 409 ILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDET 468

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SIIGN QQ+  RV F+   S +GF    C
Sbjct: 469 SIIGNYQQKNLRVVFDTSASQVGFAQETC 497


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  107 bits (266), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 76/209 (36%), Positives = 99/209 (47%), Gaps = 20/209 (9%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSYCL 59
            V + +TL +  +     GC +   G  +   GLLGLG G +S  SQ  A     FSYCL
Sbjct: 134 LVQDAITLANDVIPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCL 193

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P +  T PLLRN    + YY+ LTG+SVG   +PI 
Sbjct: 194 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIP 248

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 D +   G I+DSGT +TR     Y A+RD F +      P   +  FDTC  F+
Sbjct: 249 SEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG--PISSLGAFDTC--FA 304

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
             +  E P V+ HF EG  L LP +N LI
Sbjct: 305 ETNEAEAPAVTLHF-EGLNLVLPMENSLI 332


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 80/255 (31%), Positives = 114/255 (44%), Gaps = 16/255 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---STF 55
           G ++ + +TL +  +V     GC H  +G F   AAG++ LGGG  S  SQ  +   + F
Sbjct: 107 GAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAF 166

Query: 56  SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           SYC+    SDS   TL          V  P++R  +  TFY + L  I+VGG  L ++  
Sbjct: 167 SYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPA 226

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F        G ++DS TA+TRL    Y ALR AF                DTCYDF+  
Sbjct: 227 VFA------AGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGV 280

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            ++ +P +S  F    VLPL     L     N    F          ++G+VQQQ   V 
Sbjct: 281 VNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGSVQQQTIEVL 336

Query: 235 FNLRNSLIGFTPNKC 249
           +++    +GF    C
Sbjct: 337 YDVGGGAVGFRQGAC 351


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  106 bits (265), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 87/256 (33%), Positives = 128/256 (50%), Gaps = 19/256 (7%)

Query: 6   ETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVD 61
           +T+T  S+S       GCG  N G F    GLLGLG G LS PSQ   S    FSYCL  
Sbjct: 229 DTLTFNSSSKFTGFTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS 288

Query: 62  RDSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
            ++ +   L   ++ P    P   TA +++  +  +FY++ L  I++GG +LP+  + F 
Sbjct: 289 YNT-TPGYLNIGATKPTSTVPVQYTA-MIKKPQYPSFYFIELVSINIGGYILPVPPSVFT 346

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
                  G ++DSGT +T L    Y +LRD F    +   P       DTCYDF+ + ++
Sbjct: 347 -----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAI 401

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLI-PVDSN---GTFCFAFAPTSSSLSIIGNVQQQGTRV 233
            +P VSF+F +G V  L     +I P D+    G   F   P +   SI+GN QQ+   V
Sbjct: 402 VIPAVSFNFSDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEV 461

Query: 234 SFNLRNSLIGFTPNKC 249
            +++ +  IGF P  C
Sbjct: 462 IYDVPSQKIGFIPISC 477


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 103/207 (49%), Gaps = 22/207 (10%)

Query: 54  TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           TFSYCL      S  +L F  +L       PP   T PLL N    + YY+ +TGI VG 
Sbjct: 254 TFSYCL-----PSFKSLNFSGTLRLGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGR 308

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
            ++PI   A   D +   G ++DSGT  TRL    Y A+RD   R  R  +P   +  FD
Sbjct: 309 KVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSSLGGFD 366

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----SSSLSI 222
           TC++    ++V  P V+  F +G  + LP +N +I        C A A      ++ L++
Sbjct: 367 TCFN---TTAVAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNV 422

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I ++QQQ  RV F++ N  +GF   +C
Sbjct: 423 IASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 83/258 (32%), Positives = 124/258 (48%), Gaps = 22/258 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + ++T++L S+ +V +   GC H   G      GL+GLGG + S  SQ  A+    FS
Sbjct: 220 GTYGSDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFS 279

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVT---APLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           YCL    S     L   ++   ++      P++R   + TFY + L GI+V G +L +  
Sbjct: 280 YCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVR-FSVPTFYGVFLQGITVAGTMLNVPA 338

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
           + F      +G  +VDSGT +T+L    Y ALR AF +  +A      V   DTC+DFS 
Sbjct: 339 SVF------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSG 392

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGT 231
            +++ VPTV+  F  G  + L     L         C AF  T+      I+GNVQQ+  
Sbjct: 393 FNTITVPTVTLTFSRGAAMDLDISGILY------AGCLAFTATAHDGDTGILGNVQQRTF 446

Query: 232 RVSFNLRNSLIGFTPNKC 249
            + F++    IGF    C
Sbjct: 447 EMLFDVGGRTIGFRSGAC 464


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  106 bits (265), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 80/255 (31%), Positives = 114/255 (44%), Gaps = 16/255 (6%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---STF 55
           G ++ + +TL +  +V     GC H  +G F   AAG++ LGGG  S  SQ  +   + F
Sbjct: 237 GAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAF 296

Query: 56  SYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           SYC+    SDS   TL          V  P++R  +  TFY + L  I+VGG  L ++  
Sbjct: 297 SYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPA 356

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
            F        G ++DS TA+TRL    Y ALR AF                DTCYDF+  
Sbjct: 357 VFA------AGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGV 410

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            ++ +P +S  F    VLPL     L     N    F          ++G+VQQQ   V 
Sbjct: 411 VNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGSVQQQTIEVL 466

Query: 235 FNLRNSLIGFTPNKC 249
           +++    +GF    C
Sbjct: 467 YDVGGGAVGFRQGAC 481


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 90/263 (34%), Positives = 123/263 (46%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            D V + +TL + SV +   GC     G  V   GLLGLG G LS   Q   +  STFSY
Sbjct: 187 ADLVQDNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSY 246

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  ++ F  SL       P      PLLRN    + YY+ L  I VG  ++ 
Sbjct: 247 CL-----PSFKSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVD 301

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I  +A   + +   G ++DSGT  TRL    Y A+RD F R          +  FDTCY 
Sbjct: 302 IPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT 361

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNV 226
                 +  PT++F F  G  + LP  N+LI   +  T C A A      +S L++I ++
Sbjct: 362 V----PIISPTITFMF-AGMNVTLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASM 416

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  R+ F++ NS +G     C
Sbjct: 417 QQQNHRILFDIPNSRVGVARESC 439


>gi|388520263|gb|AFK48193.1| unknown [Lotus japonicus]
          Length = 157

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/162 (38%), Positives = 94/162 (58%), Gaps = 10/162 (6%)

Query: 91  LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
           L T Y L LT I+VGG  L ++ +++K+        I+DSGT +TRL    Y AL+++FV
Sbjct: 2   LPTLYGLDLTAITVGGKPLGLAASSYKVPT------IIDSGTVITRLPMPVYTALKNSFV 55

Query: 151 R-GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
           R  ++  +   G+++ DTC+  + +   EVP +   F  G  LPL A N LI +D  G  
Sbjct: 56  RIMSKKYAQAPGISILDTCFKGNVKEMSEVPEIQMIFGGGADLPLKAHNTLIELD-KGVT 114

Query: 210 CFAFAPTSSS--LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           C A A +S +  ++IIGN QQQ  +V++++ NS IGF    C
Sbjct: 115 CLAIAGSSENNPIAIIGNYQQQTFKVAYDVANSKIGFAAGGC 156


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 84/265 (31%), Positives = 129/265 (48%), Gaps = 21/265 (7%)

Query: 1   GDFVTETVTLGSASVDNIA-----IGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           G+   +TVT+ S S   +A     IGCGH+N G F    +G++GLG G  S  +Q+  +T
Sbjct: 172 GNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPAT 231

Query: 55  ---FSYCLVDRDSDST---STLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCL+   + ST   + L F S+   +    V+ P+  + +  TFY L L  +SVG
Sbjct: 232 GGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVG 291

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
                  E A K+   G   II+DSGT +T L +   N+   A  +        D     
Sbjct: 292 DTKFNFPEGASKL--GGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFL 349

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIG 224
           D C+  ++    E+P V+ HF EG  +PL  +N  + + S+ T C AF      ++ I G
Sbjct: 350 DYCFA-TTTDDYEMPPVTMHF-EGADVPLQRENLFVRL-SDDTICLAFGSFPDDNIFIYG 406

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+ Q    V ++++N  + F P  C
Sbjct: 407 NIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 125/261 (47%), Gaps = 21/261 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS- 53
           G    ET+TL S      S     IGCG+ N G F G ++G++GLG G +S PSQ+  S 
Sbjct: 160 GYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSI 219

Query: 54  --TFSYCLVDRDSDSTSTLEF-DSSLP--PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
              FSYCL     +STS L F D+++     A+T P+++  +  + YYL L   SVG  L
Sbjct: 220 GGKFSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKL 278

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +      +  +E   G I++DSGT  T L  + Y     A           D    F  C
Sbjct: 279 IEFGGPTYGGNE---GNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLC 335

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
           Y+ +     E P ++ HF +G  + L   +  I V S+G  C AF P  S  +I GNV Q
Sbjct: 336 YNVAYHG-FEAPLITAHF-KGADIKLYYISTFIKV-SDGIACLAFIP--SQTAIFGNVAQ 390

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V +NL  + + F P  C
Sbjct: 391 QNLLVGYNLVQNTVTFKPVDC 411


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/263 (33%), Positives = 118/263 (44%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G   T+TV LG ASVD    GCG +N GLF G AGL+GLG   LS  SQ        FSY
Sbjct: 266 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSY 325

Query: 58  CL---VDRDSDSTSTLEFDSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
           CL      D+  + +L  D+S   NA      R   +     FY++ +TG SV       
Sbjct: 326 CLPAATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV------- 378

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
              A      G   +++DSGT +TRL    Y A+R  F R  G          +L D CY
Sbjct: 379 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 438

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
           + +    V+VP ++     G  + + A   L     +G+  C A A  S      IIGN 
Sbjct: 439 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 498

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV ++   S +GF    C
Sbjct: 499 QQKNKRVVYDTVGSRLGFADEDC 521


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 88/263 (33%), Positives = 118/263 (44%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSY 57
           G   T+TV LG ASVD    GCG +N GLF G AGL+GLG   LS  SQ        FSY
Sbjct: 267 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSY 326

Query: 58  CL---VDRDSDSTSTLEFDSSLPPNAVTAPLLR---NHELDTFYYLGLTGISVGGDLLPI 111
           CL      D+  + +L  D+S   NA      R   +     FY++ +TG SV       
Sbjct: 327 CLPAATSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASV------- 379

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCY 169
              A      G   +++DSGT +TRL    Y A+R  F R  G          +L D CY
Sbjct: 380 GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACY 439

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNV 226
           + +    V+VP ++     G  + + A   L     +G+  C A A  S      IIGN 
Sbjct: 440 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNY 499

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+  RV ++   S +GF    C
Sbjct: 500 QQKNKRVVYDTVGSRLGFADEDC 522


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 91/258 (35%), Positives = 128/258 (49%), Gaps = 32/258 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G++  +T+TL  + V      GCG NNEG F  GA G+LGLG G LS  SQ  +     F
Sbjct: 238 GNYGCDTMTLEHSDVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVF 297

Query: 56  SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           SYCL + DS             +S+L+F S      V  P     E   +Y++ L  ISV
Sbjct: 298 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGPGTSGLEESGYYFVKLLDISV 352

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
           G   L I  + F      + G I+DSGT +TRL    Y+AL+ AF +       ++G   
Sbjct: 353 GNKRLNIPSSVF-----ASPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 407

Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
              + DTCY+ S R  V +P +  HF EG  + L  K  +   D++   C AFA  +S L
Sbjct: 408 KGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDAS-RLCLAFA-GNSEL 465

Query: 221 SIIGNVQQQGTRVSFNLR 238
           +IIGN QQ    V ++++
Sbjct: 466 TIIGNRQQVSLTVLYDIQ 483


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/260 (38%), Positives = 130/260 (50%), Gaps = 26/260 (10%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS----TF 55
           G + ++T+TL  A +V     GCGH   GLF G  GLL LG   +S  SQ + +     F
Sbjct: 592 GVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVF 651

Query: 56  SYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISE 113
           SYCL    S +   TL   SS    A T  LL   ++ TFY + LTGI VGG  L  +  
Sbjct: 652 SYCLPPSPSSTGFLTLGGPSSASGFATTG-LLTAWDVPTFYMVMLTGIGVGGQQLSGVPA 710

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCY 169
           +AF       GG +VD+GT +TRL    Y ALR AF           +P  G+   DTCY
Sbjct: 711 SAFA------GGTVVDTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGI--LDTCY 762

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
           +F+   +V +PTVS  F  G  L L A  +L    S+G   FA        +I+GNVQQ+
Sbjct: 763 NFTDYGTVTLPTVSLTFSGGATLKLDAPGFL----SSGCLAFATNSGDGDPAILGNVQQR 818

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V F+   S +GF P+ C
Sbjct: 819 SFAVRFD--GSSVGFMPHSC 836


>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 316

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 82/266 (30%), Positives = 128/266 (48%), Gaps = 33/266 (12%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC  +  G  F+ + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 53  AKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNA 112

Query: 67  TSTLEFD-----------------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           TS L F                  S+  P A   PLL +H +  FY + + G+SV G+LL
Sbjct: 113 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELL 172

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            I    + + +   GG I+DSGT++T L +  Y A+  A  +    L P   +  FD CY
Sbjct: 173 RIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCY 229

Query: 170 DFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
           +++S       +V VP ++ HF     L  P K+Y+I   + G  C          +S+I
Sbjct: 230 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-APGVKCIGLQEGDWPGVSVI 288

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+ QQ     F+L+N  + F  ++C
Sbjct: 289 GNILQQEHLWEFDLKNRRLRFKRSRC 314


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 120/282 (42%), Gaps = 37/282 (13%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE +T  S +V ++  GC      + G   GA+G++GLG G LS PSQ+  + FSY
Sbjct: 160 GTLATENLTFQSETV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTRFSY 218

Query: 58  CLVDRDSDSTSTLEF------------DSSLPPNAVTAPLLRNHELD---TFYYLGLTGI 102
           CL     D+                   SS P    T P +R+   D   TFYYL LTGI
Sbjct: 219 CLTPYFEDTIEPSHMVVGASAGLINGSASSTP--VTTVPFVRSPSDDPFSTFYYLPLTGI 276

Query: 103 SVGGDLLPISETAFKIDESGNG---GIIVDSGTAVTRLQTETYNALRDAFVR--GTRALS 157
           + G   L +   AF + +   G   G  +DSG  +T L    Y ALR    R  G   + 
Sbjct: 277 TAGKVKLAVPSAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQ 336

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHF----PEGKVLPLPAKNYLIPVDSNGTFCFAF 213
           P  G   FD C        + VP +  HF      G  L +P  NY  PVDS       F
Sbjct: 337 PLAGTTGFDLCVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVF 395

Query: 214 APTS------SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +         +  ++IGN  QQ   V ++L   ++ F P  C
Sbjct: 396 SSVDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/263 (34%), Positives = 122/263 (46%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            + V +T+TL +  V +   GC     G      GLLGLG G LS  SQ   +  STFSY
Sbjct: 161 ANLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 220

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + YY+ L  I VG  ++ 
Sbjct: 221 CL-----PSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVD 275

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I   A   + +   G I DSGT  TRL    Y A+RD F R          +  FDTCY+
Sbjct: 276 IPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYN 335

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
                 + VPT++F F  G  + LP  N LI   +  T C A A      +S L++I N+
Sbjct: 336 V----PIVVPTITFIF-TGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 390

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQQ  RV +++ NS +G     C
Sbjct: 391 QQQNHRVLYDVPNSRVGVARELC 413


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 115/262 (43%), Gaps = 28/262 (10%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQINAS--- 53
           G ++++ +T+  A +V +   GC H  +G F     AAG++ LGGG  S  SQ  A+   
Sbjct: 223 GTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGR 282

Query: 54  TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
            FS+C          TL          V  P+L+N  +  TFY + L  I+V G  + + 
Sbjct: 283 VFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 342

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            T F        G  +DS TA+TRL    Y ALR AF        P       DTCYD +
Sbjct: 343 PTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMA 396

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAF--APTSSSLSIIGNVQ 227
              S  +P ++  F          KN  + +D +G     C AF   P      IIGN+Q
Sbjct: 397 GVRSFALPRITLVF---------DKNAAVELDPSGVLFQGCLAFTAGPNDQVPGIIGNIQ 447

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
            Q   V +N+  +L+GF    C
Sbjct: 448 LQTLEVLYNIPAALVGFRHAAC 469


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 134/263 (50%), Gaps = 22/263 (8%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           G+   +T+TLGS+      + NI IGCGHNN G F    +G++GLGGG +S   Q+  S 
Sbjct: 180 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSI 239

Query: 54  --TFSYCLVDRDS--DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   S  D TS + F ++   +    V+ PL+     +TFYYL L  ISVG 
Sbjct: 240 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 299

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +   + +    ES  G II+DSGT +T L TE Y+ L DA      A    D  +   
Sbjct: 300 KQI---QYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 356

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY  S+   ++VP ++ HF +G  + L + N  + V S    CFAF   S S SI GNV
Sbjct: 357 LCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNV 411

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    V ++  +  + F P  C
Sbjct: 412 AQMNFLVGYDTVSKTVSFKPTDC 434


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/262 (30%), Positives = 115/262 (43%), Gaps = 28/262 (10%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQINAS--- 53
           G ++++ +T+  A +V +   GC H  +G F     AAG++ LGGG  S  SQ  A+   
Sbjct: 248 GTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGIMALGGGPESLVSQTAATYGR 307

Query: 54  TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPIS 112
            FS+C          TL          V  P+L+N  +  TFY + L  I+V G  + + 
Sbjct: 308 VFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVP 367

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            T F        G  +DS TA+TRL    Y ALR AF        P       DTCYD +
Sbjct: 368 PTVFA------AGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQPAPPKGPLDTCYDMA 421

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAF--APTSSSLSIIGNVQ 227
              S  +P ++  F          KN  + +D +G     C AF   P      IIGN+Q
Sbjct: 422 GVRSFALPRITLVF---------DKNAAVELDPSGVLFQGCLAFTAGPNDQVPGIIGNIQ 472

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
            Q   V +N+  +L+GF    C
Sbjct: 473 LQTLEVLYNIPAALVGFRHAAC 494


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/263 (36%), Positives = 134/263 (50%), Gaps = 22/263 (8%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           G+   +T+TLGS+      + NI IGCGHNN G F    +G++GLGGG +S   Q+  S 
Sbjct: 180 GNIAVDTLTLGSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSI 239

Query: 54  --TFSYCLVDRDS--DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   S  D TS + F ++   +    V+ PL+     +TFYYL L  ISVG 
Sbjct: 240 DGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGS 299

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +   + +    ES  G II+DSGT +T L TE Y+ L DA      A    D  +   
Sbjct: 300 KQI---QYSGSDSESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLS 356

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY  S+   ++VP ++ HF +G  + L + N  + V S    CFAF   S S SI GNV
Sbjct: 357 LCY--SATGDLKVPVITMHF-DGADVKLDSSNAFVQV-SEDLVCFAFR-GSPSFSIYGNV 411

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    V ++  +  + F P  C
Sbjct: 412 AQMNFLVGYDTVSKTVSFKPTDC 434


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 85/267 (31%), Positives = 128/267 (47%), Gaps = 26/267 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           GDF  +T+T+GS S         AIGCGH+N G F    +G++GLG G  S   Q+ ++ 
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233

Query: 55  ---FSYCL--VDRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL  +  D   ++ L F S+   +   AV+ P+  + +  +FY L L  +SVG 
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293

Query: 107 DLLPISETAFKIDES---GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           +      T +    S   G   II+DSGT +T L  + Y+    A           D   
Sbjct: 294 N-----NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSI 222
             + C++ ++    +VP ++ HF EG  L L  +N LI V  N   C AFA    + +SI
Sbjct: 349 FLEYCFE-TTTDDYKVPFIAMHF-EGANLRLQRENVLIRVSDN-VICLAFAGAQDNDISI 405

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN+ Q    V +++ N  + F P  C
Sbjct: 406 YGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 91/269 (33%), Positives = 129/269 (47%), Gaps = 26/269 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE  T  S +   +  GC       +G   GA+GL+GLG G LS  SQ  A+ FSY
Sbjct: 176 GSLGTEAFTFQSGAA-KLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATKFSY 234

Query: 58  CLVD--RDSDSTSTLEFDSSLPPN----AVTA-PLLRNHE---LDTFYYLGLTGISVGGD 107
           CL    R+  ++S L   +S   +    AVT+ P +++ E     TFYYL L GISVG  
Sbjct: 235 CLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGET 294

Query: 108 LLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAFVRG-TRALSPTDGV 162
            LPI   AF++        +GG+I+D+G+ VT L    Y+AL D   R   R+L      
Sbjct: 295 KLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD 354

Query: 163 ALFDTCYDFSSRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
              D C    +R  V+  VP + FHF  G  + + A +Y  PVD + T C          
Sbjct: 355 TGLDLCV---ARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKS-TACMLIEEGGYE- 409

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++IGN QQQ   + +++    + F    C
Sbjct: 410 TVIGNFQQQDVHLLYDIGKGELSFQTADC 438


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 79/261 (30%), Positives = 129/261 (49%), Gaps = 23/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
           GD   E +T+GS+SV ++ IGCGH + G F  A+G++GLGGG LS  SQ++ ++     F
Sbjct: 168 GDLGFEKITIGSSSVKSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 226

Query: 56  SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           SYCL    S +   + F  +     P  V+ PL+  + + T+YY+ L  IS+G +     
Sbjct: 227 SYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTV-TYYYITLEAISIGNE----R 281

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
             AF    +  G +I+DSGT ++ L  E Y+ +  + ++  +A    D    +D C+D  
Sbjct: 282 HMAF----AKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
            +  +S  +P ++  F  G  + L   N    V +N   C    P S +    IIGN+  
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANNVNCLTLTPASPTDEFGIIGNLAL 396

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
               + ++L    + F P  C
Sbjct: 397 ANFLIGYDLEAKRLSFKPTVC 417


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 89/268 (33%), Positives = 125/268 (46%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE     S + + +A GC       +G   GA+GL+GLG G LS  SQ  A+ FSY
Sbjct: 181 GTLGTEAFAFQSGTAE-LAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATKFSY 239

Query: 58  CLVD--RDSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CL     ++ +T  L   +S       + +T   ++  +   FYYL L G++VG   LPI
Sbjct: 240 CLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPI 299

Query: 112 SETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRD---AFVRGTRALSPTDGVAL 164
             T F + E      +GG+I+DSG+  T L  + Y+AL     A + G+    P D    
Sbjct: 300 PATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDA--- 356

Query: 165 FDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLS 221
            D      +R  V   VP V FHF  G  + +PA++Y  PVD         +       S
Sbjct: 357 -DDGALCVARRDVGRVVPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQS 415

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +IGN QQQ  RV ++L N    F P  C
Sbjct: 416 VIGNYQQQNMRVLYDLANGDFSFQPADC 443


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/178 (34%), Positives = 92/178 (51%), Gaps = 10/178 (5%)

Query: 77  PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
           P    T PLL N    + YY+ + GI VG  ++ + ++A   +     G I+D+GT  TR
Sbjct: 249 PKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTR 308

Query: 137 LQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
           L    Y A+RDAF RG         +  FDTCY+     +V VPTV+F F     + LP 
Sbjct: 309 LAAPVYAAVRDAF-RGRVRTPVAPPLGGFDTCYNV----TVSVPTVTFMFAGAVAVTLPE 363

Query: 197 KNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +N +I   S G  C A A       +++L+++ ++QQQ  RV F++ N  +GF+   C
Sbjct: 364 ENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 421


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 89/268 (33%), Positives = 132/268 (49%), Gaps = 36/268 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
           G+F  +T++LG+ S       + A+GCG  N G F G  GL+GLG G +S  SQ++A   
Sbjct: 140 GEFARDTISLGTTSDGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAID 198

Query: 53  STFSYCLVDRDSDSTST-LEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           S FSYCLVD +S S S+ L F  S       +    +T P   +    T+Y L + GI+V
Sbjct: 199 SKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP---SDTYPTYYLLTVNGIAV 255

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
            G  +              G  I+DSGT +T + +  Y  +  + +     L   DG ++
Sbjct: 256 AGQTM-----------GSPGTTIIDSGTTLTYVPSGVYGRVL-SRMESMVTLPRVDGSSM 303

Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSS-SLS 221
             D CYD SS  + + P ++       + P P+ NY + VD +G T C A    S   +S
Sbjct: 304 GLDLCYDRSSNRNYKFPALTIRLAGATMTP-PSSNYFLVVDDSGDTVCLAMGSASGLPVS 362

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGNV QQG  + ++  +S + F   KC
Sbjct: 363 IIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 85/267 (31%), Positives = 128/267 (47%), Gaps = 26/267 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           GDF  +T+T+GS S         AIGCGH+N G F    +G++GLG G  S   Q+ ++ 
Sbjct: 174 GDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAV 233

Query: 55  ---FSYCL--VDRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL  +  D   ++ L F S+   +   AV+ P+  + +  +FY L L  +SVG 
Sbjct: 234 GGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293

Query: 107 DLLPISETAFKIDES---GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           +      T +    S   G   II+DSGT +T L  + Y+    A           D   
Sbjct: 294 N-----NTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQ 348

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA-PTSSSLSI 222
             + C++ ++    +VP ++ HF EG  L L  +N LI V  N   C AFA    + +SI
Sbjct: 349 FLEYCFE-TTTDDYKVPFIAMHF-EGANLRLQRENVLIRVSDN-VICLAFAGAQDNDISI 405

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN+ Q    V +++ N  + F P  C
Sbjct: 406 YGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/255 (38%), Positives = 129/255 (50%), Gaps = 13/255 (5%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
           G F  E ++L +  V ++   GCG NN+GLF GAAGLLGLG   LS  SQ        FS
Sbjct: 246 GFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFS 305

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S ST  L F  S   +A   PL       +FY L LTGISVGG  L IS + F
Sbjct: 306 YCL-PSSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVF 364

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
                   G I+DSGT +TRL    Y+AL   F +          +++ DTC+DFS+  +
Sbjct: 365 S-----TAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDT 419

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVS 234
           + VP +   F  G V+ +  K  +  V+     C AFA  S  S ++I GNVQQ+   V 
Sbjct: 420 ISVPKIGLFFSGGVVVDID-KTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVV 478

Query: 235 FNLRNSLIGFTPNKC 249
           ++     +GF P  C
Sbjct: 479 YDGAAGRVGFAPAGC 493


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 82/266 (30%), Positives = 128/266 (48%), Gaps = 33/266 (12%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC  +  G  F+ + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 198 AKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNA 257

Query: 67  TSTLEFD-----------------SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           TS L F                  S+  P A   PLL +H +  FY + + G+SV G+LL
Sbjct: 258 TSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELL 317

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            I    + + +   GG I+DSGT++T L +  Y A+  A  +    L P   +  FD CY
Sbjct: 318 RIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALGKKLVGL-PRVAMDPFDYCY 374

Query: 170 DFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
           +++S       +V VP ++ HF     L  P K+Y+I   + G  C          +S+I
Sbjct: 375 NWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-APGVKCIGLQEGDWPGVSVI 433

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+ QQ     F+L+N  + F  ++C
Sbjct: 434 GNILQQEHLWEFDLKNRRLRFKRSRC 459


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 92/267 (34%), Positives = 127/267 (47%), Gaps = 29/267 (10%)

Query: 1   GDFVTETVTLGSASVDNIAI-----GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G    ET+TL S + + +A      GCGHNN G      GL+GLG G LS  SQI +S  
Sbjct: 149 GVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSGFNDREMGLIGLGRGPLSLISQIGSSLG 208

Query: 54  ----TFSYCLVDRDSDS--TSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
                FS CLV  ++D   TS + F      L    V+ PL+      T Y+  L GISV
Sbjct: 209 AGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKD--GTGYFATLLGISV 266

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGV 162
               LP S  +  +     G I++DSGT +T L  E Y+ L +  VR   AL P   DG 
Sbjct: 267 EDINLPFSNGS-SLGTITKGNILIDSGTTITYLPEEFYHRLIEQ-VRNKVALEPFRIDG- 323

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
             ++ CY   + +++  PT++ HF  G VL  PA+ ++   D N  FCFA   T+     
Sbjct: 324 --YELCYQ--TPTNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDN--FCFAVFDTNEEYVT 377

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN  Q    + F+L   ++ F    C
Sbjct: 378 YGNYAQSNYLIGFDLERQVVSFKATDC 404


>gi|326526699|dbj|BAK00738.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 182

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/166 (37%), Positives = 93/166 (56%), Gaps = 8/166 (4%)

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           P++ +   D+ Y++ L+G++V G  L +S +     E  +   I+DSGT +TRL T  Y+
Sbjct: 24  PMVSSTLDDSLYFIKLSGMTVAGKPLAVSSS-----EYSSLPTIIDSGTVITRLPTTVYD 78

Query: 144 ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
           AL  A     +     D  ++ DTC+     SS+ VP VS  F  G  L L A+N L+ V
Sbjct: 79  ALSKAVAGAMKGTKRADAYSILDTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDV 137

Query: 204 DSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           DS+ T C AFAP  S+ +IIGN QQQ   V ++++++ IGF    C
Sbjct: 138 DSSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAGGC 181


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 109/206 (52%), Gaps = 14/206 (6%)

Query: 5   TETVTLGSASV-DNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
           TET T G   V +N++ G     +G  F G AGL+GLG G LS  SQ+ A  F+YCL   
Sbjct: 187 TETFTFGDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGRFAYCLA-A 245

Query: 63  DSDSTSTLEFDS-----SLPPNAVTAPLLRNH--ELDTFYYLGLTGISVGGDLLPISETA 115
           D +  ST+ F S     +   +  + PL+ N   + DT YY+ L GISVGG  LPI +  
Sbjct: 246 DPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGT 305

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           F I+  G+GG+  DSG   T L+   Y  +R A     + L    G    DTC+  +++ 
Sbjct: 306 FAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQ 362

Query: 176 SV-EVPTVSFHFPEGKVLPLPAKNYL 200
           +V ++P +  HF +G  + L  +NYL
Sbjct: 363 AVAQMPPLVLHFDDGADMSLNGRNYL 388


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  103 bits (258), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 78/249 (31%), Positives = 119/249 (47%), Gaps = 14/249 (5%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD 73
           SV  +A GCG +N GL   + G +GLG GSLS  +Q+    FSYCL D  + S  +    
Sbjct: 209 SVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLGSPVLF 268

Query: 74  SSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            SL   A           + PL++     + YY+ L GIS+G   LPI    F + + G+
Sbjct: 269 GSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRDDGS 328

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS--RSSVEVPT 181
           GG+IVDSGT  T L    +  + +  V G       +  +L   C+  ++  +   ++P 
Sbjct: 329 GGMIVDSGTIFTVLVESAFRVVVN-HVAGVLNQPVVNASSLDSPCFPATAGEQQLPDMPD 387

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQGTRVSFNLRNS 240
           +  HF  G  + L   NY+     + +FC   A   S+  SI+GN QQQ  ++ F++   
Sbjct: 388 MLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVG 447

Query: 241 LIGFTPNKC 249
            + F P  C
Sbjct: 448 QLSFVPTDC 456


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
           D V ET   G+ +V ++  GCGH+N G F G  +G+LGL  G  S  S++  S FSYC+ 
Sbjct: 153 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 211

Query: 61  DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           D  D   T + L     +     + P    H  + FYY+ L GISVG   L I+   F+ 
Sbjct: 212 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 268

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
            ESG GG+++DSGT  T L  + ++ L +   R  R         ++ T     CY    
Sbjct: 269 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 325

Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
              +   P ++FHF EG  L L A N L    +   FC A   ++     S+IG + QQ 
Sbjct: 326 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 384

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V+++L    + F    C
Sbjct: 385 YNVAYDLIGKRVYFQRTDC 403


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 80/281 (28%), Positives = 122/281 (43%), Gaps = 35/281 (12%)

Query: 1   GDFVTETVTL------GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ----I 50
           G  +TET+ L      G+ ++ + A+GC   +       +G+ G G G+LS PSQ    I
Sbjct: 89  GLLLTETLNLPLENGEGARAITHFAVGCSIVSS---QQPSGIAGFGRGALSMPSQLGEHI 145

Query: 51  NASTFSYCL----VDRDSDSTSTLEFDSSLPPNAVT--APLLRNH------ELDTFYYLG 98
               F+YCL     D ++  +  +  D +LP N      P L N       +   +YY+G
Sbjct: 146 GKDRFAYCLQSHRFDEENKKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIG 205

Query: 99  LTGISVGGDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRA 155
           L G+S+GG  L  +     + D  GNGG I+DSGT  T    E +  +   F    G R 
Sbjct: 206 LRGVSIGGKRLKQLPSKLLRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRR 265

Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
               +       CYD +   ++ +P  +FHF  G  + LP  NY     S  + C     
Sbjct: 266 AGEVEDKTGMGLCYDVTGLENIVLPEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMIS 325

Query: 216 TSSSLS-------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +   L        I+GN QQQ   + ++   + +GFT   C
Sbjct: 326 SRGLLEVDSGPAVILGNDQQQDFYLLYDREKNRLGFTQQTC 366


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 90/267 (33%), Positives = 132/267 (49%), Gaps = 24/267 (8%)

Query: 1   GDFVTETVTLGSASVDNIAI-----GCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAS- 53
           G    ET+TL S +   +A+     GCGHNN G+F     G++GLG G LS  SQI +S 
Sbjct: 148 GVLAQETLTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSF 207

Query: 54  ---TFSYCLVDRDSDS--TSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
               FS CLV   ++   TS + F      L    V+ PL+  +    FY++ L GISV 
Sbjct: 208 GGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVE 267

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS--PTDGVA 163
              LP ++ +  ++    G +++DSGT  T L  + Y+ L +  VR   AL   P D   
Sbjct: 268 DINLPFNDGS-SLEPITKGNMVIDSGTPTTLLPEDFYHRLVEE-VRNKVALDPIPIDPTL 325

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSI 222
            +  CY   + ++++  T++ HF    VL  P + + IPV  +G FCFAF  T S+   I
Sbjct: 326 GYQLCY--RTPTNLKGTTLTAHFEGADVLLTPTQIF-IPVQ-DGIFCFAFTSTFSNEYGI 381

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN  Q    + F+L   L+ F    C
Sbjct: 382 YGNHAQSNYLIGFDLEKQLVSFKATDC 408


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
           D V ET   G+ +V ++  GCGH+N G F G  +G+LGL  G  S  S++  S FSYC+ 
Sbjct: 153 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 211

Query: 61  DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           D  D   T + L     +     + P    H  + FYY+ L GISVG   L I+   F+ 
Sbjct: 212 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 268

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
            ESG GG+++DSGT  T L  + ++ L +   R  R         ++ T     CY    
Sbjct: 269 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 325

Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
              +   P ++FHF EG  L L A N L    +   FC A   ++     S+IG + QQ 
Sbjct: 326 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 384

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V+++L    + F    C
Sbjct: 385 YNVAYDLIGKRVYFQRTDC 403


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 92/251 (36%), Positives = 124/251 (49%), Gaps = 34/251 (13%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS---T 67
           +V     GCGH   GLF G  GLLGLG    S   Q   +    FSYCL  + S +   T
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 280

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
             +   S   P   T  LL +    T+Y + LTGISVGG  L +  +AF           
Sbjct: 281 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------ 334

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVS 183
           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY+F+   +V +P V+
Sbjct: 335 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCYNFAGYGTVTLPNVA 392

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNVQQQGTRVSFNLR-- 238
             F  G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+    SF +R  
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQR----SFEVRID 441

Query: 239 NSLIGFTPNKC 249
            + +GF P+ C
Sbjct: 442 GTSVGFKPSSC 452


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 86/274 (31%), Positives = 131/274 (47%), Gaps = 40/274 (14%)

Query: 12  SASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
            A +  + +GC  +  G  F+ + G+L LG  ++SF S+  A     FSYCLVD     +
Sbjct: 248 QAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRN 307

Query: 66  STSTLEFD-----SSLPPNAVTA-------------------PLLRNHELDTFYYLGLTG 101
           +TS L F      SS PP+                       PLL +H +  FY + + G
Sbjct: 308 ATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNG 367

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
           ISV G+LL I    +  D +  GG I+DSGT++T L +  Y A+  A  +    L P   
Sbjct: 368 ISVDGELLRIPRLVW--DVAKGGGAILDSGTSLTVLVSPAYRAVVAALNKKLAGL-PRVT 424

Query: 162 VALFDTCYDFSSRS-----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP- 215
           +  FD CY+++S S     +V +P ++ HF     L  PAK+Y+I   + G  C      
Sbjct: 425 MDPFDYCYNWTSPSTGEDLTVAMPELAVHFAGSARLQPPAKSYVIDA-APGVKCIGLQEG 483

Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               +S+IGN+ QQ     F+L+N  + F  ++C
Sbjct: 484 EWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 517


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 92/268 (34%), Positives = 123/268 (45%), Gaps = 36/268 (13%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G ++T+T+T+  S +  N   GC H   G F   A+G + LGGG  S  SQ   +    F
Sbjct: 241 GTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASGTMSLGGGPQSLLSQTARAYGNAF 300

Query: 56  SYCLVDRDSDSTSTLEFDS-SLPPNA---------VTAPLLRNHEL--DTFYYLGLTGIS 103
           SYC+        S   F S   P N           T PL+R+  +   T Y + L GI 
Sbjct: 301 SYCV-----PGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVINPTIYVVRLQGIE 355

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G  L +    F      +GG ++DS   +T+L    Y ALR AF    RA        
Sbjct: 356 VAGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTG 409

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLS 221
             DTC+DF   S V VPTVS  F  G V+ L   + L+  DS    C AFAP ++  +L 
Sbjct: 410 NLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLL--DS----CLAFAPMAADFALG 463

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            IGNVQQQ   V +++    +GF    C
Sbjct: 464 FIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 84/259 (32%), Positives = 122/259 (47%), Gaps = 19/259 (7%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINASTFSYCLV 60
           D V ET   G+ +V ++  GCGH+N G F G  +G+LGL  G  S  S++  S FSYC+ 
Sbjct: 185 DIVFETSDQGTVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL-GSRFSYCIG 243

Query: 61  DR-DSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           D  D   T + L     +     + P    H  + FYY+ L GISVG   L I+   F+ 
Sbjct: 244 DLFDPHYTHNQLVLGDGVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQR 300

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDFSS 173
            ESG GG+++DSGT  T L  + ++ L +   R  R         ++ T     CY    
Sbjct: 301 TESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQ---VIYRTIPGWLCYKGRV 357

Query: 174 RSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIGNVQQQG 230
              +   P ++FHF EG  L L A N L    +   FC A   ++     S+IG + QQ 
Sbjct: 358 NEDLRGFPELAFHFAEGADLVLDA-NSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQH 416

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V+++L    + F    C
Sbjct: 417 YNVAYDLIGKRVYFQRTDC 435


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  103 bits (257), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 91/268 (33%), Positives = 129/268 (48%), Gaps = 28/268 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQIN--- 51
           G    ETVTL S      S+  I  GCGHNN G F     GL+GLGGG  S  SQI    
Sbjct: 152 GVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLF 211

Query: 52  -ASTFSYCLVDRDSDST--STLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
               FS CLV   +D T  S + F      L    VT PL++  +  T YY+ L GISV 
Sbjct: 212 GGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVE 271

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVAL 164
              LP++ T  K      G ++VDSGT    L  + Y+ +    V+    L P TD  +L
Sbjct: 272 DTYLPMNSTIEK------GNMLVDSGTPPNILPQQLYDRVY-VEVKNKVPLEPITDDPSL 324

Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-DSNGTFCFAFAPTSSS-LS 221
               CY   ++++++ PT+++HF    +L  P + ++ P  ++ G FC A    ++S   
Sbjct: 325 GPQLCY--RTQTNLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPG 382

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I GN  Q    + F+L   ++ F P  C
Sbjct: 383 IYGNFAQTNYLIGFDLDRQIVSFKPTDC 410


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 82/252 (32%), Positives = 121/252 (48%), Gaps = 17/252 (6%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF- 72
           SV  IA GCG +N GL   + G +GLG GSLS  +Q+    FSYCL D  + S S+  F 
Sbjct: 177 SVGGIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGKFSYCLTDFFNTSLSSPVFF 236

Query: 73  ----------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-DES 121
                      S+      + PL+++    + YY+ L GIS+G   LPI    F + D+ 
Sbjct: 237 GSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDD 296

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
           G+GG+IVDSGT  T L    +  + D  V G       +  +L   C+   +    E+P 
Sbjct: 297 GSGGMIVDSGTIFTILVETGFRVVVD-HVAGVLGQPVVNASSLDRPCFPAPAAGVQELPD 355

Query: 182 VS---FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNL 237
           +     HF  G  + L   NY+   +   +FC     T S+S S++GN QQQ  ++ F++
Sbjct: 356 MPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDI 415

Query: 238 RNSLIGFTPNKC 249
               + F P  C
Sbjct: 416 TVGQLSFMPTDC 427


>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
           oleracea]
          Length = 165

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 86/159 (54%), Gaps = 8/159 (5%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           +FY L + GISVGG  L I +T F        G ++DSGT ++RL  + Y ALR AF   
Sbjct: 12  SFYGLDIVGISVGGQKLAIPQTVFSTP-----GALIDSGTVISRLPPKAYAALRGAFKAK 66

Query: 153 TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
                 T  V++ DTC+D +   +V +PTVSF+F  G V+ L +K  L     +   C A
Sbjct: 67  MSQYKNTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLYAFKMS-QVCLA 125

Query: 213 FAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           FA  S  ++ +I GNVQQQ   V ++     +GF PN C
Sbjct: 126 FAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGC 164


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 81/264 (30%), Positives = 125/264 (47%), Gaps = 24/264 (9%)

Query: 1   GDFVTETVTLGSASVD--NIAIGCGHNNEGLFV---GAAGLLGLGGGSLSFPSQIN---A 52
           G   TE++  GS +V       GCG NN+ +        G++GLG G LS  SQ+     
Sbjct: 179 GVLCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIG 238

Query: 53  STFSYCLVDRDSDSTSTLEF--DSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
             FSYCL+   S ST  L+F  D+++  N  V+ PL+ +    ++Y+L L GI++G  +L
Sbjct: 239 HKFSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKML 298

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDT 167
            +  T     +  NG II+D GT +T L+   Y+      +R    +S T  D    FD 
Sbjct: 299 QVRTT-----DHTNGNIIIDLGTVLTYLEVNFYHNFV-TLLREALGISETKDDIPYPFDF 352

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGN 225
           C  F +++++  P + F F   KV  L  KN     D     C A  P   +   S+ GN
Sbjct: 353 C--FPNQANITFPKIVFQFTGAKVF-LSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGN 409

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + Q   +V ++ +   + F P  C
Sbjct: 410 LAQVDFQVEYDRKGKKVSFAPADC 433


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 135/269 (50%), Gaps = 32/269 (11%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNN----EGLFVGAAGLLGLGGGSLSFPSQIN 51
           GD   +T+TL S      S   I IGCGH N    EGL   A+G++G G G+ S  SQ+ 
Sbjct: 180 GDISKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTEGL---ASGIIGFGRGNFSIVSQLG 236

Query: 52  AS---TFSYCLVDRDSDS--TSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGIS 103
           +S    FSYCL    S +  +S L F D ++      V+ PL+++  +   Y+  L   S
Sbjct: 237 SSIGGKFSYCLASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGN-YFTNLEAFS 295

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTD 160
           VG  ++ + +++   D  GN   ++DSG+ +T+L  + Y+ L  A    V+  R   PT 
Sbjct: 296 VGDHIIKLKDSSLIPDNEGNA--VIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQ 353

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
            ++L   CY  ++    EVP ++ HF  G  + L A N  I ++ +   CFAF  ++   
Sbjct: 354 QLSL---CYK-TTLKKYEVPIITAHF-RGADVKLNAFNTFIQMN-HEVMCFAFNSSAFPW 407

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + GN+ QQ   V ++   ++I F P  C
Sbjct: 408 VVYGNIAQQNFLVGYDTLKNIISFKPTNC 436


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 92/269 (34%), Positives = 132/269 (49%), Gaps = 34/269 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G++  +T+TL  + V      GCG NN+G F  G  G+LGLG G LS  SQ  +     F
Sbjct: 204 GNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVF 263

Query: 56  SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           SYCL + DS             +S+L+F S      V  P     +   +Y++ L+ ISV
Sbjct: 264 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGP--GTLQESGYYFVNLSDISV 316

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
           G + L I  + F      + G I+DS T +TRL    Y+AL+ AF +       ++G   
Sbjct: 317 GNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 371

Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
              + DTCY+ S R  V +P +  HF  G  + L   N +   D++   C AFA T S L
Sbjct: 372 KGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDAS-RLCLAFAGT-SEL 429

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +IIGN QQ    V ++++   IGF  N C
Sbjct: 430 TIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 92/251 (36%), Positives = 124/251 (49%), Gaps = 34/251 (13%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS---T 67
           +V     GCGH   GLF G  GLLGLG    S   Q   +    FSYCL  + S +   T
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLT 280

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
             +   S   P   T  LL +    T+Y + LTGISVGG  L +  +AF           
Sbjct: 281 LGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------ 334

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVS 183
           VD+GT VTRL    Y ALR AF  G  +     +P++G+   DTCY+F+   +V +P V+
Sbjct: 335 VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGI--LDTCYNFAGYGTVTLPNVA 392

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF-CFAFAPTSS--SLSIIGNVQQQGTRVSFNLR-- 238
             F  G  + L A   L       +F C AFAP+ S   ++I+GNVQQ+    SF +R  
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQR----SFEVRID 441

Query: 239 NSLIGFTPNKC 249
            + +GF P+ C
Sbjct: 442 GTSVGFKPSSC 452


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  103 bits (256), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 89/278 (32%), Positives = 123/278 (44%), Gaps = 51/278 (18%)

Query: 1   GDFVTETVTLGSA----SVDNIAIGCGHN--NEGLFVGA-AGLLGLGGGSLSFPSQINAS 53
           G ++++ +TL  A    ++     GC H     G F    +G++ LG G+ S P+Q  A+
Sbjct: 236 GTYISDVLTLNPAKPASAISEFRFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKAT 295

Query: 54  ---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT----APLLRNHELDTFYYLGLTGISVGG 106
               FSYCL      S     F   +P  A +     P+LR+      Y + L  I V G
Sbjct: 296 YGDVFSYCLPPTPVHSGF---FILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAG 352

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVA 163
             LP+    F        G ++DS T VTRL    Y ALR AFV   R  RA +P +   
Sbjct: 353 KRLPVPPAVFA------AGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEH-- 404

Query: 164 LFDTCYDFS-----SRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNGTF---CFAF 213
             DTCYDFS         V++P ++  F  P G V           +D +G     C AF
Sbjct: 405 -LDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVE----------LDPSGVLLDGCLAF 453

Query: 214 APTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           AP +      IIGNVQQQ   V +N+  + +GF    C
Sbjct: 454 APNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 85/267 (31%), Positives = 127/267 (47%), Gaps = 23/267 (8%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFP---SQINA 52
           G F  ETVT+G        + ++ IGC  +         G++GLG    S     ++I  
Sbjct: 218 GVFANETVTVGLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFG 277

Query: 53  STFSYCLVDRDSDSTST--LEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           + FSYCLVD  S S     L F    +  LP    T  LL    ++ FY + ++GISVGG
Sbjct: 278 NKFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLG--YINAFYPVNVSGISVGG 335

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVA 163
            +L IS   + +  +G GG+IVDSGT++T L  E Y+ + DA        + + P +   
Sbjct: 336 SMLSISSDIWNV--TGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPE 393

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSI 222
           L + C++        VP +  HF +G +   P K+Y+I V + G  C           SI
Sbjct: 394 LNNFCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDV-AEGIKCLGIIKADFPGSSI 452

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +GNV QQ     ++L    +GF P+ C
Sbjct: 453 LGNVMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 18/216 (8%)

Query: 44  LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
           LS    +  +TFSYCL    S + + TL    +  P  + T PLL N    + YY+ +TG
Sbjct: 239 LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTG 298

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALS 157
           I VG  ++ I  +A   D +   G ++DSGT  TRL    Y ALRD   R    G  A+S
Sbjct: 299 IRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVS 358

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
              G   FDTCY+    ++V  P V+  F +G  + LP +N +I      T C A A   
Sbjct: 359 SLGG---FDTCYN----TTVAWPPVTLLF-DGMQVTLPEENVVIHTTYGTTSCLAMAAAP 410

Query: 217 ---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              ++ L++I ++QQQ  RV F++ N  +GF    C
Sbjct: 411 DGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/267 (35%), Positives = 126/267 (47%), Gaps = 36/267 (13%)

Query: 1   GDFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFS 56
           G + TET+ LGS A V +   GCG +  G +    GLLGLGG   S  SQ   +    FS
Sbjct: 224 GVYSTETLALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFS 283

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNA--------VTAPLLR-NHELDTFYYLGLTGISVGGD 107
           YCL   +S +     F +   PN+        V  P+   + ++ TFY + LTGISVGG 
Sbjct: 284 YCLPPLNSGA----GFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGK 339

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGVA 163
            L I    F     GN   IVDSGT +T + T  Y ALR AF R   A    L P D  +
Sbjct: 340 ALDIPPAVF---AKGN---IVDSGTVITGIPTTAYKALRTAF-RSAMAEYPLLPPAD--S 390

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSI 222
             DTCY+F+   +V VP V+  F  G  + L   + ++  D     C AFA     S  I
Sbjct: 391 ALDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVLVED-----CLAFADAGDGSFGI 445

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGNV  +   V ++     +GF    C
Sbjct: 446 IGNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 88/268 (32%), Positives = 131/268 (48%), Gaps = 36/268 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA--- 52
           G+F  +T++LG+ S       + A+GCG  N G F G  GL+GLG G +S  SQ++A   
Sbjct: 140 GEFARDTISLGTTSGGSQKFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAID 198

Query: 53  STFSYCLVDRDSDSTST-LEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           S FSYCLVD +S S S+ L F  S       +    +T P   +    T+Y L + GI+V
Sbjct: 199 SKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPP---SDTYPTYYLLTVNGIAV 255

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
            G  +              G  I+DSGT +T + +  Y  +  + +     L   DG ++
Sbjct: 256 AGQTM-----------GSPGTTIIDSGTTLTYVPSGVYGRVL-SRMESMVTLPRVDGSSM 303

Query: 165 -FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSS-SLS 221
             D CYD SS  + + P ++       + P P+ NY + VD +G T C A        +S
Sbjct: 304 GLDLCYDRSSNRNYKFPALTIRLAGATMTP-PSSNYFLVVDDSGDTVCLAMGSAGGLPVS 362

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGNV QQG  + ++  +S + F   KC
Sbjct: 363 IIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 48/292 (16%)

Query: 5   TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           T+  T  S+S   +A GC        G   GA+G++GLG G+LS  SQ+NA+ FSYCL  
Sbjct: 188 TDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTP 247

Query: 62  --RDSDSTSTL--------------EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGI 102
             RD+ S S L                         T P  +N +     TFYYL L G+
Sbjct: 248 YFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 307

Query: 103 SVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRA 155
           + G   + +   AF + E+      GG ++DSG+  TRL    + AL       +RG+ +
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367

Query: 156 LSPTDGV--ALFDTCY----DFSSRSSVEVPTVSFHFPE----GKVLPLPAKNYLIPVDS 205
           L P         + C     D  S ++  VP +   F +    G+ L +PA+ Y   V++
Sbjct: 368 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEA 427

Query: 206 NGTFCFAFAPTSS--------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + T+C A   ++S          +IIGN  QQ  RV ++L N L+ F P  C
Sbjct: 428 S-TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 82/261 (31%), Positives = 131/261 (50%), Gaps = 25/261 (9%)

Query: 10  LGSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS- 64
           +  A +  + +GC  +  G  F  + G+L LG  ++SF S   +     FSYCLVD  S 
Sbjct: 220 VKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLVDHLSP 279

Query: 65  -DSTSTLEF--DSSLP--------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
            ++TS L F  +S+L         P A   PL+ +  +  FY + +  ISV G+LL I  
Sbjct: 280 RNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDGELLKIPR 339

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
             +++D  G GG+IVDSGT++T L    Y A+  A  +   A  P   +  F+ CY+++S
Sbjct: 340 DVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKL-ARFPRVAMDPFEYCYNWTS 396

Query: 174 RSSV----EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQ 228
            S      ++P ++ HF     L  P+K+Y+I   + G  C          +S+IGN+ Q
Sbjct: 397 PSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDA-APGVKCIGVQEGPWPGISVIGNILQ 455

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q     F+L+N  + F  ++C
Sbjct: 456 QEHLWEFDLKNRRLRFKRSRC 476


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/276 (33%), Positives = 128/276 (46%), Gaps = 38/276 (13%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TF 55
           G ++T+ +T+    S  N   GC H   G F G  +G + LGGG  S  SQ   +    F
Sbjct: 260 GTYMTDILTISPGTSFLNFRFGCSHGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAF 319

Query: 56  SYCLVDRDSDSTSTL-------EFDSSLPPNAVTAPLLRNHEL--DTFYYLGLTGISVGG 106
           SYC+    +    +L       + DS  P + VT PL+RN  +   T+Y + L GI V G
Sbjct: 320 SYCVPKPSASGFLSLGGAINDGDSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAG 379

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTR--------A 155
             L +    F      +GG ++DS   VT+L    Y ALR AF   +RG R        +
Sbjct: 380 RRLNVPPVVF------SGGTLMDSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTS 433

Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
            +P  G  + DTCYDF    +V VPTVS  F  G V+ L     ++        C AF P
Sbjct: 434 STPAGGEMILDTCYDFEGLDNVTVPTVSLVFFGGAVVDLDPTTAVMMEG-----CLAFVP 488

Query: 216 TSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           T +   L  IGNVQQQ   V +++    +GF    C
Sbjct: 489 TPADFDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 524


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/266 (36%), Positives = 132/266 (49%), Gaps = 23/266 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
           GD   +T+T+GS      SV  +  GCGHNN G F +  +GL+GLGGG LS  SQ+    
Sbjct: 184 GDLAVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLI 243

Query: 52  ASTFSYCLVD--RDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV    D   +S + F S        AV+ PL  + + DTFYYL L  +SVG 
Sbjct: 244 GGRFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPL-ASRQPDTFYYLTLESMSVGS 302

Query: 107 DLLP---ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
             L     S+    + ++  G II+DSGT +T L  + Y  L    V         D   
Sbjct: 303 KKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNN 362

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           +F  CY  S+ S + +PT++ HF  G  L L   N  + V  +  FCFA  P  S L+I 
Sbjct: 363 VFSLCY--SNLSGLRIPTITAHF-VGADLELKPLNTFVQVQED-LFCFAMIPV-SDLAIF 417

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN+ Q    V ++L++  + F P  C
Sbjct: 418 GNLAQMNFLVGYDLKSRTVSFKPTDC 443


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 132/292 (45%), Gaps = 48/292 (16%)

Query: 5   TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           T+  T  S+S   +A GC        G   GA+G++GLG G+LS  SQ+NA+ FSYCL  
Sbjct: 171 TDAFTFPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATEFSYCLTP 230

Query: 62  --RDSDSTSTL--------------EFDSSLPPNAVTAPLLRNHE---LDTFYYLGLTGI 102
             RD+ S S L                         T P  +N +     TFYYL L G+
Sbjct: 231 YFRDTVSPSHLFVGDGELAGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 290

Query: 103 SVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRA 155
           + G   + +   AF + E+      GG ++DSG+  TRL    + AL       +RG+ +
Sbjct: 291 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 350

Query: 156 LSPTDGV--ALFDTCY----DFSSRSSVEVPTVSFHFPE----GKVLPLPAKNYLIPVDS 205
           L P         + C     D  S ++  VP +   F +    G+ L +PA+ Y   V++
Sbjct: 351 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEA 410

Query: 206 NGTFCFAFAPTSS--------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + T+C A   ++S          +IIGN  QQ  RV ++L N L+ F P  C
Sbjct: 411 S-TWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/216 (34%), Positives = 108/216 (50%), Gaps = 18/216 (8%)

Query: 44  LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
           LS    +  +TFSYCL    S + + TL    +  P  + T PLL N    + YY+ +TG
Sbjct: 186 LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTG 245

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALS 157
           I VG  ++ I  +A   D +   G ++DSGT  TRL    Y ALRD   R    G  A+S
Sbjct: 246 IRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVS 305

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT- 216
              G   FDTCY+    ++V  P V+  F +G  + LP +N +I      T C A A   
Sbjct: 306 SLGG---FDTCYN----TTVAWPPVTLLF-DGMQVTLPEENVVIHTTYGTTSCLAMAAAP 357

Query: 217 ---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              ++ L++I ++QQQ  RV F++ N  +GF    C
Sbjct: 358 DGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393


>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 85/161 (52%), Gaps = 12/161 (7%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           +FY L +  I+VGG  LPI  T F        G ++DSGT +TRL  + Y ALR +F   
Sbjct: 30  SFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAK 84

Query: 153 TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGTFC 210
                 T GV++ DTC+D S   +V +P V+F F  G V+ L +K   Y+  +      C
Sbjct: 85  MSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ---VC 141

Query: 211 FAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            AFA  S  S+ +I GNVQQQ   V ++     +GF PN C
Sbjct: 142 LAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 182


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
           G F  ETVT+         + N+ IGC  + +G  F  A G++GLG    SF    ++  
Sbjct: 186 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 245

Query: 52  ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLVD  S    ++ L F SS    A    +T   L    +++FY + + GIS+G
Sbjct: 246 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 305

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
           G +L I    +  D  G GG I+DSG+++T L    Y     ALR + ++  +       
Sbjct: 306 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 360

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
           +   + C++ +      VP + FHF +G     P K+Y+I   ++G  C  F   +    
Sbjct: 361 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 419

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S++GN+ QQ     F+L    +GF P+ C
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
           G F  ETVT+         + N+ IGC  + +G  F  A G++GLG    SF    ++  
Sbjct: 115 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 174

Query: 52  ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLVD  S    ++ L F SS    A    +T   L    +++FY + + GIS+G
Sbjct: 175 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 234

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
           G +L I    +  D  G GG I+DSG+++T L    Y     ALR + ++  +       
Sbjct: 235 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 289

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
           +   + C++ +      VP + FHF +G     P K+Y+I   ++G  C  F   +    
Sbjct: 290 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 348

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S++GN+ QQ     F+L    +GF P+ C
Sbjct: 349 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 377


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/269 (30%), Positives = 128/269 (47%), Gaps = 26/269 (9%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFP---SQIN 51
           G F  ETVT+         + N+ IGC  + +G  F  A G++GLG    SF    ++  
Sbjct: 186 GFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKF 245

Query: 52  ASTFSYCLVDRDSDS--TSTLEFDSSLPPNA----VTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLVD  S    ++ L F SS    A    +T   L    +++FY + + GIS+G
Sbjct: 246 GGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIG 305

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN----ALRDAFVRGTRALSPTDG 161
           G +L I    +  D  G GG I+DSG+++T L    Y     ALR + ++  +       
Sbjct: 306 GAMLKIPSEVW--DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKV---EMD 360

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
           +   + C++ +      VP + FHF +G     P K+Y+I   ++G  C  F   +    
Sbjct: 361 IGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISA-ADGVRCLGFVSVAWPGT 419

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S++GN+ QQ     F+L    +GF P+ C
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 90/254 (35%), Positives = 119/254 (46%), Gaps = 24/254 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            + V +T+TL +  V +   GC     G      GLLGLG G LS  SQ   +  STFSY
Sbjct: 176 ANLVQDTITLATDPVPSYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 235

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + YY+ L  I VG  ++ 
Sbjct: 236 CL-----PSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVD 290

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I   A   + +   G I DSGT  TRL    Y A+RD F R          +  FDTCY+
Sbjct: 291 IPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCYN 350

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNV 226
                 + VPT++F F  G  + LP  N LI   +  T C A A      +S L++I N+
Sbjct: 351 V----PIVVPTITFIF-TGMNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANM 405

Query: 227 QQQGTRVSFNLRNS 240
           QQQ  RV +++ NS
Sbjct: 406 QQQNHRVLYDVPNS 419


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 94/266 (35%), Positives = 129/266 (48%), Gaps = 39/266 (14%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINAS---T 54
           G + ++T+ L  S +V +   GC H+ E  F G    GL+GLGG + S  SQ  A+   +
Sbjct: 213 GTYSSDTLALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQSLVSQTAATYGKS 271

Query: 55  FSYCLVDRDSDSTSTLEFDSSLPPNA-----VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           FSYCL   +  S   L F +   PN      VT P+LR  +  T Y + L  ISVGG  L
Sbjct: 272 FSYCLPPTNRTS-GFLTFGA---PNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPL 327

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF------VRGTRALSPTDGVA 163
            I  +        + G ++DSGT +T L    Y+AL  AF      +R  RA +P   + 
Sbjct: 328 GIQPSVL------SNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRA-AP---LG 377

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           + DTCYDF+   +V +P VS     G V+ L     +I        C AFA TS   SII
Sbjct: 378 ILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAATSGD-SII 430

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQ+   V  ++   + GF    C
Sbjct: 431 GNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 87/268 (32%), Positives = 128/268 (47%), Gaps = 29/268 (10%)

Query: 2   DFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQIN 51
           D  +ET T+GS     AS   +A GCGH+N G F            G     +   S++ 
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243

Query: 52  ASTFSYCLVDRDSDST--STLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   SDST  S + F  S   +    V+ PL++    DTFYYL L G+S+G 
Sbjct: 244 GQ-FSYCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTP-DTFYYLTLEGMSLGS 301

Query: 107 DLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
           +   ++   F  ++S         II+DSGT +T L  + Y  +  A  +     + TD 
Sbjct: 302 E--KVAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDP 359

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
              F  CY  S    +E+PT++ HF  G  + LP  N  +    +   CF+  P SS+L+
Sbjct: 360 RGTFSLCY--SGVKKLEIPTITAHFI-GADVQLPPLNTFVQAQED-LVCFSMIP-SSNLA 414

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I GN+ Q    V ++L+N+ + F P  C
Sbjct: 415 IFGNLSQMNFLVGYDLKNNKVSFKPTDC 442


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 116/275 (42%), Gaps = 30/275 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++ET+ L +  V +  +GC   +       AG+ G G G  S PSQ+    FS+CLV
Sbjct: 240 GILLSETLDLENKRVPDFLVGCSVMSVH---QPAGIAGFGRGPESLPSQMRLKRFSHCLV 296

Query: 61  DR---DSDSTSTL------EFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISVGG 106
            R   DS  +S L      E D S   + + AP   N  +       +YYL L  I +GG
Sbjct: 297 SRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGG 356

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----AFVRGTRALSPTDGV 162
             +         D +GNGG I+DSG+  T L    + A+ D      V+  RA    +  
Sbjct: 357 KPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKD-VEAQ 415

Query: 163 ALFDTCYDF-SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
           +    C++      S E P V   F  G  L L A+NYL  V   G  C       + + 
Sbjct: 416 SGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVG 475

Query: 222 -------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                  I+G  QQQ   V ++L    IGF   KC
Sbjct: 476 GGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 93/263 (35%), Positives = 121/263 (46%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSAS---VDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINA---S 53
           G + ++T+TL   S   +     GC     G       GL+GLGG + SF SQ  A   S
Sbjct: 212 GTYGSDTLTLAGTSEPLISGFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGS 271

Query: 54  TFSYCLV-DRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
            FSYCL    +S    TL    SS      T P+LR+ +  TFY L L GISVGG  L I
Sbjct: 272 AFSYCLPPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEI 331

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDTCY 169
             + F      + G IVDSGT +TRL    Y AL  AF  G       P     L DTC+
Sbjct: 332 PSSVF------SAGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCF 385

Query: 170 DFSSR---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
           DF+     ++  VP+V+     G V+ L        +  +G   FA         IIGNV
Sbjct: 386 DFTGHGEGNNFTVPSVALVLDGGAVVDLHPNG----IVQDGCLAFAATDDDGRTGIIGNV 441

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+   V +++  S+ GF P  C
Sbjct: 442 QQRTFEVLYDVGQSVFGFRPGAC 464


>gi|56202144|dbj|BAD73477.1| chloroplast nucleoid DNA binding protein-like [Oryza sativa
           Japonica Group]
 gi|125571574|gb|EAZ13089.1| hypothetical protein OsJ_03009 [Oryza sativa Japonica Group]
          Length = 316

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 130/275 (47%), Gaps = 41/275 (14%)

Query: 12  SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
            A +  + +GC  +  G  F+ + G+L LG  ++SF S+  +     FSYCLVD     +
Sbjct: 44  KAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRN 103

Query: 66  STSTLEFD-----SSLPPNAVTA---------------------PLLRNHELDTFYYLGL 99
           +TS L F      SS  P+  TA                     PL+ +H    FY + +
Sbjct: 104 ATSYLTFGPNPAFSSRRPSEGTASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTV 163

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
            G+SV G+LL I    + +++   GG I+DSGT++T L    Y A+  A  +    L P 
Sbjct: 164 KGVSVAGELLKIPRAVWDVEQ--GGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGL-PR 220

Query: 160 DGVALFDTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
             +  FD CY+++S S  +V    P ++ HF     L  PAK+Y+I   + G  C     
Sbjct: 221 VTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA-APGVKCIGLQE 279

Query: 216 -TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                LS+IGN+ QQ     ++L+N  + F  ++C
Sbjct: 280 GPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 314


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 91/267 (34%), Positives = 122/267 (45%), Gaps = 28/267 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            + V +TVTL +  + +   GC     G      GLLGLG G LS  SQ   +  STFSY
Sbjct: 181 ANVVQDTVTLATDPIPDYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 240

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + YY+ L  I VG  ++ 
Sbjct: 241 CL-----PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVD 295

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVALFD 166
           I   A   + +   G + DSGT  TRL    Y A+RD F R      +A      +  FD
Sbjct: 296 IPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFD 355

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSI 222
           TCY       +  PT++F F  G  + LP  N LI   +  T C A A      +S L++
Sbjct: 356 TCYTV----PIVAPTITFMF-SGMNVTLPEDNILIHSTAGSTTCLAMASAPDNVNSVLNV 410

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I N+QQQ  RV +++ NS +G     C
Sbjct: 411 IANMQQQNHRVLYDVPNSRLGVARELC 437


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/266 (31%), Positives = 121/266 (45%), Gaps = 26/266 (9%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           G   TET+TL S      S+ NI  GCGHNN G F     GL G GG  LS  SQI ++ 
Sbjct: 180 GVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTL 239

Query: 54  ----TFSYCLVDRDSDS--TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
                FS CLV   +D   TS + F  ++ +  + V +  L   +  T+Y++ L GISVG
Sbjct: 240 GSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVG 299

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
             L P S ++     +  G + +D+GT  T L  + YN L    V+G +   P + V   
Sbjct: 300 DKLFPFSSSS---PMATKGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQDP 352

Query: 166 DTCYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
           D       RS+  ++ P ++ HF    V   P   ++ P    G +CFA  P      I 
Sbjct: 353 DLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGIF 410

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN  Q    + F+L    + F    C
Sbjct: 411 GNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
          Length = 201

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 89/175 (50%), Gaps = 8/175 (4%)

Query: 82  TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           T PLL++ +  TFYY+  TG++VG   L I E+AF +   G+GG+IVDSGTA+T L    
Sbjct: 28  TTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAV 87

Query: 142 YNALRDAFVRGTRAL-----SPTDGVALF--DTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
              +  AF +  R       +P DGV           SS S + VP +  HF +G  L L
Sbjct: 88  LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHF-QGADLDL 146

Query: 195 PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           P +NY++     G  C   A +    S IGN+ QQ  RV ++L    +   P +C
Sbjct: 147 PRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 201


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 87/266 (32%), Positives = 127/266 (47%), Gaps = 21/266 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE+    S +  ++A GC        G    A+GL+GLG G LS  SQI A+ FSY
Sbjct: 178 GSLGTESFAFESGTT-SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATRFSY 236

Query: 58  CLVD--RDSDSTSTL--EFDSSLPPNAVTAPLL---RNHELDTFYYLGLTGISVGGDLLP 110
           CL      S ++S L     +SL     + P +   +++   TFYYL L GI+VG   LP
Sbjct: 237 CLTPYFHSSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLP 296

Query: 111 -ISETAFKIDE----SGNGGIIVDSGTAVTRLQTETYNALRD--AFVRGTRALSPTDGVA 163
            ++ T F++ +       GG+I+D+G+ +T+L +  Y AL++  A   G  +L P    +
Sbjct: 297 AVNSTTFQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDS 356

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
             + C        V VP + FHF  G  + +PA +Y  PVD     C          SII
Sbjct: 357 GLELCVAREGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAA-CMMILEGGYD-SII 413

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN QQQ   + ++LR     F    C
Sbjct: 414 GNFQQQDMHLLYDLRRGRFSFQTADC 439


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 84/278 (30%), Positives = 117/278 (42%), Gaps = 39/278 (14%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
           +T+++    + N   GC H          G+ G G G LS P+Q+        + FSYCL
Sbjct: 133 DTLSMSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCL 189

Query: 60  VDRDSDSTSTLE--------FD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           V    D     +        +D  SS     V   +LRN +   FY +GLTGISVG   +
Sbjct: 190 VSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTI 249

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALF 165
              E   ++D  G+GG++VDSGT  T L    YN++   F R      +  S  +     
Sbjct: 250 LAPEMLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGL 309

Query: 166 DTCYDFSSRSSVEVPTVSFHF-PEGKVLPLPAKNYLIPV--------DSNGTFCFAFAPT 216
             CY       VEVPTV++HF      + LP  NY               G         
Sbjct: 310 GPCYFLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGD 367

Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + LS     I+GN QQQG  V ++L N  +GF   +C
Sbjct: 368 DTELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 127/261 (48%), Gaps = 29/261 (11%)

Query: 6   ETVTLGS-----ASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINAS---TFS 56
           +T+TL S      S  NI IGCGH N+G   G  +G +GL  G LSF SQ+N+S    FS
Sbjct: 161 DTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFS 220

Query: 57  YCLVD--RDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCLV      + +S L F D S       V+ P+    + +  Y++ L   SVG  ++ +
Sbjct: 221 YCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVGDHIIKL 276

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +    D  GN   I+DSGT +T L  + Y+ L    +   +     D    F+ CY  
Sbjct: 277 ENS----DNRGNS--IIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQT 330

Query: 172 SSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP--TSSSLSIIGNVQQ 228
           +S + + +V  ++ HF  G  + L A N   P+ ++   CFAF      SSL+I GNV Q
Sbjct: 331 TSTTLLTKVLIITAHF-SGSEVHLNALNTFYPI-TDEVICFAFVSGGNFSSLAIFGNVVQ 388

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V F+L    I F P  C
Sbjct: 389 QNFLVGFDLNKKTISFKPTDC 409


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 82/249 (32%), Positives = 118/249 (47%), Gaps = 25/249 (10%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA---ST 54
           G + ++ +TL GS  V     GC H     G+     GL+GLGG + S  SQ  A    +
Sbjct: 203 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262

Query: 55  FSYCLVDRDSDS-----TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           F YCL    + S      +             T P+LR+ ++ T+Y+  L  I+VGG  L
Sbjct: 263 FFYCLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKL 322

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
            +S + F        G +VDSGT +TRL    Y AL  AF  G    +  + + + DTC+
Sbjct: 323 GLSPSVFA------AGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 376

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
           +F+    V +PTV+  F  G V+ L A   +    S G  C AFAPT    +   IGNVQ
Sbjct: 377 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIV----SGG--CLAFAPTRDDKAFGTIGNVQ 430

Query: 228 QQGTRVSFN 236
           Q+   V ++
Sbjct: 431 QRTFEVLYD 439


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 104/224 (46%), Gaps = 30/224 (13%)

Query: 50  INASTFSYCLVDRDSDSTSTLEFDSSL---------PPNAVTAPLLRNHELDTFYYLGLT 100
           I   TFSYCL    S   S   F  SL         P    T PLL +    + YY+ +T
Sbjct: 238 IYEGTFSYCL---PSYYRSAANFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMT 294

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--------- 151
           G+ +G   +PI  +A   D +   G ++DSGT   RL    Y A+RD   R         
Sbjct: 295 GVRIGKKSVPIPPSALAFDAATGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRR 354

Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
            G  A      +  FDTCY+    S+V  P V+  F  G  + LP +N +I      T C
Sbjct: 355 GGGGASVSVSSLGGFDTCYNV---STVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSC 411

Query: 211 FAFAPT-----SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            A A +     +++L++IG++QQQ  RV F++ N+ +GF   +C
Sbjct: 412 LAMAASPADGVNAALNVIGSLQQQNHRVLFDVPNARVGFARERC 455


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  100 bits (250), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 111/273 (40%), Gaps = 40/273 (14%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDST 67
           S+ +   GC H+  G  +G AG    G GSLS P+Q+        + FSYCLV    DST
Sbjct: 220 SLKDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDST 276

Query: 68  S-----------TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
                         E D       V  P+L N +   FY + +  ISVG   +       
Sbjct: 277 KLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSSRVRAPNALI 336

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVALFDTCYDFS 172
           +ID  GNGG++VDSGT  T L T  YN++     R      +  S T+       CY   
Sbjct: 337 RIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLE 396

Query: 173 ----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-------DSNGTFCFAFAPTSSSL- 220
                R  + VP ++FHF     + LP +NY                 C           
Sbjct: 397 GNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLMLMDGGDESE 456

Query: 221 ----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               + +GN QQQG +V ++L    +GF P KC
Sbjct: 457 GGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  100 bits (250), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 83/269 (30%), Positives = 118/269 (43%), Gaps = 38/269 (14%)

Query: 1   GDFVTETVTLGS---ASVDNIAIGCGHN--NEGLFVGA-AGLLGLGGGSLSFPSQINAS- 53
           G +V++ +TL +    +V     GC H     G F    AG + LG G+ S  SQ   + 
Sbjct: 256 GTYVSDLLTLNADPKGAVSKFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTF 315

Query: 54  ----TFSYCLVDRDSDST-STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
                FSYCL    S     +L             P+L++      Y + L GI V G  
Sbjct: 316 SKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPMLKSKMAPMIYMVRLIGIDVAGQR 375

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALF 165
           LP+    F  + +      +DS T +TRL    Y ALR AF   +R  RA++P       
Sbjct: 376 LPVPPAVFAANAA------MDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPK---GQL 426

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL-- 220
           DTCYDF+    V +P V+  F          +N  + +D +G     C AFAP ++    
Sbjct: 427 DTCYDFTGVPMVRLPKVTLVF---------DRNAAVELDPSGVMLDSCLAFAPNANDFMP 477

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            IIGNVQQQ   V +N+  + +GF    C
Sbjct: 478 GIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 120/267 (44%), Gaps = 28/267 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           G   TET+TL S      S+ NI  GCGHNN G F     GL G GG  LS  SQI ++ 
Sbjct: 180 GVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTL 239

Query: 54  ----TFSYCLVDRDSDS--TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISV 104
                FS CLV   +D   TS + F         + V+ PL+   +  T+Y++ L GISV
Sbjct: 240 GSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDD-PTYYFVTLDGISV 298

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
           G  L P S ++     +  G + +D+GT  T L  + YN L    V+G +   P + V  
Sbjct: 299 GDKLFPFSSSS---PMATKGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQD 351

Query: 165 FDTCYDFSSRSS--VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            D       RS+  ++ P ++ HF    V   P   ++ P    G +CFA  P      I
Sbjct: 352 PDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGI 409

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GN  Q    + F+L    + F    C
Sbjct: 410 FGNFVQMNFLIGFDLDGKKVSFKAVDC 436


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  100 bits (248), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 64/108 (59%), Positives = 79/108 (73%), Gaps = 4/108 (3%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQI---NASTFSY 57
           G F TET+T G+ SV N+AIGCGH N GLF+GAAGLLGLG G+LSFP+QI      TFSY
Sbjct: 244 GSFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSY 303

Query: 58  CLVDRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           CLVDR+SDS+  L+F   S+P  ++  PL +N  L TFYYL +T IS+
Sbjct: 304 CLVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISI 351


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 128/275 (46%), Gaps = 41/275 (14%)

Query: 12  SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
            A +  + +GC  +  G  F+ + G+L LG  ++SF S+  +     FSYCLVD     +
Sbjct: 212 KAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRN 271

Query: 66  STSTL------EFDSSLPPNAVTA--------------------PLLRNHELDTFYYLGL 99
           +TS L       F S  P   + +                    PL+ +H    FY + +
Sbjct: 272 ATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTV 331

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
            G+SV G+LL I    + +++   GG I+DSGT++T L    Y A+  A  +    L P 
Sbjct: 332 KGVSVAGELLKIPRAVWDVEQ--GGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGL-PR 388

Query: 160 DGVALFDTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
             +  FD CY+++S S  +V    P ++ HF     L  PAK+Y+I   + G  C     
Sbjct: 389 VTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA-APGVKCIGLQE 447

Query: 216 -TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                LS+IGN+ QQ     ++L+N  + F  ++C
Sbjct: 448 GPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 96/211 (45%), Gaps = 21/211 (9%)

Query: 50  INASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGI 102
           I +  FSYCL      S  +  F  SL       P +  T PLL N    + YY+ LT I
Sbjct: 236 IYSGVFSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLHNPHRPSLYYVNLTAI 290

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
           SVG   +P+       + S   G I+DSGT +TR     YNA+RD F +  +   P   +
Sbjct: 291 SVGRVYVPLPSELLAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRK--QVTGPFSSL 348

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--- 219
             FDTC  F        P ++ HF +   L LP +N LI   S    C A A   S+   
Sbjct: 349 GAFDTC--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMAAAPSNVNS 405

Query: 220 -LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            L++I N QQQ  RV F+  N+ +G     C
Sbjct: 406 VLNVIANFQQQNLRVLFDTVNNKVGIARELC 436


>gi|413923981|gb|AFW63913.1| hypothetical protein ZEAMMB73_837345 [Zea mays]
          Length = 414

 Score =  100 bits (248), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 65/178 (36%), Positives = 93/178 (52%), Gaps = 19/178 (10%)

Query: 54  TFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           TFSY LV+ DSD+ S + F       + P    TA    +   DTFYY+ L G+ VGG+L
Sbjct: 5   TFSYRLVEHDSDAVSKVVFREDDLVLAHPELKYTAFTPTSSPADTFYYVKLKGVLVGGEL 64

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDT 167
           L IS   + + + G+GG I+DSGT ++      Y A+            P+D G+   + 
Sbjct: 65  LKISSDTWDVGKDGSGGTIIDSGTTLSYFVEPVYQAV------------PSDPGLLGAEP 112

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSIIG 224
           CY+ S     EVP +S  FP+G V   PA+NY + +D +   C A   TS + +SIIG
Sbjct: 113 CYNVSGMERPEVPELSLLFPDGAVWDFPAENYFVRLDPDDIMCLAVLGTSRTGMSIIG 170


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 81/268 (30%), Positives = 123/268 (45%), Gaps = 38/268 (14%)

Query: 12  SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--D 65
            A +  + +GC  +  G  F  + G+L LG   +SF S   +     FSYCLVD  S  +
Sbjct: 216 KAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSPRN 275

Query: 66  STSTLEFDSSLPPN-------------------AVTAPLLRNHELDTFYYLGLTGISVGG 106
           +TS L F     PN                   A   PLL +  +  FY + L  ISV G
Sbjct: 276 ATSYLTFG----PNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
           + L I    + ++    GG+I+DSGT++T L    Y A+  A  +G   L P   +  F+
Sbjct: 332 EFLKIPRAVWDVE--AGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGL-PRVTMDPFE 388

Query: 167 TCYDFSSRS----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLS 221
            CY+++S S     V VP ++ HF     L  P K+Y+I   + G  C          +S
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDA-APGVKCIGLQEGPWPGIS 447

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +IGN+ QQ     F+++N  + F  ++C
Sbjct: 448 VIGNILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 82/249 (32%), Positives = 116/249 (46%), Gaps = 20/249 (8%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQIN----ASTFSYCLVD--RDSD 65
            S+     GCGHNN G F     GL+GLGGG  S  SQI        FS CLV    D  
Sbjct: 173 VSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIK 232

Query: 66  STSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
            +S + F      L    VT PL+   E DT Y++ L GISV     P++ T       G
Sbjct: 233 ISSRMSFGKGSQVLGNGVVTTPLVP-REKDTSYFVTLLGISVEDTYFPMNSTI------G 285

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTV 182
              ++VDSGT    L  + Y+ +  A VR   AL P        T   + ++++++ PT+
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVF-AEVRNKVALKPITDDPSLGTQLCYRTQTNLKGPTL 344

Query: 183 SFHFPEGKVLPLPAKNYLIPV-DSNGTFCFA-FAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           +FHF    VL  P + ++ P   + G FC A +  T+S   + GN  Q    + F+L   
Sbjct: 345 TFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQ 404

Query: 241 LIGFTPNKC 249
           ++ F P  C
Sbjct: 405 VVSFKPTDC 413


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 82/271 (30%), Positives = 122/271 (45%), Gaps = 27/271 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
           G  V +T+TL  SA+      GC     +   F GA GL+ L   S S  S++       
Sbjct: 170 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 229

Query: 51  NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
           +A+ FSYCL    + S+   L   +S P     +   AP+  N      Y++ L GISVG
Sbjct: 230 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVG 289

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G+ LP+    F        G ++++ T  T L    Y ALRDAF R            + 
Sbjct: 290 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVL 344

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
           DTCY+ +  +S+ VPTV+  F  G  L L  +  +   D +  F         A    + 
Sbjct: 345 DTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 404

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +S+IG + Q+ T V ++LR   +GF P +C
Sbjct: 405 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 96/207 (46%), Gaps = 22/207 (10%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCL      S  +  F  SL       P +  T PLLRN    + Y++ LTGI+VG  
Sbjct: 241 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKV 295

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            +P  +     D +   G I+DSGT +TR     YNA+RD F +  +   P   +  FDT
Sbjct: 296 NVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRK--QVTGPFSSLGAFDT 353

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSI 222
           C  F        P ++ HF +   L LP +N LI   S    C A A T  +     L++
Sbjct: 354 C--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNV 410

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I N QQQ  RV F+  N+ +G     C
Sbjct: 411 IANYQQQNLRVLFDTVNNKVGIARELC 437


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 79/232 (34%), Positives = 111/232 (47%), Gaps = 20/232 (8%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLVD--RDSDSTSTLEFDSSLP----PNAVTAPLLR 87
           +GL+GLG G LS  SQ  A+ FSYCL     ++ +T  L   +S       + +T   ++
Sbjct: 152 SGLMGLGRGRLSLVSQTGATKFSYCLTPYFHNNGATGHLFVGASASLGGHGDVMTTQFVK 211

Query: 88  NHELDTFYYLGLTGISVGGDLLPISETAFKIDESG----NGGIIVDSGTAVTRLQTETYN 143
             +   FYYL L G++VG   LPI  T F + E      +GG+I+DSG+  T L  + Y+
Sbjct: 212 GPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTSLVHDAYD 271

Query: 144 ALRD---AFVRGTRALSPTDGVALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN 198
           AL     A + G+    P D     D      +R  V   VP V FHF  G  + +PA++
Sbjct: 272 ALASELAARLNGSLVAPPPDA----DDGALCVARRDVGRVVPAVVFHFRGGADMAVPAES 327

Query: 199 YLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           Y  PVD         +       S+IGN QQQ  RV ++L N    F P  C
Sbjct: 328 YWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 379


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 103/221 (46%), Gaps = 28/221 (12%)

Query: 44  LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
           +S    I  STFSYCL      S  +L F  SL       P       LLRN    + YY
Sbjct: 231 MSQAQSIYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 285

Query: 97  LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
           + L  I VG  ++ +   A   + S   G I DSGT  TRL    Y A+R+ F +  +  
Sbjct: 286 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK-- 343

Query: 157 SPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
            PT  V      FDTCY       V+VPT++F F +G  + +PA N ++   +  T C A
Sbjct: 344 -PTTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLA 397

Query: 213 FAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            A      +S +++I ++QQQ  RV  ++ N  +G    +C
Sbjct: 398 MAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/257 (33%), Positives = 121/257 (47%), Gaps = 20/257 (7%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAS----T 54
           G++ T+ +TLG  A V     GCGH+ + G F  A G+LGLG    S   Q +A      
Sbjct: 225 GEYSTDALTLGPGAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGV 284

Query: 55  FSYCLVDRDSDSTSTLEFDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           FS+CL      ST  L   +    +A V  PLL   +   FY L  T ISV G LL I  
Sbjct: 285 FSHCLPPTGV-STGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPP 343

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
             F+       G+I DSGT ++ LQ   Y ALR AF            V   DTC++F+ 
Sbjct: 344 AVFR------EGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTG 397

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS-IIGNVQQQGTR 232
             +V VPTVS  F  G  + L A + ++ +D     C AF  +    + +IG+V Q+   
Sbjct: 398 YDNVTVPTVSLTFRGGATVHLDASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIE 452

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++    +GF    C
Sbjct: 453 VLYDMPGRKVGFRTGAC 469


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 82/271 (30%), Positives = 119/271 (43%), Gaps = 25/271 (9%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSL-----SFPSQINAST 54
           GD  T+ +   + + V+N+ +GCG +NEGLF  AAGLLG    +       +P +   S+
Sbjct: 178 GDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGRRAAARYPSRRRWPRRTAPSS 237

Query: 55  FSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRN-----HELDTFYYLGLTGISVG--GD 107
            +     R +   +     ++                      T+ + G    + G  G 
Sbjct: 238 STASATGRRAQRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSASAARGSPGS 297

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV---AL 164
             P S    +    G    +VDSGTA++R   + Y ALRDAF    RA          ++
Sbjct: 298 RTPASRWTRRRGRGGV---VVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSV 354

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD------SNGTFCFAFAPTSS 218
           FD CYD   R +   P +  HF  G  + LP +NY +PVD      ++   C  F     
Sbjct: 355 FDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD 414

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            LS+IGNVQQQG RV F++    IGF P  C
Sbjct: 415 GLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 445


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 91/267 (34%), Positives = 121/267 (45%), Gaps = 28/267 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            + V +TVTL +  +     GC     G      GLLGLG G LS  SQ   +  STFSY
Sbjct: 180 ANVVQDTVTLATDPIPGYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSY 239

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + YY+ L  I VG  ++ 
Sbjct: 240 CL-----PSFKSLNFSGSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVD 294

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVALFD 166
           I   A   + +   G + DSGT  TRL    Y A+RD F R      +A      +  FD
Sbjct: 295 IPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFD 354

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSI 222
           TCY       +  PT++F F  G  + LP  N LI   +  T C A A      +S L++
Sbjct: 355 TCYTV----PIVAPTITFMF-SGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNV 409

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I N+QQQ  RV +++ NS +G     C
Sbjct: 410 IANMQQQNHRVLYDVPNSRLGVARELC 436


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 103/221 (46%), Gaps = 28/221 (12%)

Query: 44  LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
           +S    I  STFSYCL      S  +L F  SL       P       LLRN    + YY
Sbjct: 247 MSQAQSIYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 301

Query: 97  LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
           + L  I VG  ++ +   A   + S   G I DSGT  TRL    Y A+R+ F +  +  
Sbjct: 302 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK-- 359

Query: 157 SPTDGVAL----FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFA 212
            PT  V      FDTCY       V+VPT++F F +G  + +PA N ++   +  T C A
Sbjct: 360 -PTTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLA 413

Query: 213 FAP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            A      +S +++I ++QQQ  RV  ++ N  +G    +C
Sbjct: 414 MAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 454


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 72/234 (30%), Positives = 109/234 (46%), Gaps = 12/234 (5%)

Query: 21  GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD--SSLPP 78
           GCG   +G  +GA+G+LG+    LS  SQ+    FSYCL       +S L F   + L  
Sbjct: 202 GCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRKSSPLFFGAWADLGR 261

Query: 79  NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
              T P+ ++  L  +YY+ L G+S+G   L +    F + +   GG +VD G  V +L 
Sbjct: 262 YKTTGPIQKS--LTFYYYVPLVGLSLGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLA 316

Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLP 195
              + AL++A +           V  +  C+   S     +V+ P +  +F  G  + LP
Sbjct: 317 EPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLP 376

Query: 196 AKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             NY     + G  C A  P    +SIIGNVQQQ   + F++ +S   F P  C
Sbjct: 377 RDNYF-QEPTAGLMCLALVP-GGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 74/250 (29%), Positives = 112/250 (44%), Gaps = 10/250 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G    +     +   D +  GC    EG      G++GLG G LS  SQ+    FSY L 
Sbjct: 193 GLLAVDAFAFATVRADGVIFGCAVATEGDI---GGVIGLGRGELSLVSQLQIGRFSYYLA 249

Query: 61  DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
             D+ D  S + F     P    AV+ PL+ N    + YY+ L GI V G+ L I    F
Sbjct: 250 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTF 309

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
            +   G+GG+++     VT L    Y  +R A       L   DG  L  D CY   S +
Sbjct: 310 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKI-GLRAADGSELGLDLCYTSESLA 368

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVS 234
           + +VP+++  F  G V+ L   NY     + G  C    P+ +   S++G++ Q GT + 
Sbjct: 369 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMI 428

Query: 235 FNLRNSLIGF 244
           +++  S + F
Sbjct: 429 YDISGSRLVF 438


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 81/257 (31%), Positives = 120/257 (46%), Gaps = 24/257 (9%)

Query: 12  SASVDNIAIGC--GHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DS 64
            A +  + +GC   H  +G F  + G+L LG  ++SF S+  +     FSYCLVD     
Sbjct: 241 KAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPR 299

Query: 65  DSTSTLEF-------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
           ++TS L F        SS P      PLL +  +  FY + +  +SV G  L I    + 
Sbjct: 300 NATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGVALDIPAEVW- 358

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR--- 174
            D   NGG I+DSGT++T L T  Y A+  A       L P   +  FD CY++++R   
Sbjct: 359 -DVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGL-PRVAMDPFDYCYNWTARGDG 416

Query: 175 -SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTR 232
              + VP ++  F     L  PAK+Y+I   + G  C      +   +S+IGN+ QQ   
Sbjct: 417 GGDLAVPKLAVQFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHL 475

Query: 233 VSFNLRNSLIGFTPNKC 249
             F+L N  + F    C
Sbjct: 476 WEFDLNNRWLRFRQTSC 492


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 35/278 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +TE +     +V +  +GC   +       AG+ G G G +S PSQ+N   FS+CLV
Sbjct: 196 GVLITEKLDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLV 252

Query: 61  DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
            R   D++ T+ L+ D+       S  P     P  +N  +       +YYL L  I VG
Sbjct: 253 SRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
              + I         +G+GG IVDSG+  T ++   +  + + F          + L   
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
            G+     C++ S +  V VP + F F  G  L LP  NY   V +  T C       + 
Sbjct: 373 TGLG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429

Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                     I+G+ QQQ   V ++L N   GF   KC
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 117/278 (42%), Gaps = 35/278 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +TE +     +V +  +GC   +       AG+ G G G +S PSQ+N   FS+CLV
Sbjct: 196 GVLITEKLDFPDLTVPDFVVGCSIIST---RQPAGIAGFGRGPVSLPSQMNLKRFSHCLV 252

Query: 61  DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
            R   D++ T+ L+ D+       S  P     P  +N  +       +YYL L  I VG
Sbjct: 253 SRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVG 312

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
              + I         +G+GG IVDSG+  T ++   +  + + F          + L   
Sbjct: 313 RKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKE 372

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
            G+     C++ S +  V VP + F F  G  L LP  NY   V +  T C       + 
Sbjct: 373 TGLG---PCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTV 429

Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                     I+G+ QQQ   V ++L N   GF   KC
Sbjct: 430 NPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 83/262 (31%), Positives = 126/262 (48%), Gaps = 21/262 (8%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAST 54
           GD   ET+TLGS +   +      IGCG  N  G+    +G++GLG G +S  +Q++ ST
Sbjct: 177 GDLSVETLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPST 236

Query: 55  ---FSYCLVDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDL 108
              FSYCLV   S ++S L F ++   +    V+ PL   + L  FY+L L   SVG + 
Sbjct: 237 GGKFSYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGL-VFYFLTLEAFSVGRNR 295

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +            G G II+DSGT +T L    Y+ L  A  +        D   +   C
Sbjct: 296 IEFGSPG----SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLC 351

Query: 169 YDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           Y  +  +    VP ++ HF  G  + L A N  + V ++   CFAF PT +  ++ GN+ 
Sbjct: 352 YKVTPDKLDASVPVITAHF-SGADVTLNAINTFVQV-ADDVVCFAFQPTETG-AVFGNLA 408

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   V ++L+ + + F    C
Sbjct: 409 QQNLLVGYDLQMNTVSFKHTDC 430


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 70/206 (33%), Positives = 97/206 (47%), Gaps = 20/206 (9%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCL      S  +  F  SL       P +  T PLLR+    + YY+  TGISVG  
Sbjct: 242 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRV 296

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
           L+P        + +   G I+DSGT +TR     YNA+R+ F +     + T  +  FDT
Sbjct: 297 LVPFPSEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDT 355

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSII 223
           C  F        P ++ HF EG  L LP +N LI   +    C A A      +S L++I
Sbjct: 356 C--FVKTYETLAPPITLHF-EGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVI 412

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
            N QQQ  R+ F++ N+ +G     C
Sbjct: 413 ANFQQQNLRILFDIVNNKVGIAREVC 438


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/265 (31%), Positives = 117/265 (44%), Gaps = 32/265 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
           G +  + +TLG   V      GC H + G       AG L LGGGS S   Q        
Sbjct: 247 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 306

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           FSYCL      + S+L F         + L P+ V+ PLL +    TFY + L  I V G
Sbjct: 307 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 362

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             L +    F          ++DS T ++RL    Y ALR AF            V++ D
Sbjct: 363 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 416

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
           TCYDF+   S+ +P+++  F  G  + L A   L+     G+ C AFAPT+S      IG
Sbjct: 417 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 470

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           NVQQ+   V +++    + F    C
Sbjct: 471 NVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/255 (33%), Positives = 115/255 (45%), Gaps = 22/255 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNN---EGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G +  + +TL   A V N   GCGH      GLF    G+LGLG    S  ++     FS
Sbjct: 51  GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGG-VFS 106

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S            P   V  P+       TF  + L GI+VGG  L +  +AF
Sbjct: 107 YCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF 166

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSR 174
                 +GG+IVDSGT +T LQ+  Y ALR AF +   A  L P   +   DTCY+ +  
Sbjct: 167 ------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGY 217

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            +V VP ++  F  G  + L   N ++    NG   FA +    S  ++GNV Q+   V 
Sbjct: 218 KNVVVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVL 274

Query: 235 FNLRNSLIGFTPNKC 249
           F+   S  GF    C
Sbjct: 275 FDTSTSKFGFRAKAC 289


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 89/261 (34%), Positives = 124/261 (47%), Gaps = 23/261 (8%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCL 59
            V +++ L +  + N + GC +   G  V A GLLGLG G LS  SQ  ++    FSYCL
Sbjct: 188 LVQDSLRLATDVIPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCL 247

Query: 60  VDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                 S  +  F  SL       P +  T PLLR+    + YY+  TGISVG  L+P  
Sbjct: 248 -----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFP 302

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
                 + +   G I+DSGT +TR     YNA+R+ F +     + T  +  FDTC  F 
Sbjct: 303 SEYLGFNPNTGSGTIIDSGTVITRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTC--FV 359

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQ 228
                  P ++ HF EG  L LP +N LI   +    C A A      +S L++I N QQ
Sbjct: 360 KTYETLAPPITLHF-EGLDLKLPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQ 418

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q  R+ F+  N+ +G     C
Sbjct: 419 QNLRILFDTVNNKVGIAREVC 439


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/220 (32%), Positives = 103/220 (46%), Gaps = 26/220 (11%)

Query: 44  LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYY 96
           +S    +  STFSYCL      S  +L F  SL       P       LLRN    + YY
Sbjct: 231 MSQAQSVYKSTFSYCL-----PSFRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYY 285

Query: 97  LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
           + L  I VG  ++ +   A   + S   G I DSGT  TRL    Y A+R+ F +  R  
Sbjct: 286 VNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYEAVRNEFRK--RVK 343

Query: 157 SPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF 213
            PT  V     FDTCY       V+VPT++F F +G  + +PA N ++   +  T C A 
Sbjct: 344 PPTAVVTSLGGFDTCYS----GQVKVPTITFMF-KGVNMTMPADNLMLHSTAGSTSCLAM 398

Query: 214 AP----TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           A      +S +++I ++QQQ  RV  ++ N  +G    +C
Sbjct: 399 ASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438


>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
          Length = 454

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 69/236 (29%), Positives = 104/236 (44%), Gaps = 24/236 (10%)

Query: 38  GLGGGSLSFPSQINASTFSYCLVDRDSDST---STLEFDSSLPPNAVTA-----PLLRN- 88
           G G G  S PSQ+    FSYCL+ R  D T   S+L  D        TA     P ++N 
Sbjct: 218 GFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNP 277

Query: 89  -----HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
                H    +YYLGL  I+VGG  + I          G+GG I+DSGT  T ++ E + 
Sbjct: 278 KVAGKHAFSVYYYLGLRHITVGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFE 337

Query: 144 ALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
            +   F +  ++   T  +G+     C++ S  ++   P ++  F  G  + LP  NY+ 
Sbjct: 338 LVAAEFEKQVQSKRATEVEGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVA 397

Query: 202 PVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +  +   C       ++          I+GN QQQ   V ++LRN  +GF    C
Sbjct: 398 FLGGDDVVCLTIVTDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 453


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 85/237 (35%), Positives = 109/237 (45%), Gaps = 24/237 (10%)

Query: 24  HNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPN 79
           H   G F  + +G + LGGG  S  SQ  A+    FSYC+ D  S    +L   +     
Sbjct: 245 HAVRGNFSASTSGTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGA 304

Query: 80  AVTA--PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTR 136
              A  PL+RN  +  T Y + L GI VGG  L +    F       GG ++DS   +T+
Sbjct: 305 GRFARTPLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQ 358

Query: 137 LQTETYNALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
           L    Y ALR AF R   A  P    G A  DTCYDF   +SV VP VS  F  G V+ L
Sbjct: 359 LPPTAYRALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRL 417

Query: 195 PAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            A   ++        C AF PT    +L  IGNVQQQ   V +++    +GF    C
Sbjct: 418 DAMGVMV------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|115448347|ref|NP_001047953.1| Os02g0720500 [Oryza sativa Japonica Group]
 gi|113537484|dbj|BAF09867.1| Os02g0720500, partial [Oryza sativa Japonica Group]
          Length = 172

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/175 (38%), Positives = 87/175 (49%), Gaps = 24/175 (13%)

Query: 82  TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           T PLL      T+Y + L GISVGG  L I  + F        G +VD+GT VTRL    
Sbjct: 15  TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRLPPTA 68

Query: 142 YNALRDAFVRGTRALSP-----TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPA 196
           Y+ALR AF     A++P          + DTCYDF+   +V +PT+S  F  G  + L  
Sbjct: 69  YSALRSAF---RAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGT 125

Query: 197 KNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              L       + C AFAPT   S  SI+GNVQQ+   V F+   S +GF P  C
Sbjct: 126 SGILT------SGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 172


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/222 (32%), Positives = 101/222 (45%), Gaps = 19/222 (8%)

Query: 42  GSLSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHEL 91
           G +S  SQ  +     FSYCL      S  +  F  SL       P N    PLL N   
Sbjct: 191 GPMSLLSQTGSRYNGVFSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHR 245

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
            + YY+ +TG+SVG  L+     +F  D S   G ++DSGT +TR     Y ALRD F R
Sbjct: 246 PSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAPVYAALRDEFRR 305

Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
              A S    +  FDTC++    ++   P V+ H   G  L LP +N LI   +    C 
Sbjct: 306 QVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTLIHSSATPLACL 365

Query: 212 AFAPTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           A A         ++++ N+QQQ  RV  ++  S +GF    C
Sbjct: 366 AMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 121/289 (41%), Gaps = 53/289 (18%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
           +T++L S  + N   GC +          G+ G G G LS P+Q+        + FSYCL
Sbjct: 163 DTLSLSSLFLRNFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 219

Query: 60  VDRDSDSTSTLEFDSSL----------------PPNAVTAPLLRNHELDTFYYLGLTGIS 103
           V    DS    +    +                    V  P+L N +   FY +GL GIS
Sbjct: 220 VSHSFDSERVRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGIS 279

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-------RAL 156
           VG  ++P  E   +++  G+GG++VDSGT  T L    YN++ D F RG        R +
Sbjct: 280 VGKRIVPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKI 339

Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK-VLPLPAKNYLIP-VDSN-------- 206
               G+A    CY  +  S  EVP ++  F  G   + LP KNY    +D          
Sbjct: 340 EEKTGLA---PCYYLN--SVAEVPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRR 394

Query: 207 -GTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            G          + LS      +GN QQQG  V ++L    +GF   +C
Sbjct: 395 VGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443


>gi|242059939|ref|XP_002459115.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
 gi|241931090|gb|EES04235.1| hypothetical protein SORBIDRAFT_03g046190 [Sorghum bicolor]
          Length = 153

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 60/157 (38%), Positives = 85/157 (54%), Gaps = 12/157 (7%)

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           + GI VGG  +P+  +A   D +   G IVD+GT  TRL    Y A+RDAF R  RA  P
Sbjct: 1   MVGIRVGGKPVPVPASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDAFRRRVRA--P 58

Query: 159 TDG-VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-- 215
             G +  FDTCY+     +V VPTV+F F     + LP +N +I   S G  C A A   
Sbjct: 59  VAGPLGGFDTCYNV----TVSVPTVTFVFDGPVSVTLPEENVVIRSSSGGIACLAMAAGP 114

Query: 216 ---TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                ++L+++ ++QQQ  RV F++ N  +GF+   C
Sbjct: 115 PDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRELC 151


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 85/255 (33%), Positives = 115/255 (45%), Gaps = 22/255 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G +  + +TL   A V N   GCGH      GLF    G+LGLG    S  ++     FS
Sbjct: 173 GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLGRLRESLGARYGG-VFS 228

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
           YCL    S            P   V  P+       TF  + L GI+VGG  L +  +AF
Sbjct: 229 YCLPSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF 288

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSR 174
                 +GG+IVDSGT +T LQ+  Y ALR AF +   A  L P   +   DTCY+ +  
Sbjct: 289 ------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGY 339

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVS 234
            +V VP ++  F  G  + L   N ++    NG   FA +    S  ++GNV Q+   V 
Sbjct: 340 KNVVVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVL 396

Query: 235 FNLRNSLIGFTPNKC 249
           F+   S  GF    C
Sbjct: 397 FDTSTSKFGFRAKAC 411


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score = 97.4 bits (241), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 113/279 (40%), Gaps = 51/279 (18%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLV------DR 62
           + N   GC H   G  VG AG    G G LS P+Q+ +      + FSYCLV      DR
Sbjct: 204 LHNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADR 260

Query: 63  --------------DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
                         D +    +  D       V   +L N +   FY +GL GI+VG   
Sbjct: 261 VRRPSPLILGRYSLDDEKKKRVGHDRG---EFVYTAMLDNPKHPYFYCVGLEGITVGNRK 317

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV----RGTRALSPTDGVAL 164
           +P+ E   ++D  GNGG++VDSGT  T L    Y +L   F     R  +  +  +    
Sbjct: 318 IPVPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTG 377

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV--------DSNGTFCFAF--- 213
              CY +S  S+ +VP V+ HF     + LP  NY                  C      
Sbjct: 378 LGPCY-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNG 436

Query: 214 ---APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              A +    + +GN QQQG  V ++L    +GF   KC
Sbjct: 437 GDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475


>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 174

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 86/170 (50%), Gaps = 8/170 (4%)

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           PLL++  ++TFY++ L  ++V G  LPIS    K++  GNGG I+D  T  TR     + 
Sbjct: 6   PLLKHPLVETFYFVNLVAVAVNGAKLPISSKVLKMNSEGNGGAILDMSTRFTRFPNSAF- 64

Query: 144 ALRDAFVRGTRALS--PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
              D  V+  +AL   PT  V  F  CY   +  ++ +PTV+  F  G  + LP +N  +
Sbjct: 65  ---DHLVKALKALIRLPTMVVPRFQLCYSTVNTGTLIIPTVTLIFENGVRMRLPMENTFV 121

Query: 202 PVDSNG-TFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            V   G   C A  P    + ++IG+ QQQ   +  +   S +GF P +C
Sbjct: 122 SVTEQGDVMCLAMVPGNPGTATVIGSAQQQNFLIVIDREASRLGFAPLQC 171


>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
          Length = 205

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 17/166 (10%)

Query: 36  LLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF------------DSSLPPNAVTA 83
           ++GLG G LS  SQ+  S FSYCL    S   S L F             S LP    + 
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLP--VQST 58

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           PL+ N  L + Y++ L GIS+G   LPI    F I++ G GG+ +DSGT++T LQ + Y+
Sbjct: 59  PLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVYD 118

Query: 144 ALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSV--EVPTVSFHF 186
           A+R   V   R L P +   +  +TC+ +    +V   VP +  HF
Sbjct: 119 AVRRELVSVLRPLPPANDTEIGLETCFPWPPPPTVTMTVPDMELHF 164


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score = 97.1 bits (240), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 91/274 (33%), Positives = 130/274 (47%), Gaps = 38/274 (13%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS---TF 55
           G++  +T+TL  + V      G G NN+G F  G  G+LGLG G LS  SQ  +     F
Sbjct: 177 GNYGCDTMTLEPSDVFQKFQFGRGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVF 236

Query: 56  SYCLVDRDS-----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISV 104
           SYCL + DS             +S+L+F S      V  P     +   +Y++ L+ ISV
Sbjct: 237 SYCLPEEDSIGSLLFGEKATSQSSSLKFTS-----LVNGP--GTLQESGYYFVNLSDISV 289

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA- 163
           G + L I  + F      + G I+DS T +TRL    Y+AL+ AF +       ++G   
Sbjct: 290 GNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRK 344

Query: 164 ---LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS- 219
              + DTCY+ S R  V +P +  HF  G  + L   N +   D +   C AFA  S S 
Sbjct: 345 KGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDES-RLCLAFAGNSKST 403

Query: 220 ----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               L+IIGN QQ    V ++++   IGF  N C
Sbjct: 404 MNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437


>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
          Length = 205

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 85/166 (51%), Gaps = 17/166 (10%)

Query: 36  LLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF------------DSSLPPNAVTA 83
           ++GLG G LS  SQ+  S FSYCL    S   S L F             S LP    + 
Sbjct: 1   MVGLGRGLLSLVSQLGPSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLP--VQST 58

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           PL+ N  L + Y++ L GIS+G   LPI    F I++ G GG+ +DSGT++T LQ + Y+
Sbjct: 59  PLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVYD 118

Query: 144 ALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSV--EVPTVSFHF 186
           A+R   V   R L P +   +  +TC+ +    +V   VP +  HF
Sbjct: 119 AVRRELVSVLRPLPPANDTEIGLETCFPWPPPPTVTMTVPDMELHF 164


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 82/252 (32%), Positives = 113/252 (44%), Gaps = 16/252 (6%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G +  + +TL   A V N   GCGH    +     G+LGLG    S  ++     FSYCL
Sbjct: 207 GAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGG-VFSYCL 265

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
               S            P   V  P+       TF  + L GI+VGG  L +  +AF   
Sbjct: 266 PSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF--- 322

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYDFSSRSSV 177
              +GG+IVDSGT +T LQ+  Y ALR AF +   A  L P   +   DTCY+ +   +V
Sbjct: 323 ---SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL---DTCYNLTGYKNV 376

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
            VP ++  F  G  + L   N ++    NG   FA +    S  ++GNV Q+   V F+ 
Sbjct: 377 VVPKIALTFTGGATINLDVPNGIL---VNGCLAFAESGPDGSAGVLGNVNQRAFEVLFDT 433

Query: 238 RNSLIGFTPNKC 249
             S  GF    C
Sbjct: 434 STSKFGFRAKAC 445


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 95/204 (46%), Gaps = 22/204 (10%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCL      S  +  F  SL       P +  T PLLRN    + Y++ LTGI+VG  
Sbjct: 241 FSYCL-----PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKV 295

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            +P  +     D +   G I+DSGT +TR     YNA+RD F +  +   P   +  FDT
Sbjct: 296 NVPFPKELLAFDVNTGSGTIIDSGTVITRFVEPVYNAVRDEFRK--QVTGPFSSLGAFDT 353

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS-----LSI 222
           C  F        P ++ HF +   L LP +N LI   S    C A A T  +     L++
Sbjct: 354 C--FVKNYETLAPAITLHFTDLD-LKLPLENSLIHSSSGSLACLAMASTPKNVNYTVLNV 410

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTP 246
           I N QQQ  RV F+  N+   + P
Sbjct: 411 IANYQQQNLRVLFDTVNNKGWYCP 434


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/261 (36%), Positives = 128/261 (49%), Gaps = 24/261 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G + TET+TL    SV +   GCG   +G F    GLLGLGG   S  SQ   +    FS
Sbjct: 224 GVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFS 283

Query: 57  YCLVDRDSDSTSTLEFDSSLPPNAVTA----PLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           YCL   +S +T  L   +    N        PL    E  TFY + LTG+SVGG  L I 
Sbjct: 284 YCLPPGNS-TTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIP 342

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA--LSPTDGVALFDTCYD 170
            T        +GG+I+DSGT +T L    Y+ALR AF     A  L P +   + DTCY+
Sbjct: 343 PTVL------SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN 396

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSIIGNVQQ 228
           F+  ++V VPTV+  F  G  + L   + ++  D     C AFA  +S   + IIGNV Q
Sbjct: 397 FTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQ 451

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +   V ++     +GF P  C
Sbjct: 452 RTFEVLYDSGRGHVGFRPGAC 472


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score = 97.1 bits (240), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 76/235 (32%), Positives = 108/235 (45%), Gaps = 20/235 (8%)

Query: 32  GAAGLLGLGGGSLSFPSQINASTFSYCLVD--RDSDSTSTLEFDSSLPPNAVTAPLL--- 86
           GA+GL+GLG G LS  SQ  A  FSYCL     ++ ++S L   ++   +     ++   
Sbjct: 209 GASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGGGAVMSMA 268

Query: 87  -----RNHELDTFYYLGLTGISVGGDLLPISETAFKIDES----GNGGIIVDSGTAVTRL 137
                +++   TFYYL L GI+VG   L I  TAF + E       GG+I+DSG+  T L
Sbjct: 269 FVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDSGSPFTSL 328

Query: 138 QTETYNALRDAFVR---GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPL 194
             + Y  L     R   G+    P +       C        V VPT+  HF  G  + L
Sbjct: 329 VEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRV-VPTLVLHFSGGADMAL 387

Query: 195 PAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           P +NY  P++ + T C A        SIIGN QQQ   + F++    + F    C
Sbjct: 388 PPENYWAPLEKS-TACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGRLSFQNADC 440


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 121/291 (41%), Gaps = 59/291 (20%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST------FSYCL 59
           +T++L +  + N   GC H     F    G+ G G G LS P+Q+   +      FSYCL
Sbjct: 192 DTLSLSTLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCL 248

Query: 60  V----------------------DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYL 97
           V                      ++ S+    +EF        V   +L N +   FY +
Sbjct: 249 VSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEF--------VYTSMLENPKHSYFYTV 300

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GT 153
           GL GISVG   +P  +   ++++ G+GG++VDSGT  T L  + YN++ + F R      
Sbjct: 301 GLKGISVGKKTVPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSN 360

Query: 154 RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---------- 203
           R     +       CY  ++ + V   T+ F      V+ LP KNY              
Sbjct: 361 RRAPEIEQKTGLSPCYYLNTAAIVPAVTLRFVGMNSSVV-LPRKNYFYEFMDGGDGVRRK 419

Query: 204 DSNGTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +  G   F      + +S     ++GN QQQG  V ++L    +GF   KC
Sbjct: 420 ERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 70/212 (33%), Positives = 106/212 (50%), Gaps = 16/212 (7%)

Query: 44  LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
           LS    +  +TFSYCL    S + + TL    +  P  + T PLL N    + YY+ +TG
Sbjct: 246 LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTG 305

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
           I VG  ++PI       D +   G ++DSGT  TRL    Y A+RD   R  R  +P   
Sbjct: 306 IRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSS 359

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----S 217
           +  FDTC++    ++V  P V+  F +G  + LP +N +I        C A A      +
Sbjct: 360 LGGFDTCFN---TTAVAWPPVTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + L++I ++QQQ  RV F++ N  +GF   +C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 404

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 78/238 (32%), Positives = 102/238 (42%), Gaps = 20/238 (8%)

Query: 21  GCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSD---STSTLEFD 73
           GC H+  G F G  +G + LGGG  S  SQ  ++    FSYC+    +    S       
Sbjct: 178 GCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTASAYGDAFSYCVPQPSASGFLSLGGAIGS 237

Query: 74  SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTA 133
           S       + PL+      TFY + L GI V G  L +    F      + G ++DS   
Sbjct: 238 SGSGSGFASTPLVATAN-PTFYVVRLQGIDVAGRRLNVPPAVF------SAGTLMDSSAV 290

Query: 134 VTRLQTETYNALRDAFVRGTRALS--PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV 191
           VT+L    Y ALR AF    R     P  G  + DTCYDF    +V VP VS  F  G V
Sbjct: 291 VTQLPPTAYRALRRAFRNAMRRYRRVPAGGKQILDTCYDFEGLGNVTVPAVSLVFSGGAV 350

Query: 192 LPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + L     ++     G   F   P  S L  IGNVQQQ   V +++    +GF    C
Sbjct: 351 VRLEPMAVMM----EGCLAFVPTPADSDLGFIGNVQQQTHEVLYDVGARNVGFRRGAC 404


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 77/261 (29%), Positives = 128/261 (49%), Gaps = 23/261 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----F 55
           GD   E +T+GS+SV ++ IGCGH + G F  A+G++GLGGG LS  SQ++ ++     F
Sbjct: 180 GDLGFEKITIGSSSVKSV-IGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRF 238

Query: 56  SYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           SYCL    S +   + F  +     P  V+ PL+  + + T+YY+ L  IS+G +     
Sbjct: 239 SYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPV-TYYYVTLEAISIGNER---- 293

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-- 170
                +  +  G +I+DSGT ++ L  E Y+ +  + ++  +A    D    +D C+D  
Sbjct: 294 ----HMASAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 349

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQ 228
            +  +S  +P ++  F  G  + L   N    V +N   C    P S +    IIGN+  
Sbjct: 350 INVATSSGIPIITAQFSGGANVNLLPVNTFQKV-ANNVNCLTLTPASPTDEFGIIGNLAL 408

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
               + ++L    + F P  C
Sbjct: 409 ANFLIGYDLEAKRLSFKPTVC 429


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 121/271 (44%), Gaps = 27/271 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
           G  V +T+TL  SA+      GC     +   F GA GL+ L   S S  S++       
Sbjct: 170 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 229

Query: 51  NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
           +A+ FSYCL    + S+   L   +S P     +   AP+  N      Y++ L GISVG
Sbjct: 230 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVG 289

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G+ LP+    F        G ++++ T  T L    Y ALRDAF +            + 
Sbjct: 290 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVL 344

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
           DTCY+ +  +S+ VP V+  F  G  L L  +  +   D +  F         A    + 
Sbjct: 345 DTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 404

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +S+IG + Q+ T V ++LR   +GF P +C
Sbjct: 405 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 121/271 (44%), Gaps = 27/271 (9%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGH--NNEGLFVGAAGLLGLGGGSLSFPSQI------- 50
           G  V +T+TL  SA+      GC     +   F GA GL+ L   S S  S++       
Sbjct: 258 GTLVRDTLTLPPSATFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLASRVISNGATT 317

Query: 51  NASTFSYCLVDRDSDSTST-LEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVG 105
           +A+ FSYCL    + S+   L   +S P     +   AP+  N      Y++ L GISVG
Sbjct: 318 SAAAFSYCLPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVG 377

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G+ LP+    F        G ++++ T  T L    Y ALRDAF +            + 
Sbjct: 378 GEDLPVPPAVFAAH-----GTLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVL 432

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC-------FAFAPTSS 218
           DTCY+ +  +S+ VP V+  F  G  L L  +  +   D +  F         A    + 
Sbjct: 433 DTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAF 492

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +S+IG + Q+ T V ++LR   +GF P +C
Sbjct: 493 PVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 76/271 (28%), Positives = 126/271 (46%), Gaps = 28/271 (10%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVG--AAGLLGLGGGSLSFPSQI------N 51
           G  V +T+TL  SA+ +N A+GC   +  LF    A G + L     S  +++       
Sbjct: 228 GTIVMDTLTLSPSATFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPG 287

Query: 52  ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDTFYYLGLTGISVGG 106
            + FSYCL   D+D+   L    +L   +  A     PL+ N     FYY+ L  I++ G
Sbjct: 288 MAAFSYCL-PADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAING 346

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
           + LPI    F    +GNG +I DS +A T L    Y ALRD F +      P       D
Sbjct: 347 EDLPIPPALF----TGNGTMI-DSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGLD 401

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-------GTFCFAFAPTSS- 218
           TCY+F+   ++ +P ++  F  G+ + L  + ++     +       G   FA AP  + 
Sbjct: 402 TCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNF 461

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             + +G+  Q+   + +++R  ++ F P++C
Sbjct: 462 PWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492


>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
          Length = 216

 Score = 96.7 bits (239), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 95/206 (46%), Gaps = 16/206 (7%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCL      S  +  F  SL       P N    PLL N    + YY+ +TG+SVG  
Sbjct: 15  FSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVNVTGLSVGRT 69

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            + +   +F  D +   G ++DSGT +TR     Y ALR+ F R   A S    +  FDT
Sbjct: 70  WVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 129

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSII 223
           C++    ++   P V+ H   G  L LP +N LI   +    C A A         ++++
Sbjct: 130 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 189

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
            N+QQQ  RV  ++  S +GF    C
Sbjct: 190 ANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 95/206 (46%), Gaps = 16/206 (7%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCL      S  +  F  SL       P N    PLL N    + YY+ +TG+SVG  
Sbjct: 15  FSYCL-----PSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRT 69

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            + +   +F  D +   G ++DSGT +TR     Y ALR+ F R   A S    +  FDT
Sbjct: 70  WVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDT 129

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS----SLSII 223
           C++    ++   P V+ H   G  L LP +N LI   +    C A A         ++++
Sbjct: 130 CFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVV 189

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
            N+QQQ  RV  ++  S +GF    C
Sbjct: 190 ANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 85/266 (31%), Positives = 127/266 (47%), Gaps = 26/266 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQINAS- 53
           G    ET+TL S       +  I  GCGHNN G F     G++GLGGG +S  SQ+ +S 
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219

Query: 54  ---TFSYCLVDRDSDST--STLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVG 105
               FS CLV   +D +  S + F           V+ PL+   +  T Y++ L GISV 
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQD-KTPYFVTLLGISVE 278

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-TDGVAL 164
              L  + ++  +++   G + +DSGT  T L T+ Y+ +  A VR   A+ P TD   L
Sbjct: 279 NTYLHFNGSSQNVEK---GNMFLDSGTPPTILPTQLYDQVV-AQVRSEVAMKPVTDDPDL 334

Query: 165 F-DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSII 223
               CY   +++++  P ++ HF    V   P + ++ P D  G FC  F  TSS   + 
Sbjct: 335 GPQLCY--RTKNNLRGPVLTAHFEGADVKLSPTQTFISPKD--GVFCLGFTNTSSDGGVY 390

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN  Q    + F+L   ++ F P  C
Sbjct: 391 GNFAQSNYLIGFDLDRQVVSFKPKDC 416


>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
 gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
          Length = 369

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 10/172 (5%)

Query: 82  TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           T PLL N    + YY+ +TGI VG  ++PI   A   D +   G ++DSGT  TRL    
Sbjct: 201 TTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPA 260

Query: 142 YNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI 201
           Y A+RD   R  R  +P   +  FDTC++    ++V  P V+  F +G  + LP +N +I
Sbjct: 261 YVAVRDEVRR--RVGAPVSSLGGFDTCFNT---TAVAWPPVTLLF-DGMQVTLPEENVVI 314

Query: 202 PVDSNGTFCFAFAPT----SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   C A A      ++ L++I ++QQQ  RV F++ N  +GF   +C
Sbjct: 315 HSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 366


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 89/264 (33%), Positives = 133/264 (50%), Gaps = 23/264 (8%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINA-- 52
           G+  TE  T+GS S     +  I  GCG  N G F    +G++GLGGG+LS  SQ+++  
Sbjct: 185 GNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSII 244

Query: 53  -STFSYCLV--DRDSDSTSTLEF--DSSLP-PNAVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV     S+ TS ++F  DS +  P  V+ PL+ + + DT+YY+ L  ISVG 
Sbjct: 245 KGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLV-SKQPDTYYYVTLEAISVGN 303

Query: 107 DLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
             LP +      + E GN  +I+DSGT +T L +E +  L        +A   +D   LF
Sbjct: 304 KRLPYTNGLLNGNVEKGN--VIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLF 361

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
             C  F S   +++P ++ HF +  V   P  N  +  D +   CF    +S+ + I GN
Sbjct: 362 SVC--FRSAGDIDLPVIAVHFNDADVKLQPL-NTFVKADED-LLCFTMI-SSNQIGIFGN 416

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + Q    V ++L    + F P  C
Sbjct: 417 LAQMDFLVGYDLEKRTVSFKPTDC 440


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/267 (29%), Positives = 117/267 (43%), Gaps = 21/267 (7%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGH---NNEGLFVGAAGLLGLGGGSLSF---PSQ 49
           G F T+++T+G        ++N+ IGC     N         G+LGLG    SF    + 
Sbjct: 189 GFFGTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAAN 248

Query: 50  INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGG 106
              + FSYCLVD  S  + +         NA     +R  EL     FY + + GIS+GG
Sbjct: 249 KYGAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGG 308

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-- 164
            +L I    +  D +  GG ++DSGT +T L    Y A+ +A  +    +    G     
Sbjct: 309 QMLKIPPQVW--DFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDA 366

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSI 222
            + C+D        VP + FHF  G     P K+Y+I V +    C    P       S+
Sbjct: 367 LEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVPIDGIGGASV 425

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN+ QQ     F+L  + +GF P+ C
Sbjct: 426 IGNIMQQNHLWEFDLSTNTVGFAPSTC 452


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 117/261 (44%), Gaps = 29/261 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS  ++A GC   N GL     G L LG G            FSYCL 
Sbjct: 174 GYLATETLKVGDASFPSVAFGCSTEN-GL-----GQLDLGVGR-----------FSYCLR 216

Query: 61  DRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAF 116
              +   S + F S       N  + P + N  +  ++YY+ LTGI+VG   LP++ + F
Sbjct: 217 SGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTF 276

Query: 117 KIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD--FSS 173
              ++G  GG IVDSGT +T L  + Y  ++ AF+  T  ++  +G    D C+      
Sbjct: 277 GFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGG 336

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS--SSLSIIGNVQQ 228
              + VP++   F  G    +P     +  DS G+    C    P      +S+IGNV Q
Sbjct: 337 GGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQ 396

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
               + ++L   +  F P  C
Sbjct: 397 MDMHLLYDLDGGIFSFAPADC 417


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 73/250 (29%), Positives = 112/250 (44%), Gaps = 10/250 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G    +     +   D +  GC    EG      G++GLG G LS  SQ+    FSY L 
Sbjct: 193 GLLAVDAFAFATVRADGVIFGCAVATEG---DIGGVIGLGRGELSPVSQLQIGRFSYYLA 249

Query: 61  DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
             D+ D  S + F     P    AV+ PL+ +    + YY+ L GI V G+ L I    F
Sbjct: 250 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTF 309

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
            +   G+GG+++     VT L    Y  +R A       L   DG  L  D CY   S +
Sbjct: 310 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIE-LRAADGSELGLDLCYTSESLA 368

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVS 234
           + +VP+++  F  G V+ L   NY     + G  C    P+ +   S++G++ Q GT + 
Sbjct: 369 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMI 428

Query: 235 FNLRNSLIGF 244
           +++  S + F
Sbjct: 429 YDISGSRLVF 438


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 87/256 (33%), Positives = 124/256 (48%), Gaps = 14/256 (5%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCL 59
            V +++ LG  ++ + A GC ++  G  + A GLLGLG G LS PSQ   + +  FSYCL
Sbjct: 200 LVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCL 259

Query: 60  VDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
               S   S +L+   +  P  + T PLL+N    + YY+ LTG++VG   +P+      
Sbjct: 260 PSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTGVTVGRVKVPLPIEYLA 319

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
            D +   G I+DSGT +TR     Y+A+RD F    +   P      FDTC  F      
Sbjct: 320 FDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKG--PFFSRGGFDTC--FVKTYEN 375

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRV 233
             P +   F  G  + LP +N LI     G  C A A      +S L++I N QQQ  RV
Sbjct: 376 LTPLIKLRF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSVLNVIANYQQQNLRV 434

Query: 234 SFNLRNSLIGFTPNKC 249
            F+  N+ +G     C
Sbjct: 435 LFDTVNNRVGIARELC 450


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 79/249 (31%), Positives = 112/249 (44%), Gaps = 23/249 (9%)

Query: 14  SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDS- 66
           S+ NI  GCGHNN G F     GL G GG  LS  SQI ++      FS CLV   +D  
Sbjct: 92  SILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPS 151

Query: 67  -TSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
            TS + F         + V+ PL+   +  T+Y++ L GISVG  L P S ++     + 
Sbjct: 152 ITSKIIFGPEAEVSGSDVVSTPLVTKDD-PTYYFVTLDGISVGDKLFPFSSSS---PMAT 207

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS--VEVP 180
            G + +D+GT  T L  + YN L    V+G +   P + V   D       RS+  ++ P
Sbjct: 208 KGNVFIDAGTPPTLLPRDFYNRL----VQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGP 263

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
            ++ HF    V   P   ++ P    G +CFA  P      I GN  Q    + F+L   
Sbjct: 264 ILTAHFDGADVQLKPLNTFISP--KEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGK 321

Query: 241 LIGFTPNKC 249
            + F    C
Sbjct: 322 KVSFKAVDC 330


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 75/242 (30%), Positives = 105/242 (43%), Gaps = 27/242 (11%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-----TLEFDSS--LPPNAVTA--- 83
           +G+ G G G  S PSQ+N   FSYCLV    D T       L+  S+     N ++    
Sbjct: 230 SGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPF 289

Query: 84  ---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
              P   N     +YYL L  + VGG  + I  T  +    GNGG IVDSG+  T ++  
Sbjct: 290 RSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERP 349

Query: 141 TYNALRDAFVRG-----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
            YN +   FV+      +RA        L   C++ S   +V  P ++F F  G  +  P
Sbjct: 350 VYNLVAQEFVKQLEKNYSRAEDAETQSGL-SPCFNISGVKTVTFPELTFKFKGGAKMTQP 408

Query: 196 AKNYLIPVDSNGTFCF-------AFAPTSSSLSII-GNVQQQGTRVSFNLRNSLIGFTPN 247
            +NY   V      C        A  P ++  +II GN QQQ   + ++L N   GF P 
Sbjct: 409 LQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPR 468

Query: 248 KC 249
            C
Sbjct: 469 SC 470


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 119/284 (41%), Gaps = 47/284 (16%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE  T+G       A GC     +     V  AGLLG+  G+LSF SQ +   FSY
Sbjct: 159 GALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSY 218

Query: 58  CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
           C+ DRD D+   L   S LP  P   T        L  F    Y + L GI VGG  LPI
Sbjct: 219 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPI 277

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +    D +G G  +VDSGT  T L  + Y+AL+  F R T+   P    AL D  + F
Sbjct: 278 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLP----ALNDPNFAF 333

Query: 172 SSRSSVEVPTVSFHFPEGKVLP--LPAKN----------------YLIPVDS---NGTFC 210
                 E     F  P+G+  P  LPA                  Y +P +    +G +C
Sbjct: 334 Q-----EAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWC 388

Query: 211 FAF-----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             F      P ++   +IG+  Q    V ++L    +G  P +C
Sbjct: 389 LTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRVGLAPIRC 430


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/277 (30%), Positives = 120/277 (43%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   T+   +GS      A GC     ++    V +AGLLG+  G+LSF SQ +   FSY
Sbjct: 177 GALATDVFAVGSGPPLRAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQASTRRFSY 236

Query: 58  CLVDRDSDSTSTL---EFDSSLPPN--AVTAPLLRNHELDTFYY-LGLTGISVGGDLLPI 111
           C+ DRD      L   +  + LP N   +  P L     D   Y + L GI VGG  LPI
Sbjct: 237 CISDRDDAGVLLLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPI 296

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT-DGVAL-----F 165
             +    D +G G  +VDSGT  T L  + Y+AL+  F R  R L P  D  +      F
Sbjct: 297 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAF 356

Query: 166 DTCYDF---SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAF---- 213
           DTC+      S  +  +P V+  F  G  + +     L  V       +G +C  F    
Sbjct: 357 DTCFRVPQGRSPPTARLPGVTLLF-NGAEMAVAGDRLLYKVPGERRGGDGVWCLTFGNAD 415

Query: 214 -APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             P  +   +IG+  Q    V ++L    +G  P +C
Sbjct: 416 MVPIMA--YVIGHHHQMNVWVEYDLERGRVGLAPVRC 450


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 77/278 (27%), Positives = 117/278 (42%), Gaps = 35/278 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++E +     +V +  +GC   +       AG+ G G G  S PSQ+   +FS+CLV
Sbjct: 196 GILISEKLDFPDLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLKSFSHCLV 252

Query: 61  DR---DSDSTSTLEFDS-------SLPPNAVTAPLLRNHELDT-----FYYLGLTGISVG 105
            R   D++ T+ L  D+       S  P     P  +N  +       +YYL L  I VG
Sbjct: 253 SRRFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVG 312

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSPT 159
              + I         +GNGG IVDSG+  T ++   +  + + F          + L   
Sbjct: 313 SKHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKV 372

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
            G+A    C++ S +  V VP + F F  G  + LP  NY   V +  T C      ++ 
Sbjct: 373 SGIA---PCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTV 429

Query: 219 -------SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                     I+G+ QQQ   V ++L N   GF   KC
Sbjct: 430 NPGGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/284 (31%), Positives = 119/284 (41%), Gaps = 47/284 (16%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   TE  T+G       A GC     +     V  AGLLG+  G+LSF SQ +   FSY
Sbjct: 160 GALATEVFTVGQGPPLRAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQASTRRFSY 219

Query: 58  CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
           C+ DRD D+   L   S LP  P   T        L  F    Y + L GI VGG  LPI
Sbjct: 220 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPI 278

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +    D +G G  +VDSGT  T L  + Y+AL+  F R T+   P    AL D  + F
Sbjct: 279 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLP----ALNDPNFAF 334

Query: 172 SSRSSVEVPTVSFHFPEGKVLP--LPAKN----------------YLIPVDS---NGTFC 210
                 E     F  P+G+  P  LPA                  Y +P +    +G +C
Sbjct: 335 Q-----EAFDTCFRVPQGRAPPARLPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWC 389

Query: 211 FAF-----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             F      P ++   +IG+  Q    V ++L    +G  P +C
Sbjct: 390 LTFGNADMVPITA--YVIGHHHQMNVWVEYDLERGRVGLAPIRC 431


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 86/262 (32%), Positives = 125/262 (47%), Gaps = 21/262 (8%)

Query: 1   GDFVTETVTLGSA-----SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSL-SFPSQINAS- 53
           GD   +TVT+GS+     S+ N+ IGCGH N G F  A   +   GG   S  SQ+  S 
Sbjct: 175 GDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSI 234

Query: 54  --TFSYCLVDRDSDS--TSTLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
              FSYCLV   S++  TS + F ++  +  + V +  +   +  T+Y+L L  ISVG  
Sbjct: 235 NGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSK 294

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            +  + T F    +G G I++DSGT +T L +  Y  L        +A    D   +   
Sbjct: 295 KIQFTSTIFG---TGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQDPDGILSL 351

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CY  S  SS +VP ++ HF  G V  L   N  + V S    CFAFA  +  L+I GN+ 
Sbjct: 352 CYRDS--SSFKVPDITVHFKGGDV-KLGNLNTFVAV-SEDVSCFAFA-ANEQLTIFGNLA 406

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q    V ++  +  + F    C
Sbjct: 407 QMNFLVGYDTVSGTVSFKKTDC 428


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 119/275 (43%), Gaps = 28/275 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   T+   +G A     A GC    +++    V  AGLLG+  G+LSF +Q +   FSY
Sbjct: 152 GALATDVFAVGDAPPLRSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTRRFSY 211

Query: 58  CLVDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPI 111
           C+ DRD D+   L   S LP  P   T        L  F    Y + L GI VGG  LPI
Sbjct: 212 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPI 270

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT------DGVALF 165
             +    D +G G  +VDSGT  T L  + Y+A++  F++ T+ L P            F
Sbjct: 271 PPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAF 330

Query: 166 DTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAPTS 217
           DTC+         S  +P V+  F  G  + +     L  V      ++G +C  F    
Sbjct: 331 DTCFRVPKGRPPPSARLPPVTLLF-NGAQMSVAGDRLLYKVPGERRGADGVWCLTFGNAD 389

Query: 218 S---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               +  +IG+  Q    V ++L    +G  P KC
Sbjct: 390 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 424


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/262 (31%), Positives = 119/262 (45%), Gaps = 25/262 (9%)

Query: 3   FVTETVTLGSASVDNIAIGCGHN---NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
            V ET   G++ + ++ IGCGHN   N     G  G+LGL  G  S  +QI    FSYC+
Sbjct: 190 LVFETTDEGTSQISDVIIGCGHNIGFNSD--PGYNGILGLNNGPNSLATQI-GRKFSYCI 246

Query: 60  --VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
             +     + + L           + P    H    FYY+ + GISVG   L I+   F+
Sbjct: 247 GNLADPYYNYNQLRLGEGADLEGYSTPFEVYH---GFYYVTMEGISVGEKRLDIALETFE 303

Query: 118 IDESGNGGIIVDSGTAVTRL----QTETYNALRDAFVRGTRALSPTDGVALFDTC-YDFS 172
           +  +G GG+I+DSGT +T L        YN +R+      R +   +  A +  C Y   
Sbjct: 304 MKRNGTGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFEN--APWKLCYYGII 361

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQ 227
           SR  V  P V+FHF +G  L L   ++    D    FC   +P     T+ S S+IG + 
Sbjct: 362 SRDLVGFPVVTFHFVDGADLALDTGSFFSQRDD--IFCMTVSPASILNTTISPSVIGLLA 419

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   V ++L N  + F    C
Sbjct: 420 QQSYNVGYDLVNQFVYFQRIDC 441


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/264 (32%), Positives = 124/264 (46%), Gaps = 18/264 (6%)

Query: 1   GDFVTETVTLGS---ASVD--NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           GD   ET+TLGS   +SV   N  IGCGHNN+G F G    +   GG         +S  
Sbjct: 187 GDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSI 246

Query: 54  --TFSYCLVDR--DSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL      S+S+S L F D+++     AV+ PL+     + FYYL L   SVG 
Sbjct: 247 GGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD 306

Query: 107 DLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
             +  +  ++     +G G II+DSGT +T L  E Y+ L  A     +A   +D     
Sbjct: 307 KRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFL 366

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
             CY  +    ++VP ++ HF    V   P   ++   +  G  CFAF  +S  +SI GN
Sbjct: 367 SLCYQTTPSGQLDVPVITAHFKGADVELNPISTFVQVAE--GVVCFAFH-SSEVVSIFGN 423

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           + Q    V ++L    + F P  C
Sbjct: 424 LAQLNLLVGYDLMEQTVSFKPTDC 447


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 87/248 (35%), Positives = 123/248 (49%), Gaps = 22/248 (8%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
           +ET TLGS +V  I  GC   +EG +   +GL+GLG G LS  SQ+N   FSYCL   D+
Sbjct: 179 SETFTLGSDAVPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGAFSYCLTS-DA 237

Query: 65  DSTSTLEFDSSLPPNA--VTAPLLRNHELDTFYY-LGLTGISVGGDLLPISETAFKIDES 121
             TS L F S     A   + PLLR     T+YY + L  IS+G         A     +
Sbjct: 238 AKTSPLLFGSGALTGAGVQSTPLLRT---STYYYTVNLESISIG---------AATTAGT 285

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
           G+ GII DSGT V  L    Y   ++A +  T  L+   G   ++ C+     S    P+
Sbjct: 286 GSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQ---TSGAVFPS 342

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
           +  HF +G  + LP +NY   VD +   C+     S SLSI+GN+ Q    + +++  S+
Sbjct: 343 MVLHF-DGGDMDLPTENYFGAVD-DSVSCW-IVQKSPSLSIVGNIMQMNYHIRYDVEKSM 399

Query: 242 IGFTPNKC 249
           + F P  C
Sbjct: 400 LSFQPANC 407


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/264 (33%), Positives = 114/264 (43%), Gaps = 25/264 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFSY 57
              V +TVTL +  V     GC     G  +   GLLGLG G LS  +Q      STFSY
Sbjct: 190 ASLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSY 249

Query: 58  CLVDRDSDSTSTLEFDSSL------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CL      S  TL F           P     P  +N    + YY+ L  I VG  ++ I
Sbjct: 250 CL-----PSFKTLNFSGHXDLXPVAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDI 304

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCY 169
              A   +     G + DSGT  TRL    Y A+R+ F R           +L  FDTCY
Sbjct: 305 PPEALAFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCY 364

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + LP  N LI   +    C A AP     +S L++I N
Sbjct: 365 TV----PIVAPTITFMF-SGMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIAN 419

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  RV F++ NS +G     C
Sbjct: 420 MQQQNHRVLFDVPNSRLGVARELC 443


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 68/212 (32%), Positives = 106/212 (50%), Gaps = 16/212 (7%)

Query: 44  LSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTG 101
           LS    +  +TFSYCL    S + + TL    +  P  + T PLL N    + YY+ +TG
Sbjct: 246 LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLLANPHRSSLYYVNMTG 305

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
           + VG  ++PI       D +   G ++DSGT  TRL    Y A+RD   R  R  +P   
Sbjct: 306 VRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVRDEVRR--RVGAPVSS 359

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT----S 217
           +  FDTC++    ++V  P ++  F +G  + LP +N +I        C A A      +
Sbjct: 360 LGGFDTCFN---TTAVAWPPMTLLF-DGMQVTLPEENVVIHSTYGTISCLAMAAAPDGVN 415

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + L++I ++QQQ  RV F++ N  +GF   +C
Sbjct: 416 TVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 114/260 (43%), Gaps = 16/260 (6%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFP-SQIN-----AST 54
           GD  ++ +T+GS  +    IGCGH N G F G    +   GG      SQ+         
Sbjct: 179 GDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPR 238

Query: 55  FSYCLVD--RDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           FSYCL     +++ T T+ F           V+ PL+     DTFY+L L  ISVG    
Sbjct: 239 FSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSP-DTFYFLTLEAISVGKKRF 297

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
             +     +   GN  II+DSGT +T L    Y  +     R  +A    D   + + CY
Sbjct: 298 KAANGISAMTNHGN--IIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCY 355

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
                  + +P ++ HF  G  + L   N   PV  N T C  FAP ++ ++I GN+ Q 
Sbjct: 356 SAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVT-CLTFAP-ATQVAIFGNLAQI 413

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V ++L N  + F P  C
Sbjct: 414 NFEVGYDLGNKRLSFEPKLC 433


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 79/267 (29%), Positives = 116/267 (43%), Gaps = 21/267 (7%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEG---LFVGAAGLLGLGGGSLSFPSQIN- 51
           G F T+T+T+         ++N+ IGC  + E          G+LGLG    SF  +   
Sbjct: 243 GFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAY 302

Query: 52  --ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGG 106
              + FSYCLVD  S    +         NA     ++  EL     FY + + GIS+GG
Sbjct: 303 EYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFPPFYGVNVVGISIGG 362

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG--VAL 164
            +L I    +  D +  GG ++DSGT +T L    Y  + +A ++    +    G     
Sbjct: 363 QMLKIPPQVW--DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGA 420

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--SLSI 222
            D C+D        VP + FHF  G     P K+Y+I V +    C    P       S+
Sbjct: 421 LDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-APLVKCIGIVPIDGIGGASV 479

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IGN+ QQ     F+L  + IGF P+ C
Sbjct: 480 IGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 118/258 (45%), Gaps = 36/258 (13%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           SA++ ++  GCGH+N G  +   G+LGLG G  S   +   + FSYC    D        
Sbjct: 191 SAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRF-GTKFSYCFGSLD-------- 241

Query: 72  FDSSLPPNAV------------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF-KI 118
            D S P N +            T PL      + FYY+ +  ISV G +LPI    F + 
Sbjct: 242 -DPSYPHNVLVLGDDGANILGDTTPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRN 297

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGVALFDT-CYDFS-S 173
            ++G GG I+D+G ++T L  E Y  L++    +  G    +  +   +F   CY+ +  
Sbjct: 298 HQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLE 357

Query: 174 RSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
           R  VE   P V+FHF +G  L L  K+  + +  N  FC A  P   +++ IG   QQ  
Sbjct: 358 RDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPN-VFCLAVTP--GNMNSIGATAQQSY 414

Query: 232 RVSFNLRNSLIGFTPNKC 249
            + ++L    I F    C
Sbjct: 415 NIGYDLEAKKISFERIDC 432


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 85/276 (30%), Positives = 124/276 (44%), Gaps = 29/276 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  T+T  +GS+ + N+  GC  +    N        GL+G+  GSLSF SQ+    FS
Sbjct: 165 GNLATDTFYIGSSGIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPKFS 224

Query: 57  YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+ + D      L    F S L P   T  +  +  L  F    Y + L GI V   LL
Sbjct: 225 YCISEYDFSGLLLLGDANF-SWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLL 283

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
           PI E+ F+ D +G G  +VDSGT  T L    Y ALRD F+  T    R    ++ V   
Sbjct: 284 PIPESVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQG 343

Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGT---FCFAFAPT 216
             D CY   +  +    +P+V+  F  G  + +      Y +P +  G     CF F  +
Sbjct: 344 AMDLCYRVPTNQTRLPPLPSVTLVF-RGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNS 402

Query: 217 S---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   +IG++ QQ   + F+L+ S IG    +C
Sbjct: 403 DLLGVEAFVIGHLHQQNVWMEFDLKKSRIGLAEIRC 438


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 76/249 (30%), Positives = 122/249 (48%), Gaps = 17/249 (6%)

Query: 12  SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
            A +  + +GC  + +G  F  + G+L LG  ++SF S+  A     FSYCLVD     +
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288

Query: 66  STSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
           +TS L F    ++  P+    PLL + ++  FY + +  +SV G  L I    + + +  
Sbjct: 289 ATSYLTFGPVGAAHSPS--RTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKK-- 344

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF-SSRSSVEVPT 181
           NGG I+DSGT++T L T  Y A+  A  +   A  P   +  F+ CY++ ++R    VP 
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQL-ARVPRVTMDPFEYCYNWTATRRPPAVPR 403

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNS 240
           +   F     L  P K+Y+I   + G  C          +S+IGN+ QQ     F+L N 
Sbjct: 404 LEVRFAGSARLRPPTKSYVIDA-APGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDLANR 462

Query: 241 LIGFTPNKC 249
            + F  ++C
Sbjct: 463 WLRFQESRC 471


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 87/272 (31%), Positives = 113/272 (41%), Gaps = 43/272 (15%)

Query: 14  SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDST 67
           +V N   GC H   G  VG AG    G G LS PSQ+        + FSYCLV   S + 
Sbjct: 205 NVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQLGNRFSYCLVSH-SFAA 260

Query: 68  STLEFDSSL--------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
             +   S L            +   LL N +   FY +GL GISVG   +P  E   K+D
Sbjct: 261 DRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRIPAPEFLTKVD 320

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-----RALSPTDGVALFDTCYDFSSR 174
           E G+GG++VDSGT  T L    Y ++   F   T     RA    +   L   CY +   
Sbjct: 321 EGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL-SPCYYY--E 377

Query: 175 SSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI----------- 222
           +SV VP V  HF  E   + LP KNY       G            L +           
Sbjct: 378 NSVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMNGGDEAELAG 437

Query: 223 -----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                +GN QQQG  V ++L  + +GF   +C
Sbjct: 438 GPGATLGNYQQQGFEVVYDLEKNRVGFARRQC 469


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 83/248 (33%), Positives = 116/248 (46%), Gaps = 27/248 (10%)

Query: 16  DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--DSTSTL 70
           D    GCG   +G +    GL+GLG  S S   Q+       FSYCLV  DS   + S L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 71  EFDSSLP---PNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNG-- 124
              SS      + V+ P+L    LD T YY+ L  I+VGG  +P+        ESG+   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGG--VPV---VVYDKESGHNTS 234

Query: 125 -------GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSS 176
                    ++DSGT  T L    Y A+R +     + + PT G  A  D C++ S  +S
Sbjct: 235 VGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTS 292

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
              P+V+F+F     L LP +N +  V S    C +   +   LSIIGN+QQQ   + ++
Sbjct: 293 YGFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351

Query: 237 LRNSLIGF 244
           L  S I F
Sbjct: 352 LVASQISF 359


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/245 (33%), Positives = 110/245 (44%), Gaps = 32/245 (13%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
           G +  + +TLG   V      GC H + G       AG L LGGGS S   Q        
Sbjct: 156 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 215

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           FSYCL      + S+L F         + L P+ V+ PLL +    TFY + L  I V G
Sbjct: 216 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 271

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             L +    F          ++DS T ++RL    Y ALR AF            V++ D
Sbjct: 272 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 325

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
           TCYDF+   S+ +P+++  F  G  + L A   L+     G+ C AFAPT+S      IG
Sbjct: 326 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 379

Query: 225 NVQQQ 229
           NVQQ+
Sbjct: 380 NVQQK 384



 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 96/206 (46%), Gaps = 27/206 (13%)

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
           FSYC+      S S+L F         ++L P  V+ PLL +  +  TFY + L  I V 
Sbjct: 440 FSYCI----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVA 495

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G  LP+  T F          ++ S T ++RL    Y ALR AF R          V++ 
Sbjct: 496 GRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAFRRAMTMYRTAPPVSIL 549

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
           DTCYDF+   S+ +P+++  F  G  + L A   L+     G  C AFAPT++      I
Sbjct: 550 DTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL----QG--CLAFAPTATDRMPGFI 603

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQ+   V +++    I F    C
Sbjct: 604 GNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 127/269 (47%), Gaps = 27/269 (10%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPS---QIN 51
           G F  ET+T+G      A +    IGC  +  G  F GA G+LGL     SF S    + 
Sbjct: 177 GVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLY 236

Query: 52  ASTFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
            + FSYCLVD  S+   ++ L F SS    +      R   LD      FY + + GIS+
Sbjct: 237 GAKFSYCLVDHLSNKNVSNYLIFGSS---RSTKTAFRRTTPLDLTRIPPFYAINVIGISL 293

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
           G D+L I    +  D +  GG I+DSGT++T L    Y  +     R    L     +GV
Sbjct: 294 GYDMLDIPSQVW--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGV 351

Query: 163 ALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
            + + C+ F+S  +V ++P ++FH   G       K+YL+   + G  C  F    + + 
Sbjct: 352 PI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFVSAGTPAT 409

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++IGN+ QQ     F+L  S + F P+ C
Sbjct: 410 NVIGNIMQQNYLWEFDLMASTLSFAPSAC 438


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 83/262 (31%), Positives = 127/262 (48%), Gaps = 21/262 (8%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
           G+   +T+TLGS S     +    IGCGHNN G F      +   GG  +S  SQ+ ++ 
Sbjct: 183 GNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTI 242

Query: 54  --TFSYCLVDRDSDST--STLEFDSS--LPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
              FSYCLV   S++T  S L F S+  +    V +  L + + DTFY+L L  +SVG +
Sbjct: 243 DGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSE 302

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            +    ++F   E   G II+DSGT +T    + ++ L  A           D   +   
Sbjct: 303 RIKFPGSSFGTSE---GNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSL 359

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           CY  S  + ++ P+++ HF +G  + L   N  + V S+   CFAF P +S  +I GN+ 
Sbjct: 360 CY--SIDADLKFPSITAHF-DGADVKLNPLNTFVQV-SDTVLCFAFNPINSG-AIFGNLA 414

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q    V ++L    + F P  C
Sbjct: 415 QMNFLVGYDLEGKTVSFKPTDC 436


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 86/265 (32%), Positives = 122/265 (46%), Gaps = 23/265 (8%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           G+   ET+T+ S      S    A GC H + G+F   ++G++GLG   LS  SQ+ ++ 
Sbjct: 181 GNLAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTI 240

Query: 55  ---FSYCL--VDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTFYYL-GLTGISVG 105
              FSYCL  V  DS  +S + F  S        V+ PL+     DT+YYL  L G SVG
Sbjct: 241 NGRFSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGP-DTYYYLITLEGFSVG 299

Query: 106 GDLLPISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
              L  S   F K  E   G IIVDSGT  T L  E Y  L ++     +     D   +
Sbjct: 300 KKRL--SYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 357

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
              CY+ ++   ++ P ++ HF +  V   P   +L   +     CF   PT S + I+G
Sbjct: 358 SSLCYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQED--LVCFTVLPT-SDIGILG 413

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+ Q    V F+LR   + F    C
Sbjct: 414 NLAQVNFLVGFDLRKKRVSFKAADC 438


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 70/238 (29%), Positives = 107/238 (44%), Gaps = 25/238 (10%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTL--------EFDSSLPPNAVT-APL 85
           G+ G G G  S P+Q+  + FSYCLV    D T              +    N V  AP 
Sbjct: 207 GIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAYAPF 266

Query: 86  LRNHELD---TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
            ++  L     +YY+ L+ I VGG  +PI        + G+GG+IVDSG+  T ++   +
Sbjct: 267 TKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMERIIF 326

Query: 143 N----ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
           +     L     +  RA    D   L   CY+ + +S V+VP ++F F  G  + LP  +
Sbjct: 327 DPVARELEKHMTKYKRAKEIEDSSGL-GPCYNITGQSEVDVPKLTFSFKGGANMDLPLTD 385

Query: 199 YLIPVDSNGTFCFAF-------APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           Y   V ++G  C            T+    I+GN QQQ   + ++L+    GF P +C
Sbjct: 386 YFSLV-TDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQC 442


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/245 (33%), Positives = 110/245 (44%), Gaps = 32/245 (13%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS---T 54
           G +  + +TLG   V      GC H + G       AG L LGGGS S   Q        
Sbjct: 247 GTYSFDDLTLGPYDVIRGFRFGCAHADRGSAFDYDVAGSLALGGGSQSLVQQTATRYGRV 306

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
           FSYCL      + S+L F         + L P+ V+ PLL +    TFY + L  I V G
Sbjct: 307 FSYCL----PPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAG 362

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             L +    F          ++DS T ++RL    Y ALR AF            V++ D
Sbjct: 363 RPLAVPPAVFSASS------VIDSSTIISRLPPTAYQALRAAFRSAMTMYRAAPPVSILD 416

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SIIG 224
           TCYDF+   S+ +P+++  F  G  + L A   L+     G+ C AFAPT+S      IG
Sbjct: 417 TCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL-----GS-CLAFAPTASDRMPGFIG 470

Query: 225 NVQQQ 229
           NVQQ+
Sbjct: 471 NVQQK 475



 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 96/206 (46%), Gaps = 27/206 (13%)

Query: 55  FSYCLVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRNHELD-TFYYLGLTGISVG 105
           FSYC+      S S+L F         ++L P  V+ PLL +  +  TFY + L  I V 
Sbjct: 531 FSYCI----PPSPSSLGFITLGVPPQRAALVPTFVSTPLLSSSSMPPTFYRVLLRAIIVA 586

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G  LP+  T F          ++ S T ++RL    Y ALR AF R          V++ 
Sbjct: 587 GRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAAFRRAMTMYRTAPPVSIL 640

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL--SII 223
           DTCYDF+   S+ +P+++  F  G  + L A   L+     G  C AFAPT++      I
Sbjct: 641 DTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL----QG--CLAFAPTATDRMPGFI 694

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNVQQ+   V +++    I F    C
Sbjct: 695 GNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/298 (27%), Positives = 120/298 (40%), Gaps = 62/298 (20%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------AST 54
            +   +T++L S  + N   GC H          G+ G G G LS P+Q++       + 
Sbjct: 185 ANLYQQTLSLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNR 241

Query: 55  FSYCLVD-----------------RDSDSTS------TLEFDSSLPPNAVTAPLLRNHEL 91
           FSYCLV                  R +D+ +      ++EF        V   +L N + 
Sbjct: 242 FSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEF--------VYTSMLSNPKH 293

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF-- 149
             +Y +GL GISVG   +P  E   ++DE GNGG++VDSGT  T L    YNA+ + F  
Sbjct: 294 PYYYCVGLAGISVGKRTVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDK 353

Query: 150 --VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSN 206
              R  +  S  +       CY  +  S  ++P +  HF      + LP KNY       
Sbjct: 354 RVNRFHKRASEIETKTGLGPCYYLNGLS--QIPVLKLHFVGNNSDVVLPRKNYFYEFMDG 411

Query: 207 G--------TFCFAFAPTSSSLSI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           G          C           +       +GN QQQG  V ++L    +GF   +C
Sbjct: 412 GDGIRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 127/269 (47%), Gaps = 27/269 (10%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPS---QIN 51
           G F  ET+T+G      A +    IGC  +  G  F GA G+LGL     SF S    + 
Sbjct: 199 GVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLY 258

Query: 52  ASTFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
            + FSYCLVD  S+   ++ L F SS    +      R   LD      FY + + GIS+
Sbjct: 259 GAKFSYCLVDHLSNKNVSNYLIFGSS---RSTKTAFRRTTPLDLTRIPPFYAINVIGISL 315

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
           G D+L I    +  D +  GG I+DSGT++T L    Y  +     R    L     +GV
Sbjct: 316 GYDMLDIPSQVW--DATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGV 373

Query: 163 ALFDTCYDFSSRSSV-EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SL 220
            + + C+ F+S  +V ++P ++FH   G       K+YL+   + G  C  F    + + 
Sbjct: 374 PI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFVSAGTPAT 431

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++IGN+ QQ     F+L  S + F P+ C
Sbjct: 432 NVIGNIMQQNYLWEFDLMASTLSFAPSAC 460


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 112/274 (40%), Gaps = 31/274 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++ET+      + N  +GC   +       +G+ G G GS S PSQ+    F+YCL 
Sbjct: 189 GLLLSETLDFPDKKIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLA 245

Query: 61  DR---DSDSTSTLEFDSS-LPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
            R   DS  +  L  DS+ +  + +T       P + N+    +YYL +  I VG   + 
Sbjct: 246 SRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK 305

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVAL-- 164
           +          GNGG I+DSG+  T +       +   F +     TRA   TD   L  
Sbjct: 306 VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA---TDVETLTG 362

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---------P 215
              C+D S   SV+ P + F F  G    LP  NY   V S+G  C              
Sbjct: 363 LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGG 422

Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 I+G  QQQ   V ++L N  +GF    C
Sbjct: 423 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 87/267 (32%), Positives = 127/267 (47%), Gaps = 29/267 (10%)

Query: 1   GDFVTETVTL-----GSASVDNIAIGCGHNNEGLFVGA-AGLLGLGGGSLSFPSQINAST 54
           G+   +TVTL     G        IGCG  N G F    +G++GLGGG +S  SQ+ +S 
Sbjct: 182 GNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSV 241

Query: 55  ---FSYCLVDRDSDS---TSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCLV   S+S   +S L F  ++ +  + V +  L +   DTFYYL L  +SVG 
Sbjct: 242 GGKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGD 301

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQ----TETYNALRDAFVRGTRALSPTDGV 162
             +    ++F   E     II+DSGT++T       TE   A+ +A + G R     D  
Sbjct: 302 KKIEFGGSSFGGSEG---NIIIDSGTSLTLFPVNFFTEFATAVENAVINGERT---QDAS 355

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
            L   CY       ++VP ++ HF    V+ L   N  I + S+   C AF  T S  +I
Sbjct: 356 GLLSHCY--RPTPDLKVPVITAHFNGADVV-LQTLNTFILI-SDDVLCLAFNSTQSG-AI 410

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            GNV Q    + ++++   + F P  C
Sbjct: 411 FGNVAQMNFLIGYDIQGKSVSFKPTDC 437


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 112/274 (40%), Gaps = 31/274 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++ET+      + N  +GC   +       +G+ G G GS S PSQ+    F+YCL 
Sbjct: 189 GLLLSETLDFPDKXIPNFVVGCSFLS---IHQPSGIAGFGRGSESLPSQMGLKKFAYCLA 245

Query: 61  DR---DSDSTSTLEFDSS-LPPNAVTA------PLLRNHELDTFYYLGLTGISVGGDLLP 110
            R   DS  +  L  DS+ +  + +T       P + N+    +YYL +  I VG   + 
Sbjct: 246 SRKFDDSPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVK 305

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGVAL-- 164
           +          GNGG I+DSG+  T +       +   F +     TRA   TD   L  
Sbjct: 306 VPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRA---TDVETLTG 362

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---------P 215
              C+D S   SV+ P + F F  G    LP  NY   V S+G  C              
Sbjct: 363 LRPCFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGG 422

Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 I+G  QQQ   V ++L N  +GF    C
Sbjct: 423 GGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
          Length = 274

 Score = 94.0 bits (232), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 66/252 (26%), Positives = 102/252 (40%), Gaps = 60/252 (23%)

Query: 11  GSASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINASTFSYC---LVDRDSDS 66
           G  +   +  GCGH N+G+F     G+ G G G  S PSQ+N ++FSYC   + D  S S
Sbjct: 63  GGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSS 122

Query: 67  TSTL---------EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFK 117
             TL            ++   +  T  L++N    + Y++ L GISVGG  + + E+  +
Sbjct: 123 VVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLR 182

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV 177
                    I+DSG ++T L  + Y A++  FV                           
Sbjct: 183 ------SSTIIDSGASITTLPEDVYEAVKAEFVS-------------------------- 210

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNL 237
                           LP  NY+    +    C      +    +IGN QQQ T V ++L
Sbjct: 211 ---------------QLPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDL 255

Query: 238 RNSLIGFTPNKC 249
            N ++ F P +C
Sbjct: 256 ENDVLSFAPARC 267


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 85/239 (35%), Positives = 112/239 (46%), Gaps = 14/239 (5%)

Query: 13  ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-FSYCLVDRDSDSTSTLE 71
           A V +   GCGH+   L     GLLGLG  S S  +Q      FSYCL   +S     L 
Sbjct: 219 AIVKDFYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSKP-GFLA 277

Query: 72  FDSSLPPNA-VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDS 130
           F +   P+  V  P+ R     TF  + L GI+VGG  L +  +AF      +GG+IVDS
Sbjct: 278 FGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGGMIVDS 331

Query: 131 GTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK 190
           GT VT LQ+  Y ALR AF    +A     G    DTCYD +   +V VP ++  F  G 
Sbjct: 332 GTVVTVLQSTVYRALRAAFREAMKAYRLVHGD--LDTCYDLTGYKNVVVPKIALTFSGGA 389

Query: 191 VLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + L   N ++    NG   FA      +  ++GNV Q+   V F+   S  GF    C
Sbjct: 390 TINLDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 83/255 (32%), Positives = 120/255 (47%), Gaps = 40/255 (15%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           SA++ ++  GCGH+N G  +   G+LGLG G  S   +     FSYC    D        
Sbjct: 191 SAALHDVVFGCGHDNYGEPLVGTGILGLGYGEFSLVHRF-GKKFSYCFGSLD-------- 241

Query: 72  FDSSLPPNAV------------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF-KI 118
            D S P N +            T PL  +   + FYY+ +  ISV G +LPI    F + 
Sbjct: 242 -DPSYPHNVLVLGDDGANILGDTTPLEIH---NGFYYVTIEAISVDGIILPIDPRVFNRN 297

Query: 119 DESGNGGIIVDSGTAVTRLQTETY----NALRDAFV-RGTRA-LSPTDGVALFDTCYDFS 172
            ++G GG I+D+G ++T L  E Y    N + D F  R T A +S  D + +   CY+ +
Sbjct: 298 HQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKM--ECYNGN 355

Query: 173 -SRSSVE--VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
             R  VE   P V+FHF EG  L L  K+  + +  N  FC A  P   +L+ IG   QQ
Sbjct: 356 FERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPN-VFCLAVTP--GNLNSIGATAQQ 412

Query: 230 GTRVSFNLRNSLIGF 244
              + ++L    + F
Sbjct: 413 SYNIGYDLEAMEVSF 427


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 93.6 bits (231), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/277 (31%), Positives = 127/277 (45%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++T  +G++ +     GC  +    N        GL+G+  GSLSF SQ++   FS
Sbjct: 174 GNLASDTFYIGNSDMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPKFS 233

Query: 57  YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDL 108
           YC+ D D      L    F   +P N    PL++ +  L  F    Y + L GI V   L
Sbjct: 234 YCISDSDFSGVLLLGDANFSWLMPLNY--TPLIQISTPLPYFDRVAYTVQLEGIKVSSKL 291

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGVAL 164
           LP+ ++ F  D +G G  +VDSGT  T L    Y+ALR+ F+  T    R L   + V  
Sbjct: 292 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQ 351

Query: 165 --FDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNGT---FCFAFAP 215
              D CY    S  S   +PTVS  F  G  + +      Y +P +  G+   +CF F  
Sbjct: 352 GGMDLCYRVPLSQTSLPWLPTVSLMF-RGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGN 410

Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +   +    +IG+  QQ   + F+L  S IGF   +C
Sbjct: 411 SDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 72/237 (30%), Positives = 98/237 (41%), Gaps = 23/237 (9%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDR---DSDSTSTLEFDS------SLPPNAVTAPL 85
           G+ G G    S PSQ+    FSYCLV     D+ ++S L  D+      +  P     P 
Sbjct: 232 GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPF 291

Query: 86  LRN--HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
            +N       +YY+ L  I +G   + +          GNGG IVDSGT  T ++   Y 
Sbjct: 292 QKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYE 351

Query: 144 ALRDAFVRGTRALSPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            +   F +     +    V        C++ S   SV VP   FHF  G  + LP  NY 
Sbjct: 352 LVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYF 411

Query: 201 IPVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             VDS G  C      + S S        I+GN QQ+   V F+L+N   GF    C
Sbjct: 412 SFVDS-GVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score = 93.6 bits (231), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 99/281 (35%), Positives = 139/281 (49%), Gaps = 36/281 (12%)

Query: 1   GDFVTETVTLGSASVD--NIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN---AST 54
           G   ++TVT+G+ASV   N+A GCG  N G F    +G++GLGGG+LSF SQ+       
Sbjct: 170 GYLASDTVTVGNASVQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKK 229

Query: 55  FSYCLVD---------RDSDSTSTLEFD-----SSLPPNAV---TAPLLRNHELDTFYYL 97
           FSYCL+           DS +TS + F      SS   N V   T PL+ N E  T+YYL
Sbjct: 230 FSYCLLPLENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLV-NKEPSTYYYL 288

Query: 98  GLTGISVGGDLLPISETAFKID--ESGN------GGIIVDSGTAVTRLQTETYNALRDAF 149
            +  I+VG   L  S ++ K    +SG+      G II+DSGT +T L+ E Y AL  A 
Sbjct: 289 TIEAITVGRKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAAL 348

Query: 150 VRGTRALSPTD-GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
           V   +     D   ++F  C+  S +  VE+P +  HF  G  + L   N  +  +  G 
Sbjct: 349 VEEIKMERVNDVKNSMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-EGL 406

Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            CF   PT + + I GN+ Q    V ++L    + F P  C
Sbjct: 407 VCFTMLPT-NDVGIYGNLAQMNFVVGYDLGKRTVSFLPADC 446


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 92/280 (32%), Positives = 127/280 (45%), Gaps = 32/280 (11%)

Query: 1   GDFVTETVTLGSASVD-NIAIGCGHNNEG----LFVGAAGLLGLGGGSLSFPSQINASTF 55
           G+   E    G+++ D N+  GC  +  G          GLLG+  GSLSF SQ+    F
Sbjct: 164 GNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKF 223

Query: 56  SYCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGD 107
           SYC+   D      L  DS+   L P   T PL+R +  L  F    Y + LTGI V G 
Sbjct: 224 SYCISGTDDFPGFLLLGDSNFTWLTPLNYT-PLIRISTPLPYFDRVAYTVQLTGIKVNGK 282

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL----SPTDGV- 162
           LLPI ++    D +G G  +VDSGT  T L    Y ALR  F+  T  +       D V 
Sbjct: 283 LLPIPKSVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVF 342

Query: 163 -ALFDTCYDFSS---RSSV--EVPTVSFHFPEGKVL----PLPAKNYLIPVDSNGTFCFA 212
               D CY  S    RS +   +PTVS  F   ++     PL  +   + V ++  +CF 
Sbjct: 343 QGTMDLCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFT 402

Query: 213 FAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           F  +        +IG+  QQ   + F+L+ S IG  P +C
Sbjct: 403 FGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVEC 442


>gi|326490700|dbj|BAJ90017.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493830|dbj|BAJ85377.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 459

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 82/257 (31%), Positives = 123/257 (47%), Gaps = 22/257 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGC-GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G    ETV +GS  V    +GC   N+ G  VG  G  G   G+LS  SQ++ S FSY L
Sbjct: 169 GFLANETVAVGS-FVGAAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSKFSYYL 227

Query: 60  VDRD---SDSTSTLEF-DSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
              +   SDS S +   D+++P       + PLLR+      YY+ L+ I V G  L  I
Sbjct: 228 APDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVYYVKLSAIQVDGQALSGI 287

Query: 112 SETAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
              AF +   G +GG+++ +   +TRLQ + YNA+R A V    A    +G A    +FD
Sbjct: 288 PAGAFDLAADGSSGGVVMGTLYPITRLQEDAYNAVRQALVSKINA-QEVNGSAFAGGVFD 346

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFAFAPTSSSL--- 220
            CYD  S +++  P ++  F  G     L L   +Y    +  G  CF   P        
Sbjct: 347 LCYDAQSVATLTFPKITLVFDGGNAPATLELTTVHYFFKDNVTGLQCFTMLPMPVGTPFG 406

Query: 221 SIIGNVQQQGTRVSFNL 237
           S++G++ Q GT + +++
Sbjct: 407 SVLGSMVQAGTNMIYDV 423


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 117/247 (47%), Gaps = 15/247 (6%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A + ++ +GC   ++G  F    G+L LG   +SF S+  A    +FSYCLVD     ++
Sbjct: 198 AQLQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNA 257

Query: 67  TSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
           T  L F    +P    T   L       FY + +  + V G  L I     ++ +  +GG
Sbjct: 258 TGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPA---EVWDPKSGG 314

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS--SVEVPTVS 183
           +I+DSGT +T L T  Y A+  A  +    +   D    F+ CY++++    + E+P ++
Sbjct: 315 VILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLA 373

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLI 242
             F     L  PAK+Y+I V   G  C          +S+IGN+ QQ     F+L+N  +
Sbjct: 374 VQFTGCARLEPPAKSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEV 432

Query: 243 GFTPNKC 249
            F P+ C
Sbjct: 433 RFMPSTC 439


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 77/250 (30%), Positives = 115/250 (46%), Gaps = 27/250 (10%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLVD--RDSDST 67
           V  ++ GC   + G F  + GL+GLG G+LS  SQ+ A+      FSYCLV     ++S+
Sbjct: 212 VPRVSFGCSTGSAGSF-RSDGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAANSS 270

Query: 68  STLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
           STL F +      P A + PL+ + E+D++Y + L  ++V G           +  + + 
Sbjct: 271 STLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVAGQ---------DVASANSS 320

Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE---VPT 181
            IIVDSGT +T L       L     R  R         L   CYD   +S  E   +P 
Sbjct: 321 RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPD 380

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFNLRN 239
           V+  F  G  + L  +N    ++  GT C    P S S  +SI+GN+ QQ   V ++L  
Sbjct: 381 VTLRFGGGASVTLRPENTFSLLE-EGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 439

Query: 240 SLIGFTPNKC 249
             + F    C
Sbjct: 440 RTVTFAAVDC 449


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score = 93.2 bits (230), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 76/253 (30%), Positives = 115/253 (45%), Gaps = 27/253 (10%)

Query: 11  GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLV-DRDS 64
           G   V  +  GC   + G F  + GL+GLG G+ S  SQ+ A+T      SYCL+   D+
Sbjct: 211 GQVRVPRVNFGCSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDA 269

Query: 65  DSTSTLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           +S+STL F S      P A + PL+ + ++D++Y + L  ++VGG  +   ++       
Sbjct: 270 NSSSTLNFGSRAVVSEPGAASTPLVPS-DVDSYYTVALESVAVGGQEVATHDSR------ 322

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE--- 178
               IIVDSGT +T L       L     R  +         L   CYD   +S  +   
Sbjct: 323 ----IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFG 378

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--LSIIGNVQQQGTRVSFN 236
           +P V+  F  G  + L  +N    +   GT C    P S S  +SI+GN+ QQ   V ++
Sbjct: 379 IPDVTLRFGGGAAVTLRPEN-TFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYD 437

Query: 237 LRNSLIGFTPNKC 249
           L    + F    C
Sbjct: 438 LDARTVTFAAADC 450


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 73/240 (30%), Positives = 103/240 (42%), Gaps = 24/240 (10%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS-----TLEFDSS--LPPNAVTAPLL 86
           +G+ G G G  S PSQ+N   FSYCLV    D T       L+  S+     N ++    
Sbjct: 227 SGIAGFGRGQESLPSQMNLKRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPF 286

Query: 87  R-----NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           R     N     +YY+ L  + VGG  + I     +    GNGG IVDSG+  T ++   
Sbjct: 287 RSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPV 346

Query: 142 YNALRDAFVRGT-RALSPTDGVAL---FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
           YN +   F+R   +  S  + V        C++ S   ++  P  +F F  G  +  P  
Sbjct: 347 YNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKTISFPEFTFQFKGGAKMSQPLL 406

Query: 198 NYLIPVDSNGTFCF-------AFAPTSSSLSII-GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           NY   V      CF       A  P ++  +II GN QQQ   V ++L N   GF P  C
Sbjct: 407 NYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/249 (29%), Positives = 118/249 (47%), Gaps = 18/249 (7%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A + ++ +GC  +++G  F  A G+L LG   +SF +Q  A    +FSYCLVD     ++
Sbjct: 222 AQLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNA 281

Query: 67  TSTLEFDSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNG 124
           T  L F     P   A    L  + E+  FY + +  I V G  L I    +   ++ +G
Sbjct: 282 TGYLAFGPGQVPRTPATQTKLFLDPEM-PFYGVKVDAIHVAGKALDIPAEVW---DAKSG 337

Query: 125 GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR---SSVEVPT 181
           G+I+DSG  +T L    Y A+  A  +    + P      F+ CY++++R   +   +P 
Sbjct: 338 GVILDSGNTLTVLAAPAYKAVVAALSKHLDGV-PKVSFPPFEHCYNWTARRPGAPEIIPK 396

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNS 240
           ++  F     L  PAK+Y+I V   G  C          LS+IGN+ QQ     F+L+N 
Sbjct: 397 LAVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNM 455

Query: 241 LIGFTPNKC 249
            + F  + C
Sbjct: 456 QVRFKQSNC 464


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/258 (34%), Positives = 131/258 (50%), Gaps = 28/258 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G    ET TLG+ +V ++  GC   +EG +   +GL+GLG G LS  SQ+NASTF YCL 
Sbjct: 189 GFLARETFTLGADAVPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLT 248

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHEL---DTFYYLGLTGISVGGDLLP-ISETAF 116
             D+   S L F S     ++T   +++  L    TFY + L  IS+G    P + E   
Sbjct: 249 S-DASKASPLLFGSL---ASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPE- 303

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR--ALSPTDGVALFDTCYDFSSR 174
                   G++ DSGT +T L    Y+  + AF+  T    +  TDG   F+ C+   + 
Sbjct: 304 --------GVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG---FEACFQKPAN 352

Query: 175 ---SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
              S+  VPT+  HF +G  + LP  NY++ V+ +G  C+     S SLSIIGN+ Q   
Sbjct: 353 GRLSNAAVPTMVLHF-DGADMALPVANYVVEVE-DGVVCW-IVQRSPSLSIIGNIMQVNY 409

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V  ++  S++ F P  C
Sbjct: 410 LVLHDVHRSVLSFQPANC 427


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 128/279 (45%), Gaps = 35/279 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++T  +G++++     GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 145 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 204

Query: 57  YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YC+  +DS         S S L+     P   ++ PL     +   Y + L GI V   +
Sbjct: 205 YCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVA--YTVQLEGIKVANSM 262

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGV-- 162
           L + ++ +  D +G G  +VDSGT  T L    Y AL++ FVR T+A    L   + V  
Sbjct: 263 LQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ 322

Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
              D CY    + R+   +PTV+  F  G  + + A+  +  V      S+  +CF F  
Sbjct: 323 GAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFG- 380

Query: 216 TSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +S L      IIG+  QQ   + F+L  S +GF   +C
Sbjct: 381 -NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 418


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 121/273 (44%), Gaps = 43/273 (15%)

Query: 12  SASVDNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS--D 65
            A +  + +GC  +  G  F  + G+L LG   +SF S      A  FSYCLVD  S  +
Sbjct: 210 KAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRN 269

Query: 66  STSTLEFDSSLPPNAVTA---------------------------PLLRNHELDTFYYLG 98
           +TS L F     PN   A                           PLL +  +  FY + 
Sbjct: 270 ATSYLTFG----PNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVA 325

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           +  +SV G  L I    + +D    GG+I+DSGT++T L    Y A+  A   G   L P
Sbjct: 326 VKAVSVAGQFLKIPRAVWDVD--AGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGL-P 382

Query: 159 TDGVALFDTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-T 216
              +  F+ CY+++S S  V +P ++ HF     L  P K+Y+I   + G  C       
Sbjct: 383 RVTMDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDA-APGVKCIGLQEGP 441

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              +S+IGN+ QQ     F+++N  + F  ++C
Sbjct: 442 WPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 128/279 (45%), Gaps = 35/279 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++T  +G++++     GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 152 GNLASDTFHIGNSAIPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQKFS 211

Query: 57  YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YC+  +DS         S S L+     P   ++ PL     +   Y + L GI V   +
Sbjct: 212 YCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVA--YTVQLEGIKVANSM 269

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA----LSPTDGV-- 162
           L + ++ +  D +G G  +VDSGT  T L    Y AL++ FVR T+A    L   + V  
Sbjct: 270 LQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQ 329

Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
              D CY    + R+   +PTV+  F  G  + + A+  +  V      S+  +CF F  
Sbjct: 330 GAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFG- 387

Query: 216 TSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +S L      IIG+  QQ   + F+L  S +GF   +C
Sbjct: 388 -NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 83/275 (30%), Positives = 121/275 (44%), Gaps = 28/275 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSY 57
           G   T+   +G A     A GC    +++    V  AGLLG+  G+LSF +Q +   FSY
Sbjct: 161 GALATDVFAVGEAPPLRSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSY 220

Query: 58  CLVDRDSDSTSTLEFDSSLP--P---NAVTAPLLRNHELDTFYY-LGLTGISVGGDLLPI 111
           C+ DRD D+   L   S LP  P     +  P L     D   Y + L GI VGG  LPI
Sbjct: 221 CISDRD-DAGVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPI 279

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDGVA-LF 165
             +    D +G G  +VDSGT  T L  + Y+AL+  F++ T+ L      P+       
Sbjct: 280 PASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEAL 339

Query: 166 DTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAPTS 217
           DTC+   +     S  +P V+  F  G  + +     L  V      ++G +C  F    
Sbjct: 340 DTCFRVPAGRPPPSARLPPVTLLF-NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFGNAD 398

Query: 218 S---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               +  +IG+  Q    V ++L    +G  P KC
Sbjct: 399 MVPLTAYVIGHHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 120/248 (48%), Gaps = 19/248 (7%)

Query: 13  ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLV--DRDSDS 66
           A    +A GCG  N G F    +G++GLGGGS+S  SQ+    +  FSYCLV     S+ 
Sbjct: 207 AYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNY 266

Query: 67  TSTLEFDSSLP-----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           TS + F + +       N V+ PLL     +T+YYL L  ISV    LP   T     E 
Sbjct: 267 TSKINFGNDINISGSNYNVVSTPLLPKKP-ETYYYLTLEAISVENKRLPY--TNLWNGEV 323

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPT 181
             G II+DSGT +T L +E +N L  A     +    +D   LF+ C  F    ++E+P 
Sbjct: 324 EKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNIC--FKDEKAIELPI 381

Query: 182 VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSL 241
           ++ HF  G  + L   N    V+ +   CF   P S+ ++I GN+ Q    V ++L    
Sbjct: 382 ITAHF-TGADVELQPVNTFAKVEED-LLCFTMIP-SNDIAIFGNLAQMNFLVGYDLEKKA 438

Query: 242 IGFTPNKC 249
           + F P  C
Sbjct: 439 VSFLPTDC 446


>gi|361068719|gb|AEW08671.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
 gi|376338612|gb|AFB33836.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
 gi|376338614|gb|AFB33837.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
 gi|376338616|gb|AFB33838.1| hypothetical protein CL1136Contig1_03, partial [Pinus mugo]
 gi|383135631|gb|AFG48834.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
          Length = 70

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 44/69 (63%), Positives = 51/69 (73%)

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G +LFDTCYD S   +V+VPTV FHF     + LPA NYLIPVDS+ TFCFAFA  +  L
Sbjct: 2   GFSLFDTCYDLSGLKTVKVPTVVFHFQGRADVSLPATNYLIPVDSSATFCFAFAGNTGGL 61

Query: 221 SIIGNVQQQ 229
           SIIGN+QQQ
Sbjct: 62  SIIGNIQQQ 70


>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
          Length = 379

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/239 (33%), Positives = 113/239 (47%), Gaps = 27/239 (11%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHE 90
           GL+G+  GSLSF SQ++   FSYC+ D D      L    F   +P N    PL++ +  
Sbjct: 133 GLMGMNRGSLSFVSQMDFPKFSYCISDSDFSGVLLLGDANFSWLMPLNY--TPLIQISTP 190

Query: 91  LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
           L  F    Y + L GI V   LLP+ ++ F  D +G G  +VDSGT  T L    Y+ALR
Sbjct: 191 LPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALR 250

Query: 147 DAFVRGT----RALSPTDGVAL--FDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
           + F+  T    R L   + V     D CY    S  S   +PTVS  F  G  + +    
Sbjct: 251 NEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMF-RGAEMKVSGDR 309

Query: 199 --YLIPVDSNGT---FCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             Y +P +  G+   +CF F  +   +    +IG+  QQ   + F+L  S IGF   +C
Sbjct: 310 LLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 368


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 110/273 (40%), Gaps = 31/273 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G F+ E +     ++ N  +GC   +    + +  L G G    S P Q+    F+YCL 
Sbjct: 185 GYFLLENLKFPRKTIRNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVKKFAYCLN 243

Query: 61  DRDSDSTST-----LEFDSSLPPNAVTAPLLRNHELDTFYY-LGLTGISVGGDLLPISET 114
             D D T       L++           P L++     FYY LG+  I +G  LL I   
Sbjct: 244 SHDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSK 303

Query: 115 AFKIDESGNGGIIVDSGTA--------VTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
                  G  G+I+DSG          V ++ T   N L+    +  R+L       L  
Sbjct: 304 YLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVT---NELKKQMSKYRRSLEAETQTGL-T 359

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----------IPVDSNGTFCFAFAPT 216
            CY+F+   S+++P + + F  G  + +P KNY             +D+NGT      P 
Sbjct: 360 PCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPD 419

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S   I+GN Q     V ++L+N   GF    C
Sbjct: 420 PS--IILGNSQHVDYYVEYDLKNDRFGFRRQTC 450


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/259 (30%), Positives = 112/259 (43%), Gaps = 45/259 (17%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           F + T    SA+V  +  GCGH N G+F     G+ G G GSLS PSQ+    FS+C   
Sbjct: 190 FASGTGEGSSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTT 249

Query: 62  RDSDSTST--LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
                TS   L      PP+A  +PL R                        S       
Sbjct: 250 ITGSKTSAVLLGLPGVAPPSA--SPLGRRRG---------------------SYRCRSTP 286

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFDTCYDFSSR-SSV 177
            S N      SGT++T L   TY A+R+ F    +  + P +    F TC+    R    
Sbjct: 287 RSSN------SGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPF-TCFSAPLRGPKP 339

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPV-------DSNGTFCFAFAPTSSSLSIIGNVQQQG 230
           +VPT++ HF EG  + LP +NY+  V       +S+   C A         I+GN+QQQ 
Sbjct: 340 DVPTMALHF-EGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNIQQQN 396

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V ++L+NS + F P +C
Sbjct: 397 MHVLYDLQNSKLSFVPAQC 415


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/264 (35%), Positives = 129/264 (48%), Gaps = 26/264 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            +   +T+TL + +V   + GC     G  + A GLLGLG G LS  SQ   +  STFSY
Sbjct: 198 ANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSY 257

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + Y++ L  + VG  ++ 
Sbjct: 258 CL-----PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVD 312

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCY 169
           +   +F  + S   G I DSGT  TRL T  Y A+RDAF  R  R L+ T  +  FDTCY
Sbjct: 313 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCY 371

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N
Sbjct: 372 TV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 426

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  R+ +++ NS +G     C
Sbjct: 427 LQQQNHRLLYDVPNSRLGVARELC 450


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/248 (32%), Positives = 115/248 (46%), Gaps = 27/248 (10%)

Query: 16  DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDS--DSTSTL 70
           D    GC    +G +    GL+GLG  S S   Q+       FSYCLV  DS   + S L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 71  EFDSSLP---PNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNG-- 124
              SS      + V+ P+L    LD T YY+ L  I++GG  +P+        ESG+   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGG--VPV---VVYDKESGHNTS 234

Query: 125 -------GIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG-VALFDTCYDFSSRSS 176
                    ++DSGT  T L    Y A+R +     + + PT G  A  D C++ S  +S
Sbjct: 235 VGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEE--QVILPTLGNSAGLDLCFNSSGDTS 292

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
              P+V+F+F     L LP +N +  V S    C +   +   LSIIGN+QQQ   + ++
Sbjct: 293 YGFPSVTFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351

Query: 237 LRNSLIGF 244
           L  S I F
Sbjct: 352 LVASQISF 359


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  + +T+     +V    +GC  +   +    +GL G G G+ S P+Q+    FSYCL+
Sbjct: 212 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 269

Query: 61  DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELD-----TFYYLGLTGISVGGDLLP 110
            R  D  + +     L            PL+++   D      +YYL L G++VGG  + 
Sbjct: 270 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 329

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
           +   AF  + +G+GG IVDSGT  T L    +  + DA V     R  R+    DG+ L 
Sbjct: 330 LPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGL- 388

Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF-------- 213
             C+     + S+ +P +SFHF  G V+ LP +NY + V   G     C A         
Sbjct: 389 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFGGGS 447

Query: 214 ---APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 S    I+G+ QQQ   V ++L    +GF    C
Sbjct: 448 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 486


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/272 (29%), Positives = 135/272 (49%), Gaps = 36/272 (13%)

Query: 3   FVTETVTLGS-ASVDNIAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
            + ETV  G   +V + A GC   + E +  GA+G+LGL  G ++ P Q+       FS+
Sbjct: 199 LIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSH 258

Query: 58  CLVDRDS--DSTSTLEF-DSSLPPNAV--TAPLLRNHELD-TFYYLGLTGISVGGD---L 108
           C  DR S  +ST  + F ++ LP   V  T+  L N EL   FY++ L G+S+      L
Sbjct: 259 CFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVL 318

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFD- 166
           LP               +I+DSG++ +      ++ LR+AF++    +L   +G +  D 
Sbjct: 319 LPRGSV-----------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL 367

Query: 167 -TCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
            TC+  S+    E+    P++S  F +G  + +P+   L+PV    ++   CFAF     
Sbjct: 368 GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGP 427

Query: 219 S-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +++IGN QQQ   V ++++ S +GF    C
Sbjct: 428 NPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/259 (35%), Positives = 128/259 (49%), Gaps = 26/259 (10%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
           +T+TL + +V   + GC     G  + A GLLGLG G LS  SQ   +  STFSYCL   
Sbjct: 124 DTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCL--- 180

Query: 63  DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
              S  +L F  SL       P      PLL+N    + Y++ L  + VG  ++ +   +
Sbjct: 181 --PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGS 238

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCYDFSSR 174
           F  + S   G I DSGT  TRL T  Y A+RDAF  R  R L+ T  +  FDTCY     
Sbjct: 239 FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCYTV--- 294

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQG 230
             +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N+QQQ 
Sbjct: 295 -PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 352

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+ +++ NS +G     C
Sbjct: 353 HRLLYDVPNSRLGVARELC 371


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 121/270 (44%), Gaps = 33/270 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G   T+T  +G+A+  ++  GC    G +  G   G +GL+GLG    S  SQ+N + FS
Sbjct: 140 GIVATDTFAIGTATA-SLGFGCVVASGIDTMG---GPSGLIGLGRAPSSLVSQMNITKFS 195

Query: 57  YCLVDRDSDSTSTLEFDSSLP----PNAVTAPLLRNH---ELDTFYYLGLTGISVGGDLL 109
           YCL   DS   S L   SS       N+ T P ++     ++  +Y + L GI  G    
Sbjct: 196 YCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAG---- 251

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
              + A  +  SGN  ++V +   ++ L    Y AL+    +   A      +  FD C+
Sbjct: 252 ---DAAIALPPSGN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCF 307

Query: 170 DFSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPV-DSNGTFCFAFAPTS--------SS 219
             +  S+   P + F F +G   L +P   YLI V +  GT C A   TS         +
Sbjct: 308 PKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDEN 367

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           L+I+G++QQ+ T    +L    + F P  C
Sbjct: 368 LNILGSLQQENTHFLLDLEKKTLSFEPADC 397


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/264 (35%), Positives = 129/264 (48%), Gaps = 26/264 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSY 57
            +   +T+TL + +V   + GC     G  + A GLLGLG G LS  SQ   +  STFSY
Sbjct: 184 ANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSY 243

Query: 58  CLVDRDSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           CL      S  +L F  SL       P      PLL+N    + Y++ L  + VG  ++ 
Sbjct: 244 CL-----PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVD 298

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCY 169
           +   +F  + S   G I DSGT  TRL T  Y A+RDAF  R  R L+ T  +  FDTCY
Sbjct: 299 VPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCY 357

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGN 225
                  +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N
Sbjct: 358 TV----PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 412

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           +QQQ  R+ +++ NS +G     C
Sbjct: 413 LQQQNHRLLYDVPNSRLGVARELC 436


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 77/273 (28%), Positives = 110/273 (40%), Gaps = 38/273 (13%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST----FSYCLVDRDSDST 67
           S +V+N    C H   G  VG AG    G G LS P+Q+  +     FSYCLV     + 
Sbjct: 213 SVAVENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRAD 269

Query: 68  STLE-----------FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
             +             D +     V  PLL N +   FY + L  +SVGG  +P      
Sbjct: 270 RPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELG 329

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT-----CYDF 171
           ++  +G+GG++VDSGT  T L  ETY  + + F R   A       A  D      CY +
Sbjct: 330 RVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPCYYY 389

Query: 172 SSRSSV-------EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSS--- 218
              +S         VP ++ HF     + LP +NY +   S       C           
Sbjct: 390 DHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDG 449

Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 +GN QQQG  V +++    +GF   +C
Sbjct: 450 GGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 121/270 (44%), Gaps = 33/270 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G   T+T  +G+A+  ++  GC    G +  G   G +GL+GLG    S  SQ+N + FS
Sbjct: 156 GIVATDTFAIGTATA-SLGFGCVVASGIDTMG---GPSGLIGLGRAPSSLVSQMNITKFS 211

Query: 57  YCLVDRDSDSTSTLEFDSSLP----PNAVTAPLLRNH---ELDTFYYLGLTGISVGGDLL 109
           YCL   DS   S L   SS       N+ T P ++     ++  +Y + L GI  G    
Sbjct: 212 YCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAG---- 267

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
              + A  +  SGN  ++V +   ++ L    Y AL+    +   A      +  FD C+
Sbjct: 268 ---DAAIALPPSGN-TVLVQTLAPMSFLVDSAYQALKKEVTKAVGAAPTATPLQPFDLCF 323

Query: 170 DFSSRSSVEVPTVSFHFPEG-KVLPLPAKNYLIPV-DSNGTFCFAFAPTS--------SS 219
             +  S+   P + F F +G   L +P   YLI V +  GT C A   TS         +
Sbjct: 324 PKAGLSNASAPDLVFTFQQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDEN 383

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           L+I+G++QQ+ T    +L    + F P  C
Sbjct: 384 LNILGSLQQENTHFLLDLEKKTLSFEPADC 413


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 80/268 (29%), Positives = 120/268 (44%), Gaps = 27/268 (10%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +    S +   I +GC   ++     A G+LG+  G L FPSQ   + FSYC+
Sbjct: 178 GNLVREKIAFSPSQTTPPIILGCATQSDD----ARGILGMNLGRLGFPSQAKITKFSYCV 233

Query: 60  VDRDSDSTSTLEFDSSLPP-------NAVT-APLLRNHELDTFYY-LGLTGISVGGDLLP 110
             + +   S   +  + P        N +T     R   LD   Y L L GIS+GG  L 
Sbjct: 234 PTKQAQPASGSFYLGNNPASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLN 293

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTC 168
           I  + FK +  G+G  ++DSG+  T L  E YN +R+  V+  G +         + D C
Sbjct: 294 IPPSVFKPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELVKKVGPKIKKGYMYGGVADIC 353

Query: 169 YDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
           +D     ++E    V  + F F +G  + +P +  L  VD  G  C     +    +  +
Sbjct: 354 FD---GDAIEIGRLVGDMVFEFEKGVQIVIPKERVLATVDG-GVHCLGMGRSERLGAGGN 409

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN  QQ   V F+L N  +GF    C
Sbjct: 410 IIGNFHQQNLWVEFDLANRRVGFGEADC 437


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 84/257 (32%), Positives = 116/257 (45%), Gaps = 19/257 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGG-SLSFPSQINAS---TFS 56
           G FV + VTL          GCG +  G F  A+G+LGL  G   S  SQ  +     FS
Sbjct: 204 GVFVCDEVTLKPDVFPKFQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFS 263

Query: 57  YCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YC   ++    S L  E   S  P+     LL N      Y++ L GISV    L +S +
Sbjct: 264 YCFPPKEHTLGSLLFGEKAISASPSLKFTQLL-NPPSGLGYFVELIGISVAKKRLNVSSS 322

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCYDF 171
            F      + G I+DSGT +TRL T  Y ALR AF +      ++SP     L DTCY+ 
Sbjct: 323 LF-----ASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNL 377

Query: 172 S--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
                 ++++P +  HF     + L     L         C AFA  S  S ++IIGN Q
Sbjct: 378 KGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQ 437

Query: 228 QQGTRVSFNLRNSLIGF 244
           Q   +V +++    +GF
Sbjct: 438 QVSLKVVYDIEGGRLGF 454


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 93/259 (35%), Positives = 128/259 (49%), Gaps = 26/259 (10%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQ---INASTFSYCLVDR 62
           +T+TL + +V   + GC     G  + A GLLGLG G LS  SQ   +  STFSYCL   
Sbjct: 72  DTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCL--- 128

Query: 63  DSDSTSTLEFDSSL-------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
              S  +L F  SL       P      PLL+N    + Y++ L  + VG  ++ +   +
Sbjct: 129 --PSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGS 186

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-RGTRALSPTDGVALFDTCYDFSSR 174
           F  + S   G I DSGT  TRL T  Y A+RDAF  R  R L+ T  +  FDTCY     
Sbjct: 187 FTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTS-LGGFDTCYTV--- 242

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQG 230
             +  PT++F F  G  + LP  N LI   +  T C A A      +S L++I N+QQQ 
Sbjct: 243 -PIAAPTITFMF-TGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQN 300

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            R+ +++ NS +G     C
Sbjct: 301 HRLLYDVPNSRLGVARELC 319


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 80/289 (27%), Positives = 117/289 (40%), Gaps = 53/289 (18%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCL 59
           +T++L S  + N   GC H          G+ G G G LS P+Q+        + FSYCL
Sbjct: 188 DTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 244

Query: 60  VDRDSDSTSTLE--------FDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGISV 104
           V    DS    +        ++              V   +L N +   FY + L GI+V
Sbjct: 245 VSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAV 304

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-------RALS 157
           G   +P  E   +++  G+GG++VDSGT  T L    YN++ D F R         R + 
Sbjct: 305 GKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIE 364

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK--VLPLPAKNYLIPVDSN--------- 206
              G+A    CY  +  S  +VP ++  F  GK   + LP KNY                
Sbjct: 365 EKTGLA---PCYYLN--SVADVPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRK 419

Query: 207 -GTFCFAFAPTSSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            G          + LS      +GN QQQG  V ++L    +GF   +C
Sbjct: 420 VGCLMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 84/270 (31%), Positives = 124/270 (45%), Gaps = 29/270 (10%)

Query: 1   GDFVTETVTLGSASVDNI------AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN--- 51
           GD   +T+ +  A+ D +        GCG   +GL  G  G+L L  GSLSFPSQI    
Sbjct: 71  GDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKY 130

Query: 52  ASTFSYCLVD---RDSDSTSTLEFDSSL----PPNAVTAPLLRNH---ELDTFYYLGLTG 101
            + FSYCL+    ++S   S + F  +      P +     L+     E   +Y + L G
Sbjct: 131 GNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDG 190

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
           ISVG   L +S +AF   +  +   I DSGT +T L     ++++ +       +S  + 
Sbjct: 191 ISVGNQRLDLSPSAFLNGQ--DKPTIFDSGTTLTMLPPGVCDSIKQSLA---SMVSGAEF 245

Query: 162 VAL--FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS 219
           VA+   D C+     S   +P ++FHF  G        NY+I  D     C  F PT + 
Sbjct: 246 VAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVI--DLGSLQCLIFVPT-NE 302

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +SI GN+QQQ   V  ++ N  IGF    C
Sbjct: 303 VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
          Length = 426

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 77/257 (29%), Positives = 121/257 (47%), Gaps = 18/257 (7%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS- 64
           E   +G+        GC   +     G +G+LG   G  S  SQ+  S FSY ++  D+ 
Sbjct: 158 EVTAVGTHITGRALFGCSLASTVPLDGESGVLGFSRGPYSLLSQLKISRFSYFMLPDDAD 217

Query: 65  --DSTSTLEF-DSSLPP--NAVTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKI 118
             DS S L   D ++P   ++ + PLLRN      YY+ LTGI V    L  I    F +
Sbjct: 218 KPDSESVLLLGDDAVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGTFDL 277

Query: 119 DESG-NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCYDFSSR 174
             +G +GG+++ + + +T LQ   YNAL  A    ++        D VA    CY+  S 
Sbjct: 278 AANGCSGGVVMSTLSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNIQSV 337

Query: 175 SSVEVPTVS--FHFPEGKVLP--LPAKNYLIPVDSNGTFCFAFAPT---SSSLSIIGNVQ 227
           +++  P ++  FH  +G+  P  L   +Y I  +S G  C    PT   S   S++G++ 
Sbjct: 338 ANLTFPKITLVFHGVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLGSLL 397

Query: 228 QQGTRVSFNLRNSLIGF 244
           Q GT + ++LR   + F
Sbjct: 398 QTGTHMIYDLRGGSLTF 414


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 95/272 (34%), Positives = 126/272 (46%), Gaps = 45/272 (16%)

Query: 1   GDFVTETVTL---GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---T 54
           G + TET+TL    +  V+N + GCG   +G+F    GLLGLGG   S  SQ   +    
Sbjct: 220 GVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGA 279

Query: 55  FSYCLVDRDS--------------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLT 100
           FSYCL   +S              ++T+  +F           PL       TFY + LT
Sbjct: 280 FSYCLPAGNSTAGFLALGAPATGGNNTAGFQF----------TPLQVVET--TFYLVKLT 327

Query: 101 GISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA---LS 157
           GISVGG  L I  T F       GG+I+DSGT VT L    Y+ALR AF     A   L 
Sbjct: 328 GISVGGKQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLP 381

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
           P D   L DTCYDF+  ++V VPTV+  F  G  + L   + ++    +G   F    + 
Sbjct: 382 PNDDEDL-DTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVL---LDGCLAFVAGASD 437

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               IIGNV Q+   V ++     +GF    C
Sbjct: 438 GDTGIIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|326523515|dbj|BAJ92928.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 459

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/257 (31%), Positives = 123/257 (47%), Gaps = 22/257 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGC-GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G    ETV +GS  V    +GC   N+ G  VG  G  G   G+LS  SQ++ S FSY L
Sbjct: 169 GFLANETVAVGS-FVGAAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSKFSYYL 227

Query: 60  VDRD---SDSTSTLEF-DSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLP-I 111
              +   SDS S +   D+++P       + PLLR+      +Y+ L+ I V G  L  I
Sbjct: 228 APDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVHYVKLSAIQVDGQALSGI 287

Query: 112 SETAFKIDESG-NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA----LFD 166
              AF +   G +GG+++ +   +TRLQ + YNA+R A V    A    +G A    +FD
Sbjct: 288 PAGAFDLAADGSSGGVVMGTLYPITRLQEDAYNAVRQALVSKINA-QEVNGSAFAGGVFD 346

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFAFAPTSSSL--- 220
            CYD  S +++  P ++  F  G     L L   +Y    +  G  CF   P        
Sbjct: 347 LCYDAQSVATLTFPKITLVFDGGNAPATLELTTVHYFFKDNVTGLQCFTMLPMPVGTPFG 406

Query: 221 SIIGNVQQQGTRVSFNL 237
           S++G++ Q GT + +++
Sbjct: 407 SVLGSMVQAGTNMIYDV 423


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 78/250 (31%), Positives = 110/250 (44%), Gaps = 34/250 (13%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDS-----SLPPNAVTAPLLR 87
           A GLLG+  GSLSF +Q     F+YC+   D      L  D      S  P     PL+ 
Sbjct: 208 ATGLLGMNRGSLSFVTQTGTLRFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIE 267

Query: 88  -NHELDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
            +  L  F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y
Sbjct: 268 MSQPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAY 327

Query: 143 NALRDAFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GK 190
             L+  F+  T AL    G         FD C+  +S + V   T S   PE      G 
Sbjct: 328 APLKGEFLNQTSALLAPLGEPDFVFQGAFDACFR-ASEARVAAATASQLLPEVGLVLRGA 386

Query: 191 VLPLPAKN--YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRN 239
            + +  +   Y++P +  G       +C  F  +     S  +IG+  QQ   V ++L+N
Sbjct: 387 EVAVGGEKLLYMVPGERRGEGGSEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQN 446

Query: 240 SLIGFTPNKC 249
           S +GF P +C
Sbjct: 447 SRVGFAPARC 456


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 77/250 (30%), Positives = 109/250 (43%), Gaps = 11/250 (4%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL--VDRD 63
           ET   G  S  NI  GCG +N G F   +G+LGLG G+ S  ++   S FSYC   +   
Sbjct: 175 ETSDDGLISKQNIVFGCGQDNSG-FTKYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLTNP 233

Query: 64  SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
           +   + L   +         PL         YYL L  IS G  LL I    F+   S  
Sbjct: 234 TYPHNILILGNGAKIEGDPTPL---QIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRS-Q 289

Query: 124 GGIIVDSGTAVTRLQTETYNALRDA--FVRGTRALSPTDGVALFDTCYDFSSRSSVE-VP 180
           GG ++D+G + T L  E Y  L +   F+ G       D       CY+ + +  +   P
Sbjct: 290 GGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFP 349

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRN 239
            V+FHF  G  L L  ++  +  +S  +FC A    T   +S+IG + QQ   V +NLR 
Sbjct: 350 VVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRT 409

Query: 240 SLIGFTPNKC 249
             + F    C
Sbjct: 410 MKVYFQRTDC 419


>gi|194702702|gb|ACF85435.1| unknown [Zea mays]
 gi|414885969|tpg|DAA61983.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 163

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 58/163 (35%), Positives = 83/163 (50%), Gaps = 14/163 (8%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD----A 148
           +FYYL LTGI+V G  + +  + F    +   G I+DSGTA + L    Y ALR     A
Sbjct: 8   SFYYLNLTGITVAGRAIKVPPSVF----ATAAGTIIDSGTAFSCLPPSAYAALRSSVRSA 63

Query: 149 FVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
             R  RA S T    +FDTCYD +   +V +P+V+  F +G  + L     L    +   
Sbjct: 64  MGRYKRAPSST----IFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQ 119

Query: 209 FCFAFAPT--SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            C AF P    +SL ++GN QQ+   V +++ N  +GF  N C
Sbjct: 120 TCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 162


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 90.9 bits (224), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 90/280 (32%), Positives = 127/280 (45%), Gaps = 32/280 (11%)

Query: 1   GDFVTETVTLGSASVD-NIAIGCGHNNEG----LFVGAAGLLGLGGGSLSFPSQINASTF 55
           G+   E    G+++ D N+  GC  +  G          GLLG+  GSLSF SQ+    F
Sbjct: 164 GNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPKF 223

Query: 56  SYCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGD 107
           SYC+   D      L  DS+   L P   T PL+R +  L  F    Y + LTGI V G 
Sbjct: 224 SYCISGTDDFPGFLLLGDSNFTWLTPLNYT-PLIRISTPLPYFDRVAYTVQLTGIKVNGK 282

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALF 165
           LLPI ++    D +G G  +VDSGT  T L    Y ALR  F+  T  +     D   +F
Sbjct: 283 LLPIPKSVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVF 342

Query: 166 ----DTCYD---FSSRSSV--EVPTVSFHFPEGKVL----PLPAKNYLIPVDSNGTFCFA 212
               D CY    F  R+ +   +PTVS  F   ++     PL  +   +   ++  +CF 
Sbjct: 343 QGTMDLCYRISPFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFT 402

Query: 213 FAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           F  +        +IG+  QQ   + F+L+ S IG  P +C
Sbjct: 403 FGNSDLMGMEAYVIGHHHQQNMWIEFDLQRSRIGLAPVQC 442


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 134/272 (49%), Gaps = 36/272 (13%)

Query: 3   FVTETVTLGS-ASVDNIAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
            + ETV  G   +V + A GC   + E +  GA+G+LGL  G ++ P Q+       FS+
Sbjct: 199 LIMETVVGGKPVTVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSH 258

Query: 58  CLVDRDS--DSTSTLEF-DSSLPPNAV--TAPLLRNHELD-TFYYLGLTGISVGGD---L 108
           C  DR S  +ST  + F ++ LP   V  T+  L N EL   FY++ L G+S+       
Sbjct: 259 CFPDRSSHLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVF 318

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALFD- 166
           LP               +I+DSG++ +      ++ LR+AF++    +L   +G +  D 
Sbjct: 319 LPRGSV-----------VILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDL 367

Query: 167 -TCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSS 218
            TC+  S+    E+    P++S  F +G  + +P+   L+PV    ++   CFAF     
Sbjct: 368 GTCFKVSNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGP 427

Query: 219 S-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +++IGN QQQ   V ++++ S +GF    C
Sbjct: 428 NPVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 111/243 (45%), Gaps = 21/243 (8%)

Query: 17  NIAIGCG-HNNEGLF--VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTSTL 70
           N   GCG +NN  +F      G++GLG G LS  SQI       FSYCL+   S STS L
Sbjct: 203 NSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKL 262

Query: 71  EFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
           +F +         V+ P++    L T+Y+L L  ++V    +P   T        +G +I
Sbjct: 263 KFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGST--------DGNVI 314

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP 187
           +DSGT +T L    Y     +           D ++    C+ +  R +   P ++F F 
Sbjct: 315 IDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPY--RDNFVFPEIAFQFT 372

Query: 188 EGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
             +V   PA  +++  D N T C   AP+S S +SI G+  Q   +V ++L    + F P
Sbjct: 373 GARVSLKPANLFVMTEDRN-TVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQP 431

Query: 247 NKC 249
             C
Sbjct: 432 TDC 434


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 69/270 (25%), Positives = 109/270 (40%), Gaps = 25/270 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G F+ E +     ++    +GC  + +     +  L G G    S P Q+    F+YCL 
Sbjct: 193 GFFLLENLDFPGKTIHKFLVGCTTSADRE-PSSDALAGFGRTMFSLPMQMGVKKFAYCLN 251

Query: 61  DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
             D D T       L++          AP L+N  +   +YYLG+  + +G  LL I   
Sbjct: 252 SHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGK 311

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
                    GG+++DSG A   +    +    N L+    +  R+L       L   CY+
Sbjct: 312 YLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGL-TPCYN 370

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----------AFAPTSSS 219
           F+   S+++P + + F  G  + +P  NY +        CF            F P  S 
Sbjct: 371 FTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS- 429

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             I+GN QQ    V F+L+N  +GF    C
Sbjct: 430 -IILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 79/249 (31%), Positives = 116/249 (46%), Gaps = 13/249 (5%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   +ET TLGS +V  I  GC   +EG +   +GL+GLG G LS   Q+    FSYCL 
Sbjct: 180 GYMGSETFTLGSDAVQGIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGAFSYCLT 239

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
              S S+  L    +L    V +  L N +  TFY + L  IS+G         A K   
Sbjct: 240 SDPSTSSPLLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIG---------AAKTPG 290

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           +G  GII DSGT +T L    Y       +  T  L+   G   ++ C  F +      P
Sbjct: 291 TGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTDGYEVC--FQTSGGAVFP 348

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           ++  HF +G  + L  +NY   V+ +   C+    + S +SI+GN+ Q    + ++L  S
Sbjct: 349 SMVLHF-DGGDMALKTENYFGAVN-DSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKS 406

Query: 241 LIGFTPNKC 249
           ++ F P  C
Sbjct: 407 VLSFQPTNC 415


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 119/252 (47%), Gaps = 19/252 (7%)

Query: 12  SASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSD 65
            A +  + +GC  + +G  F  + G+L LG  ++SF S+  +     FSYCLVD     +
Sbjct: 220 KAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRN 279

Query: 66  STSTLEFDSSLPPNAVTAP-------LLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           +TS L F +        +        LL +     FY++ +  ++V G+ L I    +  
Sbjct: 280 ATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVW-- 337

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
           D   NGG I+DSGT++T L T  Y+A+  A  +    + P   +  F+ CY+++   S E
Sbjct: 338 DFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGV-PRVNMDPFEYCYNWTG-VSAE 395

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNL 237
           +P +   F     L  P K+Y+I   + G  C          +S+IGN+ QQ     F+L
Sbjct: 396 IPRMELRFAGAATLAPPGKSYVIDT-APGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDL 454

Query: 238 RNSLIGFTPNKC 249
            N  + F  ++C
Sbjct: 455 ANRWLRFKQSRC 466


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score = 90.5 bits (223), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 81/263 (30%), Positives = 110/263 (41%), Gaps = 30/263 (11%)

Query: 3   FVTETVTLGSASVD------NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-- 54
           F   + T G  +VD       +  GC    EGL V   GL+GL  G +S  SQ++A T  
Sbjct: 152 FADGSCTAGPVTVDAFTFSTRLDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPF 211

Query: 55  ---FSYCLV--DRDSDSTSTLEFDS----SLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
              FSYCLV        +S+L F S    S  P A T PL+      +FY + L  I V 
Sbjct: 212 AHKFSYCLVPYSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRN-KSFYTIALDSIKVA 270

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
           G  +P+  T  K+        IVDSGT +T L     + L  A     +         L+
Sbjct: 271 GKPVPLQTTTTKL--------IVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLY 322

Query: 166 DTCYDFSSRSSVEV----PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
             CYD   R+  +V    P V+     G  + LP  N  +  +   T C A   +     
Sbjct: 323 AVCYDVRRRAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEF 382

Query: 222 IIGNVQQQGTRVSFNLRNSLIGF 244
           I+GNV QQ   V F+L    + F
Sbjct: 383 ILGNVAQQNLHVGFDLERRTVSF 405


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 127/280 (45%), Gaps = 34/280 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+   ET  +GS +      GC      +N      + GL+G+  GSLSF +Q+  S FS
Sbjct: 155 GNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFS 214

Query: 57  YCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+   DS S   L  D+S   L P   T  +L++  L  F    Y + L GI VG  +L
Sbjct: 215 YCISGSDS-SVFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKIL 273

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
            + ++ F  D +G G  +VDSGT  T L    Y AL++ F+  T    R +   D V   
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333

Query: 164 LFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAFA 214
             D CY   S +      +P VS  F  G  + +  +  L  V+  G+      +CF F 
Sbjct: 334 TMDLCYKVGSTTRPNFSGLPMVSLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG 392

Query: 215 PTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
             S  L I    IG+  QQ   + F+L  S +GF  N +C
Sbjct: 393 -NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 431


>gi|222617032|gb|EEE53164.1| hypothetical protein OsJ_35998 [Oryza sativa Japonica Group]
          Length = 384

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 71/244 (29%), Positives = 111/244 (45%), Gaps = 33/244 (13%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   T+T T G+ +V  +  GC   + G F GA+G++G+G G+LS  SQ+    FSY L+
Sbjct: 133 GYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQLQFGKFSYQLL 192

Query: 61  ----DRDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
                 D  + S + F D ++P         +   LD                  I    
Sbjct: 193 APEATDDGSADSVIRFGDDAVPKT-------KRGRLDA-----------------IPAGT 228

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL--FDTCYDFSS 173
           F +  +G GG+I+ S T VT L+   Y+ +R A V     L   +G A    D CY+ SS
Sbjct: 229 FDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA-VASRIGLPAVNGSAALELDLCYNASS 287

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
            + V+VP ++  F  G  + L A NY    +  G  C    P+    S++G + Q GT +
Sbjct: 288 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVLGTLLQTGTNM 346

Query: 234 SFNL 237
            +++
Sbjct: 347 IYDV 350


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 78/251 (31%), Positives = 113/251 (45%), Gaps = 13/251 (5%)

Query: 6   ETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC---LVDR 62
           +T   G  S  NI  GCG +N G F   +G+LGLG G+ S  ++   S FSYC   L+D 
Sbjct: 185 QTSDEGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSLID- 242

Query: 63  DSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
            +   + L   +         PL         YYL L  IS+G  LL I    F+   S 
Sbjct: 243 PTYPHNFLILGNGARIEGDPTPL---QIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRS- 298

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDA--FVRGTRALSPTDGVALFDTCYDFSSRSSVE-V 179
            GG ++D+G + T L  E Y  L +   F+ G       D     + CY+ + +  +   
Sbjct: 299 KGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGF 358

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLR 238
           P V+FHF  G  L L  ++  +  +S  +FC A    T   +S+IG + QQ   V +NLR
Sbjct: 359 PVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLR 418

Query: 239 NSLIGFTPNKC 249
              + F    C
Sbjct: 419 TMKVYFQRTDC 429


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score = 90.1 bits (222), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 86/275 (31%), Positives = 125/275 (45%), Gaps = 28/275 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++   +GS+++     GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 129 GNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS 188

Query: 57  YCLVDRDSDSTSTLEFDSSLPP--NAVTAPLLR-NHELDTF----YYLGLTGISVGGDLL 109
           YC+  RDS S   L  DS L    N    PL++ +  L  F    Y + L GI VG  +L
Sbjct: 189 YCISGRDS-SGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKIL 247

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF---- 165
           P+ ++ F  D +G G  +VDSGT  T L    Y ALR+ F+  T+ +    G   F    
Sbjct: 248 PLPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQG 307

Query: 166 --DTCYDFSSRSSV-EVPTVSFHFPEGK-VLPLPAKNYLIPVDSNG---TFCFAFAPTSS 218
             D CY   +   + E+P VS  F   + V+      Y +P    G    +C  F   S 
Sbjct: 308 AMDLCYRVPAGGKLPELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFG-NSD 366

Query: 219 SLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            L I    IG+  QQ   + F+L  S +GF   +C
Sbjct: 367 LLGIEAFVIGHHHQQNVWMEFDLVKSRVGFVETRC 401


>gi|297605070|ref|NP_001056627.2| Os06g0118000 [Oryza sativa Japonica Group]
 gi|55296430|dbj|BAD68553.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|215692556|dbj|BAG87976.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255676664|dbj|BAF18541.2| Os06g0118000 [Oryza sativa Japonica Group]
          Length = 175

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 15/179 (8%)

Query: 74  SSLPPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGT 132
           ++L P  V+ PLL +  +  TFY + L  I V G  LP+  T F      +   ++DS T
Sbjct: 9   AALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVF------SASSVIDSAT 62

Query: 133 AVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVL 192
            ++R+    Y ALR AF        P   V++ DTCYDFS   S+ +P+++  F  G  +
Sbjct: 63  VISRIPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVFDGGATV 122

Query: 193 PLPAKNYLIPVDSNGTFCFAFAPTSSSLS--IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            L A   L+     G  C AFAPT+S      IGNVQQ+   V +++    I F    C
Sbjct: 123 NLDAAGILL----QG--CLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 175


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 84/277 (30%), Positives = 120/277 (43%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGC----GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G    ET   GS +      GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 156 GHLAFETFRFGSLTRPATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFRKFS 215

Query: 57  YCLVDRDSD--------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YC+   DS           S L+  +  P   ++ PL     +   Y + L GI V   +
Sbjct: 216 YCISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVA--YSVQLEGIKVNNKV 273

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
           LP+ ++ F  D +G G  +VDSGT  T L    Y+ALR  F+  T    R L+    V  
Sbjct: 274 LPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQ 333

Query: 163 ALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
              D CY   S SS    +P V   F  G  + +  +   Y +P +  G    +CF F  
Sbjct: 334 GAMDLCYLIDSTSSTLPNLPVVKLMF-RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGN 392

Query: 216 TSS---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +     S  +IG+ QQQ   + ++L NS IGF   +C
Sbjct: 393 SDELGISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 77/246 (31%), Positives = 115/246 (46%), Gaps = 28/246 (11%)

Query: 17  NIAIGCGHNNEGLFVG-----AAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
            I  GCG  N+  F         G++GLG G LS  SQ+       FSYCL+   S+S S
Sbjct: 207 KICFGCGFQNK--FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNS 264

Query: 69  TLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
            L+F  +        V+ PL+   +L  FYYL L GI+VG   +   +T        +G 
Sbjct: 265 KLKFGEAAIVQGNGVVSTPLIIKPDL-PFYYLNLEGITVGAKTVKTGQT--------DGN 315

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRSSVEVPTVSF 184
           II+DSG+ +T L+   YN    + V+ T A+     +   FD C+ +    S   P V F
Sbjct: 316 IIIDSGSTLTYLEESFYNEFV-SLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVF 373

Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIG 243
           HF  G V+ L   N L+ ++ N   C    P+    ++I GN+ Q    V ++++   + 
Sbjct: 374 HFTGGDVV-LKPMNTLVLIEDN-LICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVS 431

Query: 244 FTPNKC 249
           F P  C
Sbjct: 432 FAPTDC 437


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 122/273 (44%), Gaps = 35/273 (12%)

Query: 1   GDFVTETVTLGSASVDNI------AIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS- 53
           G  + +T+ +  A+ D +        GCG   +GL  G  G+L L  GSLSFPSQI    
Sbjct: 196 GRSLRDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGEVGILALSPGSLSFPSQIGEKY 255

Query: 54  --TFSYCLVDRDSDST----------STLEFD---SSLPPNAVTAPLLRNHELDTFYYLG 98
              FSYCL+ + + ++          + +E     S  P      P+    E   +Y + 
Sbjct: 256 GNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPI---GESSIYYTVR 312

Query: 99  LTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP 158
           L GISVG   L +S + F      +   I DSGT +T L +   ++++ +       +S 
Sbjct: 313 LDGISVGNQRLDLSPSTFL--NGQDKPTIFDSGTTLTMLPSGVCDSIKQSLA---SMVSG 367

Query: 159 TDGVAL--FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
            + VA+   D C+     S   +P ++FHF  G        NY+I  D     C  F PT
Sbjct: 368 AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVI--DLGSLQCLIFVPT 425

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + +SI GN+QQQ   V  ++ N  IGF    C
Sbjct: 426 -NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 457


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/277 (31%), Positives = 123/277 (44%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++   +GS+ +  +  GC  +    N      + GL+G+  GSLSF SQ+    FS
Sbjct: 120 GNLASDVFHIGSSDISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPKFS 179

Query: 57  YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDL 108
           YC+   D      L       S+P N    PL++ +  L  F    Y + L GI V   L
Sbjct: 180 YCISGTDFSGLLLLGESNLTWSVPLNY--TPLIQISTPLPYFDRVAYTVQLEGIKVLDKL 237

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
           LPI ++ F+ D +G G  +VDSGT  T L    YNALR AF+  T    R L   D V  
Sbjct: 238 LPIPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQ 297

Query: 163 ALFDTCY--DFSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
              D CY    S R    +PTV+  F  G  + +      Y +P +  G     C +F  
Sbjct: 298 GAMDLCYLVPLSQRVLPLLPTVTLVF-RGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGN 356

Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        +IG+  QQ   + F+L  S IG    +C
Sbjct: 357 SDLLGVEAYVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 16/199 (8%)

Query: 55  FSYCLVDRDSDSTSTLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           FSYCL+   S+STS L+F S         V+ PL+      +FY+L L  +++G  ++P 
Sbjct: 248 FSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPT 307

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             T        +G II+DSGT +T L+   YN    +        S  D    F  C+ +
Sbjct: 308 GRT--------DGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPY 359

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQG 230
              +   +P ++F F  G  + L  KN LI +      C A  P+S S +SI GNV Q  
Sbjct: 360 RDMT---IPVIAFQF-TGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFD 415

Query: 231 TRVSFNLRNSLIGFTPNKC 249
            +V ++L    + F P  C
Sbjct: 416 FQVVYDLEGKKVSFAPTDC 434


>gi|449444520|ref|XP_004140022.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 229

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/232 (28%), Positives = 105/232 (45%), Gaps = 25/232 (10%)

Query: 37  LGLGGGSLSFPSQINAST--FSYCLVDRDSDSTSTLEF--------------DSSLPPNA 80
           LG    SL++ +  NA+   FSYCLVD  +D  +   F               + LP   
Sbjct: 3   LGTSSYSLTYKAAENANGGGFSYCLVDHLTDQRAISYFVLGIPTPSTSASTSSAKLPAKM 62

Query: 81  VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
               L       +FY + L GIS  G +L I    + I+  G  G I+DSGT++T L   
Sbjct: 63  TYTKLYVGDPYSSFYGVDLIGISANGIMLNIPSRVWDINSGG--GTIIDSGTSLTILAAP 120

Query: 141 TYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            ++ + +A     +     + +  FD C++ S  +    P + FHF +G V   P K+Y+
Sbjct: 121 AFDMVMEALTPRLKKFQQLE-IEPFDFCFNNSQYTHEMAPKLRFHFGDGTVFEPPTKSYI 179

Query: 201 IPVDSNGTF--CFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + V   G F  C  F      + +IIGN+ QQ     F+ +   +GF P++C
Sbjct: 180 VSV---GKFISCIGFVSMPFPANNIIGNILQQNHLWQFDFQKRRVGFAPSEC 228


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/277 (29%), Positives = 121/277 (43%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++T  +G++ +  +  GC  +    N        GL+G+  GSLSF SQ+    FS
Sbjct: 123 GNLASDTFHMGASDIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPKFS 182

Query: 57  YCLVDRDSDSTSTL---EFDSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YC+   D      L    F  ++P N      ++ PL     +   Y + L GI V   L
Sbjct: 183 YCISGTDFSGMLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIA--YTVQLEGIKVSDRL 240

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
           LPI ++ F+ D +G G  +VDSGT  T L    Y ALR  F+  T    R L   D V  
Sbjct: 241 LPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQ 300

Query: 163 ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAP 215
              D CY    S R    +PTVS  F  G  + +  +   Y +P +  G     C +F  
Sbjct: 301 GAMDLCYRVPISQRVLPRLPTVSLVF-NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGN 359

Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        +IG+  QQ   + F+L  S IG    +C
Sbjct: 360 SDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 111/243 (45%), Gaps = 26/243 (10%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLR-NHE 90
           A GLLG+  GSLSF +Q     F+YC+   D      L  D ++L P     PL++ +  
Sbjct: 197 ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVLGGDGAALAPQLNYTPLIQISRP 256

Query: 91  LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
           L  F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y  L+
Sbjct: 257 LPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLK 316

Query: 147 DAFVRGTRALSPTDGVA--LFDTCYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKN----- 198
             F+  T AL    G +  +F   +D   R+S   V   S   PE  ++   A+      
Sbjct: 317 GEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASXMLPEVGLVLRGAEVAVGGE 376

Query: 199 ---YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
              Y +P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF P
Sbjct: 377 KLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAP 436

Query: 247 NKC 249
            +C
Sbjct: 437 ARC 439


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/280 (31%), Positives = 127/280 (45%), Gaps = 34/280 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+   ET  +GS +      GC  +    N      + GL+G+  GSLSF +Q+  S FS
Sbjct: 155 GNLAHETFVIGSVTRPGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSKFS 214

Query: 57  YCLVDRDSDSTSTLEFDSS---LPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+   DS S   L  D+S   L P   T  +L++  L  F    Y + L GI VG  +L
Sbjct: 215 YCISGSDS-SGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKIL 273

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
            + ++ F  D +G G  +VDSGT  T L    Y AL++ F+  T    R +   D V   
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333

Query: 164 LFDTCYDFSSRSSVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAFA 214
             D CY   S +      +P VS  F  G  + +  +  L  V+  G+      +CF F 
Sbjct: 334 TMDLCYKVGSTTRPNFSGLPMVSLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFG 392

Query: 215 PTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
             S  L I    IG+  QQ   + F+L  S +GF  N +C
Sbjct: 393 -NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 431


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 75/241 (31%), Positives = 113/241 (46%), Gaps = 31/241 (12%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDRDSD--------STSTLEFDSSLPPNAVTAPLL 86
           GL+G+  GSLSF +Q+    FSYC+  +DS         S S L+     P   ++ PL 
Sbjct: 441 GLIGMNRGSLSFVTQMGLQKFSYCISGQDSSGILLFGESSFSWLKALKYTPLVQISTPLP 500

Query: 87  RNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
               +   Y + L GI V   +L + ++ +  D +G G  +VDSGT  T L    Y AL+
Sbjct: 501 YFDRVA--YTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALK 558

Query: 147 DAFVRGTRA----LSPTDGV--ALFDTCYD--FSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
           + FVR T+A    L   + V     D CY    + R+   +PTV+  F  G  + + A+ 
Sbjct: 559 NEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF-RGAEMSVSAER 617

Query: 199 YLIPVD-----SNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
            +  V      S+  +CF F   +S L      IIG+  QQ   + F+L  S +GF   +
Sbjct: 618 LMYRVPGVIRGSDSVYCFTFG--NSELLGVESYIIGHHHQQNVWMEFDLAKSRVGFAEVR 675

Query: 249 C 249
           C
Sbjct: 676 C 676


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 77/262 (29%), Positives = 117/262 (44%), Gaps = 26/262 (9%)

Query: 4   VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
           V ET   G++ + ++  GCGHN  +    G  G+LGL  G  S  ++I    FSYC+ D 
Sbjct: 192 VFETTDEGTSRIPDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKI-GQKFSYCIGDL 250

Query: 63  DSD--STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
                +   L           + P   +   + FYY+ + GISVG   L I+   F++ +
Sbjct: 251 ADPYYNYHQLILGEGADLEGYSTPFEVH---NGFYYVTMEGISVGEKRLDIAPETFEMKK 307

Query: 121 SGNGGIIVDSGTAVT--------RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
           +  GG+I+D+G+ +T         L  E  N L  +F + T   SP          Y   
Sbjct: 308 NRTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSP-----WMQCFYGSI 362

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
           SR  V  P V+FHF +G  L L + ++   ++ N  FC    P S     S  S+IG + 
Sbjct: 363 SRDLVGFPVVTFHFADGADLALDSGSFFNQLNDN-VFCMTVGPVSSLNLKSKPSLIGLLA 421

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   V ++L N  + F    C
Sbjct: 422 QQSYSVGYDLVNQFVYFQRIDC 443


>gi|222624328|gb|EEE58460.1| hypothetical protein OsJ_09701 [Oryza sativa Japonica Group]
          Length = 360

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 73/254 (28%), Positives = 107/254 (42%), Gaps = 42/254 (16%)

Query: 3   FVTETVTLGSASVDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
             ++T+ LG  ++ N   GC  +  G    +   GLLGLG G ++  SQ  +        
Sbjct: 141 LASDTLRLGKDAIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGS-------- 192

Query: 61  DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYL-GLTGISVGGDLLPISETAFKID 119
                      ++  LP        L   EL     L   +G   G         +F  D
Sbjct: 193 ----------LYNGRLP--------LLPPELQVILLLRACSGFPAG---------SFAFD 225

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV 179
            +   G +VDSGT +TR     Y ALR+ F R   A S    +  FDTC++    ++   
Sbjct: 226 AATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGA 285

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSSLSIIGNVQQQGTRVSF 235
           P V+ H   G  L LP +N LI   +    C A A      +S +++I N+QQQ  RV F
Sbjct: 286 PAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVF 345

Query: 236 NLRNSLIGFTPNKC 249
           ++ NS +GF    C
Sbjct: 346 DVANSRVGFAKESC 359


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 66/268 (24%), Positives = 111/268 (41%), Gaps = 21/268 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G F+ E +     ++    +GC  + +     +  L G G    S P Q+    F+YCL 
Sbjct: 193 GFFLLENLDFPGKTIHKFLVGCTTSADRE-PSSDALAGFGRTMFSLPMQMGVKKFAYCLN 251

Query: 61  DRDSDSTST-----LEFDSSLPPNAVTAPLLRNH-ELDTFYYLGLTGISVGGDLLPISET 114
             D D T       L++          AP  +N  +   +YYLG+  + +G  +L I   
Sbjct: 252 SHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGK 311

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETY----NALRDAFVRGTRALSPTDGVALFDTCYD 170
                    GG+++DSG A + +    +    N L+    +  R+L   +       CY+
Sbjct: 312 YLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLE-LEAQTGVTPCYN 370

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF---APTSS------SLS 221
           F+   S+++P + + F  G  + +P  NY +        CF     +PTS+         
Sbjct: 371 FTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPSI 430

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I+GN QQ    V F+L+N  +GF    C
Sbjct: 431 ILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/267 (32%), Positives = 126/267 (47%), Gaps = 31/267 (11%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    ET+T  S+S       GCG  N G F    GLLGLG GSLS  SQ   +    FS
Sbjct: 199 GVLARETLTFSSSSEFTGFIFGCGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFS 258

Query: 57  YCLVDRDSD----STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           YCL   ++     S         +P       ++   +  +FY++ L  I++GG +LP+ 
Sbjct: 259 YCLPSYNTTPGYLSIGATPVTGQIPVQYTA--MVNKPDYPSFYFIELVSINIGGYVLPVP 316

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVALFDTCY 169
            + F        G ++DSGT +T L    Y ALRD F   ++G++   P D +   DTCY
Sbjct: 317 PSEFT-----KTGTLLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDEL---DTCY 368

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----IPVDSN---GTFCFAFAPTSSSLSI 222
           DF+ +S + +P VSF+F +G V  L   N+      P D+    G   F   P     S+
Sbjct: 369 DFTGQSGILIPGVSFNFSDGAVFNL---NFFGIMTFPDDTKPAVGCLAFVSRPADMPFSV 425

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +G+  Q+   V +++    IGF P  C
Sbjct: 426 VGSTTQRSAEVIYDVPAQKIGFIPASC 452


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 76/243 (31%), Positives = 111/243 (45%), Gaps = 26/243 (10%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFD-SSLPPNAVTAPLLR-NHE 90
           A GLLG+  GSLSF +Q     F+YC+   D      L  D ++L P     PL++ +  
Sbjct: 199 ATGLLGMNRGSLSFVTQTATLRFAYCIAPGDGPGLLVLGGDGAALAPQLNYTPLIQISRP 258

Query: 91  LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
           L  F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y  L+
Sbjct: 259 LPYFDRVAYSVQLEGIRVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLK 318

Query: 147 DAFVRGTRALSPTDGVA--LFDTCYDFSSRSS-VEVPTVSFHFPEGKVLPLPAKN----- 198
             F+  T AL    G +  +F   +D   R+S   V   S   PE  ++   A+      
Sbjct: 319 GEFLNQTSALLAPLGESDFVFQGAFDACFRASEARVAAASQMLPEVGLVLRGAEVAVGGE 378

Query: 199 ---YLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
              Y +P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF P
Sbjct: 379 KLLYRVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAP 438

Query: 247 NKC 249
            +C
Sbjct: 439 ARC 441


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/234 (29%), Positives = 103/234 (44%), Gaps = 10/234 (4%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G    +     +   D +  GC    EG      G++GLG G LS  SQ+    FSY L 
Sbjct: 177 GLLAVDAFAFATVRADGVIFGCAVATEGDI---GGVIGLGRGELSPVSQLQIGRFSYYLA 233

Query: 61  DRDS-DSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
             D+ D  S + F     P    AV+ PL+ +    + YY+ L GI V G+ L I    F
Sbjct: 234 PDDAVDVGSFILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTF 293

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL-FDTCYDFSSRS 175
            +   G+GG+++     VT L    Y  +R A       L   DG  L  D CY   S +
Sbjct: 294 DLQADGSGGVVLSITIPVTFLDAGAYKVVRQAMASKIE-LRAADGSELGLDLCYTSESLA 352

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQ 228
           + +VP+++  F  G V+ L   NY     + G  C    P+ +   S++G++ Q
Sbjct: 353 TAKVPSMALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQ 406


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/254 (31%), Positives = 119/254 (46%), Gaps = 29/254 (11%)

Query: 13  ASVDNIAIGCGHNNEGLFV-GAAGLLGLGGGSLSFPSQIN----ASTFSYCLVD--RDSD 65
            S+     GCGHNN G F     GL+GLGGG  S  SQI        FS CLV    D  
Sbjct: 71  VSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIK 130

Query: 66  STSTLEFDSS---LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
            +S + F      L    VT PL++  +  T Y++ L GISV    LP++ T  K     
Sbjct: 131 ISSRMSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEK----- 185

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-----PTDGVALFDTCYDFSSRSSV 177
            G ++VDSGT    L  + Y+ +    V+    L      P+ G  L   CY   +++++
Sbjct: 186 -GNMLVDSGTPPNILPQQLYDRVY-VEVKNNVPLELITNDPSLGPQL---CY--RTQTNL 238

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPV-DSNGTFCFAFAP-TSSSLSIIGNVQQQGTRVSF 235
           + PT+++HF    +L  P + ++ P  ++ G FC A    T+S+  + GN  Q    + F
Sbjct: 239 KGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGF 298

Query: 236 NLRNSLIGFTPNKC 249
           +L   ++ F    C
Sbjct: 299 DLDRQVVSFKATDC 312


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 75/255 (29%), Positives = 122/255 (47%), Gaps = 23/255 (9%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC  + +G  F  + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 229 AKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 288

Query: 67  TSTLEFDSSLPPNAVTA-----------PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
           TS L F    P     A           PLL +  +  FY + +  + V G+ L I    
Sbjct: 289 TSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADV 348

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
           +  D +  GG I+DSGT++T L T  Y A+  A       L P   +  F+ CY++++ +
Sbjct: 349 W--DVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGL-PRVSMDPFEYCYNWTA-A 404

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVS 234
           ++E+P +   F     L  PAK+Y++   + G  C      +   +S+IGN+ QQ     
Sbjct: 405 ALEIPGLEVRFAGSARLQPPAKSYVVDA-APGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463

Query: 235 FNLRNSLIGFTPNKC 249
           F+LR+  + F   +C
Sbjct: 464 FDLRDRWLRFKHTRC 478


>gi|361068717|gb|AEW08670.1| Pinus taeda anonymous locus CL1136Contig1_03 genomic sequence
          Length = 70

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/69 (59%), Positives = 51/69 (73%)

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL 220
           G +LFDTCYD S   +V+VPT+ FHF     + LPA NYLIPVD++  FCF+FA  +S L
Sbjct: 2   GFSLFDTCYDLSVLKTVKVPTLVFHFQGRADVSLPATNYLIPVDTSAIFCFSFAGNTSGL 61

Query: 221 SIIGNVQQQ 229
           SIIGN+QQQ
Sbjct: 62  SIIGNIQQQ 70


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 87/277 (31%), Positives = 126/277 (45%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+   ET  +GS +      GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 156 GNLAFETFRVGSVTGPATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFRKFS 215

Query: 57  YCLVDRDSDSTSTL---EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+ DRDS     L    F S L P   T  +  +  L  F    Y + L GI V   +L
Sbjct: 216 YCISDRDSSGVLLLGEASF-SWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVL 274

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV--A 163
            + ++ F  D +G G  +VDSGT  T L    Y+AL+  F+  T    R L+    V   
Sbjct: 275 SLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQG 334

Query: 164 LFDTCYDFS-SRSSV-EVPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAPT 216
             D CY    +R+++  +P V+  F  G  + +  +   Y +P +  G    +CF F   
Sbjct: 335 AMDLCYLIEPTRAALPNLPVVNLMF-RGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFG-N 392

Query: 217 SSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           S SL I    IG+ QQQ   + ++L  S IGF   +C
Sbjct: 393 SDSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score = 87.8 bits (216), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 68/218 (31%), Positives = 109/218 (50%), Gaps = 8/218 (3%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDRDS-DSTSTLEFDSSLPPNAVTAPLLRNHELDT 93
           G +GL    LS  SQ+    FSYCLV  ++  STS + F S    +    PLL  +    
Sbjct: 209 GNVGLNQTPLSLISQLGIKKFSYCLVPFNNLGSTSKMYFGSLPVTSGGQTPLLYPNS--D 266

Query: 94  FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR-G 152
            YY+ + GIS+G D  P  +  F + E  +G II D+G   + L+T+ +++L   F+   
Sbjct: 267 AYYVKVLGISIGNDE-PHFDGVFDVYEVRDGWII-DTGITYSSLETDAFDSLLAKFLTLK 324

Query: 153 TRALSPTDGVALFDTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
                  D    F+ C++  + + +E  P V+ HF +G  L L  ++  + ++ +G FC 
Sbjct: 325 DFPQRKDDPKERFELCFELQNANDLESFPDVTVHF-DGADLILNVESTFVKIEDDGIFCL 383

Query: 212 AFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           A   + S +SI+GN Q Q   V ++L   +I F P  C
Sbjct: 384 ALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAPVDC 421


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 77/256 (30%), Positives = 103/256 (40%), Gaps = 54/256 (21%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLG--GGSLSFPSQINASTFSYC 58
           G   T+TV LG ASVD    GCG +N GLF G AGL+GLG  G     P           
Sbjct: 266 GVLATDTVALGGASVDGFVFGCGLSNRGLFGGTAGLMGLGPDGALAGLP----------- 314

Query: 59  LVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
                         D + PP               FY++ +TG SV          A   
Sbjct: 315 --------------DGAPPP---------------FYFMNVTGASV-------GGAAVAA 338

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYDFSSRSS 176
              G   +++DSGT +TRL    Y A+R  F R  G          +L D CY+ +    
Sbjct: 339 AGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDE 398

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--SSLSIIGNVQQQGTRV 233
           V+VP ++     G  + + A   L     +G+  C A A  S      IIGN QQ+  RV
Sbjct: 399 VKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRV 458

Query: 234 SFNLRNSLIGFTPNKC 249
            ++   S +GF    C
Sbjct: 459 VYDTVGSRLGFADEDC 474


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 115/262 (43%), Gaps = 26/262 (9%)

Query: 4   VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR 62
           V ET   G++ + ++  GCGHN       G  G+LGL  G  S  +++    FSYC+ + 
Sbjct: 191 VFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKL-GQKFSYCIGNL 249

Query: 63  DSD--STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDE 120
                +   L           + P       + FYY+ + GISVG   L I+   F++ E
Sbjct: 250 ADPYYNYHQLILGEGADLEGYSTPF---EVYNGFYYVTMEGISVGEKRLDIAPETFEMKE 306

Query: 121 SGNGGIIVDSGTAVT--------RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
           +  GG+I+D+G+ +T         L  E  N L  +F + T   SP          Y   
Sbjct: 307 NRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP-----WMQCFYGSI 361

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
           SR  V  P V+FHF +G  L L + ++   ++ N  FC    P S     S  S+IG + 
Sbjct: 362 SRDLVGFPVVTFHFSDGADLALDSGSFFNQLNDN-VFCMTVGPVSSLNIKSKPSLIGLLA 420

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQ   V ++L N  + F    C
Sbjct: 421 QQSYNVGYDLVNQFVYFQRIDC 442


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score = 87.8 bits (216), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 111/272 (40%), Gaps = 38/272 (13%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS-- 66
           S +V+N    C H      VG AG    G G LS P+Q+  S    FSYCLV     +  
Sbjct: 215 SMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSYCLVAHSFRADR 271

Query: 67  ---TSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
              +S L    S    A+ A        PLL N +   FY + L  +SVGG  +      
Sbjct: 272 LIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPEL 331

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDTCYD 170
             +D  GNGG++VDSGT  T L ++T+  + D F R   A         +       CY 
Sbjct: 332 GDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCYH 391

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS-------- 219
           +S  S   VP V+ HF     + LP +NY +   S       C        +        
Sbjct: 392 YSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGG 450

Query: 220 --LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                +GN QQQG  V +++    +GF   +C
Sbjct: 451 GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 76/264 (28%), Positives = 130/264 (49%), Gaps = 29/264 (10%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINAST 54
           GD   +++TL S S       NI IGCGH N       ++G++G+G G +S   Q+ +S+
Sbjct: 180 GDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSS 239

Query: 55  ----FSYCLV--DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVG 105
               FSYCL+  + DS+S+S L F   +  +    V+ P+++ +  + +Y+L L   SVG
Sbjct: 240 VGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVG 299

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV 162
            + +   E +     +    I++DSGT +T L     + L       V+  R   P   +
Sbjct: 300 NNRIEYGERS----NASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHL 355

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSI 222
           +L   CY+ + +  + VP ++ HF  G  + L +     P + +G  CF F  +S+ L I
Sbjct: 356 SL---CYNTTGKQ-LNVPDITAHF-NGADVKLNSNGTFFPFE-DGIMCFGFI-SSNGLEI 408

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTP 246
            GN+ Q    + ++L   +I F P
Sbjct: 409 FGNIAQNNLLIDYDLEKEIISFKP 432


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/262 (32%), Positives = 130/262 (49%), Gaps = 20/262 (7%)

Query: 1   GDFVTETVTLGSA--SVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINA---ST 54
           G   TE   +GS   S+  +A GCG++N G F    +G++GLGGGSLS  SQ+     + 
Sbjct: 187 GYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNK 246

Query: 55  FSYCLV---DRDSDSTSTLEF-DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGD 107
           FSYCLV   ++ + S   + F D+S    +   V+ PL+ + E +TFYYL L  ISVG +
Sbjct: 247 FSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLV-SKEPETFYYLTLEAISVGNE 305

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            L   E +        G II+DSGT +T L ++ YN L     +       +D   +F  
Sbjct: 306 RLAY-ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSI 364

Query: 168 CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQ 227
           C  F  +  +E+P ++ HF +  V   P   +    +     CF   P S+ ++I GN+ 
Sbjct: 365 C--FRDKIGIELPIITVHFTDADVELKPINTFAKAEED--LLCFTMIP-SNGIAIFGNLA 419

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q    V ++L  + + F P  C
Sbjct: 420 QMNFLVGYDLDKNCVSFMPTDC 441


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 111/273 (40%), Gaps = 38/273 (13%)

Query: 11  GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDRDSDS- 66
            S +V+N    C H      VG AG    G G LS P+Q+  S    FSYCLV     + 
Sbjct: 214 ASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSYCLVAHSFRAD 270

Query: 67  ----TSTLEFDSSLPPNAVTA--------PLLRNHELDTFYYLGLTGISVGGDLLPISET 114
               +S L    S    A+ A        PLL N +   FY + L  +SVGG  +     
Sbjct: 271 RLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPE 330

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP-----TDGVALFDTCY 169
              +D  GNGG++VDSGT  T L ++T+  + D F R   A         +       CY
Sbjct: 331 LGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCY 390

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS------- 219
            +S  S   VP V+ HF     + LP +NY +   S       C        +       
Sbjct: 391 HYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDG 449

Query: 220 ---LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 +GN QQQG  V +++    +GF   +C
Sbjct: 450 GGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 114/262 (43%), Gaps = 23/262 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   T+ V +G+A+   +A GC   +E     G++G +GLG  +LS  +Q+NA+ FSYCL
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCL 200

Query: 60  VDRDSDSTSTLEFDSSLP-----PNAVTAPLLR-----NHELDTFYYLGLTGISVGGDLL 109
              D+  +S L   +S         A T P ++     N  L   Y L L  I  G    
Sbjct: 201 APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAG---- 256

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
                   + +SGN  I V + T VT L    Y  LR A      A      V  +D C+
Sbjct: 257 ---NATIAMPQSGN-TITVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCF 312

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQ 227
             +S S    P +   F  G  + +P  +YL     N T C A   +P    +SI+G++Q
Sbjct: 313 PKASASG-GAPDLVLAFQGGAEMTVPVSSYLFDA-GNDTACVAILGSPALGGVSILGSLQ 370

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q    + F+L    + F P  C
Sbjct: 371 QVNIHLLFDLDKETLSFEPADC 392


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)

Query: 84  PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
           PL+RN  +  T Y + L GI VGG  L +    F       GG ++DS   +T+L    Y
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 320

Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            ALR AF R   A  P    G A  DTCYDF   +SV VP VS  F  G V+ L A   +
Sbjct: 321 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 379

Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        C AF PT    +L  IGNVQQQ   V +++    +GF    C
Sbjct: 380 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)

Query: 84  PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
           PL+RN  +  T Y + L GI VGG  L +    F       GG ++DS   +T+L    Y
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 320

Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            ALR AF R   A  P    G A  DTCYDF   +SV VP VS  F  G V+ L A   +
Sbjct: 321 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 379

Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        C AF PT    +L  IGNVQQQ   V +++    +GF    C
Sbjct: 380 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|357444933|ref|XP_003592744.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
 gi|355481792|gb|AES62995.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
          Length = 65

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 41/65 (63%), Positives = 50/65 (76%)

Query: 185 HFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
           +F  G +L LPA+N+LIPVDS GTFCFAFAP+SS LSIIGN+QQ+G  +S +  N  IGF
Sbjct: 1   YFLGGPILTLPARNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGF 60

Query: 245 TPNKC 249
            PN C
Sbjct: 61  GPNIC 65


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 108/242 (44%), Gaps = 22/242 (9%)

Query: 17  NIAIGCGHNNEG-LFVGAA--GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTS 68
           ++ +GCG    G  F GAA  G++GLG G +S PS +  +      FS C  + DS    
Sbjct: 230 SVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSLCFDENDS---G 286

Query: 69  TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
            + F      +  + P L        Y++G+    VG   L    + FK         +V
Sbjct: 287 RILFGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCL--KRSGFKA--------LV 336

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPE 188
           DSG++ T L +E YN L   F +   A   +    L+D CY+ SS+   ++P +   FP 
Sbjct: 337 DSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYNASSQELHDIPAIQLKFPR 396

Query: 189 GKVLPLPAKNYLIPVDSNGT-FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
            +   +    Y IP     T FC +  PT  S  IIG     G R+ F++ N  +G++ +
Sbjct: 397 NQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGSYGIIGQNFMIGYRMVFDIENLKLGWSNS 456

Query: 248 KC 249
            C
Sbjct: 457 SC 458


>gi|383128174|gb|AFG44740.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
          Length = 103

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 5/102 (4%)

Query: 85  LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L+ N    ++Y++ L GISVGG  L I+         G GG IVDSGT +TRL  + YNA
Sbjct: 1   LVSNSIYTSYYFVVLNGISVGGQRLSITPAVL-----GKGGTIVDSGTIITRLVPQAYNA 55

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
           L+ +F   T+ L   +  ++ DTCYD SS S V VP V+FHF
Sbjct: 56  LKTSFRSQTQNLPSAEPYSILDTCYDLSSYSQVRVPIVTFHF 97


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score = 87.4 bits (215), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 65/171 (38%), Positives = 82/171 (47%), Gaps = 18/171 (10%)

Query: 84  PLLRNHEL-DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
           PL+RN  +  T Y + L GI VGG  L +    F       GG ++DS   +T+L    Y
Sbjct: 285 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAY 338

Query: 143 NALRDAFVRGTRALSP--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            ALR AF R   A  P    G A  DTCYDF   +SV VP VS  F  G V+ L A   +
Sbjct: 339 RALRLAF-RSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVM 397

Query: 201 IPVDSNGTFCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        C AF PT    +L  IGNVQQQ   V +++    +GF    C
Sbjct: 398 V------EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 60/182 (32%), Positives = 86/182 (47%), Gaps = 24/182 (13%)

Query: 14  SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
           +   +  GCGH N+G+F     G+ G G G  S PSQ+NA++FSYC        +S +  
Sbjct: 196 ATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATSFSYCFTSMFDSKSSIVTL 255

Query: 73  DSSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
             +  P A+          T PL +N    + Y+L L GISVG   LP+ ET F+     
Sbjct: 256 GGA--PAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFR----- 308

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVP 180
               I+DSG ++T L  E Y A++  F      L P+  +G AL D C+     +    P
Sbjct: 309 --STIIDSGASITTLPEEVYEAVKAEFA-AQVGLPPSGVEGSAL-DVCFALPVSALWRRP 364

Query: 181 TV 182
            V
Sbjct: 365 AV 366


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 80/264 (30%), Positives = 116/264 (43%), Gaps = 30/264 (11%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TF 55
           G + ++ +T+ S   V+    GC  N +G F   A G++ LG G  S  +Q +++    F
Sbjct: 238 GTYSSDVLTINSGDRVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAF 297

Query: 56  SYCLVDRDSDSTSTLEFDSSLPPNA----VTAPLLRNH-----ELDTFYYLGLTGISVGG 106
           SYCL   +   T+   F   +P  A    VT P+L+          T Y   L  I+V G
Sbjct: 298 SYCLPPTE---TTKGFFQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDG 354

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTR-ALSPTDGVALF 165
             L +    F        G ++DS T +TRL    Y ALR AF    R  ++P       
Sbjct: 355 KELNVPAEVFA------AGTVMDSRTIITRLPVTAYGALRAAFRNRMRYRVAPPQ--EEL 406

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGN 225
           DTCYD +      +P ++  F    V+ +     L+    NG   FA     SS SI+GN
Sbjct: 407 DTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGN 462

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           VQQQ  +V  ++    IGF    C
Sbjct: 463 VQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|383128168|gb|AFG44737.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128170|gb|AFG44738.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128172|gb|AFG44739.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128176|gb|AFG44741.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128178|gb|AFG44742.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128180|gb|AFG44743.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128182|gb|AFG44744.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128184|gb|AFG44745.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128186|gb|AFG44746.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128188|gb|AFG44747.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128190|gb|AFG44748.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128192|gb|AFG44749.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128194|gb|AFG44750.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128196|gb|AFG44751.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128198|gb|AFG44752.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
 gi|383128200|gb|AFG44753.1| Pinus taeda anonymous locus CL117Contig1_03 genomic sequence
          Length = 103

 Score = 87.0 bits (214), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 46/102 (45%), Positives = 61/102 (59%), Gaps = 5/102 (4%)

Query: 85  LLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L+ N    ++Y++ L GISVGG  L I+         G GG IVDSGT +TRL  + YNA
Sbjct: 1   LVSNSIYTSYYFVVLNGISVGGQRLSITPAVL-----GRGGTIVDSGTIITRLVPQAYNA 55

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
           L+ +F   T+ L   +  ++ DTCYD SS S V VP V+FHF
Sbjct: 56  LKTSFRSQTQNLPSAEPYSILDTCYDLSSYSQVRVPIVTFHF 97


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 77/262 (29%), Positives = 115/262 (43%), Gaps = 23/262 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   T+ V +G+A+   +A GC   +E     G++G +GLG  +LS  +Q+NA+ FSYCL
Sbjct: 141 GRIGTDAVAIGTAATARLAFGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCL 200

Query: 60  VDRDSDSTSTLEFDSSLP-----PNAVTAPLLR-----NHELDTFYYLGLTGISVGGDLL 109
              D+  +S L   +S         A T P ++     +  L   Y L L  I  G    
Sbjct: 201 APPDTGKSSALFLGASAKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAG---- 256

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
                   + +SGN  I+V + T VT L    Y  LR A      A      V  +D C+
Sbjct: 257 ---NATIAMPQSGN-TIMVSTATPVTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCF 312

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAF--APTSSSLSIIGNVQ 227
             +S S    P +   F  G  + +P  +YL     N T C A   +P    +SI+G++Q
Sbjct: 313 PKASASG-GAPDLVLAFQGGAEMTVPVSSYLFDA-GNDTACVAILGSPALGGVSILGSLQ 370

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q    + F+L    + F P  C
Sbjct: 371 QVNIHLLFDLDKETLSFEPADC 392


>gi|295830689|gb|ADG39013.1| AT5G10770-like protein [Neslia paniculata]
          Length = 159

 Score = 87.0 bits (214), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 60/161 (37%), Positives = 81/161 (50%), Gaps = 10/161 (6%)

Query: 44  LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
           LSFPSQ   +    FSYCL    +  T  L F S+    +V   P+    +  +FY L +
Sbjct: 1   LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLSI 59

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
             I+VGG  LPI  T F        G ++DSGT +TRL  + Y ALR  F         T
Sbjct: 60  VAITVGGQKLPIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSEFKAKMSKYPTT 114

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            GV++ DTC+D S   +V +P V+F F  G V+ L +K  L
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIL 155


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 75/260 (28%), Positives = 123/260 (47%), Gaps = 24/260 (9%)

Query: 14  SVDNIAIGCGHNNEGLFVG-AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEF 72
           +V N+  GCG  N+G+F    +G+ G   G +S PSQ+  + FS+C        TS +  
Sbjct: 216 AVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFSHCFTAIADARTSPVFL 275

Query: 73  DSSLPPNAV----TAPLLRN---HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
             +  P+ +    T P+      +   + YYL L GI+VG   LP++  AF    +G+G 
Sbjct: 276 GGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSGS 335

Query: 126 I--IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
              I+DSGT +  L    Y +LR AFV   +     +  A  ++   F +  S  +P  +
Sbjct: 336 GGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTLCFEAARSASLPPEA 395

Query: 184 FHFPEGKVL--------PLPAKNYLIPV--DSNGT---FCFAF-APTSSSLSIIGNVQQQ 229
                 KV+         LP ++Y++ +  D +G+    C    +   S L+IIGN QQQ
Sbjct: 396 PAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQQ 455

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
              V+++L  + + F P +C
Sbjct: 456 NMHVAYDLEKNKLVFVPARC 475


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 122/266 (45%), Gaps = 24/266 (9%)

Query: 5   TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINA-----STFSY 57
           T TV+ G  A +  + +GC     G  V A  G+L LG G +SF   I+A       FS+
Sbjct: 265 TVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSF--AIHAVLRFGGRFSF 322

Query: 58  CLVDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
           CL+  +S  D++S L F    + + P  +   +L N ++   Y   +T + VGG+ L I 
Sbjct: 323 CLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERLDIP 382

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
           +  + ID+    G+I+D+ T+VT L  E Y  L  A  R    L P +  A F+ CY ++
Sbjct: 383 DDVWNIDKGLGSGVILDTSTSVTSLVPEAYEPLVAALDRHLAHL-PRESFAGFEYCYRWT 441

Query: 173 -------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--PTSSSLSII 223
                     +V +P V+     G  L   AK+ ++P   +G  C AF   P      II
Sbjct: 442 FTGDGVDPAHNVTIPKVTVEMTGGARLEPEAKSVVMPEVGHGVACLAFRKLPWGGGPCII 501

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNV  Q      +   +   F  +KC
Sbjct: 502 GNVLMQEYIWEIDHSKATFRFRKDKC 527


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 80/269 (29%), Positives = 114/269 (42%), Gaps = 36/269 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G   TET+TL S S     +    IGCGHNN       +G++GL  G  S  +Q+     
Sbjct: 139 GTLATETITLHSTSGEPFVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYP 198

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
              SYC        TS + F +    NA+ A        +        FYYL L  +SVG
Sbjct: 199 GLMSYCF---SGQGTSKINFGA----NAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVG 251

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
              +    T F   E   G I++DSGT +T       N +R A    V   RA  PT   
Sbjct: 252 NTRIETMGTTFHALE---GNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGND 308

Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSL 220
            L   CY+     ++++ P ++ HF  G  L L   N  +  ++ G FC A    S +  
Sbjct: 309 ML---CYN---SDTIDIFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQE 362

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I GN  Q    V ++  + L+ F+P  C
Sbjct: 363 AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 31/245 (12%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
           A GLLG+  G+LSF +Q     F+YC+   +      L  D  + P     PL+  +  L
Sbjct: 179 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 238

Query: 92  DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
             F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y AL+ 
Sbjct: 239 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 298

Query: 148 AFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GKVLPLP 195
            F    R L    G         FD C+         V   S   PE      G  + + 
Sbjct: 299 EFTSQARLLLAPLGEPGFVFQGAFDACF---RGPEARVAAASGLLPEVGLVLRGAEVAVS 355

Query: 196 AKN--YLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
            +   Y++P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF
Sbjct: 356 GEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGF 415

Query: 245 TPNKC 249
            P +C
Sbjct: 416 APARC 420


>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
 gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
          Length = 330

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +++T+     +V N  IGC   +  +    +GL G G G+ S PSQ+  + FSYCL+
Sbjct: 43  GLLISDTLRTPGRAVRNFVIGCSLAS--VHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 100

Query: 61  DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
            R  D  + +  +  L              APL R+         +YYL LT I+VGG  
Sbjct: 101 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 160

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
           + + E AF +     GG IVDSGT  +      +  +  A V     R +R+    +G+ 
Sbjct: 161 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 219

Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
           L   C+       ++E+P +S HF  G V+ LP +NY +   P  S G        C A 
Sbjct: 220 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 278

Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               PTSS  +         I+G+ QQQ   + ++L    +GF   +C
Sbjct: 279 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 326


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 86.7 bits (213), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 83/281 (29%), Positives = 112/281 (39%), Gaps = 51/281 (18%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSD 65
           S S+ N   GC H      VG AG    G G LS P+Q+ +      + FSYCLV    +
Sbjct: 211 SLSLHNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLVSHSFN 267

Query: 66  STSTLEFDSSL---------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           S   L   S L                   V   +L N +   FY +GL GIS+G   +P
Sbjct: 268 S-DRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKIP 326

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT-----RALSPTDGVALF 165
             E   ++D  G+GG++VDSGT  T L    YN++   F         RA    D   L 
Sbjct: 327 APEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGL- 385

Query: 166 DTCYDFSSRSSVEVPTVSFHF--PEGKVLPLPAKNYLIPVDSNG--------TFCFAFAP 215
             CY +   + V +P++  HF   E  V+ LP KNY       G          C     
Sbjct: 386 GPCYYYD--TVVNIPSLVLHFVGNESSVV-LPKKNYFYDFLDGGDGVRRKRRVGCLMLMN 442

Query: 216 TSSSLSI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 +       +GN QQ G  V ++L    +GF   KC
Sbjct: 443 GGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 72/245 (29%), Positives = 104/245 (42%), Gaps = 31/245 (12%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
           A GLLG+  G+LSF +Q     F+YC+   +      L  D  + P     PL+  +  L
Sbjct: 195 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 254

Query: 92  DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
             F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y AL+ 
Sbjct: 255 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 314

Query: 148 AFVRGTRALSPTDG------VALFDTCYDFSSRSSVEVPTVSFHFPE------GKVLPLP 195
            F    R L    G         FD C+         V   S   PE      G  + + 
Sbjct: 315 EFTSQARLLLAPLGEPGFVFQGAFDACF---RGPEARVAAASGLLPEVGLVLRGAEVAVS 371

Query: 196 AKN--YLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
            +   Y++P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF
Sbjct: 372 GEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGF 431

Query: 245 TPNKC 249
            P +C
Sbjct: 432 APARC 436


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/237 (28%), Positives = 96/237 (40%), Gaps = 23/237 (9%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDR---DSDSTSTLEFDS------SLPPNAVTAPL 85
           G+ G G    S PSQ+    FSYCLV     D+ ++S L  D+      +        P 
Sbjct: 223 GIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPF 282

Query: 86  LRN--HELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           L+N       +YY+ L  I +G   + +          GNGG IVDSGT  T ++   Y 
Sbjct: 283 LKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYE 342

Query: 144 ALRDAFVRGT---RALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            +   F +        +    +     CY+ S   S+ VP + F F  G  + LP  NY 
Sbjct: 343 LVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYF 402

Query: 201 IPVDSNGTFCFAFAPTSSSLS--------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             VDS G  C      + +          I+GN QQ+   V F+L N   GF    C
Sbjct: 403 SIVDS-GVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 102/242 (42%), Gaps = 27/242 (11%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
           +G+ G G G  S P Q+    FSYCL+  R  DS  + +    + P++            
Sbjct: 228 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 287

Query: 82  --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
               P+  N     +YY+ L  I VG   + +  +       GNGG IVDSG+  T ++ 
Sbjct: 288 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEK 347

Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
             + A+   F R     TRA +  + ++    C++ S   SV +P++ F F  G  + LP
Sbjct: 348 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 406

Query: 196 AKNYLIPVDSNGTFCFAFAPTS---SSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPN 247
             NY   V      C          S+LS     I+GN Q Q     ++L N   GF   
Sbjct: 407 VANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQ 466

Query: 248 KC 249
           +C
Sbjct: 467 RC 468


>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
 gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
          Length = 491

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +++T+     +V N  IGC  +   +    +GL G G G+ S PSQ+  + FSYCL+
Sbjct: 204 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 261

Query: 61  DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
            R  D  + +  +  L              APL R+         +YYL LT I+VGG  
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 321

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
           + + E AF +     GG IVDSGT  +      +  +  A V     R +R+    +G+ 
Sbjct: 322 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 380

Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
           L   C+       ++E+P +S HF  G V+ LP +NY +   P  S G        C A 
Sbjct: 381 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439

Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               PTSS  +         I+G+ QQQ   + ++L    +GF   +C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 73/246 (29%), Positives = 103/246 (41%), Gaps = 20/246 (8%)

Query: 11  GSASVDNIAIGCGHNNEGLF---VGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDS 64
           G A+      GC   +   F     A G +GLG G LS  SQ+       FSYC+V   S
Sbjct: 195 GGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSS 254

Query: 65  DSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            ST  L+F S  P N  V+ P + N    ++Y L L GI+VG   +   +          
Sbjct: 255 TSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIG-------- 306

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           G II+DS   +T L+   Y     +           D    F+ C    + +++  P   
Sbjct: 307 GNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFV 364

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIG 243
           FHF    V+ L  KN  I +D+N   C    P S  +SI GN  Q   +V ++L    + 
Sbjct: 365 FHFTGADVV-LGPKNMFIALDNN-LVCMTVVP-SKGISIFGNWAQVNFQVEYDLGEKKVS 421

Query: 244 FTPNKC 249
           F P  C
Sbjct: 422 FAPTNC 427


>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
          Length = 490

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +++T+     +V N  IGC  +   +    +GL G G G+ S PSQ+  + FSYCL+
Sbjct: 203 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 260

Query: 61  DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
            R  D  + +  +  L              APL R+         +YYL LT I+VGG  
Sbjct: 261 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 320

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
           + + E AF +     GG IVDSGT  +      +  +  A V     R +R+    +G+ 
Sbjct: 321 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 379

Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNG------TFCFAF 213
           L   C+       ++E+P +S HF  G V+ LP +NY +   P  S G        C A 
Sbjct: 380 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 438

Query: 214 ---APTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               PTSS  +         I+G+ QQQ   + ++L    +GF   +C
Sbjct: 439 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 486


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 74/280 (26%), Positives = 106/280 (37%), Gaps = 35/280 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++ET+   + ++ +   GC   +        G+ G G    S P Q+    FSYCLV
Sbjct: 192 GLLLSETINFPNKTISDFLAGCSLLSTR---QPEGIAGFGRSQESLPLQLGLKKFSYCLV 248

Query: 61  DR---DSDSTSTLEFDS------------SLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
            R   DS  +S L  D             S  P         N     +YY+ L  I VG
Sbjct: 249 SRRFDDSPVSSDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVG 308

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
              + +  +       GNGG IVDSG+  T ++   +  L   F +     +    V   
Sbjct: 309 KTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKL 368

Query: 166 ---DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP------- 215
                C+D S   SV +P ++F F  G  + LP  NY   VD  G  C            
Sbjct: 369 TGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDM-GVVCLTIVSDNAAALG 427

Query: 216 ------TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 +S    I+GN QQQ   + ++L N   GF    C
Sbjct: 428 GDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSC 467


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 105/242 (43%), Gaps = 25/242 (10%)

Query: 33  AAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHEL 91
           A GLLG+  G+LSF +Q     F+YC+   +      L  D  + P     PL+  +  L
Sbjct: 195 ATGLLGMNRGTLSFVTQTGTRRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPL 254

Query: 92  DTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
             F    Y + L GI VG  LLPI ++    D +G G  +VDSGT  T L  + Y AL+ 
Sbjct: 255 PYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKA 314

Query: 148 AFVRGTRALSPTDG------VALFDTCYDFS----SRSSVEVPTVSFHFPEGKVLPLPAK 197
            F    R L    G         FD C+       + +S  +P V       +V     K
Sbjct: 315 EFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAAASGLLPVVGLVLRGAEVAVSGEK 374

Query: 198 -NYLIPVDSNG------TFCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
             Y++P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF P 
Sbjct: 375 LLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPA 434

Query: 248 KC 249
           +C
Sbjct: 435 RC 436


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/261 (30%), Positives = 117/261 (44%), Gaps = 24/261 (9%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLS-FPSQINAS- 53
           G F  +T+TLGS       + NI IGCG NN   F   +  +   GG       Q+  S 
Sbjct: 184 GKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSI 243

Query: 54  --TFSYCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
              FSYCLV  ++D TS + F ++     P  V+ PL+     DTFYYL L  ISVG   
Sbjct: 244 DGKFSYCLVP-ENDQTSKINFGTNAVVSGPGTVSTPLVVKSR-DTFYYLTLKSISVGSKN 301

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +   ++  K      G +++DSGT +T L  + Y  + +A      A    D       C
Sbjct: 302 MQTPDSNIK------GNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLC 355

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
           Y+  + + + +P ++ HF EG  + L   N    V +    C AF  +     I GNV Q
Sbjct: 356 YN--ATADLNIPVITMHF-EGADVKLYPYNSFFKV-TEDLVCLAFGMSFYRNGIYGNVAQ 411

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           +   V ++  +  + F P  C
Sbjct: 412 KNFLVGYDTASKTMSFKPTDC 432


>gi|295830681|gb|ADG39009.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830683|gb|ADG39010.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830685|gb|ADG39011.1| AT5G10770-like protein [Capsella grandiflora]
 gi|295830687|gb|ADG39012.1| AT5G10770-like protein [Capsella grandiflora]
          Length = 159

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)

Query: 44  LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
           LSFPSQ   +    FSYCL    +  T  L F S+    +V   P+    + ++FY L +
Sbjct: 1   LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPIATISDGNSFYGLNI 59

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
            GI+VGG  L I  T F        G ++DSGT +TRL  + Y ALR +F          
Sbjct: 60  VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
            GV++ DTC+D S   +V +P V+F F  G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152


>gi|295830679|gb|ADG39008.1| AT5G10770-like protein [Capsella grandiflora]
          Length = 159

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)

Query: 44  LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
           LSFPSQ   +    FSYCL    +  T  L F S+    +V   P+    + ++FY L +
Sbjct: 1   LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPIXTISDGNSFYGLNI 59

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
            GI+VGG  L I  T F        G ++DSGT +TRL  + Y ALR +F          
Sbjct: 60  VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
            GV++ DTC+D S   +V +P V+F F  G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152


>gi|194689804|gb|ACF78986.1| unknown [Zea mays]
          Length = 158

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 60/158 (37%), Positives = 86/158 (54%), Gaps = 8/158 (5%)

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
           D+ Y++ +TGI V G  L +S +A+    +     I+DSGT +TRL T  Y+AL  A   
Sbjct: 8   DSLYFIKMTGIKVAGKPLSVSSSAYSSLPT-----IIDSGTVITRLPTGVYSALSKAVAG 62

Query: 152 GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF 211
             +        ++ DTC+     + + VP V+  F  G  L L A+N L+ VDS  T C 
Sbjct: 63  AMKGTPRASAFSILDTCFQ-GQAARLRVPEVTMAFAGGAALKLAARNLLVDVDS-ATTCL 120

Query: 212 AFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           AFAP  S+ +IIGN QQQ   V ++++NS IGF    C
Sbjct: 121 AFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 157


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/281 (30%), Positives = 124/281 (44%), Gaps = 36/281 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLF------VGAAGLLGLGGGSLSFPSQINAST 54
           G+   +T  +GS +      GC   + GL         + GL+G+  GSLSF +Q+  S 
Sbjct: 151 GNLAHDTFVIGSVTRPGTLFGC--MDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSK 208

Query: 55  FSYCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTF----YYLGLTGISVGGDL 108
           FSYC+   DS     L     S L P   T  +L+   L  F    Y + L GI VG  +
Sbjct: 209 FSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKI 268

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDG-----V 162
           L + ++ F  D +G G  +VDSGT  T L    Y AL++ F+  T++ L   D       
Sbjct: 269 LSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQ 328

Query: 163 ALFDTCYDFSSRSS---VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT------FCFAF 213
              D CY   S +      +P +S  F  G  + +  +  L  V+  G+      +CF F
Sbjct: 329 GTMDLCYRVGSSTRPNFTGLPVISLMF-RGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF 387

Query: 214 APTSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPN-KC 249
              S  L I    IG+  QQ   + F+L  S +GF  N +C
Sbjct: 388 G-NSDLLGIEAFVIGHHHQQNVWMEFDLAKSRVGFAGNVRC 427


>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
          Length = 648

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 84/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  +++T+     +V N  IGC  +   +    +GL G G G+ S PSQ+  + FSYCL+
Sbjct: 204 GLLISDTLRTPGRAVRNFVIGC--SLASVHQPPSGLAGFGRGAPSVPSQLGLTKFSYCLL 261

Query: 61  DRDSDSTSTLEFDSSLPPNAVT--------APLLRNHE----LDTFYYLGLTGISVGGDL 108
            R  D  + +  +  L              APL R+         +YYL LT I+VGG  
Sbjct: 262 SRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS 321

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVA 163
           + + E AF +     GG IVDSGT  +      +  +  A V     R +R+    +G+ 
Sbjct: 322 VQLPERAF-VAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLG 380

Query: 164 LFDTCYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI---PVDSNGTFCFAFA----- 214
           L   C+       ++E+P +S HF  G V+ LP +NY +   P  S G    A A     
Sbjct: 381 L-SPCFAMPPGTKTMELPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAICLAV 439

Query: 215 ----PTSSSLS---------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               PTSS  +         I+G+ QQQ   + ++L    +GF   +C
Sbjct: 440 VSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQC 487


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/265 (32%), Positives = 126/265 (47%), Gaps = 23/265 (8%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS----LSFPSQIN 51
           GD   ET+TLGS    ++      IGCGHNN G F      +   GG     +S  S   
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSI 238

Query: 52  ASTFSYCL--VDRDSDSTSTLEF-DSSLPPN--AVTAPL--LRNHELDTFYYLGLTGISV 104
              FSYCL  +  +S+S+S L F D+++      V+ PL  L       FY+L L   SV
Sbjct: 239 GGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQ---VFYFLTLEAFSV 295

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
           G + +  S ++     SG+G II+DSGT +T L  E Y  L  A     +     D   L
Sbjct: 296 GDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKL 355

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
              CY  +S   +++P ++ HF +G  + L   +  +PV+  G  CFAF  +S   +I G
Sbjct: 356 LSLCYKTTS-DELDLPVITAHF-KGADVELNPISTFVPVE-KGVVCFAFI-SSKIGAIFG 411

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N+ QQ   V ++L    + F P  C
Sbjct: 412 NLAQQNLLVGYDLVKKTVSFKPTDC 436


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/254 (30%), Positives = 115/254 (45%), Gaps = 42/254 (16%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAST 54
           G+  +ET+T+ S      S    A GCGH++ G+F   ++G++GLGGG LS  SQ+ ++ 
Sbjct: 181 GNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTI 240

Query: 55  ---FSYCL--VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL  V  DS  +S + F +S   +    V+ PL           L   G S   
Sbjct: 241 NGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPL----------RLPYKGYS--- 287

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
                     K  E   G IIVDSGT  T L  E Y+ L  +     +     D   +F 
Sbjct: 288 ----------KKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFS 337

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY+  + + +  P ++ HF +  V   P   ++   +     CF  APT S + ++GN+
Sbjct: 338 LCYN--TTAEINAPIITAHFKDANVELQPLNTFMRMQED--LVCFTVAPT-SDIGVLGNL 392

Query: 227 QQQGTRVSFNLRNS 240
            Q    V F+LR  
Sbjct: 393 AQVNFLVGFDLRKK 406



 Score = 57.4 bits (137), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 58/133 (43%), Gaps = 4/133 (3%)

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           K  E   G IIVDSGT  T L  E Y  L ++     +     D   +   CY+ ++   
Sbjct: 411 KKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVDQ 469

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           ++ P ++ HF +  V   P   +L   +     CF   PT S + I+GN+ Q    V F+
Sbjct: 470 IDAPIITAHFKDANVELQPWNTFLRMQED--LVCFTVLPT-SDIGILGNLAQVNFLVGFD 526

Query: 237 LRNSLIGFTPNKC 249
           LR   + F    C
Sbjct: 527 LRKKRVSFKAADC 539


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/274 (29%), Positives = 120/274 (43%), Gaps = 36/274 (13%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S     + +GC   +        G+LG+  G LSF SQ   + FSYC+
Sbjct: 190 GNLVREKFTFSRSLFTPPLILGCATES----TDPRGILGMNRGRLSFASQSKITKFSYCV 245

Query: 60  VDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDTF-------------YYLGLTGISV 104
             R +    T T  F     PN+ T    R  E+ TF             Y + L GI +
Sbjct: 246 PTRVTRPGYTPTGSFYLGHNPNSNT---FRYIEMLTFARSQRMPNLDPLAYTVALQGIRI 302

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGV 162
           GG  L IS   F+ D  G+G  ++DSG+  T L  E Y+ +R   VR  G R        
Sbjct: 303 GGRKLNISPAVFRADAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYG 362

Query: 163 ALFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS- 217
            + D C+D    +++E+      + F F +G  + +P +  L  V+  G  C   A +  
Sbjct: 363 GVADMCFD---GNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEG-GVHCIGIANSDK 418

Query: 218 --SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             ++ +IIGN  QQ   V F+L N  +GF    C
Sbjct: 419 LGAASNIIGNFHQQNLWVEFDLVNRRMGFGTADC 452


>gi|345292859|gb|AEN82921.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292861|gb|AEN82922.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292863|gb|AEN82923.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292865|gb|AEN82924.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292867|gb|AEN82925.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292869|gb|AEN82926.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292871|gb|AEN82927.1| AT5G10770-like protein, partial [Capsella rubella]
 gi|345292873|gb|AEN82928.1| AT5G10770-like protein, partial [Capsella rubella]
          Length = 161

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 10/158 (6%)

Query: 44  LSFPSQINAS---TFSYCLVDRDSDSTSTLEFDSSLPPNAVT-APLLRNHELDTFYYLGL 99
           LSFPSQ   +    FSYCL    +  T  L F S+    +V   P+    + ++FY L +
Sbjct: 1   LSFPSQTATAYNKIFSYCL-PSSASYTGHLTFGSAGISRSVKFTPISTISDGNSFYGLNI 59

Query: 100 TGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT 159
            GI+VGG  L I  T F        G ++DSGT +TRL  + Y ALR +F          
Sbjct: 60  VGITVGGQKLAIPSTVFSTP-----GALIDSGTVITRLPPKAYAALRSSFKAQMSKYPTA 114

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAK 197
            GV++ DTC+D S   +V +P V+F F  G V+ L +K
Sbjct: 115 SGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVELGSK 152


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/280 (28%), Positives = 113/280 (40%), Gaps = 50/280 (17%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
           S +VDN    C H   G  VG AG    G G LS P Q+    +  FSYCLV     +  
Sbjct: 227 SVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLAPQLSGRFSYCLVSHSFRADR 283

Query: 69  TLEFDSSL---PPNA-------VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            +     +    P+A       V  PLL N +   FY + L  +SVG   +       ++
Sbjct: 284 LIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARV 343

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTDGVALFDTCY 169
           D +GNGG++VDSGT  T L  ETY  + +AF R           RA   T        CY
Sbjct: 344 DRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTG----LTPCY 399

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS----------NGTFCFAF------ 213
            +++ S   VP ++ HF     + LP +NY +   S          +   C         
Sbjct: 400 HYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDV 458

Query: 214 ----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                        +GN QQQG  V +++    +GF   +C
Sbjct: 459 SGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/261 (30%), Positives = 116/261 (44%), Gaps = 28/261 (10%)

Query: 6   ETVTLGSASVDNIAIGCGH-----NNEGLFVGAAGLLGLGGG-SLSFPSQINASTFSYCL 59
           ET+  G     NI  GCGH     NN+  +    G+ GLG    ++  +Q+  + FSYC+
Sbjct: 202 ETLDEGKIKKSNITFGCGHMNIKTNNDDAY---NGVFGLGAYPHITMATQL-GNKFSYCI 257

Query: 60  VDRD----SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETA 115
            D +    + +   L   S +  ++    +   H     YY+ L  ISVG   L I   A
Sbjct: 258 GDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNA 312

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFDTCYD-F 171
           FKI   G+GG+++DSG   T+L    +  L D  V   +G     PT        C+   
Sbjct: 313 FKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR-KFEGLCFKGV 371

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQ 228
            SR  V  P V+FHF  G  L L + + L        FC A  P++S   +LS+IG + Q
Sbjct: 372 VSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQ 430

Query: 229 QGTRVSFNLRNSLIGFTPNKC 249
           Q   V F+L    + F    C
Sbjct: 431 QNYNVGFDLEQMKVFFRRIDC 451


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 86/265 (32%), Positives = 123/265 (46%), Gaps = 25/265 (9%)

Query: 1   GDFVTETVTLGSASVD-----NIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
           G    E +T  S   D     +I  GCGH+N G F     G++G+GGG LS  SQI    
Sbjct: 169 GVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLY 228

Query: 52  -ASTFSYCLV--DRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            +  FS CLV    D+ ++ T+ F  +S +    V    L + E  T Y + L GISVG 
Sbjct: 229 GSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGD 288

Query: 107 DLLPI--SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVAL 164
             +    SET  K      G I++DSGT  T +  E Y  L +  ++   +L P +    
Sbjct: 289 TFVRFNSSETLSK------GNIMIDSGTPATYIPQEFYERLVEE-LKVQSSLLPIEDDPD 341

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIG 224
             T   + S +++E P ++ HF    V  LP + ++ P D  G FCFA A ++    I G
Sbjct: 342 LGTQLCYRSETNLEGPILTAHFEGADVQLLPIQTFIPPKD--GVFCFAMAGSTDGDYIFG 399

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N  Q    + F+L    I F P  C
Sbjct: 400 NFAQSNILMGFDLDRKTISFKPTDC 424


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 101/242 (41%), Gaps = 27/242 (11%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
           +G+ G G G  S P Q+    FSYCL+  R  DS  + +    + P++            
Sbjct: 228 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 287

Query: 82  --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
               P+  N     +YY+ L  I VG   +    +       GNGG IVDSG+  T ++ 
Sbjct: 288 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKXPYSFMVAGSDGNGGTIVDSGSTFTFMEK 347

Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
             + A+   F R     TRA +  + ++    C++ S   SV +P++ F F  G  + LP
Sbjct: 348 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 406

Query: 196 AKNYLIPVDSNGTFCFAFAPTS---SSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPN 247
             NY   V      C          S+LS     I+GN Q Q     ++L N   GF   
Sbjct: 407 VANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPSIILGNYQSQNFYTEYDLENERFGFRRQ 466

Query: 248 KC 249
           +C
Sbjct: 467 RC 468


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 76/260 (29%), Positives = 112/260 (43%), Gaps = 23/260 (8%)

Query: 6   ETVTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA----- 52
           +TV LG + V N    I  GC     G          G+ G G G+LS  SQ+++     
Sbjct: 190 DTVLLGQSMVANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249

Query: 53  STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
             FS+CL   + +    L     L P+ V +PL+ +      Y L L  I+V G LLPI 
Sbjct: 250 KVFSHCLKGGE-NGGGVLVLGEILEPSIVYSPLVPSLP---HYNLNLQSIAVNGQLLPID 305

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
              F    + N G IVDSGT +  L  E YN   DA        S    ++  + CY  S
Sbjct: 306 SNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPI-ISKGNQCYLVS 362

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
           +      P VS +F  G  + L  ++YL+    +DS   +C  F       +I+G++  +
Sbjct: 363 NSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLK 422

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
                ++L N  IG+    C
Sbjct: 423 DKIFVYDLANQRIGWADYNC 442


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  + +T+     +V    +GC  +   +    +GL G G G+ S P+Q+    FSYCL+
Sbjct: 173 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 230

Query: 61  DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDT-----FYYLGLTGISVGGDLLP 110
            R  D  + +     L            PL+++   D      +YYL L G++VGG  + 
Sbjct: 231 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 290

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
           +   AF  + +G+GG IVDSGT  T L    +  + DA V     R  R+    D + L 
Sbjct: 291 LPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL- 349

Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTSSSLS 221
             C+     + S+ +P +SFHF  G V+ LP +NY + V   G     C A     S  S
Sbjct: 350 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFSGGS 408

Query: 222 -----------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                      I+G+ QQQ   V ++L    +GF    C
Sbjct: 409 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 447


>gi|316927704|gb|ADU58605.1| xyloglucan-specific endoglucanase inhibitor 4 [Solanum tuberosum]
          Length = 440

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/271 (28%), Positives = 118/271 (43%), Gaps = 42/271 (15%)

Query: 14  SVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQI-NA----STFSYCLVDRDSDS 66
           S + +   C  ++  EGL  G  G+LGLG G + FP+Q+ NA      F+ CL    +  
Sbjct: 154 STNGVVFDCAPHSLLEGLAKGVKGILGLGNGYVGFPTQLANAFSVPRKFAICLTSSTTSR 213

Query: 67  TSTLEFDSS---LP-----PNAVTAPLLRNH----------ELDTFYYLGLTGISVGGDL 108
                 DS    LP        V  PLL+N           E  T Y++G+T I + G++
Sbjct: 214 GVIFFGDSPYVFLPGMDVSKRLVYTPLLKNPVSTSGSYFEGEPSTDYFIGVTSIKINGNV 273

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           +PI+ T   I + G GG  + +    T+L+T  YNAL  AFV+    +     VA F  C
Sbjct: 274 VPINTTLLNITKDGKGGTKISTVDPYTKLETSIYNALTKAFVKSLAKVPRVKPVAPFKVC 333

Query: 169 YDFSSRSSVE----VPTVSFHFPEGKV---LPLPAKNYLIPVDSNGTFCFA-------FA 214
           Y+ +S  S      VP +              +   N ++ ++ N   C         F 
Sbjct: 334 YNRTSLGSTRVGRGVPPIELVLGNKNATTSWTIWGVNSMVAMN-NDVLCLGFLDGGVEFE 392

Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           PT+S   +IG  Q +   + F++ N  +GFT
Sbjct: 393 PTTS--IVIGAHQIEDNLLQFDIANKRLGFT 421


>gi|413944378|gb|AFW77027.1| hypothetical protein ZEAMMB73_570500 [Zea mays]
          Length = 484

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 112/259 (43%), Gaps = 30/259 (11%)

Query: 12  SASVDNIAIGCGHNNEGLFVG-----AAGLLGLGGGSLSFPSQINAST------FSYCLV 60
           SA+VD     C    EG+  G     +AG+L L   S S PS++ AS+      FSYCL 
Sbjct: 235 SATVDKFRFAC---LEGIAPGPAEDGSAGILDLSRNSHSLPSRLVASSPPHAVAFSYCLP 291

Query: 61  DRDSDSTSTLEFDSSLPP----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF 116
              +D    L   ++ P          PL  +      Y + L G+ +GG  LPI   A 
Sbjct: 292 ASTAD-VGFLSLGATKPELLGRKVSYTPLRGSPSNGNLYVVDLVGLGLGGPDLPIPPAAI 350

Query: 117 KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
             D++     I++  T  T L+ + Y  LRD+F +          +   DTCY+F+   +
Sbjct: 351 AGDDT-----ILELHTTFTYLKPQVYKVLRDSFRKSMSEYPAAPPLGSLDTCYNFTGLDA 405

Query: 177 VEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL---SIIGNVQQQG 230
             VP V+  F  G  + L     +   D +  F   C AF          ++IG++ Q  
Sbjct: 406 FSVPAVTLKFAGGADVDLWMDEMMYFTDPDNHFSIGCLAFVAQDDDCDGGTVIGSMAQMS 465

Query: 231 TRVSFNLRNSLIGFTPNKC 249
           T V +++R   +GF P +C
Sbjct: 466 TEVVYDVRGGKVGFVPYRC 484


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 81/279 (29%), Positives = 123/279 (44%), Gaps = 34/279 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  + +T+     +V    +GC  +   +    +GL G G G+ S P+Q+    FSYCL+
Sbjct: 180 GLLIADTLRAPGRAVPGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLPKFSYCLL 237

Query: 61  DRDSDSTSTLEFDSSLPPNAVTA-----PLLRNHELDT-----FYYLGLTGISVGGDLLP 110
            R  D  + +     L            PL+++   D      +YYL L G++VGG  + 
Sbjct: 238 SRRFDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVR 297

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALF 165
           +   AF  + +G+GG IVDSGT  T L    +  + DA V     R  R+    D + L 
Sbjct: 298 LPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGL- 356

Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTSSSLS 221
             C+     + S+ +P +SFHF  G V+ LP +NY + V   G     C A     S  S
Sbjct: 357 HPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFV-VAGRGAVEAICLAVVTDFSGGS 415

Query: 222 -----------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                      I+G+ QQQ   V ++L    +GF    C
Sbjct: 416 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSC 454


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/62 (77%), Positives = 57/62 (91%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           GDFVTETVT+G   V N+A+GCGHNNEGLFVGAAGL+GLGGG LSFP+Q+N+++FSYCLV
Sbjct: 219 GDFVTETVTIGVNKVKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNSTSFSYCLV 278

Query: 61  DR 62
           DR
Sbjct: 279 DR 280


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 79/276 (28%), Positives = 117/276 (42%), Gaps = 31/276 (11%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFS 56
           G    + +TL  SASVD+   GC   + G  +GAAGLL L   S S  S++ A    TFS
Sbjct: 229 GAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFS 288

Query: 57  YCLVDRDSDSTSTLEF-DSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           YCL    + S   L   ++ +P N        APL+ +      Y + L G+S+GG  +P
Sbjct: 289 YCLPLSTTSSHGFLAIGEADVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIP 348

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I   A     + +  +++D+    T ++   Y  LRDAF R          +   DTCY+
Sbjct: 349 IPPHA----ATASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYN 404

Query: 171 FSS-RSSVEVPTVSFHF-----PEGKVLPLPAKNYLIPVDSNGTF----CFAFAPTSSS- 219
           F+  R  V +P V   F       G  +     + +  +   G F    C AFA   S  
Sbjct: 405 FTGVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDG 464

Query: 220 ------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   ++G + Q    V  ++    IGF P  C
Sbjct: 465 DAEAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 84/272 (30%), Positives = 117/272 (43%), Gaps = 34/272 (12%)

Query: 1   GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G +V++T+     LG + +DN    I  GC     G          G+ G G G LS  S
Sbjct: 163 GYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVIS 222

Query: 49  Q-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q     I    FS+CL   D      L     L P  V +PL+ +      Y L L  I+
Sbjct: 223 QLSTRGITPRVFSHCL-KGDGSGGGILVLGEILEPGIVYSPLVPSQP---HYNLNLLSIA 278

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTD 160
           V G LLPI   AF    S + G IVDSGT +  L  E Y    D FV    A+   S T 
Sbjct: 279 VNGQLLPIDPAAFA--TSNSQGTIVDSGTTLAYLVAEAY----DPFVSAVNAIVSPSVTP 332

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAFAPTS 217
             +  + CY  S+  S   P  SF+F  G  + L  ++YLIP  S+G    +C  F    
Sbjct: 333 ITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKV- 391

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             ++I+G++  +     ++L    IG+    C
Sbjct: 392 QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 423


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 84.0 bits (206), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 75/224 (33%), Positives = 103/224 (45%), Gaps = 16/224 (7%)

Query: 35  GLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G+ G G   LS  SQ+++       FS+CL   DS     L     + PN V  PL+ + 
Sbjct: 226 GIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGG-GILVLGEIVEPNVVYTPLVPSQ 284

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y L L  ISV G +LPIS   F    S + G I+DSGT +  L  E YNA   A 
Sbjct: 285 P---HYNLNLQSISVNGQVLPISPAVFA--TSSSQGTIIDSGTTLAYLAEEAYNAFVVA- 338

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG-- 207
           V    + S    V   + CY  SS  S   P VS +F  G  L L A++YLI  +S G  
Sbjct: 339 VTNIVSQSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGT 398

Query: 208 -TFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +C  F       ++I+G++  +     ++L N  IG+T   C
Sbjct: 399 TVWCIGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWTNYDC 442


>gi|326489434|dbj|BAK01698.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 429

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 108/252 (42%), Gaps = 37/252 (14%)

Query: 27  EGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSDSTST------------ 69
           E L  GAAG+ G     LS P+Q       A+ F+ CL    SD  +             
Sbjct: 161 ESLPAGAAGVAGFSRLPLSLPTQFASLLKVANEFALCLPSGGSDGVAVFGGGPFQLLAAP 220

Query: 70  -LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID-ESGNGGII 127
            +E    L  N +  PLL+ H  +  YY  +TGI+V   L+P     F +D  SG GG +
Sbjct: 221 PVELAGRLRENPL--PLLK-HPYNGGYYFNITGIAVNQQLVPTPPGVFDLDASSGTGGAV 277

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS----RSSVEVPTVS 183
             + T  T L+ + Y  LR+AF   T  ++  D V  FD CY  S+    R    V  + 
Sbjct: 278 FSTVTPYTALRWDIYWPLRNAFDAATSGIARADKVEPFDLCYQASALTVTRVGYGVANIE 337

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS----------IIGNVQQQGTRV 233
                G+   LP  + L+ V+ N T CFAF   +SS S          I+G  Q +   +
Sbjct: 338 LMLDGGRNWTLPGASSLVQVN-NQTVCFAFVQMASSSSMPAALDSPAVILGGHQMENNLL 396

Query: 234 SFNLRNSLIGFT 245
            F+L      F+
Sbjct: 397 MFDLVKETFAFS 408


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 118/271 (43%), Gaps = 30/271 (11%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +T   S S   + +GC   +      A G+LG+  G LSF SQ   + FSYC+
Sbjct: 175 GNLVREKITFSRSQSTPPLILGCAEESSD----AKGILGMNLGRLSFASQAKLTKFSYCV 230

Query: 60  VDRDSDS--TSTLEFDSSLPPNA---------VTAPLLRNHELDTFYY-LGLTGISVGGD 107
             R      T T  F     PN+           +   R   LD   Y + + GI +G  
Sbjct: 231 PTRQVRPGFTPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQ 290

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L I  +AF+ D SG G  ++DSG+  T L  E YN +R+  VR  G R         + 
Sbjct: 291 KLNIPISAFRPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVS 350

Query: 166 DTCYDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---S 218
           D C++    +++E    +  + F F +G  + +  +  L  V   G  C     +    +
Sbjct: 351 DMCFN---GNAIEIGRLIGNMVFEFDKGVEIVVEKERVLADV-GGGVHCVGIGRSEMLGA 406

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +IIGN  QQ   V F+L N  +GF    C
Sbjct: 407 ASNIIGNFHQQNIWVEFDLANRRVGFGKADC 437


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 96/220 (43%), Gaps = 50/220 (22%)

Query: 21  GCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPP 78
           GC H     G+     GL+GLGG + S  SQ  A                          
Sbjct: 207 GCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAA-------------------------- 240

Query: 79  NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
                   R+ ++ T+Y+  L  I+VGG  L +S + F        G +VDSGT +TRL 
Sbjct: 241 --------RSKKVPTYYFAALEDIAVGGKKLGLSPSVFA------AGSLVDSGTVITRLP 286

Query: 139 TETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKN 198
              Y AL  AF  G    +  + + + DTC++F+    V +PTV+  F  G V+ L A  
Sbjct: 287 PAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHG 346

Query: 199 YLIPVDSNGTFCFAFAPT--SSSLSIIGNVQQQGTRVSFN 236
            +    S G  C AFAPT    +   IGNVQQ+   V ++
Sbjct: 347 IV----SGG--CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 118/278 (42%), Gaps = 39/278 (14%)

Query: 5   TETVTLGSAS--VDNIAIGCGHNNEGLFVGA--AGLLGLGGGSLSFPSQIN---ASTFSY 57
           T+T+ LG+ +  + ++A GC  + EG       AG LG+G    S   QI     S FSY
Sbjct: 163 TDTIILGNPTLPIHSVAFGCAQSTEGFDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSY 222

Query: 58  CLVD--RDSDSTSTLEFDSSLPPNAV----------TAPLLRNHELDTFYYLGLTGISVG 105
           CL+           + F + +P   +          T P L +   D+ YY+ L GIS+ 
Sbjct: 223 CLIGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLN 282

Query: 106 GDLLP-ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT------RALSP 158
           G  +P I +  F+    G+GG  VD+GT VT L    Y  + +A           R   P
Sbjct: 283 GTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDP 342

Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKV------LPLPAKNYLIPVDSNGTFCFA 212
                 F  C+         +P ++  F EG        L + ++N  + VD+    CF 
Sbjct: 343 N-----FSLCFREHPGIWSHIPKLTLDF-EGPASRTVAHLEIVSRNLFLKVDNQPLVCFG 396

Query: 213 FAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              TS  S +++G +QQ  TR  F+L  + I F    C
Sbjct: 397 VYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRESC 434


>gi|302142046|emb|CBI19249.3| unnamed protein product [Vitis vinifera]
          Length = 191

 Score = 83.6 bits (205), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 9/150 (6%)

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           VG  L+P++      D +   G I+DSGT +TR     Y A+RD F +  +   P   + 
Sbjct: 46  VGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG--PFATIG 103

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP----TSSS 219
            FDTC  F++ +    P V+FHF  G  L LP +N LI   +    C A A      +S 
Sbjct: 104 AFDTC--FAATNEDIAPPVTFHF-TGMDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSV 160

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           L++I N+QQQ  R+ F++ NS +G     C
Sbjct: 161 LNVIANLQQQNLRIMFDVTNSRLGIARELC 190


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 83.6 bits (205), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 74/276 (26%), Positives = 111/276 (40%), Gaps = 31/276 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++E +   +    +  +GC   +       AG+ G G G  S PSQ+N + FSYCL+
Sbjct: 191 GFLLSENLNFPTKKYSDFLLGCSVVS---VYQPAGIAGFGRGEESLPSQMNLTRFSYCLL 247

Query: 61  DRDSDSTST------LEFDSSL--PPNAVT-APLL------RNHELDTFYYLGLTGISVG 105
               D ++T      LE  SS     N V+  P L      +N     +YY+ L  I VG
Sbjct: 248 SHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVG 307

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG---TRALSPTDGV 162
              + +     + +  G+GG IVDSG+  T ++   ++ +   F +    TRA       
Sbjct: 308 EKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQF 367

Query: 163 ALFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP------ 215
            L   C+  +    +   P + F F  G  + LP  NY   V      C           
Sbjct: 368 GL-SPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGS 426

Query: 216 --TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             T     I+GN QQQ   V ++L N   GF    C
Sbjct: 427 GGTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSC 462


>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
          Length = 378

 Score = 83.2 bits (204), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 80/286 (27%), Positives = 115/286 (40%), Gaps = 62/286 (21%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
           + +VDN    C H   G  VG AG    G G LS P Q++   +  FSYCLV      + 
Sbjct: 97  AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 147

Query: 69  TLEFDSSLPPNA-------------------VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           +   D  + P+                    V  PLL N +   FY + L  +SVG   +
Sbjct: 148 SFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARI 207

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTD 160
                  ++D +GNGG++VDSGT  T L  E Y  + +AF R           RA   T 
Sbjct: 208 QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG 267

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----CF 211
                  CY +++ S   VP ++ HF     + LP +NY +   S     GT      C 
Sbjct: 268 ----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCL 322

Query: 212 AFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                  +           +GN QQQG  V +++    +GF   +C
Sbjct: 323 MLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 79/246 (32%), Positives = 106/246 (43%), Gaps = 40/246 (16%)

Query: 21  GCGHNN-----EGLFVGA-AGLLGLGGGSLSFPSQ---INASTFSYCLVDRDSDS----- 66
           GC H       EG    A AG++ LGGG  S  SQ   +  S FSYC+   +S       
Sbjct: 230 GCSHGEAKQGGEGSIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFV 289

Query: 67  TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
                 D S        P+LR   + T Y + L  I+V G  L ++ + F        G 
Sbjct: 290 LGGGVGDLSGAGGYAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGS 343

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTDGVALFDTCYDFSSRSSVEVPTVS 183
           ++DS TA+TRL    Y ALR+AF R   A+   +P  G    DTCYDF+    V VP V+
Sbjct: 344 VLDSRTAITRLPPTAYQALREAF-RSRMAMYREAPPQGN--LDTCYDFAGAFLVMVPRVA 400

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTSSSL--SIIGNVQQQGTRVSFNLR 238
                     L   N ++ +D  G     C  F   +      I+GNVQQQ   V +N+ 
Sbjct: 401 L---------LLDGNAVVALDRQGILFHDCLVFTSNTDDRMPGILGNVQQQTMEVLYNVG 451

Query: 239 NSLIGF 244
             LI  
Sbjct: 452 GVLISM 457


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 77/266 (28%), Positives = 127/266 (47%), Gaps = 20/266 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G F TET  LG+ +V NI  GCG  N+G +   AG+ G+G G +S  +Q+    FSYC  
Sbjct: 60  GYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGVSLLNQLGIDRFSYCFS 119

Query: 61  DRDSDSTSTLEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
              +  +S +    S           A + P++ +  L + Y++ L G++VG   + ++ 
Sbjct: 120 SSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATRVDVAG 179

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALFDTC 168
            +    E G   +++DS + VT L   TY  +R A V     L   +     GV L D C
Sbjct: 180 ASSA--EGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL-DLC 236

Query: 169 YDFSSRSSVEVP---TVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-LSII 223
           ++ ++  +   P   T++ HF  G   L LP  NYL    + G  C    P+SS+ + ++
Sbjct: 237 FELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKDSAGGLICLTMTPSSSNGVPVL 296

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           G+     T V ++L  +++ F P  C
Sbjct: 297 GSSALLDTLVLYDLAKNVVSFQPLDC 322


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 106/242 (43%), Gaps = 31/242 (12%)

Query: 35  GLLGLGGGSLSFPSQINASTFSYCLVDRDSDST-----STLEFDSSLPPNAVTAPLLRNH 89
           GL+G+  GSLSF +Q+    FSYC+  +D+        +T ++   L P   T  +  N 
Sbjct: 197 GLMGMNRGSLSFVTQMGFPKFSYCISGKDASGVLLFGDATFKW---LGPLKYTPLVKMNT 253

Query: 90  ELDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNAL 145
            L  F    Y + L GI VG   L + +  F  D +G G  +VDSGT  T L    Y AL
Sbjct: 254 PLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRFTFLLGSVYTAL 313

Query: 146 RDAFVRGTRALSP--TDGVALFDTCYDFSSRSSV-----EVPTVSFHFPEGKVLPLPAKN 198
           R+ FV  TR +     D   +F+   D   R         VP V+  F EG  + +  + 
Sbjct: 314 RNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVF-EGAEMSVSGER 372

Query: 199 YLIPVDSNG--------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
            L  V  +G         +C  F  +        +IG+  QQ   + F+L NS +GF   
Sbjct: 373 LLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVNSRVGFADT 432

Query: 248 KC 249
           KC
Sbjct: 433 KC 434


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 79/254 (31%), Positives = 115/254 (45%), Gaps = 16/254 (6%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAST-FS 56
           +  T + T G   V++I  GCGHNN G+F    +G  GL G     +S    +  S  FS
Sbjct: 112 EIATFSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFS 171

Query: 57  YCLV----DRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
            CLV    D  +  T +L   S +    V    L + E  T Y + L GISVG   +P +
Sbjct: 172 QCLVPFHADPHTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFN 231

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
            +         G I++DSGT  T L  E Y+ L +  ++    L P        T   + 
Sbjct: 232 SSEML----SKGNIMIDSGTPETYLPQEFYDRLVEE-LKVQINLPPIHVDPDLGTQLCYK 286

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
           S +++E P ++ HF    V  LP + ++ P D  G FCFA   T+  L I GN  Q    
Sbjct: 287 SETNLEGPILTAHFEGADVKLLPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVL 344

Query: 233 VSFNLRNSLIGFTP 246
           + F+L   ++ F P
Sbjct: 345 IGFDLDKRIVFFKP 358


>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
          Length = 337

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 116/278 (41%), Gaps = 38/278 (13%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA---STFS 56
           G    + +TL  SASVD+   GC   + G  +GAAGLL L   S S  S++ A    TFS
Sbjct: 69  GAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSLASRLAAGAGGTFS 128

Query: 57  YCLVDRDSDSTSTLEF-DSSLPPN-----AVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           YCL    + S   L   ++ +P N        APL+ +      Y + L G+S+GG  +P
Sbjct: 129 YCLPLSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIP 188

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           I   A          +++D+    T ++   Y  LRDAF R          +   DTCY+
Sbjct: 189 IPPHA---------AMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGDLDTCYN 239

Query: 171 FSS-RSSVEVPTVSFHF-------PEGKVLPLPAKNYLIPVDSNGTF----CFAFAPTSS 218
           F+  R  V +P V   F            +     + ++ +   G F    C AFA   S
Sbjct: 240 FTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPGNFFSVTCLAFAALPS 299

Query: 219 S-------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                     ++G + Q    V  +++   IGF P  C
Sbjct: 300 DGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 80/269 (29%), Positives = 113/269 (42%), Gaps = 36/269 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G   TETVT+ S S     +    IGCGHN+       +G++GL  G  S  +Q+     
Sbjct: 135 GTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYP 194

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
              SYC     S  TS + F +    NA+ A        +         YYL L  +SVG
Sbjct: 195 GLMSYCFA---SQGTSKINFGT----NAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVG 247

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
              +    T F   E   G II+DSGT +T       N +R+A   +V   R   PT   
Sbjct: 248 DTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTADPTGND 304

Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL- 220
            L   CY      ++++ P ++ HF  G  L L   N  I   + GTFC A    +    
Sbjct: 305 ML---CY---YTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGTFCLAIICNNPPQD 358

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I GN  Q    V ++  + L+ F+P  C
Sbjct: 359 AIFGNRAQNNFLVGYDSSSLLVSFSPTNC 387


>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 410

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 76/251 (30%), Positives = 111/251 (44%), Gaps = 26/251 (10%)

Query: 14  SVDNIAIGCGH-----NNEGLFVGAAGLLGLGGGSLSFPSQINAST---FSYCLVDRDSD 65
           SV  I  GC H     +N+G     +G+L L    LSF + +   +   FSYCL    + 
Sbjct: 170 SVPGIMFGCAHSVTGFHNDGTL---SGVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTH 226

Query: 66  ST-STLEFDS---SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           +  S L F +   SLPP+A T  L+  H     Y+L + GIS+G   L I    F    +
Sbjct: 227 NPDSFLRFGADVPSLPPHAHTTTLV--HAGVPGYHLNIVGISLGNKRLHIDRHVF----A 280

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSP--TDGVALFDTCYDFSSRS-SVE 178
             GG  ++    +TR+    Y A+  A V   + L      G+     C+D   RS  V+
Sbjct: 281 AGGGCSINPAVTITRIMELAYLAVEHALVAHMKELGSGRVKGMPGRSLCFDHMDRSVRVQ 340

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           +P +SFHF +G  L   A+  L  V      CF         ++IG  QQ  TR +F++ 
Sbjct: 341 LPGMSFHFEDGAELRFAAEQ-LFDVRVMAA-CFLVVGRGHHQTVIGAAQQVDTRFTFDIA 398

Query: 239 NSLIGFTPNKC 249
              + F P  C
Sbjct: 399 AGRLAFVPETC 409


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
           D VT     GS +   I  GCG    G          G++G G  + SF SQ+ +     
Sbjct: 188 DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 246

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
            +F++CL   +++          + P   T P+L        Y + L  I VG  +L +S
Sbjct: 247 RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLQLS 301

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             AF  D   + G+I+DSGT +  L    YN L +  +   + L+       F TC+ + 
Sbjct: 302 SDAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYI 358

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
            R     PTV+F F +   L +  + YL  V  + T+CF +          +SL+I+G++
Sbjct: 359 DRLD-RFPTVTFQFDKSVSLAVYPQEYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 416

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
                 V +++ N +IG+T + C
Sbjct: 417 ALSNKLVVYDIENQVIGWTNHNC 439


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 112/271 (41%), Gaps = 43/271 (15%)

Query: 17  NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
           N AIGC  +   +    +GL G G G+ S PSQ+    FSYCL+ R  D  S +  +  L
Sbjct: 219 NFAIGC--SIVSVHQPPSGLAGFGRGAPSVPSQLKVPKFSYCLLSRRFDDNSAVSGELVL 276

Query: 77  PPNAVTA----------PLLRNH----ELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
               V A          PLL N         +YYL LTGISVGG  + +   AF +  SG
Sbjct: 277 GDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNLPSRAF-VPSSG 335

Query: 123 NGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALFDTCYDFSSRS-- 175
            GG I+DSGT  T L    +  +  A       R  R+    D + L   C+        
Sbjct: 336 -GGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL-RPCFALPPGPGG 393

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYL-----------------IPVDSNGTFCFAFAPTSS 218
           ++E+P +   F  G V+ LP +NY                  + V S+          + 
Sbjct: 394 AMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPASGGDGAAAG 453

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              I+G+ QQQ   + ++L    +GF    C
Sbjct: 454 PAIILGSFQQQNYHIEYDLGKERLGFRQQPC 484


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 63/215 (29%), Positives = 91/215 (42%), Gaps = 14/215 (6%)

Query: 42  GSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPP----NAVTAPLLRNHELDTFYYL 97
            S + PS  +A  FSYCL    SD    L   ++ P          PL  N      Y +
Sbjct: 277 ASRAAPSSPDAVAFSYCLPSYPSD-VGFLSLGATKPELLGRKVSYTPLRSNRHNGNLYVV 335

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
            L G+ +GG  LP+   A        GG I++  T  T L+ + Y ALRD F +      
Sbjct: 336 ELVGLGLGGVDLPVPRAAI-----AGGGTILELHTTFTYLKPKVYAALRDEFRKSMSQYP 390

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFA 214
                   DTCY+F++ SS  VP V+  F  G    L     +   +    F   C AF 
Sbjct: 391 VAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGCLAFV 450

Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 ++IG++ Q  T V +++R   +GF P +C
Sbjct: 451 AQDGG-AVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
          Length = 392

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/257 (27%), Positives = 109/257 (42%), Gaps = 24/257 (9%)

Query: 11  GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS 68
           GS S + +AIGC  +    F   +  G+ GLG  + S P Q+N S FSYCL         
Sbjct: 138 GSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDLP 197

Query: 69  TLEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           +    ++ P              T  L  N +  T Y++ L GIS+GG  LP   T    
Sbjct: 198 SYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAVST---- 253

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
            +SG G + VD+GT+ TRL+   +  L    D  ++  + +    G      CY   S +
Sbjct: 254 -KSG-GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTA 311

Query: 176 SVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
           + E   +P +  HF +   + LP  +YL    S        +     +S++GN Q Q T 
Sbjct: 312 ADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTH 371

Query: 233 VSFNLRNSLIGFTPNKC 249
           +  +  N  + F    C
Sbjct: 372 MLLDTGNEKLSFVRADC 388


>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
          Length = 503

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 80/286 (27%), Positives = 115/286 (40%), Gaps = 62/286 (21%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
           + +VDN    C H   G  VG AG    G G LS P Q++   +  FSYCLV      + 
Sbjct: 222 AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 272

Query: 69  TLEFDSSLPPNA-------------------VTAPLLRNHELDTFYYLGLTGISVGGDLL 109
           +   D  + P+                    V  PLL N +   FY + L  +SVG   +
Sbjct: 273 SFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARI 332

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPTD 160
                  ++D +GNGG++VDSGT  T L  E Y  + +AF R           RA   T 
Sbjct: 333 QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTG 392

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----CF 211
                  CY +++ S   VP ++ HF     + LP +NY +   S     GT      C 
Sbjct: 393 ----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCL 447

Query: 212 AFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                  +           +GN QQQG  V +++    +GF   +C
Sbjct: 448 MLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/275 (30%), Positives = 139/275 (50%), Gaps = 33/275 (12%)

Query: 1   GDFVTETVTLGSASVDNIAI-----GCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           GD  TET+++ SAS   ++      GCG+NN G F    +G++GLGGG LS  SQ+ +S 
Sbjct: 176 GDVATETISIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235

Query: 54  --TFSYCLVDRDS--DSTSTLEFDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGI 102
              FSYCL  + +  + TS +   ++  P++       ++ PL+ + E  T+YYL L  I
Sbjct: 236 SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVISTPLV-DKEPRTYYYLTLEAI 294

Query: 103 SVGGDLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQT---ETYNALRDAFVRGTR 154
           SVG   +P + +++  ++ G     +G II+DSGT +T L +   + + A  +  V G +
Sbjct: 295 SVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAK 354

Query: 155 ALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA 214
            +S   G  L   C+  S  + + +P ++ HF  G  + L   N  + V S    C +  
Sbjct: 355 RVSDPQG--LLSHCFK-SGSAEIGLPEITVHF-TGADVRLSPINAFVKV-SEDMVCLSMV 409

Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           PT + ++I GN  Q    V ++L    + F    C
Sbjct: 410 PT-TEVAIYGNFAQMDFLVGYDLETRTVSFQRMDC 443


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 115/268 (42%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G  V E +T  S+ S   + +GC   +        G+LG+  G  SF SQ   S FSYC+
Sbjct: 174 GSLVREKITFSSSQSTPPLILGCAEAS----TDEKGILGMNLGRRSFASQAKISKFSYCV 229

Query: 60  VDRDSDS--TSTLEFDSSLPPNA---------VTAPLLRNHELDTFYY-LGLTGISVGGD 107
             R + +  +ST  F     PN+            P  R+  LD   Y + + GI +G  
Sbjct: 230 PTRQARAGLSSTGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNA 289

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L IS T F+ D SG G  I+DSG+  T L  E YN +R+  VR  G +         + 
Sbjct: 290 RLNISATLFRPDPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVS 349

Query: 166 DTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
           D C+D +       +  + F F +G  + +     L  V   G  C     +    ++ +
Sbjct: 350 DMCFDGNPMEIGRLIGNMVFEFEKGVEIVIDKWRVLADV-GGGVHCIGIGRSEMLGAASN 408

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN  QQ   V ++L N  IG     C
Sbjct: 409 IIGNFHQQNLWVEYDLANRRIGLGKADC 436


>gi|376338606|gb|AFB33833.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
 gi|376338608|gb|AFB33834.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
 gi|376338610|gb|AFB33835.1| hypothetical protein CL1136Contig1_03, partial [Larix decidua]
          Length = 71

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/70 (58%), Positives = 49/70 (70%), Gaps = 1/70 (1%)

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS-NGTFCFAFAPTSSS 219
           G +LFDTCYD S   +V+VPT+ FHF     + LPA NYLI VDS +  FCFAFA  +  
Sbjct: 2   GFSLFDTCYDLSGLKTVKVPTLDFHFKGRADVSLPATNYLILVDSASAVFCFAFAGNTGG 61

Query: 220 LSIIGNVQQQ 229
           LSIIGN+QQQ
Sbjct: 62  LSIIGNIQQQ 71


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 72/226 (31%), Positives = 105/226 (46%), Gaps = 23/226 (10%)

Query: 35  GLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTSTLEF--DSSLP--PNAVTAPLLR 87
           GL+GLG G LS  SQ+       FSYC     S+STS + F  D+ +      V+ PL+ 
Sbjct: 224 GLVGLGVGPLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLII 283

Query: 88  NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
                ++YYL L G+S+G   +  SE+        +G I++DSGT+ T L+   YN    
Sbjct: 284 KSIGPSYYYLNLEGVSIGNKKVKTSES------QTDGNILIDSGTSFTILKQSFYNK--- 334

Query: 148 AFVRGTRALSPTDGVALFDTCYDFSSRSS---VEVPTVSFHFPEGKVLPLPAKNYLIPVD 204
            FV   + +   + V +    Y+F   +       P V F F   KV  + A N L   +
Sbjct: 335 -FVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDVVFLFTGAKVR-VDASN-LFEAE 391

Query: 205 SNGTFCFAFAPTSSSL-SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            N   C    PTS    SI GN  Q G +V ++L+  ++ F P  C
Sbjct: 392 DNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437


>gi|226494967|ref|NP_001141737.1| uncharacterized protein LOC100273869 [Zea mays]
 gi|194705750|gb|ACF86959.1| unknown [Zea mays]
 gi|195645950|gb|ACG42443.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 163

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 85/165 (51%), Gaps = 10/165 (6%)

Query: 91  LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
           +  FY + + G+SV G+LL I    + +++   GG I+DSGT++T L +  Y A+  A  
Sbjct: 1   MRPFYAVAVNGVSVDGELLRIPRRVWDVEK--GGGAILDSGTSLTVLVSPAYRAVVAALS 58

Query: 151 RGTRALSPTDGVALFDTCYDFSSRS-----SVEVPTVSFHFPEGKVLPLPAKNYLIPVDS 205
           R    L P   +  FD CY+++S S     +V VP ++ HF     L  P K+Y+I   +
Sbjct: 59  RKLAGL-PRVAMDPFDYCYNWTSPSTGEDLAVAVPELALHFAGSARLQPPPKSYVIDA-A 116

Query: 206 NGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            G  C          +S+IGN+ QQ     F+L+N  + F  ++C
Sbjct: 117 PGVKCIGLQEGDWPGVSVIGNIMQQEHLWEFDLKNRRLRFKRSRC 161


>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 320

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
           D VT     GS +   I  GCG    G          G++G G  + SF SQ+ +     
Sbjct: 20  DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 78

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
            +F++CL   +++          + P   T P+L        Y + L  I VG  +L +S
Sbjct: 79  RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELS 133

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             AF  D   + G+I+DSGT +  L    YN L +  +     L+       F TC+ ++
Sbjct: 134 SNAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYT 190

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
            +     PTV+F F +   L +  + YL  V  + T+CF +          +SL+I+G++
Sbjct: 191 DKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 248

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
                 V +++ N +IG+T + C
Sbjct: 249 ALSNKLVVYDIENQVIGWTNHNC 271


>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
          Length = 415

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 71/257 (27%), Positives = 109/257 (42%), Gaps = 24/257 (9%)

Query: 11  GSASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTS 68
           GS S + +AIGC  +    F   +  G+ GLG  + S P Q+N S FSYCL         
Sbjct: 161 GSQSFEEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQKPDLP 220

Query: 69  TLEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
           +    ++ P              T  L  N +  T Y++ L GIS+GG  LP   T    
Sbjct: 221 SYLLLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAVST---- 276

Query: 119 DESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
            +SG G + VD+GT+ TRL+   +  L    D  ++  + +    G      CY   S +
Sbjct: 277 -KSG-GNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTA 334

Query: 176 SVE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
           + E   +P +  HF +   + LP  +YL    S        +     +S++GN Q Q T 
Sbjct: 335 ADESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIDKSNIKGGISVLGNFQMQNTH 394

Query: 233 VSFNLRNSLIGFTPNKC 249
           +  +  N  + F    C
Sbjct: 395 MLLDTGNEKLSFVRADC 411


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/276 (30%), Positives = 125/276 (45%), Gaps = 29/276 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCG----HNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+   ET  LGS +      GC      +N        GL+G+  GSLSF +Q+    FS
Sbjct: 158 GNLAFETFRLGSLTKPATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPKFS 217

Query: 57  YCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+   DS     L  ++S P   P + T  +  +  L  F    Y + L GI V   +L
Sbjct: 218 YCISGFDSAGVLLLG-NASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVL 276

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALF-- 165
            + ++ F  D +G G  +VDSGT  T L    Y AL++ F+  TR +     D   +F  
Sbjct: 277 SLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQG 336

Query: 166 --DTCYDF-SSRSSVE-VPTVSFHFPEGKVLPLPAKN--YLIPVDSNG---TFCFAFAPT 216
             D CY   SSR +++ +P VS  F +G  + +  +   Y +P +  G    +CF F  +
Sbjct: 337 AMDLCYLLDSSRPNLQNLPVVSLMF-QGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNS 395

Query: 217 S---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   +IG+  QQ   + F+L  S IG    +C
Sbjct: 396 DLLGVEAFVIGHHHQQNVWMEFDLEKSRIGLADVRC 431


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 103/252 (40%), Gaps = 48/252 (19%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLV-----------------DRDSDSTSTLEFDSS 75
           G+ G G G LS PSQ+      FS+C +                 D    S   L+F S 
Sbjct: 164 GIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS- 222

Query: 76  LPPNAVTAPLLRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAV 134
                    LL+N     +YY+GL  I+VG    + +  +  + D  GNGG+I+DSGT  
Sbjct: 223 ---------LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTY 273

Query: 135 TRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE------VPTVSFH 185
           T L    Y  L     + +   RA    +    FD CY     ++V       +P++SFH
Sbjct: 274 THLPGPFYTQLLSMLQSIITYPRA-QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFH 332

Query: 186 FPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS----IIGNVQQQGTRVSFNL 237
           F     L LP  N+      P +S    C        S S    + G+ QQQ  +V ++L
Sbjct: 333 FSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDL 392

Query: 238 RNSLIGFTPNKC 249
               IGF P  C
Sbjct: 393 EKERIGFQPMDC 404


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 60/105 (57%), Positives = 72/105 (68%), Gaps = 9/105 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           GDF TET+T   A VD++ +GCGH+NEGLFVGAAGLLGLG G LSFPSQ        FSY
Sbjct: 226 GDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSY 285

Query: 58  CLVDR-----DSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYY 96
           CLVDR      S   ST+ F ++++P  +V  PLL N +LDTFYY
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYY 330


>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 499

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 86/302 (28%), Positives = 126/302 (41%), Gaps = 66/302 (21%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYC 58
           +++++L S SV N   GC H      +G AG    G G LS P+Q++       ++FSYC
Sbjct: 197 SDSLSLPSVSVANFTFGCAHTTLAEPIGVAGF---GRGRLSLPAQLSVHSPHLGNSFSYC 253

Query: 59  LVDRDSDSTSTLE---------FDSSLPPNA------------------VTAPLLRNHEL 91
           LV    DS               D      A                  V   +L N + 
Sbjct: 254 LVSHSFDSDRVRRPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKH 313

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF-- 149
             FY + L GIS+G   +P      +ID++G GG++VDSGT  T L  + YN++ + F  
Sbjct: 314 PYFYSVSLQGISIGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDS 373

Query: 150 ------VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIP 202
                  R  R + P+ G++    CY  +   +V+VP +  HF   G  + LP +NY   
Sbjct: 374 RVGRVHERADR-VEPSSGMS---PCYYLN--QTVKVPALVLHFAGNGSTVTLPRRNYFYE 427

Query: 203 V----------DSNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPN 247
                         G          S L     +I+GN QQQG  V ++L N  +GF   
Sbjct: 428 FMDGGDGKEEKRKVGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKR 487

Query: 248 KC 249
           KC
Sbjct: 488 KC 489


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/262 (29%), Positives = 105/262 (40%), Gaps = 30/262 (11%)

Query: 1   GDFVTETVTLGSAS-VDNIAIGCGHNNEGLFVGA--AGLLGLGGGSLSFPSQIN---AST 54
           G  V + ++L   S V     GC H   G F  +  AG++ LG G  S  SQ +      
Sbjct: 265 GTLVADQLSLSPTSQVPKFEFGCSHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQV 324

Query: 55  FSYCLVDRDSDSTS-TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISE 113
           FSYC     S      L             P+L+   L   Y + L  I+V G  L +  
Sbjct: 325 FSYCFPPTASHKGFFVLGVPRRSSSRYAVTPMLKTPML---YQVRLEAIAVAGQRLDVPP 381

Query: 114 TAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSS 173
           T F        G  +DS T +TRL    Y ALR AF        P       DTCYDF+ 
Sbjct: 382 TVFA------AGAALDSRTVITRLPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTG 435

Query: 174 RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF---CFAFAPTS---SSLSIIGNVQ 227
            SS+ +PT+S  F              + +D +G     C AFA T+    +  IIG +Q
Sbjct: 436 VSSIMLPTISLVFDR--------TGAGVQLDPSGVLFGSCLAFASTAGDDRATGIIGFLQ 487

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
            Q   V +N+    +GF    C
Sbjct: 488 LQTIEVLYNVAGGSVGFRRGAC 509


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 103/252 (40%), Gaps = 48/252 (19%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLV-----------------DRDSDSTSTLEFDSS 75
           G+ G G G LS PSQ+      FS+C +                 D    S   L+F S 
Sbjct: 181 GIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS- 239

Query: 76  LPPNAVTAPLLRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAV 134
                    LL+N     +YY+GL  I+VG    + +  +  + D  GNGG+I+DSGT  
Sbjct: 240 ---------LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTY 290

Query: 135 TRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE------VPTVSFH 185
           T L    Y  L     + +   RA    +    FD CY     ++V       +P++SFH
Sbjct: 291 THLPGPFYTQLLSMLQSIITYPRA-QEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFH 349

Query: 186 FPEGKVLPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS----IIGNVQQQGTRVSFNL 237
           F     L LP  N+      P +S    C        S S    + G+ QQQ  +V ++L
Sbjct: 350 FSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDL 409

Query: 238 RNSLIGFTPNKC 249
               IGF P  C
Sbjct: 410 EKERIGFQPMDC 421


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/287 (27%), Positives = 115/287 (40%), Gaps = 63/287 (21%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN---ASTFSYCLVDRDSDSTS 68
           + +VDN    C H   G  VG AG    G G LS P Q++   +  FSYCLV      + 
Sbjct: 222 AVAVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLV------SH 272

Query: 69  TLEFDSSLPPNA--------------------VTAPLLRNHELDTFYYLGLTGISVGGDL 108
           +   D  + P+                     V  PLL N +   FY + L  +SVG   
Sbjct: 273 SFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAAR 332

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT---------RALSPT 159
           +       ++D +GNGG++VDSGT  T L  E Y  + +AF R           RA   T
Sbjct: 333 IQARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQT 392

Query: 160 DGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN----GTF-----C 210
                   CY +++ S   VP ++ HF     + LP +NY +   S     GT      C
Sbjct: 393 G----LTPCYRYAA-SDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGC 447

Query: 211 FAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   +           +GN QQQG  V +++    +GF   +C
Sbjct: 448 LMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 78/269 (28%), Positives = 121/269 (44%), Gaps = 25/269 (9%)

Query: 1   GDFVTETVTL-GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S +   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 176 GNLVKEKFTFSNSQTTPPLILGCAKES----TDVKGILGMNLGRLSFISQAKISKFSYCI 231

Query: 60  VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
             R +     ST  F     PN+        +T P   R   LD   Y + L GI +G  
Sbjct: 232 PTRSNRPGLASTGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQK 291

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L I  + F+ D  G+G  +VDSG+  T L    Y+ +++  VR  G+R        +  
Sbjct: 292 RLNIPSSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA 351

Query: 166 DTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSL 220
           D C+D + +  +   +  + F F  G  + +  +  L+ V   G  C     +S   ++ 
Sbjct: 352 DMCFDGNHQMVIGRLIGDLVFEFGRGVEILVEKQRLLVNV-GGGIHCVGIGRSSMLGAAS 410

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +IIGNV QQ   V F++ N  +GF+  +C
Sbjct: 411 NIIGNVHQQNLWVEFDVANRRVGFSKAEC 439


>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
 gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 106/278 (38%), Gaps = 47/278 (16%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDSTS 68
           V+N   GC H      +G AG    G G LS P+Q+        + FSYCLV    DS  
Sbjct: 213 VNNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDS-D 268

Query: 69  TLEFDSSL------------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
            L   S L                   P  V   +L N E   FY +GL GIS+G   +P
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIP 328

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT--- 167
                 K+D  G+GG++VDSGT  T L    Y ++   F      ++    V   DT   
Sbjct: 329 APGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGLS 388

Query: 168 -CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTFCFAFAPT 216
            CY F +        V      G  + LP +NY                 G         
Sbjct: 389 PCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGE 448

Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + LS      +GN QQQG  V ++L N  +GF   +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 83/268 (30%), Positives = 123/268 (45%), Gaps = 27/268 (10%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
           GD   ET+TL S      S     IGCG NN G F   +  +   GG   S  +Q+  S 
Sbjct: 175 GDLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSI 234

Query: 54  --TFSYCLVDRD------SDSTSTLEF-DSSLPP--NAVTAPLLR-NHELDTFYYLGLTG 101
              FSYCLV         S  +S L F D ++    N ++ P+++ +H    FYYL +  
Sbjct: 235 GGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSF--FYYLTIEA 292

Query: 102 ISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDG 161
            SVG   +  + ++  ++E   G II+DS T VT + ++ Y  L  A V         D 
Sbjct: 293 FSVGDKRVEFAGSSKGVEE---GNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDP 349

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
              F  CY+ SS    + P ++ HF    +L L A N  + V +    CFAFAP++   +
Sbjct: 350 NQQFSLCYNVSSDEEYDFPYMTAHFKGADIL-LYATNTFVEV-ARDVLCFAFAPSNGG-A 406

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           I G+  QQ   V ++L+   + F    C
Sbjct: 407 IFGSFSQQDFMVGYDLQQKTVSFKSVDC 434


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/269 (29%), Positives = 113/269 (42%), Gaps = 36/269 (13%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G   TETVT+ S S     +    IGCGHN+       +G++GL  G  S  +Q+     
Sbjct: 135 GTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYP 194

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
              SYC     S  TS + F +    NA+ A        +         YYL L  +SVG
Sbjct: 195 GLMSYCFA---SQGTSKINFGT----NAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVG 247

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDA---FVRGTRALSPTDGV 162
              +    T F   E   G II+DSGT +T       N +R+A   +V   R   PT   
Sbjct: 248 DTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAVRTADPTGND 304

Query: 163 ALFDTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL- 220
            L   CY      ++++ P ++ HF  G  L L   N  I   + GTFC A    +    
Sbjct: 305 ML---CY---YTDTIDIFPVITMHFSGGADLVLDKYNMYIETITRGTFCLAIICNNPPQD 358

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I GN  Q    V ++  + L+ F+P  C
Sbjct: 359 AIFGNRAQNNFLVGYDSSSLLVFFSPTNC 387


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 70/263 (26%), Positives = 116/263 (44%), Gaps = 26/263 (9%)

Query: 2   DFVTETVTLGSASVDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPSQINAS---- 53
           D VT     GS +   I  GCG    G          G++G G  + SF SQ+ +     
Sbjct: 188 DLVTGNRQTGSTN-GTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVK 246

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
            +F++CL   +++          + P   T P+L        Y + L  I VG  +L +S
Sbjct: 247 RSFAHCL--DNNNGGGIFAIGEVVSPKVKTTPMLSK---SAHYSVNLNAIEVGNSVLELS 301

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
             AF  D   + G+I+DSGT +  L    YN L +  +     L+       F TC+ ++
Sbjct: 302 SNAF--DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESF-TCFHYT 358

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PTSSSLSIIGNV 226
            +     PTV+F F +   L +  + YL  V  + T+CF +          +SL+I+G++
Sbjct: 359 DKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRED-TWCFGWQNGGLQTKGGASLTILGDM 416

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
                 V +++ N +IG+T + C
Sbjct: 417 ALSNKLVVYDIENQVIGWTNHNC 439


>gi|361068721|gb|AEW08672.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
 gi|383173175|gb|AFG69965.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
 gi|383173176|gb|AFG69966.1| Pinus taeda anonymous locus CL1136Contig1_04 genomic sequence
          Length = 80

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/80 (55%), Positives = 54/80 (67%), Gaps = 1/80 (1%)

Query: 86  LRNHELDTFYYLGLTGISVGGD-LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L N +LDTFYY+ L GISVGG  L  I  + FK+D +GNGG+I+DSGT+VTRL    Y A
Sbjct: 1   LSNPKLDTFYYVELVGISVGGRRLTSIPASVFKMDATGNGGVIIDSGTSVTRLVESAYTA 60

Query: 145 LRDAFVRGTRALSPTDGVAL 164
           +RDAF  GT  L    G +L
Sbjct: 61  MRDAFRAGTGNLKSAGGFSL 80


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 35/283 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
           G    + + +G  +   +A GC  ++ G      A+G++GLG G LS  SQ++   F+YC
Sbjct: 179 GTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYC 238

Query: 59  LVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI-- 111
           L    S     L    D+    NA   +  P+ R+    ++YYL L G+ +G   + +  
Sbjct: 239 LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPP 298

Query: 112 ---------------------SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
                                + TA  + ++   G+I+D  + +T L+   Y+ L +   
Sbjct: 299 TTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLE 358

Query: 151 RGTRALSPTDGVALFDTCY---DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
              R    T      D C+   D  +   V VP V+  F +G+ L L           +G
Sbjct: 359 VEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFAEDRESG 417

Query: 208 TFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             C       + S+SI+GN QQQ  +V +NLR   + F  + C
Sbjct: 418 MMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 75/271 (27%), Positives = 117/271 (43%), Gaps = 31/271 (11%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S S   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 165 GNLVREKFTFSKSLSTPPVILGCAQAS----TENRGILGMNHGRLSFISQAKISKFSYCV 220

Query: 60  VDR------------DSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYY-LGLTGISVG 105
             R            D+ ++S  ++ + L  P + ++P      LD   Y L +  I + 
Sbjct: 221 PSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIA 275

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
           G  L I   AFK D  G+G  ++DSG+ +T L  E Y  +++  VR   A+     V   
Sbjct: 276 GKRLNIPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAD 335

Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--- 218
           + D C+D    + V   +  +SF F  G  + +     ++     G  C     +     
Sbjct: 336 VADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGI 395

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +IIG V QQ   V ++L N  +GF   +C
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 426


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 119/247 (48%), Gaps = 15/247 (6%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC    +G  F  + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 135 AKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 194

Query: 67  TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
           +S L F          A   PL+ +  +  FY + +  + V G+ L I    + +     
Sbjct: 195 SSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGR--G 252

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGT++T L T  Y A+  A + G  A  P   +  F+ CY++++  + E+P + 
Sbjct: 253 GGAILDSGTSLTVLATPAYRAVVAA-LGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLE 310

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLI 242
             F     L  PAK+Y+I   + G  C      +   +S+IGN+ QQ     F+LR+  +
Sbjct: 311 VSFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 369

Query: 243 GFTPNKC 249
            F   +C
Sbjct: 370 RFKHTRC 376


>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
 gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 106/278 (38%), Gaps = 47/278 (16%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN------ASTFSYCLVDRDSDSTS 68
           V+N   GC H      +G AG    G G LS P+Q+        + FSYCLV    DS  
Sbjct: 213 VNNFTFGCAHTALAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDS-D 268

Query: 69  TLEFDSSL------------------PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
            L   S L                   P  V   +L N E   FY +GL GIS+G   +P
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKIP 328

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT--- 167
                 K+D  G+GG++VDSGT  T L    Y ++   F      ++    V   DT   
Sbjct: 329 APGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGLS 388

Query: 168 -CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTFCFAFAPT 216
            CY F +        V      G  + LP +NY                 G         
Sbjct: 389 PCYYFDNNVVNVPSVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLMLMNGGD 448

Query: 217 SSSLS-----IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + LS      +GN QQQG  V ++L N  +GF   +C
Sbjct: 449 EAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQC 486


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 73/247 (29%), Positives = 119/247 (48%), Gaps = 15/247 (6%)

Query: 13  ASVDNIAIGCGHNNEGL-FVGAAGLLGLGGGSLSFPSQINAS---TFSYCLVDR--DSDS 66
           A +  + +GC    +G  F  + G+L LG  ++SF S+  A     FSYCLVD     ++
Sbjct: 226 AKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNA 285

Query: 67  TSTLEFDSSLPPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
           +S L F          A   PL+ +  +  FY + +  + V G+ L I    + +     
Sbjct: 286 SSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGR--G 343

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
           GG I+DSGT++T L T  Y A+  A + G  A  P   +  F+ CY++++  + E+P + 
Sbjct: 344 GGAILDSGTSLTVLATPAYRAV-VAALGGRLAALPRVAMDPFEYCYNWTA-GAPEIPKLE 401

Query: 184 FHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLI 242
             F     L  PAK+Y+I   + G  C      +   +S+IGN+ QQ     F+LR+  +
Sbjct: 402 VSFAGSARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWL 460

Query: 243 GFTPNKC 249
            F   +C
Sbjct: 461 RFKHTRC 467


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 35/283 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
           G    + + +G  +   +A GC  ++ G      A+G++GLG G LS  SQ++   F+YC
Sbjct: 179 GTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRFAYC 238

Query: 59  LVDRDSDSTSTLEF--DSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPI-- 111
           L    S     L    D+    NA   +  P+ R+    ++YYL L G+ +G   + +  
Sbjct: 239 LPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPP 298

Query: 112 ---------------------SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
                                + TA  + ++   G+I+D  + +T L+   Y+ L +   
Sbjct: 299 TTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLE 358

Query: 151 RGTRALSPTDGVALFDTCY---DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNG 207
              R    T      D C+   D  +   V VP V+  F +G+ L L           +G
Sbjct: 359 VEIRLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVALAF-DGRWLRLDKARLFAEDRESG 417

Query: 208 TFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             C       + S+SI+GN QQQ  +V +NLR   + F  + C
Sbjct: 418 MMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 74/262 (28%), Positives = 105/262 (40%), Gaps = 44/262 (16%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLE 71
           S +V+N    C H      VG AG    G G LS P+Q+  S       D  +   S  +
Sbjct: 215 SMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGS--TDAAAIGASETD 269

Query: 72  FDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSG 131
           F        V  PLL N +   FY + L  +SVGG  +        +D  GNGG++VDSG
Sbjct: 270 F--------VYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSG 321

Query: 132 TAVTRLQTETYNALRD-----------AFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
           T  T L ++T+  + D               G  A +   G+A    CY +S  S   VP
Sbjct: 322 TTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQT---GLA---PCYHYSP-SDRAVP 374

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSS----------LSIIGNVQ 227
            V+ HF     + LP +NY +   S       C        +             +GN Q
Sbjct: 375 PVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQ 434

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           QQG  V +++    +GF   +C
Sbjct: 435 QQGFEVVYDVDAGRVGFARRRC 456


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 117/263 (44%), Gaps = 25/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   T+T  +G+A   ++A GC   ++     G +G++GLG    S  +Q   + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197

Query: 60  VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
              D+   S L   SS        A + P +      ++L  +Y + L G+  G  ++P+
Sbjct: 198 APHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +           +++D+ + ++ L    Y A++ A      A      V  FD C+  
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPK 309

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
           S  S    P + F F  G  + +PA NYL+    NGT C A   ++     + LS++G++
Sbjct: 310 SGASGA-APDLVFTFRGGAAMTVPATNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+     F+L    + F P  C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 69/263 (26%), Positives = 118/263 (44%), Gaps = 25/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   T+T  +G+A   ++A GC   ++     G +G++GLG    S  +Q   + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197

Query: 60  VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
              D+   S L   SS        A + P +      ++L  +Y + L G+  G  ++P+
Sbjct: 198 APHDAGRNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +           +++D+ + ++ L    Y A++ A      A      V  FD C+  
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFP- 308

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
            S +S   P + F F  G  + +PA NYL+    NGT C A   ++     + LS++G++
Sbjct: 309 KSGASGAAPDLVFTFRGGAAMTVPATNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+     F+L    + F P  C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 110/260 (42%), Gaps = 23/260 (8%)

Query: 6   ETVTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA----- 52
           +TV LG + V N    I  GC     G          G+ G G G+LS  SQ+++     
Sbjct: 190 DTVLLGQSVVANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTP 249

Query: 53  STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
             FS+CL   + +    L     L P+ V +PL+ +      Y L L  I+V G LLPI 
Sbjct: 250 KVFSHCLKGGE-NGGGVLVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQLLPID 305

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS 172
              F    + N G IVDSGT +  L  E YN    A        S    ++  + CY  S
Sbjct: 306 SNVFA--TTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPI-ISKGNQCYLVS 362

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQ 229
           +      P VS +F  G  + L  ++YL+    +D    +C  F       +I+G++  +
Sbjct: 363 NSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLK 422

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
                ++L N  IG+    C
Sbjct: 423 DKIFVYDLANQRIGWADYDC 442


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 109/271 (40%), Gaps = 57/271 (21%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFV-----------------GAAGLLGLGGGS 43
           G   T+TV LG ASVD    GCG +N GL                    AAG L LGG +
Sbjct: 212 GVLATDTVALGGASVDGFVFGCGLSNRGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDT 271

Query: 44  LSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
            S+    NA+  SY          + +  D + PP               FY++ +TG S
Sbjct: 272 SSY---RNATPVSY----------TRMIADPAQPP---------------FYFMNVTGAS 303

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDG 161
           V          A      G   +++DSGT +TRL    Y A+R  F R  G         
Sbjct: 304 V-------GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPP 356

Query: 162 VALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT-FCFAFAPTS--S 218
            +L D CY+ +    V+VP ++     G  + + A   L     +G+  C A A  S   
Sbjct: 357 FSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFED 416

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              IIGN QQ+  RV ++   S +GF    C
Sbjct: 417 QTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 73/243 (30%), Positives = 104/243 (42%), Gaps = 25/243 (10%)

Query: 32  GAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLR-NHE 90
            A GLLG+  GSLSF +Q     F+YC+          L  D    P     PL+  +  
Sbjct: 197 AATGLLGMNRGSLSFVTQTATLRFAYCIAPGQGPGILLLGGDGGAAPPLNYTPLIEISQP 256

Query: 91  LDTF----YYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR 146
           L  F    Y + L GI VG  LL I ++    D +G G  +VDSGT  T L  + Y AL+
Sbjct: 257 LPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALK 316

Query: 147 DAFVRGTRALSPTDG------VALFDTCY----DFSSRSSVEVPTVSFHFPEGKVLPLPA 196
             F+   R+L    G         FD C+    +  S +S  +P V       +V     
Sbjct: 317 AEFLNQARSLLAPLGEPGFVFQGAFDACFRGPEERVSAASRLLPEVGLVLRGAEVAVAGE 376

Query: 197 K-NYLIPVDSNG------TFCFAFAPTS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
           K  Y +P +  G       +C  F  +     S  +IG+  QQ   V ++L+N  +GF P
Sbjct: 377 KLLYSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVIGHHHQQDVWVEYDLQNGRVGFAP 436

Query: 247 NKC 249
            +C
Sbjct: 437 ARC 439


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score = 81.6 bits (200), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 85/275 (30%), Positives = 137/275 (49%), Gaps = 33/275 (12%)

Query: 1   GDFVTETVTLGSASVDNIA-----IGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQINAS- 53
           GD  TETV++ SAS   ++      GCG+NN G F    +G++GLGGG LS  SQ+ +S 
Sbjct: 176 GDVATETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI 235

Query: 54  --TFSYCLVDRDS--DSTSTLEFDSSLPPNA-------VTAPLLRNHELDTFYYLGLTGI 102
              FSYCL  + +  + TS +   ++  P++       V+ PL+    L T+YYL L  I
Sbjct: 236 SKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL-TYYYLTLEAI 294

Query: 103 SVGGDLLPISETAFKIDESG-----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTR 154
           SVG   +P + +++  ++ G     +G II+DSGT +T L+   ++    A    V G +
Sbjct: 295 SVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAK 354

Query: 155 ALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA 214
            +S   G  L   C+  S  + + +P ++ HF  G  + L   N  + + S    C +  
Sbjct: 355 RVSDPQG--LLSHCFK-SGSAEIGLPEITVHF-TGADVRLSPINAFVKL-SEDMVCLSMV 409

Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           PT + ++I GN  Q    V ++L    + F    C
Sbjct: 410 PT-TEVAIYGNFAQMDFLVGYDLETRTVSFQHMDC 443


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 77/250 (30%), Positives = 112/250 (44%), Gaps = 28/250 (11%)

Query: 17  NIAIGCGH-----NNEGLFVGAAGLLGLGGG-SLSFPSQINASTFSYCLVDRD----SDS 66
           NI  GCGH     NN+  +    G+ GLG    ++  +Q+  + FSYC+ D +    + +
Sbjct: 226 NITFGCGHMNIKTNNDDAY---NGVFGLGAYPHITMATQL-GNKFSYCIGDINNPLYTHN 281

Query: 67  TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGI 126
              L   S +  ++    +   H     YY+ L  ISVG   L I   AFKI   G+GG+
Sbjct: 282 HLVLGQGSYIEGDSTPLQIHFGH-----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGV 336

Query: 127 IVDSGTAVTRLQTETYNALRDAFV---RGTRALSPTDGVALFDTCYD-FSSRSSVEVPTV 182
           ++DSG   T+L    +  L D  V   +G     PT        C+    SR  V  P V
Sbjct: 337 LIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQR-KFEGLCFKGVVSRDLVGFPAV 395

Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS---SLSIIGNVQQQGTRVSFNLRN 239
           +FHF  G  L L + + L        FC A  P++S   +LS+IG + QQ   V F+L  
Sbjct: 396 TFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQ 454

Query: 240 SLIGFTPNKC 249
             + F    C
Sbjct: 455 MKVFFRRIDC 464


>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 460

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 76/262 (29%), Positives = 113/262 (43%), Gaps = 29/262 (11%)

Query: 14  SVDNIAIGCGHNNEGLFVGA----AGLLGLGGGSLSF--------PSQINASTFSYCL-- 59
            V  + IGC HN++G    +    AG+LGLG  + S            +    FSYCL  
Sbjct: 194 EVTGVVIGCTHNSKGFNFNSHGVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHRFSYCLPS 253

Query: 60  -VDRDSDSTSTLEFDSSLP--PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLLPIS 112
                SD  + L FD  +P   + V+  ++      +     Y++ LTGISV G  L   
Sbjct: 254 HGSSSSDHHTFLRFDDDVPNTQHMVSTKIMYMDSTTSRDFRAYFVSLTGISVAGKPLQDV 313

Query: 113 ETAFKIDESGN---GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
           +  FK    G     G   D+GT    +    YN L+DA VR  + L        +  C+
Sbjct: 314 KELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGLQIVSGQYHLCF 373

Query: 170 DFSSRSSVEVPTVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQ 228
             +S+    +PTV   F E +  L LP +   + V  +   C A    S  ++IIG +QQ
Sbjct: 374 RATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGYD--ICLAVV-RSYDITIIGAMQQ 430

Query: 229 QGTRVSFNLRNSLIGFTP-NKC 249
              R  +++R+  I F P N C
Sbjct: 431 VDKRFVYDVRHGRIYFVPENAC 452


>gi|194690050|gb|ACF79109.1| unknown [Zea mays]
          Length = 166

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 57/169 (33%), Positives = 82/169 (48%), Gaps = 15/169 (8%)

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYN 143
           PLL+      FY + LTGI+VGG    +  T F      +   IVDSGT +T L    YN
Sbjct: 7   PLLQGP----FYLVNLTGITVGGQ--EVESTGF------SARAIVDSGTVITSLVPSVYN 54

Query: 144 ALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
           A+R  F+          G ++ DTC++ +    V+VP+++  F  G  + + +   L  V
Sbjct: 55  AVRAEFMSQLAEYPQAPGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFV 114

Query: 204 DSNGT-FCFAFAPTSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S+ +  C A A   S    SIIGN QQ+  RV F+   S +GF    C
Sbjct: 115 SSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 163


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 70/244 (28%), Positives = 111/244 (45%), Gaps = 32/244 (13%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLV-----DRDSDSTSTLEFDSSL--PPNAVTAPL 85
           G+ G   G+LSFPSQ+      FS+C +     +  + S+  +  D++L    N    P+
Sbjct: 164 GIAGFVRGTLSFPSQLGLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPM 223

Query: 86  LRNHELDTFYYLGLTGISVG---GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETY 142
           L++     +YY+GL  I+VG      +P++   F  D  GNGG+++DSGT  T L    Y
Sbjct: 224 LKSPMYPNYYYIGLEAITVGNVSATTVPLNLREF--DSQGNGGMLIDSGTTYTHLPEPFY 281

Query: 143 NALRDAF---VRGTRALSPTDGVALFDTCYDFSSRSSVEV------PTVSFHFPEGKVLP 193
           + L   F   +   RA +  +  A FD CY     ++         P+++FHF       
Sbjct: 282 SQLLSIFKAIITYPRA-TEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFV 340

Query: 194 LPAKNYLI----PVDSNGTFCFAFAPTSSS----LSIIGNVQQQGTRVSFNLRNSLIGFT 245
           LP  N+      P +S    C  F   + S      + G+ QQQ  ++ ++L    IGF 
Sbjct: 341 LPQGNHFYAMSAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQ 400

Query: 246 PNKC 249
           P  C
Sbjct: 401 PMDC 404


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 115/273 (42%), Gaps = 29/273 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN-------NEGLFVGAAGLLGLGGGSLSFPSQINAS 53
           G    ET +L  A+      GC  +       NE       GL+G+  GSLS  +Q+   
Sbjct: 149 GTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINED--AKTTGLMGMNRGSLSLVTQMVLP 206

Query: 54  TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTF-----YYLGLTGISVGGDL 108
            FSYC+   D+     L    S P      PL+       +     Y + L GI V   L
Sbjct: 207 KFSYCISGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKL 266

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDGVALF- 165
           L + ++ F  D +G G  +VDSGT  T L    YN+L+D F+  T+ +     D   +F 
Sbjct: 267 LQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFE 326

Query: 166 ---DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS--NGTFCFAFAPTSSSL 220
              D CY  +  S   VP V+  F  G  + +  +  L  V    +  +CF F   S  L
Sbjct: 327 GAMDLCYH-APASLAAVPAVTLVF-SGAEMRVSGERLLYRVSKGRDWVYCFTFG-NSDLL 383

Query: 221 SI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            I    IG+  QQ   + F+L  S +GFT   C
Sbjct: 384 GIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 74/271 (27%), Positives = 117/271 (43%), Gaps = 31/271 (11%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S S   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 165 GNLVREKFTFSKSLSTPPVILGCAQAS----TENRGILGMNRGRLSFISQAKISKFSYCV 220

Query: 60  VDR------------DSDSTSTLEFDSSLP-PNAVTAPLLRNHELDTFYY-LGLTGISVG 105
             R            D+ ++S  ++ + L  P + ++P      LD   Y L +  I + 
Sbjct: 221 PSRTGSNPTGLFYLGDNPNSSKFKYVTMLTFPESQSSP-----NLDPLAYTLPMKAIKIA 275

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA-- 163
           G  L +   AFK D  G+G  ++DSG+ +T L  E Y  +++  VR   A+     V   
Sbjct: 276 GKRLNVPPAAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYAD 335

Query: 164 LFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--- 218
           + D C+D    + V   +  +SF F  G  + +     ++     G  C     +     
Sbjct: 336 VADMCFDAGVTAEVGRRIGGISFEFDNGVEIFVGRGEGVLTEVEKGVKCVGIGRSERLGI 395

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +IIG V QQ   V ++L N  +GF   +C
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAEC 426


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 115/269 (42%), Gaps = 27/269 (10%)

Query: 1   GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G +V++T      LG + + N    I  GC     G          G+ G G G LS  S
Sbjct: 178 GYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVIS 237

Query: 49  QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q+++       FS+CL   DS     L     L P  V +PL+ +      Y L L  I+
Sbjct: 238 QLSSHGITPRVFSHCLKGEDSGG-GILVLGEILEPGIVYSPLVPSQP---HYNLDLQSIA 293

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G LLPI   AF    S N G I+D+GT +  L  E Y+    A       L+ T  + 
Sbjct: 294 VSGQLLPIDPAAFA--TSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLA-TPTIN 350

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS---NGTFCFAFAPTSSSL 220
             + CY  S+  S   P VSF+F  G  + L  + YL+ + +      +C  F      +
Sbjct: 351 KGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGI 410

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I+G++  +     ++L +  IG+    C
Sbjct: 411 TILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 112/266 (42%), Gaps = 33/266 (12%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G  VTETVT+ S S     +    IGCG NN G   G AG++GL  G  S  +Q+     
Sbjct: 135 GTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYP 194

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
              SYC   +    TS + F +    NA+ A        +        FYYL L  +SVG
Sbjct: 195 GLMSYCFAGK---GTSKINFGA----NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVG 247

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVAL 164
              +    T F    +  G I++DSG+ +T       N +R A  +   A+  P   +  
Sbjct: 248 NTRIETVGTPF---HALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSDIL- 303

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSII 223
              CY   S++    P ++ HF  G  L L   N  +  ++ G FC A    S    +I 
Sbjct: 304 ---CY--YSKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIF 358

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN  Q    V ++  + L+ F P  C
Sbjct: 359 GNRAQNNFLVGYDSSSLLVSFKPTNC 384


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score = 80.9 bits (198), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 78/266 (29%), Positives = 112/266 (42%), Gaps = 33/266 (12%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G  VTETVT+ S S     +    IGCG NN G   G AG++GL  G  S  +Q+     
Sbjct: 141 GTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYP 200

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PLLRNHELDTFYYLGLTGISVG 105
              SYC   +    TS + F +    NA+ A        +        FYYL L  +SVG
Sbjct: 201 GLMSYCFAGK---GTSKINFGA----NAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVG 253

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS-PTDGVAL 164
              +    T F    +  G I++DSG+ +T       N +R A  +   A+  P   +  
Sbjct: 254 NTRIETVGTPF---HALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSDIL- 309

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS-SLSII 223
              CY   S++    P ++ HF  G  L L   N  +  ++ G FC A    S    +I 
Sbjct: 310 ---CY--YSKTIDIFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAIICNSPIEEAIF 364

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GN  Q    V ++  + L+ F P  C
Sbjct: 365 GNRAQNNFLVGYDSSSLLVSFKPTNC 390


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 74/272 (27%), Positives = 117/272 (43%), Gaps = 31/272 (11%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +    S +   + +GC   +      A G+LG+  G LSFP Q   + FSYC+
Sbjct: 177 GNLVREKLAFSPSQTTPPLILGCSSESRD----ARGILGMNLGRLSFPFQAKVTKFSYCV 232

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTF-------------YYLGLTGISVGG 106
             R   + +     S    N   +   R   + TF             Y + + GI +GG
Sbjct: 233 PTRQPANNNNFPTGSFYLGNNPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGG 292

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVAL 164
             L I  + F+ +  G+G  +VDSG+  T L    Y+ +R+  +R  G R         +
Sbjct: 293 RKLNIPPSVFRPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLGPRVKKGYVYGGV 352

Query: 165 FDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--- 217
            D C+D    +++E+      V+F F +G  + +P +  L  V   G  C     +    
Sbjct: 353 ADMCFD---GNAMEIGRLLGDVAFEFEKGVEIVVPKERVLADV-GGGVHCVGIGRSERLG 408

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++ +IIGN  QQ   V F+L N  IGF    C
Sbjct: 409 AASNIIGNFHQQNLWVEFDLANRRIGFGVADC 440


>gi|297724111|ref|NP_001174419.1| Os05g0403000 [Oryza sativa Japonica Group]
 gi|50878436|gb|AAT85210.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|222631539|gb|EEE63671.1| hypothetical protein OsJ_18489 [Oryza sativa Japonica Group]
 gi|255676353|dbj|BAH93147.1| Os05g0403000 [Oryza sativa Japonica Group]
          Length = 437

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 110/267 (41%), Gaps = 35/267 (13%)

Query: 13  ASVDNIAIGCGHN--NEGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSD 65
           A+V      CGH    EGL  GA G++ L     +FP+Q+      +  F+ CL    + 
Sbjct: 156 ATVPEFLFTCGHTFLTEGLANGATGMVSLSRARFAFPTQLARTFGFSRRFALCLPPASAA 215

Query: 66  ------------------STSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGD 107
                             S S+L +   L     TA      E    Y +GLTGI V G 
Sbjct: 216 GVVVFGDAPYVFQPGVDLSKSSLIYTPLLVNAVRTAGKYTTGETSIEYLIGLTGIKVNGR 275

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            +P++ T   ID++G GG  + + +  T L+T  Y A+ DAF   T  +     VA F+ 
Sbjct: 276 DVPLNATLLAIDKNGVGGTTLSTASPYTVLETSIYKAVIDAFAAETATIPRVPAVAPFEL 335

Query: 168 CYD----FSSRSSVEVPTVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSSL-- 220
           CYD     S+R+   VPT+        V   +   N ++P    G  C        +L  
Sbjct: 336 CYDGRKVGSTRAGPAVPTIELVLQREAVSWIMYGANSMVPAK-GGALCLGVVDGGPALYP 394

Query: 221 --SIIGNVQQQGTRVSFNLRNSLIGFT 245
              +IG    +   + F+L  S +GF+
Sbjct: 395 SSVVIGGHMMEDNLLEFDLEGSRLGFS 421


>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
          Length = 739

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 123/246 (50%), Gaps = 20/246 (8%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINAS---TFSYCLVDR-DSDS 66
           S S   I IGCG NN G F     G++GLGGG +S  S I  S    +SYCLV   + +S
Sbjct: 55  SVSFPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFNS 114

Query: 67  TSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
           TS + F  +        V+ P++     DTFYYL L G+SVG   +   + +   +  GN
Sbjct: 115 TSKINFGENAVVEGLGTVSTPIIPG-SFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGN 173

Query: 124 GGIIVDSGTAVTRLQTETYNALR---DAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
             II+DSGT +T L    Y  L    +A +   R ++ TD +     CY     +++EVP
Sbjct: 174 --IIIDSGTTLTILLENFYTKLEAEVEAHINLER-VNSTDQI--LSLCYKSPPNNAIEVP 228

Query: 181 TVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
            ++ HF  G  + L + N  + V  +  + FAFAP +S  SI GN+ Q    V ++L   
Sbjct: 229 IITTHFA-GVDIVLNSLNTFVSVFDDAMW-FAFAPVASG-SIFGNLAQMNHLVGYDLLRK 285

Query: 241 LIGFTP 246
            + F P
Sbjct: 286 TVSFKP 291


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 80.5 bits (197), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 122/271 (45%), Gaps = 29/271 (10%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S +   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 175 GNLVKEKFTFSNSQTTPPLILGCAKES----TDEKGILGMNLGRLSFISQAKISKFSYCI 230

Query: 60  VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
             R +     ST  F     PN+        +T P   R   LD   Y + L GI +G  
Sbjct: 231 PTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQK 290

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L I  + F+ D  G+G  +VDSG+  T L    Y+ +++  VR  G+R        +  
Sbjct: 291 RLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA 350

Query: 166 DTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---S 218
           D C+D     S+E+      + F F  G  + +  ++ L+ V   G  C     +S   +
Sbjct: 351 DMCFD--GNHSMEIGRLIGDLVFEFGRGVEILVEKQSLLVNV-GGGIHCVGIGRSSMLGA 407

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +IIGNV QQ   V F++ N  +GF+  +C
Sbjct: 408 ASNIIGNVHQQNLWVEFDVTNRRVGFSKAEC 438


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 126/269 (46%), Gaps = 23/269 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPSQINASTFSY 57
           G F TET  LG+ +V NI  GCG  N+G +   A   G+   G G +S  +Q+    FSY
Sbjct: 158 GYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGRGGVSLLNQLGIDRFSY 217

Query: 58  CLVDRDSDSTSTLEFDSS-------LPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
           C     +  +S +    S           A + P++ +  L + Y++ L G++VG  L+ 
Sbjct: 218 CFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVD 277

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-----GVALF 165
           ++  +    E G   +++DS + VT L   TY  +R A V     L   +     GV L 
Sbjct: 278 VAGASSA--EGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL- 334

Query: 166 DTCYDFSSRSSVEVP---TVSFHFPEGKV-LPLPAKNYLIPVDSNGTFCFAFAPTSSS-L 220
           D C++ ++  +   P   T++ HF  G   L LP  +YL    + G  C    P+SS+ +
Sbjct: 335 DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGV 394

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            ++G+     T V ++L  +++ F P  C
Sbjct: 395 PVLGSWALLDTLVLYDLAKNVVSFQPLDC 423


>gi|50511404|gb|AAT77327.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|222631431|gb|EEE63563.1| hypothetical protein OsJ_18380 [Oryza sativa Japonica Group]
          Length = 480

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 117/272 (43%), Gaps = 31/272 (11%)

Query: 1   GDFVTETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G+   +  T G  S D      +  GC  + EG F GA+G+LGL  G+LS  SQ+N   F
Sbjct: 163 GNLAVQNFTFGDDSEDTAVKGVVTFGCSSSTEGDF-GASGVLGLNKGNLSLVSQLNLGRF 221

Query: 56  SYCLVDR--DSDSTSTLEFDSSLPPNAVTAP-----------------LLRNHELDTFYY 96
           SY        +D+ +  +F      + +T P                  +R+  LD  Y+
Sbjct: 222 SYYFAPEVNTTDNNAADDFIVFGDDDGITVPGNSGGSRPRYTPFFTTGAVRSANLD-LYF 280

Query: 97  LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
           + LTGI VGG  L +              ++  S   VT L+   Y  L+   V    + 
Sbjct: 281 VELTGIRVGGKDLQLGGGGGGSAGGSLEAVLSTS-VPVTYLEKNAYGLLKKELVSALGSN 339

Query: 157 SPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
           +  DG AL  D CY        ++P ++F F    V+ L   NYL   +  G  C    P
Sbjct: 340 NTEDGSALGLDLCYRSQHMDRAKIPDIAFVFGGNAVMKLQQWNYLYQDEDTGLECLTIPP 399

Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
           +   S  LS+IG++ Q GT + ++L  S +GF
Sbjct: 400 SPDDSDGLSLIGSMIQTGTYMIYDLHKSRLGF 431


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score = 80.5 bits (197), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 81/278 (29%), Positives = 121/278 (43%), Gaps = 39/278 (14%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN-------NEGLFVGAAGLLGLGGGSLSFPSQINAS 53
           G    ET +L  A+      GC  +       NE       GL+G+  GSLS  +Q++  
Sbjct: 150 GTLAAETFSLAGAAQPGTLFGCMDSAGYTSDINED--SKTTGLMGMNRGSLSLVTQMSLP 207

Query: 54  TFSYCLVDRDS----------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
            FSYC+   D+          D+ S L++ + L     ++P          Y + L GI 
Sbjct: 208 KFSYCISGEDALGVLLLGDGTDAPSPLQY-TPLVTATTSSPYFNR----VAYTVQLEGIK 262

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL--SPTDG 161
           V   LL + ++ F  D +G G  +VDSGT  T L    Y++L+D F+  T+ +     D 
Sbjct: 263 VSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDP 322

Query: 162 VALF----DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVD--SNGTFCFAFAP 215
             +F    D CY  +  S   VP V+  F  G  + +  +  L  V   S+  +CF F  
Sbjct: 323 NFVFEGAMDLCYH-APASFAAVPAVTLVF-SGAEMRVSGERLLYRVSKGSDWVYCFTFG- 379

Query: 216 TSSSLSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            S  L I    IG+  QQ   + F+L  S +GFT   C
Sbjct: 380 NSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417


>gi|413953789|gb|AFW86438.1| hypothetical protein ZEAMMB73_078928 [Zea mays]
          Length = 155

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 78/159 (49%), Gaps = 14/159 (8%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           TF  + L GI+VGG  L +  +AF      +GG+IVD GT +T LQ+  Y ALR AF + 
Sbjct: 9   TFSTVTLAGINVGGKKLDLRPSAF------SGGMIVDCGTVITGLQSTAYRALRSAFRKA 62

Query: 153 TRA--LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
             A  L P   +   DTCY+ +   +V VP ++  F  G  + L   N  +    NG   
Sbjct: 63  MEAYRLLPNGDL---DTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGSL---VNGCLA 116

Query: 211 FAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           FA +    S  ++GNV Q+   V F+   S  GF    C
Sbjct: 117 FAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 155


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 70/253 (27%), Positives = 110/253 (43%), Gaps = 22/253 (8%)

Query: 18  IAIGCGHNNEGLF--VGAAGLLGLGGGSL-----SFPSQI---NASTFSYCLVDRDSDST 67
           I  GC H  E        AG+LGLG G       +F  Q+   +   FSYC         
Sbjct: 207 IVFGCAHQTEHFKNQRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMY 266

Query: 68  STLEFDSSLP----PNA--VTAPLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDE 120
           S L F S +P    PN    + P+L        Y++ L G+SVG + L  ++   F+ + 
Sbjct: 267 SYLRFGSDIPSHPPPNVHRQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNA 326

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVP 180
            G GG +VD GT +T      Y  +  A  +  +       V   +TC    +     +P
Sbjct: 327 HGAGGCVVDIGTRMTAFIHSAYVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVLP 386

Query: 181 TVSFHFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLR 238
           +++ HF  G  L +  ++  +P  V  +   CF F  +S+ L++IG  QQ   R  F+L 
Sbjct: 387 SMTLHFENGAWLRVMPEHVFMPFVVGGHHYQCFGFV-SSTDLTVIGARQQVNHRFIFDLH 445

Query: 239 NS--LIGFTPNKC 249
           ++  ++ F P  C
Sbjct: 446 DTIPIMSFNPEDC 458


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 80.1 bits (196), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 118/263 (44%), Gaps = 19/263 (7%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS----LSFPSQIN 51
           GD   ET+TLGS    ++      IGCGHNN+G F      +   GG     +S  S   
Sbjct: 184 GDLSVETLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSI 243

Query: 52  ASTFSYCLVD--RDSDSTSTLEF-DSSLPPN--AVTAPLLRNHELDTFYYLGLTGISVGG 106
              FSYCL      S+S+S L F D ++      V+ P++  + L  FY+L L   SVG 
Sbjct: 244 GGKFSYCLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLG-FYFLTLEAFSVGD 302

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
           + +    ++      G G II+DSGT +T L  + Y  L  A           D      
Sbjct: 303 NRI-EFGSSSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLR 361

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
            CY  +S   + VP ++ HF +G  + L   +  I VD  G  CFAF  +S    I GN+
Sbjct: 362 LCYRTTSSDELNVPVITAHF-KGADVELNPISTFIEVDE-GVVCFAFR-SSKIGPIFGNL 418

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            QQ   V ++L    + F P  C
Sbjct: 419 AQQNLLVGYDLVKQTVSFKPTDC 441


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 69/275 (25%), Positives = 112/275 (40%), Gaps = 30/275 (10%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  ++E +   + +V +  +GC   +        G+ G G G  S P+Q+N + FSYCL+
Sbjct: 326 GFLLSENLNFPAKNVSDFLVGCSVVS---VYQPGGIAGFGRGEESLPAQMNLTRFSYCLL 382

Query: 61  DRDSDST---STLEFDSS-----LPPNAVTA------PLLRNHELDTFYYLGLTGISVGG 106
               D +   S L  +++        N V+       P  +      +YY+ L  I VG 
Sbjct: 383 SHQFDESPENSDLVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGE 442

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG---TRALSPTDGVA 163
             + +     + D +G+GG IVDSG+ +T ++   ++ + + FV+    TRA        
Sbjct: 443 KRVRVPRRMLEPDVNGDGGFIVDSGSTLTFMERPIFDLVAEEFVKQVNYTRARELEKQFG 502

Query: 164 LFDTCYDFS-SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSS--- 219
           L   C+  +    +   P + F F  G  + LP  NY   V      C        +   
Sbjct: 503 L-SPCFVLAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQG 561

Query: 220 -----LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                  I+GN QQQ   V  +L N   GF    C
Sbjct: 562 GAVGPAVILGNYQQQNFYVECDLENERFGFRSQSC 596


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 79/274 (28%), Positives = 121/274 (44%), Gaps = 30/274 (10%)

Query: 1    GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
            G+  ++   +GS+++     GC  +    N        GL+G+  GSLSF +Q+    FS
Sbjct: 1089 GNLASDNFRIGSSALPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPKFS 1148

Query: 57   YCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLR-NHELDTF----YYLGLTGISVGGDLLP 110
            YC+  RDS       +   S   N    PL++ +  L  F    Y + L GI VG  +LP
Sbjct: 1149 YCISGRDSSGVLLFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILP 1208

Query: 111  ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF----- 165
            + ++ F  D +G G  +VDSGT  T L    Y ALR+ F+  T+ +    G   F     
Sbjct: 1209 LPKSIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGA 1268

Query: 166  -DTCYDFSSRSSV-EVPTVSFHFPEGK-VLPLPAKNYLIPVDSNG---TFCFAFAPTSSS 219
             D CY  ++   +  +P+VS  F   + V+      Y +P    G    +C  F   S  
Sbjct: 1269 MDLCYSVAAGGKLPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFG-NSDL 1327

Query: 220  LSI----IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            L I    IG+  QQ   + F+    L+ F  + C
Sbjct: 1328 LGIEAFVIGHHHQQNVWMEFD----LVAFAADLC 1357


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 77/270 (28%), Positives = 115/270 (42%), Gaps = 28/270 (10%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T   S S   + +GC   +        G+LG+  G LSF  Q   + FSYC+
Sbjct: 164 GNLVREKFTFSRSVSTPPLILGCATES----TDPRGILGMNLGRLSFAKQSKITKFSYCV 219

Query: 60  VDRDSDS--TSTLEFDSSLPPNA--------VTAPLLRNHELDTFYY-LGLTGISVGGDL 108
             R +    T T  F     P++        +T+   R    D   Y + + GI + G  
Sbjct: 220 PPRQTRPGFTPTGSFYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKK 279

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFD 166
           L IS   F+ D  G+G  ++DSG+  T L +E Y+ +R   VR  G R         + D
Sbjct: 280 LNISPAVFRADAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVAD 339

Query: 167 TCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SS 219
            C+D  S  +VE+      + F F  G  + +P +  L  V   G  C     +    ++
Sbjct: 340 MCFD--SVKAVEIGRLIGEMVFEFERGVEVVIPKERVLADV-GGGVHCVGIGSSDKLGAA 396

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +IIGN  QQ   V F+L    +GF    C
Sbjct: 397 SNIIGNFHQQNLWVEFDLVRRRVGFGKADC 426


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 72/276 (26%), Positives = 102/276 (36%), Gaps = 45/276 (16%)

Query: 16  DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDS--- 66
           +N   GC H      +G AG    G G LS P+Q+        + FSYCLV    DS   
Sbjct: 214 NNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSDRV 270

Query: 67  --------------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                               +    P+ V   +L N     FY +GL GIS+G   +P  
Sbjct: 271 RRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGRKKIPAP 330

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT----C 168
           +   K+D  G+GG++VDSGT  T L    Y+ +   F      ++    V   +T    C
Sbjct: 331 DFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEENTGLSPC 390

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV--------DSNGTFCFAFAPTSSSL 220
           Y F +        V      G  + LP +NY                  C          
Sbjct: 391 YYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCLMLMNGGDEA 450

Query: 221 SI-------IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +       +GN QQQG  V ++L N  +GF   +C
Sbjct: 451 ELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486


>gi|357491945|ref|XP_003616260.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517595|gb|AES99218.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 441

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/271 (28%), Positives = 121/271 (44%), Gaps = 31/271 (11%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E + L  S +   I +GC + ++     A G+LG+  G LSFP+Q   + FSY +
Sbjct: 165 GNLVRENIALSPSLTTPPIILGCANQSDD----ARGILGMNLGRLSFPNQAKITKFSYFV 220

Query: 60  -VDRDSDSTSTLEFDSSLPPNAVT---APLL--------RNHELDTFYY-LGLTGISVGG 106
            V +    + +L   ++  PN+       LL        R   LD   + L + GIS+GG
Sbjct: 221 PVKQTQPGSGSLYLGNN--PNSSCFRYVKLLTFSKSQSQRMPNLDPLAFTLPMQGISIGG 278

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD----GV 162
             L I  + FK D +G G  I+DSG+  + +  + YN +R+  V+   +    D    GV
Sbjct: 279 KKLNIPPSVFKPDTTGFGQTIIDSGSEFSYMVDKAYNVIRNELVKKVGSKIKKDYIYGGV 338

Query: 163 ALFDTCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
           A  D C+D  ++     V  + F F +G  + +P +  LI VD  G  CF          
Sbjct: 339 A--DICFDGDATEIGRLVGDMVFEFEKGVEIVIPKERVLIEVDG-GVHCFGIGRAEGLGG 395

Query: 222 IIGNVQ---QQGTRVSFNLRNSLIGFTPNKC 249
               +    QQ   V F+L    +GF    C
Sbjct: 396 GGNIIGNFYQQNLWVEFDLAKHRVGFRGANC 426


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 85/263 (32%), Positives = 115/263 (43%), Gaps = 21/263 (7%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLF-VGAAGLLGLGGGSLSFPSQIN--- 51
           G    ETVT  S       V +I  GCGH+N G F     G++GLGGG LS  SQ     
Sbjct: 138 GVLARETVTFSSTDGEPVVVGDIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLY 197

Query: 52  -ASTFSYCLV--DRDSDSTSTLEFD--SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
            +  FS CLV    D  +  T+ F   S +    V A  L + E  T Y + L GISVG 
Sbjct: 198 GSKRFSQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGD 257

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +  + +         G I++DSGT  T L  E Y+ L       +  L P D      
Sbjct: 258 TFVSFNSSEML----SKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNML-PIDDDPDLG 312

Query: 167 TCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNV 226
           T   + S +++E P +  HF    V  +P + ++ P D  G FCFA A T+    I GN 
Sbjct: 313 TQLCYRSETNLEGPILIAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNF 370

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    + F+L    + F    C
Sbjct: 371 AQSNVLIGFDLDRKTVSFKATDC 393


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 112/245 (45%), Gaps = 26/245 (10%)

Query: 19  AIGCGHNNEGL--FVGAAGLLGLGGGSLSFPSQINAS--TFSYCLVDRDSD----STSTL 70
           + GC  ++ G   F    GLLG+G G +S   Q + +   FSYCL  + S+    S +T 
Sbjct: 191 SFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSERGFFSKTTG 250

Query: 71  EF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
            F     +   +     ++   +    +++ LT ISV G+ L +S + F        G++
Sbjct: 251 YFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRKGVV 305

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT---CYDFSSRSSVEVPTVSF 184
            DSG+ ++ +     + L        R L    G A  ++   CYD  S    ++P +S 
Sbjct: 306 FDSGSELSYIPDRALSVLSQRI----RELLLKRGAAEEESERNCYDMRSVDEGDMPAISL 361

Query: 185 HFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLI 242
           HF +G    L +    +   V     +C AFAPT S +SIIG++ Q    V ++L+  LI
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420

Query: 243 GFTPN 247
           G  P+
Sbjct: 421 GIGPS 425


>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
          Length = 402

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/156 (37%), Positives = 73/156 (46%), Gaps = 17/156 (10%)

Query: 98  GLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALS 157
           G  GI VGG  L +    F       GG ++DS   +T+L    Y ALR AF R   A  
Sbjct: 260 GTMGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLAF-RSAMAAY 312

Query: 158 P--TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
           P    G A  DTCYDF   +SV VP VS  F  G V+ L A   ++        C AF P
Sbjct: 313 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------EGCLAFVP 366

Query: 216 TSS--SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           T    +L  IGNVQQQ   V +++    +GF    C
Sbjct: 367 TPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402


>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
          Length = 429

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 72/255 (28%), Positives = 110/255 (43%), Gaps = 50/255 (19%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTL---EFDSSLPPNAVTAPL 85
           G+ G G G LS PSQ+      FS+C +     R+ + TS+L   +   S   + +  P+
Sbjct: 181 GIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPM 240

Query: 86  LRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L++     FYY+GL G+S+G G  +    +   ID  GNGG+IVD+GT  T L    Y A
Sbjct: 241 LKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTA 300

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV----------------EVPTVSFHFPE 188
           +          LS    V L++  YD   R+                  E+P ++FHF  
Sbjct: 301 I----------LSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLG 350

Query: 189 GKVLPLPAKN--YLI--PVDSNGTFCFAF----------APTSSSLSIIGNVQQQGTRVS 234
              L LP  +  Y +  P +S    C  F             +   +++G+ Q Q   V 
Sbjct: 351 DVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGPGAVLGSFQMQNVEVV 410

Query: 235 FNLRNSLIGFTPNKC 249
           +++    IGF P  C
Sbjct: 411 YDMEAGRIGFQPKDC 425


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 117/262 (44%), Gaps = 28/262 (10%)

Query: 5   TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
           T+T  +G+A+  ++A GC  + N    +GA+G++GLG    S   Q+NA+ FSYCL    
Sbjct: 88  TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 146

Query: 63  DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            +   S L   +S       +A T PL+   +  + Y + L GI   GD++        I
Sbjct: 147 AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 197

Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY-----DFS 172
           +   NG ++ VD+   V+ L    ++A++ A      A         FD C+        
Sbjct: 198 EPPPNGSVVLVDTIFGVSFLVDAAFHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 257

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
           + SS+ +P V   F     L +P   Y+     NGT C A   ++     + LSI+G + 
Sbjct: 258 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 316

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+     F+L    + F P  C
Sbjct: 317 QENIHFLFDLDKETLSFEPADC 338


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/271 (29%), Positives = 121/271 (44%), Gaps = 28/271 (10%)

Query: 1   GDFVTETVTLG-----SASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFPSQINA-- 52
           G F  ET+T+G      A +  + +GC  +  G     A G+LGL     SF S   +  
Sbjct: 184 GVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLF 243

Query: 53  -STFSYCLVDRDSDS--TSTLEFDSSLPPNAVTAPLLRNHELDT-----FYYLGLTGISV 104
            +  SYCLVD  S+   ++ L F  S    +      R   LD      FY + + GIS+
Sbjct: 244 GAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISI 303

Query: 105 GGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGV 162
           G D+L I    +  D +  GG I+DSGT++T L    Y  +     R    L     +G+
Sbjct: 304 GDDMLDIPTQVW--DATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGI 361

Query: 163 ALFDTCYDFSSRSSV---EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS- 218
            +    Y FSS S     ++P ++FH   G       K+YL+   + G  C  F    + 
Sbjct: 362 PIE---YCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDA-APGVKCLGFMSAGTP 417

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +++GN+ QQ     F+L  S + F P+ C
Sbjct: 418 ATNVVGNIMQQNYLWEFDLMASTLSFAPSTC 448


>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
 gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
 gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
          Length = 432

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/258 (27%), Positives = 110/258 (42%), Gaps = 53/258 (20%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTL---EFDSSLPPNAVTAPL 85
           G+ G G G LS PSQ+      FS+C +     R+ + TS+L   +   S   + +  P+
Sbjct: 181 GIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNPNFTSSLIMGDLALSAKDDFLFTPM 240

Query: 86  LRNHELDTFYYLGLTGISVG-GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L++     FYY+GL G+S+G G  +    +   ID  GNGG+IVD+GT  T L    Y A
Sbjct: 241 LKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDSEGNGGMIVDTGTTYTHLPDPFYTA 300

Query: 145 LRDAFVRGTRALSPTDGVALFDTCYDFSSRSSV----------------EVPTVSFHFPE 188
           +          LS    V L++  YD   R+                  E+P ++FHF  
Sbjct: 301 I----------LSSLASVILYERSYDLEMRTGFDLCFKIPCTHTPCTQDELPLINFHFLG 350

Query: 189 GKVLPLPAKN--YLI--PVDSNGTFCFAFAPTSSSL-------------SIIGNVQQQGT 231
              L LP  +  Y +  P +S    C  F    +               +++G+ Q Q  
Sbjct: 351 DVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGANNGPGAVLGSFQMQNV 410

Query: 232 RVSFNLRNSLIGFTPNKC 249
            V +++    IGF P  C
Sbjct: 411 EVVYDMEAGRIGFQPKDC 428


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/263 (25%), Positives = 116/263 (44%), Gaps = 25/263 (9%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNE-GLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G   T+T  +G+A   ++A GC   ++     G +G++GLG    S  +Q   + FSYCL
Sbjct: 139 GKVGTDTFAVGTAKA-SLAFGCVVASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCL 197

Query: 60  VDRDSDSTSTLEFDSSLP----PNAVTAPLL----RNHELDTFYYLGLTGISVGGDLLPI 111
              D+   S L   SS        A + P +      ++L  +Y + L G+  G  ++P+
Sbjct: 198 APHDAGKNSALFLGSSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPL 257

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
             +           +++D+ + ++ L    Y A++ A      A      V  FD C+  
Sbjct: 258 PPS--------GSTVLLDTFSPISFLVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPK 309

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNV 226
           S  S    P + F F  G  + + A NYL+    NGT C A   ++     + LS++G++
Sbjct: 310 SGASGA-APDLVFTFRGGAAMTVAASNYLLDYK-NGTVCLAMLSSARLNSTTELSLLGSL 367

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
           QQ+     F+L    + F P  C
Sbjct: 368 QQENIHFLFDLDKETLSFEPADC 390


>gi|125552155|gb|EAY97864.1| hypothetical protein OsI_19785 [Oryza sativa Indica Group]
          Length = 508

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 116/272 (42%), Gaps = 31/272 (11%)

Query: 1   GDFVTETVTLGSASVDN-----IAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTF 55
           G+   +  T G  S D      +  GC  + EG F GA+G+LGL  GSLS  SQ+N   F
Sbjct: 191 GNLAVQNFTFGDDSEDTAVKGVVTFGCSSSTEGDF-GASGVLGLNKGSLSLVSQLNLGRF 249

Query: 56  SYCLVDR--DSDSTSTLEFDSSLPPNAVTAP-----------------LLRNHELDTFYY 96
           SY        +D+ +  +F      + +T P                  + +  LD  Y+
Sbjct: 250 SYYFAPEVNTTDNNAADDFIVFGDDDGITVPGTSGGSRPRYTPFFTTGAVSSANLD-LYF 308

Query: 97  LGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL 156
           + LTGI VGG  L +              ++  S   VT L+   Y  L+   V    + 
Sbjct: 309 VELTGIRVGGKDLQLGGGGGGSAGGSLEAVLSTS-VPVTYLEKNAYGLLKKELVSALGSN 367

Query: 157 SPTDGVAL-FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP 215
           +  DG AL  D CY        ++P ++F F    V+ L   NYL   +  G  C    P
Sbjct: 368 NTEDGSALGLDLCYRSQHMDRAKIPDIAFVFGGNAVMKLQQWNYLYQDEDTGLECLTILP 427

Query: 216 T---SSSLSIIGNVQQQGTRVSFNLRNSLIGF 244
           +   S  LS+IG++ Q GT + ++L  S +GF
Sbjct: 428 SPDDSDGLSLIGSMIQTGTYMIYDLHKSRLGF 459


>gi|224032957|gb|ACN35554.1| unknown [Zea mays]
          Length = 144

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 69/123 (56%), Gaps = 3/123 (2%)

Query: 127 IVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHF 186
           I+DSGT +TRL T  Y+AL  A     +        ++ DTC+   + + + VP V+  F
Sbjct: 24  IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAF 82

Query: 187 PEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTP 246
             G  L L A+N L+ VDS  T C AFAP  S+ +IIGN QQQ   V ++++NS IGF  
Sbjct: 83  AGGAALKLAARNLLVDVDS-ATTCLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAA 140

Query: 247 NKC 249
             C
Sbjct: 141 GGC 143


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/228 (30%), Positives = 105/228 (46%), Gaps = 23/228 (10%)

Query: 35  GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G+ G G   +S  SQ     I    FS+CL   D+     L     + PN V +PL+++ 
Sbjct: 220 GIFGFGQQGMSVISQLSLQGIAPRVFSHCL-KGDNSGGGVLVLGEIVEPNIVYSPLVQSQ 278

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y L L  ISV G ++PI+   F    S N G IVDSGT +  L  E YN     F
Sbjct: 279 P---HYNLNLQSISVNGQIVPIAPAVFA--TSNNRGTIVDSGTTLAYLAEEAYN----PF 329

Query: 150 VRGTRALSPTDGVALF---DTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDS 205
           V    AL P    ++    + CY  ++ S+V++ P VS +F  G  L L  ++YL+  + 
Sbjct: 330 VNAITALVPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNY 389

Query: 206 NG---TFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            G    +C  F      S++I+G++  +     ++L    IG+    C
Sbjct: 390 IGEGSVWCIGFQRIPGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437


>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like, partial [Brachypodium distachyon]
          Length = 364

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 85/283 (30%), Positives = 118/283 (41%), Gaps = 36/283 (12%)

Query: 1   GDFVTETVTLGSASVD-NIAIGC---GHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G   T+   +GSA+     A GC     ++    V +AGLLG+  G+LSF SQ     FS
Sbjct: 73  GALATDVFAVGSATPSLRAAFGCMASAFDSSPDGVASAGLLGMNRGALSFVSQAGTRRFS 132

Query: 57  YCLVDRDSDSTSTLEFDSSLP---PNAVTAPLLRNHELDTF----YYLGLTGISVGGDLL 109
           YC+ DRD D+   L   S LP   P   T     +  L  F    Y + L GI VG   L
Sbjct: 133 YCISDRD-DAGVLLLGHSDLPNFLPLNYTPLYQPSLPLPYFDRVAYSVQLLGILVGSKPL 191

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RAL-SPTDGV-A 163
           PI  +    D +G G  +VDSGT  T L  + Y AL+  F R +    RAL  P+     
Sbjct: 192 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKAEFYRQSTPFLRALDEPSFAFQG 251

Query: 164 LFDTCYD----FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV----------DSNGTF 209
            FDTC+      S      +P+V+  F  G  + +     L  V          D +  +
Sbjct: 252 AFDTCFRVPRGMSPPPGRLLPSVTLRF-NGAEMVVGGDRLLYKVPGERRGGAGADDDAVW 310

Query: 210 CFAFAPTSS---SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           C  F           +IG+  Q    V ++L    +G    +C
Sbjct: 311 CLTFGNADMVPIMAYVIGHHHQMNLWVEYDLERGRVGLAQVRC 353


>gi|50878437|gb|AAT85211.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 435

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 70/253 (27%), Positives = 108/253 (42%), Gaps = 31/253 (12%)

Query: 22  CGHNN--EGLFVGAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDS-------DST 67
           CG  +  +GL   A G++ L     + P+Q+ +       F+ CL   +S       D+ 
Sbjct: 169 CGATSLTKGLGAAATGMMSLSRARFALPTQVASIFRFSRKFALCLAPAESSGVVVFGDAP 228

Query: 68  STLEFDSSLPPNAVTAPLLRNH------ELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
              +    L  + +  PLL N       +  T Y++G+TGI V G  +P++ T   I +S
Sbjct: 229 YEFQPVMDLSKSLIYTPLLVNPVTTTGGDKSTEYFIGVTGIKVNGRAVPLNATLLAIAKS 288

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD----FSSRSSV 177
           G GG  +   +  T L+T  Y A+ DAF   T  +     VA F  CYD     S+R+  
Sbjct: 289 GVGGTKLSMLSPYTVLETSIYKAVTDAFAAETAMIPRVPAVAPFKLCYDGTMVGSTRAGP 348

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCF-----AFAPTSSSLSIIGNVQQQGTR 232
            VPTV        V  +      +    +G  CF       AP +S   +IG    +   
Sbjct: 349 AVPTVELVLQSKAVSWVVFGANSMVATKDGALCFGVVDGGVAPETS--VVIGGHMMEDNL 406

Query: 233 VSFNLRNSLIGFT 245
           + F+L  S +GFT
Sbjct: 407 LEFDLEGSRLGFT 419


>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
          Length = 424

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 75/273 (27%), Positives = 115/273 (42%), Gaps = 67/273 (24%)

Query: 5   TETVTLGSASVDNIAIGCGHNNE---GLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVD 61
           T+  T  S+S   +A GC        G   GA+G++GLG G+LS             L  
Sbjct: 188 TDAFTFPSSSSVTLAFGCVSQTRISPGALTGASGIIGLGRGALS-------------LNP 234

Query: 62  RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDES 121
           +DS                            TFYYL L G++ G   + +   AF + E+
Sbjct: 235 KDS-------------------------PFSTFYYLPLVGLAAGNATVALPAGAFDLREA 269

Query: 122 G----NGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGV--ALFDTCY--- 169
                 GG ++DSG+  TRL    + AL       +RG+ +L P         + C    
Sbjct: 270 APKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAG 329

Query: 170 -DFSSRSSVEVPTVSFHFPEG----KVLPLPAKNYLIPVDSNGTFCFAFAPTSS------ 218
            D  S ++  VP++   F +G    + L +PA+ Y   V+++ T+C A   ++S      
Sbjct: 330 DDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWARVEAS-TWCMAVVSSASGNATLP 388

Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
               +IIGN  QQ  RV ++L N L+ F P  C
Sbjct: 389 TNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 421


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 72/249 (28%), Positives = 104/249 (41%), Gaps = 19/249 (7%)

Query: 13  ASVDNIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQ-----INASTFSYCLVDRD 63
           AS   I  GC     G          G+LG G G LS  SQ     I    FS+CL   D
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL-KGD 261

Query: 64  SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
            +    L     L P+ V +PL+ +      Y L L  I+V G +L I+   F    S  
Sbjct: 262 GNGGGILVLGEILEPSIVYSPLVPSQP---HYNLNLQSIAVNGQVLSINPAVFA--TSDK 316

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVS 183
            G I+DSGT ++ L  E Y+ L +A        + T  ++    CY   +      PTVS
Sbjct: 317 RGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFA-TSFISKGSQCYLVLTSIDDSFPTVS 375

Query: 184 FHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNS 240
           F+F  G  + L    YL+     D    +C  F      ++I+G++  +   V ++L   
Sbjct: 376 FNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQ 435

Query: 241 LIGFTPNKC 249
            IG+T   C
Sbjct: 436 QIGWTNYDC 444


>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 488

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/288 (28%), Positives = 129/288 (44%), Gaps = 42/288 (14%)

Query: 1   GDFVTETVTLGS--ASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC 58
           G  V++T+ L    A+  N A+GC  +   +    +GL G G G+ S P+Q+  + FSYC
Sbjct: 199 GLLVSDTLRLSPRGAASRNFAVGC--SLASVHQPPSGLAGFGRGAPSVPAQLGVNKFSYC 256

Query: 59  LVDRDSDSTSTLEFDSSLPPNAV--------TAPLLRNH----ELDTFYYLGLTGISVGG 106
           L+ R  D  + +  +  L  ++          APLL+N         +YYL LTGI+VGG
Sbjct: 257 LLSRRFDDDAAISGELVLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGG 316

Query: 107 DLLPISETAF-KIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTD 160
             + +   A   +   G GG I+DSGT  T L    +  +  A V     R  R+    +
Sbjct: 317 KSVALPARALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKD-VE 375

Query: 161 GVALFDTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFA 214
           G      C+   + + ++++P +S HF  G  + LP +NY +        +    C A  
Sbjct: 376 GALGLRPCFALPAGARTMDLPELSLHFSGGAEMRLPIENYFLAAGPASGVAPEAICLAVV 435

Query: 215 PTSSSLS-------------IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              SS S             I+G+ QQQ  +V ++L  + +GF    C
Sbjct: 436 SDVSSASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPC 483


>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 435

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 74/258 (28%), Positives = 110/258 (42%), Gaps = 29/258 (11%)

Query: 15  VDNIAIGCGHNNEG---LFVGA-AGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSD 65
           VD +  GC H  +G   L  G  AG L L     SF SQ+ A     S FSYCL    S 
Sbjct: 176 VDKLTFGCAHTTDGFERLNHGVLAGALSLSRHPTSFLSQLTARRLADSRFSYCLFPGQSH 235

Query: 66  STST---LEFDSSLPPN---AVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAF--K 117
             +    L F   +P +     T+ L       + YY+G+T IS+ G  +   + AF  +
Sbjct: 236 PNARHGFLRFGRDIPRHDHAHSTSLLFTGRGSGSMYYIGVTSISLNGKRIIGLQPAFFRR 295

Query: 118 IDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR------GTRALSPTDGVALFDTCYDF 171
             ++  GG +VD GT +TRL  E YN +    V         RA +P  G  L   C  F
Sbjct: 296 NPQTRRGGSVVDPGTPLTRLVREAYNIVEAELVAYMQTQGSRRAPAPVQGHRL---C--F 350

Query: 172 SSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGT 231
            S     +P+++ +  E +         L    ++   CF   P    ++++G  QQ  T
Sbjct: 351 VSWGHAHLPSMTINMNEDRAKLFIKPELLFLKVTHEHLCFLVVP-DEEMTVLGAAQQVDT 409

Query: 232 RVSFNLRNSLIGFTPNKC 249
           R +F+L  + + F    C
Sbjct: 410 RFTFDLHANRLYFAQEHC 427


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 76/268 (28%), Positives = 113/268 (42%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +T   S S   + +GC  +         G+LG+  G LSF SQ   + FSYC+
Sbjct: 170 GNLVREKITFSTSQSTPPLILGCAEDASD----DKGILGMNLGRLSFASQAKITKFSYCV 225

Query: 60  VDRDSDS--TSTLEFDSSLPPNAVTAPLL---------RNHELDTFYY-LGLTGISVGGD 107
             R      T T  F     PN+     +         R   LD   + + L GI +G  
Sbjct: 226 PTRQVRPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNK 285

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L I  +AF+ D SG G  ++DSG+  T L    YN +R+  VR  G R         + 
Sbjct: 286 KLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVYSGVS 345

Query: 166 DTCYDFSSRSSVE-VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
           D C+D ++      +  + F F +G  + +     L  V   G  C     +    ++ +
Sbjct: 346 DMCFDGNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADV-GGGVHCVGIGRSEMLGAASN 404

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN  QQ   V F++ N  +GF    C
Sbjct: 405 IIGNFHQQNLWVEFDIANRRVGFGKADC 432


>gi|222632517|gb|EEE64649.1| hypothetical protein OsJ_19503 [Oryza sativa Japonica Group]
          Length = 505

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/181 (31%), Positives = 84/181 (46%), Gaps = 9/181 (4%)

Query: 74  SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTA 133
           SS P      PLL +  +  FY + +  +SV G  L I    +  D   NGG I+DSGT+
Sbjct: 327 SSAPAPGSRTPLLLDARVRPFYAVAVDSVSVDGVALDIPAEVW--DVGSNGGTIIDSGTS 384

Query: 134 VTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR----SSVEVPTVSFHFPEG 189
           +T L T  Y A+  A       L P   +  FD CY++++R      + VP ++  F   
Sbjct: 385 LTVLATPAYKAVVAALSEQLAGL-PRVAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGS 443

Query: 190 KVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
             L  PAK+Y+I   + G  C      +   +S+IGN+ QQ     F+L N  + F    
Sbjct: 444 ARLEPPAKSYVIDA-APGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTS 502

Query: 249 C 249
           C
Sbjct: 503 C 503


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 77/268 (28%), Positives = 114/268 (42%), Gaps = 24/268 (8%)

Query: 1   GDFVTETVTLGSA-SVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E  T  S+ +   + +GC  ++        G+LG+  G LSF S    S FSYC+
Sbjct: 168 GNLVREKFTFSSSQTTPPLILGCATDSSD----TQGILGMNLGRLSFSSLAKISKFSYCV 223

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPL-----------LRNHELDTFYY-LGLTGISVGGD 107
             R S S S+      L PN  +A              R   LD   Y L + GI + G 
Sbjct: 224 PPRRSQSGSSPTGSFYLGPNPSSAGFKYVNLMTYRQSQRMPNLDPLAYTLPMLGIRINGK 283

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALF 165
            L IS +AF+ D SG G  ++DSGT  T L  E Y+ +++  V+  G +           
Sbjct: 284 KLNISTSAFRADPSGAGQTLIDSGTWFTFLVDEAYSKVKEEIVKLAGPKLKKGYVYGGSL 343

Query: 166 DTCYDFSSRS-SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS---SSLS 221
           D C+D  +      +  ++F F  G  + +  +  L  V   G  C     +     + +
Sbjct: 344 DMCFDGDAMVIGRMIGNMAFEFENGVEIVVEREKMLADV-GGGVQCLGIGRSDLLGVASN 402

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIGN  QQ   V F+L    +GF    C
Sbjct: 403 IIGNFHQQDLWVEFDLVGRRVGFGRTDC 430


>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
          Length = 218

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 87/221 (39%), Gaps = 24/221 (10%)

Query: 50  INASTFSYCLVDRDSDSTST-----LEFDSSLPPNAVTAPLLRNHELDTFYY-LGLTGIS 103
           +    F+YCL   D D T       L++           P L++     FYY LG+  I 
Sbjct: 1   MGVKKFAYCLNSHDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIK 60

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTE-----TYNALRDAFVRGTRALSP 158
           +G  LL I          G  G+I+DSG       T        N L+    +  R+L  
Sbjct: 61  IGNKLLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEA 120

Query: 159 TDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL----------IPVDSNGT 208
                L   CY+F+   S+++P + + F  G  + +P KNY             +D+NGT
Sbjct: 121 ETQTGL-TPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGT 179

Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                 P  S   I+GN Q     V ++L+N   GF    C
Sbjct: 180 NALEITPDPS--IILGNSQHVDYYVEYDLKNDRFGFRRQTC 218


>gi|356551755|ref|XP_003544239.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 249

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 61/196 (31%), Positives = 88/196 (44%), Gaps = 17/196 (8%)

Query: 57  YCLVDRDSDSTS-TLEFDSSLPPNAV-TAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YCL    S   S +L+   +  P  + T PLLRN +  + YY+ LTGI+VG   + +   
Sbjct: 67  YCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLRNPQRPSLYYVNLTGINVGRVRVSLPTD 126

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSR 174
               D +   G I+DSGT +TR     YNA+RD F    +      G     T  + +  
Sbjct: 127 YLAFDPNKGSGTIIDSGTVITRFVXPVYNAIRDEFRYQVK------GPCFVKTYENLA-- 178

Query: 175 SSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIGNVQQQGTRV 233
                P +   F  G  + LP +N LI     G  C A A   +++ S + N QQQ  RV
Sbjct: 179 -----PLIKLRF-TGLDVTLPYENTLIHTAYGGMACLAMAAAPNNVNSALTNFQQQNLRV 232

Query: 234 SFNLRNSLIGFTPNKC 249
            F+  N+ +G     C
Sbjct: 233 LFDTVNNRVGIARELC 248


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 71/268 (26%), Positives = 115/268 (42%), Gaps = 32/268 (11%)

Query: 1   GDFVTETVTLGSASVDN----------IAIGCGHNNEGLFV-GAA--GLLGLGGGSLSFP 47
           G  V + + L S S D+          + +GCG    G ++ GAA  G++GLG GS+S P
Sbjct: 199 GFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVP 258

Query: 48  SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           S +  +     +FS C    D + + T+ F      +  + PLL        Y + +   
Sbjct: 259 SLLAKAGLIRKSFSLCF---DVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESY 315

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            VG   L   ++ FK         +VDSG + T L  + YN +   F +   A   +   
Sbjct: 316 CVGNSCL--KQSGFKA--------LVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQG 365

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSN-GTFCFAFAPTSSSLS 221
             ++ CY+ SS+    VP +   F   + L +    Y +P +     FC    PT  +  
Sbjct: 366 GPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYG 425

Query: 222 IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IIG     G RV F++ N  +G++ + C
Sbjct: 426 IIGQNYMTGYRVVFDMENLKLGWSSSNC 453


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 76/263 (28%), Positives = 107/263 (40%), Gaps = 24/263 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS-- 53
           G   T+TVT+ S S     +    IGCG NN        G +GL  G LS  +Q+     
Sbjct: 454 GTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYP 513

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLL 109
              SYC      + TS + F ++        V+  +        FYYL L  +SVG   +
Sbjct: 514 GLMSYCFA---GNGTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRI 570

Query: 110 PISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY 169
               T F   E   G I++DSGT +T       N +R A      A+   D       CY
Sbjct: 571 ETLGTPFHALE---GNIVIDSGTTLTYFPESYCNLVRQAVEHVVPAVPAADPTGNDLLCY 627

Query: 170 DFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA---PTSSSLSIIGNV 226
            +S+ + +  P ++ HF  G  L L   N  +   S G FC A     PT    +I GN 
Sbjct: 628 -YSNTTEI-FPVITMHFSGGADLVLDKYNMFMESYSGGLFCLAIICNNPTQE--AIFGNR 683

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
            Q    V ++  + L+ F P  C
Sbjct: 684 AQNNFLVGYDSSSLLVSFKPTNC 706



 Score = 63.5 bits (153), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 68/249 (27%), Positives = 103/249 (41%), Gaps = 44/249 (17%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNNEG--LFVGAAGLLGLGGGSLSFPSQINAS 53
           G   TETVT+ S S     +    IGC  NN G      ++G++GL  GSLS  SQ+  +
Sbjct: 141 GTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGA 200

Query: 54  TFSYCLVDRDSDSTSTLEFDSSLPPN-AVTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
                                  P +  V+  +         YYL L  +SVG   +   
Sbjct: 201 ----------------------YPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETV 238

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCY 169
            T F    + NG I++DSGT +T       N +R A  R     R + P+    L   CY
Sbjct: 239 GTPF---HALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDML---CY 292

Query: 170 DFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQ 227
                +++E+ P ++ HF  G  L L   N  + ++  G FC A    + + ++I GN  
Sbjct: 293 ---YSNTIEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQVAIFGNRA 349

Query: 228 QQGTRVSFN 236
           Q    V ++
Sbjct: 350 QNNFLVGYD 358


>gi|22165127|gb|AAM93743.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
          Length = 265

 Score = 77.4 bits (189), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 115/262 (43%), Gaps = 28/262 (10%)

Query: 5   TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
           T+T  +G+A+  ++A GC  + N    +GA+G++GLG    S   Q+NA+ FSYCL    
Sbjct: 11  TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 69

Query: 63  DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            +   S L   +S       +A T PL+   +  + Y + L GI   GD++        I
Sbjct: 70  AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 120

Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCY-----DFS 172
               NG ++ VD+   V+ L    + A++ A      A         FD C+        
Sbjct: 121 APPPNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 180

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
           + SS+ +P V   F     L +P   Y+     NGT C A   ++     + LSI+G + 
Sbjct: 181 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 239

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+     F+L    + F P  C
Sbjct: 240 QENIHFLFDLDKETLSFEPADC 261


>gi|449432735|ref|XP_004134154.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
 gi|449527085|ref|XP_004170543.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 435

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 73/268 (27%), Positives = 110/268 (41%), Gaps = 34/268 (12%)

Query: 12  SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS 64
           + S+ N    CG     EGL  G  G+ G G   +S PSQ  A+      F+ CL    S
Sbjct: 151 AVSIPNFLFVCGSTFLLEGLAPGVTGMAGFGRNGISLPSQFAAAFSFNRKFAVCLSGSTS 210

Query: 65  ------------------DSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
                             D T++  +         TA +    E  T Y++G+T I V  
Sbjct: 211 SPGVIFSGNGPYHFLPNIDLTNSFTYTPLFINPVSTAGVSSAGEKSTEYFIGVTSIVVNS 270

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +P++ T  KID +GNGG  + +    T L++  Y AL  AF      +     VA F+
Sbjct: 271 KPVPLNTTLLKIDSNGNGGTKISTVNPFTVLESSIYKALVKAFTTEVSKVPRVGAVAPFE 330

Query: 167 TCYDF----SSRSSVEVPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAPTSSSLS 221
            CY      S+R    VPT+       KV+  +   N ++ V+ +   C  F      + 
Sbjct: 331 VCYSSKSFPSTRLGAGVPTIDLVLQNKKVIWSMFGANSMVQVN-DEVLCLGFVDGGVDVR 389

Query: 222 ---IIGNVQQQGTRVSFNLRNSLIGFTP 246
              +IG  Q +   + F+L  S +GFTP
Sbjct: 390 TAIVIGAHQIEDKLLEFDLATSRLGFTP 417


>gi|297811183|ref|XP_002873475.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319312|gb|EFH49734.1| hypothetical protein ARALYDRAFT_909036 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 292

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 71/248 (28%), Positives = 101/248 (40%), Gaps = 50/248 (20%)

Query: 6   ETVTLGSASV-DNIAIGCGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRD 63
           E  TL S+   D +  GCG NN G  + G AGLLG   G L+F S               
Sbjct: 90  EKFTLMSSDFFDGVNFGCGENNTGDYYEGVAGLLGNTSGHLTFGS--------------- 134

Query: 64  SDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGN 123
                     + +  +    P+  +   D FYYL + GI+V    L I            
Sbjct: 135 ----------TGISKSVKFTPVSSSPSKD-FYYLNIEGITVCDKQLEIPS---------- 173

Query: 124 GGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTD-GVALFDTCYDFSSRSSVEVPTV 182
               ++S T         Y AL+ AF       + T  G +  DTCYDF+   +V +  +
Sbjct: 174 ----IESSTP------RAYAALKSAFKEKMSKYTITSSGDSELDTCYDFTGLKTVTITKI 223

Query: 183 SFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT-SSSLSIIGNVQQQGTRVSFNLRNSL 241
           +F F  G V+ L  K  L         C AFA     +++I G+VQQQ  +V ++     
Sbjct: 224 AFSFSGGTVVELDPKGILYSSSERSKLCLAFAEYPDDNVAIFGSVQQQTLQVVYDGVGGR 283

Query: 242 IGFTPNKC 249
           +GF PN C
Sbjct: 284 VGFAPNGC 291


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 72/262 (27%), Positives = 115/262 (43%), Gaps = 28/262 (10%)

Query: 5   TETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDR- 62
           T+T  +G+A+  ++A GC  + N    +GA+G++GLG    S   Q+NA+ FSYCL    
Sbjct: 119 TDTFAIGTATA-SLAFGCAMDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHG 177

Query: 63  DSDSTSTLEFDSSLP----PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI 118
            +   S L   +S       +A T PL+   +  + Y + L GI   GD++        I
Sbjct: 178 AAGKKSALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKF-GDVI--------I 228

Query: 119 DESGNGGII-VDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD-----FS 172
               NG ++ VD+   V+ L    + A++ A      A         FD C+        
Sbjct: 229 APPPNGSVVLVDTIFGVSFLVDAAFQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAG 288

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-----SSLSIIGNVQ 227
           + SS+ +P V   F     L +P   Y+     NGT C A   ++     + LSI+G + 
Sbjct: 289 ANSSLPLPDVVLTFQGAAALTVPPSKYMYDA-GNGTVCLAMMSSAMLNLTTELSILGRLH 347

Query: 228 QQGTRVSFNLRNSLIGFTPNKC 249
           Q+     F+L    + F P  C
Sbjct: 348 QENIHFLFDLDKETLSFEPADC 369


>gi|125572775|gb|EAZ14290.1| hypothetical protein OsJ_04214 [Oryza sativa Japonica Group]
          Length = 465

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 74/149 (49%), Gaps = 5/149 (3%)

Query: 5   TETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDS 64
           TE  T G   +D +  GCG  N G F G +G++GLG G+LS  SQ+    FSY     DS
Sbjct: 149 TEAFTFGDTRIDGVVFGCGLKNVGDFSGVSGVIGLGRGNLSLVSQLQVDRFSYHFAPDDS 208

Query: 65  -DSTSTLEFDSSLPP---NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI-D 119
            D+ S + F     P   + ++  LL +    + YY+ L GI V G  L I    F + +
Sbjct: 209 VDTQSFILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRN 268

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDA 148
           + G+GG+ +     VT L+   Y  LR A
Sbjct: 269 KDGSGGVFLSITDLVTVLEEAAYKPLRQA 297


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 74/265 (27%), Positives = 119/265 (44%), Gaps = 24/265 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCGHNN-----EGLFVGAAGLLGLGGGSLSFPSQI 50
           G   TETVT+ S S     +    IGCG +N      G    ++G++GL  G LS  SQ+
Sbjct: 495 GILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQM 554

Query: 51  N---ASTFSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
           +       SYC   +    TS + F  ++ +  +   A  +   + + FYYL L  +SV 
Sbjct: 555 DLPYPGLISYCFSGQ---GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVE 611

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
            +L+    T F  ++   G I +DSGT +T       N +R+A  +   A+   D  +  
Sbjct: 612 DNLIATLGTPFHAED---GNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDN 668

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSL-SIIG 224
             CY +S    +  P ++ HF  G  L L   N  +   + G FC A      S+ ++ G
Sbjct: 669 LLCY-YSDTIDI-FPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFG 726

Query: 225 NVQQQGTRVSFNLRNSLIGFTPNKC 249
           N  Q    V ++  +++I F+P  C
Sbjct: 727 NRAQNNFLVGYDPSSNVISFSPTNC 751



 Score = 67.8 bits (164), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 73/252 (28%), Positives = 110/252 (43%), Gaps = 24/252 (9%)

Query: 1   GDFVTETVTLGSAS-----VDNIAIGCG-HN----NEGLFVGAAGLLGLGGGSLSFPSQI 50
           G   TETVT+ S S     +    IGCG HN    N G    ++G++GL  G  S  SQ+
Sbjct: 156 GILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQM 215

Query: 51  N---ASTFSYCLVDRDSDSTSTLEF--DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVG 105
           +       SYC   +    TS + F  ++ +  +   A  +   + + FYYL L  +SV 
Sbjct: 216 DLPYPGLISYCFSGQ---GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVE 272

Query: 106 GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
            + +    T F  ++   G I++DSG+ VT       N +R A  +   A+   D     
Sbjct: 273 DNRIETLGTPFHAED---GNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGND 329

Query: 166 DTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIG 224
             CY FS    +  P ++ HF  G  L L   N  +  +S G FC A    S +  +I G
Sbjct: 330 MLCY-FSETIDI-FPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFG 387

Query: 225 NVQQQGTRVSFN 236
           N  Q    V ++
Sbjct: 388 NRAQNNFLVGYD 399


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score = 76.6 bits (187), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 69/275 (25%), Positives = 120/275 (43%), Gaps = 32/275 (11%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLFVG-------AAGLLGLGGGSLSFPS 48
           G+   ET T  S      ++ +I+ GC  ++  +           +G+LG+G G  SF +
Sbjct: 177 GNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLA 236

Query: 49  Q---INASTFSYCLVDRDSDSTSTLEFDSSL--PPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q   I+   FSYC+   ++ +T  L F   +    N  T  +++  +    Y++ L GIS
Sbjct: 237 QLGSISHGKFSYCITANNTHNT-YLRFGKHVVKSKNLQTTKIMQV-KPSAAYHVNLLGIS 294

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G  L I++T   + + G+ G I+D+GT  T L    ++ L  A    +  LS    + 
Sbjct: 295 VNGVKLNITKTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTAL---SNHLSSNQNLK 351

Query: 164 LF-------DTCYD-FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-PVDSNGTFCFAFA 214
            +       D CY+  S      +P V+FH     +   P   +L    +    FC +  
Sbjct: 352 RWVIHKLHKDLCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSML 411

Query: 215 PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +  S +IIG  QQ   +  ++ +  ++ F P  C
Sbjct: 412 -SDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/267 (28%), Positives = 116/267 (43%), Gaps = 22/267 (8%)

Query: 1   GDFVTETVTLG-SASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST---FS 56
           G F  + +T+  S +V +    C        +   G L L     S PS++  S    FS
Sbjct: 230 GTFSQDVLTVAPSVAVQDFTFVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFS 289

Query: 57  YCLVDR-DSDSTSTLEFDSSLPPNAVTA--PLLRNHELD--TFYYLGLTGISVGGDLLPI 111
           YC+    DS    +L  D+++  +  TA  PLL + + D    Y++ + G+S+G   LPI
Sbjct: 290 YCMPQYPDSPGFLSLGDDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPI 349

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPT-DGVALFDTCYD 170
               F      N   IV++GT  T L  + Y  LRDAF +     + +  G   FDTCY+
Sbjct: 350 PSGTF----GNNASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYN 405

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYL-IPVDSNGTF---CFAFAPTSSSL----SI 222
           F+    + VP V F F  G  L +     L   + S G F   C AF+          ++
Sbjct: 406 FTGLQELTVPLVEFKFGNGDSLLIDGDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAV 465

Query: 223 IGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           IG      T V +++    +GF P  C
Sbjct: 466 IGAYSLATTEVVYDVAGGTVGFIPESC 492


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 70/253 (27%), Positives = 106/253 (41%), Gaps = 38/253 (15%)

Query: 35  GLLGLGGGSLSFPSQINAST--FSYCLV----DRDSDSTSTLEFD----SSLPPNAVTAP 84
           G+ G G G LS P Q+  S   FS+C +      + + +S L       SS   N    P
Sbjct: 179 GIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTP 238

Query: 85  LLRNHELDTFYYLGLTGISVG-GD---LLPISETAFKIDESGNGGIIVDSGTAVTRLQTE 140
           LL++     +YY+GL  I++G GD      +S    +ID  GNGG+++DSGT  T L   
Sbjct: 239 LLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEP 298

Query: 141 TYNALRD--AFVRGTRALSPTDGVALFDTCYDFSSRSS-------VEVPTVSFHFPEGKV 191
            Y+ L      V G       +    FD CY    +++        ++P+++FHF     
Sbjct: 299 LYSQLISNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVS 358

Query: 192 LPLPAKNYLI----PVDSNGTFCFAFAPTSSSLS-----------IIGNVQQQGTRVSFN 236
           + LP  N       P++S    C  +                   I G+ QQQ   V ++
Sbjct: 359 VVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYD 418

Query: 237 LRNSLIGFTPNKC 249
           L    +GF P  C
Sbjct: 419 LEKERLGFQPMDC 431


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 103/247 (41%), Gaps = 20/247 (8%)

Query: 18  IAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
           IA GCGH N E L     G+LGLG    S   Q+  S FSYC+ D  + +    +     
Sbjct: 179 IAFGCGHENGEQLESEFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGE 237

Query: 77  PPNAVTAPLLRNHELDT-FYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
             + +  P     E +   YY+ L GISVG   L I    FK       G+I+D+GT  T
Sbjct: 238 DADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSRTGVILDTGTLYT 296

Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV---PTVSFHFPEGKVL 192
            L    Y   R+ +      L P      F     +  R + E+   P V+FHF  G  L
Sbjct: 297 WLADIAY---RELYNEIKSILDPKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAEL 353

Query: 193 PLPAKNYLIPVDSNGT----FCFAFAPTSSS------LSIIGNVQQQGTRVSFNLRNSLI 242
            + A +   P+  + T    FC +  PT+         + IG + QQ   ++++L+   I
Sbjct: 354 AMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERNI 413

Query: 243 GFTPNKC 249
                 C
Sbjct: 414 YLQRIDC 420


>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
          Length = 441

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 45/116 (38%), Positives = 63/116 (54%), Gaps = 3/116 (2%)

Query: 134 VTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLP 193
           +TRL T  Y+AL  A     +  S     ++ DTC+   + S V  P V+  F  G  L 
Sbjct: 328 ITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQA-SRVSAPAVTMSFAGGAALK 386

Query: 194 LPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           L A+N L+ VD + T C AFAP  S+ +IIGN QQQ   V +++++S IGF    C
Sbjct: 387 LSAQNLLVDVD-DSTTCLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 440


>gi|224127969|ref|XP_002329222.1| predicted protein [Populus trichocarpa]
 gi|222871003|gb|EEF08134.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/289 (28%), Positives = 125/289 (43%), Gaps = 47/289 (16%)

Query: 2   DFVTETVTLGS-ASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS----- 53
           D++    TLGS +S+DN    C      +GL  G  GL  LG  +LS P QIN +     
Sbjct: 139 DYLALLNTLGSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSP 198

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPN----------AVTAPLLRN----------HELD 92
             F+ CL    S     L F S  P N           +  PL+ N          H L 
Sbjct: 199 NCFAMCLSGSISQPGVAL-FGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLS 257

Query: 93  TFYYLGLTGISVGGDLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
             YY+GLT I V G ++  ++T   ID +SG+GG  + +    T+LQ+  Y A   AF+R
Sbjct: 258 PEYYVGLTAIKVNGKMVAFNKTLLAIDGQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLR 317

Query: 152 GTRA----LSPTDGVALFDTCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIP 202
              +    L+ T  V  F  CY   +  + +    VP +        V+  +   N ++ 
Sbjct: 318 EAASSAFNLTTTKPVKPFSVCYPAGAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMVR 377

Query: 203 V--DSNGTFCFAF----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           V   S   +C  F    A    S+ +IG +Q +   + F+L++  +GF+
Sbjct: 378 VTKKSVDVWCLGFVDGGAIDGPSI-MIGGLQLEDNLLQFDLQSKKLGFS 425


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 115/252 (45%), Gaps = 31/252 (12%)

Query: 15  VDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAST-----FSYCLVD-RDSDSTS 68
           +  +  GC     G F  A GL+GLGGG +S  SQ+ A+T     FSYCL    +++++S
Sbjct: 238 IAKLDFGCSTTTTGTF-RADGLVGLGGGPVSLASQLGATTSLGRKFSYCLAPYANTNASS 296

Query: 69  TLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGG 125
            L F S      P A + PL+   E++T+Y + L  I+V G   P +        +    
Sbjct: 297 ALNFGSRAVVSEPGAASTPLITG-EVETYYTIALDSINVAGTKRPTT--------AAQAH 347

Query: 126 IIVDSGTAVTRLQTETYNALRDAFVRGT---RALSPTDGVALFDTCYDFS---SRSSVEV 179
           IIVDSGT +T L +     L     R     RA SP     + D CYD S      ++ +
Sbjct: 348 IIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEK---ILDLCYDISGVRGEDALGI 404

Query: 180 PTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQQQGTRVSFNL 237
           P V+     G  + L   N  + V   G  C A   TS   S+SI+GN+ QQ   V ++L
Sbjct: 405 PDVTLVLGGGGEVTLKPDNTFVVVQ-EGVLCLALVATSERQSVSILGNIAQQNLHVGYDL 463

Query: 238 RNSLIGFTPNKC 249
               + F    C
Sbjct: 464 EKGTVTFAAADC 475


>gi|242078855|ref|XP_002444196.1| hypothetical protein SORBIDRAFT_07g014645 [Sorghum bicolor]
 gi|241940546|gb|EES13691.1| hypothetical protein SORBIDRAFT_07g014645 [Sorghum bicolor]
          Length = 100

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 56/78 (71%), Gaps = 5/78 (6%)

Query: 77  PPNAVTA---PLLRNHELDTFYYLGLTGISVGGDLLP-ISETAFKIDES-GNGGIIVDSG 131
           PP+A  A   P++RN  ++TFYY+ L GIS+GG  +P ++E+  ++  S G GG+IVDSG
Sbjct: 21  PPSASAASFTPMVRNPRMETFYYVQLVGISLGGARVPGVAESDLRLAPSTGRGGVIVDSG 80

Query: 132 TAVTRLQTETYNALRDAF 149
           T+VTRL   +Y+AL DAF
Sbjct: 81  TSVTRLARRSYSALHDAF 98


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 111/249 (44%), Gaps = 35/249 (14%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCL-----VDRDSDSTSTLEFDSSLPPNAVTA---- 83
           G+ G G G LS PSQ+      FS+C      V+  + S+  +   S+L  N   +    
Sbjct: 231 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 290

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLP--ISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           P+L        YY+GL  I++G ++ P  +  T  + D  GNGG++VDSGT  T L    
Sbjct: 291 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 350

Query: 142 YNALRDAF---VRGTRALSPTDGVALFDTCYDF----SSRSSVE------VPTVSFHFPE 188
           Y+ L       +   RA + T+    FD CY      ++ +S+E       P+++FHF  
Sbjct: 351 YSQLLTTLQSTITYPRA-TETESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLN 409

Query: 189 GKVLPLPAKN--YLIPVDSNGTF--CFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNS 240
              L LP  N  Y +   S+G+   C  F            + G+ QQQ  +V ++L   
Sbjct: 410 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKE 469

Query: 241 LIGFTPNKC 249
            IGF    C
Sbjct: 470 RIGFQAMDC 478


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 81/277 (29%), Positives = 115/277 (41%), Gaps = 31/277 (11%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHN----NEGLFVGAAGLLGLGGGSLSFPSQINASTFS 56
           G+  ++T   GS+    I  GC ++    N        GL+G+  GSLS  SQ+    FS
Sbjct: 158 GNLASDTFGFGSSFNPGIVFGCMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIPKFS 217

Query: 57  YCLVDRDSDST-----STLEFDSSL---PPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YC+   D         S   +  SL   P   ++ PL       + Y + L GI +   L
Sbjct: 218 YCISGSDFSGILLLGESNFSWGGSLNYTPLVQISTPLPYFDR--SAYTVRLEGIKISDKL 275

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT----RALSPTDGV-- 162
           L IS   F  D +G G  + D GT  + L    YNALRD F+  T    RAL   + V  
Sbjct: 276 LNISGNLFVPDHTGAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQ 335

Query: 163 ALFDTCYDFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVD-----SNGTFCFAFAP 215
              D CY      S   E+P+VS  F EG  + +     L  V      ++  +CF F  
Sbjct: 336 IAMDLCYRVPVNQSELPELPSVSLVF-EGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGN 394

Query: 216 TS---SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +        IIG+  QQ   + F+L    +G    +C
Sbjct: 395 SDLLGVEAFIIGHHHQQSMWMEFDLVEHRVGLAHARC 431


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/259 (28%), Positives = 111/259 (42%), Gaps = 46/259 (17%)

Query: 1   GDFVTETVTLGS-----ASVDNIAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQI 50
           G   +ET T+GS     AS   +A GCGH+N G F            G     +   S++
Sbjct: 83  GYLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKV 142

Query: 51  NASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLP 110
               FSYCLV   SDST++ + +                        G + +  G     
Sbjct: 143 GGQ-FSYCLVPLSSDSTASSKIN-----------------------FGKSAVVSGSG--- 175

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
            + +    +ES    II+DSGT +T L  + Y  +  A  +     + TD    F  CY 
Sbjct: 176 -TSSPAAAEESN---IIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY- 230

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQG 230
            S    +E+PT++ HF  G  + LP  N  +    +   CF+  P SS+L+I GN+ Q  
Sbjct: 231 -SGVKKLEIPTITAHF-IGADVQLPPLNTFVQAQED-LVCFSMIP-SSNLAIFGNLSQMN 286

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V ++L+N+ + F P  C
Sbjct: 287 FLVGYDLKNNKVSFKPTDC 305


>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 336

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 109/259 (42%), Gaps = 30/259 (11%)

Query: 4   VTETVTLGSASVDNIAIGCGHN-NEGLFVGAAGLLGLGGGSLSFPSQINASTFSYC---L 59
           V ET   G + + ++ + CGHN       G  G+ GL  G  S  ++I    FSYC   L
Sbjct: 92  VFETTDEGHSQIFDVLVRCGHNIGFNTDPGYNGIRGLNNGPNSLATKI-GQKFSYCVGNL 150

Query: 60  VDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
            D   +    +  + +      + P   +H    FYY+ L GI VG   L I+   F+I 
Sbjct: 151 ADPYYNYNQLILCEGA-DLEGYSTPFEVHH---GFYYVTLKGIIVGEKRLDIAPITFEIK 206

Query: 120 ESGNGGIIVDSGTAVTRL----QTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRS 175
            +  GG+I DSGT +T L        YN +R+      R L            Y   SR 
Sbjct: 207 GNNTGGVIRDSGTTITYLVDSVHKLLYNEVRNLLSWSFRQLCH----------YGIISRD 256

Query: 176 SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-----TSSSLSIIGNVQQQG 230
            V  P V+FHF +G  L L   ++   +  N   C   +P     T+ S S+I  + QQ 
Sbjct: 257 LVGFPVVTFHFADGADLALDTGSFFNQL--NSILCMTVSPASILNTTISPSVIELLAQQS 314

Query: 231 TRVSFNLRNSLIGFTPNKC 249
             V ++L  + + F    C
Sbjct: 315 YNVGYDLLTNFVYFQRIDC 333


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/251 (26%), Positives = 112/251 (44%), Gaps = 28/251 (11%)

Query: 14  SVDNIAIGCGHNNEGL--FVGAAGLLGLGGGSLSFPSQINAS--TFSYCLVDRDSD---- 65
            + +   GC  ++ G   F    GLLG+G G +S   Q +     FSYCL  + S+    
Sbjct: 186 KIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFF 245

Query: 66  STSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESG 122
           S +T  F     +   +     ++   +    +++ L  ISV G+ L +S + F      
Sbjct: 246 SKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----S 300

Query: 123 NGGIIVDSGTAVTRLQTETYNAL----RDAFVRGTRALSPTDGVALFDTCYDFSSRSSVE 178
             G++ DSG+ ++ +     + L    R+  +R   A   ++       CYD  S    +
Sbjct: 301 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESE-----RNCYDMRSVDEGD 355

Query: 179 VPTVSFHFPEGKVLPLPAKNYLIP--VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFN 236
           +P +S HF +G    L +    +   V     +C AFAPT S +SIIG++ Q    V ++
Sbjct: 356 MPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYD 414

Query: 237 LRNSLIGFTPN 247
           L+  LIG  P+
Sbjct: 415 LKRQLIGIGPS 425


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/263 (27%), Positives = 118/263 (44%), Gaps = 18/263 (6%)

Query: 5   TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFSYCL 59
           T TV+ G  A +  + +GC     G  V A  G+L LG G +SF    ++     FS+CL
Sbjct: 244 TVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFSFCL 303

Query: 60  VDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           +  +S  D++S L F    + + P  +   +L N ++   Y   +TG+ VGG+ L I + 
Sbjct: 304 LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIPDE 363

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
            +  +    GG+I+D+ T+VT L  E Y  +  A  R    L     +  F+ CY ++  
Sbjct: 364 VWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKWTFT 423

Query: 173 -----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNV 226
                   +V +P+ +     G  L   AK+ ++P    G  C AF         I+GNV
Sbjct: 424 GDGVDPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGILGNV 483

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
             Q      +  +  I F  +KC
Sbjct: 484 FMQEYIWEIDHGDGKIRFRKDKC 506


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/266 (27%), Positives = 119/266 (44%), Gaps = 18/266 (6%)

Query: 2   DFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFS 56
           +  T TV+ G  A +  + +GC     G  V A  G+L LG G +SF    ++     FS
Sbjct: 244 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQRFS 303

Query: 57  YCLVDRDS--DSTSTLEFD---SSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           +CL+  +S  D++S L F    + + P  +   +L N ++   Y   +TG+ VGG+ L I
Sbjct: 304 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERLDI 363

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            +  +  +    GG+I+D+ T+VT L  E Y  +  A  R    L     +  F+ CY +
Sbjct: 364 PDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDRHLSHLPRVYELEGFEYCYKW 423

Query: 172 S-------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
           +          +V +P+ +     G  L   AK+ ++P    G  C AF         I+
Sbjct: 424 TFTGDGVXPAHNVTIPSFTVEMAGGARLEPEAKSVVMPEVEPGVACLAFRKLLRGGPGIL 483

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNV  Q      +  +  I F  +KC
Sbjct: 484 GNVFMQEYIWEIDHGDGKIRFRKDKC 509


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/224 (29%), Positives = 99/224 (44%), Gaps = 16/224 (7%)

Query: 35  GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G+ G G   LS  SQ     I    FS+CL   + D    L     L PN + +PL+ + 
Sbjct: 229 GIFGFGQQDLSVVSQLSSLGITPKVFSHCL-KGEGDGGGKLVLGEILEPNIIYSPLVPSQ 287

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
              + Y L L  ISV G LLPI    F    S N G IVDSGT +T L    Y+    A 
Sbjct: 288 ---SHYNLNLQSISVNGQLLPIDPAVFA--TSNNQGTIVDSGTTLTYLVETAYDPFVSA- 341

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSN 206
           +  T + S T  ++  + CY  S+      P VS +F  G  + L    YL+ +   D  
Sbjct: 342 ITATVSSSTTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGA 401

Query: 207 GTFCFAFAPTSS-SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +C  F   +   ++I+G++  +     ++L +  IG+    C
Sbjct: 402 AMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445


>gi|413950928|gb|AFW83577.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 163

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 51/165 (30%), Positives = 83/165 (50%), Gaps = 10/165 (6%)

Query: 91  LDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV 150
           +  FY + + G+SV G+LL I    + + +   GG I+DSGT++T L +  Y A+  A  
Sbjct: 1   MRPFYAVAVNGVSVDGELLRIPRLVWDVQK--GGGAILDSGTSLTVLVSPAYRAVVAALG 58

Query: 151 RGTRALSPTDGVALFDTCYDFSS-----RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDS 205
           +    L P   +  FD CY+++S       +V VP ++ HF     L  P K+Y+I   +
Sbjct: 59  KKLVGL-PRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKSYVIDA-A 116

Query: 206 NGTFCFAFAP-TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            G  C          +S+IGN+ QQ     F+L+N  + F  ++C
Sbjct: 117 PGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 161


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 108/260 (41%), Gaps = 22/260 (8%)

Query: 10  LGSASVDNIAI--GCGHNNEGL-----FVGAAGLLGLGGGSLSFPSQINAST---FSYCL 59
           L SA  D I    GC  +N+            G++GL    +S   Q+N  T   FSYCL
Sbjct: 186 LQSAENDRIPFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCL 245

Query: 60  ----VDRDSDSTSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPIS 112
               +   S +TS L F + +  +    ++ P +    +   Y+L L  +SV G+ + I 
Sbjct: 246 NLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPN-YFLNLIDVSVAGNRMQIP 304

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR--GTRALSPTDGVALFDTCYD 170
              F +   G GG I+DSGTAVT +    Y  +  AF            +       CY 
Sbjct: 305 PGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYK 364

Query: 171 FSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-SSLSIIGNVQQQ 229
               +    P+++FHF        P   YL  V   G FC A  P S    +IIG + Q 
Sbjct: 365 QQGHTFHNYPSMAFHFQGADFFVEPEYVYLT-VQDRGAFCVALQPISPQQRTIIGALNQA 423

Query: 230 GTRVSFNLRNSLIGFTPNKC 249
            T+  ++  N  + FTP  C
Sbjct: 424 NTQFIYDAANRQLLFTPENC 443


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 70/256 (27%), Positives = 108/256 (42%), Gaps = 44/256 (17%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLVD----RDSDSTSTLEF------DSSLPPNAVT 82
           G+ G G G+LS PSQ+      FS+C +     R+ + TS L         +S     V 
Sbjct: 183 GIAGFGRGALSLPSQLGFLGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVF 242

Query: 83  APLLRNHELDTFYYLGLTGISVG----GDLLPISETAFKIDESGNGGIIVDSGTAVTRLQ 138
            P+L +     FYY+GL G+ +G    G  +    +   ID  GNGG++VD+GT  T+L 
Sbjct: 243 TPMLTSATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLP 302

Query: 139 TETYNALRDAFVRG------TRALSPTDGVALFDTCYDF-SSRSSV---EVPTVSFHFPE 188
              Y ++  + +        +R L    G   FD C+    +R+     E+P ++ H   
Sbjct: 303 DPFYASVLASLISAAPPYERSRDLEARTG---FDLCFKVPCARAPCADDELPPITLHLAG 359

Query: 189 GKVLPLPAKNYLIPV----DSNGTFCFAF-----------APTSSSLSIIGNVQQQGTRV 233
           G  L LP  +   PV    DS    C  F                  +++G+ Q Q   V
Sbjct: 360 GARLALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEV 419

Query: 234 SFNLRNSLIGFTPNKC 249
            ++L    +GF P  C
Sbjct: 420 VYDLAAGRVGFRPRDC 435


>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 542

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 100/241 (41%), Gaps = 22/241 (9%)

Query: 18  IAIGCGHNNEGLF---VGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTST 69
           + IGCG    G +   V   GL+GLG   +S PS +  +     +FS C    D D +  
Sbjct: 235 VVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMCF---DEDDSGR 291

Query: 70  LEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVD 129
           + F    P    + P L      T Y +G+ G  VG   L   +T+F+         +VD
Sbjct: 292 IFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGSSCL--KQTSFRA--------LVD 341

Query: 130 SGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEG 189
           +GT+ T L    Y  + + F R   A   +     +  CY  SS    +VP+V   FP  
Sbjct: 342 TGTSFTFLPNGVYERITEEFDRQVNATISSFNGYPWKYCYKSSSNHLTKVPSVKLIFPLN 401

Query: 190 KVLPLPAKNYLIP-VDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNK 248
               +    ++I  +     FC A  PT   +  IG     G RV F+  N  +G++ + 
Sbjct: 402 NSFVIHNPVFMIYGIQGITGFCLAIQPTEGDIGTIGQNFMAGYRVVFDRENMKLGWSHSS 461

Query: 249 C 249
           C
Sbjct: 462 C 462


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 67/121 (55%), Gaps = 10/121 (8%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAS---TFSY 57
           G+   E ++ G  SV N   GCG NN+GLF G +GL+GLG  +LS  SQ N++    FSY
Sbjct: 237 GELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQTNSTFGGVFSY 296

Query: 58  CLVDRDSDSTSTLEFDS------SLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           CL   D+ ++ +L   +      +L P A T  ++ N +L  FY L LTGI VG  L  +
Sbjct: 297 CLPPTDAGASGSLAMGNESSVFKNLTPIAYTR-MVPNPQLSNFYMLNLTGIDVGVWLFKL 355

Query: 112 S 112
            
Sbjct: 356 Q 356


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/242 (27%), Positives = 103/242 (42%), Gaps = 28/242 (11%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTA-------PL 85
           G+ G G G+LS  SQ+      FS+C +     +   +     +   A+T+       P+
Sbjct: 234 GIAGFGRGTLSMVSQLGFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPM 293

Query: 86  LRNHELDTFYYLGLTGISVGG-DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNA 144
           L +     FYY+GL  I+VG      +  +  + D  GNGG+ +DSGT  T L    Y+ 
Sbjct: 294 LNSPMYPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQ 353

Query: 145 LRDAFVRGTRALSPTDGVAL---FDTCYDFSS------RSSVEVPTVSFHFPEGKVLPLP 195
           +  + ++ T       G+ +   FD CY           S   +P+++FHF     L LP
Sbjct: 354 VL-SILQSTINYPRDTGMEMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLP 412

Query: 196 AKNYLIPVDSNG----TFCFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNSLIGFTPN 247
             N+  PV + G      C  F  T         + G+ QQQ   V ++L    IGF P 
Sbjct: 413 QGNHFYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPM 472

Query: 248 KC 249
            C
Sbjct: 473 DC 474


>gi|302797823|ref|XP_002980672.1| hypothetical protein SELMODRAFT_113025 [Selaginella moellendorffii]
 gi|300151678|gb|EFJ18323.1| hypothetical protein SELMODRAFT_113025 [Selaginella moellendorffii]
          Length = 152

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 70/149 (46%), Gaps = 10/149 (6%)

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF-DTCY 169
           I  +AFKID  GNGG   DSGT V+ L    + AL +AF R    L+ T G     + CY
Sbjct: 1   IPRSAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTNELCY 60

Query: 170 DFSSRSSV--EVPTVSFHFPEGKVLPLPAKNYLIPVDSNG---TFCFAF----APTSSSL 220
           D ++  S     P V+ HF     + L   +  +P+       T C AF    A     +
Sbjct: 61  DVAAGYSRLPRAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGV 120

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++IGN QQQ   +  +L  S IGF P  C
Sbjct: 121 NVIGNYQQQDYLIEHDLERSRIGFAPANC 149


>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
 gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 67/256 (26%), Positives = 104/256 (40%), Gaps = 24/256 (9%)

Query: 12  SASVDNIAIGCGHNNEGLFVGAA--GLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTST 69
           S S   +AIGC  +    F   +  G+ GLG  + S P Q+N S FSYCL         +
Sbjct: 218 SQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSKFSYCLSSYQEPDLPS 277

Query: 70  LEFDSSLP----------PNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKID 119
               ++ P              T  L  N +  T Y++ L  IS+GG   P   T     
Sbjct: 278 YLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISIGGTRFPAVST----- 332

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNAL---RDAFVRGTRALSPTDGVALFDTCYDFSSRSS 176
           +SG G + VD+G + TRL+   +  L    D  ++  + +    G      CY   S ++
Sbjct: 333 KSG-GNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYSPPSTAA 391

Query: 177 VE---VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRV 233
            E   +P +  HF +   + LP  +YL    S        +     +S++GN Q Q T +
Sbjct: 392 DESSKLPDMVLHFADSANMVLPWDSYLWKTTSKLCLAIYKSNIKGGISVLGNFQMQNTHM 451

Query: 234 SFNLRNSLIGFTPNKC 249
             +  N  + F    C
Sbjct: 452 LLDTGNEKLSFVRADC 467


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 69/269 (25%), Positives = 109/269 (40%), Gaps = 38/269 (14%)

Query: 9   TLGSASVDNIAIGCGHNNEGL------------FVGAAGLLGLGGGSLSFPSQINASTFS 56
           T G A  D  AIG      G               G +G++GLG    S  +Q+N + FS
Sbjct: 143 TGGMAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202

Query: 57  YCLVDRDSDS----TSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YCL  + S +     +  +     +SS P    T+    ++  + +Y + L GI  GG  
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAP 262

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           L       +   S    +++D+ +  + L    Y AL+ A                +D C
Sbjct: 263 L-------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLC 315

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--------SL 220
             FS   + + P + F F  G  L +P  NYL+    NGT C     ++S          
Sbjct: 316 --FSKAVAGDAPELVFTFDGGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGA 372

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SI+G++QQ+   V F+L+   + F P  C
Sbjct: 373 SILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 72/246 (29%), Positives = 101/246 (41%), Gaps = 19/246 (7%)

Query: 18  IAIGCGHNN-EGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLVDRDSDSTSTLEFDSSL 76
           IA GCG+ N E L     G+LGLG    S   Q+  S FSYC+ D  + +    +     
Sbjct: 208 IAFGCGYENGEQLESHFTGILGLGAKPTSLAVQL-GSKFSYCIGDLANKNYGYNQLVLGE 266

Query: 77  PPNAVTAPLLRNHELD-TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVT 135
             + +  P     E + + YY+ L GISVG   L I    FK       G+I+DSGT  T
Sbjct: 267 DADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSGTLYT 325

Query: 136 RLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFSSRSSVEV---PTVSFHFPEGKVL 192
            L    Y   R+ +      L P      F     +  R S E+   P V+FHF  G  L
Sbjct: 326 WLADIAY---RELYNEIKSILDPKLERFWFRDFLCYHGRVSEELIGFPVVTFHFAGGAEL 382

Query: 193 PLPAKNYLIPVDSNGT---FCFAFAPTS------SSLSIIGNVQQQGTRVSFNLRNSLIG 243
            + A +   P+    T   FC +  PT          + IG + QQ   + ++L+   I 
Sbjct: 383 AMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDLKEKNIY 442

Query: 244 FTPNKC 249
                C
Sbjct: 443 LQRIDC 448


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/264 (30%), Positives = 111/264 (42%), Gaps = 55/264 (20%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G   TET+ +G AS   +  GC   N G+   ++G++GLG   LS  SQ+  + FSYCL 
Sbjct: 178 GYLATETLHVGGASFPGVTFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGVARFSYCLR 236

Query: 61  DRDSDSTSTLEFDSSLPP---NAVTAPLLRNHEL--DTFYYLGLTGISVGGDLLPISETA 115
                  S + F S       N  + PLL N E+   ++YY+ LTGI+VG   LP+    
Sbjct: 237 SNADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPM---- 292

Query: 116 FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD---FS 172
                            A+  L T          V GTR          FD C+D     
Sbjct: 293 -----------------AMANLTT----------VNGTR--------FGFDLCFDATAAG 317

Query: 173 SRSSVEVPTVSFHFPEGKVLPLPAKNY--LIPVDSNGTF---CFAFAPTSS--SLSIIGN 225
               V VPT+   F  G    +  ++Y  ++ VDS G     C    P S   S+SIIGN
Sbjct: 318 GGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGN 377

Query: 226 VQQQGTRVSFNLRNSLIGFTPNKC 249
           V Q    V ++L   +  F P  C
Sbjct: 378 VMQMDLHVLYDLDGGMFSFAPADC 401


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 115/281 (40%), Gaps = 43/281 (15%)

Query: 8   VTLGSASVDNIAIG----------CGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFS 56
           VT G+ ++D +AIG          C  ++ G     A+GL+GLG G LS  SQ++   F 
Sbjct: 179 VTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFM 238

Query: 57  YCLVDRDSDSTSTLEFDSSLPP-----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCL    S ++  L   +         + VT  +  +    ++YYL L G++VG      
Sbjct: 239 YCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGT 298

Query: 112 SETA-------------------FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           +  A                        +   G+IVD  + ++ L+T  Y+ L D     
Sbjct: 299 TRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEE 358

Query: 153 TRALSPTDGVAL-FDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
            R    T  + L  D C+          V VPTVS  F +G+ L L      +   ++G 
Sbjct: 359 IRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF-DGRWLELDRDRLFV---TDGR 414

Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   +S +SI+GN Q Q  RV FNLR   I F    C
Sbjct: 415 MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 72/273 (26%), Positives = 125/273 (45%), Gaps = 27/273 (9%)

Query: 1   GDFVTETVTLGSAS----VDNIAIGCGHNNEGLF--VGAAGLLGLGGGSLSFPSQINA-- 52
           GD V++  T+ S        N+++GCG ++ GL   +  +G +G   G++SF  Q++A  
Sbjct: 89  GDLVSDIATMDSVRNRKVAANLSLGCGRDSGGLLELLDTSGFVGFDKGNVSFMGQLSALG 148

Query: 53  --STFSYCLVD---RDSDSTSTLEF-DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
             S F YCL     R        +  ++S+  +    P++ N +    Y++ L+ IS+  
Sbjct: 149 YRSKFIYCLPSDTFRGKLVIGNYKLRNASISSSMAYTPMITNPQAAELYFINLSTISIDK 208

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL-----SPTDG 161
           +   +    F    +G GG ++D+ T ++ L ++ Y  L  A    T  L     S  D 
Sbjct: 209 NKFQVPIQGFL--SNGTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADA 266

Query: 162 VALFDTCYDFSSRSSVEVP-TVSFHFPEGKVLPLPAKNYLIPVDS-NGTFCFAFAPTSS- 218
           + + + CY+ S+ S    P T+++HF  G  + +     L   DS N T C A   + S 
Sbjct: 267 LGV-ELCYNISANSDFPPPATLTYHFLGGAGVEVSTWFLLDDSDSVNNTICMAIGRSESV 325

Query: 219 --SLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +L++IG  QQ    V ++L     GF    C
Sbjct: 326 GPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 74/263 (28%), Positives = 119/263 (45%), Gaps = 18/263 (6%)

Query: 5   TETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFSYCL 59
           T TV+ G  A +  + +GC     G  V A  G+L LG G +SF    ++     FS+CL
Sbjct: 246 TVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFCL 305

Query: 60  VDRDS--DSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           +  +S  D++S L F    + + P  +   ++ N ++   Y   +TGI VGG+ L I + 
Sbjct: 306 LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQE 365

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDFS-- 172
            +  ++   GG+I+D+ T+VT L  E Y A+  A  R    L     +  F+ CY ++  
Sbjct: 366 IWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWTFA 425

Query: 173 -----SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSIIGNV 226
                   +V VP ++     G  L   AK+ ++P    G  C AF         I+GNV
Sbjct: 426 GDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGILGNV 485

Query: 227 QQQGTRVSFNLRNSLIGFTPNKC 249
             Q      +     + F  +KC
Sbjct: 486 LMQEYIWEIDHGKGKMRFRKDKC 508


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 77/281 (27%), Positives = 115/281 (40%), Gaps = 43/281 (15%)

Query: 8   VTLGSASVDNIAIG----------CGHNNEG-LFVGAAGLLGLGGGSLSFPSQINASTFS 56
           VT G+ ++D +AIG          C  ++ G     A+GL+GLG G LS  SQ++   F 
Sbjct: 62  VTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHRFM 121

Query: 57  YCLVDRDSDSTSTLEFDSSLPP-----NAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           YCL    S ++  L   +         + VT  +  +    ++YYL L G++VG      
Sbjct: 122 YCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGT 181

Query: 112 SETA-------------------FKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG 152
           +  A                        +   G+IVD  + ++ L+T  Y+ L D     
Sbjct: 182 TRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEE 241

Query: 153 TRALSPTDGVAL-FDTCYDFSS---RSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT 208
            R    T  + L  D C+          V VPTVS  F +G+ L L      +   ++G 
Sbjct: 242 IRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF-DGRWLELDRDRLFV---TDGR 297

Query: 209 FCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                   +S +SI+GN Q Q  RV FNLR   I F    C
Sbjct: 298 MMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 82/257 (31%), Positives = 115/257 (44%), Gaps = 19/257 (7%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGG-SLSFPSQINAS---TFS 56
           G FV + VTL          GCG +  G F  A+G+LGL  G   S  SQ  +     FS
Sbjct: 206 GVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFS 265

Query: 57  YCLVDRDSDSTSTL--EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISET 114
           YC    ++   S L  E   S  P+     LL N    + Y++ L GISV    L +S +
Sbjct: 266 YCFPHNENTRGSLLFGEKAISASPSLKFTRLL-NPSSGSVYFVELIGISVAKKRLNVSSS 324

Query: 115 AFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR---GTRALSPTDGVALFDTCYDF 171
            F      + G I+DSGT +T L T  Y ALR AF +      ++SP       DTCY+ 
Sbjct: 325 LF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNL 379

Query: 172 S--SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS--SSLSIIGNVQ 227
                 ++++P +  HF     + L     L         C AFA  S  S ++IIGN Q
Sbjct: 380 KGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQ 439

Query: 228 QQGTRVSFNLRNSLIGF 244
           Q   +V +++    +GF
Sbjct: 440 QVSLKVVYDIEGGRLGF 456


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 120/266 (45%), Gaps = 18/266 (6%)

Query: 2   DFVTETVTLGS-ASVDNIAIGCGHNNEGLFVGAA-GLLGLGGGSLSFP---SQINASTFS 56
           +  T TV+ G  A +  + +GC     G  V A  G+L LG G +SF    ++     FS
Sbjct: 243 EKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFS 302

Query: 57  YCLVDRDS--DSTSTLEF---DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPI 111
           +CL+  +S  D++S L F    + + P  +   ++ N ++   Y   +TGI VGG+ L I
Sbjct: 303 FCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDI 362

Query: 112 SETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYDF 171
            +  +  ++   GG+I+D+ T+VT L  E Y A+  A  R    L     +  F+ CY +
Sbjct: 363 PQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRW 422

Query: 172 S-------SRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAP-TSSSLSII 223
           +          +V VP ++     G  L   AK+ ++P    G  C AF         I+
Sbjct: 423 TFAGDGVDLTHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGIL 482

Query: 224 GNVQQQGTRVSFNLRNSLIGFTPNKC 249
           GNV  Q      +     + F  +KC
Sbjct: 483 GNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|224146829|ref|XP_002336347.1| predicted protein [Populus trichocarpa]
 gi|222834772|gb|EEE73235.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/289 (28%), Positives = 124/289 (42%), Gaps = 47/289 (16%)

Query: 2   DFVTETVTLGS-ASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS----- 53
           D++    TLGS +S+DN    C      +GL  G  GL  LG  +LS P QIN +     
Sbjct: 139 DYLALLNTLGSLSSIDNFIFSCARTGFLKGLAKGVTGLASLGNSNLSIPVQINKAFSSSP 198

Query: 54  -TFSYCLVDRDSDSTSTLEFDSSLPPN----------AVTAPLLRN----------HELD 92
             F+ CL    S     L F S  P N           +  PL+ N          H L 
Sbjct: 199 NCFAMCLSGSISQPGVAL-FGSKGPYNFLHGIDLSKSLLYTPLIFNPLGRDAVPNTHTLS 257

Query: 93  TFYYLGLTGISVGGDLLPISETAFKID-ESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
             YY+GLT I V G ++  ++T   ID +SG+GG  + +    T+LQ+  Y A   AF+R
Sbjct: 258 PEYYVGLTAIKVNGKMVTFNKTLLAIDAQSGSGGTRISTVVPYTKLQSSIYKAFTLAFLR 317

Query: 152 GTRA----LSPTDGVALFDTCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIP 202
              +    L+ T  V  F  CY  S+  + +    VP +        V+  +   N ++ 
Sbjct: 318 EAASSAFNLTTTKPVKPFSVCYPASAVKTTQMGPAVPIIELVLDRQDVVWKMFGSNSMMR 377

Query: 203 VDSNGT--FCFAF----APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           V       +C       A    S+ +IG +Q +   + F+L++  +GF+
Sbjct: 378 VTKKSVDLWCLGVVDGGAIDGPSI-MIGGLQLEDNLLQFDLQSKKLGFS 425


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 106/228 (46%), Gaps = 23/228 (10%)

Query: 35  GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G+ G G   +S  SQ+++       FS+CL   D+     L     + PN V +PL+ + 
Sbjct: 220 GIFGFGQQGMSVISQLSSQGIAPRVFSHCL-KGDNSGGGVLVLGEIVEPNIVYSPLVPSQ 278

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y L L  ISV G ++ I+ + F    S N G IVDSGT +  L  E YN     F
Sbjct: 279 P---HYNLNLQSISVNGQIVRIAPSVFA--TSNNRGTIVDSGTTLAYLAEEAYN----PF 329

Query: 150 VRGTRALSPTDGVALF---DTCYDFSSRSSVEV-PTVSFHFPEGKVLPLPAKNYLIP--- 202
           V    A+ P    ++    + CY  ++ S+V++ P VS +F  G  L L  ++YL+    
Sbjct: 330 VIAIAAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNF 389

Query: 203 VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +     +C  F   S  S++I+G++  +     ++L    IG+    C
Sbjct: 390 IGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDLAGQRIGWANYDC 437


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 71/270 (26%), Positives = 116/270 (42%), Gaps = 29/270 (10%)

Query: 1   GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFVGAA----GLLGLGGGSLSFPS 48
           G +V+E++     +G + + N    +  GC     G    +     G+ G G G LS  S
Sbjct: 176 GYYVSESMYFDMVMGQSMIANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVIS 235

Query: 49  QINA-----STFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q++A       FS+CL   + +    L     L P  V +PL+ +      Y L L  IS
Sbjct: 236 QLSARGITPKVFSHCL-KGEGNGGGILVLGEVLEPGIVYSPLVPSQP---HYNLYLQSIS 291

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRG-TRALSPTDGV 162
           V G  LPI  + F    S N G I+DSGT +  L  E Y     A     +++++PT  +
Sbjct: 292 VNGQTLPIDPSVFA--TSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPT--I 347

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV---DSNGTFCFAFAPTSSS 219
           +  + CY  S+      P VS +F     + L  + YL+ +   D    +C  F      
Sbjct: 348 SKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEG 407

Query: 220 LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           ++I+G++  +     ++L    IG+    C
Sbjct: 408 VTILGDLVMKDKIFVYDLARQRIGWASYDC 437


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 67/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)

Query: 1   GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
           GDF+ + +TL   +           +  GCG N  G          G++G G  + S  S
Sbjct: 168 GDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIIS 227

Query: 49  QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           Q+ A       FS+CL + +      + E +S   P   T P++ N      Y + L G+
Sbjct: 228 QLAAGGSTKRIFSHCLDNMNGGGIFAVGEVES---PVVKTTPIVPNQ---VHYNVILKGM 281

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            V GD  PI         +G+GG I+DSGT +  L    YN+L +      +       V
Sbjct: 282 DVDGD--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 337

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
                C+ F+S +    P V+ HF +   L +   +YL  +  +  +CF +         
Sbjct: 338 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 396

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + + ++G++      V ++L N +IG+  + C
Sbjct: 397 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 429


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 68/269 (25%), Positives = 108/269 (40%), Gaps = 38/269 (14%)

Query: 9   TLGSASVDNIAIGCGHNNEGL------------FVGAAGLLGLGGGSLSFPSQINASTFS 56
           T G A  D  AIG      G               G +G++GLG    S  +Q+N + FS
Sbjct: 143 TGGKAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFS 202

Query: 57  YCLVDRDSDS----TSTLEF----DSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDL 108
           YCL  + S +     +  +     +SS P    T+    ++  + +Y + L GI  GG  
Sbjct: 203 YCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAP 262

Query: 109 LPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTC 168
           L       +   S    +++D+ +  + L    Y AL+ A                +D C
Sbjct: 263 L-------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKPYDLC 315

Query: 169 YDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSS--------SL 220
             F    + + P + F F  G  L +P  NYL+    NGT C     ++S          
Sbjct: 316 --FPKAVAGDAPELVFTFDGGAALTVPPANYLL-ASGNGTVCLTIGSSASLNLTGELEGA 372

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           SI+G++QQ+   V F+L+   + F P  C
Sbjct: 373 SILGSLQQENVHVLFDLKEETLSFKPADC 401


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 67/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)

Query: 1   GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
           GDF+ + +TL   +           +  GCG N  G          G++G G  + S  S
Sbjct: 172 GDFIKDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIIS 231

Query: 49  QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           Q+ A       FS+CL + +      + E +S   P   T P++ N      Y + L G+
Sbjct: 232 QLAAGGSTKRIFSHCLDNMNGGGIFAVGEVES---PVVKTTPIVPNQ---VHYNVILKGM 285

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            V GD  PI         +G+GG I+DSGT +  L    YN+L +      +       V
Sbjct: 286 DVDGD--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 341

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
                C+ F+S +    P V+ HF +   L +   +YL  +  +  +CF +         
Sbjct: 342 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 400

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + + ++G++      V ++L N +IG+  + C
Sbjct: 401 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 433


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)

Query: 17  NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
           +I  GC ++  G          G+ G G   LS  SQ+N+       FS+CL   D +  
Sbjct: 213 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 271

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
             L     + P  V  PL+ +      Y L L  I V G  LPI  + F    S   G I
Sbjct: 272 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 326

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
           VDSGT +  L    Y+   +A    T A+SP+    V+  + C+  SS      PTVS +
Sbjct: 327 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 383

Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
           F  G  + +  +NYL+    +D+N  +C  +       ++I+G++  +     ++L N  
Sbjct: 384 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 443

Query: 242 IGFTPNKC 249
           +G+T   C
Sbjct: 444 MGWTDYDC 451


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 68/273 (24%), Positives = 114/273 (41%), Gaps = 35/273 (12%)

Query: 1   GDFVTETVTLGSAS--------VDNIAIGCGHNNEGLF----VGAAGLLGLGGGSLSFPS 48
           GDFV + +TL   +           +  GCG N  G          G++G G  + S  S
Sbjct: 171 GDFVKDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVIS 230

Query: 49  QINA-----STFSYCLVDRDSDSTSTL-EFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           Q+ A       FS+CL + +      + E +S   P   T PL+ N      Y + L G+
Sbjct: 231 QLAAGGSVKRIFSHCLDNMNGGGIFAIGEVES---PVVKTTPLVPNQ---VHYNVILKGM 284

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            V G+  PI         +G+GG I+DSGT +  L    YN+L +      +       V
Sbjct: 285 DVDGE--PIDLPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQV--KLHMV 340

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA------PT 216
                C+ F+S +    P V+ HF +   L +   +YL  +  +  +CF +         
Sbjct: 341 QETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-MYCFGWQSGGMTTQD 399

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + + ++G++      V ++L N +IG+  + C
Sbjct: 400 GADVILLGDLVLSNKLVVYDLENEVIGWADHNC 432


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)

Query: 1   GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G ++T+T      LG + V N    I  GC     G          G+ G G G LS  S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 49  QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q+++       FS+CL   D            L P  V +PL+ +      Y L L  I 
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 311

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G +LP+    F  + S   G IVD+GT +T L  E Y+   +A       L  T  ++
Sbjct: 312 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 368

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
             + CY  S+  S   P+VS +F  G  + L  ++YL      D    +C  F       
Sbjct: 369 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 428

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I+G++  +     ++L    IG+    C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)

Query: 17  NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
           +I  GC ++  G          G+ G G   LS  SQ+N+       FS+CL   D +  
Sbjct: 213 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 271

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
             L     + P  V  PL+ +      Y L L  I V G  LPI  + F    S   G I
Sbjct: 272 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 326

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
           VDSGT +  L    Y+   +A    T A+SP+    V+  + C+  SS      PTVS +
Sbjct: 327 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 383

Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
           F  G  + +  +NYL+    +D+N  +C  +       ++I+G++  +     ++L N  
Sbjct: 384 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 443

Query: 242 IGFTPNKC 249
           +G+T   C
Sbjct: 444 MGWTDYDC 451


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 66/233 (28%), Positives = 102/233 (43%), Gaps = 26/233 (11%)

Query: 35  GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G++G G   LS P+Q+ A       FS+CL + +      L       P     PL+ + 
Sbjct: 144 GIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPD- 201

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y + L GISV  + LPI    F    + + G+I+DSGT +    +  YN    A 
Sbjct: 202 --SVHYNVVLRGISVNSNRLPIDAEDFS--STNDTGVIMDSGTTLAYFPSGAYNVFVQAI 257

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD 204
              T A +P     +   C+  S R S   P V+ +F EG  + L   NYL+     P  
Sbjct: 258 REATSA-TPVRVQGMDTQCFLVSGRLSDLFPNVTLNF-EGGAMELQPDNYLMWGGTAPTG 315

Query: 205 SNGTFCFAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +   +C  +  +SSS        L+I+G++  +   V ++L NS IG+    C
Sbjct: 316 TTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 368


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 96/223 (43%), Gaps = 14/223 (6%)

Query: 35  GLLGLGGGSLSFPSQ-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G+ G G G LS  SQ     I    FS+CL   D +    L     L P+ V +PL+ + 
Sbjct: 211 GIFGFGPGPLSVVSQLSSQGITPKVFSHCL-KGDGNGGGILVLGEILEPSIVYSPLVPSQ 269

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y L L  I+V G  LPI+   F I  +  GG IVD GT +  L  E Y+ L  A 
Sbjct: 270 P---HYNLNLQSIAVNGQPLPINPAVFSISNN-RGGTIVDCGTTLAYLIQEAYDPLVTA- 324

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSN 206
           +    + S     +  + CY  S+      P VS +F  G  + L  + YL+    +D  
Sbjct: 325 INTAVSQSARQTNSKGNQCYLVSTSIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGA 384

Query: 207 GTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             +C  F       SI+G++  +   V +++    IG+    C
Sbjct: 385 EMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDC 427


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 34/273 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +T  +  +   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 164 GNLVKEKITFSNTEITPPLILGCATESSD----DRGILGMNRGRLSFVSQAKISKFSYCI 219

Query: 60  VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
             + +    T T  F     PN+        +T P   R   LD   Y + + GI  G  
Sbjct: 220 PPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLK 279

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR-DAFVRGTRALSP---TDGVA 163
            L IS + F+ D  G+G  +VDSG+  T L    Y+ +R +   R  R L       G A
Sbjct: 280 KLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTA 339

Query: 164 LFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
             D C+D    +   +P     + F F  G  + +P +  L+ V   G  C     +S  
Sbjct: 340 --DMCFD---GNVAMIPRLIGDLVFVFTRGVEILVPKERVLVNV-GGGIHCVGIGRSSML 393

Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            ++ +IIGNV QQ   V F++ N  +GF    C
Sbjct: 394 GAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 70/248 (28%), Positives = 109/248 (43%), Gaps = 24/248 (9%)

Query: 17  NIAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPSQINA-----STFSYCLVDRDSDST 67
           +I  GC ++  G          G+ G G   LS  SQ+N+       FS+CL   D +  
Sbjct: 239 SIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSD-NGG 297

Query: 68  STLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGII 127
             L     + P  V  PL+ +      Y L L  I V G  LPI  + F    S   G I
Sbjct: 298 GILVLGEIVEPGLVYTPLVPSQP---HYNLNLESIVVNGQKLPIDSSLFTT--SNTQGTI 352

Query: 128 VDSGTAVTRLQTETYNALRDAFVRGTRALSPT--DGVALFDTCYDFSSRSSVEVPTVSFH 185
           VDSGT +  L    Y+   +A    T A+SP+    V+  + C+  SS      PTVS +
Sbjct: 353 VDSGTTLAYLADGAYDPFVNAI---TAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLY 409

Query: 186 FPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTS-SSLSIIGNVQQQGTRVSFNLRNSL 241
           F  G  + +  +NYL+    +D+N  +C  +       ++I+G++  +     ++L N  
Sbjct: 410 FMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMR 469

Query: 242 IGFTPNKC 249
           +G+T   C
Sbjct: 470 MGWTDYDC 477


>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score = 73.2 bits (178), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 43/121 (35%), Positives = 56/121 (46%), Gaps = 2/121 (1%)

Query: 92  DTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR 151
           +TFYY+ L G+S+G   L +    F  D  GNGG I+DSGT  T    E Y  +  AF  
Sbjct: 32  NTFYYIDLRGVSIGRKRLNLPSKLFSFDNKGNGGTIIDSGTTFTIFNEEFYKNITAAFAS 91

Query: 152 --GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTF 209
             G R  S  +       CY+ S    V +P  +FHF  G  + LP  NY     S  + 
Sbjct: 92  QIGFRRASEVEARTGMRLCYNASGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSI 151

Query: 210 C 210
           C
Sbjct: 152 C 152


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 75/269 (27%), Positives = 109/269 (40%), Gaps = 27/269 (10%)

Query: 1   GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G ++T+T      LG + V N    I  GC     G          G+ G G G LS  S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 49  QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q+++       FS+CL   D            L P  V +PLL +      Y L L  I 
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLLPSQP---HYNLNLLSIG 311

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G +LPI    F  + S   G IVD+GT +T L  E Y+   +A       L  T  ++
Sbjct: 312 VNGQILPIDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVSQLV-TLIIS 368

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
             + CY  S+  S   P VS +F  G  + L  ++YL      D    +C  F       
Sbjct: 369 NGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQ 428

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I+G++  +     ++L    IG+    C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWANYDC 457


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)

Query: 1   GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G ++T+T      LG + V N    I  GC     G          G+ G G G LS  S
Sbjct: 196 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 255

Query: 49  QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q+++       FS+CL   D            L P  V +PL+ +      Y L L  I 
Sbjct: 256 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 311

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G +LP+    F  + S   G IVD+GT +T L  E Y+   +A       L  T  ++
Sbjct: 312 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 368

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
             + CY  S+  S   P+VS +F  G  + L  ++YL      D    +C  F       
Sbjct: 369 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 428

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I+G++  +     ++L    IG+    C
Sbjct: 429 TILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
          Length = 383

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 115/272 (42%), Gaps = 33/272 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   T+ V +G+A+  ++A GC   ++   +  G +G +GL    LS  +Q+N + FS+C
Sbjct: 116 GKIGTDAVAIGTATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHC 175

Query: 59  LVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRN--HELDTFYYL-GLTGISVGGD 107
           L   D                        A+T P +++   ++ + YYL  L GI  G  
Sbjct: 176 LAPHDGGGGKNSRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAG-- 233

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVAL 164
                E    + +SG   +++ + + V+ L    Y  L+ A    V G  A  P    ++
Sbjct: 234 ----DEAIITVPQSGR-TVLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSI 288

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS------- 217
           FD C+     S    P V   F     L +P  NYL+ V  + T C A A ++       
Sbjct: 289 FDLCFKRGGVSG--APDVVLTFQGAAALTVPPTNYLLDVGDD-TVCVAIASSARLNSTEV 345

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +SI+G +QQQ     ++L    + F    C
Sbjct: 346 AGMSILGGLQQQNVHFLYDLEKETLSFEAADC 377


>gi|194699670|gb|ACF83919.1| unknown [Zea mays]
          Length = 102

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 1/93 (1%)

Query: 158 PTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS 217
           P     +   CY+ S     EVP +S  F +G V   PA+NY I +D +G  C A   T 
Sbjct: 7   PVPDFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTP 66

Query: 218 SS-LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + +SIIGN QQQ   V+++L N+ +GF P +C
Sbjct: 67  RTGMSIIGNFQQQNFHVAYDLHNNRLGFAPRRC 99


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 27/269 (10%)

Query: 1   GDFVTET----VTLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G ++T+T      LG + V N    I  GC     G          G+ G G G LS  S
Sbjct: 201 GYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVS 260

Query: 49  QINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q+++       FS+CL   D            L P  V +PL+ +      Y L L  I 
Sbjct: 261 QLSSRGITPPVFSHCL-KGDGSGGGVFVLGEILVPGMVYSPLVPSQP---HYNLNLLSIG 316

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVA 163
           V G +LP+    F  + S   G IVD+GT +T L  E Y+   +A       L  T  ++
Sbjct: 317 VNGQMLPLDAAVF--EASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLV-TPIIS 373

Query: 164 LFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIP---VDSNGTFCFAFAPTSSSL 220
             + CY  S+  S   P+VS +F  G  + L  ++YL      D    +C  F       
Sbjct: 374 NGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQ 433

Query: 221 SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +I+G++  +     ++L    IG+    C
Sbjct: 434 TILGDLVLKDKVFVYDLARQRIGWASYDC 462


>gi|291002744|gb|ADD71504.1| xyloglucanase inhibitor 2 [Humulus lupulus]
          Length = 445

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 68/274 (24%), Positives = 114/274 (41%), Gaps = 44/274 (16%)

Query: 14  SVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDS 66
           S  N+   CG     EGL  G  G+ GLG   ++ PSQ  A+      F+ CL    + +
Sbjct: 155 SFPNVIFTCGSTFLLEGLASGVTGIAGLGRKKIALPSQFAAAFSFKRKFALCL-SSSTRA 213

Query: 67  TSTLEF---------DSSLPPNAVTAPLLRNH----------ELDTFYYLGLTGISVGGD 107
           T  + F         +  +  N +  PL+ N           E    Y++G+ GI V G+
Sbjct: 214 TGVVFFGDGPYIMLPNKDVSQNLIYTPLILNPVSTAGASFEGEPSADYFIGVKGIKVNGE 273

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDT 167
            + ++ +   I + G GG  + +    T L+T  Y A+  AF +    +     VA F+ 
Sbjct: 274 DVKLNTSLLSIAKDGTGGTKISTTQPYTSLETSIYKAVIGAFGKAVAKVPRVTAVAPFEL 333

Query: 168 CYDFSSRSSVE----VPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA--------- 214
           C++ +S SS      VP +    P  K   +   N ++ V S+   C  F          
Sbjct: 334 CFNSTSFSSTRVGPGVPQIDLVLPNNKAWTIFGANSMVQV-SDDVLCLGFVDGGPLHFVD 392

Query: 215 ---PTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
              P + +  +IG  Q +   + F+L +S +GF+
Sbjct: 393 WGIPFTPTAIVIGGHQIEDNLLQFDLGSSTLGFS 426


>gi|357443039|ref|XP_003591797.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
 gi|355480845|gb|AES62048.1| Xyloglucan-specific endoglucanase inhibitor protein [Medicago
           truncatula]
          Length = 436

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 72/277 (25%), Positives = 112/277 (40%), Gaps = 48/277 (17%)

Query: 13  ASVDNIAIGCGHN--NEGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS- 64
            SV N    CG N    GL  G  G+ GLG   +S PSQ +++      F+ CL  ++  
Sbjct: 144 VSVPNFLFICGSNVVQNGLAKGVKGMAGLGRTKVSLPSQFSSAFSFKNKFAICLGTQNGV 203

Query: 65  ----DSTSTLEFDSSLPPNAVTAPLLRNH----------ELDTFYYLGLTGISVGGDLLP 110
               D      FD S   N +  PL+ N           E    Y++G+  I V    + 
Sbjct: 204 LFFGDGPYLFNFDES--KNLIYTPLITNPVSTSPSSFLGEKSVEYFIGVKSIRVSSKNVK 261

Query: 111 ISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFDTCYD 170
           ++ T   ID++G GG  + +    T ++T  Y A+ DAFV+    +S  + VA F TC+ 
Sbjct: 262 LNTTLLSIDQNGFGGTKISTVNPYTIMETSIYKAVADAFVKALN-VSTVEPVAPFGTCFA 320

Query: 171 ----FSSRSSVEVPTVSFHFP-EGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---- 221
                SSR   +VP++      E  V  +   N ++ ++     C  F    S  +    
Sbjct: 321 SQSISSSRMGPDVPSIDLVLQNENVVWNIIGANAMVRINDKDVICLGFVDAGSDFAKTSQ 380

Query: 222 --------------IIGNVQQQGTRVSFNLRNSLIGF 244
                          IG  Q +   + F+L  S +GF
Sbjct: 381 VGFVVGGSKPMTSITIGAHQLENNLLQFDLATSRLGF 417


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 115/272 (42%), Gaps = 33/272 (12%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFV--GAAGLLGLGGGSLSFPSQINASTFSYC 58
           G   T+ V +G+A+  ++A GC   ++   +  G +G +GL    LS  +Q+N + FS+C
Sbjct: 115 GKIGTDAVAIGTATAASVAFGCVMASDIKLMDGGPSGFVGLARTPLSLVAQMNVTAFSHC 174

Query: 59  LVDRDSDSTSTLEF--------DSSLPPNAVTAPLLRN--HELDTFYYL-GLTGISVGGD 107
           L   D                        A+T P +++   ++ + YYL  L GI  G  
Sbjct: 175 LAPHDGGGGKNSRLFLGAAAKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAG-- 232

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF---VRGTRALSPTDGVAL 164
                E    + +SG   +++ + + V+ L    Y  L+ A    V G  A  P    ++
Sbjct: 233 ----DEAIITVPQSGR-TVLLQTFSPVSFLVDGVYQDLKKAVTAAVGGPTATPPEQFQSI 287

Query: 165 FDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS------- 217
           FD C+     S    P V   F     L +P  NYL+ V  + T C A A ++       
Sbjct: 288 FDLCFKRGGVSG--APDVVLTFQGAAALTVPPTNYLLDVGDD-TVCVAIASSARLNSTEV 344

Query: 218 SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           + +SI+G +QQQ     ++L    + F    C
Sbjct: 345 AGMSILGGLQQQNVHFLYDLEKETLSFEAADC 376


>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
           TFYY+ L G+S+G   L +    F  D  GNGG I+DSGT  T    E Y  +  AF   
Sbjct: 33  TFYYIDLRGVSIGRKRLNLPSKLFSFDSKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92

Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
            G R  S  +       CY+ S    V +P  +FHF  G  + LP  NY     S  + C
Sbjct: 93  IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 66/233 (28%), Positives = 102/233 (43%), Gaps = 26/233 (11%)

Query: 35  GLLGLGGGSLSFPSQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNH 89
           G++G G   LS P+Q+ A       FS+CL + +      L       P     PL+ + 
Sbjct: 171 GIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGGGILVIGGIAEPGMTYTPLVPDS 229

Query: 90  ELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF 149
                Y + L GISV  + LPI    F    + + G+I+DSGT +    +  YN    A 
Sbjct: 230 ---VHYNVVLRGISVNSNRLPIDAEDFS--STNDTGVIMDSGTTLAYFPSGAYNVFVQAI 284

Query: 150 VRGTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD 204
              T A +P     +   C+  S R S   P V+ +F EG  + L   NYL+     P  
Sbjct: 285 REATSA-TPVRVQGMDTQCFLVSGRLSDLFPNVTLNF-EGGAMELQPDNYLMWGGTAPTG 342

Query: 205 SNGTFCFAFAPTSSS--------LSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +   +C  +  +SSS        L+I+G++  +   V ++L NS IG+    C
Sbjct: 343 TTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395


>gi|357440781|ref|XP_003590668.1| Basic 7S globulin [Medicago truncatula]
 gi|355479716|gb|AES60919.1| Basic 7S globulin [Medicago truncatula]
          Length = 434

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/252 (27%), Positives = 111/252 (44%), Gaps = 39/252 (15%)

Query: 27  EGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSDSTSTLEFDS---SLPP 78
           EGL  GA+G+ GLG   L+ PSQ+      A  F+ CL    S S   + F        P
Sbjct: 170 EGLASGASGMAGLGRNKLALPSQLASAFSFAKKFAICL----SSSKGVVLFGDGPYGFLP 225

Query: 79  NAV-------TAPLLRN---------HELDTFYYLGLTGISVGGDLLPISETAFKIDES- 121
           N V         PLL N          E    Y++G+  I + G ++ +  +   ID S 
Sbjct: 226 NVVFDSKSLTYTPLLINPFSTAAFAKSEPSAEYFIGVKTIKIDGKVVSLDTSLLSIDSSN 285

Query: 122 GNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYD--FSSRSSV 177
           G GG  + +    T L+   Y A+ DAFV+ +  R +   D VA F+ CY     +R   
Sbjct: 286 GAGGTKISTVDPYTVLEASIYKAVTDAFVKASAARNIKRVDSVAPFEFCYTNVTGTRLGA 345

Query: 178 EVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFA----PTSSSLSIIGNVQQQGTRV 233
           +VPT+  +     +  +   N ++ ++ +   C  F      T +S+ +IG  Q +   +
Sbjct: 346 DVPTIELYLQNNVIWRIFGANSMVNIN-DEVLCLGFVIGGENTWASI-VIGGYQLENNLL 403

Query: 234 SFNLRNSLIGFT 245
            F+L  S +GF+
Sbjct: 404 QFDLAASKLGFS 415


>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
 gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
          Length = 437

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/257 (29%), Positives = 118/257 (45%), Gaps = 31/257 (12%)

Query: 11  GSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN-----ASTFSYCLVDRDSD 65
           G+A+  +I  GC  N  G +  A G++G G  S + P+QI      +  FS+CL   +  
Sbjct: 192 GNATTSHIFFGCAINITGSWP-ADGIMGFGQISKTVPNQIATQRNMSRVFSHCL-GGEKH 249

Query: 66  STSTLEFDSSLPPNA---VTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKI--DE 120
               LEF     PN    V  PLL    + T Y + L  ISV   +LPI    F    + 
Sbjct: 250 GGGILEFGEE--PNTTEMVFTPLL---NVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNS 304

Query: 121 SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRA-LSPT-DGVALFDTCYDFSSRSSVE 178
           +   G+I+DSGT+   L T+    L       T A L P  +G+     C+   S  +VE
Sbjct: 305 TNETGVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQ----CFYLKSGLTVE 360

Query: 179 V--PTVSFHFPEGKVLPLPAKNYLIPVD----SNGTFCFAFAPTSSSLSIIGNVQQQGTR 232
              P V+  F  G  + L   NYL+ V+     NG +C+A++ ++  L+I G +  +   
Sbjct: 361 TSFPNVTLTFSGGSTMKLKPDNYLVMVELKKKRNG-YCYAWS-SADGLTIFGEIVLKDKL 418

Query: 233 VSFNLRNSLIGFTPNKC 249
           V +++ N  IG+    C
Sbjct: 419 VFYDVENRRIGWKGQNC 435


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 34/273 (12%)

Query: 1   GDFVTETVTLGSASV-DNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCL 59
           G+ V E +T  +  +   + +GC   +        G+LG+  G LSF SQ   S FSYC+
Sbjct: 164 GNLVKEKITFSNTEITPPLILGCATESSD----DRGILGMNRGRLSFVSQAKISKFSYCI 219

Query: 60  VDRDSDS--TSTLEFDSSLPPNA--------VTAPL-LRNHELDTFYY-LGLTGISVGGD 107
             + +    T T  F     PN+        +T P   R   LD   Y + + GI  G  
Sbjct: 220 PPKSNRPGFTPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLK 279

Query: 108 LLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALR-DAFVRGTRALSP---TDGVA 163
            L IS + F+ D  G+G  +VDSG+  T L    Y+ +R +   R  R L       G A
Sbjct: 280 KLNISGSVFRPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTA 339

Query: 164 LFDTCYDFSSRSSVEVPT----VSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
             D C+D    +   +P     + F F  G  + +P +  L+ V   G  C     +S  
Sbjct: 340 --DMCFD---GNVAMIPRLIGDLVFVFTRGVEIFVPKERVLVNV-GGGIHCVGIGRSSML 393

Query: 218 -SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            ++ +IIGNV QQ   V F++ N  +GF    C
Sbjct: 394 GAASNIIGNVHQQNLWVEFDVTNRRVGFAKADC 426


>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 104/244 (42%), Gaps = 27/244 (11%)

Query: 17  NIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPS-----QINASTFSYCLVDRDSDSTS 68
            I  GCG    G F+ AA   GL GLG   +S PS      + +++FS C      D   
Sbjct: 213 QIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF---GRDGIG 269

Query: 69  TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
            + F      +    PL  N +  T Y + +TGI+VG +L+ +  +            I 
Sbjct: 270 RISFGDQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLEVST-----------IF 317

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVALFDTCYDFSS-RSSVEVPTVSFHF 186
           D+GT+ T L    Y  + D F    +A     D    F+ CYD SS  + ++ P++S   
Sbjct: 318 DTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRT 377

Query: 187 PEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
             G + P      +I +  +   +C A    S+ L+IIG     G RV F+    ++G+ 
Sbjct: 378 VGGSLFPAIDPGQVISIQQHEYVYCLAIV-KSTKLNIIGQNFMTGVRVVFDRERKILGWK 436

Query: 246 PNKC 249
              C
Sbjct: 437 KFNC 440


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/273 (27%), Positives = 125/273 (45%), Gaps = 31/273 (11%)

Query: 1   GDFVTETVTLGSASVDNI-----AIGCGHNNEGLFVGAAGLLGLGGGS-LSFPSQINAS- 53
           G+  TET+++ S+S   +     A GCG+NN G F      +   GG  LS  SQ+ +S 
Sbjct: 176 GEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSI 235

Query: 54  --TFSYCLVDRDSDS---------TSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
              FSYCL    + +         T+++    S     +T PL++  + +T+Y+L L  I
Sbjct: 236 GKKFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQK-DPETYYFLTLEAI 294

Query: 103 SVGGDLLPIS---ETAFKIDESGNGGIIVDSGTAVTRLQTETYN---ALRDAFVRGTRAL 156
           +VG   LP +     +        G II+DSGT +T L +  Y+   A+ +  V G + +
Sbjct: 295 TVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354

Query: 157 SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPT 216
           S   G+     C+  S    + +PT++ HF    V   P  +++    S    C +  PT
Sbjct: 355 SDPQGI--LTHCFK-SGDKEIGLPTITMHFTGADVKLSPINSFVKL--SEDIVCLSMIPT 409

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            + ++I GN+ Q    V ++L    + F    C
Sbjct: 410 -TEVAIYGNMVQMDFLVGYDLETKTVSFQRMDC 441


>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
 gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/290 (27%), Positives = 115/290 (39%), Gaps = 66/290 (22%)

Query: 17  NIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINA------STFSYCLVDRDSDSTSTL 70
           N   GC H      +G AG    G G LS P+Q+        ++FSYCLV    DS    
Sbjct: 211 NFTFGCAHTTLAEPIGVAGF---GRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRVR 267

Query: 71  E---------------------------FDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
                                        +       V   +L N +   FY + L GIS
Sbjct: 268 RPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGIS 327

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAF--------VRGTRA 155
           +G   +P      +ID++G GG++VDSGT  T L  + YN++ + F         R  R 
Sbjct: 328 IGKRNIPAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERADR- 386

Query: 156 LSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGK-VLPLPAKNYLIPV----------D 204
           + P+ G++    CY  +   +V+VP +  HF   +  + LP +NY               
Sbjct: 387 VEPSSGMS---PCYYLN--QTVKVPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEEKR 441

Query: 205 SNGTFCFAFAPTSSSL-----SIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             G          S L     +I+GN QQQG  V ++L N  +GF   KC
Sbjct: 442 KIGCLMLMNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKC 491


>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 505

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 67/244 (27%), Positives = 104/244 (42%), Gaps = 27/244 (11%)

Query: 17  NIAIGCGHNNEGLFVGAA---GLLGLGGGSLSFPS-----QINASTFSYCLVDRDSDSTS 68
            I  GCG    G F+ AA   GL GLG   +S PS      + +++FS C      D   
Sbjct: 213 QIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF---GRDGIG 269

Query: 69  TLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIV 128
            + F      +    PL  N +  T Y + +TGI+VG +L+ +  +            I 
Sbjct: 270 RISFGDQGSSDQEETPLDINQKHPT-YAITITGIAVGNNLMDLEVST-----------IF 317

Query: 129 DSGTAVTRLQTETYNALRDAFVRGTRA-LSPTDGVALFDTCYDFSS-RSSVEVPTVSFHF 186
           D+GT+ T L    Y  + D F    +A     D    F+ CYD SS  + ++ P++S   
Sbjct: 318 DTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTPSISLRT 377

Query: 187 PEGKVLPLPAKNYLIPVDSNG-TFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFT 245
             G + P      +I +  +   +C A    S+ L+IIG     G RV F+    ++G+ 
Sbjct: 378 VGGSLFPAIDPGQVISIQQHEYVYCLAIV-KSTKLNIIGQNFMTGVRVVFDRERKILGWK 436

Query: 246 PNKC 249
              C
Sbjct: 437 KFNC 440


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 72/249 (28%), Positives = 112/249 (44%), Gaps = 35/249 (14%)

Query: 35  GLLGLGGGSLSFPSQIN--ASTFSYCL-----VDRDSDSTSTLEFDSSLPPNAVTA---- 83
           G+ G G G LS PSQ+      FS+C      V+  + S+  +   S+L  N   +    
Sbjct: 159 GIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 218

Query: 84  PLLRNHELDTFYYLGLTGISVGGDLLP--ISETAFKIDESGNGGIIVDSGTAVTRLQTET 141
           P+L        YY+GL  I++G ++ P  +  T  + D  GNGG++VDSGT  T L    
Sbjct: 219 PMLNTPVYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPF 278

Query: 142 YNAL---RDAFVRGTRALSPTDGVALFDTCYDF----SSRSSVE------VPTVSFHFPE 188
           Y+ L     + +   RA + T+    FD CY      ++ +S+E       P+++F+F  
Sbjct: 279 YSQLLTILQSTITYPRA-TETESRTGFDLCYKVPCPNNNLTSLENDVMMVFPSITFNFLN 337

Query: 189 GKVLPLPAKN--YLIPVDSNGTF--CFAFAPTSS----SLSIIGNVQQQGTRVSFNLRNS 240
              L LP  N  Y +   S+G+   C  F            + G+ QQQ  +V ++L   
Sbjct: 338 NATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKE 397

Query: 241 LIGFTPNKC 249
            IGF    C
Sbjct: 398 RIGFQAMDC 406


>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
 gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
           TFYY+ L G+S+G   L +    F  D  GNGG I+DSGT  T    E Y  +  AF   
Sbjct: 33  TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92

Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
            G R  S  +       CY+ S    V +P  +FHF  G  + LP  NY     S  + C
Sbjct: 93  IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152


>gi|357440767|ref|XP_003590661.1| Basic 7S globulin [Medicago truncatula]
 gi|355479709|gb|AES60912.1| Basic 7S globulin [Medicago truncatula]
          Length = 500

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 63/255 (24%), Positives = 108/255 (42%), Gaps = 37/255 (14%)

Query: 27  EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVDRDS-----DSTSTLEFDSSL 76
            GL  GA+G+ GLG   ++ PSQ+ ++      F++C    D      D   +   D+  
Sbjct: 171 RGLAGGASGMAGLGRTKIALPSQLASAFIFKRKFAFCFSSSDGVIIFGDGPYSFLADNPS 230

Query: 77  PPNAV-------TAPLLRNH----------ELDTFYYLGLTGISVGGDLLPISETAFKID 119
            PN V         PLL NH          E    Y++G+  I + G ++ ++ +   ID
Sbjct: 231 LPNVVFDSKSLTYTPLLINHVSTASAFLQGESSVEYFIGVKTIKIDGKVVSLNSSLLSID 290

Query: 120 ESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGT--RALSPTDGVALFDTCYDFSSRS-- 175
             G GG  + +    T L+   Y A+ DAFV+ +  R ++  D    F+ CY F +    
Sbjct: 291 NKGVGGTKISTVDPYTVLEASIYKAVTDAFVKASVARNITTEDSSPPFEFCYSFDNLPGT 350

Query: 176 --SVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTSSSLS---IIGNVQQQG 230
                VPT+        +  +   N ++ ++ +   C  F     +L    +IG  Q + 
Sbjct: 351 PLGASVPTIELLLQNNVIWSMFGANSMVNIN-DEVLCLGFVNGGVNLRTSIVIGGYQLEN 409

Query: 231 TRVSFNLRNSLIGFT 245
             + F+L  S +GF+
Sbjct: 410 NLLQFDLAASRLGFS 424


>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 154

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/110 (37%), Positives = 52/110 (47%), Gaps = 2/110 (1%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
           TFYY+ L G+S+G   L +    F  D  GNGG I+DSGT  T    E Y  +  AF   
Sbjct: 33  TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92

Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYL 200
            G R  S  +       CY+ S    V +P  +FHF  G  + LP  NY 
Sbjct: 93  IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYF 142


>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
 gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
          Length = 497

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/285 (27%), Positives = 124/285 (43%), Gaps = 39/285 (13%)

Query: 1   GDFVTETVTLGSASVDNIAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQINASTFSYCLV 60
           G  + +T+     +V    +GC  +   +    +GL G G G+ S P+Q+  S FSYCL+
Sbjct: 212 GLLIADTLRAPGRAVSGFVLGC--SLVSVHQPPSGLAGFGRGAPSVPAQLGLSKFSYCLL 269

Query: 61  DRDSDSTSTLEFDSSLPPN---AVTAPLLRNHELD-----TFYYLGLTGISVGGDLLPIS 112
            R  D  + +     L  +       PL+++   D      +YYL L+G++VGG  + + 
Sbjct: 270 SRRFDDNAAVSGSLVLGGDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAVRLP 329

Query: 113 ETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFV-----RGTRALSPTDGVALFDT 167
             AF  + +G+GG IVDSGT  T L    +  + DA V     R  R+    +G+ L   
Sbjct: 330 ARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGL-HP 388

Query: 168 CYDFSS-RSSVEVPTVSFHFPEGKVLPLPAKNYLI-----PVD-------SNGTFCFAF- 213
           C+       S+ +P +S HF  G V+ LP +NY +     PV        +    C A  
Sbjct: 389 CFALPQGAKSMALPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLAVV 448

Query: 214 ---------APTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                            I+G+ QQQ   V ++L    +GF    C
Sbjct: 449 TDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPC 493


>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
          Length = 155

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 55/120 (45%), Gaps = 2/120 (1%)

Query: 93  TFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR- 151
           TFYY+ L G+S+G   L +    F  D  GNGG I+DSGT  T    E Y  +  AF   
Sbjct: 33  TFYYIDLRGVSIGRKRLNLPSKLFSFDTKGNGGTIIDSGTTFTIFNEEFYKNITAAFASQ 92

Query: 152 -GTRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFC 210
            G R  S  +       CY+ S    V +P  +FHF  G  + LP  NY     S  + C
Sbjct: 93  IGFRRASEVEARTGMRLCYNVSGVDHVLLPDFAFHFKGGSDMVLPVANYFSYFVSFDSIC 152


>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 117/274 (42%), Gaps = 29/274 (10%)

Query: 2   DFVTETVTLGS--ASVDNIAIGCGHNNEGLFVG--AAGLLGLGGGSLSFPSQINAS---- 53
           DFV +    GS  +SV+ +  GC HN    +     AG++ L     SF  Q++A     
Sbjct: 174 DFVFDGSGPGSPISSVNGLVFGCAHNTHDFYNHDLWAGVMSLNRHPTSFIRQLSARGLAA 233

Query: 54  -TFSYCLVDRDS-DSTSTLEFDSSLP--PNAVTAPLLRNHELD---TFYYLGLTGISVGG 106
             FSYCL  R   D    L F + +P   +A + PLL          +Y   +     G 
Sbjct: 234 PRFSYCLASRQHRDRRGFLRFGADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGR 293

Query: 107 DLLPISETAFKIDE-SGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALF 165
            L  I+   F+++  S  GG I+D GT++T + T  Y+ L    +   R+       A+F
Sbjct: 294 RLTAITPVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQH--AIF 351

Query: 166 DTCYDFSSRSSVE-----VPTVSFHF---PEGKVLPLPAKNYLIPVDSNGT--FCFAFAP 215
                   R   E     +P+V+ HF   PE   L +  +   + +    T   C A  P
Sbjct: 352 SPGQKHCFRGKWESIHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVP 411

Query: 216 TSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
            +   +IIG  Q   TR +F+L+ + + F P +C
Sbjct: 412 YAER-TIIGAGQMLDTRFTFDLQQNRLFFAPEQC 444


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 114/276 (41%), Gaps = 40/276 (14%)

Query: 1   GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVGAA-----GLLGLGGGSLSFP 47
           G FV + V   S + D        ++  GCG    G    +      G+LG G  + S  
Sbjct: 176 GYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMI 235

Query: 48  SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           SQ+ +S      F++CL  R+            + P     PL+ N      Y + +T +
Sbjct: 236 SQLASSGRVKKIFAHCLDGRNGGGI--FAIGRVVQPKVNMTPLVPNQP---HYNVNMTAV 290

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            VG + L I    F+  +    G I+DSGT +  L    Y  L    V+   +  P   V
Sbjct: 291 QVGQEFLNIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKV 344

Query: 163 ALFDT---CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
            + D    C+ +S R     P V+FHF     L +   +YL P +  G +C  +  ++  
Sbjct: 345 HIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQ 402

Query: 218 ----SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                +++++G++      V ++L N LIG+T   C
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438


>gi|242086416|ref|XP_002443633.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
 gi|241944326|gb|EES17471.1| hypothetical protein SORBIDRAFT_08g022640 [Sorghum bicolor]
          Length = 503

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 98/214 (45%), Gaps = 24/214 (11%)

Query: 53  STFSYCLVDR-DSDSTSTLEFDSSLPPNAVTA--PLLRN---HELDTFYYLGLTGISVGG 106
           + FSYCL     S    +L  D+++  + VTA  PL+ N    EL + Y++ L G+S+G 
Sbjct: 297 AAFSYCLPKSPSSQGYLSLAVDATVRHDKVTAHAPLVSNGGDPELASMYFIDLVGMSLGV 356

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVR----GTRALSPTDGV 162
           D +PI          GN G+ +D GT  T+L  E Y  LRD+F +       +L   DG 
Sbjct: 357 DDIPIPPAG----SFGNNGVNLDLGTTFTKLTPEVYMTLRDSFRKQMSQNNHSLLGFDG- 411

Query: 163 ALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGT----FCFAFAPTSS 218
             FDTC++ +    + +P + F F  G+ L +     L   D         C AF+   +
Sbjct: 412 --FDTCFNLTGVRDLAMPLLWFKFSNGERLLIDLDQMLYYDDPAAAPFTMACLAFSSLDA 469

Query: 219 SLS---IIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
             S   +IG      T V +++    +GF P  C
Sbjct: 470 GDSFSAVIGTHTLASTEVIYDVAGGKVGFIPRSC 503


>gi|242086414|ref|XP_002443632.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
 gi|241944325|gb|EES17470.1| hypothetical protein SORBIDRAFT_08g022630 [Sorghum bicolor]
          Length = 556

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 79/174 (45%), Gaps = 17/174 (9%)

Query: 88  NHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRD 147
           N EL + Y++ L GIS+G + L I    F     GN    +D GT  T L  + Y ALR+
Sbjct: 388 NPELASMYFIDLVGISLGDEDLSIPAGTF-----GNRSTNLDVGTTFTILAPDAYTALRE 442

Query: 148 AFVRGTRAL----SPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV 203
           +F R         SPTD    FDTC++F+  + + +P V   F  G +L + A   L   
Sbjct: 443 SFKRQMSQYNFSSSPTDIAGGFDTCFNFTDLNDLVIPNVQLKFSNGDMLVIDADQMLYYD 502

Query: 204 DSNGT-----FCFAFAPT---SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           D          C AF+      S  ++IG+     T V +++    +GF P  C
Sbjct: 503 DDTDAAPFTMACLAFSSLDAGDSFAAVIGSYTLATTEVVYDVAGGQVGFIPWSC 556


>gi|449432733|ref|XP_004134153.1| PREDICTED: basic 7S globulin-like [Cucumis sativus]
          Length = 432

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 110/267 (41%), Gaps = 34/267 (12%)

Query: 12  SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVD--- 61
           + S+ N    CG     EGL  G +G+ G G   +S PSQ +A+      F+ CL     
Sbjct: 148 AVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAAFSFNRKFAVCLSGSTR 207

Query: 62  ---------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
                          ++ D T +L +         TA +  + E  + Y++G+  I    
Sbjct: 208 SPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGVSTSGEKSSEYFIGVKSIVFNS 267

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +PI+ T  KID +GNGG  + +    T L++  YNAL     R  R +     VA F 
Sbjct: 268 KTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNALVKTITRELRNIPRVAAVAPFG 327

Query: 167 TCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAP---TSS 218
            CY   S  S      +P++       KV+  +   N ++ V+     C  F      + 
Sbjct: 328 VCYKSKSFGSTRLGPGMPSIDLILQNKKVIWRIFGANSMVQVNEE-VLCLGFVDGGVEAR 386

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           +  +IG  Q +   + F+L  S +GF+
Sbjct: 387 TAIVIGAYQMEDNLLEFDLATSRLGFS 413


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/276 (25%), Positives = 113/276 (40%), Gaps = 40/276 (14%)

Query: 1   GDFVTETVTLGSASVD--------NIAIGCGHNNEGLFVGAA-----GLLGLGGGSLSFP 47
           G FV + V   S + D        ++  GCG    G    +      G+LG G  + S  
Sbjct: 176 GYFVKDVVQYDSVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMI 235

Query: 48  SQINAS-----TFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGI 102
           SQ+ +S      F++CL  R+            + P     PL+ N      Y + +T +
Sbjct: 236 SQLASSGRVKKIFAHCLDGRNGGGI--FAIGRVVQPKVNMTPLVPNQP---HYNVNMTAV 290

Query: 103 SVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGV 162
            VG + L I    F+  +    G I+DSGT +  L    Y  L    V+   +  P   V
Sbjct: 291 QVGQEFLTIPADLFQPGD--RKGAIIDSGTTLAYLPEIIYEPL----VKKITSQEPALKV 344

Query: 163 ALFDT---CYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPVDSNGTFCFAFAPTS-- 217
            + D    C+ +S R     P V+FHF     L +   +YL P    G +C  +  ++  
Sbjct: 345 HIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYLFP--HEGMWCIGWQNSAMQ 402

Query: 218 ----SSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
                +++++G++      V ++L N LIG+T   C
Sbjct: 403 SRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNC 438


>gi|383147800|gb|AFG55671.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
 gi|383147802|gb|AFG55672.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
 gi|383147804|gb|AFG55673.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
 gi|383147806|gb|AFG55674.1| Pinus taeda anonymous locus CL1877Contig1_03 genomic sequence
          Length = 59

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/59 (54%), Positives = 42/59 (71%)

Query: 191 VLPLPAKNYLIPVDSNGTFCFAFAPTSSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
           +L LP  NY++PVD+ GT CFAFAPT S  SI+GN+QQQ   VS++  N  IGF  ++C
Sbjct: 1   ILSLPTNNYVVPVDNMGTHCFAFAPTDSGFSIMGNIQQQHIGVSYDTYNGQIGFALDQC 59


>gi|449527083|ref|XP_004170542.1| PREDICTED: LOW QUALITY PROTEIN: basic 7S globulin-like [Cucumis
           sativus]
          Length = 432

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/267 (25%), Positives = 110/267 (41%), Gaps = 34/267 (12%)

Query: 12  SASVDNIAIGCGHNN--EGLFVGAAGLLGLGGGSLSFPSQINAS-----TFSYCLVD--- 61
           + S+ N    CG     EGL  G +G+ G G   +S PSQ +A+      F+ CL     
Sbjct: 148 AVSIPNFLFVCGPTFLLEGLAGGVSGMAGFGRTGISLPSQFSAAFSFNRKFAVCLSGSTR 207

Query: 62  ---------------RDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGISVGG 106
                          ++ D T +L +         TA +  + E  + Y++G+  I    
Sbjct: 208 SPGVIFSGNGPYHFLQNVDVTKSLTYTPLFINPVSTAGVSTSGEKSSEYFIGVKSIVFNS 267

Query: 107 DLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRALSPTDGVALFD 166
             +PI+ T  KID +GNGG  + +    T L++  YNAL     R  R +     VA F 
Sbjct: 268 KTVPINTTLLKIDSNGNGGTKISTVHPYTVLESSIYNALVKTITRELRNIPRVAAVAPFG 327

Query: 167 TCYDFSSRSSVE----VPTVSFHFPEGKVL-PLPAKNYLIPVDSNGTFCFAFAP---TSS 218
            CY   S  S      +P++       KV+  +   N ++ V+     C  F      + 
Sbjct: 328 VCYKSKSFGSTRLGPGMPSIDLILQNKKVIWRIFGANSMVQVNEE-VLCLGFVDGGVEAR 386

Query: 219 SLSIIGNVQQQGTRVSFNLRNSLIGFT 245
           +  +IG  Q +   + F+L  S +GF+
Sbjct: 387 TAIVIGAYQMEDNLLEFDLATSRLGFS 413


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/196 (27%), Positives = 84/196 (42%), Gaps = 19/196 (9%)

Query: 34  AGLLGLGGGSLSFPSQINASTFSYCLV-DRDSDSTSTLEFDSSLPPNAV----------- 81
           +G+ G G G  S P Q+    FSYCL+  R  DS  + +    + P++            
Sbjct: 234 SGIAGFGRGPSSLPKQMGLKKFSYCLLSHRFDDSPKSSKMTLYVGPDSKDDKTGGLSYTP 293

Query: 82  --TAPLLRNHELDTFYYLGLTGISVGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQT 139
               P+  N     +YY+ L  I VG   + +  +       GNGG IVDSG+  T ++ 
Sbjct: 294 FRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDGNGGTIVDSGSTFTFMEK 353

Query: 140 ETYNALRDAFVRG----TRALSPTDGVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLP 195
             + A+   F R     TRA +  + ++    C++ S   SV +P++ F F  G  + LP
Sbjct: 354 PVFEAVATEFDRQMANYTRA-ADVEALSGLKPCFNLSGVGSVALPSLVFQFKGGAKMELP 412

Query: 196 AKNYLIPVDSNGTFCF 211
             NY   V      C 
Sbjct: 413 VANYFSLVGDLSVLCL 428


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 81/273 (29%), Positives = 115/273 (42%), Gaps = 35/273 (12%)

Query: 1   GDFVTETV----TLGSASVDN----IAIGCGHNNEGLFV----GAAGLLGLGGGSLSFPS 48
           G +V++T+     LG + V N    I  GC     G          G+ G G G LS  S
Sbjct: 163 GYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVIS 222

Query: 49  Q-----INASTFSYCLVDRDSDSTSTLEFDSSLPPNAVTAPLLRNHELDTFYYLGLTGIS 103
           Q     I    FS+CL   +      L     L P  V +PL+ +      Y L L  I+
Sbjct: 223 QLSTHGITPRVFSHCL-KGEGIGGGILVLGEILEPGMVYSPLVPSQP---HYNLNLQSIA 278

Query: 104 VGGDLLPISETAFKIDESGNGGIIVDSGTAVTRLQTETYNALRDAFVRGTRAL---SPTD 160
           V G LLPI  + F    S + G IVDSGT +  L  E Y    D FV     +   S T 
Sbjct: 279 VNGKLLPIDPSVFA--TSNSQGTIVDSGTTLAYLVAEAY----DPFVSAVNVIVSPSVTP 332

Query: 161 GVALFDTCYDFSSRSSVEVPTVSFHFPEGKVLPLPAKNYLIPV-DSNG---TFCFAFAPT 216
            ++  + CY  S+  S   P  SF+F  G  + L  ++YLIP   S G    +C  F   
Sbjct: 333 IISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKV 392

Query: 217 SSSLSIIGNVQQQGTRVSFNLRNSLIGFTPNKC 249
              ++I+G++  +     ++L    IG+    C
Sbjct: 393 -QGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.136    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,971,609,766
Number of Sequences: 23463169
Number of extensions: 171560387
Number of successful extensions: 425754
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 869
Number of HSP's successfully gapped in prelim test: 1033
Number of HSP's that attempted gapping in prelim test: 421022
Number of HSP's gapped (non-prelim): 2007
length of query: 249
length of database: 8,064,228,071
effective HSP length: 139
effective length of query: 110
effective length of database: 9,097,814,876
effective search space: 1000759636360
effective search space used: 1000759636360
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 75 (33.5 bits)