BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 011042
         (495 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  708 bits (1827), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 354/462 (76%), Positives = 400/462 (86%), Gaps = 17/462 (3%)

Query: 34  HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
           HFQ+LNV E+I        K SQY ELF+  N+  +      E +W L+LVHRDK+    
Sbjct: 37  HFQLLNVKEAIT-----ETKASQYQELFDNQNDTLT------EGKWKLKLVHRDKI---- 81

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS 153
            T  N   + H H+FHAR+QRD KRVATL+RRLS   A ++ + V++FG +VVSGM+QGS
Sbjct: 82  -TAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSS-YSVEEFGAEVVSGMNQGS 139

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEYF+RIGVGSPPR QY+VIDSGSDIVWVQCQPC+QCY Q+DPVFDPADSASF GV CSS
Sbjct: 140 GEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSS 199

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           +VC+R+ENAGCHAG CRYEV YGDGSYTKGTLALETLT GRTVV+NVAIGCGH+N+GMFV
Sbjct: 200 SVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHRNRGMFV 259

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
           GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT S+GSL FGR A+PVGAAW+PL+R
Sbjct: 260 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAMPVGAAWIPLIR 319

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           NPRAPSFYY+ LSG+GVGGM++PISED+F+L +MG+ GVVMDTGTAVTR+PT AY AFRD
Sbjct: 320 NPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTRIPTVAYVAFRD 379

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           AF+ QTGNLPRASGVSIFDTCYNL+GFVSVRVPTVSFYF+GGP+LTLPA NFLIPVDD G
Sbjct: 380 AFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGGPILTLPARNFLIPVDDVG 439

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           TFCFAFA SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC
Sbjct: 440 TFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  699 bits (1804), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/461 (76%), Positives = 399/461 (86%), Gaps = 23/461 (4%)

Query: 35  FQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSN 94
           FQ LNV E+I G+R    ++S+ +E                  +W +++VHRD++S  ++
Sbjct: 42  FQHLNVKETIAGTRIIPLEVSEDHE--------------EGGEKWMMKVVHRDQLSFGNS 87

Query: 95  TTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSG 154
                    H+H    R++RD KRVA+L+RRLS GG     + V DFGTDV+SGM+QGSG
Sbjct: 88  DD-------HRHRLDGRLKRDAKRVASLIRRLSSGGG--GSYRVDDFGTDVISGMEQGSG 138

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC+QCY QSDPVFDPADSASF+GVSCSS+
Sbjct: 139 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSS 198

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
           VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT GRT+V++VAIGCGH+N+GMFVG
Sbjct: 199 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMVRSVAIGCGHRNRGMFVG 258

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
           AAGLLGLGGGSMS VGQLGGQTGGAFSYCLVSRGT SSGSLVFGREALP GAAWVPLVRN
Sbjct: 259 AAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRN 318

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           PRAPSFYY+GL+GLGVGG+R+PISE++FRLT++GD GVVMDTGTAVTRLPT AY+AFRDA
Sbjct: 319 PRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDA 378

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           F+AQT NLPRA+GV+IFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIP+DDAGT
Sbjct: 379 FLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGT 438

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FCFAFAPS SGLSI+GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 439 FCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  667 bits (1721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/462 (72%), Positives = 388/462 (83%), Gaps = 13/462 (2%)

Query: 34  HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
           HFQ LNV + +  ++ +    + Y  L  +H  ++ +  +S  A++ L+LVHRDK+    
Sbjct: 25  HFQQLNVKQILTETKLN--PTNTYKHL--QHQKLNIATEASSPAKYKLKLVHRDKVP--- 77

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS 153
            T N  H HR +  F+ARMQRD KRVA L R L+ G    A+   + FG+DVVSGM+QGS
Sbjct: 78  -TFNTSHDHRTR--FNARMQRDTKRVAALRRHLAAGKPTYAE---EAFGSDVVSGMEQGS 131

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEYFVRIGVGSPPR+QY+VIDSGSDI+WVQC+PC+QCY QSDPVF+PADS+S++GVSC+S
Sbjct: 132 GEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCAS 191

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
            VC  ++NAGCH GRCRYEVSYGDGSYTKGTLALETLT GRT+++NVAIGCGH NQGMFV
Sbjct: 192 TVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAIGCGHHNQGMFV 251

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
           GAAGLLGLG G MS VGQLGGQ GG FSYCLVSRG  SSG L FGREA+PVGAAWVPL+ 
Sbjct: 252 GAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAVPVGAAWVPLIH 311

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           NPRA SFYYVGLSGLGVGG+R+PISED+F+L+++GD GVVMDTGTAVTRLPT AYEAFRD
Sbjct: 312 NPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTRLPTAAYEAFRD 371

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           AF+AQT NLPRASGVSIFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIPVDD G
Sbjct: 372 AFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDVG 431

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +FCFAFAPS SGLSIIGNIQQEGI+IS DGANGFVGFGPNVC
Sbjct: 432 SFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNVC 473


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  653 bits (1684), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/469 (71%), Positives = 382/469 (81%), Gaps = 16/469 (3%)

Query: 30  ASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKM 89
           +S T FQ LNV    K ++ D       + L     +   S   SD   + L L+HRDK+
Sbjct: 27  SSSTKFQYLNV----KATKLDFNDGQILHALNFSDGHRQVSGYKSDNNTFKLNLLHRDKL 82

Query: 90  SSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK---HEVQDFGTDVV 146
           S         H H H+  F+ RM+RD  RVATLVRRLS G   A K   ++V +F TDV+
Sbjct: 83  S---------HVHGHRRGFNDRMKRDAIRVATLVRRLSHGAPAAVKDSRYKVANFATDVI 133

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SGM+ GSGEYFVRIGVGSPPR+QYMVIDSGSDIVWVQC+PCS+CY+QSDPVFDPADS+SF
Sbjct: 134 SGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSF 193

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           +GVSC S VCDRLEN GC+AGRCRYEVSYGDGSYTKGTLALETLT+G+ ++++VAIGCGH
Sbjct: 194 AGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTVGQVMIRDVAIGCGH 253

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
            NQGMF+GAAGLLGLGGGSMS +GQLGGQTGGAFSYCLVSRGTGS+G+L FGR ALPVGA
Sbjct: 254 TNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGALPVGA 313

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
            W+ L+RNPRAPSFYY+GL+G+GVGG+R+ + E+ F+LT+ G +GVVMDTGTAVTR PT 
Sbjct: 314 TWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTRFPTA 373

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY AFRD+F AQT NLPRA GVSIFDTCY+L+GF SVRVPTVSFYFS GPVLTLPA NFL
Sbjct: 374 AYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVPTVSFYFSDGPVLTLPARNFL 433

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IPVD  GTFC AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN+C
Sbjct: 434 IPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNIC 482


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  649 bits (1673), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 323/418 (77%), Positives = 365/418 (87%), Gaps = 28/418 (6%)

Query: 78  RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
           +W +++VHRD++S  ++         H+H    R++RD KRVA+L+RRLS GG     + 
Sbjct: 132 KWMMKVVHRDQLSFGNSDD-------HRHRLDGRLKRDAKRVASLIRRLSSGGG--GSYR 182

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
           V DFGTDV+SGM+QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC+QCY QSDPV
Sbjct: 183 VDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPV 242

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           FDPADSASF+GVSCSS+VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT GRT+V
Sbjct: 243 FDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGRTMV 302

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
           ++VAIGCGH+N+GMFVGAAGLLGLGGGSMS VGQLGGQTGGAFSYCLVS           
Sbjct: 303 RSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVS----------- 351

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
                   AAWVPLVRNPRAPSFYY+GL+GLGVGG+R+PISE++FRLT++GD GVVMDTG
Sbjct: 352 --------AAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTG 403

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           TAVTRLPT AY+AFRDAF+AQT NLPRA+GV+IFDTCY+L GFVSVRVPTVSFYFSGGP+
Sbjct: 404 TAVTRLPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPI 463

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LTLPA NFLIP+DDAGTFCFAFAPS SGLSI+GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 464 LTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/463 (73%), Positives = 389/463 (84%), Gaps = 12/463 (2%)

Query: 34  HFQILNVNESIKGSRTDHAKMSQYNELFERHNN-ISSSNTSSDEARWNLELVHRDKMSSS 92
           HFQ LNV + I      +   +Q ++    HN  ++S+  +S  A++ L+LVHRDK+ + 
Sbjct: 24  HFQQLNVKQIILTETKLYPNPTQPSK--HPHNKKLNSATEASSSAKYKLKLVHRDKVPTF 81

Query: 93  SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQG 152
           +       YH H+  F+ARMQRD KR A+L+RRL+ G    A    + FG+DVVSGM+QG
Sbjct: 82  NT------YHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYA---AEAFGSDVVSGMEQG 132

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           SGEYFVRIGVGSPPR+QY+V+DSGSDI+WVQC+PC+QCY QSDPVF+PADS+SFSGVSC+
Sbjct: 133 SGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCA 192

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S VC  ++NA CH GRCRYEVSYGDGSYTKGTLALET+T GRT+++NVAIGCGH NQGMF
Sbjct: 193 STVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAIGCGHHNQGMF 252

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
           VGAAGLLGLGGG MS VGQLGGQTGGAFSYCLVSRG  SSG L FGREA+PVGAAWVPL+
Sbjct: 253 VGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGREAMPVGAAWVPLI 312

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NPRA SFYY+GLSGLGVGG+R+ ISED+F+L+++GD GVVMDTGTAVTRLPT AYEAFR
Sbjct: 313 HNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTRLPTVAYEAFR 372

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           D F+AQT NLPRASGVSIFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIPVDD 
Sbjct: 373 DGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGGPILTLPARNFLIPVDDV 432

Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           GTFCFAFAPS SGLSIIGNIQQEGIQIS DGANGFVGFGPNVC
Sbjct: 433 GTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 316/384 (82%), Positives = 352/384 (91%), Gaps = 2/384 (0%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           M RDVKRVA+L+ RLS G   AAK+EV+DFG+DVVSGM+QGSGEYFVRIG+GSPPRSQYM
Sbjct: 1   MHRDVKRVASLIHRLSSG--SAAKYEVEDFGSDVVSGMNQGSGEYFVRIGLGSPPRSQYM 58

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
           VIDSGSDIVWVQC+PC+QCY Q+DP+FDPADSASF GVSCSSAVCDR+ENAGC++GRCRY
Sbjct: 59  VIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDRVENAGCNSGRCRY 118

Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
           EVSYGDGSYTKGTLALETLT GRTVV+NVAIGCGH N+GMFVGAAGLLGLGGGSMS +GQ
Sbjct: 119 EVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQ 178

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           L GQTG AFSYCLVSRGT ++G L FG EA+PVGAAW+PLVRNPRAPSFYY+ L GLGVG
Sbjct: 179 LSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVG 238

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
             R+P+SED+F+L ++G  GVVMDTGTAVTR PT AYEAFR+AF+ QT NLPRASGVSIF
Sbjct: 239 DTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSIF 298

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTCYNL GF+SVRVPTVSFYFSGGP+LT+PA+NFLIPVDDAGTFCFAFAPSPSGLSI+GN
Sbjct: 299 DTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGN 358

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQEGIQIS D AN FVGFGPN+C
Sbjct: 359 IQQEGIQISVDEANEFVGFGPNIC 382


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  633 bits (1632), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 315/384 (82%), Positives = 352/384 (91%), Gaps = 2/384 (0%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           MQRDVKRV +L+RR+S G    A + V+DFG++VVSGMDQGSGEYFVRIGVGSPPRSQYM
Sbjct: 1   MQRDVKRVVSLIRRVSSG--STASYGVEDFGSEVVSGMDQGSGEYFVRIGVGSPPRSQYM 58

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
           VIDSGSDIVWVQC+PC+QCY Q+DP+FDPADSASF GVSCSSAVCD+++NAGC++GRCRY
Sbjct: 59  VIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCDQVDNAGCNSGRCRY 118

Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
           EVSYGDGS TKGTLALETLT+GRTVV+NVAIGCGH NQGMFVGAAGLLGLGGGSMS VGQ
Sbjct: 119 EVSYGDGSSTKGTLALETLTLGRTVVQNVAIGCGHMNQGMFVGAAGLLGLGGGSMSFVGQ 178

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           L  + G AFSYCLVSR T S+G L FG EA+PVGAAW+PL+RNP +PS+YY+GLSGLGVG
Sbjct: 179 LSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVG 238

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
            M++PISED+F LT++G+ GVVMDTGTAVTR PT AYEAFRDAF+ QTGNLPRASGVSIF
Sbjct: 239 DMKVPISEDIFELTELGNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIF 298

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTCYNL GF+SVRVPTVSFYFSGGP+LTLPA+NFLIPVDDAGTFCFAFAPSPSGLSI+GN
Sbjct: 299 DTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGN 358

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQEGIQIS DGAN FVGFGPNVC
Sbjct: 359 IQQEGIQISVDGANEFVGFGPNVC 382


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 326/440 (74%), Positives = 373/440 (84%), Gaps = 12/440 (2%)

Query: 59  ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
           E     NN   S+ S+  +++ L L+HRD+  S       + Y  H H  HARM+RD  R
Sbjct: 41  ETLPDFNNTHFSDDSN--SKYTLRLLHRDRFPS-------VTYRNHHHRLHARMRRDTDR 91

Query: 119 VATLVRRLSGG---GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
           V+ ++RR+SG     +  +++EV DFG+DVVSGMDQGSGEYFVRIGVGSPPR QYMVIDS
Sbjct: 92  VSAILRRISGKVVVASSDSRYEVNDFGSDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDS 151

Query: 176 GSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSY 235
           GSD+VWVQCQPC  CYKQSDPVFDPA S S++GVSC S+VCDR+EN+GCH+G CRYEV Y
Sbjct: 152 GSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMY 211

Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
           GDGSYTKGTLALETLT  +TVV+NVA+GCGH+N+GMF+GAAGLLG+GGGSMS VGQL GQ
Sbjct: 212 GDGSYTKGTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQ 271

Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           TGGAF YCLVSRGT S+GSLVFGREALPVGA+WVPLVRNPRAPSFYYVGL GLGVGG+RI
Sbjct: 272 TGGAFGYCLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRI 331

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
           P+ + +F LT+ GD GVVMDTGTAVTRLPT AY AFRD F +QT NLPRASGVSIFDTCY
Sbjct: 332 PLPDGVFDLTETGDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCY 391

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
           +LSGFVSVRVPTVSFYF+ GPVLTLPA NFL+PVDD+GT+CFAFA SP+GLSIIGNIQQE
Sbjct: 392 DLSGFVSVRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQE 451

Query: 476 GIQISFDGANGFVGFGPNVC 495
           GIQ+SFDGANGFVGFGPNVC
Sbjct: 452 GIQVSFDGANGFVGFGPNVC 471


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  630 bits (1625), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 325/433 (75%), Positives = 372/433 (85%), Gaps = 11/433 (2%)

Query: 65  NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
           NN   S+ SS  +++ L L+HRD+  S       + Y  H H  HARM+RD  RV+ ++R
Sbjct: 47  NNTHFSDESS--SKYTLRLLHRDRFPS-------VTYRNHHHRLHARMRRDTDRVSAILR 97

Query: 125 RLSGG--GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
           R+SG    +  +++EV DFG+D+VSGMDQGSGEYFVRIGVGSPPR QYMVIDSGSD+VWV
Sbjct: 98  RISGKVIPSSDSRYEVNDFGSDIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWV 157

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
           QCQPC  CYKQSDPVFDPA S S++GVSC S+VCDR+EN+GCH+G CRYEV YGDGSYTK
Sbjct: 158 QCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTK 217

Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
           GTLALETLT  +TVV+NVA+GCGH+N+GMF+GAAGLLG+GGGSMS VGQL GQTGGAF Y
Sbjct: 218 GTLALETLTFAKTVVRNVAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGY 277

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CLVSRGT S+GSLVFGREALPVGA+WVPLVRNPRAPSFYYVGL GLGVGG+RIP+ + +F
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVF 337

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
            LT+ GD GVVMDTGTAVTRLPT AY AFRD F +QT NLPRASGVSIFDTCY+LSGFVS
Sbjct: 338 DLTETGDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVS 397

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           VRVPTVSFYF+ GPVLTLPA NFL+PVDD+GT+CFAFA SP+GLSIIGNIQQEGIQ+SFD
Sbjct: 398 VRVPTVSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFD 457

Query: 483 GANGFVGFGPNVC 495
           GANGFVGFGPNVC
Sbjct: 458 GANGFVGFGPNVC 470


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  618 bits (1594), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 327/467 (70%), Positives = 384/467 (82%), Gaps = 12/467 (2%)

Query: 29  AASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDK 88
           AA+    Q+LNV ++IK + T  +++ Q  EL E +      N SS +++W L+L HRDK
Sbjct: 22  AATYPATQLLNVKDTIKEAETAPSRLPQDLELHENYPIFELDNNSS-QSQWKLKLFHRDK 80

Query: 89  MSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSG 148
           +  + +         H   F  R+ RD KRV++L+R L    +  +  +V DFG+DVVSG
Sbjct: 81  LPLNFDPD-------HPRRFKERISRDSKRVSSLLRLL----SSGSDEQVTDFGSDVVSG 129

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
            +QGSGEYFVRIGVGSPPRSQY+VIDSGSDIVWVQCQPCS+CY+QSDPVFDPA SA+++G
Sbjct: 130 TEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAG 189

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
           +SC S+VCDRL+NAGC+ GRCRYEVSYGDGSYT+GTLALETLT GR +++N+AIGCGH N
Sbjct: 190 ISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGRVLIRNIAIGCGHMN 249

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW 328
           +GMF+GAAGLLGLGGG+MS VGQLGGQTGGAFSYCLVSRGT S+G+L FGR A+PVGAAW
Sbjct: 250 RGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAMPVGAAW 309

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           VPL+RNPRAPSFYYVGLSGLGVGG+R+PI E +F LT +G  GVVMDTGTAVTRLP PAY
Sbjct: 310 VPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTRLPAPAY 369

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
           EAFRD F+ QT NLPR+  VSIFDTCYNL+GFVSVRVPTVSFYFSGGP+LTLPA NFLIP
Sbjct: 370 EAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILTLPARNFLIP 429

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VD  GTFCFAFA S SGLSIIGNIQQEGIQIS DG+NGFVGFGP +C
Sbjct: 430 VDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  566 bits (1458), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 291/466 (62%), Positives = 345/466 (74%), Gaps = 39/466 (8%)

Query: 33  THFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSS 92
           ++FQ LNV  +I  ++    K   +N               + + +W  +L HRD +   
Sbjct: 27  SYFQHLNVENAISETKLKPLKQQNHN---------------TQQPQWKTKLFHRDNI--- 68

Query: 93  SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMD 150
                N+    H+  F +R+ RD+KRV  L+ RL+    +          FG+DVVSG +
Sbjct: 69  -----NLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEASFGSDVVSGTE 123

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           +GSGEYFVRIG+GSP   QYMVIDSGSDIVW+QC+PC QCY Q+DP+F+PA SASF GV+
Sbjct: 124 EGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVA 183

Query: 211 CSSAVCDRLEN-AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
           CSS VC++L++   C  GRC Y+V+YGDGSYTKGTLALET+TIGRTV+++ AIGCGH N+
Sbjct: 184 CSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTVIQDTAIGCGHWNE 243

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           GMFVGAAGLLGLGGG MS VGQLG QTGGAF YCLVSR             A+PVGA WV
Sbjct: 244 GMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSR-------------AMPVGAMWV 290

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+ NP  PSFYYV LSGL VGG+R+PISE +F+LT +G  GVVMDTGTA+TRLPT AY 
Sbjct: 291 PLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLPTVAYN 350

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           AFRDAF+AQT NLPRA GVSIFDTCY+L+GFV+VRVPTVSFYFSGG +LT PA NFLIP 
Sbjct: 351 AFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARNFLIPA 410

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           DD GTFCFAFAPSPSGLSIIGNIQQEGIQ+S DG NGFVGFGPNVC
Sbjct: 411 DDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  516 bits (1328), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 274/461 (59%), Positives = 328/461 (71%), Gaps = 32/461 (6%)

Query: 51  HAKMSQYNELFERHNNISSSNTS-----------SDEARWNLELVHRDKMSSSSNTTNNM 99
           HA   ++  +  RHN  +   ++           S + R +  LV RD ++ S+      
Sbjct: 20  HASSLRFQYIDRRHNFTAKQASTSSSSSPSSAHGSRDRRPSFALVRRDAVTGST------ 73

Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG---TDVVSGMDQGSGEY 156
            Y   +H+    + RD  R   L  RLS      A ++   F    + VVSG+D+GSGEY
Sbjct: 74  -YPSRRHAVLDLVARDNARAEYLASRLS-----PAAYQPTGFSGSESKVVSGLDEGSGEY 127

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
           FVR+G+GSPP  QY+V+DSGSD++WVQC+PC +CY Q+DP+FDPA SA+FS V C SAVC
Sbjct: 128 FVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVC 187

Query: 217 DRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
             L  +GC  +G C YEVSYGDGSYTKG LALETLT+G T V+ VAIGCGH+N+G+FVGA
Sbjct: 188 RTLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTAVEGVAIGCGHRNRGLFVGA 247

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPVGAAWVPLVRN 334
           AGLLGLG G MSLVGQLGG  GGAFSYCL SRG   +GSLV GR EA+P GA WVPLVRN
Sbjct: 248 AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRG---AGSLVLGRSEAVPEGAVWVPLVRN 304

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           P+APSFYYVGLSG+GVG  R+P+ EDLF+LT+ G  GVVMDTGTAVTRLP  AY A RDA
Sbjct: 305 PQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDA 364

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           FVA  G LPRA GVS+ DTCY+LSG+ SVRVPTVSFYF G   LTLPA N L+ V D G 
Sbjct: 365 FVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEV-DGGI 423

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +C AFAPS SG SI+GNIQQEGIQI+ D ANG++GFGP  C
Sbjct: 424 YCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 279/489 (57%), Positives = 336/489 (68%), Gaps = 28/489 (5%)

Query: 12  QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
           +V+L  L  S      S AS   F  +N +     + T  A  S        H + +++N
Sbjct: 8   KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
            S       L LVHRD +S ++       Y   +H     + RD  RV  L +RL    A
Sbjct: 63  PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
             + +  +D  ++VV G+D GSGEYFVR+GVGSPP  QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
            Q+DP+FDPA S+SFSGVSC SA+C  L   GC  G    +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG  GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285

Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           G G +GSLV GR EA+PVGA WVPLVRN +A SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTE 345

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            G  GVVMDTGTAVTRLP  AY A R AF    G LPR+  VS+ DTCY+LSG+ SVRVP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           TVSFYF  G VLTLPA N L+ V  A  FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 406 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 464

Query: 487 FVGFGPNVC 495
           +VGFGPN C
Sbjct: 465 YVGFGPNTC 473


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 278/489 (56%), Positives = 336/489 (68%), Gaps = 28/489 (5%)

Query: 12  QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
           +V+L  L  S      S AS   F  +N +     + T  A  S        H + +++N
Sbjct: 8   KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
            S       L LVHRD +S ++       Y   +H     + RD  RV  L +RL    A
Sbjct: 63  PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
             + +  +D  ++VV G+D GSGEYFVR+GVGSPP  QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
            Q+DP+FDPA S+SFSGVSC SA+C  L   GC  G    +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSL+GQLGG  GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASR 285

Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           G G +GSLV GR EA+PVGA WVPLVRN +A SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTE 345

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            G  GVVMDTGTAVTRLP  AY A R AF    G LPR+  VS+ DTCY+LSG+ SVRVP
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 405

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           TVSFYF  G VLTLPA N L+ V  A  FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 406 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 464

Query: 487 FVGFGPNVC 495
           +VGFGPN C
Sbjct: 465 YVGFGPNTC 473


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 267/432 (61%), Positives = 319/432 (73%), Gaps = 23/432 (5%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
           S + R +  LV RD ++ ++       Y   +H+    + RD  R   L  RLS      
Sbjct: 53  SRDRRPSFALVRRDAVTGAT-------YPSPRHAVLDLVSRDNARAEYLASRLS-----P 100

Query: 134 AKHEVQDFGTD--VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
           A      FG++  VVSG+D+GSGEYFVR+G+GSPP  QY+V+DSGSD++WVQC+PC +CY
Sbjct: 101 AYQPTDFFGSESKVVSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECY 160

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETL 250
            Q+DP+FDPA SA+FS VSC SA+C  L  +GC  +G C YEVSYGDGSYTKGTLALETL
Sbjct: 161 AQADPLFDPASSATFSAVSCGSAICRTLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETL 220

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-- 308
           T+G T V+ VAIGCGH+N+G+FVGAAGLLGLG G MSLVGQLGG  GGAFSYCL SRG  
Sbjct: 221 TLGGTAVEGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGS 280

Query: 309 ----TGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
                 ++GSLV GR EA+P GA WVPLVRNP+APSFYYVG+SG+GVG  R+P+ + LF+
Sbjct: 281 GSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQ 340

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           LT+ G  GVVMDTGTAVTRLP  AY A RDAFV   G LPRA GVS+ DTCY+LSG+ SV
Sbjct: 341 LTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSV 400

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
           RVPTVSFYF G   LTLPA N L+ V D G +C AFAPS SGLSI+GNIQQEGIQI+ D 
Sbjct: 401 RVPTVSFYFDGAATLTLPARNLLLEV-DGGIYCLAFAPSSSGLSILGNIQQEGIQITVDS 459

Query: 484 ANGFVGFGPNVC 495
           ANG++GFGP  C
Sbjct: 460 ANGYIGFGPATC 471


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  489 bits (1260), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 271/489 (55%), Positives = 327/489 (66%), Gaps = 37/489 (7%)

Query: 12  QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
           +V+L  L  S      S AS   F  +N +     + T  A  S        H + +++N
Sbjct: 8   KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
            S       L LVHRD +S ++       Y   +H     + RD  RV  L +RL    A
Sbjct: 63  PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
             + +  +D  ++VV G+D GSGEYFVR+GVGSPP  QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
            Q+DP+FDPA S+SFSGVSC SA+C  L   GC  G    +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG  GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285

Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           G G +GSLV GR EA+P         R  RA SFYYVGL+G+GVGG R+P+ + LF+LT+
Sbjct: 286 GAGGAGSLVLGRTEAVP---------RGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTE 336

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            G  GVVMDTGTAVTRLP  AY A R AF    G LPR+  VS+ DTCY+LSG+ SVRVP
Sbjct: 337 DGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVP 396

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           TVSFYF  G VLTLPA N L+ V  A  FC AFAPS SG+SI+GNIQQEGIQI+ D ANG
Sbjct: 397 TVSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANG 455

Query: 487 FVGFGPNVC 495
           +VGFGPN C
Sbjct: 456 YVGFGPNTC 464


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 256/439 (58%), Positives = 307/439 (69%), Gaps = 31/439 (7%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
           S ++R +L LV RD+++ S+       Y   +H+    + RD  R   L  RLS      
Sbjct: 99  SRDSRPSLALVRRDEVTGST-------YPSLRHAVLDLVARDNARAEYLATRLS------ 145

Query: 134 AKHEVQDFG---TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
             ++   F    + VVSG+D+GSGEY VR+ VGSPP  QY+V+DSGSD++WVQC+PC +C
Sbjct: 146 PAYQPPGFSGSESKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLEC 205

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC---HAGRCRYEVSYGDGSYTKGTLAL 247
           Y Q+DP+FDPA SA+FSGVSC SA+C  L  + C     G C YEVSY DGSYTKG LAL
Sbjct: 206 YVQADPLFDPATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALAL 265

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT+G T V+ V IGCGH+N+G+FVGAAGL+GLG G MSLVGQLGG+ GGAFSYCL SR
Sbjct: 266 ETLTLGGTAVEGVVIGCGHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASR 325

Query: 308 GTGSSGS-------LVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           G   SG+       LV GR EA+P GA WVPLVRNPRAPSFYYVGLSG+ VG  R+P+  
Sbjct: 326 GGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQA 385

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGV--SIFDTCYN 416
            LF+LT+ G   VVMDTGT VTRLP  AY A RDAFV A  G +PRA GV  S+ DTCY+
Sbjct: 386 GLFQLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYD 445

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
           LSG+ SVRVPTVSF F G   L L A N L+ V D G +C AFAPS SGLSI+GN QQ G
Sbjct: 446 LSGYASVRVPTVSFCFDGDARLILAARNVLLEV-DMGIYCLAFAPSSSGLSIMGNTQQAG 504

Query: 477 IQISFDGANGFVGFGPNVC 495
           IQI+ D ANG++GFGP  C
Sbjct: 505 IQITVDSANGYIGFGPANC 523


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 263/488 (53%), Positives = 318/488 (65%), Gaps = 48/488 (9%)

Query: 12  QVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN 71
           +V+L  L  S      S AS   F  +N +     + T  A  S        H + +++N
Sbjct: 8   KVILFLLFVSTSVLIVSPASPPRFHYINPH-----NFTTPASSSSSASASAVHRSRNNNN 62

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA 131
            S       L LVHRD +S ++       Y   +H     + RD  RV  L +RL    A
Sbjct: 63  PS-------LSLVHRDAISGAT-------YPSRRHQVVGLVARDNARVEHLEKRLV---A 105

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
             + +  +D  ++VV G+D GSGEYFVR+GVGSPP  QY+V+DSGSD++WVQC+PC QCY
Sbjct: 106 STSPYLPEDLVSEVVPGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCY 165

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYEVSYGDGSYTKGTLAL 247
            Q+DP+FDPA S+SFSGVSC SA+C  L   GC  G    +C Y V+YGDGSYTKG LAL
Sbjct: 166 AQTDPLFDPAASSSFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELAL 225

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT+G T V+ VAIGCGH+N G+FVGAAGLLGLG G+MSLVGQLGG  GG FSYCL SR
Sbjct: 226 ETLTLGGTAVQGVAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASR 285

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           G G +GSL                     A SFYYVGL+G+GVGG R+P+ + LF+LT+ 
Sbjct: 286 GAGGAGSL---------------------ASSFYYVGLTGIGVGGERLPLQDSLFQLTED 324

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G  GVVMDTGTAVTRLP  AY A R AF    G LPR+  VS+ DTCY+LSG+ SVRVPT
Sbjct: 325 GAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPT 384

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           VSFYF  G VLTLPA N L+ V  A  FC AFAPS SG+SI+GNIQQEGIQI+ D ANG+
Sbjct: 385 VSFYFDQGAVLTLPARNLLVEVGGA-VFCLAFAPSSSGISILGNIQQEGIQITVDSANGY 443

Query: 488 VGFGPNVC 495
           VGFGPN C
Sbjct: 444 VGFGPNTC 451


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 257/428 (60%), Positives = 317/428 (74%), Gaps = 22/428 (5%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
           + R +L L+HRD +S  +       Y   +H+      RD  RV  L RRLS        
Sbjct: 66  DGRPSLALLHRDAVSGRT-------YPSTRHAMLGLAARDGARVEYLQRRLS------PT 112

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
               + G++VVSG+ +GSGEYFVR+GVGSPP  QY+V+DSGSD++W+QC+PC++CY+Q+D
Sbjct: 113 TMTTEVGSEVVSGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQAD 172

Query: 196 PVFDPADSASFSGVSCSSAVCDRLE--NAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI 252
           P+FDPA SASF+ V C S VC  L   ++GC  +G CRY+VSYGDGSYT+G LA+ETLT 
Sbjct: 173 PLFDPAASASFTAVPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTF 232

Query: 253 G-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
           G  T V+ VAIGCGH+N+G+FVGAAGLLGLG G MSLVGQLGG  GGAFSYCL SRG  +
Sbjct: 233 GDSTPVQGVAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADA 292

Query: 312 -SGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            +GSLVFGR +A+PVGA WVPL+RN + PSFYYVGL+GLGVGG R+P+ + LF LT+ G 
Sbjct: 293 GAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGG 352

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQT-GNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
            GVVMDTGTAVTRLP  AY A RDAF +   G+LPRA GVS+ DTCY+LSG+ SVRVPTV
Sbjct: 353 GGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTV 412

Query: 429 SFYFS-GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           + YF   G  LTLPA N L+ +   G +C AFA S SGLSI+GNIQQ+GIQI+ D ANG+
Sbjct: 413 ALYFGRDGAALTLPARNLLVEM-GGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGY 471

Query: 488 VGFGPNVC 495
           VGFGP+ C
Sbjct: 472 VGFGPSTC 479


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 229/428 (53%), Positives = 298/428 (69%), Gaps = 14/428 (3%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS---------GG 129
           W+++LVHRD +           Y R       +++R+  RV  L +R+           G
Sbjct: 71  WSVQLVHRDSLLFKGAANATASYERR---LEEKLRREAARVRALEQRIERKLKLKKDPAG 127

Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
             +       +FG++VVSGM+QGSGEYF RIG+G+P R QYMV+D+GSD+VW+QC+PC +
Sbjct: 128 SYENVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE 187

Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALET 249
           CY Q+DP+F+P+ S SFS V C SAVC +L+   CH G C YEVSYGDGSYT G+ A ET
Sbjct: 188 CYSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATET 247

Query: 250 LTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
           LT G T ++NVAIGCGH N G+FVGAAGLLGLG GS+S   QLG QTG AFSYCLV R +
Sbjct: 248 LTFGTTSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDS 307

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQ-M 367
            SSG+L FG E++P+G+ + PLV NP  P+FYY+ +  + VGG+ +  +  + FR+ +  
Sbjct: 308 ESSGTLEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETT 367

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G  G+++D+GTAVTRL T AY+A RDAF+A T +LPRA G+SIFDTCY+LS   SV +P 
Sbjct: 368 GRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPA 427

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           V F+FS G    LPA N LIP+D  GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN  
Sbjct: 428 VGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSL 487

Query: 488 VGFGPNVC 495
           VGF  + C
Sbjct: 488 VGFAIDQC 495


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 238/443 (53%), Positives = 298/443 (67%), Gaps = 14/443 (3%)

Query: 64  HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
            +      T   +  W++++VHRD +           Y R        ++RD +RV  L 
Sbjct: 99  RDEYEKRETKPRQTPWSVQVVHRDSLLVKDAANATASYERR---LEETLRRDARRVRGLE 155

Query: 124 ----RRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVID 174
               +RL      A  HE       +FG +VVSGM QGSGEYF RIGVG+P R QYMV+D
Sbjct: 156 QRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLD 215

Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVS 234
           +GSD+VW+QC+PCS+CY Q DP+F+P+ SASFS + C+SAVC  L+   CH G C Y+VS
Sbjct: 216 TGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVS 275

Query: 235 YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           YGDGSYT G+ A E LT G T V+NVAIGCGH N G+FVGAAGLLGLG G +S   QLG 
Sbjct: 276 YGDGSYTIGSFATEMLTFGTTSVRNVAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGT 335

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
           QTG AFSYCLV R + SSG+L FG E++P+G+   PL+ NP  P+FYYV L  + VGG  
Sbjct: 336 QTGRAFSYCLVDRFSESSGTLEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGAL 395

Query: 355 I-PISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
           +  +  D+FR+ +  G  G ++D+GTAVTRL TP Y+A RDAFVA T  LP+A GVSIFD
Sbjct: 396 LDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFD 455

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
           TCY+LSG   V VPTV F+FS G  L LPA N++IP+D  GTFCFAFAP+ S LSI+GNI
Sbjct: 456 TCYDLSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNI 515

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           QQ+GI++SFD AN  VGF    C
Sbjct: 516 QQQGIRVSFDTANSLVGFALRQC 538


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 232/430 (53%), Positives = 298/430 (69%), Gaps = 16/430 (3%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           W++ LVHRD M  +SN  N + Y         R++RD  RVA +  RL        +  +
Sbjct: 59  WSIPLVHRDAMKGNSNKNNELSYAER---MQQRLKRDAARVAAINSRLELAVNGIKRSSL 115

Query: 139 ------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
                        DF + VVSGMDQGSGEYF RIGVG+P R Q MV+D+GSD+ W+QC+P
Sbjct: 116 KPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEP 175

Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTL 245
           CS CY+QSDP+++PA S+S+  V C + +C +L+ +GC   G C Y+VSYGDGSYT+G  
Sbjct: 176 CSDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQVSYGDGSYTQGNF 235

Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           A ETLT+G   ++NVAIGCGH N+G+FVGAAGLLGLGGGS+S   QL  + G  FSYCLV
Sbjct: 236 ATETLTLGGAPLQNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLV 295

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            R + SS +L FGR A+P GA   P+++N R  +FYYV LSG+ VGG  + IS+ +F + 
Sbjct: 296 DRDSESSSTLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGID 355

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
             G+ GV++D+GTAVTRL T AY++ RDAF A T NLP   GVS+FDTCY+LS   SV V
Sbjct: 356 ASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDV 415

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           PTV F+FSGG  ++LPA N+L+PVD  GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRAN 475

Query: 486 GFVGFGPNVC 495
             VGF  N C
Sbjct: 476 NQVGFAVNKC 485


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 228/434 (52%), Positives = 297/434 (68%), Gaps = 10/434 (2%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-------MQRDVKRVATLV 123
            T    + W++E+VHRD +   +       Y R       R       ++R ++R  TL 
Sbjct: 66  ETKPRRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLN 125

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           +       + A+ +  DFG +VVSGM+QGSGEYF RIGVG+P R QYMV+D+GSD+ W+Q
Sbjct: 126 KDPVNRYENVAEVDA-DFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQ 184

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
           C+PC +CY Q+DP+F+P+ SASFS V C SAVC +L+   CH+G C YE SYGDGSY+ G
Sbjct: 185 CEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTG 244

Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           + A ETLT G T V NVAIGCGHKN G+F+GAAGLLGLG G++S   Q+G QTG  FSYC
Sbjct: 245 SFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYC 304

Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLF 362
           LV R + SSG L FG +++PVG+ + PL +NP  P+FYY+ ++ + VGG  +  I  ++F
Sbjct: 305 LVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVF 364

Query: 363 RLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
           R+ +  G  G ++D+GT VTRL T AY+A RDAFVA TG LPR   VSIFDTCY+LSG  
Sbjct: 365 RIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGLQ 424

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
            V VPTV F+FS G  L LPA N+LIP+D  GTFCFAFAP+ S +SI+GN QQ+ I++SF
Sbjct: 425 FVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSF 484

Query: 482 DGANGFVGFGPNVC 495
           D AN  VGF  + C
Sbjct: 485 DSANSLVGFAFDQC 498


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 209/349 (59%), Positives = 264/349 (75%), Gaps = 2/349 (0%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           M+QGSGEYF RIG+G+P R QYMV+D+GSD+VW+QC+PC +CY Q+DP+F+P+ S SFS 
Sbjct: 1   MEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFST 60

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
           V C SAVC +L+   CH G C YEVSYGDGSYT G+ A ETLT G T ++NVAIGCGH N
Sbjct: 61  VGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAIGCGHDN 120

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW 328
            G+FVGAAGLLGLG GS+S   QLG QTG AFSYCLV R + SSG+L FG E++P+G+ +
Sbjct: 121 VGLFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIF 180

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTP 386
            PLV NP  P+FYY+ +  + VGG+ +  +  + FR+ +  G  G+++D+GTAVTRL T 
Sbjct: 181 TPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTS 240

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY+A RDAF+A T +LPRA G+SIFDTCY+LS   SV +P V F+FS G    LPA N L
Sbjct: 241 AYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCL 300

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IP+D  GTFCFAFAP+ S LSI+GNIQQ+GI++SFD AN  VGF  + C
Sbjct: 301 IPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 247/445 (55%), Positives = 296/445 (66%), Gaps = 28/445 (6%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           + S  SS   R +L+L+HRD +S + + +        +H+  A   RD  RVA L RRLS
Sbjct: 46  APSVPSSTTRRPSLQLLHRDTVSGTKHPS-------RRHAVLALASRDTARVAYLQRRLS 98

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
              + ++   V+  GT V      GSGEY VR+G+GSPP  Q++V D+GSD++WVQC PC
Sbjct: 99  PSPSPSSTSSVESGGTIV----SHGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC 154

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN-----AGCHAGRCRYEVSYGDGSYTK 242
           S CY Q DP+FDPA+SASFS V C+S VC           G   G C Y+VSYGD SYT 
Sbjct: 155 SDCYAQGDPLFDPANSASFSPVPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTN 214

Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
           G LALETLT+ G T V+ VA+GCGH+N+G+F  AAGLLGLG G MSLVGQLGG  GGAFS
Sbjct: 215 GVLALETLTLDGGTEVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFS 274

Query: 302 YCLVSRGTGSSGS---LVFGRE-ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
           YCL    +G       LV GRE A P GA WVPLVRNP APSFYYVG++GLGV G R+ +
Sbjct: 275 YCLAGYYSGEGSGSGSLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQL 334

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGVSIFDTCYN 416
            + LF L   G  GVVMDTGTAVTRLP  AY A R AF  A     PRA GVS+FDTCY+
Sbjct: 335 QDGLFDLGDDGGGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYD 394

Query: 417 LSGFVSVRVPTVSFYFSG------GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
           LSG+ SVRVPTV+ YF G         LTLPA N L+PVDD GT+C AFA   SG SI+G
Sbjct: 395 LSGYASVRVPTVALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILG 454

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           NIQQ+GI+I+ D A+G+VGFGP  C
Sbjct: 455 NIQQQGIEITVDSASGYVGFGPATC 479


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  412 bits (1060), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 211/411 (51%), Positives = 281/411 (68%), Gaps = 13/411 (3%)

Query: 95  TTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV---------QDFGTDV 145
           T   +H+  ++    +R+ RD  R  +L  RL     D +K ++         +D  T V
Sbjct: 91  TIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLSTPV 150

Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSAS 205
            SG  QGSGEYF R+GVG+P R  YMV+D+GSDI W+QCQPC+ CY+Q+DP+FDP  S++
Sbjct: 151 TSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASST 210

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
           ++ V+C S  C  LE + C +G+C Y+V+YGDGSYT G  A E+++ G +  VKNVA+GC
Sbjct: 211 YAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNVALGC 270

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           GH N+G+FVGAAGLLGLGGG +SL  QL      +FSYCLV+R +  S +L F    L V
Sbjct: 271 GHDNEGLFVGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDFNSAQLGV 327

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            +   PL++N +  +FYYVGLSG+ VGG  + I E  FRL + G+ G+++D GTA+TRL 
Sbjct: 328 DSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQ 387

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
           T AY   RDAFV  T NL   S V++FDTCY+LSG  SVRVPTVSF+F+ G    LPA+N
Sbjct: 388 TQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAAN 447

Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G +++FD AN  +GF PN C
Sbjct: 448 YLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 224/510 (43%), Positives = 322/510 (63%), Gaps = 25/510 (4%)

Query: 1   MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNEL 60
           MAF +   LL  V L   L +   +S S ++ T   +L+V  S++ ++T  +     + L
Sbjct: 1   MAFPRFLSLLTTVTLSLFLTATDASSRSLSTSTKTTVLDVVSSLQQTQTILSLDPTRSSL 60

Query: 61  F-ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
              +  +IS     +  +  +LEL  RD + +S        +  ++    +R++RD  RV
Sbjct: 61  TATKPESISDPVFFNSSSPLSLELHSRDTLVAS-------QHKDYKSLVLSRLERDSSRV 113

Query: 120 ATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           A +    R +  G D +          +++ +   T VVSG+ QGSGEYF RIGVG+P +
Sbjct: 114 AGIAAKIRFAVEGIDRSDLKPVNNEDTRYQPEALTTPVVSGVSQGSGEYFSRIGVGTPAK 173

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
             Y+V+D+GSD+ W+QC+PCS CY+QSDPVF+P  S+++  ++CS+  C  LE + C + 
Sbjct: 174 EMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSACRSN 233

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
           +C Y+VSYGDGS+T G LA +T+T G +  + +VA+GCGH N+G+F GAAGLLGLGGG++
Sbjct: 234 KCLYQVSYGDGSFTVGELATDTVTFGNSGKINDVALGCGHDNEGLFTGAAGLLGLGGGAL 293

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
           S+  Q+      +FSYCLV R +G S SL F    L  G A  PL+RN +  +FYYVGLS
Sbjct: 294 SITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGSGDATAPLLRNQKIDTFYYVGLS 350

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR-A 405
           G  VGG ++ + + +F +   G  GV++D GTAVTRL T AY + RDAF+  T NL +  
Sbjct: 351 GFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGT 410

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
           S +S+FDTCY+ S   SV+VPTV+F+F+GG  L LPA N+LIPVDD GTFCFAFAP+ S 
Sbjct: 411 SSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSSS 470

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LSIIGN+QQ+G +I++D AN  +G   N C
Sbjct: 471 LSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 227/514 (44%), Positives = 322/514 (62%), Gaps = 33/514 (6%)

Query: 1   MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRT----DHAKMSQ 56
           MAF +   LL  V L   L +   +S S ++     +L+V  S++ ++T    D  + S 
Sbjct: 1   MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60

Query: 57  YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH-ARMQRD 115
                E  ++    N+SS     +LEL  RD   +S         H+   S   +R++RD
Sbjct: 61  TTTKPESLSDPVFFNSSS---PLSLELHSRDTFVASQ--------HKDYKSLTLSRLERD 109

Query: 116 VKRVATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
             RVA +V   R +  G D +          +++ +D  T VVSG  QGSGEYF RIGVG
Sbjct: 110 SSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVG 169

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
           +P +  Y+V+D+GSD+ W+QC+PC+ CY+QSDPVF+P  S+++  ++CS+  C  LE + 
Sbjct: 170 TPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSA 229

Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           C + +C Y+VSYGDGS+T G LA +T+T G +  + NVA+GCGH N+G+F GAAGLLGLG
Sbjct: 230 CRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLG 289

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
           GG +S+  Q+      +FSYCLV R +G S SL F    L  G A  PL+RN +  +FYY
Sbjct: 290 GGVLSITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYY 346

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           VGLSG  VGG ++ + + +F +   G  GV++D GTAVTRL T AY + RDAF+  T NL
Sbjct: 347 VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL 406

Query: 403 PR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
            + +S +S+FDTCY+ S   +V+VPTV+F+F+GG  L LPA N+LIPVDD+GTFCFAFAP
Sbjct: 407 KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAP 466

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + S LSIIGN+QQ+G +I++D +   +G   N C
Sbjct: 467 TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 227/514 (44%), Positives = 322/514 (62%), Gaps = 33/514 (6%)

Query: 1   MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRT----DHAKMSQ 56
           MAF +   LL  V L   L +   +S S ++     +L+V  S++ ++T    D  + S 
Sbjct: 1   MAFPRFLSLLAVVTLSLFLTTTDASSRSLSTPPKTNVLDVVSSLQQTQTILSLDPTRSSL 60

Query: 57  YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH-ARMQRD 115
                E  ++    N+SS     +LEL  RD   +S         H+   S   +R++RD
Sbjct: 61  TTTKPESLSDPVFFNSSS---PLSLELHSRDTFVASQ--------HKDYKSLTLSRLERD 109

Query: 116 VKRVATLVR--RLSGGGADAA----------KHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
             RVA +V   R +  G D +          +++ +D  T VVSG  QGSGEYF RIGVG
Sbjct: 110 SSRVAGIVAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTPVVSGASQGSGEYFSRIGVG 169

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
           +P +  Y+V+D+GSD+ W+QC+PC+ CY+QSDPVF+P  S+++  ++CS+  C  LE + 
Sbjct: 170 TPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSLLETSA 229

Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           C + +C Y+VSYGDGS+T G LA +T+T G +  + NVA+GCGH N+G+F GAAGLLGLG
Sbjct: 230 CRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGKINNVALGCGHDNEGLFTGAAGLLGLG 289

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
           GG +S+  Q+      +FSYCLV R +G S SL F    L  G A  PL+RN +  +FYY
Sbjct: 290 GGVLSITNQMKAT---SFSYCLVDRDSGKSSSLDFNSVQLGGGDATAPLLRNKKIDTFYY 346

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           VGLSG  VGG ++ + + +F +   G  GV++D GTAVTRL T AY + RDAF+  T NL
Sbjct: 347 VGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYNSLRDAFLKLTVNL 406

Query: 403 PR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
            + +S +S+FDTCY+ S   +V+VPTV+F+F+GG  L LPA N+LIPVDD+GTFCFAFAP
Sbjct: 407 KKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIPVDDSGTFCFAFAP 466

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + S LSIIGN+QQ+G +I++D +   +G   N C
Sbjct: 467 TSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 220/432 (50%), Positives = 284/432 (65%), Gaps = 16/432 (3%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           S ++T+   A ++++L H D +S +S           +  F  R+QRD  RV  +     
Sbjct: 49  SPTDTAESSATFSVQLHHVDALSFNSTP---------ETLFTTRLQRDAARVEAISYLAE 99

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
             G    K     F + V+SG+ QGSGEYF RIGVG+PPR  YMV+D+GSDIVW+QC PC
Sbjct: 100 TAGT--GKRVGTGFSSSVISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC 157

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
            +CY QSDPVFDP  S SF+ ++C S +C RL++ GC+  +  C Y+VSYGDGS+T G  
Sbjct: 158 KRCYAQSDPVFDPRKSRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDF 217

Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           + ETLT  RT V  VA+GCGH N+G+FVGAAGLLGLG G +S   Q G +    FSYCLV
Sbjct: 218 STETLTFRRTRVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLV 277

Query: 306 SRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFR 363
            R   S   S+VFG  A+   A + PLV NP+  +FYYV L G+ VGG R+P I+  LF+
Sbjct: 278 DRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFK 337

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           L Q G+ GV++D+GT+VTRL  PAY AFRDAF A   NL RA   S+FDTC++LSG   V
Sbjct: 338 LDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEV 397

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
           +VPTV  +F G  V +LPASN+LIPVD +G FC AFA +  GLSIIGNIQQ+G ++ +D 
Sbjct: 398 KVPTVVLHFRGADV-SLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDL 456

Query: 484 ANGFVGFGPNVC 495
           A   VGF P+ C
Sbjct: 457 AGSRVGFAPHGC 468


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 219/473 (46%), Positives = 302/473 (63%), Gaps = 35/473 (7%)

Query: 37  ILNVNESIKGSR---TDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
           +L+V  SI+ ++   +   KMS +N+            T+S E    +EL+ R  +  ++
Sbjct: 33  VLDVAASIQRTKNIFSSGPKMSPFNQ--------QEKETTSSE--LTVELLSRTSIQKTT 82

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-----SGGGADAAKHEV------QDFG 142
           +T        ++    +R+QRD  RV +LV RL     S   +D    E       +D  
Sbjct: 83  HTG-------YKSLTLSRLQRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQ 135

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           + ++SG  QGSGEYF R+G+G PP   Y+++D+GSD+ WVQC PC+ CY+Q+DP+F+PA 
Sbjct: 136 SPIISGTSQGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPAS 195

Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           SASFS +SC++  C  L+ + C    C YEVSYGDGSYT G    ET+T+G   V NVAI
Sbjct: 196 SASFSTLSCNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI 255

Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
           GCGH N+G+FVGAAGLLGLGGGS+S   Q+      +FSYCLV R + S+ +L F    L
Sbjct: 256 GCGHNNEGLFVGAAGLLGLGGGSLSFPSQINAT---SFSYCLVDRDSESASTLEFN-STL 311

Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
           P  A   PL+RN    +FYYVGL+GL VGG  + I E  F++ + G+ GV++D+GTA+TR
Sbjct: 312 PPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITR 371

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           L T  Y + RDAFV +T +LP  +G+++FDTCY+LS   +V VPTVSF+F  G  L LPA
Sbjct: 372 LQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPA 431

Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            N+L+P+D  GTFCFAFAP+ S LSIIGN+QQ+G ++ +D  N  VGF PN C
Sbjct: 432 KNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 223/515 (43%), Positives = 317/515 (61%), Gaps = 33/515 (6%)

Query: 1   MAFSQTTLLLKQVLLLHLLCSIITTSTSAASDTHFQILNVNESIKGSR----TDHAKMSQ 56
           MAF +   LL  V L   L +   +S S ++     +L+V  S++ ++     D  + S 
Sbjct: 1   MAFPRFLSLLSVVTLSICLTTTDASSRSLSTSHKTTVLDVVSSLQQTQHILSVDPTRSSL 60

Query: 57  YNEL--FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQR 114
              +  F+  ++    N+SS     +LEL  RD + +S        +  ++    +R++R
Sbjct: 61  TARIPEFKPESDPVFLNSSS---PLSLELHSRDTLVAS-------QHKDYKSLVLSRLER 110

Query: 115 DVKRVATLVRR------------LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           D  RVA +  +            L     D  + + +D  T VVSG  QGSGEYF RIGV
Sbjct: 111 DSSRVAGIAAKIRFAVEGIDRSDLKPVDIDETRFQPEDLTTPVVSGTSQGSGEYFSRIGV 170

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
           G+P +  Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP  S++F  ++CS   C  L+ +
Sbjct: 171 GTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCASLDVS 230

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
            C + +C Y+VSYGDGS+T G  A +T+T G +  V +VA+GCGH N+G+F GAAGLLGL
Sbjct: 231 ACRSNKCLYQVSYGDGSFTVGNYATDTVTFGESGKVNDVALGCGHDNEGLFTGAAGLLGL 290

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           GGG++S+  Q+  +   +FSYCLV R +  S SL F    +  G A  PL+RN +  +FY
Sbjct: 291 GGGALSMTNQIKAK---SFSYCLVDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKMDTFY 347

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
           YVGLSG  VGG ++ I   LF +   G  GV++D GTAVTRL T AY + RDAFV  T +
Sbjct: 348 YVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQTQAYNSLRDAFVKLTTD 407

Query: 402 LPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
             +  S +S+FDTCY+ S   +V+VPTV+F+F+GG  L LPA N+LIP+DDAGTFCFAFA
Sbjct: 408 FKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKNYLIPIDDAGTFCFAFA 467

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P+ S LSIIGN+QQ+G +I++D AN  +G   N C
Sbjct: 468 PTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 200/358 (55%), Positives = 260/358 (72%), Gaps = 4/358 (1%)

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
           +D  T V SG  QGSGEYF R+GVG+P R  YMV+D+GSDI W+QCQPC+ CY+Q+DP+F
Sbjct: 3   EDLSTPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIF 62

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
           DP  S++++ V+C S  C  LE + C +G+C Y+V+YGDGSYT G  A E+++ G +  V
Sbjct: 63  DPTASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSV 122

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
           KNVA+GCGH N+G+FVGAAGLLGLGGG +SL  QL      +FSYCLV+R +  S +L F
Sbjct: 123 KNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDF 179

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
               L V +   PL++N +  +FYYVGLSG+ VGG  + I E  FRL + G+ G+++D G
Sbjct: 180 NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCG 239

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           TA+TRL T AY   RDAFV  T NL   S V++FDTCY+LSG  SVRVPTVSF+F+ G  
Sbjct: 240 TAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKS 299

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             LPA+N+LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G +++FD AN  +GF PN C
Sbjct: 300 WNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 207/431 (48%), Positives = 279/431 (64%), Gaps = 24/431 (5%)

Query: 85  HRDKMSSSSNTTNNMH----YHRHQHSFH-----ARMQRDVKRVATLVRRLSGGGADAAK 135
            ++ ++SSS  T  +H      + +H  +     +R++RD  RV ++  RL       + 
Sbjct: 53  QQEIVTSSSQLTMELHSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLST 112

Query: 136 HEVQDFGTD-----------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
            +++   TD           ++SG  QGSGEYF R+G+G P    YMV+D+GSD+ W+QC
Sbjct: 113 SDLKPLDTDSQFRAEDLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQC 172

Query: 185 QPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
            PC+ CY Q+DP+F+PA S S+S +SC +  C  L+ + C    C YEVSYGDGSYT G 
Sbjct: 173 APCADCYHQADPIFEPASSTSYSPLSCDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGD 232

Query: 245 LALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
              ET+T+G   V NVAIGCGH N+G+F+GAAGLLGLGGG +S   Q+      +FSYCL
Sbjct: 233 FVTETITLGSASVDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQINA---SSFSYCL 289

Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
           V R + S+ +L F    LP  A   PL+RN    +FYYVG++GL VGG  + I E +F +
Sbjct: 290 VDRDSDSASTLEFNSALLP-HAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEM 348

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
            + G+ G+++D+GTAVTRL T AY A RDAFV  T +LP  S V++FDTCY+LS   SV 
Sbjct: 349 DESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVE 408

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           VPTV+F+ +GG VL LPA+N+LIPVD  GTFCFAFAP+ S LSIIGN+QQ+G ++ FD A
Sbjct: 409 VPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLA 468

Query: 485 NGFVGFGPNVC 495
           N  VGF P  C
Sbjct: 469 NSLVGFEPRQC 479


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 217/476 (45%), Positives = 304/476 (63%), Gaps = 32/476 (6%)

Query: 31  SDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMS 90
           S T   ILNV +SI   RT +    + N+  E+ ++ SSS        ++L+L  R  + 
Sbjct: 29  STTTTSILNVADSIH--RTKYTSSFRLNQQEEQTHSASSS--------FSLQLHSRVSVR 78

Query: 91  SSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK-----------HEVQ 139
            +        +  ++    AR+ RD  RV +L+ RL     + +K            E Q
Sbjct: 79  GT-------EHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQ 131

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           D    ++SG  QGSGEYF R+G+G P R  YMV+D+GSD+ W+QC PC+ CY Q++P+F+
Sbjct: 132 DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFE 191

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           P+ S+S+  +SC +  C+ LE + C    C YEVSYGDGSYT G  A ETLTIG T+V+N
Sbjct: 192 PSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQN 251

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           VA+GCGH N+G+FVGAAGLLGLGGG ++L  QL      +FSYCLV R + S+ ++ FG 
Sbjct: 252 VAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVDFGT 308

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P  A   PL+RN +  +FYY+GL+G+ VGG  + I +  F + + G  G+++D+GTA
Sbjct: 309 SLSP-DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTA 367

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           VTRL T  Y + RD+FV  T +L +A+GV++FDTCYNLS   +V VPTV+F+F GG +L 
Sbjct: 368 VTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLA 427

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LPA N++IPVD  GTFC AFAP+ S L+IIGN+QQ+G +++FD AN  +GF  N C
Sbjct: 428 LPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 208/413 (50%), Positives = 287/413 (69%), Gaps = 16/413 (3%)

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-------SGGGADAAKHEVQ--DFGTD 144
           +T +   +  ++    +R+ RD  RV  +  RL       S       + E+Q  D  T 
Sbjct: 88  DTIHKTPHKDYKALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTP 147

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG  QGSGEYF R+GVG+P +S YMV+D+GSDI W+QCQPCS CY+QSDP+F PA S+
Sbjct: 148 VSSGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASS 207

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
           S+S ++C S  C+ L+ + C  G+CRY+V+YGDGS+T G    ET++ G +  V ++A+G
Sbjct: 208 SYSPLTCDSQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIALG 267

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CGH N+G+FVGAAGLLGLGGG +SL  QL      +FSYCLV+R + +S +L F   + P
Sbjct: 268 CGHDNEGLFVGAAGLLGLGGGPLSLTSQLKAT---SFSYCLVNRDSAASSTLDF--NSAP 322

Query: 324 VGAAWV-PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
           VG + + PL+++ +  +FYYVGLSG+ VGG  + I +++F+L   GD GV++D GTA+TR
Sbjct: 323 VGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITR 382

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           L + AY + RD+FV+ + +L   SGV++FDTCY+LSG  SV+VPTVSF+F GG    LPA
Sbjct: 383 LQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLPA 442

Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +N+LIPVD AGT+CFAFAP+ S LSIIGN+QQ+G ++SFD AN  VGF  N C
Sbjct: 443 ANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 214/461 (46%), Positives = 293/461 (63%), Gaps = 28/461 (6%)

Query: 52  AKMSQYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH 109
           A + +  ++F  E  ++     T SD +  +L+L  R  +  +S       +  ++    
Sbjct: 37  ASIQRTQQVFAVEPKSSTPDETTVSDPSSLSLQLNSRISVMKAS-------HSDYKSLTL 89

Query: 110 ARMQRDVKRVATLVRRLSGG-----GAD----------AAKHEVQDFGTDVVSGMDQGSG 154
           +R++RD  RV +L  R+        G D           ++   +DF + +VSG  QGSG
Sbjct: 90  SRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSGASQGSG 149

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EYF R+G+G PP   YMV+D+GSD+ WVQC PC++CY+Q+DP+F+P  SASF+ +SC + 
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETE 209

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
            C  L+ + C  G C YEVSYGDGSYT G    ET+T+G T + N+AIGCGH N+G+F+G
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGCGHNNEGLFIG 269

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
           AAGLLGLGGGS+S   QL      +FSYCLV R + S+ +L F     P  A   PL RN
Sbjct: 270 AAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITP-DAVTAPLHRN 325

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           P   +F+Y+GL+G+ VGG  +PI E  F++++ G+ G+++D+GTAVTRL T  Y   RDA
Sbjct: 326 PNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDA 385

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           FV  T +L  A GV++FDTCY+LS    V VPTVSF+F+ G  L LPA N+LIPVD  GT
Sbjct: 386 FVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGT 445

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FCFAFAP+ S LSI+GN QQ+G ++ FD AN  VGF PN C
Sbjct: 446 FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 223/482 (46%), Positives = 314/482 (65%), Gaps = 42/482 (8%)

Query: 36  QILNVNESIKGSRTDHAKMS--QYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSS 91
           Q+L+V  ++K  R   +K+S  +++E    E  N+I             L++VHRD +SS
Sbjct: 34  QVLDVEAALK-LRISRSKVSAQEWSETVQGEEKNSIV------------LQVVHRDSLSS 80

Query: 92  SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRR---------------LSGGGADAAKH 136
           SSNT+        +     R++RD  RV ++  R               L+G   DA + 
Sbjct: 81  SSNTSL------VKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGSSIDA-RF 133

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
           + +DF + ++SG+ QGSGEYF R+GVG+PPR  YMV+D+GSDI+W+QC PC++CY Q+DP
Sbjct: 134 DAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDP 193

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT 255
           +F+PA S+++  V C++ +C +L+ +GC   R C Y+VSYGDGS+T G  + ETLT    
Sbjct: 194 LFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ 253

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGS 314
           V++ VA+GCGH N+G+F+GAAGLLGLG GS+S   Q G Q    FSYCLV R  +G++ S
Sbjct: 254 VIRRVALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVDRSASGTASS 313

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQMGDDGVV 373
           L+FG+ A+P  A + PL+ NP+  +FYYV L G+ VGG R+  I   +FR+   G+ GV+
Sbjct: 314 LIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVI 373

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
           +D+GT+VTRL   AY   RDAF   TGNL  A G S+FDTCY+LSG  +V+VPT+ F+F 
Sbjct: 374 IDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQ 433

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG  ++LPA+N+LIPVD + TFCFAFA +  GLSIIGNIQQ+G ++ FD     VGF   
Sbjct: 434 GGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAG 493

Query: 494 VC 495
            C
Sbjct: 494 SC 495


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 214/461 (46%), Positives = 292/461 (63%), Gaps = 28/461 (6%)

Query: 52  AKMSQYNELF--ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFH 109
           A + +  ++F  E  ++     T SD +  +L+L  R  +  +S       +  ++    
Sbjct: 37  ASIQRTQQVFAVEPKSSTPDETTVSDPSSLSLQLNSRISVMKAS-------HSDYKSLTL 89

Query: 110 ARMQRDVKRVATLVRRLSGG-----GAD----------AAKHEVQDFGTDVVSGMDQGSG 154
           +R++RD  RV +L  R+        G D           ++   +DF + +VSG  QGSG
Sbjct: 90  SRLKRDSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSGASQGSG 149

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EYF R+G+G PP   YMV+D+GSD+ WVQC PC++CY+Q+DP F+P  SASF+ +SC + 
Sbjct: 150 EYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETE 209

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
            C  L+ + C  G C YEVSYGDGSYT G    ET+T+G T + N+AIGCGH N+G+F+G
Sbjct: 210 QCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGSTSLGNIAIGCGHNNEGLFIG 269

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
           AAGLLGLGGGS+S   QL      +FSYCLV R + S+ +L F     P  A   PL RN
Sbjct: 270 AAGLLGLGGGSLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPITP-DAVTAPLHRN 325

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           P   +F+Y+GL+G+ VGG  +PI E  F++++ G+ G+++D+GTAVTRL T  Y   RDA
Sbjct: 326 PNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRLQTTVYNVLRDA 385

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           FV  T +L  A GV++FDTCY+LS    V VPTVSF+F+ G  L LPA N+LIPVD  GT
Sbjct: 386 FVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPAKNYLIPVDSEGT 445

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FCFAFAP+ S LSI+GN QQ+G ++ FD AN  VGF PN C
Sbjct: 446 FCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 212/471 (45%), Positives = 306/471 (64%), Gaps = 33/471 (7%)

Query: 37  ILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTT 96
           ILNV +SI   RT +    + N+  E+ ++ SSS        ++L+L  R  +  +    
Sbjct: 37  ILNVADSIH--RTKYTSSFRLNQQEEQTHSRSSS--------FSLQLHSRVSVRGT---- 82

Query: 97  NNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ------------DFGTD 144
               +  ++    AR+ RD  RV +L+ RL     + +K +++            D    
Sbjct: 83  ---EHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEEEDIEAP 139

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           ++SG  QGSGEYF R+G+G+P R  YMV+D+GSD+ W+QC PC+ CY Q++P+F+P+ S+
Sbjct: 140 LISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSS 199

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
           S+  +SC +  C+ LE + C    C YEVSYGDGSYT G  A ETLTIG T+V+NVA+GC
Sbjct: 200 SYEPLSCDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAVGC 259

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           GH N+G+FVGAAGLLGLGGG ++L  QL      +FSYCLV R + S+ ++ FG  +LP 
Sbjct: 260 GHSNEGLFVGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVEFG-TSLPP 315

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            A   PL+RN +  +FYY+GL+G+ VGG  + I +  F + + G  G+++D+GTAVTRL 
Sbjct: 316 DAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQ 375

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
           T  Y + RD+F+  T +L +A+GV++FDTCYNLS   ++ VPTV+F+F GG +L LPA N
Sbjct: 376 TGIYNSLRDSFLKGTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFPGGKMLALPAKN 435

Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++IPVD  GTFC AFAP+ S L+IIGN+QQ+G +++FD AN  +GF  N C
Sbjct: 436 YMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  390 bits (1002), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 217/432 (50%), Positives = 282/432 (65%), Gaps = 26/432 (6%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           S S  SS +A   L+L H D +S +   T+          F+ R+ RD  RV  L  R +
Sbjct: 43  SQSLQSSPDAPLTLDLHHLDSLSLNKTPTD---------LFNLRLHRDTLRVHALNSRAA 93

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
           G            F + VVSG+ QGSGEYF R+GVG+PPR  YMV+D+GSD+VW+QC PC
Sbjct: 94  G------------FSSSVVSGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC 141

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
            +CY QSDP+F+P  S SF+G+ CSS +C RL+++GC   R  C Y+VSYGDGS+T G  
Sbjct: 142 RKCYSQSDPIFNPYKSKSFAGIPCSSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDF 201

Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           A ETLT     +  VA+GCGH N+G+FVGAAGLLGLG G +S   Q G +    FSYCLV
Sbjct: 202 ATETLTFRGNKIAKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLV 261

Query: 306 SRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFR 363
            R   S   S+VFG  A+   A + PL+RNP+  +FYYVGL G+ VGG+R+  +S  LF+
Sbjct: 262 DRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFK 321

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           L   G+ GV++D+GT+VTRL  PAY A RDAF     +L R    S+FDTCY+LSG  SV
Sbjct: 322 LDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSV 381

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
           +VPTV  +F G   + LPA+N+LIPVD+ G+FCFAFA + SGLSIIGNIQQ+G ++ +D 
Sbjct: 382 KVPTVVLHFRGAD-MALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDL 440

Query: 484 ANGFVGFGPNVC 495
           A   +GF P  C
Sbjct: 441 AGSRIGFAPRGC 452


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 206/355 (58%), Positives = 250/355 (70%), Gaps = 7/355 (1%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           VVSG+ QGSGEYF R+G+GSP R  YMV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ SA
Sbjct: 158 VVSGVGQGSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSA 217

Query: 205 SFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVA 261
           S++ VSC S  C  L+ A C    G C YEV+YGDGSYT G  A ETLT+G  T V NVA
Sbjct: 218 SYAAVSCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTPVTNVA 277

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
           IGCGH N+G+FVGAAGLL LGGG +S   Q+   T   FSYCLV R + ++ +L FG + 
Sbjct: 278 IGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAST---FSYCLVDRDSPAASTLQFGADG 334

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAV 380
                   PLVR+PR  +FYYV LSG+ VGG  + I    F +    G  GV++D+GTAV
Sbjct: 335 AEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAV 394

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TRL + AY A RDAFV  T +LPR SGVS+FDTCY+LS   SV VP VS  F GG  L L
Sbjct: 395 TRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRL 454

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           PA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G ++SFD A G VGF PN C
Sbjct: 455 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 212/377 (56%), Positives = 258/377 (68%), Gaps = 10/377 (2%)

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
           +R  +G    AA   +Q     VVSG+ QGSGEYF R+G+GSP R  YMV+D+GSD+ WV
Sbjct: 136 LRPANGSAVFAASAAIQG---PVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTGSDVTWV 192

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSY 240
           QCQPC+ CY+QSDPVFDP+ SAS++ VSC S  C  L+ A C    G C YEV+YGDGSY
Sbjct: 193 QCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYEVAYGDGSY 252

Query: 241 TKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T G  A ETLT+G  T V NVAIGCGH N+G+FVGAAGLL LGGG +S   Q+   T   
Sbjct: 253 TVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISAST--- 309

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           FSYCLV R + ++ +L FG  A   G    PLVR+PR  +FYYV LSG+ VGG  + I  
Sbjct: 310 FSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPA 369

Query: 360 DLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
             F +    G  GV++D+GTAVTRL + AY A RDAFV    +LPR SGVS+FDTCY+LS
Sbjct: 370 SAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLS 429

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
              SV VP VS  F GG  L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G +
Sbjct: 430 DRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTR 489

Query: 479 ISFDGANGFVGFGPNVC 495
           +SFD A G VGF PN C
Sbjct: 490 VSFDTARGAVGFTPNKC 506


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 218/430 (50%), Positives = 281/430 (65%), Gaps = 18/430 (4%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD---VKRVATLVRRLSGG 129
           S  E+   L L H D +SS+            Q  F +R+QRD   VK +ATL  ++ G 
Sbjct: 66  SDSESSITLNLDHIDALSSNKTP---------QELFSSRLQRDSRRVKSIATLAAQIPGR 116

Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
               A      F + VVSG+ QGSGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +
Sbjct: 117 NVTHAPR-TGGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR 175

Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLAL 247
           CY QSDP+FDP  S +++ + CSS  C RL++AGC+  R  C Y+VSYGDGS+T G  + 
Sbjct: 176 CYSQSDPIFDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFST 235

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           ETLT  R  VK VA+GCGH N+G+FVGAAGLLGLG G +S  GQ G +    FSYCLV R
Sbjct: 236 ETLTFRRNRVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDR 295

Query: 308 GTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLT 365
              S   S+VFG  A+   A + PL+ NP+  +FYYV L G+ VGG R+P ++  LF+L 
Sbjct: 296 SASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLD 355

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
           Q+G+ GV++D+GT+VTRL  PAY A RDAF      L RA   S+FDTC++LS    V+V
Sbjct: 356 QIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKV 415

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           PTV  +F G  V +LPA+N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++ +D A+
Sbjct: 416 PTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLAS 474

Query: 486 GFVGFGPNVC 495
             VGF P  C
Sbjct: 475 SRVGFAPGGC 484


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 205/397 (51%), Positives = 261/397 (65%), Gaps = 15/397 (3%)

Query: 110 ARMQRDVKRVATLVRRLS-----------GGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
           +R+ RD  RV  L  RL                  A+ E       VVSG  QGSGEYF+
Sbjct: 92  SRLARDSARVKALQTRLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFL 151

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
           R+G+G PP   Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP  S S+S + C    C  
Sbjct: 152 RVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQCKS 211

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGL 278
           L+ + C  G C YEVSYGDGSYT G  A ET+T+G   V+NVAIGCGH N+G+FVGAAGL
Sbjct: 212 LDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGSAAVENVAIGCGHNNEGLFVGAAGL 271

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAP 338
           LGLGGG +S   Q+   +   FSYCLV+R + +  +L F    LP  AA  PL+RNP   
Sbjct: 272 LGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFN-SPLPRNAATAPLMRNPELD 327

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           +FYY+GL G+ VGG  +PI E  F +  +G  G+++D+GTAVTRL +  Y+A RDAFV  
Sbjct: 328 TFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKG 387

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
              +P+A+GVS+FDTCY+LS   SV +PTVSF F  G  L LPA N+LIPVD  GTFCFA
Sbjct: 388 AKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFA 447

Query: 459 FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FAP+ S LSIIGN+QQ+G ++ FD AN  VGF  + C
Sbjct: 448 FAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  385 bits (990), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 206/406 (50%), Positives = 266/406 (65%), Gaps = 16/406 (3%)

Query: 102 HRHQHSFH-ARMQRDVKRVATLVRRLS-----------GGGADAAKHEVQDFGTDVVSGM 149
           HR   S   +R+ RD  RV +L  RL                  A+ E       VVSG 
Sbjct: 83  HRDYKSLTLSRLARDSARVKSLQTRLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGT 142

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
            QGSGEYF+R+G+G PP   Y+V+D+GSD+ W+QC PCS+CY+QSDP+FDP  S S+S +
Sbjct: 143 SQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPI 202

Query: 210 SCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
            C +  C  L+ + C  G C YEVSYGDGSYT G  A ET+T+G   V+NVAIGCGH N+
Sbjct: 203 RCDAPQCKSLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTLGTAAVENVAIGCGHNNE 262

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           G+FVGAAGLLGLGGG +S   Q+   +   FSYCLV+R + +  +L F    LP      
Sbjct: 263 GLFVGAAGLLGLGGGKLSFPAQVNATS---FSYCLVNRDSDAVSTLEFN-SPLPRNVVTA 318

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL RNP   +FYY+GL G+ VGG  +PI E +F +  +G  G+++D+GTAVTRL +  Y+
Sbjct: 319 PLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRLRSEVYD 378

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RDAFV     +P+A+GVS+FDTCY+LS   SV+VPTVSF+F  G  L LPA N+LIPV
Sbjct: 379 ALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPV 438

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           D  GTFCFAFAP+ S LSI+GN+QQ+G ++ FD AN  VGF  + C
Sbjct: 439 DSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 217/422 (51%), Positives = 281/422 (66%), Gaps = 18/422 (4%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD---VKRVATLVRRLSGGGADAAKHE 137
           L L H D +SS+  T + +        F +R+QRD   VK +ATL  ++ G     A   
Sbjct: 74  LNLDHIDALSSN-KTPDEL--------FSSRLQRDSRRVKSIATLAAQIPGRNVTHAPRP 124

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
              F + VVSG+ QGSGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +CY QSDP+
Sbjct: 125 -GGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI 183

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
           FDP  S +++ + CSS  C RL++AGC+  R  C Y+VSYGDGS+T G  + ETLT  R 
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN 243

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GS 314
            VK VA+GCGH N+G+FVGAAGLLGLG G +S  GQ G +    FSYCLV R   S   S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 303

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVV 373
           +VFG  A+   A + PL+ NP+  +FYYVGL G+ VGG R+P ++  LF+L Q+G+ GV+
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVI 363

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
           +D+GT+VTRL  PAY A RDAF      L RA   S+FDTC++LS    V+VPTV  +F 
Sbjct: 364 IDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           G  V +LPA+N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++ +D A+  VGF P 
Sbjct: 424 GADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPG 482

Query: 494 VC 495
            C
Sbjct: 483 GC 484


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 206/445 (46%), Positives = 293/445 (65%), Gaps = 24/445 (5%)

Query: 61  FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
           F++  ++  SN+S     ++L+L  RD +       +N  +  ++    +R+ RD  RV 
Sbjct: 61  FQQQVHLVPSNSS---FSFSLQLHPRDSL-------HNAGHKDYKSLVLSRLSRDSSRVK 110

Query: 121 TLVRRLSGGGADAAKHEVQ---------DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++  RL    ++  + +++         D  T ++SG  QGSGEYF R+GVG P +  YM
Sbjct: 111 SIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYM 170

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
           V+D+GSDI W+QCQPC+ CY+Q+DP+FDP  S+SF+ + C S  C  LE +GC A +C Y
Sbjct: 171 VLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLY 230

Query: 232 EVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           +VSYGDGS+T G   +ETLT G + ++ NVA+GCGH N+G+FVG+AGLLGLGGGS+SL  
Sbjct: 231 QVSYGDGSFTVGEFVIETLTFGNSGMINNVAVGCGHDNEGLFVGSAGLLGLGGGSLSLTS 290

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
           Q+      +FSYCLV R + SS  L F   A P  +   PL+++ +  +FYYVGL+G+ V
Sbjct: 291 QM---KASSFSYCLVDRDSSSSSDLEFNSAA-PSDSVNAPLLKSGKVDTFYYVGLTGMSV 346

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
           GG  + I  +LF++   G  G+++D+GTA+TRL T AY   RDAFV++T  L + +G ++
Sbjct: 347 GGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL 406

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
           FDTCY+LS    V +PTVSF F+GG  L LP  N+LIPVD  GTFCFAFAP+ S LSIIG
Sbjct: 407 FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIG 466

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+QQ+G ++ +D AN  VGF P+ C
Sbjct: 467 NVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 216/422 (51%), Positives = 278/422 (65%), Gaps = 18/422 (4%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV---ATLVRRLSGGGADAAKHE 137
           L L H D +SS+            Q  F +R+QRD +RV   ATL  ++ G     A   
Sbjct: 74  LNLDHIDALSSNKTP---------QELFSSRLQRDSRRVRSIATLAAQIPGRNVTHAPRP 124

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
              F + VVSG+ QGSGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +CY QSDP+
Sbjct: 125 -GGFSSSVVSGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPI 183

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
           FDP  S +++ + CSS  C RL++AGC+  R  C Y+VSYGDGS+T G  + ETLT  R 
Sbjct: 184 FDPRKSKTYATIPCSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRN 243

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GS 314
            VK VA+GCGH N+G+FVGAAGLLGLG G +S  GQ G +    FSYCLV R   S   S
Sbjct: 244 RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS 303

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVV 373
           +VFG  A+   A + PL+ NP+  +FYYVGL G+ VGG R+P ++  LF+L Q+G+ GV+
Sbjct: 304 VVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVI 363

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
           +D+GT+VTRL  PAY A RDAF      L RA   S+FDTC++LS    V+VPTV  +F 
Sbjct: 364 IDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFR 423

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
              V +LPA+N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G ++ +D A+  VGF P 
Sbjct: 424 RADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPG 482

Query: 494 VC 495
            C
Sbjct: 483 GC 484


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 203/409 (49%), Positives = 277/409 (67%), Gaps = 19/409 (4%)

Query: 102 HRHQHSFH-----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG---------TDVVS 147
           H+  H  +     AR++RD  RV +L  R+    A   K +++            T +VS
Sbjct: 87  HKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVS 146

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS 207
           G  QGSGEYF R+G+GSPP+  YMV+D+GSD+ WVQC PC+ CY+Q+DP+F+P+ S+S++
Sbjct: 147 GASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYA 206

Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGH 266
            ++C +  C  L+ + C    C YEVSYGDGSYT G  A ET+T+ G   + NVAIGCGH
Sbjct: 207 PLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGSASLNNVAIGCGH 266

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
            N+G+FVGAAGLLGLGGGS+S   Q+      +FSYCLV+R T S+ +L F    +P  +
Sbjct: 267 DNEGLFVGAAGLLGLGGGSLSFPSQINA---SSFSYCLVNRDTDSASTLEFN-SPIPSHS 322

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              PL+RN +  +FYY+G++G+GVGG  + I    F + + G+ G+++D+GTAVTRL + 
Sbjct: 323 VTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTRLQSD 382

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
            Y + RD+FV  T +LP  SGV++FDTCY+LS   SV VPTVSF+F  G  L LPA N+L
Sbjct: 383 VYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPDGKYLALPAKNYL 442

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IPVD AGTFCFAFAP+ S LSIIGN+QQ+G ++S+D +N  VGF PN C
Sbjct: 443 IPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 213/488 (43%), Positives = 303/488 (62%), Gaps = 27/488 (5%)

Query: 22  IITTSTSAASDTHFQILNVNESIKGSR---TDHAKMSQYNELFERHNNISSSNTSSDEAR 78
           + +   S  +D+H  +L+V+ SI+ +    +  + +S+ ++  +R    +S + +S  + 
Sbjct: 22  VFSRELSLDTDSHSSVLDVSGSIRKTLDVLSHKSSVSKPSD--QRDEKTTSFSPTSLASS 79

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           ++LEL  R+ +   S       +  ++    +R+ RD  RV  +  +L    +   K ++
Sbjct: 80  FSLELHPRELLHGGS-------HKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDL 132

Query: 139 ----------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
                     QDF T V SG  QGSGEYF+R+G+G P ++ YMVID+GSD+ W+QC+PC 
Sbjct: 133 VPMDTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCD 192

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALE 248
            CY+Q DP+FDPA S+SFS + C +  C  L+   C    C Y+VSYGDGSYT G  A E
Sbjct: 193 DCYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATE 252

Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           T++ G +  V  VAIGCGH N+G+FVGAAGL+GLGGG +SL  Q+      +FSYCLV+R
Sbjct: 253 TVSFGNSGSVDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQI---KASSFSYCLVNR 309

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
            +  S +L F   A P  +   P+ +N +  +FYYVG++G+ VGG ++ I   +F +   
Sbjct: 310 DSVDSSTLEFN-SAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGS 368

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G  G+++D GTAVTRL T AY A RD FV  T +LP  SG ++FDTCYNLS   SVRVPT
Sbjct: 369 GKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRVPT 428

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           V+F F GG  L LP SN+LIPVD AGTFC AFAP+ + LSIIGN+QQ+G ++++D AN  
Sbjct: 429 VAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQ 488

Query: 488 VGFGPNVC 495
           V F    C
Sbjct: 489 VSFSSRKC 496


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 204/445 (45%), Positives = 291/445 (65%), Gaps = 24/445 (5%)

Query: 61  FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
           F++  ++  SN+S     ++L+L  RD +       +N  +  ++    +R+ RD  RV 
Sbjct: 61  FQQQVHLVPSNSS---FSFSLQLHPRDSL-------HNAGHKDYKSLVLSRLSRDSSRVK 110

Query: 121 TLVRRLSGGGADAAKHEVQ---------DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++  RL    ++  + +++         D  T ++SG  QGSGEYF R+GVG P +  YM
Sbjct: 111 SIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYM 170

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
           V+D+GSDI W+QCQPC+ CY+Q+DP+FDP  S+SF+ + C S  C  LE +GC A +C Y
Sbjct: 171 VLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLY 230

Query: 232 EVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           +VSYGDGS+T G    ETLT G + ++ +VA+GCGH N+G+FVG+AGLLGLGGG +SL  
Sbjct: 231 QVSYGDGSFTVGEFVTETLTFGNSGMINDVAVGCGHDNEGLFVGSAGLLGLGGGPLSLTS 290

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
           Q+      +FSYCLV R + SS  L F   A P  +   PL+++ +  +FYYVGL+G+ V
Sbjct: 291 QM---KASSFSYCLVDRDSSSSSDLEFNSAA-PSDSVNAPLLKSGKVDTFYYVGLTGMSV 346

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
           GG  + I  +LF++   G  G+++D+GTA+TRL T AY   RDAFV++T  L + +G ++
Sbjct: 347 GGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL 406

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
           FDTCY+LS    V +PTVSF F+GG  L LP  N+LIPVD  GTFCFAFAP+ S LSIIG
Sbjct: 407 FDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIG 466

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+QQ+G ++ +D AN  VGF P+ C
Sbjct: 467 NVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 217/467 (46%), Positives = 289/467 (61%), Gaps = 17/467 (3%)

Query: 34  HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
            FQ L +N          A      + F   +  +S  +SS     +++L H D +SS  
Sbjct: 33  QFQTLTLNPLPNKPTISWADTEPGTQTFT--DQTTSEPSSSATTFLSVQLHHIDALSSDK 90

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGADAAKHEVQDFGTDVVSGMDQG 152
           ++         Q  F++R+ RD  RV +L+   +  GG +  +     F + V+SG+ QG
Sbjct: 91  SS---------QDLFNSRLVRDAARVKSLISLAATVGGTNLTRARGPGFSSSVISGLAQG 141

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           SGEYF R+GVG+P R  YMV+D+GSDIVW+QC PC +CY Q+DPVFDP  S SF+ + C 
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCG 201

Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           S +C RL+  GC   +  C Y+VSYGDGS+T G  + ETLT   T V  V +GCGH N+G
Sbjct: 202 SPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVLGCGHDNEG 261

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWV 329
           +FVGAAGLLGLG G +S   Q+G +    FSYCL  R   S   S+VFG  A+     + 
Sbjct: 262 LFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSAISRTTRFT 321

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           PL+ NP+  +FYYV L G+ VGG R+  IS  LF+L   G+ GV++D+GT+VTRL   AY
Sbjct: 322 PLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSVTRLTRAAY 381

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
            A RDAF+    NL RA   S+FDTC++LSG   V+VPTV  +F G  V  LPASN+LIP
Sbjct: 382 VALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-PLPASNYLIP 440

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VD++G+FCFAFA + SGLSIIGNIQQ+G ++ +D A   VGF P  C
Sbjct: 441 VDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 217/467 (46%), Positives = 290/467 (62%), Gaps = 15/467 (3%)

Query: 34  HFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSS 93
            FQ L VN          A     +E   +    S+S  +S     +++L H D +SS  
Sbjct: 33  QFQTLTVNPLPNKPTLSWADTEPESEPETQTLTDSTSTEASTTTSLSVQLHHLDALSSDE 92

Query: 94  NTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGADAAKHEVQDFGTDVVSGMDQG 152
                      Q  F++R+ RD  RV +L    +  G  +  +     F + V SG+ QG
Sbjct: 93  TP---------QDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSSSVTSGLAQG 143

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           SGEYF R+GVG+P R  +MV+D+GSD+VW+QC PC +CY Q+DPVF+P  S SF+ + C 
Sbjct: 144 SGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCG 203

Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           S +C RL++ GC   +  C Y+VSYGDGS+T G  + ETLT   T V  VA+GCGH N+G
Sbjct: 204 SPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVALGCGHDNEG 263

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWV 329
           +F+GAAGLLGLG G +S   Q+G +    FSYCLV R   S  S +VFG  A+   A + 
Sbjct: 264 LFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGDSAISRTARFT 323

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           PLV NP+  +FYYV L G+ VGG R+P I+  LF+L   G+ GV++D+GT+VTRL  PAY
Sbjct: 324 PLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTSVTRLTRPAY 383

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
            A RDAF     NL RA   S+FDTC++LSG   V+VPTV  +F G  V +LPASN+LIP
Sbjct: 384 VALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADV-SLPASNYLIP 442

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VD++G+FCFAFA + SGLSI+GNIQQ+G ++ +D A   VGF P  C
Sbjct: 443 VDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 214/475 (45%), Positives = 301/475 (63%), Gaps = 26/475 (5%)

Query: 33  THFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEAR--WNLELVHRDKMS 90
           +H  +L+V+ S+  +   H  +S   +L E  ++ + + TS   +   ++L+L  R+   
Sbjct: 32  SHTNVLDVSSSLHQA---HQILSFNPQLLEEQSSETETPTSPSSSSSSFSLQLHPRE--- 85

Query: 91  SSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----------QD 140
               T  N  +  ++    +R+ RD  RV +L  +L    +   + ++          +D
Sbjct: 86  ----TLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPTETELLRPED 141

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
             T V SG  QGSGEYF R+GVG P +  YMV+D+GSD+ W+QC+PCS CY+QSDP+FDP
Sbjct: 142 LSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDP 201

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
             S+S++ ++C +  C  LE + C  G+C Y+VSYGDGS+T G    ET++ G   V  V
Sbjct: 202 TASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVNRV 261

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
           AIGCGH N+G+FVG+AGLLGLGGG +SL  Q+      +FSYCLV R +G S +L F   
Sbjct: 262 AIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIKAT---SFSYCLVDRDSGKSSTLEFN-S 317

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P  +   PL++N +  +FYYV L+G+ VGG  + +  + F + Q G  GV++D+GTA+
Sbjct: 318 PRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAI 377

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TRL T AY + RDAF  +T NL  A GV++FDTCY+LS   SVRVPTVSF+FSG     L
Sbjct: 378 TRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPTVSFHFSGDRAWAL 437

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           PA N+LIPVD AGT+CFAFAP+ S +SIIGN+QQ+G ++SFD AN  VGF PN C
Sbjct: 438 PAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 210/398 (52%), Positives = 266/398 (66%), Gaps = 12/398 (3%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHE----VQDFGTDVVSGMDQGSGEYFVRI 160
           +  FH R+QRD  RV    ++LS  GA +           F + V+SG+ QGSGEYF RI
Sbjct: 78  EELFHLRLQRDAIRV----KKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRI 133

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
           GVG+PP+  YMV+D+GSDIVW+QC PC  CY Q+DPVF+P  S SF+ V C + +C RLE
Sbjct: 134 GVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLE 193

Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
           + GC+  + C Y+VSYGDGSYT G    ETLT  RT V+ VA+GCGH N+G+FVGAAGLL
Sbjct: 194 SPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLL 253

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAP 338
           GLG G +S   Q G      FSYCLV R   S   S+VFG  A+   A + PL+ NPR  
Sbjct: 254 GLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLD 313

Query: 339 SFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
           +FYYV L G+ VGG  +  I+   F+L + G+ GV++D GT+VTRL  PAY A RDAF A
Sbjct: 314 TFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRA 373

Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
              +L  A   S+FDTCY+LSG  +V+VPTV  +F G  V +LPASN+LIPVD +G FCF
Sbjct: 374 GASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADV-SLPASNYLIPVDGSGRFCF 432

Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AFA + SGLSIIGNIQQ+G ++ +D A+  VGF P  C
Sbjct: 433 AFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 206/367 (56%), Positives = 254/367 (69%), Gaps = 11/367 (2%)

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
           +A+  E+Q     VVSG+  GSGEYF R+GVGSP R  YMV+D+GSD+ WVQCQPC+ CY
Sbjct: 146 EASAAEIQG---PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCY 202

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALET 249
           +QSDPVFDP+ S S++ V+C +  C  L+ A C    G C YEV+YGDGSYT G  A ET
Sbjct: 203 QQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATET 262

Query: 250 LTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
           LT+G +  V +VAIGCGH N+G+FVGAAGLL LGGG +S   Q+   T   FSYCLV R 
Sbjct: 263 LTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATT---FSYCLVDRD 319

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + SS +L FG  A        PL+R+PR  +FYYVGLSGL VGG  + I    F +   G
Sbjct: 320 SPSSSTLQFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTG 377

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             GV++D+GTAVTRL + AY A RDAFV  T +LPR SGVS+FDTCY+LS   SV VP V
Sbjct: 378 AGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAV 437

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           S  F+GG  L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ+G ++SFD A   V
Sbjct: 438 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTV 497

Query: 489 GFGPNVC 495
           GF  N C
Sbjct: 498 GFTTNKC 504


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 208/424 (49%), Positives = 275/424 (64%), Gaps = 14/424 (3%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
           E   +L L H D +S +   +           FH R++RD  RV TL    +        
Sbjct: 59  EPTTSLSLHHIDALSFNKTPS---------QLFHLRLERDAARVKTLTHLAAATNKTRPA 109

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
           +    F + VVSG+ QGSGEYF R+GVG+PP+  YMV+D+GSD+VW+QC+PC++CY Q+D
Sbjct: 110 NPGSGFSSSVVSGLSQGSGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTD 169

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
            +FDP+ S SF+G+ C S +C RL++ GC      C+Y+VSYGDGS+T G  + ETLT  
Sbjct: 170 QIFDPSKSKSFAGIPCYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFR 229

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSS 312
           R  V  VAIGCGH N+G+FVGAAGLLGLG G +S   Q G +    FSYCL  R  +   
Sbjct: 230 RAAVPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKP 289

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDG 371
            S+VFG  A+   A + PLV+NP+  +FYYV L G+ VGG  +  IS   FRL   G+ G
Sbjct: 290 SSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGG 349

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
           V++D+GT+VTRL  PAY + RDAF     +L RA   S+FDTCY+LSG   V+VPTV  +
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
           F G  V +LPA+N+L+PVD++G+FCFAFA + SGLSIIGNIQQ+G ++ FD A   VGF 
Sbjct: 410 FRGADV-SLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFA 468

Query: 492 PNVC 495
           P  C
Sbjct: 469 PRGC 472


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/441 (49%), Positives = 276/441 (62%), Gaps = 33/441 (7%)

Query: 75  DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD-----------------VK 117
           +E R  L L  RD +        +  Y   +    AR++RD                 V 
Sbjct: 73  EEGRLALRLHSRDFLPEEQGRQRHASY---RSLVLARLRRDSARAAAVSARAAMAADGVS 129

Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 177
           R   +   ++   A AA  E+Q     VVSG+  GSGEYF R+GVGSP R  YMV+D+GS
Sbjct: 130 RFDLVPANVTAFEASAA--EIQG---PVVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGS 184

Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSY 235
           D+ WVQCQPC+ CY+QSDPVFDP+ S S++ V+C +  C  L+ A C    G C YEV+Y
Sbjct: 185 DVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHDLDAAACRNSTGACLYEVAY 244

Query: 236 GDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           GDGSYT G  A ETLT+G +  V +VAIGCGH N+G+FVGAAGLL LGGG +S   Q+  
Sbjct: 245 GDGSYTVGDFATETLTLGDSAPVSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISA 304

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
            T   FSYCLV R + SS +L FG  A        PL+R+PR  +FYYVGLSG+ VGG  
Sbjct: 305 TT---FSYCLVDRDSPSSSTLQFGDAA--DAEVTAPLIRSPRTSTFYYVGLSGISVGGQI 359

Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC 414
           + I    F +   G  GV++D+GTAVTRL + AY A RDAFV  T +LPR SGVS+FDTC
Sbjct: 360 LSIPPSAFAMDGTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTC 419

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
           Y+LS   SV VP VS  F+GG  L LPA N+LIPVD AGT+C AFAP+ + +SIIGN+QQ
Sbjct: 420 YDLSDRTSVEVPAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQ 479

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           +G ++SFD A   VGF  N C
Sbjct: 480 QGTRVSFDTAKSTVGFTSNKC 500


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 209/442 (47%), Positives = 282/442 (63%), Gaps = 18/442 (4%)

Query: 62  ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
           E    IS+   S  +    + L HRD ++ ++           +  F+ R+QRD  RV  
Sbjct: 57  ETETQISTLPVSETDPTMTMHLEHRDVLAFNAT---------PEALFNLRLQRDAFRVEA 107

Query: 122 LVR-----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
           L +          G +    +   F + V SG+ QGSGEYF R+GVG+PP+  YMV+D+G
Sbjct: 108 LSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTG 167

Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSY 235
           SD+VW+QC PC +CY Q+DPVFDP  S SFS +SC S +C RL++ GC++ + C Y+V+Y
Sbjct: 168 SDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAY 227

Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
           GDGS+T G  + ETLT   T V  VA+GCGH N+G+FVGAAGLLGLG G +S   Q G +
Sbjct: 228 GDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLR 287

Query: 296 TGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
            G  FSYCLV R   S   S+VFG+ A+   A + PL+ NP+  +FYY+ L+G+ VGG R
Sbjct: 288 FGRKFSYCLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGAR 347

Query: 355 IP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
           +  I+  LF+L   G+ GV++D+GT+VTRL   AY + RDAF A   +L RA   S+FDT
Sbjct: 348 VAGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDT 407

Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
           C++LSG   V+VPTV  +F G  V +LPA+N+LIPVD  G FCFAFA + SGLSIIGNIQ
Sbjct: 408 CFDLSGKTEVKVPTVVMHFRGADV-SLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQ 466

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
           Q+G ++ FD A   +GF    C
Sbjct: 467 QQGFRVVFDVAASRIGFAARGC 488


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 208/420 (49%), Positives = 280/420 (66%), Gaps = 15/420 (3%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           +L L H D +SS+            +  F  R+QRD KRV  +V  L+      A+    
Sbjct: 63  SLHLHHIDALSSNKTP---------EQLFQLRLQRDAKRVEGVVA-LAALNQSHARRSGS 112

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
            F + ++SG+ QGSGEYF RIGVG+P R  YMV+D+GSD+VW+QC PC +CY Q+DPVFD
Sbjct: 113 SFSSSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFD 172

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           P  S +++G+ C + +C RL++ GC+     C+Y+VSYGDGS+T G  + ETLT  RT V
Sbjct: 173 PTKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRV 232

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGSLV 316
             VA+GCGH N+G+F+GAAGLLGLG G +S   Q G +    FSYCLV R  +    S+V
Sbjct: 233 TRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVV 292

Query: 317 FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMD 375
           FG  A+   A + PL++NP+  +FYY+ L G+ VGG  +  +S  LFRL   G+ GV++D
Sbjct: 293 FGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIID 352

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
           +GT+VTRL  PAY A RDAF     +L RA+  S+FDTC++LSG   V+VPTV  +F G 
Sbjct: 353 SGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLHFRGA 412

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            V +LPA+N+LIPVD++G+FCFAFA + SGLSIIGNIQQ+G ++SFD A   VGF P  C
Sbjct: 413 DV-SLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 202/367 (55%), Positives = 254/367 (69%), Gaps = 11/367 (2%)

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
           +A+  E+Q     VVSG+ QGSGEYF R+GVG P R  YMV+D+GSD+ W+QCQPC+ CY
Sbjct: 142 EASAAEIQG---PVVSGVGQGSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCY 198

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALET 249
            QSDPV+DP+ S S++ V C S  C  L+ A C    G C YEV+YGDGSYT G  A ET
Sbjct: 199 AQSDPVYDPSVSTSYATVGCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATET 258

Query: 250 LTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
           LT+G +  V NVAIGCGH N+G+FVGAAGLL LGGG +S   Q+   T   FSYCLV R 
Sbjct: 259 LTLGDSAPVSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATT---FSYCLVDRD 315

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + SS +L FG    P  A   PL+R+PR  +FYYV LSG+ VGG  + I    F +   G
Sbjct: 316 SPSSSTLQFGDSEQP--AVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAG 373

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             GV++D+GTAVTRL + AY A R+AFV  T +LPRASGVS+FDTCY+L+G  SV+VP V
Sbjct: 374 SGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAV 433

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           + +F GG  L LPA N+LIPVD AGT+C AFA +   +SIIGN+QQ+G+++SFD A   V
Sbjct: 434 ALWFEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTV 493

Query: 489 GFGPNVC 495
           GF  + C
Sbjct: 494 GFTADKC 500


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  365 bits (938), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 204/383 (53%), Positives = 258/383 (67%), Gaps = 8/383 (2%)

Query: 120 ATLVRRLSGGGADAAKHE----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
           A  V++LS  GA +           F + V+SG+ QGSGEYF RIGVG+PP+  YMV+D+
Sbjct: 2   AIRVKKLSSLGATSRNLSKPGGTTGFSSSVISGLAQGSGEYFTRIGVGTPPKYVYMVLDT 61

Query: 176 GSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVS 234
           GSDIVW+QC PC  CY Q+DPVF+P  S SF+ V C + +C RLE+ GC+  + C Y+VS
Sbjct: 62  GSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCRRLESPGCNQRQTCLYQVS 121

Query: 235 YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           YGDGSYT G    ETLT  RT V+ VA+GCGH N+G+FVGAAGLLGLG G +S   Q G 
Sbjct: 122 YGDGSYTTGEFVTETLTFRRTKVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGR 181

Query: 295 QTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
                FSYCLV R   S   S+VFG  A+   A + PL+ NPR  +FYYV L G+ VGG 
Sbjct: 182 TFNQKFSYCLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGT 241

Query: 354 RIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
            +  I+   F+L + G+ GV++D GT+VTRL  PAY A RDAF A   +L  A   S+FD
Sbjct: 242 PVSGITASHFKLDRTGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFD 301

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
           TCY+LSG  +V+VPTV  +F G  V +LPASN+LIPVD +G FCFAFA + SGLSIIGNI
Sbjct: 302 TCYDLSGKTTVKVPTVVLHFRGADV-SLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNI 360

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           QQ+G ++ +D A+  VGF P  C
Sbjct: 361 QQQGFRVVYDLASSRVGFSPRGC 383


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 198/395 (50%), Positives = 266/395 (67%), Gaps = 10/395 (2%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           +  FH R+QRD KRV  L+ ++      A +     F + ++SG+ QGSGEYF RIGVG+
Sbjct: 72  EQLFHLRLQRDAKRVEALLNQI-----HARRSAGSSFSSSIISGLAQGSGEYFTRIGVGT 126

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
           P R  YMV+D+GSD+VW+QC PC +CY Q+D VFDP  S +++G+ C + +C RL++ GC
Sbjct: 127 PARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDSPGC 186

Query: 225 HAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
                 C+Y+VSYGDGS+T G  + ETLT  R  V  VA+GCGH N+G+F GAAGLLGLG
Sbjct: 187 SNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVALGCGHDNEGLFTGAAGLLGLG 246

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSR-GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
            G +S   Q G +    FSYCLV R  +    S++FG  A+   A + PL++NP+  +FY
Sbjct: 247 RGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFY 306

Query: 342 YVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           Y+ L G+ VGG  +  +S  LFRL   G+ GV++D+GT+VTRL  PAY A RDAF     
Sbjct: 307 YLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGAS 366

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
           +L RA   S+FDTC++LSG   V+VPTV  +F G  V +LPA+N+LIPVD++G+FCFAFA
Sbjct: 367 HLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADV-SLPATNYLIPVDNSGSFCFAFA 425

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + SGLSIIGNIQQ+G +IS+D     VGF P  C
Sbjct: 426 GTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  361 bits (926), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 198/361 (54%), Positives = 248/361 (68%), Gaps = 15/361 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           VVSG+ QGSGEYF RIG+GSP R  YMV+D+GSD+ W+QC PC+ CY QSDP+FDPA S+
Sbjct: 185 VVSGVGQGSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSS 244

Query: 205 SFSGVSCSSAVCDRLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLTIG---RT 255
           S++ V C S  C  L+ + CH         C YEV+YGDGSYT G  A ETLT+G     
Sbjct: 245 SYATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSA 304

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
            V +VAIGCGH N+G+FVGAAGLL LGGG +S   Q+   +   FSYCLV R + S+ +L
Sbjct: 305 AVHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQI---SATEFSYCLVDRDSPSASTL 361

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
            FG  A        PL+R+PR+ +FYYV L+G+ VGG  +  I    F + + G  GV++
Sbjct: 362 QFG--ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIV 419

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GTAVTRL + AY A RDAFV  T  LPRASGVS+FDTCY+L+G  SV+VP VS  F G
Sbjct: 420 DSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEG 479

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           G  L LPA N+LIPVD AGT+C AFA +   +SI+GN+QQ+GI++SFD A   VGF PN 
Sbjct: 480 GGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNK 539

Query: 495 C 495
           C
Sbjct: 540 C 540


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score =  358 bits (920), Expect = 3e-96,   Method: Compositional matrix adjust.
 Identities = 182/225 (80%), Positives = 203/225 (90%)

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
           MFVGAAGLLGLG G MS VGQLGGQ GG FSYCLVSRGT SSGSL FGRE++PVGA+WV 
Sbjct: 1   MFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGTESSGSLEFGRESVPVGASWVS 60

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L+ NPRAPSFYY+GLSGLGVGG+R+PISED+FRL ++G+ GVVMDTGTAVTRLP  AY A
Sbjct: 61  LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
           FRDAFVAQT NLP+ SGVSIFDTCY+L+GFV+VRVPT+SFYF GGP+LTLPA NFLIPVD
Sbjct: 121 FRDAFVAQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPARNFLIPVD 180

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             GTFCFAFAPS SGLSIIGNIQQEGI+IS DGANG++GFGPN+C
Sbjct: 181 SVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  355 bits (912), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 198/402 (49%), Positives = 266/402 (66%), Gaps = 15/402 (3%)

Query: 108 FHARMQRDVKRVATLVRRLS-GGGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIGVG 163
           F+ R+QRD  RV +L    +   G +  K   +    F   V+SG+ QGSGEYF+R+GVG
Sbjct: 84  FNLRLQRDSLRVESLTSLAAVSAGRNVTKRPPRSAGGFSGVVISGLSQGSGEYFMRLGVG 143

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
           +P  + YMV+D+GSD+VW+QC PC  CY QSDPVF+PA S +F+ V C S +C RL+++ 
Sbjct: 144 TPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSS 203

Query: 224 -CHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
            C + R   C Y+VSYGDGS+T G  + ETLT     V +VA+GCGH N+G+FVGAAGLL
Sbjct: 204 ECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVALGCGHDNEGLFVGAAGLL 263

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSR-----GTGSSGSLVFGREALPVGAAWVPLVRN 334
           GLG G +S   Q   +  G FSYCLV R      +    ++VFG  A+P  A + PL+ N
Sbjct: 264 GLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTN 323

Query: 335 PRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           P+  +FYY+ L G+ VGG R+P +SE  F+L   G+ GV++D+GT+VTRL   AY A RD
Sbjct: 324 PKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRD 383

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           AF      L RA   S+FDTC++LSG  +V+VPTV F+F+GG V +LPASN+LIPV++ G
Sbjct: 384 AFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFTGGEV-SLPASNYLIPVNNQG 442

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            FCFAFA +   LSIIGNIQQ+G ++++D     VGF    C
Sbjct: 443 RFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRAC 484


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  354 bits (908), Expect = 6e-95,   Method: Compositional matrix adjust.
 Identities = 200/429 (46%), Positives = 273/429 (63%), Gaps = 22/429 (5%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS-GGGADAAKHEVQ 139
           + L H D +SS S+ +           F+ R+QRD  RV ++    +   G +A K   +
Sbjct: 63  VHLSHVDALSSFSDAS-------PADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPR 115

Query: 140 D---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
               F   V+SG+ QGSGEYF+R+GVG+P  + YMV+D+GSD+VW+QC PC  CY Q+D 
Sbjct: 116 TAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA 175

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
           +FDP  S +F+ V C S +C RL+++  C   R   C Y+VSYGDGS+T+G  + ETLT 
Sbjct: 176 IFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF 235

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
               V +V +GCGH N+G+FVGAAGLLGLG G +S   Q   +  G FSYCLV R     
Sbjct: 236 HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGS 295

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
            +    ++VFG  A+P  + + PL+ NP+  +FYY+ L G+ VGG R+P +SE  F+L  
Sbjct: 296 SSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDA 355

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            G+ GV++D+GT+VTRL  PAY A RDAF      L RA   S+FDTC++LSG  +V+VP
Sbjct: 356 TGNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVP 415

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           TV F+F GG V +LPASN+LIPV+  G FCFAFA +   LSIIGNIQQ+G ++++D    
Sbjct: 416 TVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGS 474

Query: 487 FVGFGPNVC 495
            VGF    C
Sbjct: 475 RVGFLSRAC 483


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  353 bits (907), Expect = 9e-95,   Method: Compositional matrix adjust.
 Identities = 202/449 (44%), Positives = 278/449 (61%), Gaps = 22/449 (4%)

Query: 61  FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
           +    + S  + S      ++ L H D +SS S+ +           F  R+QRD  RV 
Sbjct: 46  WPESKSFSDESVSESTTSLSVHLSHVDALSSFSDAS-------PVDLFKLRLQRDSLRVK 98

Query: 121 TLVRRLS-GGGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
           ++    +   G +A K   +    F   V+SG+ QGSGEYF+R+GVG+P  + YMV+D+G
Sbjct: 99  SITSLAAVSTGRNATKRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTG 158

Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYE 232
           SD+VW+QC PC  CY QSD +FDP  S +F+ V C S +C RL+++  C   R   C Y+
Sbjct: 159 SDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQ 218

Query: 233 VSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL 292
           VSYGDGS+T+G  + ETLT     V +V +GCGH N+G+FVGAAGLLGLG G +S   Q 
Sbjct: 219 VSYGDGSFTEGDFSTETLTFHGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQT 278

Query: 293 GGQTGGAFSYCLVSR-----GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
             +  G FSYCLV R      +    ++VFG +A+P  + + PL+ NP+  +FYY+ L G
Sbjct: 279 KSRYNGKFSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLG 338

Query: 348 LGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           + VGG R+P +SE  F+L   G+ GV++D+GT+VTRL   AY A RDAF      L RA 
Sbjct: 339 ISVGGSRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAP 398

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
             S+FDTC++LSG  +V+VPTV F+F GG V +LPASN+LIPV+  G FCFAFA +   L
Sbjct: 399 SYSLFDTCFDLSGMTTVKVPTVVFHFGGGEV-SLPASNYLIPVNTEGRFCFAFAGTMGSL 457

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SIIGNIQQ+G ++++D     VGF    C
Sbjct: 458 SIIGNIQQQGFRVAYDLVGSRVGFLSRAC 486


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  353 bits (905), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 194/406 (47%), Positives = 261/406 (64%), Gaps = 19/406 (4%)

Query: 108 FHARMQRDVKRVATLVRRL---------SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
            H  + RD  RVA++  R+         S       K   QDF   VVSG+  GSGEYF+
Sbjct: 1   MHVTISRDNLRVASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFI 60

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
           RI VG+PPR  Y+V+D+GSDI+W+QC PC  CY QSD +FDP  S+++S + CS+  C  
Sbjct: 61  RISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCLN 120

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLT------IGRTVVKNVAIGCGHKNQGMF 272
           L+   C A +C Y+V YGDGS+T G    + ++      +G+ V+  + +GCGH N+G F
Sbjct: 121 LDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYF 180

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS--GSLVFGREALP-VGAAWV 329
           VGAAGLLGLG G +S   Q+  Q GG FSYCL  R T S+   SLVFG  A+P  GA + 
Sbjct: 181 VGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFGEAAVPPAGARFT 240

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           P   N R P+FYY+ ++G+ VGG  + I    F+L  +G+ GV++D+GT+VTRL   AY 
Sbjct: 241 PQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQNAAYA 300

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           + RDAF A T +L   +G S+FDTCY+LSG  SV VPTV+ +F GG  L LPASN+LIPV
Sbjct: 301 SLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPV 360

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           D++ TFC AFA + +G SIIGNIQQ+G ++ +D  +  VGF P+ C
Sbjct: 361 DNSNTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  352 bits (902), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 193/426 (45%), Positives = 265/426 (62%), Gaps = 18/426 (4%)

Query: 84  VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----- 138
           +HRD   S     N   +   ++    R+ RD  R+ ++  R+S G A   K  +     
Sbjct: 1   MHRDSADSPYRPANATVHGLVRN----RLHRDELRLLSISSRISLGVAGIPKSSLTNPLK 56

Query: 139 -------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
                  QDF T + SG+  GSGEYFV +GVG+PPR+  MV D+GSD++W+QC PC  CY
Sbjct: 57  NTNPFLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY 116

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
            Q+DP+F+P+ S++F  ++C S++C +L   GC   +C Y+VSYGDGS+T G  + ETL+
Sbjct: 117 GQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLS 176

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
            G   V +VAIGCGH NQG+F GAAGLLGLG G +S   Q+G   G  FSYCL +R +  
Sbjct: 177 FGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG 236

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDD 370
           S  L+FG +A+   A +  L+ NP+  +FYYV + G+ VGG  + I      L +  G+ 
Sbjct: 237 SVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNG 296

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
           GV++D+GTAVTRL T AY   RDAF A    +    SG S+FDTCY+LSG  S+ +P VS
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVS 356

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
           F F+GG  + LPA N ++PVD++GT+C AFAP+    SIIGNIQQ+  ++SFD     VG
Sbjct: 357 FVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVG 416

Query: 490 FGPNVC 495
            G N C
Sbjct: 417 IGANQC 422


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  352 bits (902), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 193/426 (45%), Positives = 265/426 (62%), Gaps = 18/426 (4%)

Query: 84  VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV----- 138
           +HRD   S     N   +   ++    R+ RD  R+ ++  R+S G A   K  +     
Sbjct: 1   MHRDSADSPYRPANATVHGLVRN----RLHRDELRLLSISSRISLGVAGIPKSSLTNPLK 56

Query: 139 -------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCY 191
                  QDF T + SG+  GSGEYFV +GVG+PPR+  MV D+GSD++W+QC PC  CY
Sbjct: 57  NTNPFLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY 116

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
            Q+DP+F+P+ S++F  ++C S++C +L   GC   +C Y+VSYGDGS+T G  + ETL+
Sbjct: 117 GQTDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLS 176

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
            G   V +VAIGCGH NQG+F GAAGLLGLG G +S   Q+G   G  FSYCL +R +  
Sbjct: 177 FGSNAVNSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRESTG 236

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDD 370
           S  L+FG +A+   A +  L+ NP+  +FYYV + G+ VGG  + I      L +  G+ 
Sbjct: 237 SVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNG 296

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
           GV++D+GTAVTRL T AY   RDAF A    +    SG S+FDTCY+LSG  S+ +P VS
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVS 356

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
           F F+GG  + LPA N ++PVD++GT+C AFAP+    SIIGNIQQ+  ++SFD     VG
Sbjct: 357 FVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVG 416

Query: 490 FGPNVC 495
            G N C
Sbjct: 417 IGANQC 422


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  351 bits (900), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 194/435 (44%), Positives = 261/435 (60%), Gaps = 18/435 (4%)

Query: 75  DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
           D    +LEL+HR+ +   +        H H+      +QRD +RV  +  +    G    
Sbjct: 52  DGGTLSLELIHRNSLLREAKE----KLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKD 107

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           +    D    V SG+  GSGEYFVR+GVG+P RS +MV+D+GSD+ W+QCQPC  CYKQ+
Sbjct: 108 EASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQA 167

Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCH-----AGRCRYEVSYGDGSYTKGTLALET 249
           DP+FDP +S+SF  + C S +C  LE   C        RC Y+V+YGDGS++ G  + + 
Sbjct: 168 DPIFDPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDL 227

Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL-----GGQTGGAFSYC 303
            T+G  +   +VA GCG  N+G+F GAAGLLGLG G +S   Q+        T  +FSYC
Sbjct: 228 FTLGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC 287

Query: 304 LVSRG---TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
           LV R    T SS SL+FG  A+P  AA  PL++NP+  +FYY  + G+ VGG ++PIS  
Sbjct: 288 LVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLK 347

Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF 420
             +L+Q G  GV++D+GT+VTR PT  Y   RDAF   T NLP A   S+FDTCYN SG 
Sbjct: 348 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGK 407

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
            SV VP +  +F  G  L LP +N+LIP++ AG+FC AFAP+   L IIGNIQQ+  +I 
Sbjct: 408 ASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIG 467

Query: 481 FDGANGFVGFGPNVC 495
           FD     + F P  C
Sbjct: 468 FDLQKSHLAFAPQQC 482


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 185/360 (51%), Positives = 246/360 (68%), Gaps = 10/360 (2%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           + V SG+  GSGEYFVR+G+GSP + QY+V+D+GSD+ W+QC PC  CYKQ+D VFDP  
Sbjct: 1   SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60

Query: 203 SASFSGVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
           S+SF  +SCS+  C  L+   C +   RC Y+VSYGDGS+T G LA ++ ++ R     V
Sbjct: 61  SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTSPV 120

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG--SSGSLVFG 318
             GCGH N+G+FVGAAGLLGLG G +S   QL  +    FSYCLVSR  G  +S +L+FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177

Query: 319 REALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMD 375
             ALP  A  A+  L++NP+  +FYY GLSG+ +GG  + I    F+L+   G  GV++D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
           +GT+VTRLPT AY   RDAF + T  LPRA+  S+FDTCY+ S   SV +PTVSF+F GG
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             + LP SN+L+PVD +GTFCFAF+ +   LSIIGNIQQ+ ++++ D  +  VGF P  C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 185/360 (51%), Positives = 245/360 (68%), Gaps = 10/360 (2%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           + V SG+  GSGEYFVR+G+GSP + QY+V+D+GSD+ W+QC PC  CYKQ+D VFDP  
Sbjct: 1   SQVTSGLAFGSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRA 60

Query: 203 SASFSGVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
           S+SF  +SCS+  C  L+   C +   RC Y+VSYGDGS+T G LA ++  + R     V
Sbjct: 61  SSSFRRLSCSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTSPV 120

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG--SSGSLVFG 318
             GCGH N+G+FVGAAGLLGLG G +S   QL  +    FSYCLVSR  G  +S +L+FG
Sbjct: 121 VFGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177

Query: 319 REALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMD 375
             ALP  A  A+  L++NP+  +FYY GLSG+ +GG  + I    F+L+   G  GV++D
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
           +GT+VTRLPT AY   RDAF + T  LPRA+  S+FDTCY+ S   SV +PTVSF+F GG
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             + LP SN+L+PVD +GTFCFAF+ +   LSIIGNIQQ+ ++++ D  +  VGF P  C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  347 bits (891), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 189/329 (57%), Positives = 229/329 (69%), Gaps = 7/329 (2%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGR 228
           MV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ SAS++ VSC S  C  L+ A C    G 
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 229 CRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
           C YEV+YGDGSYT G  A ETLT+G  T V NVAIGCGH N+G+FVGAAGLL LGGG +S
Sbjct: 61  CLYEVAYGDGSYTVGDFATETLTLGDSTPVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 120

Query: 288 LVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
              Q+   T   FSYCLV R + ++ +L FG  A   G    PLVR+PR  +FYYV LSG
Sbjct: 121 FPSQISAST---FSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSG 177

Query: 348 LGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           + VGG  + I    F +    G  GV++D+GTAVTRL + AY A RDAFV    +LPR S
Sbjct: 178 ISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTS 237

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
           GVS+FDTCY+LS   SV VP VS  F GG  L LPA N+LIPVD AGT+C AFAP+ + +
Sbjct: 238 GVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAV 297

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SIIGN+QQ+G ++SFD A G VGF PN C
Sbjct: 298 SIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  343 bits (881), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 206/444 (46%), Positives = 273/444 (61%), Gaps = 26/444 (5%)

Query: 65  NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
           N  S+ +  +   R+   LVHRD  S ++     + Y         R++RD KR A L  
Sbjct: 62  NLASAEDAPASTVRF--RLVHRDDFSVNATAAELLAY---------RLERDAKRAARL-- 108

Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
             + G A+  +         VVSG+ QGSGEYF +IGVG+P     MV+D+GSD+VW+QC
Sbjct: 109 SAAAGPANGTRRGGGGVVAPVVSGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQC 168

Query: 185 QPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTK 242
            PC +CY+QS  VFDP  S S++ V C++ +C RL++ GC   R  C Y+V+YGDGS T 
Sbjct: 169 APCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTA 228

Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
           G  A ETLT  G   V  VA+GCGH N+G+FV AAGLLGLG GS+S   Q+  + G +FS
Sbjct: 229 GDFATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFS 288

Query: 302 YCLVSRGTGS-----SGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
           YCLV R + +     S ++ FG  A+   V +++ P+V+NPR  +FYYV L G+ VGG R
Sbjct: 289 YCLVDRTSSANTASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGAR 348

Query: 355 IP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIF 411
           +P   + DL      G  GV++D+GT+VTRL  PAY A RDAF      L  + G  S+F
Sbjct: 349 VPGVANSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLF 408

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTCY+LSG   V+VPTVS +F+GG    LP  N+LIPVD  GTFCFAFA +  G+SIIGN
Sbjct: 409 DTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGN 468

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQ+G ++ FDG    V F P  C
Sbjct: 469 IQQQGFRVVFDGDGQRVAFTPKGC 492


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  342 bits (878), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 204/433 (47%), Positives = 267/433 (61%), Gaps = 30/433 (6%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           +  +VHRD  + ++ T   +  HR        +QRD +R A    R+S        +  +
Sbjct: 66  HFRVVHRDTFAVNA-TAGELLKHR--------LQRDKRRAA----RISEAAGAGGGNGRK 112

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
                VVSG+ QGSGEYF +IGVG+P     MV+D+GSD+VWVQC PC +CY+QS PVFD
Sbjct: 113 GVAAPVVSGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFD 172

Query: 200 PADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-GRTV 256
           P  S+S+  V C +A+C RL++ GC    G C Y+V+YGDGS T G    ETLT  G   
Sbjct: 173 PRRSSSYGAVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAGGAR 232

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--------- 307
           V  VA+GCGH N+G+FV AAGLLGLG G +S   Q+  + G +FSYCLV R         
Sbjct: 233 VARVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAP 292

Query: 308 GTGSSGSLVFGREALPV-GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRL- 364
           G+  S ++ FG  ++    A++ P+VRNPR  +FYYV L G+ VGG R+P ++E   RL 
Sbjct: 293 GSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLD 352

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVSIFDTCYNLSGFVS 422
              G  GV++D+GT+VTRL   +Y A RDAF A      R S  G S+FDTCY+L G   
Sbjct: 353 PSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV 412

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           V+VPTVS +F+GG    LP  N+LIPVD  GTFCFAFA +  G+SIIGNIQQ+G ++ FD
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 472

Query: 483 GANGFVGFGPNVC 495
           G    VGF P  C
Sbjct: 473 GDGQRVGFAPKGC 485


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  341 bits (875), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 207/443 (46%), Positives = 270/443 (60%), Gaps = 46/443 (10%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL------------S 127
           +  +VHRD  ++++     +   RH      R+QRD +R A + +              S
Sbjct: 70  HFRVVHRDAFAANATAAELL---RH------RLQRDKRRAARISKAAAGGGAGAANGTRS 120

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
            GGA AA          VVSG+ QGSGEYF +IGVG+P     MV+D+GSD+VW+QC PC
Sbjct: 121 RGGAVAAP---------VVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC 171

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTL 245
            +CY QS PVFDP  S+S+  V C++ +C RL++ GC   R  C Y+V+YGDGS T G  
Sbjct: 172 RRCYDQSGPVFDPRRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDF 231

Query: 246 ALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
           A ETLT  G   V  VA+GCGH N+G+FV AAGLLGLG GS+S   Q+  + G +FSYCL
Sbjct: 232 ATETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCL 291

Query: 305 VSR---------GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           V R             S ++ FG  +    A++ P+VRNPR  +FYYV L G+ VGG R+
Sbjct: 292 VDRTSSSSSGAASRSRSSTVTFGPPSASA-ASFTPMVRNPRMETFYYVQLVGISVGGARV 350

Query: 356 P-ISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFD 412
           P ++E   RL    G  GV++D+GT+VTRL  P+Y A RDAF A    L  +  G S+FD
Sbjct: 351 PGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFD 410

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
           TCY+L G   V+VPTVS +F+GG    LP  N+LIPVD  GTFCFAFA +  G+SIIGNI
Sbjct: 411 TCYDLGGRKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNI 470

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           QQ+G ++ FDG    VGF P  C
Sbjct: 471 QQQGFRVVFDGDGQRVGFAPKGC 493


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  338 bits (866), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 186/406 (45%), Positives = 249/406 (61%), Gaps = 14/406 (3%)

Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           H+      +QRD +RV  +  +    G    +    D    V SG+  GSGEYFVR+G+G
Sbjct: 2   HEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLG 61

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG 223
           +P RS +MV+D+GSD+ W+QCQPC  CYKQ+DP+FDP +S+SF  + C S +C  LE   
Sbjct: 62  TPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCKALEVHS 121

Query: 224 CH-----AGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAG 277
           C        RC Y+V+YGDGS++ G  + +  T+G  +   +VA GCG  N+G+F GAAG
Sbjct: 122 CSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAMSVAFGCGFDNEGLFAGAAG 181

Query: 278 LLGLGGGSMSLVGQL-----GGQTGGAFSYCLVSRG---TGSSGSLVFGREALPVGAAWV 329
           LLGLG G +S   Q+        T  +FSYCLV R    T SS SL+FG  A+P  AA  
Sbjct: 182 LLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAAIPSTAALS 241

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL++NP+  +FYY  + G+ VGG ++PIS    +L+Q G  GV++D+GT+VTR PT  Y 
Sbjct: 242 PLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYA 301

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
             RDAF   T NLP A   S+FDTCYN SG  SV VP +  +F  G  L LP +N+LIP+
Sbjct: 302 TIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQLPPTNYLIPI 361

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + AG+FC AFAP+   L IIGNIQQ+  +I FD     + F P  C
Sbjct: 362 NTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 210/469 (44%), Positives = 283/469 (60%), Gaps = 27/469 (5%)

Query: 52  AKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTT-NNMHY---HRHQHS 107
           AK  Q   L     + +  + SS+ AR + + V    ++++ + T + + +   HR    
Sbjct: 28  AKPVQTQSLLVTPLSPTPFSASSELARGDDKDVFAGNLAAAEDATPSTVQFSVVHRDDFV 87

Query: 108 FHA--------RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
            +A        R+QRD KR A +        A+  +         VVSG+ QGSGEYF +
Sbjct: 88  VNATAAELLGHRLQRDGKRAARISAAAGA--ANGTRRTGSGVVAPVVSGLAQGSGEYFTK 145

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           IGVG+P     MV+D+GSD+VW+QC PC +CY QS  VFDP  S S+  V CS+ +C RL
Sbjct: 146 IGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPLCRRL 205

Query: 220 ENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAA 276
           ++ GC   R  C Y+V+YGDGS T G  A ETLT  G   V  +A+GCGH N+G+FV AA
Sbjct: 206 DSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAGGARVARIALGCGHDNEGLFVAAA 265

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREAL--PVGAAWV 329
           GLLGLG GS+S   Q+  + G +FSYCLV R + +     S ++ FG  A+   V A++ 
Sbjct: 266 GLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTVAASFT 325

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPA 387
           P+V+NPR  +FYYV L G+ VGG R+  +++   RL    G  GV++D+GT+VTRL  PA
Sbjct: 326 PMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPA 385

Query: 388 YEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           Y A RDAF A    L  + G  S+FDTCY+LSG   V+VPTVS +F+GG    LP  N+L
Sbjct: 386 YSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYL 445

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IPVD  GTFCFAFA +  G+SIIGNIQQ+G ++ FDG    VGF P  C
Sbjct: 446 IPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFVPKGC 494


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  333 bits (854), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 177/370 (47%), Positives = 245/370 (66%), Gaps = 10/370 (2%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           K   QDF   V+SG+  GSGEYF+R+ VG+PPR  Y+V+D+GSDI+W+QC PC  CY Q 
Sbjct: 16  KVPSQDFQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQC 75

Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
           D VFDP  S+++S + C+S  C  L+  GC   +C Y+V YGDGS++ G  A + +++  
Sbjct: 76  DEVFDPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNS 135

Query: 253 ----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
               G+ V+  + +GCGH N+G FVGAAGLLGLG G +S   Q+  + GG FSYCL  R 
Sbjct: 136 TSGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRD 195

Query: 309 TGSS--GSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           T S+   SL+FG  A+P  G  + P   N R  +FYY+ ++G+ VGG  + I    F+L 
Sbjct: 196 TDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLD 255

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
            +G+ GV++D+GT+VTRL   AY + R+AF A T +L   +  S+FDTCYNLS   SV V
Sbjct: 256 SLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDV 315

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           PTV+ +F GG  L LPASN+L+PVD++ TFC AFA + +G SIIGNIQQ+G ++ +D  +
Sbjct: 316 PTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGT-TGPSIIGNIQQQGFRVIYDNLH 374

Query: 486 GFVGFGPNVC 495
             VGF P+ C
Sbjct: 375 NQVGFVPSQC 384


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  330 bits (847), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 171/390 (43%), Positives = 238/390 (61%), Gaps = 6/390 (1%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           M+RD  R+  +  R+        +         V SG+  GSGEYF R+G+GSP RS Y+
Sbjct: 1   MERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYL 60

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
            +D+GSD+ W+QC PCS CY Q DP++DP++S+S+  V C SA+C  L+ + C    C Y
Sbjct: 61  ELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCSY 120

Query: 232 EVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
            V YGD S + G L +E+  +G    T ++N+A GCGH N G+F G AGLLG+GGG++S 
Sbjct: 121 RVVYGDSSASSGDLGIESFYLGPNSSTAMRNIAFGCGHSNSGLFRGEAGLLGMGGGTLSF 180

Query: 289 VGQLGGQTGGAFSYCLVSRGT---GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
             Q+    G AFSYCLV R +     S  L+FGR A+P  A + PL++NPR  +FYY  L
Sbjct: 181 FSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRIDTFYYAIL 240

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
           +G+ VGG  +PI    F LT  G  G ++D+GT+VTR+   AY   RDA+ A + NLP A
Sbjct: 241 TGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPA 300

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
            GV + DTC+N  G  +V++P++  +F     + LP  N LIPVD +GTFC AFAPS   
Sbjct: 301 PGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMP 360

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +S+IGN+QQ+  +I FD     +   P  C
Sbjct: 361 ISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  330 bits (846), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 192/412 (46%), Positives = 262/412 (63%), Gaps = 17/412 (4%)

Query: 97  NNMHYHRHQHSFHARMQRDVKRVA----TLVRRLSGG---GADAAKHEVQDFGT-DVVSG 148
           +N  Y  +     AR+ RD  RV      L R L+GG   G    +  + D  T  VVSG
Sbjct: 80  HNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSG 139

Query: 149 MDQGSG-EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSA 204
             +GSG EY  +IGVG P +  Y+V D+GSD+ W+QCQPC+    CYKQ DP+FDP  S+
Sbjct: 140 QSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSS 199

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
           S+S +SC+S  C  L+ A C++  C Y+V YGDGS+T G LA ETL+ G +  + N+ IG
Sbjct: 200 SYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIG 259

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CGH N+G+F G AGL+GLGGG++SL  QL      +FSYCLV+  + SS +L F    +P
Sbjct: 260 CGHDNEGLFAGGAGLIGLGGGAISLSSQL---KASSFSYCLVNLDSDSSSTLEFNSN-MP 315

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             +   PLV+N R  S+ YV + G+ VGG  +PIS   F + + G  G+++D+GT ++RL
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
           P+  YE+ R+AFV  T +L  A G+S+FDTCYN SG  +V VPT++F  S G  L LPA 
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435

Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+LI +D AGT+C AF  + S LSIIG+ QQ+GI++S+D  N  VGF  N C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 166/357 (46%), Positives = 230/357 (64%), Gaps = 6/357 (1%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG+  GSGEYF R+G+G+P RS Y+ +D+GSD+ W+QC PCS CY Q DP++DP++S+
Sbjct: 1   ISSGLSLGSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSS 60

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVA 261
           S+  V C SA+C  L+ + C    C Y V YGD S + G L +E+  +G    T ++N+A
Sbjct: 61  SYRRVYCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNSSTAMRNIA 120

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT---GSSGSLVFG 318
            GCGH N G+F G AGLLG+GGG++S   Q+    G AFSYCLV R +     S  L+FG
Sbjct: 121 FGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFG 180

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           R A+P  A + PL++NPR  +FYY  L+G+ VGG  +PI    F LT  G  G ++D+GT
Sbjct: 181 RTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGT 240

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
           +VTR+  PAY   RDA+ A + NLP A GV + DTC+N  G  +V++P++  +F  G  +
Sbjct: 241 SVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            LP  N LIPVD +GTFC AFAPS   +S+IGN+QQ+  +I FD     +   P  C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  329 bits (843), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 192/412 (46%), Positives = 262/412 (63%), Gaps = 17/412 (4%)

Query: 97  NNMHYHRHQHSFHARMQRDVKRVA----TLVRRLSGG---GADAAKHEVQDFGT-DVVSG 148
           +N  Y  +     AR+ RD  RV      L R L+GG   G    +  + D  T  VVSG
Sbjct: 80  HNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGESINESLIGDSITAPVVSG 139

Query: 149 MDQGSG-EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSA 204
             +GSG EY  +IGVG P +  Y+V D+GSD+ W+QCQPC+    CYKQ DP+FDP  S+
Sbjct: 140 QSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSS 199

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIG 263
           S+S +SC+S  C  L+ A C++  C Y+V YGDGS+T G LA ETL+ G +  + N+ IG
Sbjct: 200 SYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNSNSIPNLPIG 259

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CGH N+G+F G AGL+GLGGG++SL  QL      +FSYCLV+  + SS +L F    +P
Sbjct: 260 CGHDNEGLFAGGAGLIGLGGGAISLSSQL---KASSFSYCLVNLDSDSSSTLEFN-SYMP 315

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             +   PLV+N R  S+ YV + G+ VGG  +PIS   F + + G  G+++D+GT ++RL
Sbjct: 316 SDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIISRL 375

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
           P+  YE+ R+AFV  T +L  A G+S+FDTCYN SG  +V VPT++F  S G  L LPA 
Sbjct: 376 PSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGTSLRLPAR 435

Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+LI +D AGT+C AF  + S LSIIG+ QQ+GI++S+D  N  VGF  N C
Sbjct: 436 NYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  328 bits (842), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 187/394 (47%), Positives = 247/394 (62%), Gaps = 14/394 (3%)

Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
            S + +++  +K      RR++G  +             V SG  QG+GEYF RIGVG P
Sbjct: 140 QSLNRKLELSLKGGKQFGRRINGSDS------TNSLTAPVTSGASQGAGEYFARIGVGQP 193

Query: 166 PRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
            +S + V D+GSD+ W+QCQPC   + CYKQ  P+FDP  S+S+S +SC S  C  L+ A
Sbjct: 194 VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA 253

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
            C A  C YEV YGDGS+T G LA ET +   +  + N+ IGCGH N+G+FVGA GL+GL
Sbjct: 254 ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGADGLIGL 313

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           GGG++SL  QL      +FSYCLV   + SS +L F  +  P  +   PLV+N R P+F 
Sbjct: 314 GGGAISLSSQL---EATSFSYCLVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFR 369

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
           YV + G+ VGG  +PIS   F + + G  G+++D+GT +T +P+  Y+  RDAFV  T N
Sbjct: 370 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 429

Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
           LP A GVS FDTCY+LS   +V VPT++F   G   L LPA N LI VD AGTFC AF P
Sbjct: 430 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLP 489

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S   LSIIGN+QQ+GI++S+D AN  VGF  + C
Sbjct: 490 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  328 bits (842), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 197/434 (45%), Positives = 262/434 (60%), Gaps = 28/434 (6%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL-----VRRLSGGGADAAK 135
           L +VHRD  + ++     + +         R++RD +R + +         + G      
Sbjct: 76  LRVVHRDDFAVNATAAELLAH---------RLRRDKRRASRISAAAGGAAAANGTRVGGG 126

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
                F   VVSG+ QGSGEYF +IGVG+P     MV+D+GSD+VW+QC PC +CY QS 
Sbjct: 127 GGGSGFVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSG 186

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
            +FDP  S S+  V C++ +C RL++ GC   R  C Y+V+YGDGS T G  A ETLT  
Sbjct: 187 QMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA 246

Query: 254 RTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
               V  VA+GCGH N+G+FV AAGLLGLG GS+S   Q+  + G +FSYCLV R + S+
Sbjct: 247 SGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSA 306

Query: 313 GS------LVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLF 362
            +      + FG  A+     A++ P+V+NPR  +FYYV L G+ VGG R+P     DL 
Sbjct: 307 SATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLR 366

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFV 421
                G  GV++D+GT+VTRL  PAY A RDAF A    L  + G  S+FDTCY+LSG  
Sbjct: 367 LDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLK 426

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
            V+VPTVS +F+GG    LP  N+LIPVD  GTFCFAFA +  G+SIIGNIQQ+G ++ F
Sbjct: 427 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 486

Query: 482 DGANGFVGFGPNVC 495
           DG    +GF P  C
Sbjct: 487 DGDGQRLGFVPKGC 500


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  327 bits (838), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 170/356 (47%), Positives = 231/356 (64%), Gaps = 2/356 (0%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
           F + ++SG+  GSG+YF RIGVG+P RS YMV D+GSD+ W+QC PC +CY+Q DP+F+P
Sbjct: 66  FASPLISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNP 125

Query: 201 ADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           + S+SF  ++C+S++C +L+  GC     C Y+VSYGDGS+T G  + ETL+ G   V++
Sbjct: 126 SLSSSFKPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRS 185

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           VA+GCG  NQG+F GAAGLLGLG G +S   Q G      FSYCL  R +  + SLVFG 
Sbjct: 186 VAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGP 245

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            A+P  A +  L+ N R  ++YYVGL+ + V G  + I  D F +   G  GV++D+GTA
Sbjct: 246 SAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTA 305

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           ++RL TPAY A RDAF +     P A G+S+FDTCY+LS   +  +P V   F GG  + 
Sbjct: 306 ISRLTTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMP 364

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LPA   L+ VDD GT+C AFAP     SIIGN+QQ+  +IS D     +G  P+ C
Sbjct: 365 LPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  327 bits (837), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 187/394 (47%), Positives = 247/394 (62%), Gaps = 14/394 (3%)

Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
            S + +++  +K      RR++G  +             V SG  QG+GEYF RIGVG P
Sbjct: 140 QSLNRKLELSLKGGKQFGRRINGSDS------TNSLTAPVTSGASQGAGEYFARIGVGQP 193

Query: 166 PRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
            +S + V D+GSD+ W+QCQPC   + CYKQ  P+FDP  S+S+S +SC S  C  L+ A
Sbjct: 194 VQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEA 253

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
            C A  C YEV YGDGS+T G LA ET +   +  + N+ IGCGH N+G+FVGAAGL+GL
Sbjct: 254 ACDANSCIYEVEYGDGSFTVGELATETFSFRHSNSIPNLPIGCGHDNEGLFVGAAGLIGL 313

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           GGG++SL  QL      +FSYCLV   + SS +L F  +  P  +   PLV+N R P+F 
Sbjct: 314 GGGAISLSSQL---EATSFSYCLVDLDSESSSTLDFNADQ-PSDSLTSPLVKNDRFPTFR 369

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
           YV + G+ VGG  +PIS   F + + G  G+++D+GT +T +P+  Y+  RDAFV  T N
Sbjct: 370 YVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKN 429

Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
           LP A GVS FDTCY+LS   +V VPT++F   G   L LPA N L  VD AGTFC AF P
Sbjct: 430 LPPAPGVSPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLP 489

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S   LSIIGN+QQ+GI++S+D AN  VGF  + C
Sbjct: 490 STFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)

Query: 63  RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           +   +S +   ++ +  +  L HR+  + ++  ++ + +   + +  A         AT 
Sbjct: 47  QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 106

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
            RR  G            F   ++SG+ QGSGEYF ++GVG+P  +  MV+D+GSD+VW+
Sbjct: 107 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 154

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
           QC PC  CY QS  VFDP  S S++ V C + +C RL++AGC   R  C Y+V+YGDGS 
Sbjct: 155 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 214

Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T G  A ETLT  R   V+ VAIGCGH N+G+F+ A+GLLGLG G +S   Q+    G +
Sbjct: 215 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRS 274

Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           FSYCLV R +         S+ +   G  A   GA++ P+ RNPR  +FYYV L G  VG
Sbjct: 275 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 334

Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
           G R+  +S+   RL    G  GV++D+GT+VTRL  P YEA RDAF A    L R S  G
Sbjct: 335 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 393

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
            S+FDTCYNLSG   V+VPTVS + +GG  + LP  N+LIPVD +GTFCFA A +  G+S
Sbjct: 394 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 453

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IIGNIQQ+G ++ FDG    VGF P  C
Sbjct: 454 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 481


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)

Query: 63  RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           +   +S +   ++ +  +  L HR+  + ++  ++ + +   + +  A         AT 
Sbjct: 41  QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
            RR  G            F   ++SG+ QGSGEYF ++GVG+P  +  MV+D+GSD+VW+
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 148

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
           QC PC  CY QS  VFDP  S S++ V C + +C RL++AGC   R  C Y+V+YGDGS 
Sbjct: 149 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 208

Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T G  A ETLT  R   V+ VAIGCGH N+G+F+ A+GLLGLG G +S   Q+    G +
Sbjct: 209 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRS 268

Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           FSYCLV R +         S+ +   G  A   GA++ P+ RNPR  +FYYV L G  VG
Sbjct: 269 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 328

Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
           G R+  +S+   RL    G  GV++D+GT+VTRL  P YEA RDAF A    L R S  G
Sbjct: 329 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 387

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
            S+FDTCYNLSG   V+VPTVS + +GG  + LP  N+LIPVD +GTFCFA A +  G+S
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IIGNIQQ+G ++ FDG    VGF P  C
Sbjct: 448 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  325 bits (833), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 197/448 (43%), Positives = 267/448 (59%), Gaps = 28/448 (6%)

Query: 63  RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           +   +S +   ++ +  +  L HR+  + ++  ++ + +   + +  A         AT 
Sbjct: 41  QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
            RR  G            F   ++SG+ QGSGEYF ++GVG+P  +  MV+D+GSD+VW+
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGSGEYFAQVGVGTPATTALMVLDTGSDVVWL 148

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSY 240
           QC PC  CY QS  VFDP  S S++ V C + +C RL++AGC   R  C Y+V+YGDGS 
Sbjct: 149 QCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSV 208

Query: 241 TKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T G  A ETLT  R   V+ VAIGCGH N+G+F+ A+GLLGLG G +S   Q+    G +
Sbjct: 209 TAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRS 268

Query: 300 FSYCLVSRGTG--------SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           FSYCLV R +         S+ +   G  A   GA++ P+ RNPR  +FYYV L G  VG
Sbjct: 269 FSYCLVDRTSSVRPSSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVG 328

Query: 352 GMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--G 407
           G R+  +S+   RL    G  GV++D+GT+VTRL  P YEA RDAF A    L R S  G
Sbjct: 329 GARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGG 387

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
            S+FDTCYNLSG   V+VPTVS + +GG  + LP  N+LIPVD +GTFCFA A +  G+S
Sbjct: 388 FSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVS 447

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IIGNIQQ+G ++ FDG    VGF P  C
Sbjct: 448 IIGNIQQQGFRVVFDGDAQRVGFVPKSC 475


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  323 bits (829), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 169/352 (48%), Positives = 230/352 (65%), Gaps = 2/352 (0%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           ++SG+  GSG+YF RIGVG+P RS YMV D+GSD+ W+QC PC +CY+Q DP+F+P+ S+
Sbjct: 3   LISGIAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSS 62

Query: 205 SFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           SF  ++C+S++C +L+  GC    +C Y+VSYGDGS+T G  + ETL+ G   V++VA+G
Sbjct: 63  SFKPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAMG 122

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CG  NQG+F GAAGLLGLG G +S   Q G      FSYCL  R +  + SLVFG  A+P
Sbjct: 123 CGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSAVP 182

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             A +  L+ N R  ++YYVGL+ + V G  + I  D F +   G  GV++D+GTA++RL
Sbjct: 183 EKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAISRL 242

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
            TPAY A RDAF +     P A G+S+FDTCY+LS   +  +P V   F GG  + LPA 
Sbjct: 243 TTPAYTALRDAFRSLV-TFPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGASMPLPAD 301

Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             L+ VDD GT+C AFAP     SIIGN+QQ+  +IS D     +G  P+ C
Sbjct: 302 GILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 159/368 (43%), Positives = 231/368 (62%), Gaps = 14/368 (3%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
           F   + SG+  G+GEYF  +GVG+P R  Y+V+D+GSDI W+QC PC+ CYKQ D +F+P
Sbjct: 1   FEAPIFSGLAFGTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNP 60

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI------GR 254
           + S+SF  + CSS++C  L+  GC + +C Y+  YGDGS+T G L  + + +      G+
Sbjct: 61  SSSSSFKVLDCSSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQ 120

Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS-- 312
            V+ N+ +GCGH N+G F  AAG+LGLG G +S    L   T   FSYCL  R +  +  
Sbjct: 121 VVLTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHK 180

Query: 313 GSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGG-MRIPISEDLFRLTQM 367
            +LVFG  A+P  A     ++P +RNPR  ++YYV ++G+ VGG +   I   +F+L   
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G+ G + D+GT +TRL   AY A RDAF A T +L  A+   IFDTCY+ +G  S+ VPT
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPT 300

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           V+F+F G   + LP SN+++PV +   FCFAFA S  G S+IGN+QQ+  ++ +D  +  
Sbjct: 301 VTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAAS-MGPSVIGNVQQQSFRVIYDNVHKQ 359

Query: 488 VGFGPNVC 495
           +G  P+ C
Sbjct: 360 IGLLPDQC 367


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 229/389 (58%), Gaps = 17/389 (4%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
             RD  R+ T+  + +G  +  +   +Q        G   G+G Y V  G G+P ++  +
Sbjct: 101 FDRDNDRLNTIWSKNNGTYSTMSNLPLQP-------GSKVGTGNYIVTAGFGTPAKNSLL 153

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-CHAGRCR 230
           +ID+GSD+ W+QC+PCS CY Q DP+F+P  S+S+  +SC S+ C  L     C  G C 
Sbjct: 154 IIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCV 213

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           YE++YGDGS ++G  + ETLT+G     + A GCGH N G+F G+AGLLGLG  ++S   
Sbjct: 214 YEINYGDGSRSQGDFSQETLTLGSDSFPSFAFGCGHTNTGLFKGSAGLLGLGRTALSFPS 273

Query: 291 QLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
           Q   + GG FSYCL     + S+GS   G+ ++P  A +VPLV N   PSFY+VGL+G+ 
Sbjct: 274 QTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGIS 333

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           VGG R+ I   +     +G  G ++D+GT +TRL   AY+A + +F ++T NLP A   S
Sbjct: 334 VGGERLSIPPAV-----LGRGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFS 388

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPSPSGLS- 467
           I DTCY+LS +  VR+PT++F+F     + + A   L  +  D    C AFA +   +S 
Sbjct: 389 ILDTCYDLSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQVCLAFASASQSIST 448

Query: 468 -IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            IIGN QQ+ ++++FD   G +GF P  C
Sbjct: 449 NIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  298 bits (763), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 174/342 (50%), Positives = 223/342 (65%), Gaps = 17/342 (4%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--HAGR 228
           MV+D+GSD+VWVQC PC +CY+QS PVFDP  S+S+  V C +A+C RL++ GC    G 
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
           C Y+V+YGDGS T G    ETLT  G   V  VA+GCGH N+G+FV AAGLLGLG G +S
Sbjct: 61  CMYQVAYGDGSVTAGDFVTETLTFAGGARVARVALGCGHDNEGLFVAAAGLLGLGRGGLS 120

Query: 288 LVGQLGGQTGGAFSYCLVSR---------GTGSSGSLVFGREALPVGAA-WVPLVRNPRA 337
              Q+  + G +FSYCLV R         G+  S ++ FG  ++   +A + P+VRNPR 
Sbjct: 121 FPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRM 180

Query: 338 PSFYYVGLSGLGVGGMRIP-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
            +FYYV L G+ VGG R+P ++E   RL    G  GV++D+GT+VTRL   +Y A RDAF
Sbjct: 181 ETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAF 240

Query: 396 VAQTGNLPRAS--GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
            A      R S  G S+FDTCY+L G   V+VPTVS +F+GG    LP  N+LIPVD  G
Sbjct: 241 RAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIPVDSRG 300

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           TFCFAFA +  G+SIIGNIQQ+G ++ FDG    VGF P  C
Sbjct: 301 TFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  297 bits (761), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 163/338 (48%), Positives = 219/338 (64%), Gaps = 8/338 (2%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFDPADSASFSGVSCSSAVCDR 218
           VG P +  + V+D+GSD+ W+QC PC+    CY+Q  P+FDP  S+S++ VSC S  C  
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMFVGAAG 277
           L+ AGC+   C Y+V YGDGS+T G LA ETLT +    + N++IGCGH N+G+FVGA G
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNISIGCGHDNEGLFVGADG 122

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRA 337
           L+GLGGG++S+  QL      +FSYCLV   + S  +L F  +  P  +   PLV+N R 
Sbjct: 123 LIGLGGGAISISSQL---KASSFSYCLVDIDSPSFSTLDFNTDP-PSDSLISPLVKNDRF 178

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
           PSF YV + G+ VGG  +PIS   F + + G  G+++D+GT +T+LP+  YE  R+AF+ 
Sbjct: 179 PSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVYEVLREAFLG 238

Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
            T NLP A  +S FDTCY+LS   +V VPT++F   G   L LPA N LI VD AGTFC 
Sbjct: 239 LTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCL 298

Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AF  +   LSIIGN QQ+GI++S+D  N  VGF  N C
Sbjct: 299 AFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  289 bits (740), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 192/444 (43%), Positives = 252/444 (56%), Gaps = 36/444 (8%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
           ++  +  ++ L+HRD+ ++  N T      R       R+QRDV R A ++ + +  G  
Sbjct: 62  AASSSTLHIRLLHRDRFAA--NATPAQLLAR-------RLQRDVLRAAWIISKAAANGTP 112

Query: 133 ---AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
              A     + F   VVS     SGEY  +I VG+P     + +D+ SD+ W+QCQPC +
Sbjct: 113 PPVAGLSSARGFVAPVVSRAPT-SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRR 171

Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCRYEVSYGDGSYTKGTLA 246
           CY QS PVFDP  S S+  +S ++A C  L  +G      G C Y V YGDGS T G   
Sbjct: 172 CYPQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFI 231

Query: 247 LETLTI-GRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
            ETLT  G   +  ++IGCGH N+G+F   AAG+LGLG G MS   Q+     G FSYCL
Sbjct: 232 EETLTFAGGVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQI--DHNGTFSYCL 289

Query: 305 VS--RGTGS-SGSLVFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP- 356
           V    G GS S +L FG  A+    PV  ++ P V N   P+FYYV L+G+ VGG+R+P 
Sbjct: 290 VDFLSGPGSLSSTLTFGAGAVDTSPPV--SFTPTVLNLNMPTFYYVRLTGISVGGVRVPG 347

Query: 357 -ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVS-IFD 412
               DL      G  GV++D+GTAVTRL  PAY AFRDAF A   +L + S  G S  FD
Sbjct: 348 VTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFD 407

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PSGLSIIGN 471
           TCY + G    +VPTVS +F+G   + L   N+LIPVD  GT CFAFA +    +SIIGN
Sbjct: 408 TCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGN 467

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQ+G +I +D   G VGF PN C
Sbjct: 468 IQQQGFRIVYD-IGGRVGFAPNSC 490


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  285 bits (729), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 141/351 (40%), Positives = 205/351 (58%), Gaps = 5/351 (1%)

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           D    +  G+  G+  + V+IGVG PP+  YM+ D  +D  W+QCQPC +CY Q D +FD
Sbjct: 171 DLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFD 230

Query: 200 PADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
           P+ S+S++ +SC +  C+ L N+ C   G CRY ++Y DG+ T+G L  ET++   +  V
Sbjct: 231 PSQSSSYTLLSCETKHCNLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSGWV 290

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
             V++GC +KNQG FVG+ G  GLG GS+S   ++      + SYCLV    G S S + 
Sbjct: 291 DRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRINA---SSMSYCLVESKDGYSSSTLE 347

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
                  G+    L++NP+A + YYVGL G+ VGG +I +    F +   G+ G+++ + 
Sbjct: 348 FNSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSS 407

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           + +T L    Y   RDAFVA+T +L R      FDTCYNLS   +V +P + F  + G  
Sbjct: 408 SLITMLENDTYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKS 467

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
             LP  ++L  VD  GTFCFAFAPS    SI+G +QQ G +++FD  N FV
Sbjct: 468 WLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  281 bits (718), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 162/371 (43%), Positives = 213/371 (57%), Gaps = 20/371 (5%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V+SG+   SGEYF  I VG PP    +VID+GSD++W+QC PC  CY+Q  P++DP  S+
Sbjct: 77  VMSGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSS 136

Query: 205 SFSGVSCSSAVC-DRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNV 260
           +   + C+S  C D L   GC A  G C Y V YGDGS + G LA + L     T V NV
Sbjct: 137 THRRIPCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDTHVHNV 196

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVF 317
            +GCGH N G+   AAGLLG+G G +S   QL    G  FSYCL   +SR    S  LVF
Sbjct: 197 TLGCGHDNVGLLESAAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVF 256

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMD 375
           GR   P   A+ PL  NPR PS YYV + G  VGG R+    +  L      G  G+V+D
Sbjct: 257 GRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVD 316

Query: 376 TGTAVTRLPTPAYEAFRDAF---VAQTGNLPR-ASGVSIFDTCYNLSG----FVSVRVPT 427
           +GTA++R    AY A RDAF    A  G + + A+  S+FD CY+L G      +VRVP+
Sbjct: 317 SGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPS 376

Query: 428 VSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           +  +F+GG  + LP +N+LIPV   D    FC     +  GL+++GN+QQ+G  + FD  
Sbjct: 377 IVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVE 436

Query: 485 NGFVGFGPNVC 495
            G +GF PN C
Sbjct: 437 RGRIGFTPNGC 447


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 162/369 (43%), Positives = 212/369 (57%), Gaps = 18/369 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V+SG+   SGEYF  IGVG PP    +VID+GSD++W+QC PC +CY+Q  P++DP +S 
Sbjct: 81  VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSK 140

Query: 205 SFSGVSCSSAVCD-RLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNV 260
           +   + C+S  C   L   GC A  G C Y V YGDGS + G LA +TL +   T V NV
Sbjct: 141 THRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDTRVHNV 200

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVF 317
            +GCGH N+G+   AAGLLG G G +S   QL    G  FSYCL   +SR   SS  LVF
Sbjct: 201 TLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVF 260

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMD 375
           GR       A+ PL  NPR PS YYV + G  VGG R+    +  L      G  GVV+D
Sbjct: 261 GRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVD 320

Query: 376 TGTAVTRLPTPAYEAFRDAFV---AQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTVS 429
           +GTA++R    AY A RDAFV   A  G     +  S+FDTCY++ G      VRVP++ 
Sbjct: 321 SGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIV 380

Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
            +F+    + LP +N+LIPV   D    FC     +  GL+++GN+QQ+G  + FD   G
Sbjct: 381 LHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERG 440

Query: 487 FVGFGPNVC 495
            +GF PN C
Sbjct: 441 RIGFTPNGC 449


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 143/290 (49%), Positives = 194/290 (66%), Gaps = 8/290 (2%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-------MQRDVKRVATLV 123
            T    + W++E+VHRD +   +       Y R       R       ++R ++R  TL 
Sbjct: 66  ETKPRRSPWSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLN 125

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           +       + A+ +  DFG +VVSGM+QGSGEYF RIGVG+P R QYMV+D+GSD+ W+Q
Sbjct: 126 KDPVNRYENVAEVDA-DFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQ 184

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
           C+PC +CY Q+DP+F+P+ SASFS V C SAVC +L+   CH+G C YE SYGDGSY+ G
Sbjct: 185 CEPCRECYSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTG 244

Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           + A ETLT G T V NVAIGCGHKN G+F+GAAGLLGLG G++S   Q+G QTG  FSYC
Sbjct: 245 SFATETLTFGTTSVANVAIGCGHKNVGLFIGAAGLLGLGAGALSFPNQIGTQTGHTFSYC 304

Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
           LV R + SSG L FG +++PVG+ + PL +NP  P+FYY+ ++ + +  +
Sbjct: 305 LVDRESDSSGPLQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISISAI 354


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  278 bits (712), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 152/393 (38%), Positives = 227/393 (57%), Gaps = 21/393 (5%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
            +RD  R+ T+  + SG     +   +Q       SG   G+G Y V  G G+P ++  +
Sbjct: 100 FERDNARLNTIRSKNSGPYTTMSNLPLQ-------SGTTVGTGNYIVTAGFGTPAKNSLL 152

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-----ENAGCHA 226
           +ID+GSD+ W+QC+PC+ CY Q D +F+P  S+S+  + C SA C  L         C  
Sbjct: 153 IIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSATCTELITSESNPTPCLL 212

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
           G C YE++YGDGS ++G  + ETLT+G    +N A GCGH N G+F G++GLLGLG  S+
Sbjct: 213 GGCVYEINYGDGSSSQGDFSQETLTLGSDSFQNFAFGCGHTNTGLFKGSSGLLGLGQNSL 272

Query: 287 SLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
           S   Q   + GG F+YCL     + S+GS   G+ ++P  A + PLV N   P+FY+VGL
Sbjct: 273 SFPSQSKSKYGGQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGL 332

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
           +G+ VGG R+ I   +     +G    ++D+GT +TRL   AY A + +F ++T +LP A
Sbjct: 333 NGISVGGDRLSIPPAV-----LGRGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSA 387

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT-FCFAFAPSPS 464
              SI DTCY+LS    VR+PT++F+F     + +     L+PV + G+  C AFA +  
Sbjct: 388 KPFSILDTCYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQNGGSQVCLAFASASQ 447

Query: 465 --GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G +IIGN QQ+ ++++FD   G +GF    C
Sbjct: 448 MDGFNIIGNFQQQRMRVAFDTGAGRIGFASGSC 480


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  275 bits (703), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 163/441 (36%), Positives = 245/441 (55%), Gaps = 36/441 (8%)

Query: 71  NTSSDEARWNLELVHR-----DKMSSSSNTTNNMHYHRHQHSF-HARMQRDVKRVATLVR 124
           +TSS   + +LE++HR     D++S++      +   + +  F H+++  +++ V     
Sbjct: 53  HTSSLGEQSSLEVIHRHGPCGDEVSNAPTAAEMLVKDQSRVDFIHSKIAGELESV----D 108

Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
           RL G  A     +         SG   GSG Y V +G+G+P +   ++ D+GSD+ W QC
Sbjct: 109 RLRGSKATKIPAK---------SGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQC 159

Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGD 237
           QPC++ CY Q DPVF P+ S ++S +SCSS  C +LE+      GC A R C Y + YGD
Sbjct: 160 QPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGD 219

Query: 238 GSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
            S++ G  A ETLT+  T V++N   GCG  N+G+F  AAGL+GLG   +S+V Q   + 
Sbjct: 220 QSFSVGYFAKETLTLTSTDVIENFLFGCGQNNRGLFGSAAGLIGLGQDKISIVKQTAQKY 279

Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
           G  FSYCL  + + S+G L FG         + P+ +     +FY V + G+ VGG +IP
Sbjct: 280 GQVFSYCL-PKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIP 338

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
           IS  +F  +     G ++D+GT +TRLP  AY A + AF       P+A  +SI DTCY+
Sbjct: 339 ISSSVFSTS-----GAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYD 393

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
           LS + ++++P V F F GG  L L     +     +   C AFA    PS ++IIGN+QQ
Sbjct: 394 LSKYSTIQIPKVGFVFKGGEELDLDGIGIMYGASTS-QVCLAFAGNQDPSTVAIIGNVQQ 452

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           + +Q+ +D   G +GFG N C
Sbjct: 453 KTLQVVYDVGGGKIGFGYNGC 473


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  275 bits (702), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 161/431 (37%), Positives = 232/431 (53%), Gaps = 24/431 (5%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAA 134
           + R +LE+VH+    S       +  H+     H + + +D  RVA++  RL+   A  +
Sbjct: 72  DQRASLEVVHKHGPCS------KLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGS 125

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQ 193
             +         S    GSG Y V +G+GSP R    + D+GSD+ W QC+PC   CY+Q
Sbjct: 126 NLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQ 185

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALE 248
            + +FDP+ S S+S VSC S  C++LE+A     GC +  C Y + YGDGSY+ G  A E
Sbjct: 186 REHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFARE 245

Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
            L++  T V  N   GCG  N+G+F G AGLLGL    +SLV Q   + G  FSYCL   
Sbjct: 246 KLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PS 304

Query: 308 GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
            + S+G L FG  +       + P   N   PSFY++ + G+ VG  ++PI + +F    
Sbjct: 305 SSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTA- 363

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
               G ++D+GT ++RLP   Y + +  F     + PR  GVSI DTCY+LS + +V+VP
Sbjct: 364 ----GTIIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVP 419

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
            +  YFSGG  + L A   +I V      C AFA       ++IIGN+QQ+ I + +D A
Sbjct: 420 KIILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDA 478

Query: 485 NGFVGFGPNVC 495
            G VGF P+ C
Sbjct: 479 EGRVGFAPSGC 489


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 154/370 (41%), Positives = 207/370 (55%), Gaps = 10/370 (2%)

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
           H+     + V+SG+   SGEYF  +GVG+PP    +VID+GSD+VW+QC+PC  CY+Q  
Sbjct: 79  HDDDHLHSPVISGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLS 138

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR- 254
           P++DP  S++++   CS   C   +      G C Y + YGD S T G LA + L     
Sbjct: 139 PLYDPRGSSTYAQTPCSPPQCRNPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSND 198

Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSS 312
           T V NV +GCGH N+G+F  AAGLLG+  G+ S   Q+    G  F+YCL   +R   SS
Sbjct: 199 TSVGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSS 258

Query: 313 GSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGD 369
             LVFGR A  P  + + PL  NPR PS YYV + G  VGG  +    +  L      G 
Sbjct: 259 SYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGR 318

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            GVV+D+GT++TR    AY A RDAF    A+ G      G+S+FD CY+L G      P
Sbjct: 319 GGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAP 378

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGAN 485
            V  +F+GG  + LP  N+L+P +     CFA  A    GLS+IGN+ Q+  ++ FD  N
Sbjct: 379 GVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVEN 438

Query: 486 GFVGFGPNVC 495
             VGF PN C
Sbjct: 439 ERVGFEPNGC 448


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 154/441 (34%), Positives = 235/441 (53%), Gaps = 22/441 (4%)

Query: 65  NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
           +++ S +   D+ R +LE++H+    S  +        R Q      + +D  RV ++  
Sbjct: 52  SSVCSPSPKGDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQM-----LDQDESRVNSIRS 106

Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
           RL+   AD  K +         SG   G+G Y V +G+G+P R    + D+GSD+ W QC
Sbjct: 107 RLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQC 166

Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDG 238
           +PC++ CY Q +P+F+P+ S S++ +SCSS  CD L++       C A  C Y + YGD 
Sbjct: 167 EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQ 226

Query: 239 SYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
           SY+ G  A + L +  T V  N   GCG  N+G+FVG AGL+GLG  ++SLV Q   + G
Sbjct: 227 SYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYG 286

Query: 298 GAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
             FSYCL S  + S+G L FG          + P + N + PSFY++ L  + VGG ++ 
Sbjct: 287 KLFSYCLPSTSS-STGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLS 345

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
            S  +F        G ++D+GT ++RLP  AY   R +F  Q    P+A+  SI DTCY+
Sbjct: 346 TSASVFSTA-----GTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYD 400

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
            S + +V VP ++ YFS G  + L  S     + +    C AFA     + ++I+GN+QQ
Sbjct: 401 FSQYDTVDVPKINLYFSDGAEMDLDPSGIFY-ILNISQVCLAFAGNSDATDIAILGNVQQ 459

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           +   + +D A G +GF P  C
Sbjct: 460 KTFDVVYDVAGGRIGFAPGGC 480


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 236/452 (52%), Gaps = 47/452 (10%)

Query: 58  NELFERHNNISSSNTSSDEARWNLELVHRD-------KMSSSSNTTNNMHYHRHQHSFHA 110
           N+ F+  N++S            LE+VHR            ++N  +NM           
Sbjct: 54  NQTFKVSNSLS------------LEVVHRSGPCIQVLNQEKAANAPSNMEI--------- 92

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
            + +D  RV ++  RLS  G    K         V SG   GSG+Y V +G+G+P +   
Sbjct: 93  -LLQDRHRVDSIHARLSSHGVFQEKQAT----LPVQSGASIGSGDYAVTVGLGTPKKEFT 147

Query: 171 MVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHA 226
           ++ D+GSD+ W QC+PC++ CYKQ +P  DP  S S+  +SCSSA C  L+  G   C +
Sbjct: 148 LIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFCKLLDTEGGESCSS 207

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
             C Y+V YGDGSY+ G  A ETLT+  + V KN   GCG +N G+F GAAGLLGLG   
Sbjct: 208 PTCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNSGLFRGAAGLLGLGRTK 267

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
           +SL  Q   +    FSYCL +  + S G L FG +       + PL  + ++  FY + +
Sbjct: 268 LSLPSQTAQKYKKLFSYCLPA-SSSSKGYLSFGGQVSKT-VKFTPLSEDFKSTPFYGLDI 325

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
           + L VGG ++ I   +F  +     G V+D+GT +TRLP+ AY A   AF     + P  
Sbjct: 326 TELSVGGNKLSIDASIFSTS-----GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPST 380

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
            G SIFDTCY+ S   ++++P V   F GG  + +  S  L PV+     C AFA +   
Sbjct: 381 DGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNGDD 440

Query: 466 L--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +  +I GN QQ+  Q+ +D A G VGF P+ C
Sbjct: 441 VKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  272 bits (696), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 174/434 (40%), Positives = 231/434 (53%), Gaps = 41/434 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
             + L H D   +  N +      R     H RM R V R AT V+ ++GGG        
Sbjct: 40  LRVRLTHVD---AHGNYSRLQLLQRAARRSHHRMSRLVAR-ATGVKAVAGGG-------- 87

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                D+   +  G+GE+ + + +G+P  S   ++D+GSD+VW QC+PC  C+KQS PVF
Sbjct: 88  -----DLQVPVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVF 142

Query: 199 DPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           DP+ S++++ V CSSA+C  L  + C  A +C Y  +YGD S T+G LA ET T+G+   
Sbjct: 143 DPSSSSTYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKK 202

Query: 258 K--NVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSS 312
           K   VA GCG  N+G  F   AGL+GLG G +SLV QLG      FSYCL S   G G S
Sbjct: 203 KLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDGDGKS 259

Query: 313 GSLVFG--------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
             L+ G            PV     PLV+NP  PSFYYV L+GL VG  RI +    F +
Sbjct: 260 PLLLGGSAAAISESAATAPV--QTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAI 317

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFV 421
              G  GV++D+GT++T L    Y A + AFVAQ   LP   G  I  D C+     G  
Sbjct: 318 QDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMA-LPTVDGSEIGLDLCFQGPAKGVD 376

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
            V+VP +  +F GG  L LPA N+++    +G  C   APS  GLSIIGN QQ+  Q  +
Sbjct: 377 EVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPS-RGLSIIGNFQQQNFQFVY 435

Query: 482 DGANGFVGFGPNVC 495
           D A   + F P  C
Sbjct: 436 DVAGDTLSFAPVQC 449


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  271 bits (694), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 158/397 (39%), Positives = 214/397 (53%), Gaps = 17/397 (4%)

Query: 109 HARMQR-DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           H  + R D  RV ++  +LS   A     E +        G   GSG Y V +G+G+P  
Sbjct: 56  HVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLPAKDGSTLGSGNYIVTVGLGTPKN 115

Query: 168 SQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLE----NA 222
              ++ D+GSD+ W QCQPC + CY Q +P+F+P+ S S+  VSCSSA C  L     NA
Sbjct: 116 DLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNA 175

Query: 223 G-CHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
           G C A  C Y + YGD S++ G LA E  T+  + V   V  GCG  NQG+F G AGLLG
Sbjct: 176 GSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSDVFDGVYFGCGENNQGLFTGVAGLLG 235

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
           LG   +S   Q        FSYCL S  +  +G L FG   +     + P+       SF
Sbjct: 236 LGRDKLSFPSQTATAYNKIFSYCLPSSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSF 294

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           Y + +  + VGG ++PI   +F        G ++D+GT +TRLP  AY A R +F A+  
Sbjct: 295 YGLNIVAITVGGQKLPIPSTVFS-----TPGALIDSGTVITRLPPKAYAALRSSFKAKMS 349

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
             P  SGVSI DTC++LSGF +V +P V+F FSGG V+ L  S  +  V      C AFA
Sbjct: 350 KYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVVEL-GSKGIFYVFKISQVCLAFA 408

Query: 461 --PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                S  +I GN+QQ+ +++ +DGA G VGF PN C
Sbjct: 409 GNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGC 445


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  271 bits (693), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 162/428 (37%), Positives = 225/428 (52%), Gaps = 23/428 (5%)

Query: 78  RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQR-DVKRVATLVRRLSGGGADAAKH 136
           + +L + HR        T + ++  +     H  + R D  RV ++  +LS   A     
Sbjct: 59  KSSLHVTHRH------GTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVS 112

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSD 195
           E +        G   GSG Y V +G+G+P     ++ D+GSD+ W QCQPC + CY Q +
Sbjct: 113 ESKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKE 172

Query: 196 PVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAGRCRYEVSYGDGSYTKGTLALETL 250
           P+F+P+ S S+  VSCSSA C  L     NAG C A  C Y + YGD S++ G LA E  
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232

Query: 251 TIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
           T+  + V   V  GCG  NQG+F G AGLLGLG   +S   Q        FSYCL S  +
Sbjct: 233 TLTNSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSAS 292

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
             +G L FG   +     + P+       SFY + +  + VGG ++PI   +F       
Sbjct: 293 -YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFST----- 346

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G ++D+GT +TRLP  AY A R +F A+    P  SGVSI DTC++LSGF +V +P V+
Sbjct: 347 PGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVA 406

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 487
           F FSGG V+ L  S  +  V      C AFA     S  +I GN+QQ+ +++ +DGA G 
Sbjct: 407 FSFSGGAVVEL-GSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGR 465

Query: 488 VGFGPNVC 495
           VGF PN C
Sbjct: 466 VGFAPNGC 473


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  271 bits (692), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 160/423 (37%), Positives = 230/423 (54%), Gaps = 21/423 (4%)

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-G 130
           T   + + +LE+VH+    S  N  +        HS    + +D +RV  +  RLS   G
Sbjct: 63  TKGPKTKASLEVVHKHGPCSQLNDHDGKAKSTTPHS--DILNQDKERVKYINSRLSKNLG 120

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ- 189
            D++  E+        SG   GSG YFV +G+G+P R   ++ D+GSD+ W QC+PC++ 
Sbjct: 121 QDSSVEELDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS 180

Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR--CRYEVSYGDGSYTK 242
           CYKQ D +FDP+ S S+S ++C+SA+C +L  A     GC A    C Y + YGD S++ 
Sbjct: 181 CYKQQDVIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSV 240

Query: 243 GTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
           G  + E LT+  T VV N   GCG  NQG+F G+AGL+GLG   +S V Q   +    FS
Sbjct: 241 GYFSRERLTVTATDVVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFS 300

Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
           YCL S  + S+G L FG  A      + P     R  SFY + ++ + VGG+++P+S   
Sbjct: 301 YCLPSTSS-STGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSST 359

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
           F        G ++D+GT +TRLP  AY A R AF       P A  +SI DTCY+LSG+ 
Sbjct: 360 F-----STGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYK 414

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQI 479
              +PT+ F F+GG  + LP    L  V      C AFA +   S ++I GN+QQ  I++
Sbjct: 415 VFSIPTIEFSFAGGVTVKLPPQGILF-VASTKQVCLAFAANGDDSDVTIYGNVQQRTIEV 473

Query: 480 SFD 482
            +D
Sbjct: 474 VYD 476


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 147/356 (41%), Positives = 205/356 (57%), Gaps = 13/356 (3%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
             SG    +G Y V +G+G+P     +V D+GSD  WVQC+PC  +CYKQ +P+FDPA S
Sbjct: 152 ATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKS 211

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           ++++ VSC+ + C  L+  GC  G C Y V YGDGSYT G  A +TLTI    +K    G
Sbjct: 212 STYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFG 271

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CG KN G+F   AGL+GLG G  SL  Q   + GGAF+YCL +  TG +G L FG  +  
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTG-TGYLDFGPGSAG 330

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             A   P++ + +  +FYYVG++G+ VGG ++P++E +F        G ++D+GT +TRL
Sbjct: 331 NNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVITRL 384

Query: 384 PTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           P  AY A   AF  V       +A G SI DTCY+ +G   V +PTVS  F GG  L + 
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444

Query: 442 ASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S  +  + +A   C AFA +     ++I+GN QQ+   + +D     VGF P  C
Sbjct: 445 VSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 155/399 (38%), Positives = 221/399 (55%), Gaps = 23/399 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R++R V R    + RL+     AA   V   G  V + +  G+GE+ +++ +GSPPRS  
Sbjct: 324 RLRRGVARGKNRLHRLNAMVLAAANATV---GDQVKAPVVAGNGEFLMKLAIGSPPRSFS 380

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC QC+ QS P+FDP  S+SF  +SCSS +C  L  + C +  C 
Sbjct: 381 AIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCE 440

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHKNQGM-FVGAAGLLGLGGG 284
           Y  +YGD S T+G LA ET T G +    ++I     GCG+ N G  F   AGL+GLG G
Sbjct: 441 YLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQGAGLVGLGRG 500

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA----AWVPLVRNPRAP 338
            +SLV QL  Q    F+YCL +       SL+ G  A   P  +       PL++NP  P
Sbjct: 501 PLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQP 557

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           SFYY+ L G+ VGG ++ I +  F L   G  GV++D+GT +T +   A+ + ++ F+AQ
Sbjct: 558 SFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQ 617

Query: 399 TGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
             NLP   SG    D C+NL +G   V VP ++F+F G   L LP  N++I    AG  C
Sbjct: 618 M-NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGAD-LELPGENYMIGDSKAGLLC 675

Query: 457 FAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A   S  G+SI GN+QQ+   +  D     + F P  C
Sbjct: 676 LAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 713


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 179/444 (40%), Positives = 248/444 (55%), Gaps = 39/444 (8%)

Query: 63  RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           +   +S +   ++ +  +  L HR+  + ++  ++ + +   + +  A         AT 
Sbjct: 41  QEQQLSLAAPRTNASTLHFRLAHREHFALNATASDLLAHLLARDAARAAALLAAPNNATR 100

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
            RR  G            F   ++SG+ QG+GEYF ++GVG+P  +  MV+D+GSD+VW 
Sbjct: 101 PRRRGG------------FAAPLLSGLPQGTGEYFAQVGVGTPATTALMVLDTGSDVVWA 148

Query: 183 QCQ---PCSQCYKQ-SDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYG 236
             +   P  +  +Q S     PA +  ++   C + +C RL++AGC   R  C Y+V+YG
Sbjct: 149 PVRALPPLLRAVRQGSSTGAAPAPTPRWN---CVAPICRRLDSAGCDRRRNSCLYQVAYG 205

Query: 237 DGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
           DGS T G  A ETLT  R   V+ VAIGCGH N+G+F+ A+GLLGLG G +S   Q+   
Sbjct: 206 DGSVTAGDFASETLTFARGARVQRVAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARS 265

Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
            G +FSYCLV R +                  W      PR  +FYYV L G  VGG R+
Sbjct: 266 FGRSFSYCLVDRTSSRRARPS---------RRWG---GTPRMATFYYVHLLGFSVGGARV 313

Query: 356 P-ISEDLFRLT-QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--GVSIF 411
             +S+   RL    G  GV++D+GT+VTRL  P YEA RDAF A    L R S  G S+F
Sbjct: 314 KGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGL-RVSPGGFSLF 372

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTCYNLSG   V+VPTVS + +GG  + LP  N+LIPVD +GTFCFA A +  G+SIIGN
Sbjct: 373 DTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGN 432

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQ+G ++ FDG    VGF P  C
Sbjct: 433 IQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  269 bits (687), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 166/425 (39%), Positives = 230/425 (54%), Gaps = 33/425 (7%)

Query: 90  SSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVRRLSGG-GADAAKHE 137
           S+S   T  +H HRH             S   R+QRD  R A + R+ SG  G D  + +
Sbjct: 56  STSGGITVPLH-HRHGPCSPVPSNKMPASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSD 114

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
                T +  G    + EY + +G+GSP  +Q M +D+GSD+ WVQC+PCSQC+ + D +
Sbjct: 115 AATVPTTL--GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSL 172

Query: 198 FDPADSASFSGVSCSSAVCDRLENA----GCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
           FDP+ S+++S  SCSSA C +L  +    GC + +C+Y VSY DGS T GT + +TLT+G
Sbjct: 173 FDPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLG 232

Query: 254 RTVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
              +K    GC     G F     GL+GLGG + SLV Q  G  G AFSYCL     GSS
Sbjct: 233 SNAIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPT-PGSS 291

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G L  G  A   G    P++R+ + P++Y V L  + VGG ++ I   +F        G 
Sbjct: 292 GFLTLG-AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA------GS 344

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           VMD+GT +TRLP  AY A   AF A     P A    I DTC++ SG  SV +P+V+  F
Sbjct: 345 VMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 404

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
           SGG V+ L  +  ++ +D+   +C AFA +   S L  IGN+QQ   ++ +D   G VGF
Sbjct: 405 SGGAVVNLDFNGIMLELDN---WCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGF 461

Query: 491 GPNVC 495
               C
Sbjct: 462 RAGAC 466


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  268 bits (686), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 164/459 (35%), Positives = 235/459 (51%), Gaps = 46/459 (10%)

Query: 61  FERHNNISSSNTSSDEARWN-----LELVHRDKMSSSSNTTNN-MHYHRHQHSFHAR-MQ 113
           F+     +SS   S E RW      LE+ H+D  S      N  +  H     F  R +Q
Sbjct: 43  FQWKQGSNSSTCLSQETRWENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQ 102

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
             +K +      +SG   D    +  D    + SG+   +  Y V + +G   R   +++
Sbjct: 103 SRMKSI------ISGRNID----DSVDAPIPLTSGIRLQTLNYIVTVELGG--RKMTVIV 150

Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-------GCHA 226
           D+GSD+ WVQCQPC +CY Q DPVF+P+ S S+  V CSS  C  L++A       G + 
Sbjct: 151 DTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNP 210

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
             C Y V+YGDGSYT+G L  E L +G  T V N   GCG  NQG+F GA+GL+GLG  S
Sbjct: 211 PSCNYVVNYGDGSYTRGELGTEHLDLGNSTAVNNFIFGCGRNNQGLFGGASGLVGLGRSS 270

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPS 339
           +SL+ Q     GG FSYCL    T +SGSLV G      +   P+  ++  ++ NP+ P 
Sbjct: 271 LSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPI--SYTRMIPNPQLP- 327

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
           FY++ L+G+ VG + +       +    G DG+++D+GT +TRLP   Y+A +D FV Q 
Sbjct: 328 FYFLNLTGITVGSVAV-------QAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQF 380

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFA 458
              P A    I DTC+NLSG+  V +P +  +F G   L +  +  F     DA   C A
Sbjct: 381 SGFPSAPAFMILDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLA 440

Query: 459 FA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A     + + IIGN QQ+  ++ +D     +GF    C
Sbjct: 441 IASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFAAEAC 479


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  268 bits (685), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 156/408 (38%), Positives = 223/408 (54%), Gaps = 23/408 (5%)

Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H    +   R++R V R    + RL+     AA   V   G  V + +  G+GE+ +++ 
Sbjct: 60  HVKNLTRFERLRRGVARGKNRLHRLNAMVLAAANATV---GDQVKAPVVAGNGEFLMKLA 116

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +GSPPRS   ++D+GSD++W QC+PC QC+ QS P+FDP  S+SF  +SCSS +C  L  
Sbjct: 117 IGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELCGALPT 176

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHKNQGM-FVGA 275
           + C +  C Y  +YGD S T+G LA ET T G +    ++I     GCG+ N G  F   
Sbjct: 177 STCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNGDGFSQG 236

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA----AWV 329
           AGL+GLG G +SLV QL  Q    F+YCL +       SL+ G  A   P  +       
Sbjct: 237 AGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTT 293

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL++NP  PSFYY+ L G+ VGG ++ I +  F L   G  GV++D+GT +T +   A+ 
Sbjct: 294 PLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFT 353

Query: 390 AFRDAFVAQTGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           + ++ F+AQ  NLP   SG    D C+NL +G   V VP ++F+F G   L LP  N++I
Sbjct: 354 SLKNEFIAQM-NLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGAD-LELPGENYMI 411

Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               AG  C A   S  G+SI GN+QQ+   +  D     + F P  C
Sbjct: 412 GDSKAGLLCLAIG-SSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQC 458


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  268 bits (684), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 166/404 (41%), Positives = 225/404 (55%), Gaps = 27/404 (6%)

Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
            KR + L +RL+   ADAA++           + V SG+   SGEYF  +GVG+P     
Sbjct: 44  AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
           +VID+GSD+VW+QC PC +CY Q   VFDP  S+++  V CSS  C  L   GC +    
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160

Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
            G CRY V+YGDGS + G LA + L     T V NV +GCG  N+G+F  AAGLLG+G G
Sbjct: 161 GGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVGRG 220

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAPSFYY 342
            +S+  Q+    G  F YCL  R + S+ S  LVFGR   P   A+  L+ NPR PS YY
Sbjct: 221 KISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYY 280

Query: 343 VGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           V ++G  VGG R+    +  L   T  G  GVV+D+GTA++R    AY A RDAF A+  
Sbjct: 281 VDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 340

Query: 401 NLPRASGV---SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD----DAG 453
                      S+FD CY+L G  +   P +  +F+GG  + LP  N+ +PVD     A 
Sbjct: 341 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 400

Query: 454 TF--CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++  C  F  +  GLS+IGN+QQ+G ++ FD     +GF P  C
Sbjct: 401 SYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  267 bits (683), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 147/356 (41%), Positives = 204/356 (57%), Gaps = 13/356 (3%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
             SG    +G Y V +G+G+P     +V D+GSD  WVQC+PC  +CYKQ  P+FDPA S
Sbjct: 152 ATSGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKS 211

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           ++++ VSC+ + C  L+  GC  G C Y V YGDGSYT G  A +TLTI    +K    G
Sbjct: 212 STYANVSCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHDAIKGFRFG 271

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           CG KN G+F   AGL+GLG G  SL  Q   + GGAF+YCL +  TG +G L FG  +  
Sbjct: 272 CGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTG-TGYLDFGPGSAG 330

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             A   P++ + +  +FYYVG++G+ VGG ++P++E +F        G ++D+GT +TRL
Sbjct: 331 NNARLTPMLTD-KGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVITRL 384

Query: 384 PTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           P  AY A   AF  V       +A G SI DTCY+ +G   V +PTVS  F GG  L + 
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVD 444

Query: 442 ASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S  +  + +A   C AFA +     ++I+GN QQ+   + +D     VGF P  C
Sbjct: 445 VSGIVYAISEA-QVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 161/432 (37%), Positives = 226/432 (52%), Gaps = 34/432 (7%)

Query: 89  MSSSSNTTNNMHYHRHQHSFHARMQR--------------DVKRVATLVRRLSGGGADAA 134
           +S  ++TT +  +  H+H   +R+                D  RV ++  +LS       
Sbjct: 52  LSPRASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSK--KLTT 109

Query: 135 KHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CY 191
            H  Q   TD+ +  G   GSG Y V +G+G+P     ++ D+GSD+ W QCQPC + CY
Sbjct: 110 NHVSQSQSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCY 169

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAGRCRYEVSYGDGSYTKGTLA 246
            Q +P+F+P+ S S+  VSCSSA C  L     NAG C A  C Y + YGD S++ G LA
Sbjct: 170 DQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLA 229

Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
            +  T+  + V   V  GCG  NQG+F G AGLLGLG   +S   Q        FSYCL 
Sbjct: 230 KDKFTLTSSDVFDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLP 289

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           S  +  +G L FG   +     + P+       SFY + +  + VGG ++PI   +F   
Sbjct: 290 SSAS-YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-- 346

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
                G ++D+GT +TRLP  AY A R +F A+    P  SGVSI DTC++LSGF +V +
Sbjct: 347 ---TPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTI 403

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDG 483
           P V+F FSGG V+ L +             C AFA     S  +I GN+QQ+ +++ +DG
Sbjct: 404 PKVAFSFSGGAVVELGSKGIFYAF-KISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDG 462

Query: 484 ANGFVGFGPNVC 495
           A G VGF PN C
Sbjct: 463 AGGRVGFAPNGC 474


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  266 bits (681), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/404 (40%), Positives = 224/404 (55%), Gaps = 27/404 (6%)

Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
            KR + L +RL+   ADAA++           + V SG+   SGEYF  +GVG+P     
Sbjct: 44  AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
           +VID+GSD+VW+QC PC +CY Q   VFDP  S+++  V CSS  C  L   GC +    
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160

Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
            G CRY V+YGDGS + G LA + L     T V NV +GCG  N+G+F  AAGLLG+  G
Sbjct: 161 GGGCRYMVAYGDGSSSTGELATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLGVARG 220

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAPSFYY 342
            +S+  Q+    G  F YCL  R + S+ S  LVFGR   P   A+  L+ NPR PS YY
Sbjct: 221 KISISTQVAPAYGSVFEYCLGDRTSRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYY 280

Query: 343 VGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           V ++G  VGG R+    +  L   T  G  GVV+D+GTA++R    AY A RDAF A+  
Sbjct: 281 VDMAGFSVGGERVTGFSNASLALDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARAR 340

Query: 401 NLPRASGV---SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD----DAG 453
                      S+FD CY+L G  +   P +  +F+GG  + LP  N+ +PVD     A 
Sbjct: 341 AAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAA 400

Query: 454 TF--CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++  C  F  +  GLS+IGN+QQ+G ++ FD     +GF P  C
Sbjct: 401 SYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEKERIGFAPKGC 444


>gi|147866052|emb|CAN80962.1| hypothetical protein VITISV_022007 [Vitis vinifera]
          Length = 150

 Score =  265 bits (678), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 125/147 (85%), Positives = 141/147 (95%)

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
           GVGG+R+PISE++FRLT++GD GVVMDTGTAVTRLPT AY+AFRDAF+AQT NLPRA+GV
Sbjct: 4   GVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGV 63

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
           +IFDTCY+L GFVSVRVPTVSFYFSGGP+LTLPA NFLIP+DDAGTFCFAFAPS SGLSI
Sbjct: 64  AIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSI 123

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           +GNIQQEGIQISFDGANG+VGFGPN+C
Sbjct: 124 LGNIQQEGIQISFDGANGYVGFGPNIC 150


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  265 bits (676), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 154/419 (36%), Positives = 228/419 (54%), Gaps = 22/419 (5%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GADAA 134
           + + +LE+VH+    S  N  +     +  HS    + +D +RV  +  R+S   G D++
Sbjct: 66  KRKASLEVVHKHGPCSQLNNHDGKAKSKTPHS--EILNQDKERVKYINSRISKNLGQDSS 123

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQ 193
             E+        SG   GSG YFV +G+G+P R   ++ D+GSD+ W QC+PC++ CYKQ
Sbjct: 124 VSELDSVTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQ 183

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGR--CRYEVSYGDGSYTKGTLA 246
            D +FDP+ S S+S ++C+S +C +L  A     GC A    C Y + YGD S++ G  +
Sbjct: 184 QDAIFDPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFS 243

Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
            E L++  T +V N   GCG  NQG+F G+AGL+GLG   +S V Q        FSYCL 
Sbjct: 244 RERLSVTATDIVDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           +  + S+G L FG         + P     R  SFY + ++G+ VGG ++P+S   F   
Sbjct: 304 ATSS-STGRLSFGTTTTSY-VKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTF--- 358

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
                G ++D+GT +TRLP  AY A R AF       P A  +SI DTCY+LSG+    +
Sbjct: 359 --STGGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSI 416

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFD 482
           P + F F+GG  + LP    L  V  A   C AFA +   S ++I GN+QQ+ I++ +D
Sbjct: 417 PKIDFSFAGGVTVQLPPQGILY-VASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYD 474


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  265 bits (676), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 237/425 (55%), Gaps = 30/425 (7%)

Query: 90  SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKH--- 136
           SS+   T  +H HRH            +   R+ RD  R A + R+ SGGG + ++    
Sbjct: 53  SSTGAATVPLH-HRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGAG 111

Query: 137 EVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           +VQ     V +  G    + EY + + +GSP +SQ M+ID+GSD+ WVQC+PCSQC+ Q+
Sbjct: 112 DVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQA 171

Query: 195 DPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
           DP+FDP+ S+++S  SCSSA C +L  E  GC + +C+Y V+YGDGS T GT + +TL +
Sbjct: 172 DPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL 231

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
           G   V+    GC +   G      GL+GLGGG+ SLV Q  G  G AFSYCL +  + SS
Sbjct: 232 GSNAVRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPAT-SSSS 290

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G L  G  A   G    P++R+ + P+FY V +  + VGG ++ I   +F        G 
Sbjct: 291 GFLTLG--AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF------SAGT 342

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           +MD+GT +TRLP  AY A   AF A     P A    I DTC++ SG  SV +PTV+  F
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALVF 402

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
           SGG V+ + +   ++   ++   C AFA +   S L IIGN+QQ   ++ +D   G VGF
Sbjct: 403 SGGAVVDIASDGIMLQTSNS-ILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 461

Query: 491 GPNVC 495
               C
Sbjct: 462 KAGAC 466


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  264 bits (675), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 182/428 (42%), Positives = 248/428 (57%), Gaps = 43/428 (10%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
            + LVHRD  + +++  + +           R+QRD++R A ++ +       AA     
Sbjct: 67  QVRLVHRDSFAVNASAADLLA---------RRLQRDMRRAAWIITK-------AATPADP 110

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ-----YMVIDSGSDIVWVQCQPCSQCYKQS 194
           + GT VV+G    SGEY  +I VG+P  +       +  D GSD+ W+QC PC +CY Q 
Sbjct: 111 ENGT-VVTGAPT-SGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQP 168

Query: 195 DPVFDPADSASFSGVSCSSAVCDRL-ENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT 251
            PV++   S+S S V C +  C  L  + GC      C+Y+V YGDGS + G   +ETLT
Sbjct: 169 GPVYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLT 228

Query: 252 IGRTV-VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
               V V  VAIGCG  NQG+F   AAG+LGLG GS+S   Q+ G+ G +FSYCL  +GT
Sbjct: 229 FPPGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGT 288

Query: 310 -GSSGSLVFGREA-----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLF 362
            G S +L FG  A          ++ P++ N R  +FYYVGL G+ VGG+R+  ++E   
Sbjct: 289 GGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDL 348

Query: 363 RL-TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF-VAQTGNL--PRASG-VSIFDTCY-N 416
           RL    G  GV++D+GTAVTRL  PAY AFRDAF VA    L  P   G  + FDTCY +
Sbjct: 349 RLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSS 408

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPS-PSGLSIIGNIQQ 474
           + G V  +VP VS +F+GG  + LP  N+LIPVD + GT CFAFA S   G+SIIGNIQ 
Sbjct: 409 VRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQL 468

Query: 475 EGIQISFD 482
           +G ++ +D
Sbjct: 469 QGFRVVYD 476


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 167/423 (39%), Positives = 231/423 (54%), Gaps = 30/423 (7%)

Query: 90  SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           SSS+       +HRH            +    + RD  R A + R+ SGGG      +  
Sbjct: 52  SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 111

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           D       G    + EY + +G+GSP  SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 112 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171

Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           P+ S+++S  SC SA C +L  E  GC  + +C+Y V+YGDGS T GT + +TL +G + 
Sbjct: 172 PSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
           VK+   GC +   G      GL+GLGGG+ SLV Q  G  G AFSYCL    + SSG L 
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 290

Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
            G       + +V  P++R+ + P+FY V L  + VGG ++ I   +F        G VM
Sbjct: 291 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVM 344

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +TRLP  AY A   AF A     P A    I DTC++ SG  SV +P+V+  FSG
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           G V++L AS  ++      + C AFA +   S L IIGN+QQ   ++ +D   G VGF  
Sbjct: 405 GAVVSLDASGIIL------SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458

Query: 493 NVC 495
             C
Sbjct: 459 GAC 461


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 160/414 (38%), Positives = 222/414 (53%), Gaps = 31/414 (7%)

Query: 102 HRHQHSFHARMQ-------RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSG 154
           H   H  + ++Q       R   R++ LV R + G   AA         D+   +  G+G
Sbjct: 63  HVDAHGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAAA------APDLQVPVHAGNG 116

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           E+ + + +G+P  +   ++D+GSD+VW QC+PC +C+ QS PVFDP+ S+++S + CSS+
Sbjct: 117 EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176

Query: 215 VCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM- 271
           +C  L  + C   A  C Y  +YGD S T+G LA ET T+ +T +  VA GCG  N+G  
Sbjct: 177 LCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDG 236

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAA 327
           F   AGL+GLG G +SLV QLG    G FSYCL S    S   L+ G  A        AA
Sbjct: 237 FTQGAGLVGLGRGPLSLVSQLG---LGKFSYCLTSLDDTSKSPLLLGSLAAISTDTASAA 293

Query: 328 WV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            +   PL++NP  PSFYYV L  L VG  RIP+    F +   G  GV++D+GT++T L 
Sbjct: 294 AIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLE 353

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLP 441
              Y   + AF AQ   LP A G ++  D C+    SG   V VP +  +F GG  L LP
Sbjct: 354 LQGYRPLKKAFAAQM-KLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADLDLP 412

Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A N+++    +G  C     S  GLSIIGN QQ+ IQ  +D     + F P  C
Sbjct: 413 AENYMVLDSASGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQC 465


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  262 bits (670), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 156/395 (39%), Positives = 226/395 (57%), Gaps = 20/395 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+Q  +KR  + ++RL+     A+  + +D    + + +  G+GEY + + +G+PP S  
Sbjct: 66  RVQHGIKRGKSRLQRLNAMVLAASTLDSED---QLEAPIHAGNGEYLMELAIGTPPVSYP 122

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            V+D+GSD++W QC+PC+QCYKQ  P+FDP  S+SFS VSC S++C  + ++ C  G C 
Sbjct: 123 AVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLCSAVPSSTCSDG-CE 181

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTV----VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
           Y  SYGD S T+G LA ET T G++     V N+  GCG  N+G  F  A+GL+GLG G 
Sbjct: 182 YVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGP 241

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYY 342
           +SLV QL       FSYCL          L+ G       A  V   PL++NP  PSFYY
Sbjct: 242 LSLVSQLKEP---RFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYY 298

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           + L G+ VG  R+ I +  F +   G+ GV++D+GT +T +   A+EA +  F++QT  L
Sbjct: 299 LSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQT-KL 357

Query: 403 PRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
           P     S   D C++L SG   V +P + F+F GG  L LPA N++I   + G  C A  
Sbjct: 358 PLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGD-LELPAENYMIGDSNLGVACLAMG 416

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S SG+SI GN+QQ+ I ++ D     + F P  C
Sbjct: 417 AS-SGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  262 bits (669), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 146/355 (41%), Positives = 199/355 (56%), Gaps = 17/355 (4%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           G+GE+ + + +G+P  +   +ID+GSD+VW QC+PC +C+ QS PVFDP+ S++++ + C
Sbjct: 98  GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPC 157

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           SS +C  L ++ C + +C Y  +YGD S T+G LA ET T+ +T + +VA GCG  N+G 
Sbjct: 158 SSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTNEGD 217

Query: 272 -FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LP 323
            F   AGL+GLG G +SLV QLG      FSYCL S    S   L+ G  A         
Sbjct: 218 GFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLLGSLATISESAAAA 274

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
                 PL+RNP  PSFYYV L GL VG   I +    F +   G  GV++D+GT++T L
Sbjct: 275 SSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYL 334

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
               Y A + AF AQ   LP A G  I  DTC+    SG   V VP + F+  G   L L
Sbjct: 335 ELQGYRALKKAFAAQM-KLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHLDGAD-LDL 392

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           PA N+++    +G  C     S  GLSIIGN QQ+ IQ  +D     + F P  C
Sbjct: 393 PAENYMVLDSGSGALCLTVMGS-RGLSIIGNFQQQNIQFVYDVGENTLSFAPVQC 446


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 151/355 (42%), Positives = 204/355 (57%), Gaps = 15/355 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASF 206
           G+  G+G Y V + +G+P     +V D+GSD  WVQCQPC + CY+Q +P+FDP  SA++
Sbjct: 153 GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 212

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + +SCSS+ C  L  +GC  G C Y + YGDGSYT G  A +TLT+    +KN   GCG 
Sbjct: 213 ANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCGE 272

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
           KN+G+F  AAGLLGLG G  SL  Q   + GG F+YCL +   G +G L  G  A    A
Sbjct: 273 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAG-TGFLDLGPGAPAANA 331

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              P++ + R P+FYYVG++G+ VGG  +PI   +F        G ++D+GT +TRLP  
Sbjct: 332 RLTPMLVD-RGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTVITRLPPS 385

Query: 387 AYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
           AY   R AF      L    A   SI DTCY+L+G    S+ +P VS  F GG  L + A
Sbjct: 386 AYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445

Query: 443 SNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S  L  V D    C AFAP+   + ++I+GN QQ+   + +D     VGF P  C
Sbjct: 446 SGILY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  261 bits (668), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 153/431 (35%), Positives = 225/431 (52%), Gaps = 40/431 (9%)

Query: 83  LVHRDKMSSSSNTTNNMHYHRH-QHSFHAR-MQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           + H+D  S      N     R    +F  R +Q  +K +      LSG   D+   ++  
Sbjct: 1   MKHKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNII-----LSGNIDDSVDTQIP- 54

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
               + SG+   S  Y V + +G   R   +++D+GSD+ WVQCQPC++CY Q DPVF+P
Sbjct: 55  ----LTSGIRLQSLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNP 108

Query: 201 ADSASFSGVSCSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
           + S S+  V C+S  C  L+ A       G +   C Y V+YGDGSYT G + +E L +G
Sbjct: 109 SKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLG 168

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
            T V N   GCG KNQG+F GA+GL+GLG   +SL+ Q+    GG FSYCL +    +SG
Sbjct: 169 NTTVNNFIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASG 228

Query: 314 SLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           SLV G      +   P+  ++  ++ NP  P FY++ L+G+ VGG+ +       +    
Sbjct: 229 SLVMGGNSSVYKNTTPI--SYTRMIHNPLLP-FYFLNLTGITVGGVEV-------QAPSF 278

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G D +++D+GT ++RLP   Y+A +  FV Q    P A    I D+C+NLSG+  V++P 
Sbjct: 279 GKDRMIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPD 338

Query: 428 VSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
           +  YF G   L +  +     V  DA   C A A  P    + IIGN QQ+  +I +D  
Sbjct: 339 IKMYFEGSAELNVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTK 398

Query: 485 NGFVGFGPNVC 495
              +GF    C
Sbjct: 399 GSMLGFAEEAC 409


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  261 bits (667), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 151/355 (42%), Positives = 204/355 (57%), Gaps = 15/355 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASF 206
           G+  G+G Y V + +G+P     +V D+GSD  WVQCQPC + CY+Q +P+FDP  SA++
Sbjct: 88  GVALGTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATY 147

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + +SCSS+ C  L  +GC  G C Y + YGDGSYT G  A +TLT+    +KN   GCG 
Sbjct: 148 ANISCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDTIKNFRFGCGE 207

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
           KN+G+F  AAGLLGLG G  SL  Q   + GG F+YCL +   G +G L  G  A    A
Sbjct: 208 KNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAG-TGFLDLGPGAPAANA 266

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              P++ + R P+FYYVG++G+ VGG  +PI   +F        G ++D+GT +TRLP  
Sbjct: 267 RLTPMLVD-RGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTVITRLPPS 320

Query: 387 AYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
           AY   R AF      L    A   SI DTCY+L+G    S+ +P VS  F GG  L + A
Sbjct: 321 AYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380

Query: 443 SNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S  L  V D    C AFAP+   + ++I+GN QQ+   + +D     VGF P  C
Sbjct: 381 SGILY-VADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  261 bits (666), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 148/388 (38%), Positives = 207/388 (53%), Gaps = 17/388 (4%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+QR +KR    ++RLS   A         F + V + +  G+GE+ +++ +G+P  +  
Sbjct: 60  RLQRAMKRGKLRLQRLSAKTAS--------FESSVEAPVHAGNGEFLMKLAIGTPAETYS 111

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC  C+ Q  P+FDP  S+SFS + CSS +C  L  + C  G C 
Sbjct: 112 AIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG-CE 170

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
           Y  SYGD S T+G LA ET   G   V  +  GCG  N G  F   AGL+GLG G +SL+
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLI 230

Query: 290 GQLGGQTGGAFSYCLVSRGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
            QLG      FSYCL S        SL+ G EA    A   PL++NP  PSFYY+ L G+
Sbjct: 231 SQLGEP---KFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGI 287

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
            VG   +PI +  F +   G  G+++D+GT +T L   A+ A +  F++Q       SG 
Sbjct: 288 SVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGS 347

Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
           +  D C+ L    S V VP + F+F G   L LPA N++I     G  C     S SG+S
Sbjct: 348 TGLDLCFTLPPDASTVDVPQLVFHFEGAD-LKLPAENYIIADSGLGVICLTMG-SSSGMS 405

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I GN QQ+ I +  D     + F P  C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  261 bits (666), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 153/394 (38%), Positives = 222/394 (56%), Gaps = 17/394 (4%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+Q  +KR  + +++L+      A     D    + + +  G+GEY + + +G+PP S  
Sbjct: 65  RVQHGIKRGKSRLQKLNA--MVLAASSTPDSEDQLEAPIHAGNGEYLIELAIGTPPVSYP 122

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            V+D+GSD++W QC+PC++CYKQ  P+FDP  S+SFS VSC S++C  L ++ C  G C 
Sbjct: 123 AVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLCSALPSSTCSDG-CE 181

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTV----VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
           Y  SYGD S T+G LA ET T G++     V N+  GCG  N+G  F  A+GL+GLG G 
Sbjct: 182 YVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIGFGCGEDNEGDGFEQASGLVGLGRGP 241

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYY 342
           +SLV QL  Q    FSYCL          L+ G       A  V   PL++NP  PSFYY
Sbjct: 242 LSLVSQLKEQ---RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYY 298

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           + L  + VG  R+ I +  F +   G+ GV++D+GT +T +   AYEA +  F++QT   
Sbjct: 299 LSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLA 358

Query: 403 PRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
              +  +  D C++L SG   V +P + F+F GG  L LPA N++I   + G  C A   
Sbjct: 359 LDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD-LELPAENYMIGDSNLGVACLAMGA 417

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S SG+SI GN+QQ+ I ++ D     + F P  C
Sbjct: 418 S-SGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 166/423 (39%), Positives = 230/423 (54%), Gaps = 30/423 (7%)

Query: 90  SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           SSS+       +HRH            +    + RD  R A + R+ SGGG      +  
Sbjct: 122 SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 181

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           D       G    + EY + +G+GSP  SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 182 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 241

Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           P+ S+++S  SC SA C +L  E  GC  + +C+Y V+YGDGS T GT + +TL +G + 
Sbjct: 242 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 301

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
           V++   GC +   G      GL+GLGGG+ SLV Q  G  G AFSYCL    + SSG L 
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 360

Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
            G       + +V  P++R+ + P+FY V L  + VGG ++ I   +F        G VM
Sbjct: 361 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVM 414

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +TRLP  AY A   AF A     P A    I DTC++ SG  SV +P+V+  FSG
Sbjct: 415 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 474

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           G V++L AS  ++      + C AFA     S L IIGN+QQ   ++ +D   G VGF  
Sbjct: 475 GAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 528

Query: 493 NVC 495
             C
Sbjct: 529 GAC 531


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 148/412 (35%), Positives = 230/412 (55%), Gaps = 18/412 (4%)

Query: 91  SSSNTTNNMHYHRHQHSFHARMQ-----RDVKRVATLVRRLSGGGADAAKHEVQDFG-TD 144
           S+S T  N H+      F   ++     +++ +   L R +  G     + E    G + 
Sbjct: 24  STSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFELLERAVERGSRRLQRLEAMLNGPSG 83

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V + +  G GEY + + +G+P +    ++D+GSD++W QCQPC+QC+ QS P+F+P  S+
Sbjct: 84  VETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
           SFS + CSS +C  L++  C    C+Y   YGDGS T+G++  ETLT G   + N+  GC
Sbjct: 144 SFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC 203

Query: 265 GHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
           G  NQG   G  AGL+G+G G +SL  QL       FSYC+   G+ +S +L+ G  A  
Sbjct: 204 GENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTSSTLLLGSLANS 260

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAV 380
           +  G+    L+ + + P+FYY+ L+GL VG   +PI   +F+L +  G  G+++D+GT +
Sbjct: 261 VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVL 438
           T     AY+A R AF++Q  NL   +G S  FD C+ + S   ++++PT   +F GG  L
Sbjct: 321 TYFADNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD-L 378

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
            LP+ N+ I   + G  C A   S  G+SI GNIQQ+ + + +D  N  V F
Sbjct: 379 VLPSENYFISPSN-GLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSF 429


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 166/423 (39%), Positives = 230/423 (54%), Gaps = 30/423 (7%)

Query: 90  SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           SSS+       +HRH            +    + RD  R A + R+ SGGG      +  
Sbjct: 52  SSSAGAATVPLHHRHGPCSPLPTKKMPTLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRS 111

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           D       G    + EY + +G+GSP  SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FD
Sbjct: 112 DATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFD 171

Query: 200 PADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           P+ S+++S  SC SA C +L  E  GC  + +C+Y V+YGDGS T GT + +TL +G + 
Sbjct: 172 PSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSA 231

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
           V++   GC +   G      GL+GLGGG+ SLV Q  G  G AFSYCL    + SSG L 
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLT 290

Query: 317 FGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
            G       + +V  P++R+ + P+FY V L  + VGG ++ I   +F        G VM
Sbjct: 291 LGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSA------GTVM 344

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +TRLP  AY A   AF A     P A    I DTC++ SG  SV +P+V+  FSG
Sbjct: 345 DSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSG 404

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           G V++L AS  ++      + C AFA     S L IIGN+QQ   ++ +D   G VGF  
Sbjct: 405 GAVVSLDASGIIL------SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRA 458

Query: 493 NVC 495
             C
Sbjct: 459 GAC 461


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 148/412 (35%), Positives = 231/412 (56%), Gaps = 18/412 (4%)

Query: 91  SSSNTTNNMHYHRHQHSFHARMQ-----RDVKRVATLVRRLSGGGADAAKHEVQDFG-TD 144
           S+S T  N H+      F   ++     +++ +   L R +  G     + E    G + 
Sbjct: 24  STSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFELLERAVERGSRRLQRLEAMLNGPSG 83

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V + +  G GEY + + +G+P +    ++D+GSD++W QCQPC+QC+ QS P+F+P  S+
Sbjct: 84  VETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSS 143

Query: 205 SFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC 264
           SFS + CSS +C  L++  C    C+Y   YGDGS T+G++  ETLT G   + N+  GC
Sbjct: 144 SFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGC 203

Query: 265 GHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
           G  NQG   G  AGL+G+G G +SL  QL       FSYC+   G+ +S +L+ G  A  
Sbjct: 204 GENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGSLANS 260

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAV 380
           +  G+    L+++ + P+FYY+ L+GL VG   +PI   +F+L +  G  G+++D+GT +
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVL 438
           T     AY+A R AF++Q  NL   +G S  FD C+ + S   ++++PT   +F GG  L
Sbjct: 321 TYFVDNAYQAVRQAFISQM-NLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD-L 378

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
            LP+ N+ I   + G  C A   S  G+SI GNIQQ+ + + +D  N  V F
Sbjct: 379 VLPSENYFISPSN-GLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSF 429


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 163/427 (38%), Positives = 223/427 (52%), Gaps = 30/427 (7%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR    + +   + +       SF   ++ D +R   + RR+SG  A A   ++  
Sbjct: 67  LRLTHRHGPCAPAGKASALG---SPPSFLDTLRADQRRAEYIQRRVSGAAAAAPGMQLAG 123

Query: 141 FGTDVVS---GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSD 195
                V    G   G+ +Y V + +G+P  +Q + +D+GSD+ WVQC+PC    CY Q D
Sbjct: 124 SKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRD 183

Query: 196 PVFDPADSASFSGVSCSSAVCDRLE--NAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
           P+FDP  S+S+S V C++A C +L   + GC  G+C Y VSYGDGS T G  + +TLT+ 
Sbjct: 184 PLFDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLT 243

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
           G   +K    GCGH  QG+F G  GLLGLG    SLV Q     GG FSYCL      S 
Sbjct: 244 GSNALKGFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPT-QNSV 302

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G +  G  +   G +  PL+     P++Y V L+G+ VGG  + I   +F        G 
Sbjct: 303 GYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFA------SGA 356

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           V+DTGT VTRLP  AY A R AF A       P A    I DTCY+ + + +V +PT+S 
Sbjct: 357 VVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISI 416

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFV 488
            F GG  + L  S  L       + C AFAP+   S  SI+GN+QQ   ++ FDG+   V
Sbjct: 417 AFGGGAAMDLGTSGILT------SGCLAFAPTGGDSQASILGNVQQRSFEVRFDGST--V 468

Query: 489 GFGPNVC 495
           GF P  C
Sbjct: 469 GFMPASC 475


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 159/422 (37%), Positives = 227/422 (53%), Gaps = 25/422 (5%)

Query: 91  SSSNTTNNMHYHRHQHSFHAR---------MQRDVKRVATLVRRLSGGGADAAKHEVQDF 141
           S+S+  N +H         AR         +  D  RV ++ R+++   +          
Sbjct: 70  SNSSALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKK 129

Query: 142 GTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           G  + +  G+  G+G Y V +G+G+P R   +V D+GSD+ WVQC PCS CY+Q DP+FD
Sbjct: 130 GVTLPAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFD 189

Query: 200 PADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VV 257
           PA S+++S V C+S  C  L++  C    +CRYEV YGD S T G LA +TLT+ ++ V+
Sbjct: 190 PARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVL 249

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
                GCG ++ G+F  A GL+GLG   +SL  Q   + G  FSYCL S  + ++G L  
Sbjct: 250 PGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPS-AAGYLSL 308

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G  A P  A +  +     +PSFYYV L G+ V G  + +S  +F        G V+D+G
Sbjct: 309 GGPA-PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAA-----GTVIDSG 362

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
           T +TRLP   Y A R AF    G     RA  +SI DTCY+ +G  +VR+P+V+  F+GG
Sbjct: 363 TVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGG 422

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGANGFVGFGPN 493
             + L  S  L  V      C AFAP+  G    IIGN QQ+ + + +D A   +GFG N
Sbjct: 423 AAVGLDFSGVLY-VAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGAN 481

Query: 494 VC 495
            C
Sbjct: 482 GC 483


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/402 (37%), Positives = 219/402 (54%), Gaps = 23/402 (5%)

Query: 109 HAR-MQRDVKRVATLVRRLSG---GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           HA  + RD  RV ++ R  +      AD      +        G+  G+  Y V +G+G+
Sbjct: 87  HAEILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSLPARRGVPLGTANYIVSVGLGT 146

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
           P R   +V D+GSD+ WVQC+PC  CY+Q DP+FDP+ S ++S V C +  C RL++  C
Sbjct: 147 PKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQECRRLDSGSC 206

Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-------VKNVAIGCGHKNQGMFVGAAG 277
            +G+CRYEV YGD S T G LA +TLT+G +        ++    GCG  + G+F  A G
Sbjct: 207 SSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDTGLFGKADG 266

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRA 337
           L GLG   +SL  Q   + G  FSYCL S  T + G L  G  A P  A +  +V     
Sbjct: 267 LFGLGRDRVSLASQAAAKYGAGFSYCLPSSST-AEGYLSLG-SAAPPNARFTAMVTRSDT 324

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF-- 395
           PSFYY+ L G+ V G  + +S  +FR       G V+D+GT +TRLP+ AY A R +F  
Sbjct: 325 PSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTVITRLPSRAYAALRSSFAG 379

Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
           + +  +  RA  +SI DTCY+ +G   V++P+V+  F GG  L L     L  V +    
Sbjct: 380 LMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGFGEVLY-VANKSQA 438

Query: 456 CFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C AFA +   + ++I+GN+QQ+   + +D AN  +GFG   C
Sbjct: 439 CLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 169/457 (36%), Positives = 235/457 (51%), Gaps = 36/457 (7%)

Query: 51  HAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHA 110
           H ++  ++ L  R +  S  N +S      L L HR    + +   + +       SF  
Sbjct: 32  HIQLRDWDSL--RVSAASPRNGTSAV----LRLTHRHGPCAPAGKASALG---SPPSFLD 82

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS---GMDQGSGEYFVRIGVGSPPR 167
            ++ D +R   + RR+SG  A A   ++       V    G   G+ +Y V + +G+P  
Sbjct: 83  TLRADQRRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAV 142

Query: 168 SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLE--NAG 223
           +Q + +D+GSD+ WVQC+PC    CY Q DP+FDP  S+S+S V C++A C +L   + G
Sbjct: 143 AQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAASCSQLALYSNG 202

Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           C  G+C Y VSYGDGS T G  + +TLT+ G   +K    GCGH  QG+F G  GLLGLG
Sbjct: 203 CSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNALKGFLFGCGHAQQGLFAGVDGLLGLG 262

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
               SLV Q     GG FSYCL      S G +  G  +   G +  PL+     P++Y 
Sbjct: 263 RQGQSLVSQASSTYGGVFSYCLPPT-QNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYI 321

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-- 400
           V L+G+ VGG  + I   +F        G V+DTGT VTRLP  AY A R AF A     
Sbjct: 322 VMLAGISVGGQPLSIDASVFA------SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPY 375

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
             P A    I DTCY+ + + +V +PT+S  F GG  + L  S  L       + C AFA
Sbjct: 376 GYPSAPATGILDTCYDFTRYGTVTLPTISIAFGGGAAMDLGTSGILT------SGCLAFA 429

Query: 461 PS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P+   S  SI+GN+QQ   ++ FDG+   VGF P  C
Sbjct: 430 PTGGDSQASILGNVQQRSFEVRFDGST--VGFMPASC 464


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 148/392 (37%), Positives = 213/392 (54%), Gaps = 14/392 (3%)

Query: 109 HAR-MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           HA  + RD  RV ++ R  +G          +        G+  G+  Y V +G+G+P R
Sbjct: 140 HAEILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAHRGLRLGTANYIVSVGLGTPRR 199

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
              +V D+GSD+ WVQC+PC+ CYKQ DP+FDP+ S ++S V C +  C  L++  C +G
Sbjct: 200 DLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC--LDSGTCSSG 257

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV--VKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
           +CRYEV YGD S T G LA +TLT+G +   ++    GCG  + G+F  A GL GLG   
Sbjct: 258 KCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQGFVFGCGDDDTGLFGRADGLFGLGRDR 317

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
           +SL  Q   + G  FSYCL S    + G L  G  A P  A +  +V     PSFYY+ L
Sbjct: 318 VSLASQAAARYGAGFSYCLPSSWR-AEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDL 376

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            G+ V G  + ++  +F+       G V+D+GT +TRLP+ AY A R +F        RA
Sbjct: 377 VGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRA 431

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--P 463
             +SI DTCY+ +G   V++P+V+  F GG  L L     L  V +    C AFA +   
Sbjct: 432 PALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFGGVLY-VANRSQACLAFASNGDD 490

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + + I+GN+QQ+   + +D AN  +GFG   C
Sbjct: 491 TSVGILGNMQQKTFAVVYDLANQKIGFGAKGC 522


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 160/396 (40%), Positives = 222/396 (56%), Gaps = 20/396 (5%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           +    + RD  R A + R+ SGGG      +  D       G    + EY + +G+GSP 
Sbjct: 3   TLEETLHRDQLRAAYIQRKFSGGGGAGGDVQRSDATVPTALGTSLNTLEYLITVGLGSPA 62

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
            SQ M+ID+GSD+ WVQC+PCSQC+ Q+DP+FDP+ S+++S  SC SA C +L  E  GC
Sbjct: 63  TSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADCAQLGQEGNGC 122

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGG 283
             + +C+Y V+YGDGS T GT + +TL +G + V++   GC +   G      GL+GLGG
Sbjct: 123 SSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVRSFQFGCSNVESGFNDQTDGLMGLGG 182

Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFY 341
           G+ SLV Q  G  G AFSYCL    + SSG L  G       + +V  P++R+ + P+FY
Sbjct: 183 GAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFY 241

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            V L  + VGG ++ I   +F        G VMD+GT +TRLP  AY A   AF A    
Sbjct: 242 GVRLQAIRVGGRQLSIPASVFSA------GTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 295

Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA- 460
            P A    I DTC++ SG  SV +P+V+  FSGG V++L AS  ++      + C AFA 
Sbjct: 296 YPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNCLAFAG 349

Query: 461 -PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               S L IIGN+QQ   ++ +D   G VGF    C
Sbjct: 350 NSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  257 bits (657), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 147/433 (33%), Positives = 224/433 (51%), Gaps = 42/433 (9%)

Query: 83  LVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL----SGGGADAAKHEV 138
           + HRD  +SS  +T+              +  D  RV +L  R+    SG   DA   ++
Sbjct: 1   MKHRDFCNSSGKSTD------WNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQI 54

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                 + SG+   +  Y V + +G   R+  +++D+GSD+ WVQCQPC  CY Q DP+F
Sbjct: 55  P-----LSSGVRLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLF 107

Query: 199 DPADSASFSGVSCSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLT 251
           +P+ S S+  + C+S+ C  L+ A       G +   C Y V+YGDGSYT+G L +E L 
Sbjct: 108 NPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLN 167

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
           +G T V N   GCG  N+G+F GA+GL+GLG   +SLV Q      G FSYCL +    +
Sbjct: 168 LGTTHVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADA 227

Query: 312 SGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           SGSL+ G      +   P+  ++  ++ NP+ P+FY++ L+G+ +GG+ +       +  
Sbjct: 228 SGSLILGGNSSVYKNTTPI--SYTRMIANPQLPTFYFLNLTGISIGGVAL-------QAP 278

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
                G+++D+GT +TRLP P Y   +  F+ Q    P A   SI DTC+NL+G+  V +
Sbjct: 279 NYRQSGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDI 338

Query: 426 PTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
           PT+   F G   LT+  +  F     DA   C A A       + IIGN QQ   ++ ++
Sbjct: 339 PTIRMQFEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYN 398

Query: 483 GANGFVGFGPNVC 495
                +GF    C
Sbjct: 399 TKESKLGFAAEAC 411


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 176/426 (41%), Positives = 226/426 (53%), Gaps = 58/426 (13%)

Query: 83  LVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL------VRRLSGGGADAAKH 136
           L HR+  ++ + T   +  HR        + RD  R   +      V R  GG       
Sbjct: 82  LAHREAFAAPNATAAQLLAHR--------LARDAARAEAISVSARNVTRAGGG------- 126

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
               F   VVSG+ QGSGEYF  +GVG+PP    +V+D+GSD+VW+QC PC QCY QS  
Sbjct: 127 ----FSAPVVSGLAQGSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGR 182

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLT 251
           VFDP  S S++ V C +  C  L+  G          C Y+V+YGDGS T G LA ETL 
Sbjct: 183 VFDPRRSRSYAAVRCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLW 242

Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
             R   V  VA+GCGH N+G+FV AAGLLGLG G +SL  Q   + G  FSYC   +G+ 
Sbjct: 243 FARGARVPRVAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCF--QGSD 300

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
                +       VG A V                 G+G   +R+  S         G  
Sbjct: 301 LDHRTIIRTVHQHVGGARV----------------RGVGERSLRLDPS--------TGRG 336

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVSVRVPTVS 429
           GV++D+GT+VTRL  P Y A R+AF A  G L  A  G S+FDTCY+L G   V+VPTVS
Sbjct: 337 GVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVS 396

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
            + +GG  + LP  N+LIPVD  GTFC A A +  G+SI+GNIQQ+G ++ FDG    V 
Sbjct: 397 VHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVA 456

Query: 490 FGPNVC 495
             P  C
Sbjct: 457 LVPKSC 462


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 166/425 (39%), Positives = 224/425 (52%), Gaps = 35/425 (8%)

Query: 90  SSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGG----GADAAK 135
           SSS  TT  +H HRH            S   R+ RD  R A + R+ SG     G  A  
Sbjct: 52  SSSGATTVPLH-HRHGPCSPLPTKKMPSLEDRLHRDQLRAAYIKRKFSGDVKKDGQGAGG 110

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
            E          G    + EY + + +GSP ++Q ++IDSGSD+ WVQC+PC QC+ Q D
Sbjct: 111 VEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD 170

Query: 196 PVFDPADSASFSGVSCSSAVCDRL--ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI 252
           P+FDP+ S+++S  SCSSA C +L  +  GC  + +C+Y V Y DGS T GT + +TL +
Sbjct: 171 PLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL 230

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
           G   + N   GC H   G      GL+GLGGG+ SL  Q  G  G AFSYCL    + SS
Sbjct: 231 GSNTISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPS-SS 289

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G L  G  A   G    P++R+   P+FY V L  + VGG ++ I   +F        G+
Sbjct: 290 GFLTLG--AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGM 341

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           VMD+GT +TRLP  AY A   AF A       A   SI DTC++ SG  SVR+P+V+  F
Sbjct: 342 VMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVALVF 401

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGF 490
           SGG V+ L A+  ++        C AFA +   S   I+GN+QQ   ++ +D   G VGF
Sbjct: 402 SGGAVVNLDANGIIL------GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGF 455

Query: 491 GPNVC 495
               C
Sbjct: 456 KAGAC 460


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 153/424 (36%), Positives = 231/424 (54%), Gaps = 46/424 (10%)

Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
           H    +  ++ R+Q+        V+ +   +RR+      A+ H V+   T +   SG++
Sbjct: 6   HCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRV------ASTHNVEASQTQIPLSSGIN 59

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
             +  Y V +G+GS  ++  ++ID+GSD+ WVQC+PC  CY Q  P+F P+ S+S+  VS
Sbjct: 60  LQTLNYIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117

Query: 211 CSSAVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           C+S+ C  L+ A  + G         C Y V+YGDGSYT G L +E L+ G   V +   
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVSDFVF 177

Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE-- 320
           GCG  N+G+F G +GL+GLG   +SLV Q     GG FSYCL +   GSSGSLV G E  
Sbjct: 178 GCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNESS 237

Query: 321 ----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVM 374
               A P+   +  ++ NP+  +FY + L+G+ VGG  ++ P+S         G+ G+++
Sbjct: 238 VFKNANPI--TYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS--------FGNGGILI 287

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +TRLP+  Y+A +  F+ +    P A G SI DTC+NL+G+  V +PT+S  F G
Sbjct: 288 DSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEG 347

Query: 435 GPVLTLPAS-NFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFG 491
              L + A+  F +  +DA   C A A        +IIGN QQ   ++ +D     VGF 
Sbjct: 348 NAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFA 407

Query: 492 PNVC 495
              C
Sbjct: 408 EEPC 411


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 149/354 (42%), Positives = 205/354 (57%), Gaps = 15/354 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S+++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 230

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
           + VSC++  C  L+ +GC  G C Y V YGDGSY+ G  A++TLT+     VK    GCG
Sbjct: 231 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
            +N G+F  AAGLLGLG G  SL  Q  G+ GG F++CL +R TG +G L FG  + P  
Sbjct: 291 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTG-TGYLDFGAGSPPAT 349

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
                L  N   P+FYYVG++G+ VGG  +PI+  +F        G ++D+GT +TRLP 
Sbjct: 350 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 402

Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
            AY + R AF A        +A+ VS+ DTCY+ +G   V +PTVS  F GG  L + AS
Sbjct: 403 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 462

Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  V  A   C AFA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 463 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 146/409 (35%), Positives = 223/409 (54%), Gaps = 21/409 (5%)

Query: 102 HRHQHSF--------HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFG-TDVVSGMDQG 152
           HRH+           H    +++ +   L R +  G     + E    G + V + +  G
Sbjct: 32  HRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVYAG 91

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
            GEY + + +G+P +    ++D+GSD++W QCQPC+QC+ QS P+F+P  S+SFS + CS
Sbjct: 92  DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCS 151

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S +C  L +  C    C+Y   YGDGS T+G++  ETLT G   + N+  GCG  NQG  
Sbjct: 152 SQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIPNITFGCGENNQGFG 211

Query: 273 VG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV 329
            G  AGL+G+G G +SL  QL       FSYC+   G+ +  +L+ G  A  +  G+   
Sbjct: 212 QGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGSLANSVTAGSPNT 268

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVTRLPTPAY 388
            L+++ + P+FYY+ L+GL VG  R+PI    F L +  G  G+++D+GT +T     AY
Sbjct: 269 TLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAY 328

Query: 389 EAFRDAFVAQTGNLPRASGVSI-FDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           ++ R  F++Q  NLP  +G S  FD C+   S   ++++PT   +F GG  L LP+ N+ 
Sbjct: 329 QSVRQEFISQI-NLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD-LELPSENYF 386

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I   + G  C A   S  G+SI GNIQQ+ + + +D  N  V F    C
Sbjct: 387 ISPSN-GLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 149/354 (42%), Positives = 205/354 (57%), Gaps = 15/354 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S+++
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 234

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
           + VSC++  C  L+ +GC  G C Y V YGDGSY+ G  A++TLT+     VK    GCG
Sbjct: 235 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 294

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
            +N G+F  AAGLLGLG G  SL  Q  G+ GG F++CL +R TG +G L FG  + P  
Sbjct: 295 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTG-TGYLDFGAGSPPAT 353

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
                L  N   P+FYYVG++G+ VGG  +PI+  +F        G ++D+GT +TRLP 
Sbjct: 354 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 406

Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
            AY + R AF A        +A+ VS+ DTCY+ +G   V +PTVS  F GG  L + AS
Sbjct: 407 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 466

Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  V  A   C AFA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 467 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 144/356 (40%), Positives = 202/356 (56%), Gaps = 14/356 (3%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S++
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSST 229

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ VSC++  C  L+  GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 230 YANVSCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 289

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALP 323
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R +G +G L FG      
Sbjct: 290 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 348

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
            GA     +     P+FYYVG++G+ VGG  + I + +F        G ++D+GT +TRL
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITRL 403

Query: 384 PTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           P PAY + R AFV+        +A  VS+ DTCY+ +G   V +PTVS  F GG +L + 
Sbjct: 404 PPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVD 463

Query: 442 ASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AS  +         C  FA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 464 ASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  255 bits (652), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 149/354 (42%), Positives = 204/354 (57%), Gaps = 15/354 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S+++
Sbjct: 172 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTY 231

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
           + VSC++  C  L+ +GC  G C Y V YGDGSY+ G  A++TLT+     VK    GCG
Sbjct: 232 ANVSCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 291

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
            +N G+F  AAGLLGLG G  SL  Q  G+ GG F++CL  R TG +G L FG  + P  
Sbjct: 292 ERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTG-TGYLDFGAGSPPAT 350

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
                L  N   P+FYYVG++G+ VGG  +PI+  +F        G ++D+GT +TRLP 
Sbjct: 351 TTTPMLTGN--GPTFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVITRLPP 403

Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
            AY + R AF A        +A+ VS+ DTCY+ +G   V +PTVS  F GG  L + AS
Sbjct: 404 AAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463

Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  V  A   C AFA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 464 GIMYTV-SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  254 bits (650), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 152/431 (35%), Positives = 228/431 (52%), Gaps = 35/431 (8%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           LE+  R + S S    + +         H R +Q  +++     R  S   AD+++ +V 
Sbjct: 56  LEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQNHIRK-----RTSSSQIADSSETQV- 109

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
                + SG+   +  Y V +G+GS   S  +++D+GSD+ WVQC+PC  CY Q+ P+F 
Sbjct: 110 ----PLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPCRSCYNQNGPLFK 163

Query: 200 PADSASFSGVSCSSAVCDRLENAGC-----HAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
           P+ S S+  + C+S  C  LE   C      +  C Y V+YGDGSYT G L +E L  G 
Sbjct: 164 PSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGG 223

Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSG 313
             V N   GCG  N+G+F GA+GL+GLG   +S++ Q     GG FSYCL S    G+SG
Sbjct: 224 ISVSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGASG 283

Query: 314 SLVFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           SLV G ++       P+  A+  ++ N +  +FY + L+G+ VGG+ + +    F     
Sbjct: 284 SLVMGNQSGVFKNVTPI--AYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF----- 336

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G+ GV++D+GT ++RL    Y+A +  F+ Q    P A G SI DTC+NL+G+  V +PT
Sbjct: 337 GNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPT 396

Query: 428 VSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGA 484
           +S YF G   L + A+  F +  +DA   C A A       + IIGN QQ   ++ +D  
Sbjct: 397 ISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAK 456

Query: 485 NGFVGFGPNVC 495
              VGF    C
Sbjct: 457 LSQVGFAKEPC 467


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 150/399 (37%), Positives = 220/399 (55%), Gaps = 22/399 (5%)

Query: 109 HAR-MQRDVKRVATLVRRLSGGG-----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           HA  ++RD  RV ++ R+++G G      D A+   Q        G+  G+G Y V +G+
Sbjct: 96  HAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGL 155

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
           G+P +   ++ D+GSD+ WVQC+PC+ CY+Q DP+FDP+ S++++ V+C +  C  L+ +
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215

Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
           GC +  RCRYEV YGD S T G L  +TLT+  +  +     GCG +N G+F    GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
           LG   +SL  Q     G  F+YCL S  +G  G L  G  A P  A +  L  +   PSF
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSSG-RGYLSLG-GAPPANAQFTALA-DGATPSF 332

Query: 341 YYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           YY+ L G+ VGG  +RIP +              V+D+GT +TRLP  AY   R AF   
Sbjct: 333 YYIDLVGIKVGGRAIRIPATAFAAAGG------TVIDSGTVITRLPPRAYAPLRAAFARS 386

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
                +A  +SI DTCY+ +G  + ++PTV   F+GG  ++L  +  L  V      C A
Sbjct: 387 MAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLA 445

Query: 459 FAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FAP+   S ++I+GN QQ+   +++D AN  +GFG   C
Sbjct: 446 FAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 156/465 (33%), Positives = 236/465 (50%), Gaps = 41/465 (8%)

Query: 58  NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
            ++   HNNI S   S + +        R       +TT  M  HR   S     +  +M
Sbjct: 35  KKILSVHNNIWSPKKSYEASS---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDWGKKM 90

Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
           +R    D  RV +L  R+    +   +  V +    + SG+   +  Y V + +G   ++
Sbjct: 91  RRALLLDNIRVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLNYIVTVELGG--KN 148

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG- 227
             +++D+GSD+ WVQCQPC  CY Q  P++DP+ S+S+  V C+S+ C  L  A  ++G 
Sbjct: 149 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGP 208

Query: 228 ----------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
                      C Y VSYGDGSYT+G LA E++ +G T ++N+  GCG  N+G+F GA+G
Sbjct: 209 CGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDTKLENLVFGCGRNNKGLFGGASG 268

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALPVGAAWVPLVR 333
           L+GLG  S+SLV Q      G FSYCL S   G+SG+L FG +           + PLV+
Sbjct: 269 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGNDFSVYKNSTSVFYTPLVQ 328

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           NP+  SFY + L+G  +GG+ +         T     G+++D+GT +TRLP   Y+A + 
Sbjct: 329 NPQLRSFYILNLTGASIGGVELK--------TLSFGRGILIDSGTVITRLPPSIYKAVKT 380

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
            F+ Q    P A G SI DTC+NL+ +  + +PT+   F G   L +  +  F     DA
Sbjct: 381 EFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELEVDVTGVFYFVKPDA 440

Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A A     + + IIGN QQ+  ++ +D     +G     C
Sbjct: 441 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 150/399 (37%), Positives = 220/399 (55%), Gaps = 22/399 (5%)

Query: 109 HAR-MQRDVKRVATLVRRLSGGGA-----DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           HA  ++RD  RV ++ R+++G G      D A+   Q        G+  G+G Y V +G+
Sbjct: 96  HAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQGVSLPAQRGISLGTGNYVVSVGL 155

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
           G+P +   ++ D+GSD+ WVQC+PC+ CY+Q DP+FDP+ S++++ V+C +  C  L+ +
Sbjct: 156 GTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPECQELDAS 215

Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLG 280
           GC +  RCRYEV YGD S T G L  +TLT+  +  +     GCG +N G+F    GL G
Sbjct: 216 GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASDTLPGFVFGCGDQNAGLFGQVDGLFG 275

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
           LG   +SL  Q     G  F+YCL S  +G  G L  G  A P  A +  L  +   PSF
Sbjct: 276 LGREKVSLPSQGAPSYGPGFTYCLPSSSSG-RGYLSLG-GAPPANAQFTALA-DGATPSF 332

Query: 341 YYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           YY+ L G+ VGG  +RIP +              V+D+GT +TRLP  AY   R AF   
Sbjct: 333 YYIDLVGIKVGGRAIRIPATAFAAAGG------TVIDSGTVITRLPPRAYAPLRAAFARS 386

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
                +A  +SI DTCY+ +G  + ++PTV   F+GG  ++L  +  L  V      C A
Sbjct: 387 MAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDFTGVLY-VSKVSQACLA 445

Query: 459 FAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FAP+   S ++I+GN QQ+   +++D AN  +GFG   C
Sbjct: 446 FAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  254 bits (649), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 178/449 (39%), Positives = 237/449 (52%), Gaps = 57/449 (12%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL-----VRRLSGGGADAAK 135
           L +VHRD  + ++     + +         R++RD +R + +         + G      
Sbjct: 76  LRVVHRDDFAVNATAAELLAH---------RLRRDKRRASRISAAAGGAAAANGTRVGGG 126

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
                F   VVSG+ QGSGEYF +IGVG+P     MV+D+GSD+VW+QC PC +CY QS 
Sbjct: 127 GGGSGFVAPVVSGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSG 186

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG 253
            +FDP  S S+  V C++ +C RL++ GC   R  C Y+V+YGDGS T G  A ETLT  
Sbjct: 187 QMFDPRASHSYGAVDCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA 246

Query: 254 RTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
               V  VA+GCGH N+G+FV AAGLLGLG GS+S   Q+  + G +FSYCLV R     
Sbjct: 247 SGARVPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSA 306

Query: 308 -----------GTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
                      G+G+ G+L  GR  L P G          RA   +          G   
Sbjct: 307 SATSRSSTVTFGSGARGAL--GRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVR 364

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS--------- 406
           P  +        G  GV++D+G      P+PA+   R           RA+         
Sbjct: 365 PPPD-----PSTGRGGVIVDSGR-----PSPAWA--RAGRTPPCATRSRAAAAGLRLSPG 412

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
           G S+FDTCY+LSG   V+VPTVS +F+GG    LP  N+LIPVD  GTFCFAFA +  G+
Sbjct: 413 GFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGV 472

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SIIGNIQQ+G ++ FDG    +GF P  C
Sbjct: 473 SIIGNIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  254 bits (649), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 155/408 (37%), Positives = 219/408 (53%), Gaps = 21/408 (5%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           +Y RHQ       +R   R++ LV R +G    ++K      G D+   +  G+GE+ + 
Sbjct: 53  NYSRHQL-LRRAARRSHHRMSRLVARATGVPMTSSKAA---GGGDLQVPVHAGNGEFLMD 108

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           + +G+P  +   ++D+GSD+VW QC+PC  C+KQS PVFDP+ S++++ V CSSA C  L
Sbjct: 109 VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 168

Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAG 277
             + C  A +C Y  +YGD S T+G LA ET T+ ++ +  V  GCG  N+G  F   AG
Sbjct: 169 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAG 228

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVP 330
           L+GLG G +SLV QLG      FSYCL S    ++  L+ G  A               P
Sbjct: 229 LVGLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTP 285

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NP  PSFYYV L  + VG  RI +    F +   G  GV++D+GT++T L    Y A
Sbjct: 286 LIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRA 345

Query: 391 FRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
            + AF AQ   LP A G  +  D C+     G   V VP + F+F GG  L LPA N+++
Sbjct: 346 LKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV 404

Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               +G  C     S  GLSIIGN QQ+  Q  +D  +  + F P  C
Sbjct: 405 LDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 451


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  254 bits (648), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 147/391 (37%), Positives = 219/391 (56%), Gaps = 18/391 (4%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+Q  +KR    + RL+     A+ +       ++ S +  G+GE+ + + +G+PP +  
Sbjct: 61  RIQHGIKRANHRLERLNAMVLAASSN------AEINSPVLSGNGEFLMNLAIGTPPETYS 114

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC+QC+ Q  P+FDP  S+SFS +SCSS +C  L  + C +  C 
Sbjct: 115 AIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSC-SDSCE 173

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
           Y  +YGD S T+GT+A ET T G+  + NV  GCG  N+G  F   +GL+GLG G +SLV
Sbjct: 174 YLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLV 233

Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGL 345
            QL       FSYCL S     + +L+ G  A   G +      PL++NP  PSFYY+ L
Sbjct: 234 SQL---KEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSL 290

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            G+ VGG R+PI E  F+L   G  G+++D+GT +T L   A++  +  F +Q G     
Sbjct: 291 EGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDN 350

Query: 406 SGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
           SG +  + CYNL    S + VP +  +F+G   L LP  N++I     G  C A   S  
Sbjct: 351 SGATGLELCYNLPSDTSELEVPKLVLHFTGAD-LELPGENYMIADSSMGVICLAMG-SSG 408

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           G+SI GN+QQ+ + +S D     + F P  C
Sbjct: 409 GMSIFGNVQQQNMFVSHDLEKETLSFLPTNC 439


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 144/354 (40%), Positives = 201/354 (56%), Gaps = 14/354 (3%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASF 206
           G   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S+++
Sbjct: 171 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTY 230

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
           + VSC++  C  L+  GC  G C Y V YGDGSY+ G  A++TLT+     VK    GCG
Sbjct: 231 ANVSCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCG 290

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
            +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R TG +G L FG  +    
Sbjct: 291 ERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSPAAR 349

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
               P++ +   P+FYYVGL+G+ VGG  + I + +F        G ++D+GT +TRLP 
Sbjct: 350 LTTTPMLVD-NGPTFYYVGLTGIRVGGRLLYIPQSVFATA-----GTIVDSGTVITRLPP 403

Query: 386 PAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
            AY + R AF A        +A  VS+ DTCY+ +G   V +PTVS  F GG  L + AS
Sbjct: 404 AAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDAS 463

Query: 444 NFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +     A   C AFA +  G  + I+GN Q +   +++D     V F P  C
Sbjct: 464 GIMYAA-SASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 155/408 (37%), Positives = 219/408 (53%), Gaps = 21/408 (5%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           +Y RHQ       +R   R++ LV R +G    ++K      G D+   +  G+GE+ + 
Sbjct: 43  NYSRHQL-LRRAARRSHHRMSRLVARATGVPMTSSKAA---GGGDLQVPVHAGNGEFLMD 98

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           + +G+P  +   ++D+GSD+VW QC+PC  C+KQS PVFDP+ S++++ V CSSA C  L
Sbjct: 99  VSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 158

Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAG 277
             + C  A +C Y  +YGD S T+G LA ET T+ ++ +  V  GCG  N+G  F   AG
Sbjct: 159 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAG 218

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVP 330
           L+GLG G +SLV QLG      FSYCL S    ++  L+ G  A               P
Sbjct: 219 LVGLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTP 275

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NP  PSFYYV L  + VG  RI +    F +   G  GV++D+GT++T L    Y A
Sbjct: 276 LIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRA 335

Query: 391 FRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
            + AF AQ   LP A G  +  D C+     G   V VP + F+F GG  L LPA N+++
Sbjct: 336 LKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMV 394

Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               +G  C     S  GLSIIGN QQ+  Q  +D  +  + F P  C
Sbjct: 395 LDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 441


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 149/421 (35%), Positives = 229/421 (54%), Gaps = 42/421 (9%)

Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
           H    +  ++ R+Q+        V+ +   +RR+       + H V+   T +   SG++
Sbjct: 6   HCSEKKIDWNRRLQKQLISDDLRVRSMQNRIRRV------VSSHNVEASQTQIPLSSGIN 59

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
             +  Y V +G+GS   +  ++ID+GSD+ WVQC+PC  CY Q  P+F P+ S+S+  VS
Sbjct: 60  LQTLNYIVTMGLGSTNMT--VIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117

Query: 211 CSSAVCDRLENA-------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           C+S+ C  L+ A       G +   C Y V+YGDGSYT G L +E L+ G   V +   G
Sbjct: 118 CNSSTCQSLQFATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSFGGVSVSDFVFG 177

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-- 321
           CG  N+G+F G +GL+GLG   +SLV Q     GG FSYCL +  +G+SGSLV G E+  
Sbjct: 178 CGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSV 237

Query: 322 ----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
                P+   +  ++ NP+  +FY + L+G+ V G+ +       ++   G+ GV++D+G
Sbjct: 238 FKNVTPI--TYTRMLPNPQLSNFYILNLTGIDVDGVAL-------QVPSFGNGGVLIDSG 288

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +TRLP+  Y+A +  F+ Q    P A G SI DTC+NL+G+  V +PT+S +F G   
Sbjct: 289 TVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAE 348

Query: 438 LTLPAS-NFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNV 494
           L + A+  F +  +DA   C A A        +IIGN QQ   ++ +D     VGF    
Sbjct: 349 LKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEES 408

Query: 495 C 495
           C
Sbjct: 409 C 409


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 179/506 (35%), Positives = 260/506 (51%), Gaps = 55/506 (10%)

Query: 9   LLKQVLLLHLLCS----IITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERH 64
           ++++ LLL L+C+     +  S  AA    +  ++     + S T     S  + + +R 
Sbjct: 5   VVRRALLLSLICAGALGFLPCSHGAAVAPGYVTVSAAR-FRPSST----CSSLDPVAQRR 59

Query: 65  NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
            N +S+          L L H+    + S  ++         S    ++ D +R   ++R
Sbjct: 60  RNGTSAV---------LRLTHKHGPCAPSRASS-----LATPSVADTLRADQRRAEYILR 105

Query: 125 RLSGGGADAAKHEVQDFGTDVVS---GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
           R+SG G         +  T  V    G + G+  Y V + +G+P  +Q + +D+GSD+ W
Sbjct: 106 RVSGRGTPQLWDSKAEAATATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSW 165

Query: 182 VQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGD 237
           VQC PC+   CY Q DP+FDPA S+S++ V C   VC  L    + C A +C Y VSYGD
Sbjct: 166 VQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGD 225

Query: 238 GSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
           GS T G  + +TLT+     V+    GCGH   G F G  GLLGLG    SLV Q  G  
Sbjct: 226 GSKTTGVYSSDTLTLSPNDAVRGFFFGCGHAQSG-FTGNDGLLGLGREEASLVEQTAGTY 284

Query: 297 GGAFSYCLVSRGTGSSGSLVFGRE--ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
           GG FSYCL +R + ++G L  G    A P G +   L+ +P A ++Y V L+G+ VGG +
Sbjct: 285 GGVFSYCLPTRPS-TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQ 343

Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL--PRASGVSIFD 412
           + +   +F        G V+DTGT +TRLP  AY A R AF +   +   P A    I D
Sbjct: 344 LSVPSSVFA------GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILD 397

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSII 469
           TCYN SG+ +V +P V+  FSGG  +TL A   L       +F C AFAPS S  G++I+
Sbjct: 398 TCYNFSGYGTVTLPNVALTFSGGATVTLGADGIL-------SFGCLAFAPSGSDGGMAIL 450

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN+QQ   ++  DG +  VGF P+ C
Sbjct: 451 GNVQQRSFEVRIDGTS--VGFKPSSC 474


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 147/355 (41%), Positives = 203/355 (57%), Gaps = 13/355 (3%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSAS 205
           SG+   +G Y V I +G+P     +V D+GSD  WVQCQPC + CY+Q +P+F P  SA+
Sbjct: 156 SGLSLNTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSAT 215

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCG 265
           ++ +SC+S+ C  L+  GC  G C Y V YGDGSYT G  A +TLT+G   VK+   GCG
Sbjct: 216 YANISCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDTVKDFRFGCG 275

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
            KN+G+F  AAGL+GLG G  S+  Q   +  G F+YC+ +  +G +G L FG  A    
Sbjct: 276 EKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSG-TGFLDFGPGAPAAA 334

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
            A +  +     P+FYYVG++G+ VGG  + I   +F      D G ++D+GT +TRLP 
Sbjct: 335 NARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF-----SDAGALVDSGTVITRLPP 389

Query: 386 PAYEAFRDAFVAQTGNL--PRASGVSIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLTLPA 442
            AYE  R AF      L    A   SI DTCY+L+G+  S+ +P VS  F GG  L + A
Sbjct: 390 SAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449

Query: 443 SNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S  L  V D    C AFA +   + ++I+GN QQ+   + +D     VGF P  C
Sbjct: 450 SGILY-VADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  253 bits (645), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 138/352 (39%), Positives = 201/352 (57%), Gaps = 14/352 (3%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G+  G+  Y + +G G+P ++Q ++ D+GS++ W+QC+PC   CY Q +P+FDP  S+++
Sbjct: 8   GLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTY 67

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
             +SC+SA C  L + GC    C Y V+YGDGS T G LA ET T+    V  N   GCG
Sbjct: 68  RNISCTSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGNVFNNFIFGCG 127

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
             NQG+F GAAGL+GLG    SL  QL    G  FSYCL S  + ++G L  G      G
Sbjct: 128 QNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSS-ATGYLNIGNPLRTPG 186

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +  ++ N RAP+ Y++ L G+ VGG R+ +S  +F+       G ++D+GT +TRLP 
Sbjct: 187 --YTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQ-----SVGTIIDSGTVITRLPP 239

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
            AY A R AF A      RA+  SI DTCY+ S   +V  PT+  +++G  V T+P +  
Sbjct: 240 TAYGALRTAFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDV-TIPGAGV 298

Query: 446 LIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              V  +   C AFA     + + IIGN+QQ  +++++D A   +GF    C
Sbjct: 299 FY-VISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 155/431 (35%), Positives = 230/431 (53%), Gaps = 22/431 (5%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG--GGADA 133
           + + +LE+VH+    S  N +          S +  M  D +RV  +  RLS   GG + 
Sbjct: 62  KRKASLEVVHKHGPCSQLNHSGKA---EATISHNDIMNLDNERVKYIQSRLSKNLGGENR 118

Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYK 192
            K E+        SG   GS +Y+V +G+G+P R   ++ D+GS + W QC+PC+  CYK
Sbjct: 119 VK-ELDSTTLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYK 177

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALET 249
           Q DP+FDP+ S+S++ + C+S++C +  +AGC +     C Y+V YGD S ++G L+ E 
Sbjct: 178 QQDPIFDPSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQER 237

Query: 250 LTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
           LTI  T +V +   GCG  N+G+F G AGL+GL    +S V Q        FSYCL S  
Sbjct: 238 LTITATDIVHDFLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTP 297

Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
           + S G L FG  A       + P        SFY + + G+ VGG ++P +S   F    
Sbjct: 298 S-SLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSA-- 354

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
               G ++D+GT +TRLP  AY A R AF       P A G  + DTCY+ SG+  + VP
Sbjct: 355 ---GGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVP 411

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGA 484
            + F F+GG  + LP    L   + A   C AFA + +G  ++I GN+QQ+ +++ +D  
Sbjct: 412 RIDFEFAGGVKVELPLVGILYG-ESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVE 470

Query: 485 NGFVGFGPNVC 495
            G +GFG   C
Sbjct: 471 GGRIGFGAAGC 481


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 146/388 (37%), Positives = 207/388 (53%), Gaps = 17/388 (4%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+QR VKR    ++RLS   A         F   V + +  G+GE+ + + +G+P  +  
Sbjct: 60  RLQRAVKRGRLRLQRLSAKTAS--------FEPSVEAPVHAGNGEFLMNLAIGTPAETYS 111

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC  C+ Q  P+FDP  S+SFS + CSS +C  L  + C  G C 
Sbjct: 112 AIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CE 170

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLV 289
           Y  SYGD S T+G LA ET T G   V  +  GCG  N+G  +   AGL+GLG G +SL+
Sbjct: 171 YRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLI 230

Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
            QLG      FSYCL S   +    +L+ G EA    A   PL++NP  PSFYY+ L G+
Sbjct: 231 SQLGVP---KFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGI 287

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
            VG   +PI +  F +   G  G+++D+GT +T L   A+ A +  F++Q      ASG 
Sbjct: 288 SVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGS 347

Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
           +  + C+ L    S V VP + F+F G   L LP  N++I        C     S SG+S
Sbjct: 348 TELELCFTLPPDGSPVEVPQLVFHFEGVD-LKLPKENYIIEDSALRVICLTMG-SSSGMS 405

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I GN QQ+ I +  D     + F P  C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 156/418 (37%), Positives = 222/418 (53%), Gaps = 26/418 (6%)

Query: 82  ELVHRDKMSSS--SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           EL+HR+  SS   SNT+           F A ++R  +R A L + +   G        +
Sbjct: 21  ELIHREHPSSPLRSNTSKTT-----TEIFLAAVKRGAERRAQLSKHILAEG--------R 67

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
            F T V SG    +GEY + I  GSPP+   +++D+GSD++W QC PC  C   +  +FD
Sbjct: 68  LFSTPVASG----NGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFD 123

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           P  S+++  VSC+S  C  L    C    C+Y+  YGDGS T G L+ ET+T+G   + N
Sbjct: 124 PVKSSTYDTVSCASNFCSSLPFQSCTT-SCKYDYMYGDGSSTSGALSTETVTVGTGTIPN 182

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           VA GCGH N G F GAAG++GLG G +SL+ Q    T   FSYCLV  G+  +  ++ G 
Sbjct: 183 VAFGCGHTNLGSFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGD 242

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            A   G A+  L+ N   P+FYY  L+G+ V G  +      F +   G  G ++D+GT 
Sbjct: 243 SAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTT 302

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF--DTCYNLSGFVSVRVPTVSFYFSGGPV 437
           +T L T A+ A   A  A+    P A G S++  D C++ +G  +   PT++F+F G   
Sbjct: 303 LTYLETGAFNALVAALKAEV-PFPEADG-SLYGLDYCFSTAGVANPTYPTMTFHFKGAD- 359

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             LP  N  + +D  G+ C A A S +G SI+GNIQQ+   I  D  N  VGF    C
Sbjct: 360 YELPPENVFVALDTGGSICLAMAAS-TGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  252 bits (644), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 183/463 (39%), Positives = 254/463 (54%), Gaps = 38/463 (8%)

Query: 57  YNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDV 116
           Y+      +N S S++S+     ++ L+HRD  + ++     +           R+QRD 
Sbjct: 46  YSAPAAADDNFSVSSSSA----LHIHLLHRDSFAVNATAAELLAR---------RLQRDE 92

Query: 117 KRVATLVRRLSGGGADAAKHEV---QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
            R A ++ + +  G       +   +     VVS     SGEY  +I VG+P     + +
Sbjct: 93  LRAAWIISKAAANGTPPPVVGLSTGRGLVAPVVSRAPT-SGEYMAKIAVGTPAVQALLAL 151

Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCR 230
           D+ SD+ W+QCQPC +CY QS PVFDP  S S+  ++  +  C  L  +G      G C 
Sbjct: 152 DTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCI 211

Query: 231 YEVSYGDG----SYTKGTLALETLTIGRTVVKN-VAIGCGHKNQGMF-VGAAGLLGLGGG 284
           Y V YGDG    S + G L  ETLT    V +  ++IGCGH N+G+F   AAG+LGLG G
Sbjct: 212 YTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLGRG 271

Query: 285 SMSLVGQLGGQ-TGGAFSYCLVS--RGTGS-SGSLVFGREALPVG--AAWVPLVRNPRAP 338
            +S+  Q+       +FSYCLV    G GS S +L FG  A+     A++ P V N   P
Sbjct: 272 QISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMP 331

Query: 339 SFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
           +FYYV L G+ VGG+R+P     DL      G  GV++D+GT VTRL  PAY AFRDAF 
Sbjct: 332 TFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFR 391

Query: 397 AQTGNLPRAS--GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           A   +L + S  G S +FDTCY + G   V+VP VS +F+GG  ++L   N+LIPVD  G
Sbjct: 392 AAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRG 451

Query: 454 TFCFAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           T CFAFA +    +S+IGNI Q+G ++ +D A   VGF PN C
Sbjct: 452 TVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQRVGFAPNNC 494


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 146/388 (37%), Positives = 207/388 (53%), Gaps = 17/388 (4%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R+QR VKR    ++RLS   A         F   V + +  G+GE+ + + +G+P  +  
Sbjct: 60  RLQRAVKRGRLRLQRLSAKTAS--------FEPSVEAPVHAGNGEFLMNLAIGTPAETYS 111

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC  C+ Q  P+FDP  S+SFS + CSS +C  L  + C  G C 
Sbjct: 112 AIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLCVALPISSCSDG-CE 170

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLV 289
           Y  SYGD S T+G LA ET T G   V  +  GCG  N+G  +   AGL+GLG G +SL+
Sbjct: 171 YRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLI 230

Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
            QLG      FSYCL S   +    +L+ G EA    A   PL++NP  PSFYY+ L G+
Sbjct: 231 SQLGVP---KFSYCLTSIDDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGI 287

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
            VG   +PI +  F +   G  G+++D+GT +T L   A+ A +  F++Q      ASG 
Sbjct: 288 SVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGS 347

Query: 409 SIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
           +  + C+ L    S V VP + F+F G   L LP  N++I        C     S SG+S
Sbjct: 348 TELELCFTLPPDGSPVDVPQLVFHFEGVD-LKLPKENYIIEDSALRVICLTMG-SSSGMS 405

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I GN QQ+ I +  D     + F P  C
Sbjct: 406 IFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 153/392 (39%), Positives = 222/392 (56%), Gaps = 20/392 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R++  VKR    ++RL       A   V    +++ + +  G+GE+ +++ +G+PP +  
Sbjct: 58  RIRHGVKRGRNRLQRLQ------AMALVASSSSEIEAPVLPGNGEFLMKLAIGTPPETYS 111

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD++W QC+PC+QC+ QS P+FDP  S+SFS +SCSS +C+ L  + C+ G C 
Sbjct: 112 AILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSCNNG-CE 170

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
           Y  SYGD S T+G LA ETLT G+  V NVA GCG  N+G  F   AGL+GLG G +SLV
Sbjct: 171 YLYSYGDYSSTQGILASETLTFGKASVPNVAFGCGADNEGSGFSQGAGLVGLGRGPLSLV 230

Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGR----EALPVGAAWVPLVRNPRAPSFYYVGL 345
            QL       FSYCL +     + +L+ G      A        PL+ +P  PSFYY+ L
Sbjct: 231 SQLKEP---KFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSL 287

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-R 404
            G+ VG  R+PI +  F L   G  G+++D+GT +T L   A+      F A+  NLP  
Sbjct: 288 EGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKI-NLPVD 346

Query: 405 ASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
           +SG +  D C+ L SG  ++ VP + F+F G   L LPA N++I     G  C A   S 
Sbjct: 347 SSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGAD-LELPAENYMIGDSSMGVACLAMG-SS 404

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SG+SI GN+QQ+ + +  D     + F P  C
Sbjct: 405 SGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  251 bits (641), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 140/355 (39%), Positives = 205/355 (57%), Gaps = 17/355 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY   + +G+P R   +++D+GSD+ WVQC PC +CY Q+D +F P  S SF+ ++C S
Sbjct: 11  GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGS 70

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKN 268
           A+C+ L    C+   C Y  SYGDGS T G    +T+T+      +  V N A GCGH N
Sbjct: 71  ALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 130

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV-- 324
           +G F GA G+LGLG G +S   QL     G FSYCLV        +  L+FG  A+P+  
Sbjct: 131 EGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLLFGDAAVPILP 190

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              ++P++ NP+ P++YYV L+G+ VG   + IS  +F +  +G  G + D+GT VT+L 
Sbjct: 191 DVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVTQLA 250

Query: 385 TPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTV---SFYFSGGPVLTL 440
             AY+    A  A T    R    +S  D C  LSGF   ++PTV   +F+F GG  + L
Sbjct: 251 EAAYKEVLAAMNASTMAYSRKIDDISRLDLC--LSGFPKDQLPTVPAMTFHFEGGD-MVL 307

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P SN+ I ++ + ++CFA   SP  ++IIG++QQ+  Q+ +D A   +GF P  C
Sbjct: 308 PPSNYFIYLESSQSYCFAMTSSPD-VNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  251 bits (641), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 142/356 (39%), Positives = 197/356 (55%), Gaps = 17/356 (4%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           G+GE+ + + +G+P  +   ++D+GSD+VW QC+PC  C+KQS PVFDP+ S++++ V C
Sbjct: 70  GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPC 129

Query: 212 SSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           SSA C  L  + C  A +C Y  +YGD S T+G LA ET T+ ++ +  V  GCG  N+G
Sbjct: 130 SSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEG 189

Query: 271 M-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------L 322
             F   AGL+GLG G +SLV QLG      FSYCL S    ++  L+ G  A        
Sbjct: 190 DGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEASAA 246

Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
                  PL++NP  PSFYYV L  + VG  RI +    F +   G  GV++D+GT++T 
Sbjct: 247 ASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITY 306

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLT 439
           L    Y A + AF AQ   LP A G  +  D C+     G   V VP + F+F GG  L 
Sbjct: 307 LEVQGYRALKKAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLD 365

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LPA N+++    +G  C     S  GLSIIGN QQ+  Q  +D  +  + F P  C
Sbjct: 366 LPAENYMVLDGGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 420


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  251 bits (640), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 147/354 (41%), Positives = 202/354 (57%), Gaps = 20/354 (5%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           G GEY + + +G+P  S   ++D+GSD++W QC+PC+QC+ Q  P+F+P DS+SFS + C
Sbjct: 92  GDGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPC 151

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            S  C  L +  C+   C+Y   YGDGS T+G +A ET T   + V N+A GCG  NQG 
Sbjct: 152 ESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQGF 211

Query: 272 FVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAW 328
             G  AGL+G+G G +SL  QLG    G FSYC+ S G+ S  +L  G  A  +P G+  
Sbjct: 212 GQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGSAASGVPEGSPS 268

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             L+ +   P++YY+ L G+ VGG  + I    F+L   G  G+++D+GT +T LP  AY
Sbjct: 269 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 328

Query: 389 EAFRDAFVAQTGNLP----RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPAS 443
            A   AF  Q  NLP     +SG+S   TC+   S   +V+VP +S  F GG VL L   
Sbjct: 329 NAVAQAFTDQI-NLPTVDESSSGLS---TCFQQPSDGSTVQVPEISMQFDGG-VLNLGEQ 383

Query: 444 NFLI-PVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N LI P +  G  C A   S   G+SI GNIQQ+  Q+ +D  N  V F P  C
Sbjct: 384 NILISPAE--GVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  251 bits (640), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 139/408 (34%), Positives = 215/408 (52%), Gaps = 30/408 (7%)

Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           F  R+  D   V +L             H++ D    + SG    +  Y V +G+G   +
Sbjct: 97  FQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG--Q 154

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
           +  +++D+GSD+ WVQC PC  CY Q +P+F+P++S+SF  + C+S  C  L+     +G
Sbjct: 155 NSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSG 214

Query: 228 --------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
                    C Y++ YGDGSY++G L  E LT+G+T + N   GCG  N+G+F GA+GL+
Sbjct: 215 LCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGASGLM 274

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-------REALPVGAAWVPLV 332
           GL    +SLV Q     G  FSYCL + G GSSGSL  G       +   P+  ++  ++
Sbjct: 275 GLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYTRMI 332

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV--VMDTGTAVTRLPTPAYEA 390
           +NP+  +FY++ L+G+ +GG+ + +     RL+   ++GV  ++D+GT +TRL    Y+A
Sbjct: 333 QNPQMSNFYFLNLTGISIGGVNLNVP----RLSS--NEGVLSLLDSGTVITRLSPSIYKA 386

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPV 449
           F+  F  Q        G SI +TC+NL+G+  V +PTV F F G   + +     F    
Sbjct: 387 FKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFVK 446

Query: 450 DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            DA   C AFA         IIGN QQ+  ++ ++     VGF    C
Sbjct: 447 SDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 494


>gi|20975624|emb|CAD31717.1| putative nucleoid DNA-binding protein [Cicer arietinum]
          Length = 144

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 117/144 (81%), Positives = 132/144 (91%)

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
           G+R+PISED+FRL ++G+ GVVMDTGTAVTRLPT AY+AFRDAF+ QT NLPR+S VSIF
Sbjct: 1   GVRVPISEDVFRLNELGEGGVVMDTGTAVTRLPTAAYDAFRDAFIGQTTNLPRSSDVSIF 60

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTCY+L GFVSVRVPT+SFYF GGP+LTLPA NFLIPV+D GTFCFAFAPSPSGLSIIGN
Sbjct: 61  DTCYDLYGFVSVRVPTISFYFLGGPILTLPARNFLIPVNDVGTFCFAFAPSPSGLSIIGN 120

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           IQQEGI+IS DG NGFVGFGPN+C
Sbjct: 121 IQQEGIEISVDGVNGFVGFGPNIC 144


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/411 (33%), Positives = 216/411 (52%), Gaps = 30/411 (7%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           +  F  R+  D   V +L             H++ D    + SG    +  Y V +G+G 
Sbjct: 15  EKIFQNRIILDAINVNSLFSHFKSAIFPGQTHQLSDSQIPISSGARLQTLNYIVTVGIGG 74

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
             ++  +++D+GSD+ WVQC PC  CY Q +P+F+P++S+SF  + C+S  C  L+    
Sbjct: 75  --QNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAG 132

Query: 225 HAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA 276
            +G         C Y++ YGDGSY++G L  E LT+G+T + N   GCG  N+G+F GA+
Sbjct: 133 SSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEIDNFIFGCGRNNKGLFGGAS 192

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-------REALPVGAAWV 329
           GL+GL    +SLV Q     G  FSYCL + G GSSGSL  G       +   P+  ++ 
Sbjct: 193 GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI--SYT 250

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV--VMDTGTAVTRLPTPA 387
            +++NP+  +FY++ L+G+ +GG+ + +     RL+   ++GV  ++D+GT +TRL    
Sbjct: 251 RMIQNPQMSNFYFLNLTGISIGGVNLNVP----RLSS--NEGVLSLLDSGTVITRLSPSI 304

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FL 446
           Y+AF+  F  Q        G SI +TC+NL+G+  V +PTV F F G   + +     F 
Sbjct: 305 YKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEGVFY 364

Query: 447 IPVDDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               DA   C AFA         IIGN QQ+  ++ ++     VGF    C
Sbjct: 365 FVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPC 415


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  249 bits (637), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 138/366 (37%), Positives = 210/366 (57%), Gaps = 26/366 (7%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG+  G+G Y V +G+G+P +   ++ D+GSD+ W QCQPC + CY Q  P+FDP+ S +
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKT 204

Query: 206 FSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKN 259
           +S +SC+SA C  L++A     GC +  C Y + YGD S+T G  A + LT+ +  V   
Sbjct: 205 YSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQNDVFDG 264

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGTGSSGSLVFG 318
              GCG  N+G+F   AGL+GLG   +S+V Q   + G  FSYCL  SRG  S+G L FG
Sbjct: 265 FMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG--SNGHLTFG 322

Query: 319 R-------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                   +A+  G  + P   + +  ++Y++ + G+ VGG  + IS  LF+     + G
Sbjct: 323 NGNGVKASKAVKNGITFTPFASS-QGTAYYFIDVLGISVGGKALSISPMLFQ-----NAG 376

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +TRLP+ AY + + AF       P A  +S+ DTCY+LS + S+ +P +SF 
Sbjct: 377 TIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFN 436

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVG 489
           F+G   + L  +  LI  + A   C AFA +     + I GNIQQ+ +++ +D A G +G
Sbjct: 437 FNGNANVELDPNGILI-TNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLG 495

Query: 490 FGPNVC 495
           FG   C
Sbjct: 496 FGYKGC 501


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 141/357 (39%), Positives = 203/357 (56%), Gaps = 16/357 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ +SC++  C  L+  GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 231 YANISCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R +G +G L FG  +   
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 349

Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
             A +  P++ +   P+FYYVG++G+ VGG  + I + +F        G ++D+GT +TR
Sbjct: 350 AGARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLSIPQSVFTTA-----GTIVDSGTVITR 403

Query: 383 LPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           LP  AY + R AF +        +A  VS+ DTCY+ +G   V +PTVS  F GG  L +
Sbjct: 404 LPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS  +         C  FA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 464 DASGIMYAA-SVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  248 bits (634), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 154/428 (35%), Positives = 226/428 (52%), Gaps = 21/428 (4%)

Query: 77  ARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGA-DAAK 135
           +R  + +VHR    S     ++     H+    A    D  R  ++ RR+S        K
Sbjct: 85  SRTRMPIVHRHGPCSPLADAHDGKLPSHEEILAA----DQNRAKSIQRRVSTTTTVSRGK 140

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQS 194
            +         SG   G+G Y V IG+G+P     +V D+GSD  WVQC+PC   CYKQ 
Sbjct: 141 PKRNRPSLPASSGSALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQ 200

Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
           + +FDPA S++++ +SC++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+  
Sbjct: 201 EKLFDPARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSS 260

Query: 255 -TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
              +K    GCG +N+G++  AAGLLGLG G  SL  Q   + GG F++C  +R +G +G
Sbjct: 261 YDAIKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSG-TG 319

Query: 314 SLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            L FG  +LP  +A +  P++ +   P+FYYVGL+G+ VGG  + I + +F  +     G
Sbjct: 320 YLDFGPGSLPAVSAKLTTPMLVD-NGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----G 373

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            ++D+GT +TRLP  AY + R AF +        +A  +S+ DTCY+ +G   V +PTVS
Sbjct: 374 TIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVS 433

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGF 487
             F GG  L + AS  +I        C  FA       + I+GN Q +   + +D     
Sbjct: 434 LLFQGGASLDVHASG-IIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKV 492

Query: 488 VGFGPNVC 495
           VGF P  C
Sbjct: 493 VGFCPGAC 500


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 216/402 (53%), Gaps = 24/402 (5%)

Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKH-EVQDFGTDVVS-----GMDQGSGEYFVRIG 161
           F A +  D  R+A+   RL+   + ++     Q  G+ + S     G   G G Y  R+G
Sbjct: 63  FSAVLTHDAARIASFAARLAKKSSPSSASATTQAAGSSLASVPLTPGTSVGVGNYVTRMG 122

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
           +G+P +   MV+D+GS + W+QC PC   C++QS PVFDP  S+S++ VSCSS  CD L 
Sbjct: 123 LGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSPQCDGLS 182

Query: 221 NAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
            A  +   C       Y+ SYGD S++ G L+ +T++ G   V N   GCG  N+G+F  
Sbjct: 183 TATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFGANSVPNFYYGCGQDNEGLFGR 242

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
           +AGL+GL    +SL+ QL    G +FSYCL S  T SSG L  G    P G ++ P+V N
Sbjct: 243 SAGLMGLARNKLSLLYQLAPTLGYSFSYCLPS--TSSSGYLSIGSYN-PGGYSYTPMVSN 299

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
               S Y++ LSG+ V G  + +S      ++      ++D+GT +TRLPT  Y A   A
Sbjct: 300 TLDDSLYFISLSGMTVAGKPLAVSS-----SEYTSLPTIIDSGTVITRLPTSVYTALSKA 354

Query: 395 F-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
              A  G+  RA+  SI DTC+         VP VS  FSGG  L L A N L+ VD A 
Sbjct: 355 VAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDGAT 414

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           T C AFAP+ S  +IIGN QQ+   + +D  +  +GF    C
Sbjct: 415 T-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSNRIGFAAAGC 454


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  247 bits (631), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 145/357 (40%), Positives = 202/357 (56%), Gaps = 16/357 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ VSC++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R TG +G L FG  +L  
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSLAA 349

Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
             A +  P++     P+FYYVG++G+ VGG  + I + +F        G ++D+GT +TR
Sbjct: 350 ARARLTTPMLTE-NGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 403

Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           LP  AY + R   A         +A  VS+ DTCY+ +G   V +PTVS  F GG  L +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS  +     A   C AFA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 464 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 145/350 (41%), Positives = 199/350 (56%), Gaps = 13/350 (3%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           GSGEY + + +G+P  S   ++D+GSD++W QC+PC+QC+ Q  P+F+P DS+SFS + C
Sbjct: 92  GSGEYLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPC 151

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            S  C  L +  C+   C+Y   YGDGS T+G +A ET T   + V N+A GCG  NQG 
Sbjct: 152 ESQYCQDLPSESCY-NDCQYTYGYGDGSSTQGYMATETFTFETSSVPNIAFGCGEDNQGF 210

Query: 272 FVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAW 328
             G  AGL+G+G G +SL  QLG    G FSYC+ S G+ S  +L  G  A  +P G+  
Sbjct: 211 GQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPS 267

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             L+ +   P++YY+ L G+ VGG  + I    F+L   G  G+++D+GT +T LP  AY
Sbjct: 268 TTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAY 327

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
            A   AF  Q    P     S   TC+ L S   +V+VP +S  F GG VL L   N LI
Sbjct: 328 NAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGG-VLNLGEENVLI 386

Query: 448 -PVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            P +  G  C A  + S  G+SI GNIQQ+  Q+ +D  N  V F P  C
Sbjct: 387 SPAE--GVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/366 (37%), Positives = 208/366 (56%), Gaps = 26/366 (7%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG+  G+G Y V +G+G+P +   ++ D+GSD+ W QCQPC + CY Q  P+FDP+ S +
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKT 204

Query: 206 FSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKN 259
           +S +SC+S  C  L++A     GC +  C Y + YGD S+T G  A +TLT+ +  V   
Sbjct: 205 YSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTLTQNDVFDG 264

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGTGSSGSLVFG 318
              GCG  N+G+F   AGL+GLG   +S+V Q   + G  FSYCL  SRG  S+G L FG
Sbjct: 265 FMFGCGQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRG--SNGHLTFG 322

Query: 319 R-------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                   +A+  G  + P   + +  +FY++ + G+ VGG  + IS  LF+     + G
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASS-QGATFYFIDVLGISVGGKALSISPMLFQ-----NAG 376

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +TRLP+  Y + +  F       P A  +S+ DTCY+LS + S+ +P +SF 
Sbjct: 377 TIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFN 436

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVG 489
           F+G   + L  +  LI  + A   C AFA +     + I GNIQQ+ +++ +D A G +G
Sbjct: 437 FNGNANVDLEPNGILI-TNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLG 495

Query: 490 FGPNVC 495
           FG   C
Sbjct: 496 FGYKGC 501


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  246 bits (628), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 142/356 (39%), Positives = 196/356 (55%), Gaps = 14/356 (3%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V IG+G+P     +V D+GSD  WVQCQPC   CYKQ + +FDPA S++
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSST 232

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ VSC++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 233 YANVSCAAPACSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 292

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALP 323
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R +G +G L FG      
Sbjct: 293 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-TGYLDFGPGSPAA 351

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
           VGA     +     P+FYYVG++G+ VGG  + I + +F        G ++D+GT +TRL
Sbjct: 352 VGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTA-----GTIVDSGTVITRL 406

Query: 384 PTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           P  AY + R AF +        +A  +S+ DTCY+ +G   V +P VS  F GG  L + 
Sbjct: 407 PPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVN 466

Query: 442 ASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AS  +         C  FA +     + I+GN Q +   + +D     VGF P  C
Sbjct: 467 ASGIMYAA-SLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 172/430 (40%), Positives = 225/430 (52%), Gaps = 43/430 (10%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR   S++S             SF    + D +RV  + RR+SGGGA  AK  +Q 
Sbjct: 75  LRLAHRCGPSTAS------------ASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQ 122

Query: 141 FGT-----DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQ 193
             T      V + M  G+ +Y V + +G+P  SQ + +D+GSD+ WVQC+PCS   C  Q
Sbjct: 123 LATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQ 182

Query: 194 SDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
            D +FDPA S+++S V C +  C   R+  AGC   +C Y VSYGDGS T G    +TL 
Sbjct: 183 RDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA 242

Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
           +     V     GCGH   GMF G  GLL LG  SMSL  Q  G  GG FSYCL S+ + 
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS- 301

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           ++G L  G  +   G A   L+    AP+FY V L+G+ VGG ++ +    F        
Sbjct: 302 AAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------G 355

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G V+DTGT +TRLP  AY A R AF   +A  G  P A    I DTCY+ S +  V +PT
Sbjct: 356 GTVVDTGTVITRLPPTAYAALRSAFRGAIAPCG-YPSAPANGILDTCYDFSRYGVVTLPT 414

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGAN 485
           V+  FSGG  L L A   L       + C AFAP+      +I+GN+QQ    + FDG+ 
Sbjct: 415 VALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFDGST 468

Query: 486 GFVGFGPNVC 495
             VGF P  C
Sbjct: 469 --VGFMPGAC 476


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  245 bits (626), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 161/422 (38%), Positives = 224/422 (53%), Gaps = 28/422 (6%)

Query: 88  KMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
           K++ SS       +HRH            +    ++RD  R A + R+ SG    A   E
Sbjct: 49  KVAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVE 108

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
             D       G    + EY + +G+GSP  +Q M+ID+GSD+ WVQC+PCSQC+ Q+D +
Sbjct: 109 GSDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSL 168

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           FDP+ S+++S  SC+SA C +L   GC + +C+Y V YGDGS   GT + +TL +G + V
Sbjct: 169 FDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSSTV 228

Query: 258 KNVAIGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
           +N   GC     G  +    AGL+GLGGG+ SL  Q  G  G AFSYCL     GSSG L
Sbjct: 229 ENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPT-PGSSGFL 287

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
             G           P++R+ + PS+Y V L  + VGG ++ I    F        G +MD
Sbjct: 288 TLGASTSGF-VVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSA------GSIMD 340

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGG 435
           +GT +TRLP  AY A   AF A     P A  + IFDTC++ SG  SV +PTV+  FSGG
Sbjct: 341 SGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGG 400

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            V+ L +   ++        C AFA +   + L IIGN+QQ   ++ +D   G VGF   
Sbjct: 401 AVVDLASDGIIL------GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAG 454

Query: 494 VC 495
            C
Sbjct: 455 AC 456


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 172/430 (40%), Positives = 224/430 (52%), Gaps = 43/430 (10%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR   S++S             SF    + D +RV  + RR+SGGGA  AK  +Q 
Sbjct: 75  LRLAHRCGPSTAS------------ASFAEVQRADEQRVEYIQRRVSGGGARGAKGALQQ 122

Query: 141 FGT-----DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQ 193
             T      V + M  G+ +Y V + +G+P  SQ + +D+GSD+ WVQC+PCS   C  Q
Sbjct: 123 LATGSRSATVPTTMGVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQ 182

Query: 194 SDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
            D +FDPA S+++S V C +  C   R+  AGC   +C Y VSYGDGS T G    +TL 
Sbjct: 183 RDQLFDPAKSSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA 242

Query: 252 IGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
           +     V     GCGH   GMF G  GLL LG  SMSL  Q  G  GG FSYCL S+ + 
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQS- 301

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           ++G L  G      G A   L+    AP+FY V L+G+ VGG ++ +    F        
Sbjct: 302 AAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA------G 355

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           G V+DTGT +TRLP  AY A R AF   +A  G  P A    I DTCY+ S +  V +PT
Sbjct: 356 GTVVDTGTVITRLPPTAYAALRSAFRGAIAPYG-YPSAPANGILDTCYDFSRYGVVTLPT 414

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGAN 485
           V+  FSGG  L L A   L       + C AFAP+      +I+GN+QQ    + FDG+ 
Sbjct: 415 VALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRSFAVRFDGST 468

Query: 486 GFVGFGPNVC 495
             VGF P  C
Sbjct: 469 --VGFMPGAC 476


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  244 bits (624), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 152/394 (38%), Positives = 214/394 (54%), Gaps = 23/394 (5%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           +QR  +RVA    +LS    DA       FG+ +  S +  G+GEY + + +GSPP+S  
Sbjct: 4   VQRSHERVAFYTLKLS---PDA-------FGSQEFQSPVKAGNGEYLMTLTLGSPPQSFD 53

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD--RLENAGCHAGR 228
           +++D+GSD+ WVQC PC  CY+Q  P FDP+ S SF   +C+  +C+   L    C A  
Sbjct: 54  VIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANV 113

Query: 229 CRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
           C+Y+ +YGD S T G LA ET+++    G   V N A GCG +N G F GAAGL+GLG G
Sbjct: 114 CQYQYTYGDQSNTNGDLAFETISLNNGAGTQSVPNFAFGCGTQNLGTFAGAAGLVGLGQG 173

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
            +SL  QL       FSYCLVS  + S+  L FG  A      +  +V N R P++YYV 
Sbjct: 174 PLSLNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQ 233

Query: 345 LSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
           L+ + VGG  + ++  +F + Q  G  G ++D+GT +T L  PAY A   A+ +   N P
Sbjct: 234 LNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV-NYP 292

Query: 404 RASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAP 461
           R  G +   D C+N++G  +  VP + F F G     +   N  + VD  A T C A   
Sbjct: 293 RLDGSAYGLDLCFNIAGVSNPSVPDMVFKFQGAD-FQMRGENLFVLVDTSATTLCLAMGG 351

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S  G SIIGNIQQ+   + +D     +GF    C
Sbjct: 352 S-QGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  244 bits (624), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 147/386 (38%), Positives = 214/386 (55%), Gaps = 12/386 (3%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD  RV ++  +LS   A+    E +       SG+  GSG Y V IG+G+P     +
Sbjct: 89  IRRDQARVESIYSKLSKNSANEVS-EAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSL 147

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
           V D+GSD+ W QC+PC   CY Q +P F+P+ S+++  VSCSS +C+  E+  C A  C 
Sbjct: 148 VFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES--CSASNCV 205

Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
           Y + YGD S+T+G LA E  T+  + V+++V  GCG  NQG+F G AGLLGLG G +SL 
Sbjct: 206 YSIGYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLP 265

Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
            Q        FSYCL S  + S+G L FG   +     + P+   P A + Y + + G+ 
Sbjct: 266 AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGIS 324

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           VG   + I+ + F       +G ++D+GT  TRLPT  Y   R  F  +  +    SG  
Sbjct: 325 VGDKELAITPNSFST-----EGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYG 379

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
           +FDTCY+ +G  +V  PT++F F+GG V+ L  S   +P+      C AFA +    +I 
Sbjct: 380 LFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPI-KISQVCLAFAGNDDLPAIF 438

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN+QQ  + + +D A G VGF PN C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  244 bits (623), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 186/335 (55%), Gaps = 21/335 (6%)

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE--NAGCHAG 227
           +++ID+GSDI W+QC PC QCYKQ D +F PA SA++  + C+S +C +L+  +  C   
Sbjct: 2   FLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNS 61

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-----TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
            C Y VSYGD S T+G  ALETLT+         V N A GCGH N+G+F GAAGL+GLG
Sbjct: 62  SCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGAAGLMGLG 121

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LPVGAAWVPLVRNPRAPSF 340
             S+    Q     G  FSYCL S   T  SG L FG  A L     + PLV +   PS 
Sbjct: 122 KSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPSQ 181

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           Y+V ++G+ VG   +PIS             V++D+GT ++R    AYE  RDAF     
Sbjct: 182 YFVSMTGINVGDELLPISA-----------TVMVDSGTVISRFEQSAYERLRDAFTQILP 230

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
            L  A  V+ FDTC+ +S    + +P ++ +F     L L   + L PVDD G  CFAFA
Sbjct: 231 GLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDD-GVMCFAFA 289

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           PS SG S++GN QQ+ ++  +D     +G     C
Sbjct: 290 PSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  244 bits (622), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 151/414 (36%), Positives = 227/414 (54%), Gaps = 25/414 (6%)

Query: 99  MHYHRHQHSFHARMQR-DVKRVATLVRRLSGGGADAAKHEVQDF---------GTDVVSG 148
           + + + Q+ F A+++  D  +  T   R+  G     +H +Q F          +++ + 
Sbjct: 31  LEHPKVQNGFRAKLKHVDSGKNLTKFERIQHG-VKRGRHRLQRFKAMALVASSNSEIDAP 89

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +  G+GE+ +++ +G+PP +   ++D+GSD++W QC+PC+QC+ Q  P+FDP  S+SFS 
Sbjct: 90  VLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSK 149

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
           +SCSS +C+ L  + C  G C Y   YGD S T+G LA ETLT G+  V  VA GCG  N
Sbjct: 150 LSCSSKLCEALPQSTCSDG-CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDN 208

Query: 269 QGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR----EALP 323
           +G  F   +GL+GLG G +SLV QL       FSYCL S     + +L+ G     +A  
Sbjct: 209 EGSGFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLASVKASD 265

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
                 PL++N   PSFYY+ L G+ VG   +PI +  F L + G  G+++D+GT +T L
Sbjct: 266 SEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYL 325

Query: 384 PTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLP 441
              A++     F +Q  NLP   SG +  + C+ L SG   + VP + F+F G   L LP
Sbjct: 326 EQSAFDLVAKEFTSQI-NLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGAD-LELP 383

Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A N++I     G  C A   S SG+SI GNIQQ+ + +  D     + F P  C
Sbjct: 384 AENYMIADASMGVACLAMG-SSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  244 bits (622), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 145/395 (36%), Positives = 219/395 (55%), Gaps = 27/395 (6%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS-----GMDQGSGEYFVRIGVGSPP 166
           ++ D +R   ++RR+SG GA     ++ D+     +     G D G+  Y V   +G+P 
Sbjct: 92  LRADQRRAEHILRRVSGRGAP----QLWDYKAAAATVPANWGYDIGTSNYVVTASLGTPG 147

Query: 167 RSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENA 222
            +Q + +D+GSD+ WVQC+PC+   CY+Q DP+FDPA S+S++ V C  + C  L    +
Sbjct: 148 MAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRSACAGLGIYAS 207

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGH-KNQGMFVGAAGLLG 280
            C A +C Y VSYGDGS T G  + +TLT+     V+    GCGH ++ G+F G  GLLG
Sbjct: 208 ACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAANATVQGFLFGCGHAQSGGLFTGIDGLLG 267

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF 340
            G    SLV Q  G  GG FSYCL ++ + +    + G   +  G +   L+ +P AP++
Sbjct: 268 FGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTY 327

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG 400
           Y V L+G+ VGG  + +    F        G V+DTGT +TRLP  AY A R AF +   
Sbjct: 328 YVVMLTGISVGGQPLSVPASAFA------AGTVVDTGTVITRLPPAAYAALRSAFRSGMA 381

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
           + P A  + I DTCY+ +G+ +V + +V+  FS G  +TL A   +      G   FA +
Sbjct: 382 SYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATMTLGADGIM----SFGCLAFASS 437

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S   ++I+GN+QQ   ++  DG++  VGF P+ C
Sbjct: 438 GSDGSMAILGNVQQRSFEVRIDGSS--VGFRPSSC 470


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 201/355 (56%), Gaps = 17/355 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY   + +G+P R   +++D+GSD+ WVQC PC  CY Q+D +F P  S SF+ ++C +
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGT 60

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKN 268
            +C+ L    C+   C Y  SYGDGS + G    +T+T+      +  V N A GCGH N
Sbjct: 61  ELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDN 120

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV-- 324
           +G F GA G+LGLG G +S   QL     G FSYCLV        +  L+FG  A+P   
Sbjct: 121 EGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVPTFP 180

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  ++ L+ NP+ P++YYV L+G+ VGG  + IS   F +  +G  G + D+GT VT+L 
Sbjct: 181 GVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVTQLA 240

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTV---SFYFSGGPVLTL 440
              ++    A  A T + PR S  S   D C  L GF   ++PTV   +F+F GG  + L
Sbjct: 241 GEVHQEVLAAMNASTMDYPRKSDDSSGLDLC--LGGFAEGQLPTVPSMTFHFEGGD-MEL 297

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P SN+ I ++ + ++CF+   SP  ++IIG+IQQ+  Q+ +D     +GF P  C
Sbjct: 298 PPSNYFIFLESSQSYCFSMVSSPD-VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 154/431 (35%), Positives = 226/431 (52%), Gaps = 23/431 (5%)

Query: 78  RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GADAAKH 136
           + +LE+VH+    S  N        +   S    M  D +RV  +  RLS   G + +  
Sbjct: 60  KASLEVVHKHGPCSQLNHNGKA---KTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVK 116

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSD 195
           E+        SG   GS  YFV +G+G+P R   +V D+GSD+ W QC+PC+  CYKQ D
Sbjct: 117 ELDSTTLPAKSGSLIGSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQD 176

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHA------GRCRYEVSYGDGSYTKGTLALET 249
            +FDP+ S+S+  ++C+S++C +L +AG  +        C Y + YGD S + G L+ E 
Sbjct: 177 AIFDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQER 236

Query: 250 LTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
           LTI  T +V +   GCG  N+G+F G+AGL+GLG   +S V Q        FSYCL S  
Sbjct: 237 LTITATDIVDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTS 296

Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
           + S G L FG  A       + PL       +FY + + G+ VGG ++P +S   F    
Sbjct: 297 S-SLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA-- 353

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
               G ++D+GT +TRL   AY A R AF       P A+   +FDTCY+ SG+  + VP
Sbjct: 354 ---GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVP 410

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGA 484
            + F F+GG  + LP    LI    A   C AFA   + + ++I GN+QQ+ +++ +D  
Sbjct: 411 KIDFEFAGGVTVELPLVGILIG-RSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVE 469

Query: 485 NGFVGFGPNVC 495
            G +GFG   C
Sbjct: 470 GGRIGFGAAGC 480


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 163/448 (36%), Positives = 232/448 (51%), Gaps = 41/448 (9%)

Query: 85  HRDKMSSSSNTTNNMHYHR-------HQHSFHARMQRDVKRVATLVRRLSGGG------- 130
            + + +SSS +      HR        + SF  + ++D  R+ T+ RR +  G       
Sbjct: 64  EQKQPASSSPSLQLRMKHRSAEGGRTRKESFLDKAEKDAVRIETMHRRAARSGVARMPAS 123

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
           +   +   +     V SG+  GSGEY + + VG+PPR   M++D+GSD+ W+QC PC  C
Sbjct: 124 SSPRRALSERMVATVESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC 183

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR---------CRYEVSYGDGSYT 241
           ++Q  PVFDPA S+S+  V+C    C  +  A   A R         C Y   YGD S T
Sbjct: 184 FEQRGPVFDPAASSSYRNVTCGDQRCGLV--APPEAPRACRRPAEDSCPYYYWYGDQSNT 241

Query: 242 KGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
            G LALE+ T+  T       V  V  GCGH+N+G+F GAAGLLGLG G +S   QL   
Sbjct: 242 TGDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAV 301

Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLG 349
            G  FSYCLV  G+ +   +VFG + L +        A+ P   +P A +FYYV L G+ 
Sbjct: 302 YGHTFSYCLVEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAP-TSSP-ADTFYYVKLKGVL 359

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGV 408
           VGG  + IS D + + + G  G ++D+GT ++    PAY+  R AFV     L P     
Sbjct: 360 VGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDF 419

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS 467
            + + CYN+SG     VP +S  F+ G V   PA N+ + +D  G  C A   +P +G+S
Sbjct: 420 PVLNPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMS 479

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IIGN QQ+   + +D  N  +GF P  C
Sbjct: 480 IIGNFQQQNFHVVYDLQNNRLGFAPRRC 507


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 154/410 (37%), Positives = 219/410 (53%), Gaps = 32/410 (7%)

Query: 113 QRDVKRVATLVRRLSGGGADAAKHEV-------QDFGTDVVSGMDQGSGEYFVRIGVGSP 165
           ++D  R+ T+ RR +  G+ AA+ +        +     V SG+  GSGEY V + +G+P
Sbjct: 99  EKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYLGTP 158

Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH 225
           PR   M++D+GSD+ W+QC PC  C++QS P+FDPA S S+  V+C    C  +      
Sbjct: 159 PRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRCRLVSPPAES 218

Query: 226 AGR---------CRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGM 271
           A R         C Y   YGD S T G LALE  T+     G   V  VA GCGH+N+G+
Sbjct: 219 APRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGHRNRGL 278

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGREALPVGAA--- 327
           F GAAGLLGLG G +S   QL G  GG AFSYCLV  G+ +   ++FG +   +      
Sbjct: 279 FHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLAHPQLN 338

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
           +        A +FYY+ L  + VGG  + IS D      +   G ++D+GT ++  P PA
Sbjct: 339 YTAFAPTTDADTFYYLQLKSILVGGEAVNISSD-----TLSAGGTIIDSGTTLSYFPEPA 393

Query: 388 YEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           Y+A R AF+ + + + P   G  +   CYN+SG   V VP +S  F+ G     PA N+ 
Sbjct: 394 YQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYF 453

Query: 447 IPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I ++  G  C A   +P SG+SIIGN QQ+   + +D  +  +GF P  C
Sbjct: 454 IRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRC 503


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 142/393 (36%), Positives = 211/393 (53%), Gaps = 15/393 (3%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           + RD  RV  + R+++     A+  + +     V  G    +  YF  + +G+P     +
Sbjct: 90  LGRDQDRVDAIRRKVAAVTTAASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLV 149

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA----G 227
            +D+GSD  W+QC+PC  CY+Q + +FDP+ S+++S ++CSS  C  L ++  H      
Sbjct: 150 ELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDK 209

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
           +C YE++Y D SYT G LA +TLT+  T  V     GCGH N G F    GLLGLG G  
Sbjct: 210 KCPYEITYADDSYTVGNLARDTLTLSPTDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKA 269

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
           SL  Q+  + G  FSYCL S  + ++G L F     A P  A +  +V   + PSFYY+ 
Sbjct: 270 SLSSQVAARYGAGFSYCLPSSPS-ATGYLSFSGAAAAAPTNAQFTEMVAG-QHPSFYYLN 327

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           L+G+ V G  I +   +F        G ++D+GTA + LP  AY A R +  +  G   R
Sbjct: 328 LTGITVAGRAIKVPPSVFATAA----GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKR 383

Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP- 463
           A   +IFDTCY+L+G  +VR+P+V+  F+ G  + L  S  L    +    C AF P+P 
Sbjct: 384 APSSTIFDTCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPD 443

Query: 464 -SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + L ++GN QQ  + + +D  N  VGFG N C
Sbjct: 444 DTSLGVLGNTQQRTLAVIYDVDNQKVGFGANGC 476


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 152/469 (32%), Positives = 243/469 (51%), Gaps = 36/469 (7%)

Query: 46  GSRTDHAKMSQYNELFERHNNISSSNTSSDEARWN---LELVHRDKMSSSSNTTNNMHYH 102
           G   +  KM +  ++ +R++   S      E+R     + L  +D+   S    N     
Sbjct: 26  GCELEQKKMFKV-QMLQRNHQFGSKGCILPESRKEKGAIVLEMKDRGYCSERKINWNRKL 84

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           + Q  F     R ++    +  ++SG  +     E+Q     + SG++  +  Y V IG+
Sbjct: 85  QKQLIFDDLRVRSMQN--RIRAKVSGHNSSEQSSEIQ---IPLASGINLETLNYIVTIGL 139

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE-- 220
           G+  ++  ++ID+GSD+ WVQC PC  CY Q  PVF+P++S+S++ + C+S+ C  L+  
Sbjct: 140 GN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFT 197

Query: 221 ---NAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
                 C +     C + VSYGDGS+T G L +E L+ G   V N   GCG  N+G+F G
Sbjct: 198 TGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVSNFVFGCGRNNKGLFGG 257

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAW 328
            +G++GLG  ++S++ Q     GG FSYCL +  +G+SGSLV G E+       P+  A+
Sbjct: 258 VSGIMGLGRSNLSMISQTNTTFGGVFSYCLPTTDSGASGSLVIGNESSLFKNLTPI--AY 315

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             +V NP+  +FY + L+G+ VGG+ I       + T  G+ G+++D+GT +TRL    Y
Sbjct: 316 TSMVSNPQLSNFYVLNLTGIDVGGVAI-------QDTSFGNGGILIDSGTVITRLAPSLY 368

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
            A +  F+ Q    P A  +SI DTC+NL+G   V +PT+S +F     L + A   L  
Sbjct: 369 NALKAEFLKQFSGYPIAPALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAVGILYM 428

Query: 449 VDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             D    C A A     + ++IIGN QQ   ++ +D     +GF    C
Sbjct: 429 PKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDC 477


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 144/357 (40%), Positives = 204/357 (57%), Gaps = 16/357 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDP  S++
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSST 228

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ VSC++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 229 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 288

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R TG +G L FG  +   
Sbjct: 289 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSPAA 347

Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
            +A +  P++ +   P+FYY+G++G+ VGG  + I + +F        G ++D+GT +TR
Sbjct: 348 ASARLTTPMLTD-NGPTFYYIGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 401

Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           LP PAY + R   A         +A  VS+ DTCY+ +G   V +PTVS  F GG  L +
Sbjct: 402 LPPPAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 461

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS  +     A   C AFA +  G  + I+GN Q +   +++D     VGF P VC
Sbjct: 462 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 146/386 (37%), Positives = 213/386 (55%), Gaps = 12/386 (3%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD  RV ++  +LS   A+    E +       SG+  GSG Y V IG+G+P     +
Sbjct: 89  IRRDQARVESIYSKLSKNSANEVS-EAKSTELPAKSGITLGSGNYIVTIGIGTPKHDLSL 147

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
           V D+GSD+ W QC+PC   CY Q +P F+P+ S+++  VSCSS +C+  E+  C A  C 
Sbjct: 148 VFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPMCEDAES--CSASNCV 205

Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
           Y + YGD S+T+G LA E  T+  + V+++V  GCG  NQG+F G AGLLGLG G +SL 
Sbjct: 206 YSIVYGDKSFTQGFLAKEKFTLTNSDVLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLP 265

Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
            Q        FSYCL S  + S+G L FG   +     + P+   P A + Y + + G+ 
Sbjct: 266 AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFN-YGIDIIGIS 324

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           VG   + I+ + F       +G ++D+GT  TRLPT  Y   R  F  +  +    SG  
Sbjct: 325 VGDKELAITPNSFST-----EGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYG 379

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
           +FDTCY+ +G  +V  PT++F F+G  V+ L  S   +P+      C AFA +    +I 
Sbjct: 380 LFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPI-KISQVCLAFAGNDDLPAIF 438

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN+QQ  + + +D A G VGF PN C
Sbjct: 439 GNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 182/468 (38%), Positives = 252/468 (53%), Gaps = 39/468 (8%)

Query: 67  ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHY-HRHQHSFHA--------RMQRDVK 117
           +S    SS EA  +    H++ M++SS++  ++   HR   + +A        R+QRD  
Sbjct: 40  LSPHAHSSPEAAEDGAHAHQEDMAASSSSAMHVRLLHRDSFAVNATGAELLARRLQRDEL 99

Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGM---DQGSGEYFVRIGVGSPPRSQYMVID 174
           R A ++   +  G           G  +V+ +      SG+Y  +I VG+P     + +D
Sbjct: 100 RAAWIISTAAANGTPPPDVVGLSTGRGLVAPVVSRAPTSGDYIAKIAVGTPAVEALLALD 159

Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG---CHAGRCRY 231
           + SD+ W+QCQPC +CY QS PVFDP  S S+  ++  +  C  L  +G      G C Y
Sbjct: 160 TASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIY 219

Query: 232 EVSYGDG------SYTKGTLALETLTIGRTVVKN-VAIGCGHKNQGMF-VGAAGLLGLGG 283
            V YGDG      S + G L  ETLT    V +  ++IGCGH N+G+F   AAG+LGL  
Sbjct: 220 TVLYGDGDGHGSTSTSVGDLVEETLTFAGGVRQAYLSIGCGHDNKGLFGAPAAGILGLSR 279

Query: 284 GSMSLVGQLGGQ-TGGAFSYCLVS--RGTGS-SGSLVFGREALPVG--AAWVPLVRNPRA 337
           G +S+  Q+       +FSYCLV    G GS S +L FG  A+     A++ P V N   
Sbjct: 280 GQISIPHQIAFLGYNASFSYCLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNM 339

Query: 338 PSFYYVGLSGLGVGGMRIP--ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
           P+FYYV L G+ VGG+R+P     DL      G  GV++D+GT VTRL  PAY AFRDAF
Sbjct: 340 PTFYYVRLIGVSVGGVRVPGVTERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAF 399

Query: 396 VAQTGNLPRAS--GVS-IFDTCYNLSGFVSVR----VPTVSFYFSGGPVLTLPASNFLIP 448
            A    L + S  G S +FDTCY + G   +R    VP VS +F+GG  L+L   N+LI 
Sbjct: 400 RAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLIT 459

Query: 449 VDDAGTFCFAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VD  GT CFAFA +    +S+IGNI Q+G ++ +D     VGF PN C
Sbjct: 460 VDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  241 bits (616), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 204/371 (54%), Gaps = 21/371 (5%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           TD  S +  G G+Y   I +G+P +   ++ D+GSD++W+QC+PC  C+ Q DP+FDP  
Sbjct: 27  TDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86

Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VV 257
           S+S++ +SC   +CD L    C +  C Y   YGDGS T+GTL+ ET+T+  T       
Sbjct: 87  SSSYTTMSCGDTLCDSLPRKSC-SPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
           KN+A GCGH N+G F  A+GL+GLG G++S V QLG   G  FSYCLV      S +  +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 316 VFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            FG E+        +  A+ P++ NP   SFYYV L  + + G  + I    F +   G 
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSG---FVSVRV 425
            G++ D+GT +T LP   Y+    A  ++  + P+  G S   D CY++SG      +++
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKI-SFPKIDGSSAGLDLCYDVSGSKASYKMKI 324

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           P + F+F G     LP  N+ I  +DAGT  C A   S   + I GN+ Q+  ++ +D  
Sbjct: 325 PAMVFHFEGAD-YQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIG 383

Query: 485 NGFVGFGPNVC 495
           +  +G+ P+ C
Sbjct: 384 SSKIGWAPSQC 394


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 174/498 (34%), Positives = 247/498 (49%), Gaps = 57/498 (11%)

Query: 14  LLLHLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTS 73
           LLL  L        + A+ ++  I+ VN  +  +  +H+            + +S+S   
Sbjct: 3   LLLFSLEKGYAVEENEATKSYLHIIKVNSLLPTTACNHS------------SKVSNS--- 47

Query: 74  SDEARWNLELVHRD-------KMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
                 +LE+VHR             ++  +NM              RD  RV ++  RL
Sbjct: 48  -----LSLEVVHRHGPCIGIVNQEKGADAPSNMEI----------FLRDQNRVDSIHARL 92

Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
           S  G    K   Q     V SG   G+G+Y V +G+G+P +   ++ D+GSDI W QC+P
Sbjct: 93  SSRGMFPEK---QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEP 149

Query: 187 CSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSY 240
           C + CYKQ +P  +P+ S S+  +SCSSA+C  + +       C +  C Y+V YGDGSY
Sbjct: 150 CVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY 209

Query: 241 TKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           + G  A ETLT+  + V KN   GCG +N G+F GAAGLLGLG   ++L  Q        
Sbjct: 210 SIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKL 269

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           FSYCL +  + S G L  G + +     + PL  +  +  FY + ++GL VGG ++ I E
Sbjct: 270 FSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDE 327

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
             F        G V+D+GT +TRL   AY     AF     + P  SG SIFDTCY+ S 
Sbjct: 328 SAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSK 381

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
           + +VR+P V   F GG  + +  S  L PV+     C AFA     S  SI GN+QQ   
Sbjct: 382 YDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTY 441

Query: 478 QISFDGANGFVGFGPNVC 495
           Q+ +DGA G VGF P  C
Sbjct: 442 QVVYDGAKGRVGFAPGGC 459


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 156/427 (36%), Positives = 219/427 (51%), Gaps = 38/427 (8%)

Query: 102 HRHQHSFHARMQ-------RDVKRVATLVRRLSGGGADAAKHEVQDFG----TDVVSGMD 150
           H   H  ++R+Q       R   R++ LV R +G  + ++             D+   + 
Sbjct: 51  HVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGDGSGGKDLQVPVH 110

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
            G+GE+ + + VG+P      ++D+GSD+VW QC+PC +C+ Q+ PVFDPA S++++ + 
Sbjct: 111 AGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALP 170

Query: 211 CSSAVCDRLENAGCHAGRCR--------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           CSSA+C  L  + C +            Y  +YGD S T+G LA ET T+ R  V  VA 
Sbjct: 171 CSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVPGVAF 230

Query: 263 GCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---- 317
           GCG  N+G  F   AGL+GLG G +SLV QLG      FSYCL S    +  S +     
Sbjct: 231 GCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDR---FSYCLTSLDDAAGRSPLLLGSA 287

Query: 318 ---GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
                 A    A   PLV+NP  PSFYYV L+GL VG  R+ +    F +   G  GV++
Sbjct: 288 AGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDGTGGVIV 347

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN-----LSGFVSVRVPTV 428
           D+GT++T L   AY A R AFVA   +LP      I  D C+      +   V V+VP +
Sbjct: 348 DSGTSITYLELRAYRALRKAFVAHM-SLPTVDASEIGLDLCFQGPAGAVDQDVQVQVPKL 406

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
             +F GG  L LPA N+++    +G  C     S  GLSIIGN QQ+  Q  +D A   +
Sbjct: 407 VLHFDGGADLDLPAENYMVLDSASGALCLTVMAS-RGLSIIGNFQQQNFQFVYDVAGDTL 465

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 466 SFAPAEC 472


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  241 bits (615), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 217/441 (49%), Gaps = 50/441 (11%)

Query: 104 HQHSFHARMQRDVKRVATLVRR---------LSGGGADAAKHEV---------------- 138
           H+ SF A   RD+ R+ TL +R         LS    +  K  V                
Sbjct: 115 HKESFVASTTRDLTRIQTLHKRILEKKNQNALSRLNKEEPKQPVVAPAASPESYPANGLS 174

Query: 139 -QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
            Q   T + SG+  GSGEYF+ + +G+PPR   +++D+GSD+ W+QC PC  C+ Q+ P 
Sbjct: 175 GQLMAT-LESGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPY 233

Query: 198 FDPADSASFSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLT 251
           +DP +S+SF  + C    C  + +      C A    C Y   YGD S T G  ALET T
Sbjct: 234 YDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFT 293

Query: 252 IGRTV---------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
           +  T          V+NV  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSY
Sbjct: 294 VNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSY 353

Query: 303 CLVSRG--TGSSGSLVFGREALPVGAAWV---PLVRNPRAP--SFYYVGLSGLGVGGMRI 355
           CLV R   T  S  L+FG +   +    V    LV     P  +FYYV +  + VGG  +
Sbjct: 354 CLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVL 413

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
            I E+ + L+  G  G ++D+GT ++    P+YE  +DAFV +    P      I D CY
Sbjct: 414 KIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCY 473

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQ 474
           N+SG   + +P     F  G V   P  N+ I ++     C A   +P S LSIIGN QQ
Sbjct: 474 NVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQ 533

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           +   I +D     +G+ P  C
Sbjct: 534 QNFHILYDTKKSRLGYAPMKC 554


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  241 bits (615), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 139/346 (40%), Positives = 190/346 (54%), Gaps = 17/346 (4%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+P  +   ++D+GSD+VW QC+PC  C+KQS PVFDP+ S++++ V CSSA C  L  
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 222 AGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGAAGLL 279
           + C  A +C Y  +YGD S T+G LA ET T+ ++ +  V  GCG  N+G  F   AGL+
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQGAGLV 292

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-------LPVGAAWVPLV 332
           GLG G +SLV QLG      FSYCL S    ++  L+ G  A               PL+
Sbjct: 293 GLGRGPLSLVSQLGLDK---FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLI 349

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
           +NP  PSFYYV L  + VG  RI +    F +   G  GV++D+GT++T L    Y A +
Sbjct: 350 KNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALK 409

Query: 393 DAFVAQTGNLPRASGVSI-FDTCYN--LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            AF AQ   LP A G  +  D C+     G   V VP + F+F GG  L LPA N+++  
Sbjct: 410 KAFAAQMA-LPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLD 468

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +G  C     S  GLSIIGN QQ+  Q  +D  +  + F P  C
Sbjct: 469 GGSGALCLTVMGS-RGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQC 513


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  241 bits (614), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 140/357 (39%), Positives = 201/357 (56%), Gaps = 17/357 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G+  G+  Y V IG+G+PP    +V D+GSD  WVQC+PC   CYKQ D +FDPA S+++
Sbjct: 155 GLSLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTY 214

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + VSC+   C  L+ +GC+AG C Y + YGDGSYT G  A +TL + +  +K    GCG 
Sbjct: 215 ANVSCADPACADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFKFGCGE 274

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---GREALP 323
           KN+G+F   AGLLGLG G  S+  Q   + GG+FSYCL +  + ++G L F      +  
Sbjct: 275 KNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPAS-SAATGYLEFGPLSPSSSG 333

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI-PISEDLFRLTQMGDDGVVMDTGTAVTR 382
             A   P++ + + P+FYYVGL+G+ VGG ++  I E +F      + G ++D+GT +TR
Sbjct: 334 SNAKTTPMLTD-KGPTFYYVGLTGIRVGGKQLGAIPESVFS-----NSGTLVDSGTVITR 387

Query: 383 LP--TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           LP    A  +   A         +A+  SI DTCY+ +G   V +PTVS  F GG  L L
Sbjct: 388 LPDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGACLDL 447

Query: 441 PASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS  +  +  +   C  FA +     + I+GN QQ    + +D +   VGF P  C
Sbjct: 448 DASGIVYAISQS-QVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 156/391 (39%), Positives = 212/391 (54%), Gaps = 20/391 (5%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           RD  RV ++  RLS  G    K   Q     V SG   G+G+Y V +G+G+P +   ++ 
Sbjct: 92  RDQNRVDSIHARLSSRGMFPEK---QATTLPVQSGASIGAGDYVVTVGLGTPKKEFTLIF 148

Query: 174 DSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAG 227
           D+GSDI W QC+PC + CYKQ +P  +P+ S S+  +SCSSA+C  + +       C + 
Sbjct: 149 DTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSS 208

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
            C Y+V YGDGSY+ G  A ETLT+  + V KN   GCG +N G+F GAAGLLGLG   +
Sbjct: 209 TCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL 268

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
           +L  Q        FSYCL +  + S G L  G + +     + PL  +  +  FY + ++
Sbjct: 269 ALPSQTAKTYKKLFSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDIT 326

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           GL VGG ++ I E  F        G V+D+GT +TRL   AY     AF     + P  S
Sbjct: 327 GLSVGGRKLSIDESAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTS 380

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPS 464
           G SIFDTCY+ S + +VR+P V   F GG  + +  S  L PV+     C AFA     S
Sbjct: 381 GYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS 440

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             SI GN+QQ   Q+ +DGA G VGF P  C
Sbjct: 441 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 471


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 145/357 (40%), Positives = 204/357 (57%), Gaps = 16/357 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSAS 205
           SG   G+G Y V +G+G+P     +V D+GSD  WVQCQPC   CY+Q + +FDPA S++
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSST 230

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           ++ VSC++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+     VK    GC
Sbjct: 231 YANVSCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGC 290

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           G +N+G+F  AAGLLGLG G  SL  Q   + GG F++CL +R TG +G L FG  +L  
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG-TGYLDFGAGSLAA 349

Query: 325 GAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
            +A +  P++ +   P+FYYVG++G+ VGG  + I + +F        G ++D+GT +TR
Sbjct: 350 ASARLTTPMLTD-NGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVITR 403

Query: 383 LPTPAYEAFR--DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           LP  AY + R   A         +A  VS+ DTCY+ +G   V +PTVS  F GG  L +
Sbjct: 404 LPPAAYSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDV 463

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS  +     A   C AFA +  G  + I+GN Q +   +++D     VGF P  C
Sbjct: 464 DASGIMY-AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 203/371 (54%), Gaps = 21/371 (5%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           TD  S +  G G+Y   I +G+P +   ++ D+GSD++W+QC+PC  C+ Q DP+FDP  
Sbjct: 27  TDYESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEG 86

Query: 203 SASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VV 257
           S+S++ +SC   +CD L    C +  C Y   YGDGS T+GTL+ ET+T+  T       
Sbjct: 87  SSSYTTMSCGDTLCDSLPRKSC-SPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAA 145

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
           KN+A GCGH N+G F  A+GL+GLG G++S V QLG   G  FSYCLV      S +  +
Sbjct: 146 KNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 316 VFGREA------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            FG E+        +  A+ P++ NP   SFYYV L  + + G  + I    F +   G 
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVS---VRV 425
            G++ D+GT +T LP   Y+    A  ++  + P   G S   D CY++SG  +    ++
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKV-SFPEIDGSSAGLDLCYDVSGSKASYKKKI 324

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           P + F+F G     LP  N+ I  +DAGT  C A   S   + I GN+ Q+  ++ +D  
Sbjct: 325 PAMVFHFEGAD-HQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIG 383

Query: 485 NGFVGFGPNVC 495
           +  +G+ P+ C
Sbjct: 384 SSKIGWAPSQC 394


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 145/402 (36%), Positives = 207/402 (51%), Gaps = 16/402 (3%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV---QDFGTDVVSGMDQGSGEYFVRIGVG 163
           S+   +   +K      R +  GG  A K  V   +D    + SG    S  Y +++G G
Sbjct: 72  SWWTAVSESIKGDTARYRAMVKGGWSAGKTMVNPQEDADIPLASGQAISSSNYIIKLGFG 131

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD--RLEN 221
           +PP+S Y V+D+GS+I W+ C PCS C  +  P F+P+ S++++ ++C+S  C   R+  
Sbjct: 132 TPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQP-FEPSKSSTYNYLTCASQQCQLLRVCT 190

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
              ++  C     YGD S     L+ ETL++G   V+N   GC +  +G+      L+G 
Sbjct: 191 KSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQQVENFVFGCSNAARGLIQRTPSLVGF 250

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-SGSLVFGREALPV-GAAWVPLVRNPRAPS 339
           G   +S V Q        FSYCL S  + + +GSL+ G+EAL   G  + PL+ N R PS
Sbjct: 251 GRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPS 310

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
           FYYVGL+G+ VG   + I      L +    G ++D+GT +TRL  PAY A RD+F +Q 
Sbjct: 311 FYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQL 370

Query: 400 GNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIP-VDDAGTFCF 457
            NL  AS   +FDTCYN  SG   V  P ++ +F     LTLP  N L P  DD    C 
Sbjct: 371 SNLTMASPTDLFDTCYNRPSG--DVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCL 428

Query: 458 AFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AF   P G    LS  GN QQ+ ++I  D A   +G     C
Sbjct: 429 AFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 158/434 (36%), Positives = 215/434 (49%), Gaps = 32/434 (7%)

Query: 82  ELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGA 131
           E+    K++SS N       HRH          + S    + RD  R A +  +LS    
Sbjct: 45  EVCSGQKVTSSKNGATLPLVHRHGPCSPVMSKEKPSHEETLGRDQLRAANIHAKLSSPRN 104

Query: 132 DAAKHEVQDFGTDV--VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS- 188
            +AK E+Q  G  +   SG   G+ EY + + +G+P  +Q M ID+GSD+ WVQC PC+ 
Sbjct: 105 SSAK-ELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAA 163

Query: 189 -QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTL 245
             C  Q D +FDPA SA++S  SCSSA C +L  E  GC    C+Y V Y D S T GT 
Sbjct: 164 QSCSSQKDKLFDPAKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTY 223

Query: 246 ALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL 304
             +TL +  +  VKN   GC H+  G      GL+GLGG + SLV Q     G AFSYCL
Sbjct: 224 GSDTLGLTTSDAVKNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCL 283

Query: 305 VSRGTGSSGSLVFGREALPVGAAW---VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
               + + G L  G  A    ++     PLVR    P+FY V L  + V G ++ +   +
Sbjct: 284 PPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRF-NVPTFYGVFLQAITVAGTKLNVPASV 342

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
           F          V+D+GT +T+LP  AY+A R AF  +    P A+ V I DTC++ SG  
Sbjct: 343 F------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIK 396

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
           +VRVP V+  FS G V+ L  S        AG   F          I+GN+QQ   ++ F
Sbjct: 397 TVRVPVVTLTFSRGAVMDLDVSGIFY----AGCLAFTATAQDGDTGILGNVQQRTFEMLF 452

Query: 482 DGANGFVGFGPNVC 495
           D     +GF P  C
Sbjct: 453 DVGGSTLGFRPGAC 466


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 167/472 (35%), Positives = 239/472 (50%), Gaps = 55/472 (11%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRV 119
           NT+  +A  + +L+  ++     + +  +H  R            + SF    Q+D  R+
Sbjct: 44  NTAVADAGCDGKLLAEEEEQKDRSPSLKLHMSRRSPAEATAGRTRKDSFLESAQKDGVRI 103

Query: 120 ATLVRRLS------GGGADAAKHEVQDFGTDVV----SGMDQGSGEYFVRIGVGSPPRSQ 169
           AT+ RR++       G   A+    +     +V    SG+  GSGEY V + VG+PPR  
Sbjct: 104 ATMHRRVALQAQAQPGRRSASSSPRRALSERLVATVESGVAVGSGEYLVEVYVGTPPRRF 163

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG----CH 225
            M++D+GSD+ W+QC PC  C+ Q  PVFDP  S S+  V+C    C  +        C 
Sbjct: 164 QMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCR 223

Query: 226 AGR---CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMFVGAAG 277
           + R   C Y   YGD S T G LALE  T+  T      V  V +GCGH+N+G+F GAAG
Sbjct: 224 SSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGCGHRNRGLFHGAAG 283

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR- 336
           LLGLG G +S   QL    G AFSYCLV  G+     +VFG + +        L+ +P+ 
Sbjct: 284 LLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVFGDDNV--------LLSHPQL 335

Query: 337 -----APS-----FYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
                APS     FYYV L G+ VGG  + I  + + +++  G  G ++D+GT ++  P 
Sbjct: 336 NYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPE 395

Query: 386 PAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
           PAY+A R AFV +     P  +   +   CYN+SG   V VP  S  F+ G V   PA N
Sbjct: 396 PAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLFADGAVWDFPAEN 455

Query: 445 FLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + I +D  G  C A   +P S +SIIGN QQ+   + +D  +  +GF P  C
Sbjct: 456 YFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFAPRRC 507


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  239 bits (609), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 155/391 (39%), Positives = 211/391 (53%), Gaps = 20/391 (5%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           RD  RV ++  RLS  G    K         V SG   G+G+Y V +G+G+P +   ++ 
Sbjct: 32  RDQNRVDSIHARLSSRGMFPEKQATT---LPVQSGASIGAGDYVVTVGLGTPKKEFTLIF 88

Query: 174 DSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAG 227
           D+GSDI W QC+PC + CYKQ +P  +P+ S S+  +SCSSA+C  + +       C + 
Sbjct: 89  DTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSALCKLVASGKKFSQSCSSS 148

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
            C Y+V YGDGSY+ G  A ETLT+  + V KN   GCG +N G+F GAAGLLGLG   +
Sbjct: 149 TCLYQVQYGDGSYSIGFFATETLTLSSSNVFKNFLFGCGQQNNGLFGGAAGLLGLGRTKL 208

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
           +L  Q        FSYCL +  + S G L  G + +     + PL  +  +  FY + ++
Sbjct: 209 ALPSQTAKTYKKLFSYCLPA-SSSSKGYLSLGGQ-VSKSVKFTPLSADFDSTPFYGLDIT 266

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           GL VGG ++ I E  F        G V+D+GT +TRL   AY     AF     + P  S
Sbjct: 267 GLSVGGRQLSIDESAFSA------GTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTS 320

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPS 464
           G SIFDTCY+ S + +VR+P V   F GG  + +  S  L PV+     C AFA     S
Sbjct: 321 GYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDS 380

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             SI GN+QQ   Q+ +DGA G VGF P  C
Sbjct: 381 DTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  238 bits (608), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 151/396 (38%), Positives = 220/396 (55%), Gaps = 20/396 (5%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           +   R++RD  R A + R+ SG G D  + +     T +  G    + EY + +G+GSP 
Sbjct: 76  TLEERLRRDQLRAAYIKRKFSGAG-DIEQSDAATVPTTL--GTSLSTLEYVITVGIGSPA 132

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENA 222
            +Q M +D+GSD+ WVQC+PCSQC+ + D +FDP+ S+++S  SCSSA C +L    E  
Sbjct: 133 VTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGN 192

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA-GLLGL 281
           GC + +C+Y V+YGD S T GT + +TLT+G + + +   GC     G F     GL+GL
Sbjct: 193 GCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSAMTDFQFGCSQSESGGFNDQTDGLMGL 252

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           GGG+ SL  Q  G  G AFSYCL    +GSSG L  G  +   G    P++R+ + P++Y
Sbjct: 253 GGGAQSLASQTAGTFGTAFSYCLPPT-SGSSGFLTLGTGS--SGFVKTPMLRSTQIPTYY 309

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            V L  + VG  ++ +   +F        G +MD+GT +TRLP  AY A   AF A    
Sbjct: 310 VVLLESIKVGSQQLNLPTSVFSA------GSLMDSGTIITRLPPTAYSALSSAFKAGMQQ 363

Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP 461
            P A+   I DTC++ SG  S+ +PTV+  FSGG  + L     ++ +  +   C AF P
Sbjct: 364 YPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDGIMLEISSS-IRCLAFTP 422

Query: 462 S--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +   S L IIGN+QQ   ++ +D   G VGF    C
Sbjct: 423 NGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  238 bits (607), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 140/392 (35%), Positives = 211/392 (53%), Gaps = 20/392 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           R +R +KR    + +L     +    E   +          G+GE+ +++ +G+P  S  
Sbjct: 79  RFKRAIKRSQDRLEKLQMSVDEVKAVEAPVYA---------GNGEFLMKMAIGTPSLSFS 129

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
            ++D+GSD+ W QC+PC+ CY Q  P++DP+ S+++S V CSS++C  L    C    C 
Sbjct: 130 AILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPMYSCSGANCE 189

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGS-MSLV 289
           Y  SYGD S T+G L+ E+ T+    + ++A GCG +N+G      G L   G   +SL+
Sbjct: 190 YLYSYGDQSSTQGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLI 249

Query: 290 GQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVRNPRAPSFYYVGL 345
            QLG   G  FSYCLVS   + S  S +F  +   + A  V   PLV++   P+FYY+ L
Sbjct: 250 SQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSL 309

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            G+ VGG  + I++  F L   G  GV++D+GT VT L    Y+  + A ++   NLP+ 
Sbjct: 310 EGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSI-NLPQV 368

Query: 406 SGVSI-FDTCYN-LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
            G +I  D C+   SG  +   PT++F+F G     LP  N+ I  D +G  C A  PS 
Sbjct: 369 DGSNIGLDLCFEPQSGSSTSHFPTITFHFEGAD-FNLPKENY-IYTDSSGIACLAMLPS- 425

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +G+SI GNIQQ+  QI +D     + F P VC
Sbjct: 426 NGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  238 bits (606), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 163/450 (36%), Positives = 232/450 (51%), Gaps = 42/450 (9%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG- 129
            +SSD  R ++ LVHR    + S  +        + S   R++RD  R   +V + +GG 
Sbjct: 9   TSSSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARTNYIVTKATGGR 62

Query: 130 GADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
            A  A  +    GT + +  G    S EY V +G+G+P   Q ++ID+GSD+ WVQC+PC
Sbjct: 63  TAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPC 122

Query: 188 S--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GC------HAGRCRYEVSY 235
              +CY Q DP+FDP+ S+S++ V C S  C +L       GC       A  C Y + Y
Sbjct: 123 GAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEY 182

Query: 236 GDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           G+ + T G  + ETLT+    VV +   GCG    G +    GLLGLGG   SLV Q   
Sbjct: 183 GNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSS 242

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
           Q GG FSYCL    +G +G L  G            G ++ P+ R P  P+FY V L+G+
Sbjct: 243 QFGGPFSYCLPPT-SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGI 301

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRA 405
            VGG  + I    F        G+V+D+GT +T LP  AY A R AF   +++   LP +
Sbjct: 302 SVGGAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPS 355

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
           +G  + DTCY+ +G  +V VPT+S  FSGG  + L A   ++ VD  G   FA A + + 
Sbjct: 356 NG-GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNA 411

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + IIGN+ Q   ++ +D   G VGF    C
Sbjct: 412 IGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 441


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 159/462 (34%), Positives = 232/462 (50%), Gaps = 63/462 (13%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV-- 138
           +EL HRD    +SN          +      ++RD+ R+ +  +R+S     +A  E   
Sbjct: 1   MELKHRDHRQPTSN---------RRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAYL 51

Query: 139 ------------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIV 180
                             ++  + V SG + G+GEYF+ + VG+PPR   ++ID+GSD+ 
Sbjct: 52  EMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDLT 111

Query: 181 WVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYEV 233
           W+QC+PC  C+ QS PVFDP+ S SF  + C++A CD + +  C           C+Y  
Sbjct: 112 WLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFY 171

Query: 234 SYGDGSYTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
            YGD S T G LALE+L++  +       ++++ IGCGH N+G+F GA GLLGLG G++S
Sbjct: 172 WYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALS 231

Query: 288 LVGQL-GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----------AWVPLVR-N 334
              QL     G +FSYCLV R    S S      A+  GA            + P VR N
Sbjct: 232 FPSQLRSSPIGQSFSYCLVDRTNNLSVS-----SAISFGAGFALSRHFDQMKFTPFVRTN 286

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
               +FYY+G+ G+ +    +PI  + F +   G  G ++D+GT +T L   AY A   A
Sbjct: 287 NSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESA 346

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI-PVDDAG 453
           F+A+  + PRA    I   CYN +G  +V  P +S  F  G  L LP  N+ I P     
Sbjct: 347 FLARI-SYPRADPFDILGICYNATGRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEA 405

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C A  P+  G+SIIGN QQ+ I   +D  +  +GF    C
Sbjct: 406 KHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 446


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 149/467 (31%), Positives = 243/467 (52%), Gaps = 54/467 (11%)

Query: 60  LFERHNNISSSNTSS----DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
            F  H  I+SS   S     +    L+L H   + S  N+T+ +        F     +D
Sbjct: 8   FFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSPPNSTSLL--------FAYMFAKD 59

Query: 116 VKRVATLVRRLSGGG-ADAAKHEV--QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
            +R+     RL+    A+A+  +V  +  G  + SG+  GSG Y+V++G+GSP +   M+
Sbjct: 60  EERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMI 119

Query: 173 IDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-------------SSAVCDR 218
           +D+GS   W+QCQPC+  C+ Q DPVF+P+ S ++  V C             +   C +
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSK 179

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAG 277
             NA      C Y+ SYGD S++ G L+ + LT+  +  + +   GCG  NQG+F    G
Sbjct: 180 QSNA------CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGAAW--VPL 331
           ++GL    +S++ QL G+ G AFSYCL +      +   G L  G  +L   +++   PL
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPL 293

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NP  PS Y++ L  + V G  + ++   +++        ++D+GT +TRLPTP Y   
Sbjct: 294 LKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------TIIDSGTVITRLPTPVYTTL 347

Query: 392 RDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
           ++A+V   +    +A G+S+ DTC+  +L+G +S   P +   F GG  L L   N L+ 
Sbjct: 348 KNAYVTILSKKYQQAPGISLLDTCFKGSLAG-ISEVAPDIRIIFKGGADLQLKGHNSLVE 406

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + + G  C A A S S ++IIGN QQ+ +++++D  N  VGF P  C
Sbjct: 407 L-ETGITCLAMAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/368 (37%), Positives = 197/368 (53%), Gaps = 15/368 (4%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
           F + VVSG   GSG+YFV   +G+PP+   +++DSGSD++WVQC PC QCY Q  P++ P
Sbjct: 49  FQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVP 108

Query: 201 ADSASFSGVSCSSAVCDRL---ENAGC---HAGRCRYEVSYGDGSYTKGTLALETLTIGR 254
           ++S++FS V C S+ C  +   E   C   + G C YE  Y D S +KG  A E+ T+  
Sbjct: 109 SNSSTFSPVPCLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDG 168

Query: 255 TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSS 312
             +  VA GCG  NQG F  A G+LGLG G +S   Q+G   G  F+YCLV+    T  S
Sbjct: 169 VRIDKVAFGCGSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVS 228

Query: 313 GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
            SL+FG E +       + P+V NP++P+ YYV +  + VGG  +PIS+  + +  +G+ 
Sbjct: 229 SSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNG 288

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           G + D+GT +T     AY     AF +   + PRA  V   D C  L+G      P+ + 
Sbjct: 289 GSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQGLDLCVELTGVDQPSFPSFTI 347

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSP-SGLSIIGNIQQEGIQISFDGANGF 487
            F  G V    A N+ + V      C A A   SP  G + IGN+ Q+   + +D     
Sbjct: 348 EFDDGAVFQPEAENYFVDV-APNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENL 406

Query: 488 VGFGPNVC 495
           +GF P  C
Sbjct: 407 IGFAPAKC 414


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 138/370 (37%), Positives = 194/370 (52%), Gaps = 15/370 (4%)

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            DF + VVSG   GSG+YFV   +G+PP+   +++DSGSD++WVQC PC QCY Q  P++
Sbjct: 48  HDFQSPVVSGSTLGSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLY 107

Query: 199 DPADSASFSGVSCSSAVCDRL---ENAGC---HAGRCRYEVSYGDGSYTKGTLALETLTI 252
            P++S++F+ V C S  C  +   E   C   + G C YE  Y D S +KG  A E+ T+
Sbjct: 108 APSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV 167

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTG 310
               +  VA GCG  NQG F  A G+LGLG G +S   Q+G   G  F+YCLV+    T 
Sbjct: 168 DDVRIDKVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTS 227

Query: 311 SSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
            S  L+FG E +       + P+V N R P+ YYV +  + VGG  +PIS   + L  +G
Sbjct: 228 VSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLG 287

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
           + G + D+GT VT    PAY     AF  +    PRA+ V   D C +++G      P+ 
Sbjct: 288 NGGSIFDSGTTVTYWLPPAYRNILAAF-DKNVRYPRAASVQGLDLCVDVTGVDQPSFPSF 346

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGAN 485
           +    GG V      N+ + V      C A A  PS   G + IGN+ Q+   + +D   
Sbjct: 347 TIVLGGGAVFQPQQGNYFVDV-APNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREE 405

Query: 486 GFVGFGPNVC 495
             +GF P  C
Sbjct: 406 NRIGFAPAKC 415


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 163/448 (36%), Positives = 231/448 (51%), Gaps = 42/448 (9%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG-GA 131
           SSD  R ++ LVHR    + S  +        + S   R++RD  R   +V + +GG  A
Sbjct: 91  SSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARTNYIVTKATGGRTA 144

Query: 132 DAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS- 188
             A  +    GT + +  G    S EY V +G+G+P   Q ++ID+GSD+ WVQC+PC  
Sbjct: 145 ATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGA 204

Query: 189 -QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GC------HAGRCRYEVSYGD 237
            +CY Q DP+FDP+ S+S++ V C S  C +L       GC       A  C Y + YG+
Sbjct: 205 GECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGN 264

Query: 238 GSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
            + T G  + ETLT+    VV +   GCG    G +    GLLGLGG   SLV Q   Q 
Sbjct: 265 RATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQF 324

Query: 297 GGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
           GG FSYCL    +G +G L  G            G ++ P+ R P  P+FY V L+G+ V
Sbjct: 325 GGPFSYCLPPT-SGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISV 383

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASG 407
           GG  + I    F        G+V+D+GT +T LP  AY A R AF   +++   LP ++G
Sbjct: 384 GGAPLAIPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG 437

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
             + DTCY+ +G  +V VPT+S  FSGG  + L A   ++ VD  G   FA A + + + 
Sbjct: 438 -GVLDTCYDFTGHANVTVPTISLTFSGGATIDLAAPAGVL-VD--GCLAFAGAGTDNAIG 493

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IIGN+ Q   ++ +D   G VGF    C
Sbjct: 494 IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 153/441 (34%), Positives = 228/441 (51%), Gaps = 41/441 (9%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
            L L H   + SS  +T+         SF   + +D +RV  L  RL+   + +      
Sbjct: 32  QLNLYHVKGLDSSQTSTS-------PFSFSDMITKDEERVRFLHSRLTNKESASNSATTD 84

Query: 140 DFG------TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYK 192
             G      T + SG+  GSG Y+V+IGVG+P +   M++D+GS + W+QCQPC   C+ 
Sbjct: 85  KLGGPSLVSTPLKSGLSIGSGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHV 144

Query: 193 QSDPVFDPADSASFSG-----VSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTL 245
           Q DP+F P+ S ++         CSS     L   GC    G C Y+ SYGD S++ G L
Sbjct: 145 QVDPIFTPSVSKTYKALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYL 204

Query: 246 ALETLTIGRTVVKN--VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           + + LT+  +   +     GCG  NQG+F  +AG++GL    +S++GQL  + G AFSYC
Sbjct: 205 SQDVLTLTPSAAPSSGFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYC 264

Query: 304 LVSRGTGSSGSLVFGREALPVGAA--------WVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           L S  +    S V G   L +GA+        + PLV+NP+ PS Y++GL+ + V G  +
Sbjct: 265 LPSSFSAQPNSSVSGF--LSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPL 322

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTC 414
            +S   + +        ++D+GT +TRLP   Y A + +FV   +    +A G SI DTC
Sbjct: 323 GVSASSYNVP------TIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTC 376

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
           +  S      VP +   F GG  L L   N L+ ++  GT C A A S + +SIIGN QQ
Sbjct: 377 FKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEK-GTTCLAIAASSNPISIIGNYQQ 435

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           +   +++D AN  +GF P  C
Sbjct: 436 QTFTVAYDVANSKIGFAPGGC 456


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 195/345 (56%), Gaps = 9/345 (2%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           GSGEY ++I +G+PP+    ++D+GSD+ WVQC PC++C++Q DP+F P  S+S+S  SC
Sbjct: 4   GSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASC 63

Query: 212 SSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           + ++CD L    C     C Y  SYGDGS T+G  A ET+T+  + +  +  GCGH  +G
Sbjct: 64  TDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARIGFGCGHNQEG 123

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVGAAWV 329
            F GA GL+GLG G +SL  QL       FSYCLV +  TG+   + FG  A    A++ 
Sbjct: 124 TFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAENSRASFT 183

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL++N   PS+YYVG+  + VG  R+P     FR+   G  GV++D+GT +T     A+ 
Sbjct: 184 PLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTITYWRLAAFI 243

Query: 390 AFRDAFVAQTGNLPRASGVSI-FDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFL 446
                   Q  + P A       + CY++S     S+ +P+++ + +      +P SN  
Sbjct: 244 PILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVD-FEIPVSNLW 301

Query: 447 IPVDDAG-TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           + VD+ G T C A + S    SIIGN+QQ+   I  D AN  VGF
Sbjct: 302 VLVDNFGETVCTAMSTS-DQFSIIGNVQQQNNLIVTDVANSRVGF 345


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 155/460 (33%), Positives = 235/460 (51%), Gaps = 41/460 (8%)

Query: 58  NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
            ++   HNNI S   S + +        R       +TT  M  HR   S        +M
Sbjct: 32  KKILSVHNNIWSPKKSYEAST---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDLGKKM 87

Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
           +R    D  RV +L  ++    +   +  V +    + SG+   S  Y V + +G   ++
Sbjct: 88  RRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KN 145

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR 228
             +++D+GSD+ WVQCQPC  CY Q  P++DP+ S+S+  V C+S+ C  L  A  ++G 
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205

Query: 229 C-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
           C            Y VSYGDGSYT+G LA E++ +G T ++N   GCG  N+G+F G++G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSSG 265

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVR 333
           L+GLG  S+SLV Q      G FSYCL S   G+SGSL FG ++         ++ PLV+
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQ 325

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           NP+  SFY + L+G  +GG+ +  S    R       G+++D+GT +TRLP   Y+A + 
Sbjct: 326 NPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAVKI 377

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
            F+ Q    P A G SI DTC+NL+ +  + +P +   F G   L +  +  F     DA
Sbjct: 378 EFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDA 437

Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
              C A A     + + IIGN QQ+  ++ +D     +G 
Sbjct: 438 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGI 477


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 159/463 (34%), Positives = 232/463 (50%), Gaps = 63/463 (13%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV- 138
            +EL HRD    + N          +      ++RD+ R+ +  +R+S     +A  E  
Sbjct: 84  KMELKHRDHGQPTRN---------RRSLLLESLKRDITRLQSFQKRVSEKLTASANPEAY 134

Query: 139 -------------------QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
                              ++  + V SG + G+GEYF+ + VG+PPR   ++ID+GSD+
Sbjct: 135 LEMTNSSSTKSPPSPSSSWEEVDSTVESGAELGAGEYFMDVFVGNPPRHFLLIIDTGSDL 194

Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYE 232
            W+QC+PC  C+ QS PVFDP+ S SF  + C++A CD + +  C           C+Y 
Sbjct: 195 TWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLVVHDECRDNSSKTSPKTCKYF 254

Query: 233 VSYGDGSYTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
             YGD S T G LALE+L++  +       ++++ IGCGH N+G+F GA GLLGLG G++
Sbjct: 255 YWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGAL 314

Query: 287 SLVGQL-GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA-----------WVPLVR- 333
           S   QL     G +FSYCLV R    S S      A+  GA            + P VR 
Sbjct: 315 SFPSQLRSSPIGQSFSYCLVDRTNNLSVS-----SAISFGAGFALSRHFDQMRFTPFVRT 369

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           N    +FYY+G+ G+ +    +PI  + F +   G  G ++D+GT +T L   AY A   
Sbjct: 370 NNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVES 429

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI-PVDDA 452
           AF+A+  + PRA    I   CYN +G  +V  PT+S  F  G  L LP  N+ I P    
Sbjct: 430 AFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQE 488

Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A  P+  G+SIIGN QQ+ I   +D  +  +GF    C
Sbjct: 489 AKHCLAILPT-DGMSIIGNFQQQNIHFLYDVQHARLGFANTDC 530


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  236 bits (601), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 155/460 (33%), Positives = 235/460 (51%), Gaps = 41/460 (8%)

Query: 58  NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHS-----FHARM 112
            ++   HNNI S   S + +        R       +TT  M  HR   S        +M
Sbjct: 32  KKILSVHNNIWSPKKSYEAST---SCFSRSLGKGRESTTLEMK-HRELCSGKTIDLGKKM 87

Query: 113 QR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
           +R    D  RV +L  ++    +   +  V +    + SG+   S  Y V + +G   ++
Sbjct: 88  RRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG--KN 145

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR 228
             +++D+GSD+ WVQCQPC  CY Q  P++DP+ S+S+  V C+S+ C  L  A  ++G 
Sbjct: 146 MSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGP 205

Query: 229 C-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
           C            Y VSYGDGSYT+G LA E++ +G T ++N   GCG  N+G+F G++G
Sbjct: 206 CGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGSSG 265

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVR 333
           L+GLG  S+SLV Q      G FSYCL S   G+SGSL FG ++         ++ PLV+
Sbjct: 266 LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPLVQ 325

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           NP+  SFY + L+G  +GG+ +  S    R       G+++D+GT +TRLP   Y+A + 
Sbjct: 326 NPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAVKI 377

Query: 394 AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDA 452
            F+ Q    P A G SI DTC+NL+ +  + +P +   F G   L +  +  F     DA
Sbjct: 378 EFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKPDA 437

Query: 453 GTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
              C A A     + + IIGN QQ+  ++ +D     +G 
Sbjct: 438 SLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGI 477


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 151/438 (34%), Positives = 223/438 (50%), Gaps = 42/438 (9%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG----ADAAKH 136
           LEL H    SS   +             HA +  D  RV++L RR+   G    +DAA  
Sbjct: 43  LELRHHASFSSGGKS--------RAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
             +     V SG    +  Y   +G+G    +  +++D+ S++ WVQC+PC  C+ Q +P
Sbjct: 95  S-KLAQVPVTSGARLRTLNYVATVGIGGGEAT--VIVDTASELTWVQCEPCDACHDQQEP 151

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALE 248
           +FDP+ S S++ V C+S+ CD L  A   +G+        C Y +SY DGSY++G LA +
Sbjct: 152 LFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHD 211

Query: 249 TLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
            L++    ++    GCG  NQG F G +GL+GLG   +SL+ Q   Q GG FSYCL  + 
Sbjct: 212 RLSLAGEDIQGFVFGCGTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKE 271

Query: 309 TGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           +GSSGSLV G      R + P+   +  +V +P    FY   L+G+ VGG      ED+ 
Sbjct: 272 SGSSGSLVLGDDASVYRNSTPI--VYTAMVSDPLQGPFYLANLTGITVGG------EDVQ 323

Query: 363 R--LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF 420
               +  G    ++D+GT +T L    Y A R  FV+Q    P+A+  SI DTC++L+G 
Sbjct: 324 SPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILDTCFDLTGL 383

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
             V+VP++   F GG  + + +   L  V  DA   C A A   S     IIGN QQ+ +
Sbjct: 384 REVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNL 443

Query: 478 QISFDGANGFVGFGPNVC 495
           ++ FD     +GF    C
Sbjct: 444 RVIFDTVGSQIGFAQETC 461


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 152/401 (37%), Positives = 214/401 (53%), Gaps = 21/401 (5%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           R    F  + Q  V  +   + ++SG G      E         SG+  G+G Y V +G+
Sbjct: 86  RSHVEFLLQDQLRVDSIQARLSKISGHGI----FEEMVTKLPAQSGIAIGTGNYVVTVGL 141

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           G+P     +V D+GS I W QCQPC   CY Q +  FDP  S S++ VSCSSA C+ L  
Sbjct: 142 GTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSASCNLLPT 201

Query: 222 A--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAA 276
           +  GC A    C Y++ YGD SY++G  A ETLTI  + V  N   GCG  N G+F  AA
Sbjct: 202 SERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSDVFTNFLFGCGQSNNGLFGQAA 261

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR 336
           GLLGL   S+SL  Q   +    FSYCL S  + S+G L FG + +   A + P+  +P 
Sbjct: 262 GLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPS-STGYLNFGGK-VSQTAGFTPI--SPA 317

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
             SFY + + G+ V G ++PI   +F  +     G ++D+GT +TRLP  AY+A ++AF 
Sbjct: 318 FSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGTVITRLPPTAYKALKEAFD 372

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
            +  N P+ +G  + DTCY+ S + +V  P VS  F GG  + + AS  L  V+     C
Sbjct: 373 EKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILYLVNGVKMVC 432

Query: 457 FAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AFA +   S   I GN QQ+  ++ +DGA G +GF    C
Sbjct: 433 LAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGAC 473


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  235 bits (600), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 143/417 (34%), Positives = 212/417 (50%), Gaps = 36/417 (8%)

Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--------DVVSGMDQGSGEYFVRIG 161
           +R+++D +R    ++ +    A       + +GT         + SG+  GSGEYF+ + 
Sbjct: 41  SRLKKDKERPEKQIKTVVATAASP-----ESYGTGLSGQLMATLESGVTLGSGEYFMDVF 95

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP+   +++D+GSD+ W+QC PC  C++Q+ P +DP +S+SF  + C    C  + +
Sbjct: 96  IGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPRCHLVSS 155

Query: 222 AG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGH 266
                 C A    C Y   YGD S T G  A ET T+  T          V+NV  GCGH
Sbjct: 156 PDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENVMFGCGH 215

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPV 324
            N+G+F GA+GLLGLG G +S   QL    G +FSYCLV R   T  S  L+FG +   +
Sbjct: 216 WNRGLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLL 275

Query: 325 GAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
                 +  LV     P  +FYYV +  + VGG  + I E  + +T  G  G ++D+GT 
Sbjct: 276 NHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTT 335

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           ++    PAY+  +DAFV +    P      I D CYN+SG   + +P     F+ G V  
Sbjct: 336 LSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDFGILFADGAVWN 395

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            P  N+ I +D     C A   +P S LSIIGN QQ+   + +D     +G+ P  C
Sbjct: 396 FPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKSRLGYAPMNC 452


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  235 bits (599), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 151/408 (37%), Positives = 209/408 (51%), Gaps = 34/408 (8%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++ D  RV ++ R ++   A       QD       G+  G+G Y V +G+G+P R   +
Sbjct: 45  LEHDQARVDSIHRMIANETAVVG----QDVSLPAERGISVGTGNYVVSVGLGTPARDLTV 100

Query: 172 VIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG-- 227
           V D+GSD+ WVQC PCS   CY Q DP+F P+ S++FS V C    C R   + C +   
Sbjct: 101 VFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEPECPRARQS-CSSSPG 159

Query: 228 --RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-----------IGCGHKNQGMFVG 274
             RC YEV YGD S T G L  +TLT+G T   N +            GCG  N G+F  
Sbjct: 160 DDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGFVFGCGENNTGLFGK 219

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR 333
           A GL GLG G +SL  Q  G+ G  FSYCL S  + + G L  G  A  P  A + P++ 
Sbjct: 220 ADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLN 279

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
               PSFYYV L G+ V G  I +S        +   G+++D+GT +TRL   AY A R 
Sbjct: 280 RSNTPSFYYVKLVGIRVAGRAIKVSSR----PALWPAGLIVDSGTVITRLAPRAYSALRT 335

Query: 394 AFVAQTGN--LPRASGVSIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           AF++  G     RA  +SI DTCY+ +     +V +P V+  F+GG  +++  S  L  V
Sbjct: 336 AFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFAGGATISVDFSGVLY-V 394

Query: 450 DDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C AFAP+ +G S  I+GN QQ  + + +D     +GF    C
Sbjct: 395 AKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFAAKGC 442


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  234 bits (598), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 150/379 (39%), Positives = 201/379 (53%), Gaps = 36/379 (9%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG+  GSGEY V + VG+PPR   M++D+GSD+ W+QC PC  C++Q  PVFDPA S 
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSL 200

Query: 205 SFSGVSCSSAVCDRLENA----GC---HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV- 256
           S+  V+C    C  +        C   H+  C Y   YGD S T G LALE  T+  T  
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAP 260

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
                V +V  GCGH N+G+F GAAGLLGLG G++S   QL    G AFSYCLV  G+  
Sbjct: 261 GASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSV 320

Query: 312 SGSLVFGREALPVGAAWVPLVRNPR-------------APSFYYVGLSGLGVGGMRIPIS 358
              +VFG +   +G        +PR             A +FYYV L G+ VGG ++ IS
Sbjct: 321 GSKIVFGDDDALLG--------HPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNIS 372

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNL 417
              + + + G  G ++D+GT ++    PAYE  R AFV +     P  +   +   CYN+
Sbjct: 373 PSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNV 432

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
           SG   V VP  S  F+ G V   PA N+ + +D  G  C A   +P S +SIIGN QQ+ 
Sbjct: 433 SGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQN 492

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +GF P  C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 150/379 (39%), Positives = 201/379 (53%), Gaps = 36/379 (9%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG+  GSGEY V + VG+PPR   M++D+GSD+ W+QC PC  C++Q  PVFDPA S 
Sbjct: 141 VESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASL 200

Query: 205 SFSGVSCSSAVCDRLENA----GC---HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV- 256
           S+  V+C    C  +        C   H+  C Y   YGD S T G LALE  T+  T  
Sbjct: 201 SYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAP 260

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
                V +V  GCGH N+G+F GAAGLLGLG G++S   QL    G AFSYCLV  G+  
Sbjct: 261 GASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSSV 320

Query: 312 SGSLVFGREALPVGAAWVPLVRNPR-------------APSFYYVGLSGLGVGGMRIPIS 358
              +VFG +   +G        +PR             A +FYYV L G+ VGG ++ IS
Sbjct: 321 GSKIVFGDDDALLG--------HPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNIS 372

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-LPRASGVSIFDTCYNL 417
              + + + G  G ++D+GT ++    PAYE  R AFV +     P  +   +   CYN+
Sbjct: 373 PSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNV 432

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
           SG   V VP  S  F+ G V   PA N+ + +D  G  C A   +P S +SIIGN QQ+ 
Sbjct: 433 SGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQN 492

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +GF P  C
Sbjct: 493 FHVLYDLQNNRLGFAPRRC 511


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  234 bits (597), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 136/349 (38%), Positives = 184/349 (52%), Gaps = 14/349 (4%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSS 213
           E+ V +G G+P ++  ++ D+GSD+ W+QC PCS  CYKQ DP+FDP  SA++S V C  
Sbjct: 134 EFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGH 193

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
             C   + + C  G C Y+V YGDGS + G L+ ETL++  T  +   A GCG  N G F
Sbjct: 194 PQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTRALPGFAFGCGQTNLGDF 253

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVGAAWVP 330
               GL+GLG G +SL  Q     GG FSYCL S  T + G L  G    A      +  
Sbjct: 254 GDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNT-THGYLTIGPTTPASNDDVQYTA 312

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           +V+    PSFY+V L  + +GG  +P+   LF      DDG  +D+GT +T LP  AY A
Sbjct: 313 MVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTILTYLPPEAYTA 367

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            RD F         A     FDTCY+ +G  ++ +P VSF FS G V  L     LI  D
Sbjct: 368 LRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFGILIFPD 427

Query: 451 DAGTF--CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           D      C  F   PS +  +I+GN+QQ   ++ +D A   +GF    C
Sbjct: 428 DTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  234 bits (596), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 138/373 (36%), Positives = 199/373 (53%), Gaps = 25/373 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + VG+PP+   +++D+GSD+ W+QC PC  C++Q+ P +DP DS+SF
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSF 245

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT----- 255
             ++C    C      D  +        C Y   YGD S T G  ALET T+  T     
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305

Query: 256 ----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
               +V+NV  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R + S
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNS 365

Query: 312 SGS--LVFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
           S S  L+FG +   +        ++V    NP   +FYYV +  + VGG  + I E+ + 
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFVGGKENP-VDTFYYVLIKSIMVGGEVLKIPEETWH 424

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           L+  G  G ++D+GT +T    PAYE  ++AF+ +    P          CYN+SG   +
Sbjct: 425 LSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKM 484

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
            +P  +  F+ G +   P  N+ I ++     C A   +P S LSIIGN QQ+   I +D
Sbjct: 485 ELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYD 544

Query: 483 GANGFVGFGPNVC 495
                +G+ P  C
Sbjct: 545 LKKSRLGYAPMKC 557


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  234 bits (596), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 164/433 (37%), Positives = 228/433 (52%), Gaps = 40/433 (9%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR    + S  ++         S    ++ D +R   ++RR+SG        +   
Sbjct: 68  LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
               V +  G D G+  Y V   +G+P  +Q M +D+GSD+ WVQC+PCS    CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182

Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
           P+FDPA S+S++ V C   VC  L     + C A +C Y VSYGDGS T G  + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
              + V+    GCGH   G+F G  GLLGLG    SLV Q  G  GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301

Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           +G L  G    P GAA       L+ +P AP++Y V L+G+ VGG ++ +    F     
Sbjct: 302 AGYLTLGLGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA---- 356

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
              G V+DTGT +TRLP  AY A R AF +   +   P A    I DTCYN +G+ +V +
Sbjct: 357 --GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
           P V+  F  G  + L A   L       +F C AFAPS S  G++I+GN+QQ   ++  D
Sbjct: 415 PNVALTFGSGATVMLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467

Query: 483 GANGFVGFGPNVC 495
           G +  VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 145/393 (36%), Positives = 215/393 (54%), Gaps = 22/393 (5%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           M+R ++R    + +L    A    H+++D  T V    D GSGEY +++ +G+P  S   
Sbjct: 1   MKRAIQRSQERLEKLQITSA-VNTHQMKDIETPVTP--DIGSGEYLIQMAIGTPALSLSA 57

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCR 230
           ++D+GSD+VW +C PC+ C   +  ++DP+ S+++S V C S++C       C+  G C 
Sbjct: 58  IMDTGSDLVWTKCNPCTDC--STSSIYDPSSSSTYSKVLCQSSLCQPPSIFSCNNDGDCE 115

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           Y   YGD S T G L+ ET +I    + N+  GCGH NQG F    GL+G G GS+SLV 
Sbjct: 116 YVYPYGDRSSTSGILSDETFSISSQSLPNITFGCGHDNQG-FDKVGGLVGFGRGSLSLVS 174

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVF-----GREALPVGAAWVPLVRNPRAPSFYYVGL 345
           QLG   G  FSYCLVSR   S  S +F       EA  VG+   PLV++  + + YY+ L
Sbjct: 175 QLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGS--TPLVQS-SSTNHYYLSL 231

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            G+ VGG  + I    F +   G  G+++D+GT +T L   AY+A ++A V+   NLP+A
Sbjct: 232 EGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSI-NLPQA 290

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
            G    D C+N  G  +   P+++F+F G     +P  N+L P   +   C A  P+ S 
Sbjct: 291 DGQ--LDLCFNQQGSSNPGFPSMTFHFKGAD-YDVPKENYLFPDSTSDIVCLAMMPTNSN 347

Query: 466 L---SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   +I GN+QQ+  QI +D  N  + F P  C
Sbjct: 348 LGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  233 bits (595), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 142/402 (35%), Positives = 217/402 (53%), Gaps = 32/402 (7%)

Query: 111 RMQR----DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           +M+R    D  RV +L  ++    +   +  V +    + SG+   S  Y V + +G   
Sbjct: 38  KMRRALVLDNIRVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLNYIVTVELGG-- 95

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA 226
           ++  +++D+GSD+ WVQCQPC  CY Q  P++DP+ S+S+  V C+S+ C  L  A  ++
Sbjct: 96  KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 155

Query: 227 GRC-----------RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
           G C            Y VSYGDGSYT+G LA E++ +G T ++N   GCG  N+G+F G+
Sbjct: 156 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDTKLENFVFGCGRNNKGLFGGS 215

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPL 331
           +GL+GLG  S+SLV Q      G FSYCL S   G+SGSL FG ++         ++ PL
Sbjct: 216 SGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSLSFGNDSSVYTNSTSVSYTPL 275

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           V+NP+  SFY + L+G  +GG+ +  S    R       G+++D+GT +TRLP   Y+A 
Sbjct: 276 VQNPQLRSFYILNLTGASIGGVELK-SSSFGR-------GILIDSGTVITRLPPSIYKAV 327

Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVD 450
           +  F+ Q    P A G SI DTC+NL+ +  + +P +   F G   L +  +  F     
Sbjct: 328 KIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELEVDVTGVFYFVKP 387

Query: 451 DAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           DA   C A A     + + IIGN QQ+  ++ +D     +G 
Sbjct: 388 DASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGI 429


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 141/398 (35%), Positives = 211/398 (53%), Gaps = 21/398 (5%)

Query: 112 MQRDVKRVATLVRRLSGG-GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           M  D +RV  +  RLS   G +    ++        SG   GS  Y V +G+G+P R   
Sbjct: 1   MNLDNERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSANYVVVVGLGTPKRDLS 60

Query: 171 MVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA--- 226
           +V D+GSD+ W QC+PC+  CYKQ D +FDP+ S+S++ ++C+S++C +L + G  +   
Sbjct: 61  LVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECS 120

Query: 227 ----GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
                 C Y+  YGD S + G L+ E LTI  T +V +   GCG  N+G+F G+AGL+GL
Sbjct: 121 SSTDASCIYDAKYGDNSTSVGFLSQERLTITATDIVDDFLFGCGQDNEGLFNGSAGLMGL 180

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVRNPRAPSF 340
           G   +S+V Q        FSYCL +  + S G L FG  A    +  + PL       SF
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSS-SLGHLTFGASAATNASLIYTPLSTISGDNSF 239

Query: 341 YYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
           Y + +  + VGG ++P +S   F        G ++D+GT +TRL    Y A R AF    
Sbjct: 240 YGLDIVSISVGGTKLPAVSSSTFSA-----GGSIIDSGTVITRLAPTVYAALRSAFRRXM 294

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
              P A+   + DTCY+LSG+  + VP + F FSGG  + L     L  V+     C AF
Sbjct: 295 EKYPVANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILX-VESEQQVCLAF 353

Query: 460 AP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A   S + +++ GN+QQ+ +++ +D   G +GFG   C
Sbjct: 354 AANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 137/402 (34%), Positives = 213/402 (52%), Gaps = 23/402 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           ++QR + R    + RL G  A  A     D   ++ +    GSGE+ + + +G+P     
Sbjct: 63  KIQRGINRGFHRLNRL-GAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYS 121

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-- 228
            ++D+GSD++W QC+PC++C+ Q  P+FDP  S+S+S V CSS +C+ L  + C+  +  
Sbjct: 122 AIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDA 181

Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSM 286
           C Y  +YGD S T+G LA ET T      +  +  GCG +N+G  F   +GL+GLG G +
Sbjct: 182 CEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPL 241

Query: 287 SLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GAAW-------VPLVRN 334
           SL+ QL       FSYCL S   + +S SL  G  A  +    GA+        + L+RN
Sbjct: 242 SLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRN 298

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           P  PSFYY+ L G+ VG  R+ + +  F L + G  G+++D+GT +T L   A++  ++ 
Sbjct: 299 PDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEE 358

Query: 395 FVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           F ++       SG +  D C+ L     ++ VP + F+F G   L LP  N+++     G
Sbjct: 359 FTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGAD-LELPGENYMVADSSTG 417

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C A   S +G+SI GN+QQ+   +  D     V F P  C
Sbjct: 418 VLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 458


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 139/401 (34%), Positives = 221/401 (55%), Gaps = 24/401 (5%)

Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKH------EVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           +R +  VK +++ +R+    GA  ++H      E       +  G+  GSG Y++++G+G
Sbjct: 68  SRDEEHVKFLSSRLRKKDVQGASFSRHKSGHLLEPNSANIPLNPGLSIGSGNYYLKLGLG 127

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
           SPP+   M++D+GS + W+QC+PC   C+ Q DP+F+P+ S ++  + CSS+ C  L+ A
Sbjct: 128 SPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAA 187

Query: 223 G-----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGA 275
                 C A G C Y  SYGD SY+ G L+ + LT+  +  + +   GCG  N+G+F  A
Sbjct: 188 TLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPSQTLPSFTYGCGQDNEGLFGKA 247

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
           AG++GL    +S++ QL  + G AFSYCL +  +   G L  G+ + P    + P++RN 
Sbjct: 248 AGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSIGKIS-PSSYKFTPMIRNS 306

Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
           + PS Y++ L+ + V G  + ++   +++        ++D+GT VTRLP   Y A R+AF
Sbjct: 307 QNPSLYFLRLAAITVAGRPVGVAAAGYQVP------TIIDSGTVVTRLPISIYAALREAF 360

Query: 396 VA-QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           V   +    +A   SI DTC+  S       P +   F GG  L+L A N LI  D  G 
Sbjct: 361 VKIMSRRYEQAPAYSILDTCFKGSLKSMSGAPEIRMIFQGGADLSLRAPNILIEADK-GI 419

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C AFA S + ++IIGN QQ+   I++D +   +GF P  C
Sbjct: 420 ACLAFA-SSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 236/458 (51%), Gaps = 40/458 (8%)

Query: 61  FERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA 120
           FE     S+S+ +SD  R ++ LVHR    + S  +        + S   R++RD  R  
Sbjct: 25  FEPEAACSTSSANSDPNRASVPLVHRHGPCAPSAASGG------KPSLAERLRRDRARAN 78

Query: 121 TLVRRLSGGGADAA--KHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
            +V + +GG   A      V   GT + +  G    S EY V +G+G+P   Q ++ID+G
Sbjct: 79  YIVTKAAGGRTAATAVSDAVGGGGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTG 138

Query: 177 SDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GCHAGR-- 228
           SD+ WVQC+PC   +CY Q DP+FDP+ S+S++ V C S  C +L       GC +G   
Sbjct: 139 SDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAA 198

Query: 229 -CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
            C Y + YG+ + T G  + ETLT+    VV +   GCG    G +    GLLGLGG   
Sbjct: 199 LCEYGIEYGNRATTTGVYSTETLTLKPGVVVADFGFGCGDHQHGPYEKFDGLLGLGGAPE 258

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE------ALPVGAAWVPLVRNPRAPSF 340
           SLV Q   Q GG FSYCL    +G +G L  G            G  + P+ R P  P+F
Sbjct: 259 SLVSQTSSQFGGPFSYCLPPT-SGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTF 317

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VA 397
           Y V L+G+ VGG  + +    F        G+V+D+GT +T LP  AY A R AF   ++
Sbjct: 318 YVVTLTGISVGGAPLAVPPSAF------SSGMVIDSGTVITGLPATAYAALRSAFRSAMS 371

Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
           +   LP ++G ++ DTCY+ +G  +V VPT++  FSGG  + L A+   + VD  G   F
Sbjct: 372 EYRLLPPSNG-AVLDTCYDFTGHTNVTVPTIALTFSGGATIDL-ATPAGVLVD--GCLAF 427

Query: 458 AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A A +   + IIGN+ Q   ++ +D   G VGF    C
Sbjct: 428 AGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGFRAGAC 465


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 155/420 (36%), Positives = 213/420 (50%), Gaps = 24/420 (5%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +  EL++R+  SS   +            F A ++R  +R A L + +  G         
Sbjct: 28  FRAELIYREHQSSPLRSET---LKTPSEIFIAAVKRGHERRARLAKHVLAGD-------- 76

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
           Q F T V SG    +GEY + I  G+PP+    ++D+GSD+ WVQC PC  CY+     F
Sbjct: 77  QLFETPVASG----NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKF 132

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
           DP+ SAS+  + C S  C  L    C A  C+Y+  YGDGS T G L+ + +TIG   + 
Sbjct: 133 DPSKSASYKTLGCGSNFCQDLPFQSC-AASCQYDYMYGDGSSTSGALSTDDVTIGTGKIP 191

Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
           NVA GCG+ N G F GA GL+GLG G +SLV QLGG     FSYCLV  G+  +  L  G
Sbjct: 192 NVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIG 251

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
              L  G A+ P++ N   P+FYY  L G+ V G  +    + F +   G  G+++D+GT
Sbjct: 252 DSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGT 311

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVPTVSFYFSGG 435
            +T L     +AF     A    LP       F   + C++ +G  +   PTV F+F+G 
Sbjct: 312 TLTYLDV---DAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGA 368

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            V   P + F I +D  GT C A A S +G SI GNIQQ    I  D  N  +GF    C
Sbjct: 369 DVALAPDNTF-IALDFEGTTCLAMA-SSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/352 (38%), Positives = 196/352 (55%), Gaps = 16/352 (4%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVS 210
           G+G Y V IG+G+P     +V D+GSD  WVQC+PC   CY+Q + +FDPA S++ + +S
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANIS 241

Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQ 269
           C++  C  L   GC  G C Y V YGDGSY+ G  A++TLT+     +K    GCG +N+
Sbjct: 242 CAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKGFRFGCGERNE 301

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           G+F  AAGLLGLG G  SL  Q   + GG F++C  +R +G +G L FG  + P  +  +
Sbjct: 302 GLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSG-TGYLDFGPGSSPAVSTKL 360

Query: 330 --PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
             P++ +    +FYYVGL+G+ VGG  + I   +F        G ++D+GT +TRLP  A
Sbjct: 361 TTPMLVD-NGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-----GTIVDSGTVITRLPPAA 414

Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
           Y + R AF +        +A  +S+ DTCY+ +G   V +PTVS  F GG  L + AS  
Sbjct: 415 YSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASG- 473

Query: 446 LIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +I        C  FA +     + I+GN Q +   + +D     VGF P  C
Sbjct: 474 IIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 146/375 (38%), Positives = 204/375 (54%), Gaps = 24/375 (6%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG+  GS EY + + VG+PPR   M++D+GSD+ W+QC PC  C++Q  PVFDPA S+
Sbjct: 135 VESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASS 194

Query: 205 SFSGVSCSSAVCDRL------ENAGCH---AGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           S+  ++C    C  +          C       C Y   YGD S + G LALE+ T+  T
Sbjct: 195 SYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLT 254

Query: 256 V------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRG 308
                  V  V  GCGH+N+G+F GAAGLLGLG G +S   QL    GG  FSYCLV  G
Sbjct: 255 APGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHG 314

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAP------SFYYVGLSGLGVGGMRIPISEDLF 362
           +  +  +VFG +     AA   L     AP      +FYYV L+G+ VGG  + IS D +
Sbjct: 315 SDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTW 374

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV 421
             ++ G  G ++D+GT ++    PAY+  R AF+ + +G+ P      +   CYN+SG  
Sbjct: 375 DASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVE 434

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
              VP +S  F+ G V   PA N+ I +D  G  C A   +P +G+SIIGN QQ+   ++
Sbjct: 435 RPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVA 494

Query: 481 FDGANGFVGFGPNVC 495
           +D  N  +GF P  C
Sbjct: 495 YDLHNNRLGFAPRRC 509


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 135/361 (37%), Positives = 193/361 (53%), Gaps = 27/361 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + +G+GSPPR    +ID+GSD++W QC PC  C +Q  P F+PA S S++ + CSS
Sbjct: 86  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 145

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
           A+C+ L +  C    C Y+  YGD + + G LA ET T G    R  V  V+ GCG+ N 
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 205

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
           G     +G++G G G++SLV QLG      FSYCL S  + ++  L FG  A        
Sbjct: 206 GTLFNGSGMVGFGRGALSLVSQLGSPR---FSYCLTSFMSPATSRLYFGAYATLNSTNTS 262

Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
              PV +   P + NP  P+ Y++ ++G+ V G  +PI   +F + +  G  GV++D+GT
Sbjct: 263 SSGPVQS--TPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 320

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVS--VRVPTVSFYFSG 434
            VT L  PAY   + AFVA  G LPRA+      FDTC+         V +P +  +F G
Sbjct: 321 TVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDG 379

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
              + LP  N+++     G  C A  PS  G SIIG+ Q +   + +D  N  + F P  
Sbjct: 380 AD-MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 437

Query: 495 C 495
           C
Sbjct: 438 C 438


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  232 bits (591), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 148/467 (31%), Positives = 242/467 (51%), Gaps = 54/467 (11%)

Query: 60  LFERHNNISSSNTSS----DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
            F  H  I+SS   S     +    L+L     + S  N+T+ +        F     +D
Sbjct: 8   FFSAHLAIASSLKDSGLKHKQPDMQLKLYPMTSLKSPPNSTSLL--------FAYMFAKD 59

Query: 116 VKRVATLVRRLSGGG-ADAAKHEV--QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
            +R+     RL+    A+A+  +V  +  G  + SG+  GSG Y+V++G+GSP +   M+
Sbjct: 60  EERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYYTMI 119

Query: 173 IDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-------------SSAVCDR 218
           +D+GS   W+QCQPC+  C+ Q DPVF+P+ S ++  V C             +   C +
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSK 179

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAG 277
             NA      C Y+ SYGD S++ G L+ + LT+  +  + +   GCG  NQG+F    G
Sbjct: 180 QSNA------CVYKASYGDSSFSLGYLSQDVLTLTPSQTLSSFVYGCGQDNQGLFGRTDG 233

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGAAW--VPL 331
           ++GL    +S++ QL G+ G AFSYCL +      +   G L  G  +L   +++   PL
Sbjct: 234 IIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPL 293

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NP  PS Y++ L  + V G  + ++   +++        ++D+GT +TRLPTP Y   
Sbjct: 294 LKNPNNPSLYFIDLESITVAGRPLGVAASSYKVP------TIIDSGTVITRLPTPVYTTL 347

Query: 392 RDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
           ++A+V   +    +A G+S+ DTC+  +L+G +S   P +   F GG  L L   N L+ 
Sbjct: 348 KNAYVTILSKKYQQAPGISLLDTCFKGSLAG-ISEVAPDIRIIFKGGADLQLKGHNSLVE 406

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + + G  C A A S S ++IIGN QQ+ +++++D  N  VGF P  C
Sbjct: 407 L-ETGITCLAMAGS-SSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 135/361 (37%), Positives = 193/361 (53%), Gaps = 27/361 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + +G+GSPPR    +ID+GSD++W QC PC  C +Q  P F+PA S S++ + CSS
Sbjct: 83  GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSS 142

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
           A+C+ L +  C    C Y+  YGD + + G LA ET T G    R  V  V+ GCG+ N 
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRVSFGCGNMNA 202

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
           G     +G++G G G++SLV QLG      FSYCL S  + ++  L FG  A        
Sbjct: 203 GTLFNGSGMVGFGRGALSLVSQLGSPR---FSYCLTSFMSPATSRLYFGAYATLNSTNTS 259

Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
              PV +   P + NP  P+ Y++ ++G+ V G  +PI   +F + +  G  GV++D+GT
Sbjct: 260 SSGPVQS--TPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGT 317

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVS--VRVPTVSFYFSG 434
            VT L  PAY   + AFVA  G LPRA+      FDTC+         V +P +  +F G
Sbjct: 318 TVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDG 376

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
              + LP  N+++     G  C A  PS  G SIIG+ Q +   + +D  N  + F P  
Sbjct: 377 AD-MELPLENYMVMDGGTGNLCLAMLPSDDG-SIIGSFQHQNFHMLYDLENSLLSFVPAP 434

Query: 495 C 495
           C
Sbjct: 435 C 435


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  231 bits (590), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 137/402 (34%), Positives = 214/402 (53%), Gaps = 23/402 (5%)

Query: 111 RMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           ++QR + R    + RL G  A  A     D   ++ +    GSGE+ + + +G+P     
Sbjct: 64  KIQRGINRGFHRLNRL-GAVAVLAVASNPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYA 122

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-- 228
            ++D+GSD++W QC+PC++C+ Q  P+FDP  S+S+S V CSS +C+ L  + C+  +  
Sbjct: 123 AIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDS 182

Query: 229 CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSM 286
           C Y  +YGD S T+G LA ET T      +  +  GCG +N+G  F   +GL+GLG G +
Sbjct: 183 CEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPL 242

Query: 287 SLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GA-------AWVPLVRN 334
           SL+ QL       FSYCL S   + +S SL  G  A  +    GA         + L+RN
Sbjct: 243 SLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRN 299

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           P  PSFYY+ L G+ VG  R+ + +  F L++ G  G+++D+GT +T L   A++  ++ 
Sbjct: 300 PDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEE 359

Query: 395 FVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           F ++       SG +  D C+ L +   ++ VP + F+F G   L LP  N+++     G
Sbjct: 360 FTSRMSLPVDDSGSTGLDLCFKLPNAAKNIAVPKLIFHFKGAD-LELPGENYMVADSSTG 418

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C A   S +G+SI GN+QQ+   +  D     V F P  C
Sbjct: 419 VLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVTFVPTEC 459


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  231 bits (590), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 145/370 (39%), Positives = 197/370 (53%), Gaps = 29/370 (7%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
           G+  G+G Y V +G+G+P R   +V D+GSD+ WVQC PCS   CYKQ DP+F P+DS++
Sbjct: 146 GISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSST 205

Query: 206 FSGVSCSSAVCDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-- 261
           FS V C +  C   ++ G   G  RC YEV YGD S T+G L  +TLT+G     N +  
Sbjct: 206 FSAVRCGARECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAE 265

Query: 262 ---------IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
                     GCG  N G+F  A GL GLG G +SL  Q  G+ G  FSYCL S  + + 
Sbjct: 266 NDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325

Query: 313 GSLVFGREA-LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           G L  G     P  A + P++     PSFYYV L G+ V G  I +S     L       
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALP------ 379

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGF--VSVRVPT 427
           +++D+GT +TRL   AY A R AF++  G     RA  +SI DTCY+ +     +V +P 
Sbjct: 380 LIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPA 439

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--IIGNIQQEGIQISFDGAN 485
           V+  F+GG  +++  S  L  V      C AFAP+  G S  I+GN QQ  + + +D A 
Sbjct: 440 VALVFAGGATISVDFSGVLY-VAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVAR 498

Query: 486 GFVGFGPNVC 495
             +GF    C
Sbjct: 499 QKIGFAAKGC 508


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  231 bits (589), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 143/402 (35%), Positives = 212/402 (52%), Gaps = 36/402 (8%)

Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGT---DVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           D  RV++L RR +GGG+ A         T    V SG    +  Y   +G+G    +  +
Sbjct: 84  DAARVSSLQRR-AGGGSWAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEAT--V 140

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NA 222
           ++D+ S++ WVQC PC+ C+ Q  P+FDPA S S++ + C+S+ CD L+           
Sbjct: 141 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 200

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           G     C Y +SY DGSY++G LA + L++   V+     GCG  NQG F G +GL+GLG
Sbjct: 201 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLG 260

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
              +SL+ Q   Q GG FSYCL  + + SSGSLV G      R + P+   +  +V +P 
Sbjct: 261 RSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI--VYTTMVSDPV 318

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
              FY+V L+G+ +GG  +  S             V++D+GT +T L    Y A +  F+
Sbjct: 319 QGPFYFVNLTGITIGGQEVESSA----------GKVIVDSGTIITSLVPSVYNAVKAEFL 368

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTF 455
           +Q    P+A G SI DTC+NL+GF  V++P++ F F G   + + +S  L  V  D+   
Sbjct: 369 SQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQV 428

Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C A A   S    SIIGN QQ+ +++ FD     +GF    C
Sbjct: 429 CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  231 bits (588), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 143/402 (35%), Positives = 212/402 (52%), Gaps = 36/402 (8%)

Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGT---DVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           D  RV++L RR +GGG+ A         T    V SG    +  Y   +G+G    +  +
Sbjct: 83  DAARVSSLQRR-AGGGSWAEDEAAAAAATGRVPVTSGARLRTLNYVATVGLGGGEAT--V 139

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NA 222
           ++D+ S++ WVQC PC+ C+ Q  P+FDPA S S++ + C+S+ CD L+           
Sbjct: 140 IVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGACG 199

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           G     C Y +SY DGSY++G LA + L++   V+     GCG  NQG F G +GL+GLG
Sbjct: 200 GGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEVIDGFVFGCGTSNQGPFGGTSGLMGLG 259

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
              +SL+ Q   Q GG FSYCL  + + SSGSLV G      R + P+   +  +V +P 
Sbjct: 260 RSQLSLISQTMDQFGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPI--VYTTMVSDPV 317

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
              FY+V L+G+ +GG  +  S             V++D+GT +T L    Y A +  F+
Sbjct: 318 QGPFYFVNLTGITIGGQEVESSA----------GKVIVDSGTIITSLVPSVYNAVKAEFL 367

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTF 455
           +Q    P+A G SI DTC+NL+GF  V++P++ F F G   + + +S  L  V  D+   
Sbjct: 368 SQFAEYPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVSSDSSQV 427

Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C A A   S    SIIGN QQ+ +++ FD     +GF    C
Sbjct: 428 CLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  231 bits (588), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 159/448 (35%), Positives = 225/448 (50%), Gaps = 49/448 (10%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
            L++VHR  + +  +     H+H     +   ++RD  RV ++ RRL+     AA+    
Sbjct: 56  TLQIVHRACLQTGDDIAVPDHHH-----YTGILRRDRHRVRSIYRRLT-----AAETTTT 105

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPV 197
                   G+   S EY V IG+G+PPR+  ++ D+GSD+ WVQC PC  S CY Q +P+
Sbjct: 106 TTTIPARLGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPL 165

Query: 198 FDPADSASFSGVSCSSAVCDR--LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-- 253
           FDP+ S+++  V CS+  C    ++   C A  C Y V YGD S T G+LA ET T+   
Sbjct: 166 FDPSKSSTYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPP 225

Query: 254 ---RTVVKNVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYC 303
                    V  GC H+   +F    +G AGLLGLG G  S++ Q        GG FSYC
Sbjct: 226 SPLAPAATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYC 285

Query: 304 LVSRGTGSSGSLVFGREALP----VGAAWVPLVRN-PRAPSFYYVGLSGLGVGGMRIPIS 358
           L  RG+ +    + G  A P       ++ PL+    +  S Y V L+G+ V G  + I 
Sbjct: 286 LPPRGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIP 345

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN---LPRASGVSIFDTCY 415
              F L      G V+D+GT VT +P  AY   RD F    G+   LP  S + + DTCY
Sbjct: 346 ASAFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGS-MKLLDTCY 398

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFL--IPVDDAG-----TFCFAFAPSPS-GLS 467
           +++G   V  P V+  F GG  + + AS  L  +P +D         C AF P+ S GL 
Sbjct: 399 DVTGQDVVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLV 458

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I+GN+QQ    + FD   G +GFGPN C
Sbjct: 459 IVGNMQQRAYNVVFDVDGGRIGFGPNGC 486


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  231 bits (588), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 157/454 (34%), Positives = 230/454 (50%), Gaps = 37/454 (8%)

Query: 62  ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
           E    I ++  SS +   ++ L HR    S ++  +       +      ++RD  R   
Sbjct: 43  EFWGGIEATIPSSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEE----LLRRDQLRADY 98

Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
           + R+ SG    AA  + Q     V +  G    + EY + +G+GSP  +Q +VID+GSD+
Sbjct: 99  IRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDV 158

Query: 180 VWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCHA-GRCRY 231
            WVQC+PC   S C+  +  +FDPA S++++  +CS+A C +L    E  GC A  RC+Y
Sbjct: 159 SWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQY 218

Query: 232 EVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKN--QGMFVGAAGLLGLGGGSMSL 288
            V YGDGS T GT + + LT+ G  VV+    GC H     GM     GL+GLGG + SL
Sbjct: 219 IVKYGDGSNTTGTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSL 278

Query: 289 VGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNPRAPSFYYV 343
           V Q   + G +FSYCL +    SSG L  G  A   G      A  P++R+ + P++Y+ 
Sbjct: 279 VSQTAARYGKSFSYCLPAT-PASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFA 337

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
            L  + VGG ++ +S  +F        G ++D+GT +TRLP  AY A   AF A      
Sbjct: 338 ALEDIAVGGKKLGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYA 391

Query: 404 RASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS- 462
           RA  + I DTC+N +G   V +PTV+  F+GG V+ L A   +         C AFAP+ 
Sbjct: 392 RAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAFAPTR 445

Query: 463 -PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  IGN+QQ   ++ +D   G  GF    C
Sbjct: 446 DDKAFGTIGNVQQRTFEVLYDVGGGVFGFRAGAC 479


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  231 bits (588), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 147/437 (33%), Positives = 230/437 (52%), Gaps = 37/437 (8%)

Query: 89  MSSSSNTTNNMHYHRHQHSF--------HARMQRDVKRVATLVRRLS--GGGADAAKH-- 136
           ++ SS   N  H H H  S            +  D + V  L  RL+  G G+ +AK   
Sbjct: 41  INQSSIHLNIYHVHGHGSSLTPNSSSLLSDVLLHDEEHVKALSDRLANKGLGSGSAKPPK 100

Query: 137 -----EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QC 190
                E       +  G+  GSG Y+V++G+G+PP+   M++D+GS + W+QCQPC+  C
Sbjct: 101 SGHLLEPNSASIPLNPGLSIGSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYC 160

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-------AGRCRYEVSYGDGSYTKG 243
           + Q+DP++DP+ S ++  +SC+S  C RL+ A  +       +  C Y  SYGD S++ G
Sbjct: 161 HAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIG 220

Query: 244 TLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
            L+ + LT+  +  +     GCG  NQG+F  AAG++GL    +S++ QL  + G AFSY
Sbjct: 221 YLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSY 280

Query: 303 CLVSRGTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
           CL +  +GSSG       ++ P    + P++ + + PS Y++ L+ + V G  + ++  +
Sbjct: 281 CLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAM 340

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGF 420
           +R+  +      +D+GT +TRLP   Y A R AFV   +    +A   SI DTC+  S  
Sbjct: 341 YRVPTL------IDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLK 394

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQ 478
               VP +   F GG  LTL A + LI  D  G  C AFA S   + ++IIGN QQ+   
Sbjct: 395 SISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAFAGSSGTNQIAIIGNRQQQTYN 453

Query: 479 ISFDGANGFVGFGPNVC 495
           I++D +   +GF P  C
Sbjct: 454 IAYDVSTSRIGFAPGSC 470


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  231 bits (588), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 140/384 (36%), Positives = 203/384 (52%), Gaps = 23/384 (5%)

Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
           VA+L R       D +   V      +  G   G G Y  R+G+G+P +   MV+D+GS 
Sbjct: 105 VASLYRANDDAAVDGSLASVP-----LTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSS 159

Query: 179 IVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR------Y 231
           + W+QC PC   C++QS PVFDP  S+S++ VSCS+  C+ L  A  +   C       Y
Sbjct: 160 LTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219

Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
           + SYGD S++ G L+ +T++ G   V N   GCG  N+G+F  +AGL+GL    +SL+ Q
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNSVPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQ 279

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
           L    G +FSYCL S  +    S+       P   ++ P+V +    S Y++ LSG+ V 
Sbjct: 280 LAPTLGYSFSYCLPSSSSSGYLSIGSYN---PGQYSYTPMVSSTLDDSLYFIKLSGMTVA 336

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
           G  + +S      ++      ++D+GT +TRLPT  Y+A   A         RA   SI 
Sbjct: 337 GKPLAVSS-----SEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSIL 391

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
           DTC+ +    S+RVP VS  FSGG  L L A N L+ VD + T C AFAP+ S  +IIGN
Sbjct: 392 DTCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTT-CLAFAPARSA-AIIGN 448

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
            QQ+   + +D  +  +GF    C
Sbjct: 449 TQQQTFSVVYDVKSNRIGFAAGGC 472


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 141/433 (32%), Positives = 222/433 (51%), Gaps = 29/433 (6%)

Query: 70  SNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-RLSG 128
           S+ + +E   +L+LVHR    +   T+          SF+  ++RD  RV ++++ R S 
Sbjct: 52  SSKALNEGSSSLKLVHRFGPCNPHRTST-----APASSFNEILRRDKLRVDSIIQARRSM 106

Query: 129 GGADAAKH---EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
               + +H    V  +G   ++  D     Y V +G+G+P +   ++ D+GS ++W QC+
Sbjct: 107 NLTSSVEHMKSSVPFYGLSKITASD-----YIVNVGIGTPKKEMPLIFDTGSGLIWTQCK 161

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTL 245
           PC  CY +  PVFDP  SASF G+ CSS +C  +   GC + +C Y  +Y D S + GTL
Sbjct: 162 PCKACYPKV-PVFDPTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTL 219

Query: 246 ALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           A ET++    +   KN+ IGC  +  G  +G +G++GL    +SL  Q        FSYC
Sbjct: 220 ATETISFSHLKYDFKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYC 279

Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY-VGLSGLGVGGMRIPISEDLF 362
           + S   GS+G L FG + +P    + P+ +   APS  Y + ++G+ VGG ++ I    F
Sbjct: 280 IPST-PGSTGHLTFGGK-VPNDVRFSPVSKT--APSSDYDIKMTGISVGGRKLLIDASAF 335

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
           ++         +D+G  +TRLP  AY A R  F       P        DTCY+ S + +
Sbjct: 336 KIAS------TIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYST 389

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           V +P++S +F GG  + +  S  +  V  +  +C AFA     +SI GN QQ+   + FD
Sbjct: 390 VAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFD 449

Query: 483 GANGFVGFGPNVC 495
           GA   +GF P  C
Sbjct: 450 GAKERIGFAPGGC 462


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 152/432 (35%), Positives = 213/432 (49%), Gaps = 29/432 (6%)

Query: 82  ELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLSGGGA 131
           E+    K++ S N +     HRH          + S    ++RD  R A +  ++S    
Sbjct: 44  EVCSGHKVTPSKNGSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYN 103

Query: 132 DAAKHEVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-- 188
           + AK   Q   T    SG   G+ EY + + +G+P  +Q M ID+GSD+ WVQC PC+  
Sbjct: 104 NVAKELQQSAVTIPTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQ 163

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLA 246
            C  Q D +FDPA SA++S  SC SA C +L  E  GC   +C+Y V YGDGS T GT  
Sbjct: 164 SCSSQKDKLFDPAMSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYG 223

Query: 247 LETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
            +TL++  +  VK+   GC H+  G      GL+GLGG + SLV Q     G AFSYCL 
Sbjct: 224 SDTLSLTSSDAVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLP 283

Query: 306 SRGTGSSGSLVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
              +   G L  G       + +   P+VR    P+FY V L G+ V G  + +   +F 
Sbjct: 284 PPSSSGGGFLTLGAAGGASSSRYSHTPMVRF-SVPTFYGVFLQGITVAGTMLNVPASVF- 341

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
                    V+D+GT +T+LP  AY+A R AF  +    P A+ V   DTC++ SGF ++
Sbjct: 342 -----SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTI 396

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
            VPTV+  FS G  + L  S  L     AG   F          I+GN+QQ   ++ FD 
Sbjct: 397 TVPTVTLTFSRGAAMDLDISGILY----AGCLAFTATAHDGDTGILGNVQQRTFEMLFDV 452

Query: 484 ANGFVGFGPNVC 495
               +GF    C
Sbjct: 453 GGRTIGFRSGAC 464


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 143/417 (34%), Positives = 209/417 (50%), Gaps = 34/417 (8%)

Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           H       +  D  R  +   R+    A AA  +       + SG+   +  Y   I +G
Sbjct: 133 HDRYLRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALG 192

Query: 164 -----SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
                SP  +  +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C  
Sbjct: 193 GGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAA 252

Query: 219 LENAG------CHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
              A       C  G  RC Y ++YGDGS+++G LA +T+ +G   +     GCG  N+G
Sbjct: 253 SLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLDGFVFGCGLSNRG 312

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALP 323
           +F G AGL+GLG   +SLV Q   + GG FSYCL +  +G +SGSL  G      R   P
Sbjct: 313 LFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTP 372

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
           V  A+  ++ +P  P FY++ ++G  VGG  +           +G   V++D+GT +TRL
Sbjct: 373 V--AYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-------GLGASNVLIDSGTVITRL 423

Query: 384 PTPAYEAFRDAFVAQ--TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
               Y   R  F  Q      P A G SI DTCY+L+G   V+VP ++    GG  +T+ 
Sbjct: 424 APSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVD 483

Query: 442 ASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A+  L  V  D    C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 484 AAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 187/451 (41%), Positives = 239/451 (52%), Gaps = 54/451 (11%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVA--------------TLVRR 125
           ++ L+HRD  S + N T      R       R+QRD  R A              T V  
Sbjct: 62  HVRLLHRD--SFAVNATPAQLLAR-------RLQRDELRAAWIIKAAAPAAAANDTPVVG 112

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
           LS GGA         F   VVS     SGEY  +I VG+P     + +D+GSDI W+QCQ
Sbjct: 113 LSSGGA---------FVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQ 163

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-ENAGCHAGR--CRYEVSYG-DGSYT 241
           PC +CY QS PVFDP  S S+  +   +  C  L  + G  A R  C Y V YG DGS T
Sbjct: 164 PCRRCYPQSGPVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTT 223

Query: 242 KGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLG--GQTG 297
            G    ETLT  G   V +++IGCGH N+G+F   AAG+LGLG G +S   Q+   G   
Sbjct: 224 VGDFIEETLTFAGGVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNV 283

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAA-------WVPLVRNPRAPSFYY-VGLSGLG 349
            +FSYCL      S G  V     +  GAA       + P V+N    +FYY   +    
Sbjct: 284 TSFSYCLADFFLSSPGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSV 343

Query: 350 VGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-- 406
            G     ++ED  +L    G  GV++D+GTAVTRL   AY AFRDAF A   +L + S  
Sbjct: 344 GGVRVPGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIG 403

Query: 407 GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PS 464
           G S  FDTCY + G  +++VPTVS +F+GG  LTLP  N+LIPVD  GT CFAFA +   
Sbjct: 404 GPSGFFDTCYTMGG-RAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDR 462

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +SIIGNIQQ+G ++ ++   G VGF PN C
Sbjct: 463 SVSIIGNIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 151/440 (34%), Positives = 215/440 (48%), Gaps = 42/440 (9%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA 133
           +D   + L+L H D  +S         Y + Q    A + R   RVA L        A  
Sbjct: 23  NDNVGFQLKLTHVDAGTS---------YTKPQLLSRA-IARSKARVAAL------QSAAV 66

Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
           +   V D  T     +   SGEY V + +G+PP     ++D+GSD++W QC PC  C  Q
Sbjct: 67  SPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQ 126

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
             P FD   SA++  + C S+ C  L +  C    C Y+  YGD + T G LA ET T G
Sbjct: 127 PTPYFDVKRSATYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFG 186

Query: 254 -----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
                +    N++ GCG  N G    ++G++G G G +SLV QLG      FSYCL S  
Sbjct: 187 AASSTKVRAANISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLGPSR---FSYCLTSYL 243

Query: 309 TGSSGSLVFGREA----------LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
           + +   L FG  A           PV +   P V NP  P+ Y++ + G+ +G  R+PI 
Sbjct: 244 SPTPSRLYFGVFANLNSTNTSSGSPVQS--TPFVINPALPNMYFLSVKGISLGTKRLPID 301

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL 417
             +F +   G  GV++D+GT++T L   AYEA R   +A T  LP  +   I  DTC+  
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRG-LASTIPLPAMNDTDIGLDTCFQW 360

Query: 418 SGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
                V+V VP   F+F G   +TLP  N+++     G  C A AP+  G +IIGN QQ+
Sbjct: 361 PPPPNVTVTVPDFVFHFDGA-NMTLPPENYMLIASTTGYLCLAMAPTSVG-TIIGNYQQQ 418

Query: 476 GIQISFDGANGFVGFGPNVC 495
            + + +D AN F+ F P  C
Sbjct: 419 NLHLLYDIANSFLSFVPAPC 438


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 148/404 (36%), Positives = 212/404 (52%), Gaps = 23/404 (5%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           H H    S  AR+ +      T +RR S    DA        G     G   G G Y  R
Sbjct: 69  HDHARIASLAARLAKTPSSRPTKLRRGSSSSPDAESLASVPLG----PGTSVGVGNYVTR 124

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
           +G+G+P +S  MV+D+GS + W+QC PC   C++QS PVF+P  S+S++ VSCS+  CD 
Sbjct: 125 MGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQCDA 184

Query: 219 LENAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           L  A  +   C       Y+ SYGD S++ G L+ +T++ G T V N   GCG  N+G+F
Sbjct: 185 LTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLF 244

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
             +AGL+GL    +SL+ QL    G +FSYCL +  + S    +      P   ++ P+ 
Sbjct: 245 GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSIGSYN--PGQYSYTPMA 302

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
           ++    S Y++ ++G+ V G  + +S   +          ++D+GT +TRLPT  Y A  
Sbjct: 303 KSSLDDSLYFIKMTGITVAGKPLSVSASAY-----SSLPTIIDSGTVITRLPTDVYSALS 357

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
            A        PRAS  SI DTC+   G  S +RVP VS  F+GG  L L A+N L+ VD 
Sbjct: 358 KAVAGAMKGTPRASAFSILDTCFQ--GQASRLRVPQVSMAFAGGAALKLKATNLLVDVDS 415

Query: 452 AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A T C AFAP+ S  +IIGN QQ+   + +D  N  +GF    C
Sbjct: 416 ATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 457


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/359 (37%), Positives = 195/359 (54%), Gaps = 35/359 (9%)

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLEN 221
           GSP  +  +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C D L  
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214

Query: 222 A----------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           A          G  + +C Y ++YGDGS+++G LA +T+ +G   +     GCG  N+G+
Sbjct: 215 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGL 274

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG---------REA 321
           F G AGL+GLG   +SLV Q   + GG FSYCL +  +G +SGSL  G         R  
Sbjct: 275 FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNT 334

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
            PV  A+  ++ +P  P FY++ ++G  VGG  +           +G   V++D+GT +T
Sbjct: 335 TPV--AYTRMIADPAQPPFYFLNVTGAAVGGTALAAQ-------GLGASNVLIDSGTVIT 385

Query: 382 RLPTPAYEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           RL    Y A R  F+ Q G    P A G SI DTCY+L+G   V+VP ++    GG  +T
Sbjct: 386 RLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGADVT 445

Query: 440 LPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + A+  L  V  D    C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 446 VDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLGSRLGFADEDC 504


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/372 (36%), Positives = 197/372 (52%), Gaps = 23/372 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + VG+PP+   +++D+GSD+ W+QC PC +C++Q+ P +DP  S+S+
Sbjct: 172 SGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSY 231

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             + C  + C      D  +        C Y   YGD S T G  ALET T+  T+    
Sbjct: 232 RNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGK 291

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
                V+NV  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R + +
Sbjct: 292 PELRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDA 351

Query: 312 --SGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
             S  L+FG +   +      +  LV     P  +FYYV +  + VGG  + I E+ +++
Sbjct: 352 NVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQI 411

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
              G  G ++D+GT ++    PAY+  ++AF+A+    P      + + CYN++G     
Sbjct: 412 ATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPD 471

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDG 483
           +P     FS G V   P  N+ I ++     C A     PS LSIIGN QQ+   I +D 
Sbjct: 472 LPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDT 531

Query: 484 ANGFVGFGPNVC 495
               +GF P  C
Sbjct: 532 KKSRLGFAPTKC 543


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 154/463 (33%), Positives = 237/463 (51%), Gaps = 42/463 (9%)

Query: 58  NELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVK 117
           + L E  +N    N    +    L L H   + SS  +T+         SF   + +D +
Sbjct: 17  SSLVEFQDN---DNPRQKQEGMQLNLYHVKGLDSSQTSTSPF-------SFSDMITKDEE 66

Query: 118 RVATLVRRLSGGGA--DAAKHEVQDFGTDVVS------GMDQGSGEYFVRIGVGSPPRSQ 169
           RV  L  RL+   +  ++A  +    G  +VS      G+  GSG Y+V+IG+G+P +  
Sbjct: 67  RVRFLHSRLTNKESVRNSATTDKLRGGPSLVSTTPLKSGLSIGSGNYYVKIGLGTPAKYF 126

Query: 170 YMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSC-----SSAVCDRLENAG 223
            M++D+GS + W+QCQPC   C+ Q DP+F P+ S ++  + C     SS     L   G
Sbjct: 127 SMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQCSSLKSSTLNAPG 186

Query: 224 CH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN--VAIGCGHKNQGMFVGAAGLL 279
           C    G C Y+ SYGD S++ G L+ + LT+  +   +     GCG  NQG+F  ++G++
Sbjct: 187 CSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSEAPSSGFVYGCGQDNQGLFGRSSGII 246

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREALPVGA-AWVPLVR 333
           GL    +S++GQL  + G AFSYCL S  +       SG L  G  +L      + PLV+
Sbjct: 247 GLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVK 306

Query: 334 NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           N + PS Y++ L+ + V G  + +S   + +        ++D+GT +TRLP   Y A + 
Sbjct: 307 NQKIPSLYFLDLTTITVAGKPLGVSASSYNVP------TIIDSGTVITRLPVAVYNALKK 360

Query: 394 AFV-AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           +FV   +    +A G SI DTC+  S      VP +   F GG  L L A N L+ ++  
Sbjct: 361 SFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEK- 419

Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           GT C A A S + +SIIGN QQ+  ++++D AN  +GF P  C
Sbjct: 420 GTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 170/513 (33%), Positives = 244/513 (47%), Gaps = 56/513 (10%)

Query: 22  IITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNL 81
           +   S+S      F   N   ++ G   D  K+    E  +     +S + S       L
Sbjct: 27  VKINSSSPLFGVEFPPFNTAVAVTG--CDSGKLVAAEEALDEQKQPASPSPS-----LKL 79

Query: 82  ELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL--SGGGADAAKHEVQ 139
            L HR      +           + S     ++D  R+ T+ RR   SGGG   A    +
Sbjct: 80  RLNHRAAEGGRT----------REESLLDLAEKDAVRIETMYRRAARSGGGRMPASSSPR 129

Query: 140 DFGTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
              ++     V SG+  GSGEY + + VG+PPR   M++D+GSD+ W+QC PC  C++Q 
Sbjct: 130 RALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQR 189

Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAG---------CH---AGRCRYEVSYGDGSYTK 242
            PVFDPA S+S+  V+C    C  +             C       C Y   YGD S T 
Sbjct: 190 GPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTT 249

Query: 243 GTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
           G LALE+ T+  T       V  V  GCGH+N+G+F GAAGLLGLG G +S   QL    
Sbjct: 250 GDLALESFTVNLTAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVY 309

Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR----------NPRAPSFYYVGLS 346
           G  FSYCLV  G+     +VFG +   +  A  P ++          +  A +FYYV L 
Sbjct: 310 GHTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLK 369

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRA 405
           G+ VGG  + IS D + + + G  G ++D+GT ++    PAY+  R AF+ + + + P  
Sbjct: 370 GVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLV 429

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG--TFCFAFAPSP 463
               +   CYN+SG     VP +S  F+ G V   PA N+ I +D  G    C A   +P
Sbjct: 430 PEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTP 489

Query: 464 -SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +G+SIIGN QQ+   + +D  N  +GF P  C
Sbjct: 490 RTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRC 522


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  228 bits (582), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 175/462 (37%), Positives = 240/462 (51%), Gaps = 47/462 (10%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATL 122
           S    R N  +V  +  + + + T  +H HRH            +   R+ RD  R A +
Sbjct: 40  SHQSLRTNKSVVCSESRAPAVHATVPLH-HRHGPCSPLPNKKMPTLEERLHRDKLRAAYI 98

Query: 123 VRRLS--------GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP-RSQYMVI 173
            R+LS        G G D    +          G    + EY + + +GSPP +SQ M+I
Sbjct: 99  HRKLSRGKKQGGGGAGGDVVVQQSHAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLI 158

Query: 174 DSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-----ENAGCHAG 227
           D+GSDI WV+C+PC  QC  Q DP+FDP+ S+++S  SCSSA C +L      N    +G
Sbjct: 159 DTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSG 218

Query: 228 RCRYEVSYGDGSY-TKGTLALETLTIGR----TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           +C+Y   YGDGS  T GT + +TL +G      VV     GC H   G+    AGL+GLG
Sbjct: 219 QCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTVVVSKFRFGCSHAETGITGLTAGLMGLG 278

Query: 283 GGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRNPRAPSF 340
           GG+ SLV Q  G  G  AFSYCL    + SSG L  G       G    P++R+ + P+F
Sbjct: 279 GGAQSLVSQTAGTFGTTAFSYCLPPTPS-SSGFLTLGAAGTSSAGFVKTPMLRSSQVPAF 337

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA--- 397
           Y V L  + VGG ++ I   +F        G++MD+GT VTRLP  AY +   AF A   
Sbjct: 338 YGVRLEAIRVGGRQLSIPTTVF------SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMK 391

Query: 398 QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS--GGPVLTLPASNFLIPVDDAGTF 455
           Q    P ++G    DTC+++SG  SV +PTV+  FS  GG V+ L AS  L+ ++ +  F
Sbjct: 392 QYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFSGAGGAVVNLDASGILLQMETSSIF 451

Query: 456 CFAF-APSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C AF A S  G + IIGN+QQ   Q+ +D A G VGF    C
Sbjct: 452 CLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVGFKAGAC 493


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  228 bits (582), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 133/353 (37%), Positives = 197/353 (55%), Gaps = 15/353 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G+  GSG Y + +G G+P R+Q +V D+GSD+ W+QC+PC+ +CY Q +P+FDP+ S+++
Sbjct: 8   GLFIGSGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTY 67

Query: 207 SGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCG 265
             VSC+   C  L   GC +  C Y V YGDGS T G LA++T  +      KN   GCG
Sbjct: 68  RNVSCTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQKFKNFIFGCG 127

Query: 266 HKNQGMFVGAAGLLGLGGGSM-SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
             N G+F G AGL+GLG  S  SL  Q+    G  FSYCL S  + ++G L  G      
Sbjct: 128 QNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSS-ATGYLNIGNPQNTP 186

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  +  ++ + R P+ Y++ L G+ VGG R+ +S  +F+       G ++D+GT +TRLP
Sbjct: 187 G--YTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVITRLP 239

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY A + A  A       A  V+I DTCY+ S   SV  P +  +F+G  V  +PA+ 
Sbjct: 240 PTAYSALKTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDV-RIPATG 298

Query: 445 FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               V ++   C AFA     + + IIGN+QQ  +++++D     +GF    C
Sbjct: 299 VFF-VFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|356537173|ref|XP_003537104.1| PREDICTED: uncharacterized protein LOC100817302 [Glycine max]
          Length = 328

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 108/142 (76%), Positives = 121/142 (85%)

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
           ++ ISEDL+R+T +GD+G VMDTG  VTRLPT AY AFRDAFVAQT NLPRA GVSIF+T
Sbjct: 187 QLNISEDLYRVTDLGDEGAVMDTGITVTRLPTVAYGAFRDAFVAQTTNLPRAPGVSIFNT 246

Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
           CY+L+GFV+VRVPTV FYFSGG +LT+   NFLIP DD GTF FAFA SPS LSIIGNIQ
Sbjct: 247 CYDLNGFVTVRVPTVLFYFSGGQILTILTQNFLIPADDVGTFYFAFAASPSALSIIGNIQ 306

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
           QEGIQIS DGANGF+GFG NVC
Sbjct: 307 QEGIQISVDGANGFLGFGRNVC 328


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 141/413 (34%), Positives = 209/413 (50%), Gaps = 29/413 (7%)

Query: 110 ARMQRDVKRVATLVRR----LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP 165
           +R+Q+  K+     +     +S   A + ++  Q   T + SG+  GSGEYF+ + +G+P
Sbjct: 143 SRLQKSTKKQTNSKQSYKPAVSPVAAASPEYSSQLVAT-LESGVSLGSGEYFMDVFIGTP 201

Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC------DRL 219
           P+   +++D+GSD+ W+QC PC  C++QS P +DP +S+SF  ++C    C      D  
Sbjct: 202 PKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLVSSPDPP 261

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHKNQG 270
           +        C Y   YGD S T G  ALET T+  T          V+NV  GCGH N+G
Sbjct: 262 KPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHVENVMFGCGHWNRG 321

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPVGAAW 328
           +F GAAGLLGLG G +S   QL    G +FSYCLV R   T  S  L+FG +   +    
Sbjct: 322 LFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLVDRNSDTSVSSKLIFGEDKELLSHPN 381

Query: 329 VPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
           +              +FYYVG+  + V G  + I E+ + L++ G  G ++D+GT +T  
Sbjct: 382 LNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGGGTIIDSGTTLTYF 441

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
             PAYE  ++AF+ +        G      CYN+SG   + +P     FS G +   P  
Sbjct: 442 AEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILFSDGAMWDFPVE 501

Query: 444 NFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+ I + +    C A   +P S LSIIGN QQ+   I +D     +G+ P  C
Sbjct: 502 NYFIQI-EPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAPMKC 553


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 138/362 (38%), Positives = 210/362 (58%), Gaps = 27/362 (7%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   GSG Y+V++G+GSP R   M++D+GS + W+QC+PC   C+ Q+DP+FDP+ S ++
Sbjct: 5   GASIGSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTY 64

Query: 207 SGVSCSSAVCDRLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
             +SC+S+ C  L +A  +       +  C Y  SYGD SY+ G L+ + LT+  +  + 
Sbjct: 65  KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLP 124

Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
               GCG  ++G+F  AAG+LGLG   +S++GQ+  + G AFSYCL +RG G  G L  G
Sbjct: 125 GFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIG 182

Query: 319 REALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
           + +L  G+A+   P+  +P  PS Y++ L+ + VGG  + ++   +R+        ++D+
Sbjct: 183 KASL-AGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP------TIIDS 235

Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCY--NLSGFVSVRVPTVSFYFS 433
           GT +TRLP   Y  F+ AFV   +    RA G SI DTC+  NL    S  VP V   F 
Sbjct: 236 GTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCFKGNLKDMQS--VPEVRLIFQ 293

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG  L L   N L+ VD+ G  C AFA + +G++IIGN QQ+  +++ D +   +GF   
Sbjct: 294 GGADLNLRPVNVLLQVDE-GLTCLAFAGN-NGVAIIGNHQQQTFKVAHDISTARIGFATG 351

Query: 494 VC 495
            C
Sbjct: 352 GC 353


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  228 bits (580), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 167/503 (33%), Positives = 236/503 (46%), Gaps = 72/503 (14%)

Query: 17  HLLCSIITTSTSAASDTHFQILNVNESIKGSRTDHAKMSQYNELFERHNNISSSN----T 72
           HLLC  +  S S      F+         G +       Q     E  N + S++    T
Sbjct: 8   HLLCLCLVISLSTTYAFGFE---------GRKIAQENHLQLIHAIEISNLLPSADCEHST 58

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG-- 130
              + + +L++VH+    S  N  N      +  +    +  D  RV ++  +LS     
Sbjct: 59  KVAQNKASLKVVHKHGPCSQLNQQNG-----NAPNLVEILLEDQSRVDSIHAKLSDHSGV 113

Query: 131 --ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
              DAAK   +       SGM  G+G Y V IG+GSP +   ++ D+GSD+ W +C    
Sbjct: 114 KETDAAKLPTK-------SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCS--- 163

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-----CHAGRCRYEVSYGDGSYTKG 243
                +   FDP  S S++ VSCS+ +C  + +A      C A  C Y + YGDGSY+ G
Sbjct: 164 -----AAETFDPTKSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIG 218

Query: 244 TLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
            L  E LTIG T +  N   GCG    G+F  AAGLLGLG   +S+V Q   +    FSY
Sbjct: 219 FLGKERLTIGSTDIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSY 278

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL S  + S+G L FG  +    A + PL   P   SFY + L+G+ VGG ++ I   +F
Sbjct: 279 CLPS--SSSTGFLSFG-SSQSKSAKFTPLSSGPS--SFYNLDLTGITVGGQKLAIPLSVF 333

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
                   G ++D+GT VTRLP  AY A R AF     + P    +SI DTCY+ S + +
Sbjct: 334 STA-----GTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKT 388

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--------CFAFAPSPSG--LSIIGNI 472
           ++VP +   FSGG           + VD AG F        C AFA +      +I GN 
Sbjct: 389 IKVPKIVISFSGG---------VDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNT 439

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           QQ   ++ +D + G VGF P  C
Sbjct: 440 QQRNFEVVYDVSGGKVGFAPASC 462


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 147/433 (33%), Positives = 223/433 (51%), Gaps = 45/433 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD   S          +++QH F    +R + R               A H  
Sbjct: 28  FSVELIHRDSPKSPYYKPTE---NKYQH-FVDAARRSINR---------------ANHFF 68

Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
           +D  T    S +    G Y +   VG+PP   Y + D+GSDIVW+QC+PC QCY Q+ P+
Sbjct: 69  KDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPI 128

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
           F+P+ S+S+  + CSS +C  + +  C     C+Y++SYGD S+++G L+++TL++  T 
Sbjct: 129 FNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS 188

Query: 256 ----VVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SR 307
                   + IGCG  N G F GA +G++GLGGG +SL+ QLG   GG FSYCLV   ++
Sbjct: 189 GSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNK 248

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            + +S  L FG  A+  G   V  PL++  + P FY++ L    VG  R+         +
Sbjct: 249 ESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGS----S 302

Query: 366 QMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVS 422
           + GDD   +++D+GT +T +P+  Y     A V     L R    +  F  CY+L     
Sbjct: 303 EGGDDEGNIIIDSGTTLTLIPSDVYTNLESA-VVDLVKLDRVDDPNQQFSLCYSLKS-NE 360

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              P ++ +F G  V     S F +P+ D G  CFAF PSP   SI GN+ Q+ + + +D
Sbjct: 361 YDFPIITVHFKGADVELHSISTF-VPITD-GIVCFAFQPSPQLGSIFGNLAQQNLLVGYD 418

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 419 LQQKTVSFKPTDC 431


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 136/375 (36%), Positives = 200/375 (53%), Gaps = 31/375 (8%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG    +  Y   +G+G    +  +++D+ S++ WVQC PC  C+ Q DP+FDP+ S 
Sbjct: 142 VTSGAKLRTLNYVATVGLGGGEAT--VIVDTASELTWVQCAPCESCHDQQDPLFDPSSSP 199

Query: 205 SFSGVSCSSAVCDRLE---------NAGCH-----AGRCRYEVSYGDGSYTKGTLALETL 250
           S++ V C+S+ CD L+          A C      A  C Y +SY DGSY++G LA + L
Sbjct: 200 SYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRL 259

Query: 251 TIGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
           ++   V+     GCG  NQG  F G +GL+GLG   +SLV Q   Q GG FSYCL  + +
Sbjct: 260 SLAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKES 319

Query: 310 GSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
            SSGSLV G      R + P+   +  +V +P    FY+V L+G+ VGG  +   E    
Sbjct: 320 DSSGSLVIGDDSSVYRNSTPI--VYASMVSDPLQGPFYFVNLTGITVGGQEV---ESSGF 374

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
            +  G    ++D+GT +T L    Y A +  F++Q    P+A G SI DTC+N++G   V
Sbjct: 375 SSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAPGFSILDTCFNMTGLREV 434

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQIS 480
           +VP++   F GG  + + +   L  V  D+   C A AP  S    +IIGN QQ+ +++ 
Sbjct: 435 QVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVI 494

Query: 481 FDGANGFVGFGPNVC 495
           FD +   VGF    C
Sbjct: 495 FDTSGSQVGFAQETC 509


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  227 bits (579), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 138/372 (37%), Positives = 198/372 (53%), Gaps = 24/372 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + VG+PP+   +++D+GSD+ W+QC PC  C++QS P +DP DS+SF
Sbjct: 186 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSF 245

Query: 207 SGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             +SC    C  + +      C A    C Y   YGDGS T G  ALET T+  T     
Sbjct: 246 RNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGK 305

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
                V+NV  GCGH N+G+F GAAGLLGLG G +S   Q+    G +FSYCLV R + +
Sbjct: 306 SELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNA 365

Query: 312 SGS--LVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
           S S  L+FG +   +    +        ++    +FYYV ++ + V    + I E+ + L
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHL 425

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
           +  G  G ++D+GT +T    PAYE  ++AFV +        G+     CYN+SG   + 
Sbjct: 426 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKME 485

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
           +P     F+ G V   P  N+ I + D    C A   +P S LSIIGN QQ+   I +D 
Sbjct: 486 LPDFGILFADGAVWNFPVENYFIQI-DPDVVCLAILGNPRSALSIIGNYQQQNFHILYDM 544

Query: 484 ANGFVGFGPNVC 495
               +G+ P  C
Sbjct: 545 KKSRLGYAPMKC 556


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 147/416 (35%), Positives = 213/416 (51%), Gaps = 39/416 (9%)

Query: 108 FHARMQRDVKRVATLVRRLSG------------------GGADAAKHEVQDFGTDVV--S 147
           F   +  D  RVA L  RL+                   GGA    H   D    V    
Sbjct: 66  FSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSP 125

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   G G Y  ++G+G+P  S  MV+D+GS + W+QC PC   C++Q  P+FDP  S+++
Sbjct: 126 GTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTY 185

Query: 207 SGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
           + V CS++ CD L+ A      C A   C Y+ SYGD S++ G+L+ +T++ G T   + 
Sbjct: 186 ASVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTRYPSF 245

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
             GCG  N+G+F  +AGL+GL    +SL+ QL    G +FSYCL +    S+G L  G  
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPT--AASTGYLSIGPY 303

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
                 ++ P+  +    S Y++ LSG+ VGG  + +S      ++      ++D+GT +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP-----SEYSSLPTIIDSGTVI 358

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLT 439
           TRLPT  + A   A         RA   SI DTC+   G  S +RVPTV+  F+GG  + 
Sbjct: 359 TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE--GQASQLRVPTVAMAFAGGASMK 416

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N LI VDD+ T C AFAP+ S  +IIGN QQ+   + +D A   +GF    C
Sbjct: 417 LTTRNVLIDVDDSTT-CLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/373 (36%), Positives = 192/373 (51%), Gaps = 24/373 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + +GSPP+   +++D+GSD+ W+QC PC  C++Q+ P +DP DS SF
Sbjct: 187 SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISF 246

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             ++C+   C      D           C Y   YGD S T G  ALET T+  T     
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTG 306

Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
                 V+NV  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R   
Sbjct: 307 KSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSD 366

Query: 309 TGSSGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFR 363
           T  S  L+FG +   +      +  L+     P  +FYY+ +  + VGG ++ I E+ + 
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWN 426

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           L+  G  G ++D+GT ++    PAY   ++AF+ +           I   CYN+SG   +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDEL 486

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
             P     F+ G V   P  N+ I +      C A   +P S LSIIGN QQ+   I +D
Sbjct: 487 NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYD 546

Query: 483 GANGFVGFGPNVC 495
             N  +G+ P  C
Sbjct: 547 TKNSRLGYAPMRC 559


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 160/440 (36%), Positives = 221/440 (50%), Gaps = 43/440 (9%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
            +++VHR  + S    T   H   H H +   ++RD  RV ++ RRL+G G  AA     
Sbjct: 61  TIQIVHRACLQSGDRKTVPDH---HPH-YTGILRRDHNRVRSIHRRLTGAGDTAAT---- 112

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVF 198
                   G+   S EY V IG+G+P R+  ++ D+GSD+ WVQC+PC+  CY+Q +P+F
Sbjct: 113 ---IPASLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLF 169

Query: 199 DPADSASFSGVSCSSAVCDR--LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP+ S+++  V C +  C     ++  C    C Y V YGD S T+G LA E  T+  + 
Sbjct: 170 DPSKSSTYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSA 229

Query: 257 --VKNVAIGCGHKNQGMFVGA------AGLLGLGGGSMSLVGQL-GGQTGGAFSYCLVSR 307
                V  GC H+      GA      AGLLGLG G  S++ Q   G +G  FSYCL  R
Sbjct: 230 PPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPR 289

Query: 308 GTGSSGSLVFGREALP-VGAAWVPLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           G+ S+G L  G  A P    ++ PLV  N +  S Y V L G+ V G  +PI    F + 
Sbjct: 290 GS-SAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI- 347

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN---LPRASGVSIFDTCYNLSGFVS 422
                G V+D+GT +T +P  AY   RD F    G    LP    V   DTCY+++G   
Sbjct: 348 -----GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGH-VESLDTCYDVTGHDV 401

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLI--PVDDAGT----FCFAFAPSP-SGLSIIGNIQQE 475
           V  P V+  F GG  + + AS  L+   VD +G      C AF P+   G  IIGN+QQ 
Sbjct: 402 VTAPPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQR 461

Query: 476 GIQISFDGANGFVGFGPNVC 495
              + FD     +GFG N C
Sbjct: 462 AYNVVFDVEGRRIGFGANGC 481


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 135/373 (36%), Positives = 192/373 (51%), Gaps = 24/373 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + +GSPP+   +++D+GSD+ W+QC PC  C++Q+ P +DP DS SF
Sbjct: 187 SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISF 246

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             ++C+   C      D           C Y   YGD S T G  ALET T+  T     
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTG 306

Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
                 V+NV  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R   
Sbjct: 307 KSEFRRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSD 366

Query: 309 TGSSGSLVFGREALPVGAA---WVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFR 363
           T  S  L+FG +   +      +  L+     P  +FYY+ +  + VGG ++ I E+ + 
Sbjct: 367 TSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWN 426

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           L+  G  G ++D+GT ++    PAY   ++AF+ +           I   CYN+SG   +
Sbjct: 427 LSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDEL 486

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
             P     F+ G V   P  N+ I +      C A   +P S LSIIGN QQ+   I +D
Sbjct: 487 NFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYD 546

Query: 483 GANGFVGFGPNVC 495
             N  +G+ P  C
Sbjct: 547 TKNSRLGYAPMRC 559


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 28/362 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + +G+G+PPR    ++D+GSD++W QC PC  C  Q  P FDPA S S++ + C+S
Sbjct: 87  GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNS 146

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
            +C+ L    C+   C Y+  YGD + T G L+ ET T G    R  V  +A GCG+ N 
Sbjct: 147 PMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRVTVPRIAFGCGNLNA 206

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL------- 322
           G     +G++G G G +SLV QLG      FSYCL S  +     L FG  A        
Sbjct: 207 GSLFNGSGMVGFGRGPLSLVSQLGSPR---FSYCLTSFMSPVPSRLYFGAYATLNSTSAS 263

Query: 323 ---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGT 378
              PV +   P + NP  P+ YY+ ++G+ VGG  +PI   +F +    G  GV++D+G+
Sbjct: 264 TGEPVQS--TPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSGS 321

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVS---IFDTCYNLSGFVS--VRVPTVSFYFS 433
            +T L   AY+    AF  Q G LP  +  S   + DTC+         V +P ++F+F 
Sbjct: 322 TITYLARAAYDMVHQAFADQVG-LPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHFE 380

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           G   + LP  N+++   D G  C A A S  G SIIG+ Q +   + +D  N  + F P 
Sbjct: 381 GA-NMELPLENYMLIDGDTGNLCLAIAASDDG-SIIGSFQHQNFHVLYDNENSLLSFTPA 438

Query: 494 VC 495
            C
Sbjct: 439 TC 440


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 142/409 (34%), Positives = 208/409 (50%), Gaps = 34/409 (8%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQD----FGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           +  D  RV++L RR+    + +   E +         + SG +  +  Y   +G+G+   
Sbjct: 72  LSSDAARVSSLQRRIESYRSSSEGEEEEASKLALQVPITSGANLRTLNYVATVGLGAAEA 131

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--- 224
           +  +V+D+ S++ WVQCQPC  C+ Q DP+FDP+ S S++ V C+S+ CD L  A     
Sbjct: 132 T--VVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRVAMAAGT 189

Query: 225 --------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM-FVGA 275
                       C Y +SY DGSY++G LA + L +    ++    GCG  NQG  F G 
Sbjct: 190 SPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQDIEGFVFGCGTSNQGAPFGGT 249

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWV 329
           +GL+GLG   +SLV Q   Q GG FSYCL  R +GSSGSLV G      R + P+    +
Sbjct: 250 SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAM 309

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
                P    FY++ L+G+ VGG    +    F   +     V++D+GT +T L    Y 
Sbjct: 310 VSDSGPLQGPFYFLNLTGITVGGQE--VESPWFSAGR-----VIIDSGTIITTLVPSVYN 362

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A R  F++Q    P+A   SI DTC+NL+G   V+VP++ F F G   + + +   L  V
Sbjct: 363 AVRAEFLSQLAEYPQAPAFSILDTCFNLTGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFV 422

Query: 450 -DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             DA   C A A   S    SIIGN QQ+ +++ FD     +GF    C
Sbjct: 423 SSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLGSQIGFAQETC 471


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 146/416 (35%), Positives = 211/416 (50%), Gaps = 39/416 (9%)

Query: 108 FHARMQRDVKRVATLVRRLSG------------------GGADAAKHEVQD--FGTDVVS 147
           F   +  D  RVA L  RL+                   GGA    H   D      +  
Sbjct: 66  FSTVLTHDDARVAHLASRLAASDPPSRRPTSLRKQKKAAGGASGGHHLDDDSLASVPLSP 125

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASF 206
           G   G G Y  ++G+G+P  S  MV+D+GS + W+QC PC   C++Q  P+FDP  S+++
Sbjct: 126 GTSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTY 185

Query: 207 SGVSCSSAVCDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
           + V CS++ CD L+ A      C A   C Y+ SYGD S++ G L+ +T++ G T   + 
Sbjct: 186 TSVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTSYPSF 245

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
             GCG  N+G+F  +AGL+GL    +SL+ QL    G +FSYCL +    S+G L  G  
Sbjct: 246 YYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPT--AASTGYLSIGPY 303

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
                 ++ P+  +    S Y++ LSG+ VGG  + +S      ++      ++D+GT +
Sbjct: 304 NTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP-----SEYSSLPTIIDSGTVI 358

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLT 439
           TRLPT  + A   A         RA   SI DTC+   G  S +RVPTV   F+GG  + 
Sbjct: 359 TRLPTAVHTALSKAVAQAMAGAQRAPAFSILDTCFE--GQASQLRVPTVVMAFAGGASMK 416

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N LI VDD+ T C AFAP+ S  +IIGN QQ+   + +D A   +GF    C
Sbjct: 417 LTTRNVLIDVDDSTT-CLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSAGGC 470


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  225 bits (573), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 146/431 (33%), Positives = 225/431 (52%), Gaps = 44/431 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           ++ EL+HRD   SS +       ++ QH  +A  +R + R   L +       ++  +  
Sbjct: 28  FSFELIHRD---SSKSPLYKPAQNKFQHVVNA-ARRSINRANRLFKDSLSNTPESTVY-- 81

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                  V+G     GEY +   VG+PP + Y V+D+GSDIVW+QC+PC QCYKQ+ P+F
Sbjct: 82  -------VNG-----GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIF 129

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           +P+ S+S+  + CSS +C  +    C+    C Y +++ D SY++G L++ETLT+  T  
Sbjct: 130 NPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTG 189

Query: 258 KNVA-----IGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GT 309
            +V+     IGCGH N+GMF G  +G++GLG G +SL  QL    GG FSYCL+     +
Sbjct: 190 HSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDS 249

Query: 310 GSSGSLVFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
             +  L FG  A+  G   V  P V ++P+A  FYY+ L    VG  RI      F +  
Sbjct: 250 NKTSKLNFGDAAVVSGDGVVSTPFVKKDPQA--FYYLTLEAFSVGNKRIE-----FEVLD 302

Query: 367 MGDDG-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVR 424
             ++G +++D+GT +T LP+  Y     A VAQ   L R    + + + CY+++      
Sbjct: 303 DSEEGNIILDSGTTLTLLPSHVYTNLESA-VAQLVKLDRVDDPNQLLNLCYSITS-DQYD 360

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
            P ++ +F G  +   P S F    D  G  C AF  S +G  I GN+ Q  + + +D  
Sbjct: 361 FPIITAHFKGADIKLNPISTFAHVAD--GVVCLAFTSSQTG-PIFGNLAQLNLLVGYDLQ 417

Query: 485 NGFVGFGPNVC 495
              V F P+ C
Sbjct: 418 QNIVSFKPSDC 428


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  224 bits (572), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 151/430 (35%), Positives = 207/430 (48%), Gaps = 68/430 (15%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR-MQRDVKRVATLVRRLSGGGADAA 134
           + R +LE+VH+    S       +  H+     H + + +D  RVA++  RL+   A  +
Sbjct: 14  DQRASLEVVHKHGPCS------KLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGS 67

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQ 193
             +         S    GSG Y V +G+GSP R    + D+GSD+ W QC+PC   CY+Q
Sbjct: 68  NLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQ 127

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSYTKGTLALE 248
            + +FDP+ S S+S VSC S  C++LE+A     GC +  C Y + YGDGSY+ G  A E
Sbjct: 128 REHIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFARE 187

Query: 249 TLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
            L++  T V  N   GCG  N+G+F G AGLLGL    +SLV Q   + G  FSYCL   
Sbjct: 188 KLSLTSTDVFNNFQFGCGQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PS 246

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
            + S+G L FG                                       S D       
Sbjct: 247 SSSSTGYLSFG---------------------------------------SGD------- 260

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           GD   V  T     RLP   Y + +  F     + PR  GVSI DTCY+LS + +V+VP 
Sbjct: 261 GDSKAVKFT----PRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPK 316

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGAN 485
           +  YFSGG  + L A   +I V      C AFA       ++IIGN+QQ+ I + +D A 
Sbjct: 317 IILYFSGGAEMDL-APEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAE 375

Query: 486 GFVGFGPNVC 495
           G VGF P+ C
Sbjct: 376 GRVGFAPSGC 385


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 159/428 (37%), Positives = 219/428 (51%), Gaps = 41/428 (9%)

Query: 91  SSSNTTNNMHYHRH----------QHSFHARMQRDVKRVATLVRRLS---GGGADAAKHE 137
           SSS TT  +  HRH          + +    ++RD  R   +  +LS   G G D  +  
Sbjct: 49  SSSGTTVPLS-HRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQS 107

Query: 138 VQ-DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
                 T + S +D  +  Y + + +G+P  +Q ++ID+GSD+ WV C   ++    S  
Sbjct: 108 AAITLPTTLGSALD--TLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSL 163

Query: 197 VFDPADSASFSGVSCSSAVCDRLE--NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG 253
            FDP  S++++  SCSSA C RLE  + GC     C+Y V YGDGS T GT   +TL + 
Sbjct: 164 FFDPGKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALN 223

Query: 254 RT-VVKNVAIGCGHKN---QGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
            T  V+N   GC   +   +G+      GL+GLGGG+ SLV Q     G AFSYCL +  
Sbjct: 224 STEKVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPAT- 282

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           T SSG L  G      G    P+ R+ RAP+FY+V L G+ VGG  + IS  +F      
Sbjct: 283 TRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA---- 338

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             G +MD+GT +TRLP  AY A   AF A     PRA   SI DTC++ +G  +V +P V
Sbjct: 339 --GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAV 396

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGF 487
              FSGG V+ L A   +         C AFAP+  G+ SIIGN+QQ   ++  D     
Sbjct: 397 ELVFSGGAVVDLDADGIMY------GSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSV 450

Query: 488 VGFGPNVC 495
           +GF P  C
Sbjct: 451 LGFRPGAC 458


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  224 bits (571), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 159/449 (35%), Positives = 224/449 (49%), Gaps = 41/449 (9%)

Query: 88  KMSSSSNTTNNMHYHRH--------QHSFHARMQRDVKRVATLVRRLSGGGADAA----- 134
           K  +S + +  +H +R         + S      +D  R+ T+ RR +  G D       
Sbjct: 66  KQPASLSPSLKLHMNRRAAEGGRTRKESVLDLADKDAVRIETMHRRAARSGGDRTPASPS 125

Query: 135 ----KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
               +   +     V SG+  GSGEY + + VG+PPR   M++D+GSD+ W+QC PC  C
Sbjct: 126 SSPRRALSERMVATVESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC 185

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENA----GCH---AGRCRYEVSYGDGSYTKG 243
           + Q  PVFDPA S+S+  V+C    C  +        C       C Y   YGD S T G
Sbjct: 186 FDQVGPVFDPAASSSYRNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTG 245

Query: 244 TLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
            LALE+ T+  T       V +V  GCGH N+G+F GAAGLLGLG G +S   QL    G
Sbjct: 246 DLALESFTVNLTAPGASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYG 305

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR-------NPRAPSFYYVGLSGLGV 350
             FSYCLV  G+  +  +VFG +     AA  P +        +  A +FYYV L G+ V
Sbjct: 306 HTFSYCLVDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLV 365

Query: 351 GGMRIPISEDLF--RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASG 407
           GG  + IS D +     + G  G ++D+GT ++    PAY+  R AF+ + G + P    
Sbjct: 366 GGELLNISSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPD 425

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGL 466
             +   CYN+SG     VP +S  F+ G V   PA N+ I +D  G  C A   +P +G+
Sbjct: 426 FPVLSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGM 485

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SIIGN QQ+   + +D  N  +GF P  C
Sbjct: 486 SIIGNFQQQNFHVVYDLKNNRLGFAPRRC 514


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 128/353 (36%), Positives = 196/353 (55%), Gaps = 13/353 (3%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           G GE+ V I +G+PP+   ++ID+GSD+ W+Q +PC  C++Q+DP+FDP+ S++++ ++C
Sbjct: 21  GYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIAC 80

Query: 212 SSAVC-DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
           SS+ C D L    C  A  C Y   YGDGS T+G  + ET+T   T  + V  G    N 
Sbjct: 81  SSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNT 140

Query: 270 GMF--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRGTGSSGSLVFGREALPV 324
           G F   G  G+LGLG G +S+  QLG   G  FSYCLV   S G+ +S ++ FG  A+P 
Sbjct: 141 GTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETS-TMYFGDAAVPS 199

Query: 325 GAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
           G   + P+V N   P++YY+ + G+ VGG  + I + ++ +   G  G ++D+GT +T L
Sbjct: 200 GEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYL 259

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
               + A   A+ +Q    P  +  +  D C+N  G  S   P ++ +   G  L LP +
Sbjct: 260 QQEVFNALVAAYTSQV-RYPTTTSATGLDLCFNTRGTGSPVFPAMTIHLD-GVHLELPTA 317

Query: 444 NFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N  I + +    C AFA +    ++I GNIQQ+   I +D  N  +GF P  C
Sbjct: 318 NTFISL-ETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPADC 369


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 153/441 (34%), Positives = 232/441 (52%), Gaps = 34/441 (7%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHAR--MQRDVKRVATLVRRL 126
           S+   S E +  L++VH+    S           R  H   A+  + +D  RV ++  +L
Sbjct: 73  STQVPSIENKAFLKVVHKHGPCSD---------LRQGHKAEAQYILLQDQSRVDSIHSKL 123

Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQ---GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           S    D+   +V+      +   D    GSG YFV +G+G+P +   ++ D+GSD+ W Q
Sbjct: 124 S---KDSGLSDVKATAATTLPAKDGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQ 180

Query: 184 CQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGD 237
           C+PC + CY Q + +F+P+ S S++ +SC S +CD L +A      C +  C Y + YGD
Sbjct: 181 CEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGD 240

Query: 238 GSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
            S++ G    E L++  T V  +   GCG  N+G+F GAAGLLGLG   +SLV Q   + 
Sbjct: 241 SSFSIGFFGKEKLSLTATDVFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRY 300

Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
              FSYCL    + S+G L FG       A++ PL       SFY + L+G+ VGG ++ 
Sbjct: 301 NKIFSYCL-PSSSSSTGFLTFGGST-SKSASFTPLATISGGSSFYGLDLTGISVGGRKLA 358

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
           IS  +F        G ++D+GT +TRLP  AY A    F       P A  +SI DTC++
Sbjct: 359 ISPSVFSTA-----GTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFD 413

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
            S   ++ VP +  +FSGG V+ +  +  +  V+D    C AFA     S ++I GN+QQ
Sbjct: 414 FSNHDTISVPKIGLFFSGGVVVDIDKTG-IFYVNDLTQVCLAFAGNSDASDVAIFGNVQQ 472

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           + +++ +DGA G VGF P  C
Sbjct: 473 KTLEVVYDGAAGRVGFAPAGC 493


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 155/444 (34%), Positives = 223/444 (50%), Gaps = 38/444 (8%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           SS N     A  ++ LVHR    ++S  ++         SF   ++    R   +  R S
Sbjct: 44  SSVNLEPSSATLSVPLVHRYGPCAASQYSD-----MPTPSFSETLRHSRARTNYIKSRAS 98

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
            G A           T +   +D  S EY V +G G+P   Q +++D+GSD+ WVQC PC
Sbjct: 99  TGMASTPDDAAVTVPTRLGGFVD--SLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPC 156

Query: 188 --SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN---AGCHAG--RCRYEVSYGDGSY 240
             ++CY Q DP+FDP+ S++++ ++C +  C++L +    GC +G  +C Y V YGDGS 
Sbjct: 157 NSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSS 216

Query: 241 TKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T+G  + ET+T    + VK+   GCGH  +G      GLLGLGG   SLV Q     GGA
Sbjct: 217 TRGVYSNETITFAPGITVKDFHFGCGHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGA 276

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGA------AWVPLVRNPRAPSFYYVGLSGLGVGGM 353
           FSYCL +  +  +G L  G    P  A       + P+   P   + Y V ++G+ VGG 
Sbjct: 277 FSYCLPALNS-EAGFLALGVR--PSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGK 333

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
            + I    FR       G+++D+GT VT LP  AY A   A        P  +    FDT
Sbjct: 334 PLDIPRSAFR------GGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASED-FDT 386

Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PS-GLSIIGN 471
           CYN +G+ +V VP V+  FSGG  + L   N ++  D     C AF  S P  GL IIGN
Sbjct: 387 CYNFTGYSNVTVPRVALTFSGGATIDLDVPNGILVKD-----CLAFRESGPDVGLGIIGN 441

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           + Q  +++ +D  +G VGF    C
Sbjct: 442 VNQRTLEVLYDAGHGKVGFRAGAC 465


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  223 bits (568), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 151/432 (34%), Positives = 224/432 (51%), Gaps = 42/432 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + +EL+HRD   S    ++  H+ R  ++      R+     T+V       +D A+  +
Sbjct: 27  FTVELIHRDSPKSPMYNSSETHFDRIVNALRRSSHRN-----TVVLE-----SDTAEAPI 76

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            + G           GEY V I VG+PP S   V D+GSD++W QC+PCS CY+Q+ P+F
Sbjct: 77  FNNG-----------GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMF 125

Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP+ S ++  V+CSS VC    + + C     C Y ++YGD S+++G LA++T+T+  T 
Sbjct: 126 DPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTS 185

Query: 257 VKNVA-----IGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
            + VA     IGCGH N G F    +G++GLG G  SLV QLG  TGG FSYCL+  GTG
Sbjct: 186 GRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTG 245

Query: 311 S---SGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           S   S  L FG  A   G+  V  P+  + +  +FY + L  + VG  +    E   +L 
Sbjct: 246 STNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLG 305

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVSVR 424
             G+  +++D+GT +T LP+    +F  A ++Q+ +LP A   S F D C+  +      
Sbjct: 306 --GESNIIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEFLDYCFATTT-DDYE 361

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
           +P V+ +F G  V  L   N  + + D  T C AF   P   + I GNI Q    + +D 
Sbjct: 362 MPPVTMHFEGADV-PLQRENLFVRLSD-DTICLAFGSFPDDNIFIYGNIAQSNFLVGYDI 419

Query: 484 ANGFVGFGPNVC 495
            N  V F P  C
Sbjct: 420 KNLAVSFQPAHC 431


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 154/446 (34%), Positives = 229/446 (51%), Gaps = 48/446 (10%)

Query: 75  DEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
           D  R ++ L HR    +   ++      + + SF  R++ D  R   ++R+ SG      
Sbjct: 50  DPTRASVPLAHRHGPCAPKGSSAT---DKKKPSFAERLRSDRARADHILRKASG------ 100

Query: 135 KHEVQDFGTDVVSGMDQG---SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQ 189
           +  + + G   +     G   S EY V +G+G+P   Q ++ID+GSD+ WVQC+PC  S 
Sbjct: 101 RRMMSEGGGASIPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASD 160

Query: 190 CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RCRYEVSYGDGS 239
           CY Q DP+FDP+ S++F+ + C+S  C +L   G   G          +C Y + YG+G+
Sbjct: 161 CYPQKDPLFDPSKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGA 220

Query: 240 YTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
            T+G  + ETL +G + VVK+   GCG    G +    GLLGLGG   SLV Q     GG
Sbjct: 221 ITEGVYSTETLALGSSAVVKSFRFGCGSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGG 280

Query: 299 AFSYCLVSRGTGSSGSLVFGREALP----VGAAWVPL-VRNPRAPSFYYVGLSGLGVGGM 353
           AFSYCL    +G +G L  G          G  + P+   +P+  +FY V L+G+ VGG 
Sbjct: 281 AFSYCLPPLNSG-AGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGK 339

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSI 410
            + I   +F        G ++D+GT +T +PT AY+A R AF   +A+   LP A   S 
Sbjct: 340 ALDIPPAVFA------KGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SA 391

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSII 469
            DTCYN +G  +V VP V+  F GG  + L   + ++ V+D    C AFA +  G   II
Sbjct: 392 LDTCYNFTGHGTVTVPKVALTFVGGATVDLDVPSGVL-VED----CLAFADAGDGSFGII 446

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN+    I++ +D   G +GF    C
Sbjct: 447 GNVNTRTIEVLYDSGKGHLGFRAGAC 472


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/372 (36%), Positives = 194/372 (52%), Gaps = 24/372 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG+  GSGEYF+ + VG+PP+   +++D+GSD+ W+QC PC  C++QS P +DP DS+SF
Sbjct: 188 SGVSLGSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSF 247

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             +SC    C      D  +        C Y   YGDGS T G  ALET T+  T     
Sbjct: 248 RNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGT 307

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
                V+NV  GCGH N+G+F GAAGLLGLG G +S   Q+    G +FSYCLV R + +
Sbjct: 308 SELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLVDRNSNA 367

Query: 312 SGS--LVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
           S S  L+FG +   +    +        ++    +FYYV +  + V    + I E+ + L
Sbjct: 368 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHL 427

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
           +  G  G ++D+GT +T    PAYE  ++AFV +        G+     CYN+SG   + 
Sbjct: 428 SSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKME 487

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDG 483
           +P     F+   V   P  N+ I + D    C A   +P S LSIIGN QQ+   I +D 
Sbjct: 488 LPDFGILFADEAVWNFPVENYFIWI-DPEVVCLAILGNPRSALSIIGNYQQQNFHILYDM 546

Query: 484 ANGFVGFGPNVC 495
               +G+ P  C
Sbjct: 547 KKSRLGYAPMKC 558


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 223/433 (51%), Gaps = 45/433 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD   S          +++QH F    +R + R               A H  
Sbjct: 28  FSVELIHRDSPKSPYYKPTE---NKYQH-FVDAARRSINR---------------ANHFF 68

Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
           +D  T    S +    G Y +   VG+PP   Y + D+GSDIVW+QC+PC QCY Q+ P+
Sbjct: 69  KDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPI 128

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           F+P+ S+S+  + C S +C  + +  C     C+Y++SYGD S+++G L+++TL++  T 
Sbjct: 129 FNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS 188

Query: 257 VKNVA-----IGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SR 307
              V+     IGCG  N G F GA +G++GLGGG +SL+ QLG   GG FSYCLV   ++
Sbjct: 189 GSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNK 248

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            + +S  L FG  A+  G   V  PL++  + P FY++ L    VG  R+         +
Sbjct: 249 ESNASSILSFGDAAVVSGDGVVSTPLIK--KDPVFYFLTLQAFSVGNKRVEFGGS----S 302

Query: 366 QMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVS 422
           + GDD   +++D+GT +T +P+  Y     A V     L R    +  F  CY+L     
Sbjct: 303 EGGDDEGNIIIDSGTTLTLIPSDVYTNLESA-VVDLVKLDRVDDPNQQFSLCYSLKS-NE 360

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              P ++ +F G  +     S F +P+ D G  CFAF PSP   SI GN+ Q+ + + +D
Sbjct: 361 YDFPIITAHFKGADIELHSISTF-VPITD-GIVCFAFQPSPQLGSIFGNLAQQNLLVGYD 418

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 419 LQQKTVSFKPTDC 431


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  222 bits (565), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 149/432 (34%), Positives = 221/432 (51%), Gaps = 40/432 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +  +L+HRD   S           R +++ H    R V RV            D ++ + 
Sbjct: 31  FTADLIHRDSPKSPFYNPTETSSQRLRNAIH----RSVSRVFHFT--------DISQKDA 78

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            D    +   +   SGEY + I +G+PP     + D+GSD++W QC+PC  CY Q DP+F
Sbjct: 79  SDNAPQI--DLTSNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLF 136

Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP  S+++  VSCSS+ C  LEN A C      C Y  SYGD SYTKG +A++TLT+G T
Sbjct: 137 DPKASSTYKDVSCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGST 196

Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
                 +KN+ IGCGH N G F    +G++GLGGG++SL+ QLG    G FSYCLV  + 
Sbjct: 197 DTRPVQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTS 256

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI--PISEDLFR 363
               +  + FG  A+  G   V  PL+   +  +FYY+ L  + VG   +  P S+    
Sbjct: 257 ENDRTSKINFGTNAVVSGTGVVSTPLIAKSQE-TFYYLTLKSISVGSKEVQYPGSD---- 311

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
            +  G+  +++D+GT +T LPT  Y    DA  +      +    +    CY+ +G   +
Sbjct: 312 -SGSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATG--DL 368

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
           +VP ++ +F G  V   P++ F+   +D    CFAF  SPS  SI GN+ Q    + +D 
Sbjct: 369 KVPAITMHFDGADVNLKPSNCFVQISEDL--VCFAFRGSPS-FSIYGNVAQMNFLVGYDT 425

Query: 484 ANGFVGFGPNVC 495
            +  V F P  C
Sbjct: 426 VSKTVSFKPTDC 437


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 147/430 (34%), Positives = 218/430 (50%), Gaps = 39/430 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +  +L+HRD   S           R +++ H    R V RV     +      D      
Sbjct: 31  FTADLIHRDSPKSPFYNPMETSSQRLRNAIH----RSVNRVFHFTEK------DNTPQPQ 80

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            D  ++        SGEY + + +G+PP     + D+GSD++W QC PC  CY Q DP+F
Sbjct: 81  IDLTSN--------SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLF 132

Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP  S+++  VSCSS+ C  LEN A C  +   C Y +SYGD SYTKG +A++TLT+G +
Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS 192

Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
                 +KN+ IGCGH N G F    +G++GLGGG +SL+ QLG    G FSYCLV  + 
Sbjct: 193 DTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
               +  + FG  A+  G+  V  PL+      +FYY+ L  + VG  +I   +     +
Sbjct: 253 KKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDS 309

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
           +  +  +++D+GT +T LPT  Y    DA  +      +    S    CY+ +G   ++V
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKV 367

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           P ++ +F G  V  L +SN  + V +    CFAF  SPS  SI GN+ Q    + +D  +
Sbjct: 368 PVITMHFDGADV-KLDSSNAFVQVSE-DLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVS 424

Query: 486 GFVGFGPNVC 495
             V F P  C
Sbjct: 425 KTVSFKPTDC 434


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  221 bits (563), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 147/430 (34%), Positives = 218/430 (50%), Gaps = 39/430 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +  +L+HRD   S           R +++ H    R V RV     +      D      
Sbjct: 31  FTADLIHRDSPKSPFYNPMETSSQRLRNAIH----RSVNRVFHFTEK------DNTPQPQ 80

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            D  ++        SGEY + + +G+PP     + D+GSD++W QC PC  CY Q DP+F
Sbjct: 81  IDLTSN--------SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLF 132

Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP  S+++  VSCSS+ C  LEN A C  +   C Y +SYGD SYTKG +A++TLT+G +
Sbjct: 133 DPKTSSTYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSS 192

Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SR 307
                 +KN+ IGCGH N G F    +G++GLGGG +SL+ QLG    G FSYCLV  + 
Sbjct: 193 DTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
               +  + FG  A+  G+  V  PL+      +FYY+ L  + VG  +I   +     +
Sbjct: 253 KKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQI---QYSGSDS 309

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
           +  +  +++D+GT +T LPT  Y    DA  +      +    S    CY+ +G   ++V
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKV 367

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           P ++ +F G  V  L +SN  + V +    CFAF  SPS  SI GN+ Q    + +D  +
Sbjct: 368 PVITMHFDGADV-KLDSSNAFVQVSE-DLVCFAFRGSPS-FSIYGNVAQMNFLVGYDTVS 424

Query: 486 GFVGFGPNVC 495
             V F P  C
Sbjct: 425 KTVSFKPTDC 434


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  221 bits (562), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 156/438 (35%), Positives = 233/438 (53%), Gaps = 31/438 (7%)

Query: 73  SSDEARWNLELVHR----DKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
           S+++ + +L++VH+     K+S    +    H           + +D  RV ++  RLS 
Sbjct: 68  SNNDNKASLKVVHKHGPCSKLSQDEASAAPTHTEI--------LLQDQSRVKSIHSRLSN 119

Query: 129 GGADAAKH-EVQDFGT-DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
                 K  +V D  T     G   GSG Y V +G+G+P +   ++ D+GSDI W QCQP
Sbjct: 120 SKTSGGKDVKVTDSTTIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQP 179

Query: 187 CSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDGSY 240
           C++ CYKQ + +FDP+ S S++ +SCSS++C+ L +A     GC +  C Y + YGD S+
Sbjct: 180 CARSCYKQKEQIFDPSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSF 239

Query: 241 TKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           + G    E LT+  T    N+  GCG  NQG+F G+AGLLGLG   +S+V Q   +    
Sbjct: 240 SVGFFGTEKLTLTSTDAFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKI 299

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           FSYCL    + S+G L FG  A    A + PL      PSFY +  +G+ VGG ++ IS 
Sbjct: 300 FSYCL-PSSSSSTGFLTFGGSA-SKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISA 357

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
            +F        G ++D+GT +TRLP  AY A R +F       P    +SI DTCY+ S 
Sbjct: 358 SVFSTA-----GAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSS 412

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGI 477
           + ++ VP + F FS G  + + A+  L         C AFA     + + I GN+QQ+ +
Sbjct: 413 YTTISVPKIGFSFSSGIEVDIDATGILY-ASSLSQVCLAFAGNSDATDVFIFGNVQQKTL 471

Query: 478 QISFDGANGFVGFGPNVC 495
           ++ +DG+ G VGF P  C
Sbjct: 472 EVFYDGSAGKVGFAPGGC 489


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 135/390 (34%), Positives = 199/390 (51%), Gaps = 27/390 (6%)

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
           +A H  Q   + VVSG   GSG+YFV + +G+PP+   +V D+GSD+VWV+C  C  C +
Sbjct: 66  SALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTR 125

Query: 193 QSD-PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGR----CRYEVSYGDGSYTKGT 244
            +    F    S +FS   C  + C  +   ++  C+  R    CRYE SYGDGS T G 
Sbjct: 126 HTPGSAFLARHSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGF 185

Query: 245 LALETLTI----GRTV-VKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLG 293
            + ET T+    GR   +K +A GC  +  G       F GA G++GLG G +SL  QLG
Sbjct: 186 FSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLG 245

Query: 294 GQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAA-----WVPLVRNPRAPSFYYVGLS 346
            + G  FSYCL+      S +  L+ G     V        + PL  NP +P+FYY+G+ 
Sbjct: 246 HRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIE 305

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
            + V G+++PI+  ++ L ++G+ G ++D+GT +T LP PAY         +      A 
Sbjct: 306 SVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAE 365

Query: 407 GVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-DAGTFCFAFAPSPSG 465
               FD C N+S     R+P +SF   G  V + P  N+ +  D D          +PSG
Sbjct: 366 PTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSG 425

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S+IGN+ Q+G  + FD     +GF  + C
Sbjct: 426 FSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 133/427 (31%), Positives = 212/427 (49%), Gaps = 31/427 (7%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + ++L+HRD   S         ++  + +   R+   ++R  + V       A +   + 
Sbjct: 32  FTVDLIHRDSPLSP--------FYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKA 83

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            +  +DV S      GEY + + +G+PP     + D+GSD++W QC+PC +CYKQ DP+F
Sbjct: 84  AE--SDVTSNR----GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLF 137

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
           DP  S ++   SC +  C  L+ + C    C+Y+ SYGD SYT G +A +T+T+  T   
Sbjct: 138 DPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGS 197

Query: 259 NVA-----IGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SRGTG 310
            V+     IGCGH+N G F    +G++GLG G +SL+ Q+G   GG FSYCLV  S   G
Sbjct: 198 PVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAG 257

Query: 311 SSGSLVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           +S  L FG  A+  G      PL+ +    SFY++ L  + VG  RI   +        G
Sbjct: 258 NSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSL---GTG 314

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
           +  +++D+GT +T +P   +     A   Q               CY+ +    ++VP +
Sbjct: 315 EGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATS--DLKVPAI 372

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           + +F+G  V   P + F+   DD    C AFA + SG+SI GN+ Q    + ++     +
Sbjct: 373 TAHFTGADVKLKPINTFVQVSDDV--VCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSL 430

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 431 SFKPTDC 437


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 124/355 (34%), Positives = 192/355 (54%), Gaps = 22/355 (6%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           + + +G+P      ++D+GSD++W QC+PC++C+ Q  P+FDP  S+S+S V CSS +C+
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60

Query: 218 RLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGM-FV 273
            L  + C+  +  C Y  +YGD S T+G LA ET T      +  +  GCG +N+G  F 
Sbjct: 61  ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFS 120

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV----GAAW 328
             +GL+GLG G +SL+ QL       FSYCL S   + +S SL  G  A  +    GA+ 
Sbjct: 121 QGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASL 177

Query: 329 -------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
                  + L+RNP  PSFYY+ L G+ VG  R+ + +  F L + G  G+++D+GT +T
Sbjct: 178 DGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTIT 237

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTL 440
            L   A++  ++ F ++       SG +  D C+ L     ++ VP + F+F G   L L
Sbjct: 238 YLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGAD-LEL 296

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P  N+++     G  C A   S +G+SI GN+QQ+   +  D     V F P  C
Sbjct: 297 PGENYMVADSSTGVLCLAMG-SSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTEC 350


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 136/412 (33%), Positives = 201/412 (48%), Gaps = 26/412 (6%)

Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H    + + ++Q   + +A    R++   + A    V D  T     +   SGEY V + 
Sbjct: 35  HVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLA 94

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP     ++D+GSD++W QC PC  C  Q  P FD   SA++  + C S+ C  L +
Sbjct: 95  IGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSS 154

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAA 276
             C    C Y+  YGD + T G LA ET T G     +    N+A GCG  N G    ++
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS 214

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGA 326
           G++G G G +SLV QLG      FSYCL S  + +   L FG  A           PV +
Sbjct: 215 GMVGFGRGPLSLVSQLGPSR---FSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS 271

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              P V NP  P+ Y++ L  + +G   +PI   +F +   G  GV++D+GT++T L   
Sbjct: 272 --TPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329

Query: 387 AYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPAS 443
           AYEA R   V+    LP  +   I  DTC+       V+V VP + F+F    +  LP  
Sbjct: 330 AYEAVRRGLVSAIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-E 387

Query: 444 NFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+++     G  C   AP+  G +IIGN QQ+ + + +D  N F+ F P  C
Sbjct: 388 NYMLIASTTGYLCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 438


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 149/430 (34%), Positives = 220/430 (51%), Gaps = 37/430 (8%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
           SS +   ++ L HR    S ++  +       +      ++RD  R   + R+ SG    
Sbjct: 27  SSSDGTSSVTLSHRYGPCSPADPNSGEKRPTDEE----LLRRDQLRADYIRRKFSGSNGT 82

Query: 133 AAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--- 187
           AA  + Q     V +  G    + EY + +G+GSP  +Q +VID+GSD+ WVQC+PC   
Sbjct: 83  AAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAP 142

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCHA-GRCRYEVSYGDGSYTK 242
           S C+  +  +FDPA S++++  +CS+A C +L    E  GC A  RC+Y V YGDGS T 
Sbjct: 143 SPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTT 202

Query: 243 GTLALETLTI-GRTVVKNVAIGCGHKN--QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           GT + + LT+ G  VV+    GC H     GM     GL+GLGG + S V Q   + G +
Sbjct: 203 GTYSSDVLTLSGSDVVRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKS 262

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
           F YCL +    SSG L  G  A   G      A  P++R+ + P++Y+  L  + VGG +
Sbjct: 263 FFYCLPAT-PASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKK 321

Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC 414
           + +S  +F        G ++D+GT +TRLP  AY A   AF A      RA  + I DTC
Sbjct: 322 LGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTC 375

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNI 472
           +N +G   V +PTV+  F+GG V+ L A   +         C AFAP+        IGN+
Sbjct: 376 FNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAFAPTRDDKAFGTIGNV 429

Query: 473 QQEGIQISFD 482
           QQ   ++ +D
Sbjct: 430 QQRTFEVLYD 439


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 138/353 (39%), Positives = 188/353 (53%), Gaps = 19/353 (5%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSS 213
           E+ V +G GSP ++  + ID+GSD+ W+QC PCS  CYKQ DPVFDP  SA++S V C  
Sbjct: 160 EFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGH 219

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMF 272
             C        ++G C Y+V+YGDGS T G L+ ETL++  T  +   A GCG  N G F
Sbjct: 220 PQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRDLPGFAFGCGQTNLGEF 279

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA------ 326
            G  GL+GLG G++SL  Q     G  FSYCL S  T + G L  G    P  +      
Sbjct: 280 GGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDT-THGYLTMG-STTPAASNDDDDV 337

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
            +  +++    PS Y+V +  + +GG  +P+   +F       DG + D+GT +T LP  
Sbjct: 338 QYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT-----RDGTLFDSGTILTYLPPE 392

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY + RD F         A     FDTCY+ +G  ++ +P V+F FS G V  L     L
Sbjct: 393 AYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAVAFKFSDGAVFDLSPVAIL 452

Query: 447 IPVDDA--GTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I  DD    T C AF P PS +  +IIGN QQ G ++ +D A   +GFG   C
Sbjct: 453 IYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 141/401 (35%), Positives = 209/401 (52%), Gaps = 28/401 (6%)

Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           F A +  D  R+A L  RL    A   K  V      + SG   G G Y  R+G+G+P  
Sbjct: 64  FSAFITHDAARIAGLASRL----ATKDKDWVAASSVPLASGASVGVGNYITRLGLGTPTT 119

Query: 168 SQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH- 225
           +  MV+DSGS + W+QC PC+  C+ Q+ P++DP  S++++ V CS+  C  L+ A  + 
Sbjct: 120 TYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQCAELQAATLNP 179

Query: 226 -----AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLL 279
                +G C+Y+ SYGDGS++ G L+ +T+++  +        GCG  N G+F  AAGL+
Sbjct: 180 SSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGSFPGFYYGCGQDNVGLFGRAAGLI 239

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA---LPVGAAWVPLVRNPR 336
           GL    +SL+ QL    G +F+YCL +    S+G L FG  +    P   ++  +V +  
Sbjct: 240 GLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSL 299

Query: 337 APSFYYVGLSGLGVGG--MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
             S Y+V L+G+ V G  + +P SE        G    ++D+GT +TRLPTP Y A   A
Sbjct: 300 DASLYFVSLAGMSVAGSPLAVPSSE-------YGSLPTIIDSGTVITRLPTPVYTALSKA 352

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
            V      P A   SI  TC+       + VP V+  F+GG  L L   N L+ V++  T
Sbjct: 353 -VGAALAAPSAPAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVNETTT 410

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C AFAP+ S  +IIGN QQ+   + +D     +GF    C
Sbjct: 411 -CLAFAPTDS-TAIIGNTQQQTFSVVYDVKGSRIGFAAGGC 449


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 128/357 (35%), Positives = 186/357 (52%), Gaps = 20/357 (5%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + +G+G+P R    ++D+GSD++W QC PC  C  Q  P FDPA+S+++  + CS+
Sbjct: 90  GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSA 149

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQ 269
             C+ L    C+   C Y+  YGD + T G LA ET T G    R  +  ++ GCG+ N 
Sbjct: 150 PACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRVTLPRISFGCGNLNA 209

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL--PVGAA 327
           G     +G++G G GS+SLV QLG      FSYCL S  +     L FG  A      A+
Sbjct: 210 GSLANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRSRLYFGAYATLNSTNAS 266

Query: 328 WV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRL 383
            V   P + NP  P+ Y++ ++G+ VGG R+PI   +  +    G  G ++D+GT +T L
Sbjct: 267 TVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYL 326

Query: 384 PTPAYEAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVL 438
             PAY A R+AFV     T  L   +  S+ DTC+        SV +P +  +F G    
Sbjct: 327 AEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-W 385

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            LP  N+++     G  C A A S  G SIIG+ Q +   + +D  N  + F P  C
Sbjct: 386 ELPLQNYMLVDPSTGGLCLAMATSSDG-SIIGSYQHQNFNVLYDLENSLLSFVPAPC 441


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 144/435 (33%), Positives = 215/435 (49%), Gaps = 40/435 (9%)

Query: 76  EARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
            A +  ELVHRD   S    +   H  R    ++  M+R V RV    R      A  + 
Sbjct: 28  NAGFTTELVHRDSPKSPLYNSQQTHLQR----WNKAMRRSVSRVHHFQRT----AATVSP 79

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
            EV+       S +    GEY + + +G+PP     + D+GSD++W QC PC +CYKQ  
Sbjct: 80  KEVE-------SEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIA 132

Query: 196 PVFDPADSASFSGVSCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTI- 252
           P+FDP  S ++  +SC +  C  L E++ C + + C+Y   YGD S+T G LA++T+T+ 
Sbjct: 133 PLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLP 192

Query: 253 ----GRTVVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-- 305
               G        IGCG +N G F    +G++GLGGG MSL+ Q+G   GG FSYCLV  
Sbjct: 193 STNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPF 252

Query: 306 -SRGTGSSGSLVFGREALPVGAAW--VPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
            S   G+S  L FGR A+  G+     PL+ +NP   +FYY+ L  + VG  +I      
Sbjct: 253 SSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPD--TFYYLTLEAMSVGDKKIEFGG-- 308

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGF 420
                  +  +++D+GT++T  P   +  F  A      N  R    S +   CY  +  
Sbjct: 309 -SSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP- 366

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
             ++VP ++ +F+G  V+    + F++  DD    C AF  + SG +I GN+ Q    I 
Sbjct: 367 -DLKVPVITAHFNGADVVLQTLNTFILISDDV--LCLAFNSTQSG-AIFGNVAQMNFLIG 422

Query: 481 FDGANGFVGFGPNVC 495
           +D     V F P  C
Sbjct: 423 YDIQGKSVSFKPTDC 437


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 144/407 (35%), Positives = 212/407 (52%), Gaps = 33/407 (8%)

Query: 108 FHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD-----------VVSGMDQGSGEY 156
           F A +  D  R+++L  RL+     +A+    D   D           +  G   G G Y
Sbjct: 65  FTAVLTHDDARISSLAARLAK--TPSARATSLDADADAGLAGSLASVPLSPGASVGVGNY 122

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV 215
             R+G+G+P     MV+D+GS + W+QC PC   C++QS PVF+P  S++++ V CS+  
Sbjct: 123 VTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 182

Query: 216 CDRLENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
           C  L +A      C +   C Y+ SYGD S++ G L+ +T++ G T + N   GCG  N+
Sbjct: 183 CSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNE 242

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           G+F  +AGL+GL    +SL+ QL    G +F+YCL S  +    SL       P   ++ 
Sbjct: 243 GLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYN---PGQYSYT 299

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           P+V +    S Y++ LSG+ V G   P+S      + +     ++D+GT +TRLPT  Y 
Sbjct: 300 PMVSSSLDDSLYFIKLSGMTVAGN--PLSVSSSAYSSL---PTIIDSGTVITRLPTSVYS 354

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIP 448
           A   A  A      RAS  SI DTC+   G  S V  P V+  F+GG  L L A N L+ 
Sbjct: 355 ALSKAVAAAMKGTSRASAYSILDTCFK--GQASRVSAPAVTMSFAGGAALKLSAQNLLVD 412

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VDD+ T C AFAP+ S  +IIGN QQ+   + +D  +  +GF    C
Sbjct: 413 VDDSTT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 457


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 134/353 (37%), Positives = 197/353 (55%), Gaps = 25/353 (7%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCS 212
           EY V +G G+P   Q +++D+GSD+ WVQC PC  ++CY Q DP+FDP+ S++++ ++C+
Sbjct: 130 EYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACN 189

Query: 213 SAVCDRLEN---AGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGH 266
           +  C +L +    GC +G  +C Y V Y DGS+++G  + ETLT+   + V++   GCG 
Sbjct: 190 TDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTLAPGITVEDFHFGCGR 249

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
             +G      GLLGLGG  +SLV Q     GGAFSYCL +  +  +G LV G       +
Sbjct: 250 DQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNS-EAGFLVLGSPPSGNKS 308

Query: 327 AWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           A+V  P+   P   +FY V ++G+ VGG  + I +  FR       G+++D+GT  T LP
Sbjct: 309 AFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFR------GGMIIDSGTVDTELP 362

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY A   A        P       FDTCYN +G+ ++ VP V+F FSGG  + L   N
Sbjct: 363 ETAYNALEAALRKALKAYPLVPS-DDFDTCYNFTGYSNITVPRVAFTFSGGATIDLDVPN 421

Query: 445 FLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            ++ V+D    C AF  S    GL IIGN+ Q  +++ +D   G VGF    C
Sbjct: 422 GIL-VND----CLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAGAC 469


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  218 bits (556), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 138/391 (35%), Positives = 202/391 (51%), Gaps = 27/391 (6%)

Query: 117 KRVATLVRR-LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDS 175
           +R  T +R+  +  GA     +       +  G   G G Y   +G+G+P  S  MV+D+
Sbjct: 94  RRPTTSLRKPKAAAGASGGPLDDSLASVPLTPGTSVGVGNYVTELGLGTPATSYAMVVDT 153

Query: 176 GSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR---- 230
           GS + W+QC PC   C++Q  P++DP  S++++ V CS++ CD L+ A  +   C     
Sbjct: 154 GSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNV 213

Query: 231 --YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
             Y+ SYGD S++ G L+ +T++ G     N   GCG  N+G+F  +AGL+GL    +SL
Sbjct: 214 CIYQASYGDSSFSVGYLSRDTVSFGSGSYPNFYYGCGQDNEGLFGRSAGLIGLARNKLSL 273

Query: 289 VGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
           + QL    G +FSYCL    S G  S G    G        ++ P+  +    S Y+V L
Sbjct: 274 LYQLAPSLGYSFSYCLPTPASTGYLSIGPYTSGHY------SYTPMASSSLDASLYFVTL 327

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
           SG+ VGG  + +S       +      ++D+GT +TRLPT  Y A   A  A    +  A
Sbjct: 328 SGMSVGGSPLAVSP-----AEYSSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSA 382

Query: 406 SGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
              SI DTC+   G  S +RVP V+  F+GG  L L   N LI VDD+ T C AFAP+ S
Sbjct: 383 PAFSILDTCFQ--GQASQLRVPAVAMAFAGGATLKLATQNVLIDVDDSTT-CLAFAPTDS 439

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +IIGN QQ+   + +D A   +GF    C
Sbjct: 440 -TTIIGNTQQQTFSVVYDVAQSRIGFAAGGC 469


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  218 bits (555), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 164/433 (37%), Positives = 228/433 (52%), Gaps = 40/433 (9%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR    + S  ++         S    ++ D +R   ++RR+SG        +   
Sbjct: 68  LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
               V +  G D G+  Y V   +G+P  +Q M +D+GSD+ WVQC+PCS    CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182

Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
           P+FDPA S+S++ V C   VC  L     + C A +C Y VSYGDGS T G  + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
              + V+    GCGH   G+F G  GLLGLG    SLV Q  G  GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301

Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           +G L  G    P GAA       L+ +P AP++Y V L+G+ VGG ++ +    F    +
Sbjct: 302 AGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV 360

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
                 +DTGT VTRLP  AY A R AF +   +   P A    I DTCYN +G+ +V +
Sbjct: 361 ------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
           P V+  F  G  +TL A   L       +F C AFAPS S  G++I+GN+QQ   ++  D
Sbjct: 415 PNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467

Query: 483 GANGFVGFGPNVC 495
           G +  VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  218 bits (555), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 148/429 (34%), Positives = 225/429 (52%), Gaps = 38/429 (8%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + ++L+HRD   S         ++    +   RM+  ++R A    + S    DA+ +  
Sbjct: 26  FTIDLIHRDSPKSP--------FYNSAETSSQRMRNAIRRSARSTLQFSND--DASPNSP 75

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
           Q F T          GEY + I +G+PP     + D+GSD++W QC PC  CY+Q+ P+F
Sbjct: 76  QSFIT-------SNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLF 128

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG--- 253
           DP +S+++  VSCSS+ C  LE+A C      C Y ++YGD SYTKG +A++T+T+G   
Sbjct: 129 DPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSG 188

Query: 254 -RTV-VKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--SRG 308
            R V ++N+ IGCGH+N G F  A +G++GLGGGS SLV QL     G FSYCLV  +  
Sbjct: 189 RRPVSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSE 248

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           TG +  + FG   +  G   V      + P ++Y++ L  + VG  +I  +  +F     
Sbjct: 249 TGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIF---GT 305

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVP 426
           G+  +V+D+GT +T LP+  Y    ++ VA T    R      I   CY  S   S +VP
Sbjct: 306 GEGNIVIDSGTTLTLLPSNFYYEL-ESVVASTIKAERVQDPDGILSLCYRDSS--SFKVP 362

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
            ++ +F GG V     + F+   +D    CFAFA +   L+I GN+ Q    + +D  +G
Sbjct: 363 DITVHFKGGDVKLGNLNTFVAVSEDVS--CFAFAANEQ-LTIFGNLAQMNFLVGYDTVSG 419

Query: 487 FVGFGPNVC 495
            V F    C
Sbjct: 420 TVSFKKTDC 428


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  218 bits (554), Expect = 7e-54,   Method: Compositional matrix adjust.
 Identities = 141/380 (37%), Positives = 201/380 (52%), Gaps = 43/380 (11%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG    +  Y   +G+G    +  +++D+ S++ WVQC PC  C+ Q  P+FDP+ S 
Sbjct: 132 VSSGARLRTLNYVATVGLGGGEAT--VIVDTASELTWVQCAPCESCHDQQGPLFDPSSSP 189

Query: 205 SFSGVSCSSAVCDRLEN-----AG-----CHAGR---CRYEVSYGDGSYTKGTLALETLT 251
           S++ V C S  CD L+      AG     C AGR   C Y +SY DGSY++G LA + L+
Sbjct: 190 SYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLS 249

Query: 252 IGRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL-VSRGT 309
           +   V+     GCG  NQG  F G +GL+GLG   +SLV Q   Q GG FSYCL +SR +
Sbjct: 250 LAGEVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRES 309

Query: 310 GSSGSLVFG------REALPVGAAWV-----PLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
            +SGSLV G      R + PV    +     PL++ P    FY V L+G+ VGG  +  +
Sbjct: 310 DASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGP----FYLVNLTGITVGGQEVEST 365

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
               R         ++D+GT +T L    Y A R  F++Q    P+A G SI DTC+N++
Sbjct: 366 GFSAR--------AIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILDTCFNMT 417

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQE 475
           G   V+VP+++  F GG  + + +   L  V  D+   C A A   S    SIIGN QQ+
Sbjct: 418 GLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQK 477

Query: 476 GIQISFDGANGFVGFGPNVC 495
            +++ FD +   VGF    C
Sbjct: 478 NLRVVFDTSASQVGFAQETC 497


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 153/447 (34%), Positives = 221/447 (49%), Gaps = 44/447 (9%)

Query: 78  RWNLELV--HRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVR 124
             N E V   R+ +SSS + T     HRH           + +    ++RD  R   + R
Sbjct: 32  ELNSEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQR 91

Query: 125 RLS-----GGGADAAKHEVQD-FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
           + +      G  D  + +V     T + S +D  + EY + +G+G+P  +Q + ID+GSD
Sbjct: 92  KFAMNAAVDGAGDLQQSKVSSSVPTKLGSSLD--TLEYVISVGLGTPAVTQTVTIDTGSD 149

Query: 179 IVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYE 232
           + WVQC PC    CY Q+  +FDPA S+++  VSC++A C +LE  G   G     C+Y 
Sbjct: 150 VSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYG 209

Query: 233 VSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           V YGDGS T GT + +TLT+      VK    GC H   G      GL+GLGGG+ SLV 
Sbjct: 210 VQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVS 269

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
           Q     G +FSYCL    +GSSG L  G      G     ++R+ + P+FY   L  + V
Sbjct: 270 QTAAAYGNSFSYCLPPT-SGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAV 328

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
           GG ++ +S  +F        G V+D+GT +TRLP  AY A   AF A       A   SI
Sbjct: 329 GGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI 382

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSI 468
            DTC++ +G   + +PTV+  FSGG  + L  +  +         C AFA +       I
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GNCLAFAATGDDGTTGI 436

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           IGN+QQ   ++ +D  +  +GF    C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 163/433 (37%), Positives = 227/433 (52%), Gaps = 40/433 (9%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA--AKHEV 138
           L L HR    + S  ++         S    ++ D +R   ++RR+SG       +K   
Sbjct: 68  LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
                    G D G+  Y V   +G+P  +Q M +D+GSD+ WVQC+PC+    CY Q D
Sbjct: 123 AVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKD 182

Query: 196 PVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI 252
           P+FDPA S+S++ V C   VC  L     + C A +C Y VSYGDGS T G  + +TLT+
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL 242

Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
              + V+    GCGH   G+F G  GLLGLG    SLV Q  G  GG FSYCL ++ + +
Sbjct: 243 SASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-T 301

Query: 312 SGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           +G L  G    P GAA       L+ +P AP++Y V L+G+ VGG ++ +    F    +
Sbjct: 302 AGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV 360

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRV 425
                 +DTGT VTRLP  AY A R AF +   +   P A    I DTCYN +G+ +V +
Sbjct: 361 ------VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTL 414

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFD 482
           P V+  F  G  +TL A   L       +F C AFAPS S  G++I+GN+QQ   ++  D
Sbjct: 415 PNVALTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRID 467

Query: 483 GANGFVGFGPNVC 495
           G +  VGF P+ C
Sbjct: 468 GTS--VGFKPSSC 478


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 186/362 (51%), Gaps = 26/362 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + + +G+PP     ++D+GSD++W QC PC  C  Q  P F PA SA++  V C S
Sbjct: 90  GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRS 149

Query: 214 AVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHK 267
            +C  L    C     C Y+  YGD + T G LA ET T G     + +V +VA GCG+ 
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----- 322
           N G    ++G++GLG G +SLV QLG      FSYCL S  +     L FG  A      
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSR---FSYCLTSFLSPEPSRLNFGVFATLNGTN 266

Query: 323 ------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
                 PV +   PLV N   PS Y++ L G+ +G  R+PI   +F +   G  GV +D+
Sbjct: 267 ASSSGSPVQS--TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFS 433
           GT++T L   AY+A R   V+    LP  +   I  +TC+       V+V VP +  +F 
Sbjct: 325 GTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFD 384

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG  +T+P  N+++     G  C A   S    +IIGN QQ+ + I +D AN  + F P 
Sbjct: 385 GGANMTVPPENYMLIDGATGFLCLAMIRS-GDATIIGNYQQQNMHILYDIANSLLSFVPA 443

Query: 494 VC 495
            C
Sbjct: 444 PC 445


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 146/430 (33%), Positives = 225/430 (52%), Gaps = 43/430 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +   L HRD + S    ++  HY R  ++F    +R + R ATL+ R +  GA       
Sbjct: 30  FTTSLFHRDSLLSPLEFSSLSHYDRLTNAF----RRSLSRSATLLNRAATNGA------- 78

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                D+ + +  GSGEY + + +G+PP     + D+GSD++W QC PC +CYKQS P+F
Sbjct: 79  ----LDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF 134

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           DP  S SFS V C+S  C  ++++ C A G C Y  +YGD +YTKG L  E +TIG + V
Sbjct: 135 DPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV 194

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA--FSYCLVSRGTGSSGSL 315
           K+V IGCGH++ G F  A+G++GLGGG +SLV Q+   +G +  FSYCL +  + ++G +
Sbjct: 195 KSV-IGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKI 253

Query: 316 VFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
            FG+ A+  G   V  PL+ +NP   ++YYV L  + +G  R         +       V
Sbjct: 254 NFGQNAVVSGPGVVSTPLISKNPV--TYYYVTLEAISIGNER--------HMASAKQGNV 303

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYN--LSGFVSVRVP 426
           ++D+GT ++ LP   Y    D  V+    + +A  V    + +D C++  ++   S  +P
Sbjct: 304 IIDSGTTLSFLPKELY----DGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 359

Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
            ++  FSGG  V  LP + F    ++        A       IIGN+      I +D   
Sbjct: 360 IITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEA 419

Query: 486 GFVGFGPNVC 495
             + F P VC
Sbjct: 420 KRLSFKPTVC 429


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  216 bits (549), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 132/362 (36%), Positives = 186/362 (51%), Gaps = 26/362 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY + + +G+PP     ++D+GSD++W QC PC  C  Q  P F PA SA++  V C S
Sbjct: 90  GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRS 149

Query: 214 AVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHK 267
            +C  L    C     C Y+  YGD + T G LA ET T G     + +V +VA GCG+ 
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNI 209

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL----- 322
           N G    ++G++GLG G +SLV QLG      FSYCL S  +     L FG  A      
Sbjct: 210 NSGQLANSSGMVGLGRGPLSLVSQLGPSR---FSYCLTSFLSPEPSRLNFGVFATLNGTN 266

Query: 323 ------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
                 PV +   PLV N   PS Y++ L G+ +G  R+PI   +F +   G  GV +D+
Sbjct: 267 ASSSGSPVQS--TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFS 433
           GT++T L   AY+A R   V+    LP  +   I  +TC+       V+V VP +  +F 
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELHFD 384

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG  +T+P  N+++     G  C A   S    +IIGN QQ+ + I +D AN  + F P 
Sbjct: 385 GGANMTVPPENYMLIDGATGFLCLAMIRS-GDATIIGNYQQQNMHILYDIANSLLSFVPA 443

Query: 494 VC 495
            C
Sbjct: 444 PC 445


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 145/392 (36%), Positives = 207/392 (52%), Gaps = 15/392 (3%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           + +D  RV ++  R S   A +   E+Q     V SG+  G+G Y V++ +G+P  S  +
Sbjct: 2   LLQDQLRVKSMHARFSNKNAGSHFKEMQA-DIPVQSGIPLGAGNYLVKMALGTPKLSLSL 60

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSA----VCDRLENAGCHA 226
            +D+GSDI W QC+PC   CY+Q+   FDP  S+S+  VSCSS+    + D     GC +
Sbjct: 61  ALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
             C Y+V YGDGSY+ G  A E LTI  + V+ N   GCG +N G F   AGLLGLG G 
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSDVISNFLFGCGQQNAGRFGRIAGLLGLGRGK 180

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGL 345
           +SL  Q   +    F+YCL S  + S+G L  G + +P    + PL    +   FY + +
Sbjct: 181 LSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQ-VPKSVKFTPLSPAFKNTPFYGIDI 239

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            GL VGG  +PI   +F      + G ++D+GT +TRL    Y A    F     + P+ 
Sbjct: 240 KGLSVGGHVLPIDASVF-----SNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKT 294

Query: 406 SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS- 464
            G SI DTCY+ SG  S+ VP +SF+F GG  + +     L  ++     C AFAP+   
Sbjct: 295 DGFSILDTCYDFSGNESISVPRISFFFKGGVEVDIKFFGILTVINAWDKVCLAFAPNDDD 354

Query: 465 -GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               + GN QQ+   +  D A G +GF P+ C
Sbjct: 355 GDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 127/373 (34%), Positives = 191/373 (51%), Gaps = 24/373 (6%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG   G+GEYF+ + VG+PP+  ++++D+GSD+ W+QC PC  C++Q+ P ++P +S+S+
Sbjct: 161 SGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSY 220

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             +SC    C      D L++       C Y   Y DGS T G  ALET T+  T     
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGK 280

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
                V +V  GCGH N+G F GA GLLGLG G +S   QL    G +FSYCL      T
Sbjct: 281 EKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNT 340

Query: 310 GSSGSLVFGREALPV---GAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
             S  L+FG +   +      +  L+     P  +FYY+ +  + VGG  + I E  +  
Sbjct: 341 SVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHW 400

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
           +  G  G ++D+G+ +T  P  AY+  ++AF  +      A+   I   CYN+SG + V 
Sbjct: 401 SSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVE 460

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
           +P    +F+ G V   PA N+    +     C A   +P  S L+IIGN+ Q+   I +D
Sbjct: 461 LPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYD 520

Query: 483 GANGFVGFGPNVC 495
                +G+ P  C
Sbjct: 521 VKRSRLGYSPRRC 533


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 123/379 (32%), Positives = 189/379 (49%), Gaps = 30/379 (7%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG   G+GEYF+ + VG+PP+  ++++D+GSD+ W+QC PC  C++Q+   + P DS+++
Sbjct: 162 SGASLGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTY 221

Query: 207 SGVSCSSAVC------DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
             +SC    C      D L++       C Y   Y DGS T G  A ET T+  T     
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGK 281

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
                V +V  GCGH N+G F GA+GLLGLG G +S   Q+    G +FSYCL      T
Sbjct: 282 EKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNT 341

Query: 310 GSSGSLVFGREALPV---GAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRL 364
             S  L+FG +   +      +  L+     P  +FYY+ +  + VGG  + ISE  +  
Sbjct: 342 SVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHW 401

Query: 365 TQ-----MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
           +          G ++D+G+ +T  P  AY+  ++AF  +      A+   +   CYN+SG
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSG 461

Query: 420 -FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEG 476
             + V +P    +F+ G V   PA N+    +     C A   +P  S L+IIGN+ Q+ 
Sbjct: 462 AMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQN 521

Query: 477 IQISFDGANGFVGFGPNVC 495
             I +D     +G+ P  C
Sbjct: 522 FHILYDVKRSRLGYSPRRC 540


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  215 bits (547), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 140/430 (32%), Positives = 217/430 (50%), Gaps = 43/430 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +   L HRD + S    ++  HY R  ++F    +R + R A L+ R +  GA   +   
Sbjct: 30  FTTSLFHRDSLLSPLEFSSLSHYDRLANAF----RRSLSRSAALLNRAATSGAVGLQ--- 82

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                   S +  GSGEY + + +G+PP     + D+GSD+ W QC PC +CY+Q  P+F
Sbjct: 83  --------SSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIF 134

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           +P  S SFS V C++  C  +++  C   G C Y  +YGD +Y+KG L  E +TIG + V
Sbjct: 135 NPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV 194

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA---FSYCLVSRGTGSSGS 314
           K+V IGCGH + G F  A+G++GLGGG +SLV Q+  QT G    FSYCL +  + ++G 
Sbjct: 195 KSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQM-SQTSGISRRFSYCLPTLLSHANGK 252

Query: 315 LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           + FG  A+  G   V  PL+ +    ++YY+ L  + +G  R         +       V
Sbjct: 253 INFGENAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNER--------HMAFAKQGNV 303

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFV----AQTGNLPRASGVSIFDTCYN--LSGFVSVRVP 426
           ++D+GT +T LP   Y+    + +    A+    P  S     D C++  ++   S+ +P
Sbjct: 304 IIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGS----LDLCFDDGINAAASLGIP 359

Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
            ++ +FSGG  V  LP + F    D+        A   +   IIGN+ Q    I +D   
Sbjct: 360 VITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEA 419

Query: 486 GFVGFGPNVC 495
             + F P VC
Sbjct: 420 KRLSFKPTVC 429


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  214 bits (546), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 152/447 (34%), Positives = 221/447 (49%), Gaps = 44/447 (9%)

Query: 78  RWNLELV--HRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVKRVATLVR 124
             N E V   R+ +SSS + T     HRH           + +    ++RD  R   + R
Sbjct: 32  ELNSEAVCSERNAISSSLSGTTVALNHRHGPCSPVPSSKKRPTEEELLKRDQLRAEHIQR 91

Query: 125 RLS-----GGGADAAKHEVQD-FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
           + +      G  D  + +V     T + S +D  + EY + +G+G+P  +Q + ID+GSD
Sbjct: 92  KFAMNAAVDGAGDLQQSKVSSSVPTKLGSSLD--TLEYVISVGLGTPAVTQTVTIDTGSD 149

Query: 179 IVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG----RCRYE 232
           + WVQC PC    C+ Q+  +FDPA S+++  VSC++A C +LE  G   G     C+Y 
Sbjct: 150 VSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYG 209

Query: 233 VSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           V YGDGS T GT + +TLT+      VK    GC H   G      GL+GLGGG+ SLV 
Sbjct: 210 VQYGDGSTTNGTYSRDTLTLSGASDAVKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVS 269

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
           Q     G +FSYCL    +GSSG L  G      G     ++R+ + P+FY   L  + V
Sbjct: 270 QTAAAYGNSFSYCLPPT-SGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAV 328

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI 410
           GG ++ +S  +F        G V+D+GT +TRLP  AY A   AF A       A   SI
Sbjct: 329 GGKQLGLSPSVFAA------GSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI 382

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSI 468
            DTC++ +G   + +PTV+  FSGG  + L  +  +         C AFA +       I
Sbjct: 383 LDTCFDFAGQTQISIPTVALVFSGGAAIDLDPNGIMY------GNCLAFAATGDDGTTGI 436

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           IGN+QQ   ++ +D  +  +GF    C
Sbjct: 437 IGNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  214 bits (544), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 138/343 (40%), Positives = 182/343 (53%), Gaps = 31/343 (9%)

Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--- 222
           SQ +V+D+ SDI WVQC PC   QC+ Q DP++DPA S++F+ + C S  C  L ++   
Sbjct: 168 SQTVVVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227

Query: 223 GCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGA-AGL 278
           GC      C+Y V+YGDG  T GT   +TLT+  T VVK+   GC H  +G F    AG+
Sbjct: 228 GCSPTTDECKYIVNYGDGKATTGTYVTDTLTMSPTIVVKDFRFGCSHAVRGSFSNQNAGI 287

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA----AWVPLVRN 334
           L LGGG  SL+ Q     G AFSYC+      S+G L  G    PV A    ++ PL++N
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYCIPK--PSSAGFLSLGG---PVEASLKFSYTPLIKN 342

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
             AP+FY V L  + V G ++ +    F        G VMD+G  VT+LP   Y A R A
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAA 396

Query: 395 F-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL-PASNFLIPVDDA 452
           F  A     P A+ V   DTCY+ + F  V+VP VS  F+GG  L L PAS  L      
Sbjct: 397 FRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL-----D 451

Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           G   FA  P    +  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 452 GCLAFAATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 153/440 (34%), Positives = 217/440 (49%), Gaps = 53/440 (12%)

Query: 89  MSSSSNTTNNMHYHRH------------QHSFHARMQRDVKRVATLVRRLSGG--GADAA 134
           +   SNT +    HRH              SF  R++R+  R   ++ R+S G  G DA 
Sbjct: 49  LDPGSNTVSVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDAD 108

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYK 192
                  G  V       S EY V +G+G+P  SQ ++ID+GSD+ WVQCQPC  + CY 
Sbjct: 109 VSIPTHLGGSV------DSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYP 162

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLEN----AGCHAG----RCRYEVSYGDGSYTKGT 244
           Q DP+FDP+ S++++ + C++  C  L +     GC +G    +C + ++YGDGS T+G 
Sbjct: 163 QKDPLFDPSKSSTYAPIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGV 222

Query: 245 LALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
            + ETL +   V VK+   GCGH   G      GLLGLGG   SLV Q     GGAFSYC
Sbjct: 223 YSNETLALAPGVAVKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYC 282

Query: 304 L------VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
           L      V       G    G      G  + P++R     +FY V ++G+ VGG  I +
Sbjct: 283 LPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEE--TFYVVNMTGITVGGEPIDV 340

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL 417
               F        G+++D+GT VT L   AY A + AF       P      + DTCY+ 
Sbjct: 341 PPSAFS------GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGEL-DTCYDF 393

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQE 475
           SG+ +V +P V+  FSGG  + L   N ++ +DD    C AF  S       I+GN+ Q 
Sbjct: 394 SGYSNVTLPKVALTFSGGATIDLDVPNGIL-LDD----CLAFQESGPDDQPGILGNVNQR 448

Query: 476 GIQISFDGANGFVGFGPNVC 495
            +++ +D   G VGF   VC
Sbjct: 449 TLEVLYDAGRGRVGFRAAVC 468


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 189/356 (53%), Gaps = 19/356 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
           +  Y V I +G+PP     V+D+GSD++W QC  PC +C+ Q  P++ PA SA+++ VSC
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
            S +C  L++           C Y  SYGDG+ T G LA ET T+G  T V+ VA GCG 
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
           +N G    ++GL+G+G G +SLV QLG      FSYC       ++  L  G  A L   
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYCFTPFNATAASPLFLGSSARLSSA 265

Query: 326 AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
           A   P V +P     R  S+YY+ L G+ VG   +PI   +FRLT MGD GV++D+GT  
Sbjct: 266 AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTF 325

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           T L   A+ A   A  ++   LP ASG  +    C+  +   +V VP +  +F G   + 
Sbjct: 326 TALEESAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-ME 383

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   ++++    AG  C     S  G+S++G++QQ+   I +D   G + F P  C
Sbjct: 384 LRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 189/356 (53%), Gaps = 19/356 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
           +  Y V I +G+PP     V+D+GSD++W QC  PC +C+ Q  P++ PA SA+++ VSC
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
            S +C  L++           C Y  SYGDG+ T G LA ET T+G  T V+ VA GCG 
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
           +N G    ++GL+G+G G +SLV QLG      FSYC       ++  L  G  A L   
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLGVTR---FSYCFTPFNATAASPLFLGSSARLSSA 265

Query: 326 AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
           A   P V +P     R  S+YY+ L G+ VG   +PI   +FRLT MGD GV++D+GT  
Sbjct: 266 AKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTF 325

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           T L   A+ A   A  ++   LP ASG  +    C+  +   +V VP +  +F G   + 
Sbjct: 326 TALEERAFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-ME 383

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   ++++    AG  C     S  G+S++G++QQ+   I +D   G + F P  C
Sbjct: 384 LRRESYVVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 138/353 (39%), Positives = 185/353 (52%), Gaps = 15/353 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
           +G   G+ E+ V +G G+P ++  ++ D+GSD+ W+QC PCS  CYKQ DP+FDP  SA+
Sbjct: 111 TGTSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSAT 170

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
           +S V C    C          G C Y+V YGDGS T G L+ ETL++     +   A GC
Sbjct: 171 YSAVPCGHPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFAFGC 230

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           G  N G F    GL+GLG G +SL  Q     G AFSYCL S  T S G L  G      
Sbjct: 231 GETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNT-SHGYLTIGTTTPAS 289

Query: 325 GA---AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
           G+    +  +++    PSFY+V L  + VGG  +P+   LF       DG ++D+GT +T
Sbjct: 290 GSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFT-----RDGTLLDSGTVLT 344

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
            LP  AY A RD F         A     FDTCY+ +G  ++ +P VSF FS G    L 
Sbjct: 345 YLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFDLS 404

Query: 442 ASNFLIPVDD--AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGF 490
               LI  DD    T C AF P PS +  +I+GN QQ   ++ +D A   +GF
Sbjct: 405 PFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGF 457


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/334 (38%), Positives = 174/334 (52%), Gaps = 74/334 (22%)

Query: 62  ERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVAT 121
           E    IS+   S  +    + L HRD ++ ++           +  F+ R+QRD  RV  
Sbjct: 84  ETETQISTLPVSETDPTMTMHLEHRDVLAFNATP---------EALFNLRLQRDAFRVEA 134

Query: 122 LVR-----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSG 176
           L +          G +    +   F + V SG+ QGSGEYF R+GVG+PP+  YMV+D+G
Sbjct: 135 LSKMAAAAGGRRAGRNGTHAQGGGFSSSVTSGLAQGSGEYFTRLGVGTPPKYVYMVLDTG 194

Query: 177 SDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSY 235
           SD+VW+QC PC +CY Q+DPVFDP  S SFS +SC S +C RL++ GC++ + C Y+V+Y
Sbjct: 195 SDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAY 254

Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
           GDGS+T G  + ETLT   T V  VA+GCGH N+G+FVGAAGLLGLG             
Sbjct: 255 GDGSFTFGEFSTETLTFRGTRVPKVALGCGHDNEGLFVGAAGLLGLG------------- 301

Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
                                                R PR        L+   VGG R+
Sbjct: 302 -------------------------------------RQPR--------LNRPPVGGARV 316

Query: 356 P-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             I+  LF+L   G+ GV++D+GT+VTRL   AY
Sbjct: 317 AGITASLFKLDTAGNGGVIIDSGTSVTRLTRRAY 350


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 141/400 (35%), Positives = 205/400 (51%), Gaps = 36/400 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD------QGSGEYFVRIGVGSP 165
           ++RD  RV ++           AKH +    T V + M          G Y V +G+G+P
Sbjct: 92  LRRDQLRVKSI----------RAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTP 141

Query: 166 PRSQYMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENA 222
            +   ++ D+GSD+ W QC+PCS  C+ Q+D  FDP  S S+  +SCSS  C  +  E+A
Sbjct: 142 KKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESA 201

Query: 223 -GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLL 279
            GC +   C Y V YG G YT G LA ETLTI  + V +N  IGCG +N G F G AGLL
Sbjct: 202 QGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSDVFENFVIGCGERNGGRFSGTAGLL 260

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
           GLG   ++L  Q        FSYCL +  + S+G L FG   +   A + P+    + P 
Sbjct: 261 GLGRSPVALPSQTSSTYKNLFSYCLPAS-SSSTGHLSFGG-GVSQAAKFTPITS--KIPE 316

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
            Y + +SG+ VGG ++PI   +FR       G ++D+GT +T LP+ A+ A   AF    
Sbjct: 317 LYGLDVSGISVGGRKLPIDPSVFRTA-----GTIIDSGTTLTYLPSTAHSALSSAFQEMM 371

Query: 400 GNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF 457
            N     G S    CY+ S     ++ +P +S +F GG  + +  S   I  +     C 
Sbjct: 372 TNYTLTKGTSGLQPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCL 431

Query: 458 AFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AF    + + ++I GN+QQ+  ++ +D A G VGF P  C
Sbjct: 432 AFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 144/419 (34%), Positives = 207/419 (49%), Gaps = 46/419 (10%)

Query: 108 FHARMQRDVKRVATLVRRLSG-----------------------GGADAAKHEVQDFGTD 144
           F A +  D  R+A L  RL+                        GG+ A+   V      
Sbjct: 65  FSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHRKKKAGGVGGSQASSSSVP----- 119

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADS 203
           +  G     G Y  R+G+G+P  S  MV+D+GS + W+QC PCS  C++Q+ PVFDP  S
Sbjct: 120 LTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRAS 179

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCR------YEVSYGDGSYTKGTLALETLTIGRTVV 257
            +++ V CSS+ C  L+ A  +   C       Y+ SYGD SY+ G L+ +T++ G    
Sbjct: 180 GTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFGSGSF 239

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
                GCG  N+G+F  +AGL+GL    +SL+ QL    G AFSYCL +  + ++G L  
Sbjct: 240 PGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTS-SAAAGYLSI 298

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G    P   ++ P+  +    S Y+V LSG+ V G  + +    +R         ++D+G
Sbjct: 299 GSYN-PGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLP-----TIIDSG 352

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
           T +TRLP   Y A   A  A   +    +   SI DTC+  S    +RVP V   F+GG 
Sbjct: 353 TVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSA-AGLRVPRVDMAFAGGA 411

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            L L   N LI VDD+ T C AFAP+  G +IIGN QQ+   + +D A   +GF    C
Sbjct: 412 TLALSPGNVLIDVDDSTT-CLAFAPT-GGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGC 468


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 137/395 (34%), Positives = 198/395 (50%), Gaps = 27/395 (6%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           + +D  RV +   RLS   +     E+Q   T + + +    G Y V +G+G+P +   +
Sbjct: 99  LLQDQLRVKSFQVRLSMNPSSGVFKEMQ---TTIPASIVPTGGAYVVTVGLGTPKKDFTL 155

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG-----CH 225
             D+GSD+ W QC+PC   C+ Q+ P FDP  S S+  VSCSS  C  +         C 
Sbjct: 156 SFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCI 215

Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
           +  C Y + YG G YT G LA ETL I  + V KN   GC  +++G F G  GLLGLG  
Sbjct: 216 SNTCLYGIQYGSG-YTIGFLATETLAIASSDVFKNFLFGCSEESRGTFNGTTGLLGLGRS 274

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
            ++L  Q   +    FSYCL +  + S+G L FG E +   A   P+  +P+    Y + 
Sbjct: 275 PIALPSQTTNKYKNLFSYCLPASPS-STGHLSFGVE-VSQAAKSTPI--SPKLKQLYGLN 330

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
             G+ V G  +PI+  + R         ++D+GT  T LP+P Y A   AF     N   
Sbjct: 331 TVGISVRGRELPINGSISR--------TIIDSGTTFTFLPSPTYSALGSAFREMMANYTL 382

Query: 405 ASGVSIFDTCYNLS--GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP- 461
            +G S F  CY+ S  G  ++ +P +S +F GG  + +  S  +IPV+     C AFA  
Sbjct: 383 TNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFADT 442

Query: 462 -SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S S  +I GN QQ+  ++ +D A G VGF P  C
Sbjct: 443 GSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 141/428 (32%), Positives = 213/428 (49%), Gaps = 48/428 (11%)

Query: 114 RDVKRVATLVRR-LSGGGADAAKHEVQDFGTDVV--------------------SGMDQG 152
           RD+ R+ TL +R L+    +    + +    +VV                    SGM  G
Sbjct: 92  RDLTRIQTLHKRVLAKKNQNTVSQKQKKKNKEVVTTPVASSVEEQAGQLVATLESGMTLG 151

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           SGEYF+ + VGSPP+   +++D+GSD+ W+QC PC  C++Q+   +DP  SAS+  ++C+
Sbjct: 152 SGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCN 211

Query: 213 SAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------V 257
              C+ +        C +    C Y   YGD S T G  A+ET T+  T          V
Sbjct: 212 DPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNV 271

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSL 315
           +N+  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R   T  S  L
Sbjct: 272 ENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 331

Query: 316 VFGREALPVG------AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           +FG +   +        ++V    N    +FYYV +  + V G  + I E+ + ++  G 
Sbjct: 332 IFGEDKDLLSHPNLNFTSFVARKEN-LVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGA 390

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
            G ++D+GT ++    PAYE  ++    +  G  P      I D C+N+SG  S+++P +
Sbjct: 391 GGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPEL 450

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGF 487
              F+ G V   P  N  I +++    C A   +P S  SIIGN QQ+   I +D     
Sbjct: 451 GIAFADGAVWNFPTENSFIWLNE-DLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSR 509

Query: 488 VGFGPNVC 495
           +G+ P  C
Sbjct: 510 LGYAPTKC 517


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 146/433 (33%), Positives = 209/433 (48%), Gaps = 46/433 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++E++HRD   S         + R  ++ H    R V R     +        AAK   
Sbjct: 29  FSVEMIHRDSSRSPFFRPTETQFQRVANAVH----RSVNRANHFHK-----AHKAAK--- 76

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                   + + Q  GEY +   VG PP   Y +ID+GSD++W+QC+PC +CY Q+  +F
Sbjct: 77  --------ATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIF 128

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP+ S ++  +  SS  C  +E+  C +     C Y + YGDGSY++G L++ETLT+G T
Sbjct: 129 DPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGST 188

Query: 256 -----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQL---GGQTGGAFSYCLVS 306
                  +   IGCG  N   F G ++G++GLG G +SL+ QL       G  FSYCL S
Sbjct: 189 NGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLAS 248

Query: 307 RGTGSSGSLVFGREALPVGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
               SS  L FG  A+  G   V  P+V  +P+   FYY+ L    VG  RI  +   FR
Sbjct: 249 MSNISS-KLNFGDAAVVSGDGTVSTPIVTHDPKV--FYYLTLEAFSVGNNRIEFTSSSFR 305

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGFVS 422
             + G+  +++D+GT +T LP   Y     A VA    L R    +     CY  S F  
Sbjct: 306 FGEKGN--IIIDSGTTLTLLPNDIYSKLESA-VADLVELDRVKDPLKQLSLCYR-STFDE 361

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           +  P +  +FSG  V  L A N  I V+  G  C AF  S  G  I GN+ Q+   + +D
Sbjct: 362 LNAPVIMAHFSGADV-KLNAVNTFIEVEQ-GVTCLAFISSKIG-PIFGNMAQQNFLVGYD 418

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 419 LQKKIVSFKPTDC 431


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 140/439 (31%), Positives = 220/439 (50%), Gaps = 31/439 (7%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           LEL  RD ++        +    +Q++   + +++ K V T     +   A + + +   
Sbjct: 101 LELQIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT-----TTPVASSVEEQAGQ 154

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
               + SGM  GSGEYF+ + VGSPP+   +++D+GSD+ W+QC PC  C++Q+   +DP
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDP 214

Query: 201 ADSASFSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGR 254
             SAS+  ++C+   C+ + +      C +    C Y   YGD S T G  A+ET T+  
Sbjct: 215 KASASYKNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNL 274

Query: 255 TV---------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           T          V+N+  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV
Sbjct: 275 TTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV 334

Query: 306 SRG--TGSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPIS 358
            R   T  S  L+FG +   +    +        +     +FYYV +  + V G  + I 
Sbjct: 335 DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIP 394

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNL 417
           E+ + ++  G  G ++D+GT ++    PAYE  ++    +  G  P      I D C+N+
Sbjct: 395 EETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNV 454

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEG 476
           SG  +V++P +   F+ G V   P  N  I +++    C A   +P S  SIIGN QQ+ 
Sbjct: 455 SGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQN 513

Query: 477 IQISFDGANGFVGFGPNVC 495
             I +D     +G+ P  C
Sbjct: 514 FHILYDTKRSRLGYAPTKC 532


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 154/456 (33%), Positives = 222/456 (48%), Gaps = 66/456 (14%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRH-----------QHSFHARMQRDVK 117
           ++  S+   RW         +   SNT +    HRH           + S   R++R   
Sbjct: 41  AATCSTSRVRW---------LDEGSNTVSVPLVHRHGPCAPSTRSSDEPSLSERLRRSRA 91

Query: 118 RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGS 177
           R   ++ R S        H           G    S EY V +G+G+P  SQ ++ID+GS
Sbjct: 92  RSKYIMSRASKSNVSIPTHL----------GGSVDSLEYVVTVGLGTPAVSQVLLIDTGS 141

Query: 178 DIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG----CHAG---- 227
           D+ WVQC PC  + CY Q DP+FDP+ S++++ + C++  C  L   G    C +G    
Sbjct: 142 DLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDCTSGSGGG 201

Query: 228 -RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
            +C Y ++YGDGS T G  + ETLT+   V VK+   GCGH   G      GLLGLGG  
Sbjct: 202 AQCGYAITYGDGSQTTGVYSNETLTMAPGVTVKDFHFGCGHDQDGPNDKYDGLLGLGGAP 261

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPRAPSFY 341
            SLV Q     GGAFSYCL +     +G L  G    PV    G  + P+VR  +  +FY
Sbjct: 262 ESLVVQTSSVYGGAFSYCLPA-ANDQAGFLALGA---PVNDASGFVFTPMVREQQ--TFY 315

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            V ++G+ VGG  I +    F        G+++D+GT VT L   AY A + AF      
Sbjct: 316 VVNMTGITVGGEPIDVPPSAFS------GGMIIDSGTVVTELQHTAYAALQAAFRKAMAA 369

Query: 402 LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-- 459
            P      + DTCYN +G  +V VP V+  FSGG  + L   + ++ +D+    C AF  
Sbjct: 370 YPLLPNGEL-DTCYNFTGHSNVTVPRVALTFSGGATVDLDVPDGIL-LDN----CLAFQE 423

Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A   +   I+GN+ Q  +++ +D  +G VGFG + C
Sbjct: 424 AGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 158/398 (39%), Positives = 213/398 (53%), Gaps = 35/398 (8%)

Query: 116 VKRVATLVRRLSGG--GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           V+RV  L   L  G  G     H           G D G+  Y V   +G+P  +Q M +
Sbjct: 6   VRRVVLLSSLLCAGALGFLPCSHAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEV 65

Query: 174 DSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENAGCHAG 227
           D+GSD+ WVQC+PC+    CY Q DP+FDPA S+S++ V C   VC  L     + C A 
Sbjct: 66  DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 125

Query: 228 RCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
           +C Y VSYGDGS T G  + +TLT+   + V+    GCGH   G+F G  GLLGLG    
Sbjct: 126 QCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGREQP 185

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYY 342
           SLV Q  G  GG FSYCL ++ + ++G L  G    P GAA       L+ +P AP++Y 
Sbjct: 186 SLVEQTAGTYGGVFSYCLPTKPS-TAGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPTYYV 243

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN- 401
           V L+G+ VGG ++ +    F    +      +DTGT VTRLP  AY A R AF +   + 
Sbjct: 244 VMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGMASY 297

Query: 402 -LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-CFAF 459
             P A    I DTCYN +G+ +V +P V+  F  G  +TL A   L       +F C AF
Sbjct: 298 GYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGCLAF 350

Query: 460 APSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           APS S  G++I+GN+QQ   ++  DG +  VGF P+ C
Sbjct: 351 APSGSDGGMAILGNVQQRSFEVRIDGTS--VGFKPSSC 386


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  211 bits (537), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 140/436 (32%), Positives = 210/436 (48%), Gaps = 44/436 (10%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDV----------------- 145
           R  HS      +D+ R+ TL  R +       +   +   +D+                 
Sbjct: 90  RTTHSVVDLQIQDLTRIKTLHARFNKSKKQKNEKVRKKITSDISLVGAPEVSPGKLIATL 149

Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSAS 205
            SGM  GSGEYF+ + VG+PP+   +++D+GSD+ W+QC PC  C+ Q+   +DP  SAS
Sbjct: 150 ESGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSAS 209

Query: 206 FSGVSCSSAVCDRLENAG----CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV--- 256
           F  ++C+   C  + +      C +    C Y   YGD S T G  A+ET T+  T    
Sbjct: 210 FKNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEG 269

Query: 257 ------VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--G 308
                 V N+  GCGH N+G+F GA+GLLGLG G +S   QL    G +FSYCLV R   
Sbjct: 270 GSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSN 329

Query: 309 TGSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
           T  S  L+FG +   +    +        +     +FYY+ +  + VGG  + I E+ + 
Sbjct: 330 TNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWN 389

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCYNLSGFV- 421
           ++  GD G ++D+GT ++    PAYE  ++ F  +   N P      + D C+N+SG   
Sbjct: 390 ISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEE 449

Query: 422 -SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQI 479
            ++ +P +   F  G V   PA N  I + +    C A   +P S  SIIGN QQ+   I
Sbjct: 450 NNIHLPELGIAFVDGTVWNFPAENSFIWLSE-DLVCLAILGTPKSTFSIIGNYQQQNFHI 508

Query: 480 SFDGANGFVGFGPNVC 495
            +D     +GF P  C
Sbjct: 509 LYDTKRSRLGFTPTKC 524


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 147/435 (33%), Positives = 210/435 (48%), Gaps = 50/435 (11%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + +EL+HRD   S        HYHR               VA  +RR       +  H  
Sbjct: 30  FTVELIHRDSPKSPMYNPLENHYHR---------------VADTLRR-------SISHNT 67

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                 V + +    GEY +++ VG+PP     V D+GSDI+W QC+PC+ CY+Q  P+F
Sbjct: 68  GLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMF 127

Query: 199 DPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
           +P+ S ++  VSCSS VC     +N+      C Y +SYGD S+++G  A++TLT+G T 
Sbjct: 128 NPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTS 187

Query: 256 ----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT- 309
                    AIGCGH N G F    +G++GLG G  SL+ Q+G   GG FSYCL   G  
Sbjct: 188 GRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGND 247

Query: 310 -GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
            G S  L FG  A   G+  V  P+  + +  SFY + L  + VG        + F  T 
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG------RNNTFYSTA 301

Query: 367 M----GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFV 421
                G   +++D+GT +T LP   Y  F  A ++ + NL R    + F + C+  +   
Sbjct: 302 NSILGGKANIIIDSGTTLTLLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFETTT-D 359

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQIS 480
             +VP ++ +F G   L L   N LI V D    C AFA    + +SI GNI Q    + 
Sbjct: 360 DYKVPFIAMHFEGAN-LRLQRENVLIRVSD-NVICLAFAGAQDNDISIYGNIAQINFLVG 417

Query: 481 FDGANGFVGFGPNVC 495
           +D  N  + F P  C
Sbjct: 418 YDVTNMSLSFKPMNC 432


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 156/480 (32%), Positives = 227/480 (47%), Gaps = 38/480 (7%)

Query: 40  VNESIKGSRTDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNM 99
             E+I+G R   AK + +    ++      +   S      LEL H     SS+ T  + 
Sbjct: 67  ARETIQGRRYAQAKQAGFLAGEDKKAAEEPAARRSRSTTAVLELKHH----SSTATVPDH 122

Query: 100 HYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYF 157
              R ++  H  +  D  R A+L  R     +     +      +V   SG+   +  Y 
Sbjct: 123 PAARERYLKHL-LAADSARAASLQLRKPKPASSTTTTQASAAAAEVPLGSGIRYQTLNYV 181

Query: 158 VRIGVGSP-PRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSA 214
             I +G    ++  +++D+GSD+ WVQC+PC  S CY Q DP+FDPA S +F+ V C S 
Sbjct: 182 TTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSP 241

Query: 215 VCDR------------LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVA 261
            C                +AG    RC Y +SYGDGS+++G LA +TL +G T  +    
Sbjct: 242 ACAASLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTTKLDGFV 301

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--- 318
            GCG  N+G+F G AGL+GLG   +SLV Q   + GG FSYCL +  T S+GSL  G   
Sbjct: 302 FGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPAT-TTSTGSLSLGPGP 360

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             + P   A+  ++ +P  P FY++ ++G  V                 G   V++D+GT
Sbjct: 361 SSSFP-NMAYTRMIADPTQPPFYFINITGAAV------GGGAALTAPGFGAGNVLVDSGT 413

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
            +TRL    Y+A R  F A+    P A G SI D CY+L+G   V VP ++    GG  +
Sbjct: 414 VITRLAPSVYKAVRAEF-ARRFEYPAAPGFSILDACYDLTGRDEVNVPLLTLTLEGGAQV 472

Query: 439 TLPASNFLIPV-DDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           T+ A+  L  V  D    C A A  P      IIGN QQ   ++ +D     +GF    C
Sbjct: 473 TVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDC 532


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 135/357 (37%), Positives = 189/357 (52%), Gaps = 23/357 (6%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY + + +G+PP     + D+GSD+ W QCQPC  C+ Q  PV+DP+ S++FS V CSSA
Sbjct: 65  EYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSA 124

Query: 215 VCD---RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCG 265
            C    R  N    +  CRY  SY DG+Y+ G L  ETLTIG +V      V +VA GCG
Sbjct: 125 TCLPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCG 184

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF--GREALP 323
             N G  + + G +GLG G++SL+ QLG    G FSYCL      +  S  F      L 
Sbjct: 185 TDNGGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSTMDSPFFLGTLAELA 241

Query: 324 VGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            G   V   PL+++P  PS Y+V L G+ +G +R+PI    F L   G+ G+++D+GT  
Sbjct: 242 PGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTF 301

Query: 381 TRLPTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
           T L   A   FR+    VAQ    P  +  S+   C+  S      +P +  +F+GG  +
Sbjct: 302 TIL---AKSGFREVVDRVAQLLGQPPVNASSLDSPCFP-SPDGEPFMPDLVLHFAGGADM 357

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            L   N++   +D  +FC     SPS  S +GN QQ+ IQ+ FD   G + F P  C
Sbjct: 358 RLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTDC 414


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 132/391 (33%), Positives = 208/391 (53%), Gaps = 31/391 (7%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           K+ V  F + VV+ + Q   EY+V + VG+P     +++D+GSD+ W+QC PC  C    
Sbjct: 119 KNTVTGFTSPVVT-LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPAL 177

Query: 195 DPVFDPADSASFSGVSCSSAVCDR----LENAGCHAGR-CRYEVSYGDGSYTKGTLALET 249
            P F+P  S+SF  + C+S+ C      ++     +GR C + + YGDGS + G LA+ET
Sbjct: 178 RPPFNPRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMET 237

Query: 250 LT-------IGRTV-VKNVAIGCGH-KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
           +         G  V + N+ +GC     +G+  GA+GLLG+    +S   QL  +    F
Sbjct: 238 IAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKF 297

Query: 301 SYCLVSR--GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGM 353
           S+C   +     SSG + FG  + +     + PLV+NP  PS    +YYVGL G+ V   
Sbjct: 298 SHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDES 357

Query: 354 RIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
           R+P+S   F + ++ G  G ++D+GTA T L  PA++A R  F+A+T +L +    S F 
Sbjct: 358 RLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT 417

Query: 413 TCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPS- 464
            CYN++       S  +P+++ +F GG  + LP ++ LIPV   ++  T C AF  S   
Sbjct: 418 PCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDI 477

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +IIGN QQ+ + + +D     +G  P  C
Sbjct: 478 PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 508


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 136/363 (37%), Positives = 192/363 (52%), Gaps = 20/363 (5%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +  G  EY + + +G+PP     + D+GSD+ W QCQPC  C+ Q  P++D A S+SFS 
Sbjct: 86  LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSP 145

Query: 209 VSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI---GRTVVKNVAI 262
           V C+SA C  +    N    +  CRY  +YGDG+Y+ G L  ETLT        V  +A 
Sbjct: 146 VPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPGVSVGGIAF 205

Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFG--- 318
           GCG  N G+   + G +GLG GS+SLV QLG    G FSYCL      S GS ++FG   
Sbjct: 206 GCGVDNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGSPVLFGALA 262

Query: 319 REALPVGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
             A P   A V   PLV++P  P++YYV L G+ +G  R+PI    F L   G  G+++D
Sbjct: 263 ELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIVD 322

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-NLSGFVSV-RVPTVSFYFS 433
           +GT  T L   A+    D  VA     P  +  S+   C+   +G   +  +P +  +F+
Sbjct: 323 SGTTFTFLVESAFRVVVD-HVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHFA 381

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGP 492
           GG  + L   N++    +  +FC   A SPS  +SI+GN QQ+ IQ+ FD   G + F P
Sbjct: 382 GGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQLSFMP 441

Query: 493 NVC 495
             C
Sbjct: 442 TDC 444


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  209 bits (531), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 143/415 (34%), Positives = 209/415 (50%), Gaps = 39/415 (9%)

Query: 102 HRHQHSFHARMQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           +  +      ++R   RVATL  +  L+ G A  A   +    +D         GEY + 
Sbjct: 44  YTEEQLLSRALRRSSARVATLQSLAALAPGDAITAAR-ILVLASD---------GEYLME 93

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           +G+G+P R    ++D+GSD++W QC PC  C  Q  P FDPA SA++  + C+S  C+ L
Sbjct: 94  MGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL 153

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQGMFVGA 275
               C+   C Y+  YGD + T G LA ET T G    R  +  ++ GCG+ N G+    
Sbjct: 154 YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGLLANG 213

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---------PVGA 326
           +G++G G GS+SLV QLG      FSYCL S  +     L FG  A          PV +
Sbjct: 214 SGMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
              P V NP  P+ Y++ ++G+ VGG  +PI   +F +    G  G ++D+GT +T L  
Sbjct: 271 --TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAE 328

Query: 386 PAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
           PAY+A R AF +Q T  L   +  S+ DTC+        SV +P +  +F G     LP 
Sbjct: 329 PAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-WELPL 387

Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            N+++ VD +  G  C A A S S  SIIG+ Q +   + +D  N  + F P  C
Sbjct: 388 QNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 147/435 (33%), Positives = 209/435 (48%), Gaps = 50/435 (11%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + +EL+HRD   S        HYHR               VA  +RR       +  H  
Sbjct: 30  FTVELIHRDSPKSPMYNPLENHYHR---------------VADTLRR-------SISHNT 67

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                 V + +    GEY +++ VG+PP     V D+GSDI+W QC PC+ CY+Q  P+F
Sbjct: 68  GLVTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMF 127

Query: 199 DPADSASFSGVSCSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT- 255
           +P+ S ++  VSCSS VC     +N+      C Y +SYGD S+++G  A++TLT+G T 
Sbjct: 128 NPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLTMGSTS 187

Query: 256 ----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT- 309
                    AIGCGH N G F    +G++GLG G  SL+ Q+G   GG FSYCL   G  
Sbjct: 188 GRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGND 247

Query: 310 -GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
            G S  L FG  A   G+  V  P+  + +  SFY + L  + VG        + F  T 
Sbjct: 248 DGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG------RNNTFYSTA 301

Query: 367 M----GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFV 421
                G   +++D+GT +T LP   Y  F  A ++ + NL R    + F + C+  +   
Sbjct: 302 NSILGGKANIIIDSGTTLTLLPVDLYHNFAKA-ISNSINLQRTDDPNQFLEYCFETTT-D 359

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQIS 480
             +VP ++ +F G   L L   N LI V D    C AFA    + +SI GNI Q    + 
Sbjct: 360 DYKVPFIAMHFEGAN-LRLQRENVLIRVSD-NVICLAFAGAQDNDISIYGNIAQINFLVG 417

Query: 481 FDGANGFVGFGPNVC 495
           +D  N  + F P  C
Sbjct: 418 YDVTNMSLSFKPMNC 432


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  208 bits (529), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 137/358 (38%), Positives = 195/358 (54%), Gaps = 17/358 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
           +G +  + E+ V +G G+P ++  +++D+GSD+ W+QC+PCS  CY+Q DP FDPA S+S
Sbjct: 128 TGTNLDTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSS 187

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGC 264
           ++ V C + VC       C+   C Y V YGDGS T G L+ +TLT   +        GC
Sbjct: 188 YAAVPCGTPVC-AAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSKFTGFTFGC 246

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REAL 322
           G KN G F    GLLGLG G +SL  Q     GG FSYCL S  T + G L  G  +   
Sbjct: 247 GEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNT-TPGYLNIGATKPTS 305

Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
            V   +  +++ P+ PSFY++ L  + +GG  +P+   +F  T     G ++D+GT +T 
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGTILTY 360

Query: 383 LPTPAYEAFRDAF-VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           LP PAY + RD F     GN P A      DTCY+ +G  ++ +P VSF FS G V  L 
Sbjct: 361 LPPPAYTSLRDRFKFTMQGNKP-APPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419

Query: 442 ASNFLIPVDDAGTF--CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               +I  DDA     C AF   P+ +  SI+GN QQ   ++ +D  +  +GF P  C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 136/419 (32%), Positives = 211/419 (50%), Gaps = 30/419 (7%)

Query: 106 HSFHARMQRDVK-RVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGV 162
            + HAR ++  K R   + ++++   +     EV        + SGM  GSGEYF+ + V
Sbjct: 109 QTLHARFKKSKKQRNEKVKKKITSDISLVGAPEVSPGKLIATLESGMTLGSGEYFMDVLV 168

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN- 221
           G+PP+   +++D+GSD+ W+QC PC  C+ Q++  +DP  SASF  ++C+   C  + + 
Sbjct: 169 GTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLISSP 228

Query: 222 ---AGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHK 267
                C +    C Y   YGD S T G  A+ET T+  T          V+N+  GCGH 
Sbjct: 229 EPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHW 288

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGREALPVG 325
           N+G+F GA+GLLGLG G +S   QL    G +FSYCLV R   T  S  L+FG +   + 
Sbjct: 289 NRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLN 348

Query: 326 AAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
              +        +     +FYY+ +  + VGG  + I E+ + ++  G  G ++D+GT +
Sbjct: 349 HTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTL 408

Query: 381 TRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPV 437
           +    PAYE  ++ F  +   N        + D C+N+SG    ++ +P +   F+ G V
Sbjct: 409 SYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLPELGIAFADGAV 468

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              PA N  I + +    C A   +P S  SIIGN QQ+   I +D     +GF P  C
Sbjct: 469 WNFPAENSFIWLSE-DLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSRLGFTPTKC 526


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 133/374 (35%), Positives = 192/374 (51%), Gaps = 23/374 (6%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
           F T +VSG   GSG+YFV   +G+P +  ++++D+GSD+ +VQC PC  CY+Q  P++ P
Sbjct: 19  FRTPLVSGTTLGSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQP 78

Query: 201 ADSASFSGVSCSSAVCDRLE---NAGCHA--------GRCRYEVSYGDGSYTKGTLALET 249
           ++S++F+ V C SA C  +     A C +        G C YE  YGD S T G  A ET
Sbjct: 79  SNSSTFTPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYET 138

Query: 250 LTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
            T+G   V +VA GCG++NQG FV A G+LGLG G++S   Q G      F+YCL S  +
Sbjct: 139 ATVGGIRVNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLS 198

Query: 310 GSS--GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            +S   SL+FG + +       + PLV NP  PS YYV +  +  GG  + I +  +++ 
Sbjct: 199 PTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKID 258

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVS 422
            +G+ G + D+GT VT     AY     AF   V      P   G+ +   C N+SG   
Sbjct: 259 SVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL---CVNVSGIDH 315

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISF 481
              P+ +  F  G        N+ I V      C A   S S G ++IGNI Q+   + +
Sbjct: 316 PIYPSFTIEFDQGATYRPNQGNYFIEV-SPNIDCLAMLESSSDGFNVIGNIIQQNYLVQY 374

Query: 482 DGANGFVGFGPNVC 495
           D     +GF    C
Sbjct: 375 DREEHRIGFAHANC 388


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 141/402 (35%), Positives = 203/402 (50%), Gaps = 31/402 (7%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQ 169
           ++RD  RV  + R+++      A       G  +++  G    +  Y   + +G+P    
Sbjct: 99  LRRDQDRVDAIRRKVT------ASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATEL 152

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE-------NA 222
            + +D+GSD  WVQC+PC+ CY+Q DPVFDP  S+++S V C +  C  L         +
Sbjct: 153 VVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGARECQELASSSSSRNCS 212

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKNVAIGCGHKNQGMFVGA 275
             +   C YEVSY D S+T G LA +TLT+  +        V     GCGH N G F   
Sbjct: 213 SDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTFGEV 272

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
            GLLGLG G  SL  Q+  + G AFSYCL S  + ++G L FG  A    A +  +V   
Sbjct: 273 DGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS-AAGYLSFGGAAARANAQFTEMVTG- 330

Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
           + P+ YY+ L+G+ V G  I +    F        G ++D+GTA +RLP  AY A R +F
Sbjct: 331 QDPTSYYLNLTGIVVAGRAIKVPASAFATAA----GTIIDSGTAFSRLPPSAYAALRSSF 386

Query: 396 VAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
            +  G     RA    IFDTCY+ +G  +VR+P V   F+ G  + L  S  L   +D  
Sbjct: 387 RSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDVA 446

Query: 454 TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C AF P+   L I+GN QQ  + + +D  +  +GFG   C
Sbjct: 447 QTCLAFVPN-HDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  207 bits (528), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 139/398 (34%), Positives = 207/398 (52%), Gaps = 36/398 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAK-HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           M+R V R  + +R LSG  A + + H VQ               EY + + +G PP    
Sbjct: 42  MRRAVHR--SRLRALSGYDATSPRLHSVQV--------------EYLMELAIGKPPVPFV 85

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-HAGRC 229
            + D+GSD+ W QCQPC  C+ Q  PV+DP+ S++FS + CSSA C  + +  C  +  C
Sbjct: 86  ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATCLPIWSRNCTPSSLC 145

Query: 230 RYEVSYGDGSYTKGTLALETLTIGRT----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGS 285
           RY  +YGDG+Y+ G L  ETLT+G +     V  VA GCG  N G  + + G +GLG G+
Sbjct: 146 RYRYAYGDGAYSAGILGTETLTLGPSSAPVSVGGVAFGCGTDNGGDSLNSTGTVGLGRGT 205

Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LPVGAAWV---PLVRNPRAPSF 340
           +SL+ QLG    G FSYCL     +      + G  A L  G + V   PL+++P+ PS 
Sbjct: 206 LSLLAQLG---VGKFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSR 262

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF--VAQ 398
           Y+V L G+ +G +R+PI    F L   G  G+++D+GT  T L   A   FR+    VA+
Sbjct: 263 YFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGTTFTIL---AESGFREVVGRVAR 319

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA 458
               P  +  S+   C+         +P +  +F+GG  + L   N++   ++  +FC  
Sbjct: 320 VLGQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLN 379

Query: 459 FA-PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A  +P   S++GN QQ+ IQ+ FD   G + F P  C
Sbjct: 380 IAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDC 417


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 150/425 (35%), Positives = 211/425 (49%), Gaps = 61/425 (14%)

Query: 112 MQRDVKRVATLVRRLSGGGADAA-KHEVQDFG-TDVVSGMDQGSGEYFVRIGVGSPPR-- 167
           +  D  RVA + +RL+G   D A  H+  + G T VVS +   +G      G+G  P   
Sbjct: 5   LDADQLRVAYIQKRLAGDTGDGADPHKFVEGGDTHVVSSLQVATGA-----GIGQKPHLT 59

Query: 168 --------------------SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
                               SQ ++IDSGSD+ WVQCQPC    C+ Q DP+FDPA S +
Sbjct: 60  TTRLGTTATTNSAPDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTT 119

Query: 206 FSGVSCSSAVCDRL--ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVA 261
           ++ V CSSA C RL     GC A  +C++ ++Y +G+   GT + + LT+G   VV+   
Sbjct: 120 YAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFL 179

Query: 262 IGCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
            GC H +QG       AG L LGGGS S V Q   Q    FSYC V   T S G ++FG 
Sbjct: 180 FGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGV 238

Query: 320 EALPVGAAWVP-LVRNP------RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
              P  AA VP  V  P       +P+FY V L  + V G  +P+   +F  +       
Sbjct: 239 P--PQRAALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASS------ 290

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           V+D+ T ++R+P  AY+A R AF +       A  VSI DTCY+ SG  S+ +P+++  F
Sbjct: 291 VIDSATVISRIPPTAYQALRAAFRSAMTMYRPAPPVSILDTCYDFSGVRSITLPSIALVF 350

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGF 490
            GG  + L A+  L+        C AFAP+ S      IGN+QQ  +++ +D     + F
Sbjct: 351 DGGATVNLDAAGILL------QGCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRF 404

Query: 491 GPNVC 495
               C
Sbjct: 405 RSAAC 409


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 144/392 (36%), Positives = 201/392 (51%), Gaps = 32/392 (8%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++ D  R   + R+LSG   D  +       T + S +D  + EY + +G+GSP  +Q M
Sbjct: 89  LEHDQLRAKYIQRKLSG--TDGLQPLDLTVPTTLGSALD--TMEYVITVGIGSPAVTQTM 144

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN--AGCHAGRC 229
           +ID+GSD+ WV+C            +FDP+ S +++  SCSSA C +L N   GC    C
Sbjct: 145 MIDTGSDVSWVRCNSTDGL-----TLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGC 199

Query: 230 RYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAA--GLLGLGGGSM 286
           +Y V YGDGS T GT + +TL +  +  V +   GC H  +  F G    GL+GLGG + 
Sbjct: 200 QYRVQYGDGSNTTGTYSSDTLALSASDTVTDFHFGCSHHEED-FDGEKIDGLMGLGGDAQ 258

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPSFYYVGL 345
           SLV Q     G +FSYCL      +SG L FG       G    P++R P+AP+ Y V L
Sbjct: 259 SLVSQTAATYGKSFSYCLPPTNR-TSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLL 317

Query: 346 SGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL--P 403
             + VGG  + I   +        +G VMD+GT +T LP  AY A   AF +    L   
Sbjct: 318 QDISVGGTPLGIQPSVL------SNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQ 371

Query: 404 RASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP 463
           RA+ + I DTCY+ +G V+V +P VS    GG V+ L  +  +I        C AFA + 
Sbjct: 372 RAAPLGILDTCYDFTGLVNVSIPAVSLVLDGGAVVDLDGNGIMI------QDCLAFAAT- 424

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           SG SIIGN+QQ   ++  D   G  GF    C
Sbjct: 425 SGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 130/391 (33%), Positives = 208/391 (53%), Gaps = 31/391 (7%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           K+ +  F + VV+ + Q   EY+V + +G+P     +++D+GSD+ W+QC PC  C    
Sbjct: 118 KNALTGFTSPVVT-LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPAL 176

Query: 195 DPVFDPADSASFSGVSCSSAVCDR----LENAGCHAGR-CRYEVSYGDGSYTKGTLALET 249
            P F+P  S+SF  + C+S+ C      ++     +GR C + + YGDGS + G LA+ET
Sbjct: 177 RPPFNPRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMET 236

Query: 250 LT-------IGRTV-VKNVAIGCGH-KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
           +         G  V + N+ +GC     +G+  GA+GLLG+    +S   QL  +    F
Sbjct: 237 IAGNTPNFGDGEPVKLSNITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKF 296

Query: 301 SYCLVSR--GTGSSGSLVFGR-EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGM 353
           S+C   +     SSG + FG  + +     + PLV+NP  PS    +YYVGL G+ V   
Sbjct: 297 SHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDES 356

Query: 354 RIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
           R+P+S   F + ++ G  G ++D+GTA T L  PA++A R  F+A+T +L +    S F 
Sbjct: 357 RLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFT 416

Query: 413 TCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPS- 464
            CYN++       S  +P+++ +F GG  + LP ++ LIPV   ++  T C AF  S   
Sbjct: 417 PCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDI 476

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +IIGN QQ+ + + +D     +G  P  C
Sbjct: 477 PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQC 507


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 143/415 (34%), Positives = 208/415 (50%), Gaps = 39/415 (9%)

Query: 102 HRHQHSFHARMQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           +  +      ++R   RVATL  +  L+ G A  A   +    +D         GEY + 
Sbjct: 44  YTEEQLLSRALRRSSARVATLQSLAALAPGDAITAAR-ILVLASD---------GEYLME 93

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           +G+G+P R    ++D+GSD++W QC PC  C  Q  P FDPA SA++  + C+S  C+ L
Sbjct: 94  MGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPACNAL 153

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG----RTVVKNVAIGCGHKNQGMFVGA 275
               C+   C Y+  YGD + T G LA ET T G    R  +  ++ GCG+ N G     
Sbjct: 154 YYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRVSLPGISFGCGNLNAGSLANG 213

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---------PVGA 326
           +G++G G GS+SLV QLG      FSYCL S  +     L FG  A          PV +
Sbjct: 214 SGMVGFGRGSLSLVSQLGSPR---FSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM-GDDGVVMDTGTAVTRLPT 385
              P V NP  P+ Y++ ++G+ VGG  +PI   +F +    G  G ++D+GT +T L  
Sbjct: 271 --TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAE 328

Query: 386 PAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
           PAY+A R AF +Q T  L   +  S+ DTC+        SV +P +  +F G     LP 
Sbjct: 329 PAYDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGAD-WELPL 387

Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            N+++ VD +  G  C A A S S  SIIG+ Q +   + +D  N  + F P  C
Sbjct: 388 QNYML-VDPSTGGGLCLAMA-SSSDGSIIGSYQHQNFNVLYDLENSLMSFVPAPC 440


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 153/451 (33%), Positives = 226/451 (50%), Gaps = 38/451 (8%)

Query: 60  LFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
           +F  H +  S   +S++  ++ +L+ RD   S     +   + R Q +FH    R + R 
Sbjct: 16  IFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFH----RSISRA 71

Query: 120 ATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
                R +G   ++ +  V              +GEY + I +G+PP S + + D+GSD+
Sbjct: 72  NHF--RANGVSTNSIQSPVI-----------SNNGEYLMNISLGTPPVSMHGIADTGSDL 118

Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGD 237
           +W QC+PC  CY+Q +P+FDPA S ++  +SC    C  L    GC     C Y  SYGD
Sbjct: 119 LWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGD 178

Query: 238 GSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
           GS+T G LA++TLTIG T      V  V  GCGH N G F +  +GL+GLGGG +S++ Q
Sbjct: 179 GSHTSGDLAVDTLTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQ 238

Query: 292 LGGQTGGAFSYCLVSRGTGSSGS--LVFGREALPVGAAWVPLVRNPRAP-SFYYVGLSGL 348
           L    GG FSYCLV  G   S S  + FG   +  GA  V      R P +FYY+ L  +
Sbjct: 239 LRPLIGGRFSYCLVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESM 298

Query: 349 GVGGMRIP---ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
            VG  ++     S+    L    +  +++D+GT +T LP   Y       V+  G  P  
Sbjct: 299 SVGSKKLAYKGFSKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR 358

Query: 406 SGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
              ++F  CY NLSG   +R+PT++ +F G  +   P + F+   +D   FCFA  P  S
Sbjct: 359 DPNNVFSLCYSNLSG---LRIPTITAHFVGADLELKPLNTFVQVQEDL--FCFAMIPV-S 412

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            L+I GN+ Q    + +D  +  V F P  C
Sbjct: 413 DLAIFGNLAQMNFLVGYDLKSRTVSFKPTDC 443


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 141/401 (35%), Positives = 204/401 (50%), Gaps = 37/401 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAK-HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
           M+R   R  + +R LSG  A++ + H VQ               EY + + +G+PP    
Sbjct: 48  MRRAAHR--SRLRALSGYDANSPRLHSVQV--------------EYLMELAIGTPPVPFV 91

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC---DRLENAGCHAG 227
            + D+GSD+ W QCQPC  C+ Q  PV+DP+ S++FS V CSSA C    R  N    + 
Sbjct: 92  ALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSS 151

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGAAGLLGL 281
            CRY  SY DG+Y+ G L  ETLT+G +V      V +VA GCG  N G  + + G +GL
Sbjct: 152 LCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGGDSLNSTGTVGL 211

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWV---PLVRNPR 336
           G G++SL+ QLG    G FSYCL      T  S  L+     L  G   V   PL+++P 
Sbjct: 212 GRGTLSLLAQLG---VGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPL 268

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
            PS Y V L G+ +G +R+PI    F L      G+V+D+GT  + LP   +    D  V
Sbjct: 269 NPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVD-HV 327

Query: 397 AQTGNLPRASGVSIFDTCYNL-SGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           AQ    P  +  S+   C+   +G   +  +P +  +F+GG  + L   N++    +  +
Sbjct: 328 AQVLGQPPVNASSLDSPCFPAPAGERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSS 387

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           FC     + S  S++GN QQ+ IQ+ FD   G + F P  C
Sbjct: 388 FCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDC 428


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 129/344 (37%), Positives = 189/344 (54%), Gaps = 20/344 (5%)

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
           +G+G+P     MV+D+GS + W+QC PC   C++QS PVF+P  S++++ V CS+  C  
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 219 LENA-----GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           L +A      C +   C Y+ SYGD S++ G L+ +T++ G T + N   GCG  N+G+F
Sbjct: 61  LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLPNFYYGCGQDNEGLF 120

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLV 332
             +AGL+GL    +SL+ QL    G +F+YCL S  +    SL       P   ++ P+V
Sbjct: 121 GRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYN---PGQYSYTPMV 177

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            +    S Y++ LSG+ V G   P+S      + +     ++D+GT +TRLPT  Y A  
Sbjct: 178 SSSLDDSLYFIKLSGMTVAGN--PLSVSSSAYSSL---PTIIDSGTVITRLPTSVYSALS 232

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVS-VRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
            A  A      RAS  SI DTC+   G  S V  P V+  F+GG  L L A N L+ VDD
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFK--GQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDD 290

Query: 452 AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + T C AFAP+ S  +IIGN QQ+   + +D  +  +GF    C
Sbjct: 291 STT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKSSRIGFAAGGC 332


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  205 bits (522), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 126/353 (35%), Positives = 187/353 (52%), Gaps = 37/353 (10%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
           +++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+    A        
Sbjct: 179 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238

Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
                      + RC Y ++YGDGS+++G LA +T+ +G   V     GCG  N+G+F G
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 298

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALPVGAA 327
            AGL+GLG   +SLV Q   + GG FSYCL +  +G ++GSL  G      R A PV  +
Sbjct: 299 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV--S 356

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
           +  ++ +P  P FY++ ++G  VGG  +           +G   V++D+GT +TRL    
Sbjct: 357 YTRMIADPAQPPFYFMNVTGASVGGAAV-------AAAGLGAANVLLDSGTVITRLAPSV 409

Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
           Y A R  F  Q G    P A   S+ D CYNL+G   V+VP ++    GG  +T+ A+  
Sbjct: 410 YRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGM 469

Query: 446 L-IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L +   D    C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 470 LFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 522


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/353 (35%), Positives = 184/353 (52%), Gaps = 37/353 (10%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
           +++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+    A        
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237

Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
                      + RC Y ++YGDGS+++G LA +T+ +G   V     GCG  N+G+F G
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 297

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG------REALPVGAA 327
            AGL+GLG   +SLV Q   + GG FSYCL +  +G ++GSL  G      R A PV  +
Sbjct: 298 TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGDTSSYRNATPV--S 355

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
           +  ++ +P  P FY++ ++G  V                +G   V++D+GT +TRL    
Sbjct: 356 YTRMIADPAQPPFYFMNVTGASV-------GGAAVAAAGLGAANVLLDSGTVITRLAPSV 408

Query: 388 YEAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
           Y A R  F  Q G    P A   S+ D CYNL+G   V+VP ++    GG  +T+ A+  
Sbjct: 409 YRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGM 468

Query: 446 L-IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L +   D    C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 469 LFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 521


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 133/367 (36%), Positives = 194/367 (52%), Gaps = 28/367 (7%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSG 208
           + G+G Y + + VG+PP +   +ID+GSD+ W QC PC+  C+ Q  P++DPA S++FS 
Sbjct: 90  ENGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSK 149

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVVK 258
           + C+S +C  L +A   C+A  C Y+  Y  G +T G LA +TL I          +   
Sbjct: 150 LPCASPLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFA 208

Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
            VA GC   N G   GA+G++GLG  ++SL+ Q+G    G FSYCL S     +  ++FG
Sbjct: 209 GVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILFG 265

Query: 319 REALPVG--AAWVPLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
             A   G       L+RNP     RAP +YYV L+G+ VG   +P++   F  T  G  G
Sbjct: 266 ALANVTGDKVQSTALLRNPVAARRRAP-YYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGG 324

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-FDTCYNLSGFVSVRVPTVS 429
           V++D+GT  T L    Y   R AF++QT G L R SG    FD C+  +G     VP + 
Sbjct: 325 VIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-AGAADTPVPRLV 383

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           F F+GG    +P  ++   VD+ G   C    P+  G+S+IGN+ Q  + + +D      
Sbjct: 384 FRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPT-RGVSVIGNVMQMDLHVLYDLDGATF 442

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 443 SFAPADC 449


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 138/389 (35%), Positives = 191/389 (49%), Gaps = 30/389 (7%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           +  V    + VVSG   GSG+YFV + +G PP+S  ++ D+GSD+VWV+C  C  C   S
Sbjct: 62  RKPVPFVKSPVVSGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS 121

Query: 195 DP-VFDPADSASFSGVSCSSAVCDRLENAG----CHAGR----CRYEVSYGDGSYTKGTL 245
              VF P  S++FS   C   VC  +   G    C+  R    C YE  Y DGS T G  
Sbjct: 122 PATVFFPRHSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLF 181

Query: 246 ALETLTI-----GRTVVKNVAIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGG 294
           A ET ++         +K+VA GCG +  G       F GA G++GLG G +S   QLG 
Sbjct: 182 ARETTSLKTSSGKEAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGR 241

Query: 295 QTGGAFSYCLV--SRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVG 351
           + G  FSYCL+  +     +  L+ G     V    + PL+ NP +P+FYYV L  + V 
Sbjct: 242 RFGNKFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVN 301

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
           G ++ I   ++ +   G+ G VMD+GT +  L  PAY     A V Q   LP A  ++  
Sbjct: 302 GAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAA-VKQRIKLPNADELTPG 360

Query: 411 FDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGL 466
           FD C N+SG       +P + F FSGG V   P  N+ I  ++    C A        G 
Sbjct: 361 FDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ-CLAIQSVDPKVGF 419

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S+IGN+ Q+G    FD     +GF    C
Sbjct: 420 SVIGNLMQQGFLFEFDRDRSRLGFSRRGC 448


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  204 bits (520), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 120/338 (35%), Positives = 188/338 (55%), Gaps = 20/338 (5%)

Query: 171 MVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH---- 225
           M++D+GS + W+QCQPC+  C+ Q+DP++DP+ S ++  +SC+S  C RL+ A  +    
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 226 ---AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGL 281
              +  C Y  SYGD S++ G L+ + LT+  +  +     GCG  NQG+F  AAG++GL
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQTLPQFTYGCGQDNQGLFGRAAGIIGL 120

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL-PVGAAWVPLVRNPRAPSF 340
               +S++ QL  + G AFSYCL +  +GSSG       ++ P    + P++ + + PS 
Sbjct: 121 ARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSL 180

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QT 399
           Y++ L+ + V G  + ++  ++R+  +      +D+GT +TRLP   Y A R AFV   +
Sbjct: 181 YFLRLTAITVSGRPLDLAAAMYRVPTL------IDSGTVITRLPMSMYAALRQAFVKIMS 234

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
               +A   SI DTC+  S      VP +   F GG  LTL A + LI  D  G  C AF
Sbjct: 235 TKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADK-GITCLAF 293

Query: 460 APS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A S   + ++IIGN QQ+   I++D +   +GF P  C
Sbjct: 294 AGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 130/359 (36%), Positives = 192/359 (53%), Gaps = 17/359 (4%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           ++ G G Y + I VG+P  +  +V D+GSD++W QC PC++C++Q  P F PA S++FS 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + C+S+ C  L N+   C+A  C Y   YG G YT G LA ETL +G     +VA GC  
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
           +N G+    +G+ GLG G++SL+ QLG    G FSYCL S     +  ++FG  A L  G
Sbjct: 198 EN-GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDG 253

Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
                P V NP   PS+YYV L+G+ VG   +P++   F  TQ G   G ++D+GT +T 
Sbjct: 254 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 313

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS-GFVSVRVPTVSFYFSGGPVLTLP 441
           L    YE  + AF++QT N+   +G    D C+  + G   + VP++   F GG    +P
Sbjct: 314 LAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRFDGGAEYAVP 373

Query: 442 ASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                +  D  G+    C    P+     +S+IGN+ Q  + + +D   G   F P  C
Sbjct: 374 TYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 122/380 (32%), Positives = 195/380 (51%), Gaps = 29/380 (7%)

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQSDPV 197
             F + V+SG   GSG+YFV + +G+PP++  +V D+GSD++WV+C PC  C ++     
Sbjct: 69  NSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA 128

Query: 198 FDPADSASFSGVSCSSAVCDRLENA---GCHAGR----CRYEVSYGDGSYTKGTLALETL 250
           F    S ++S + C S  C  + +     C+  R    CRY+ +Y D S T G  + E L
Sbjct: 129 FFARHSTTYSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEAL 188

Query: 251 TIGRTVVK-----NVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           T+  +  K      ++ GCG +  G       F GA G++GLG   +S   QLG + G  
Sbjct: 189 TLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSK 248

Query: 300 FSYCLVS---RGTGSSGSLVFGREALPVGA----AWVPLVRNPRAPSFYYVGLSGLGVGG 352
           FSYCL+        +S   + G + + V      ++ PL+ NP +P+FYY+ + G+ V G
Sbjct: 249 FSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNG 308

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD 412
           +++PI+  ++ +  +G+ G ++D+GT +T +  PAY     AF  +      A     FD
Sbjct: 309 VKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFD 368

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
            C N+SG     +P +SF  +GG V + P  N+ I   D    C A  P     G S++G
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQ-IKCLAVQPVSQDGGFSVLG 427

Query: 471 NIQQEGIQISFDGANGFVGF 490
           N+ Q+G  + FD     +GF
Sbjct: 428 NLMQQGFLLEFDRDKSRLGF 447


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  204 bits (519), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 146/458 (31%), Positives = 225/458 (49%), Gaps = 53/458 (11%)

Query: 59  ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
           + F   + ++  +  S  + W   L H     S + ++ N        S    +  D +R
Sbjct: 44  KTFCSGHKVAPGDVPSPNSTWA-PLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRR 102

Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR----------- 167
              + +RL+G     A  + Q       +   + +G+Y    G+GS P            
Sbjct: 103 ADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTN 157

Query: 168 ---------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVC 216
                    +Q ++IDSGSD+ WVQC+PC    C++Q DP+FDPA S +++ V C+SA C
Sbjct: 158 SAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAAC 217

Query: 217 DRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG-- 270
            +L     GC A  +C++ ++YGDGS   GT + + LT+G   V++    GC H ++G  
Sbjct: 218 AQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSA 277

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REALPVGA 326
                AG L LGGGS SLV Q   + G  FSYCL    + S G LV G    R  L    
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQLIPSF 336

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              PL+ +  AP+FY V L  + V G  + +   +F  +       V+D+ T ++RLP  
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPT 390

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY+A R AF +       A  VSI DTCY+ +G  S+ +P+++  F GG  + L A+  L
Sbjct: 391 AYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGIL 450

Query: 447 IPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFD 482
           +     G+ C AFAP+ S      IGN+QQ+ +++ +D
Sbjct: 451 L-----GS-CLAFAPTASDRMPGFIGNVQQKTLEVVYD 482


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/216 (50%), Positives = 152/216 (70%), Gaps = 5/216 (2%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
           K   +   T +VSG  QGSGEYF R+G+GSPP+  YMV+D+GSD+ WVQC PC+ CY+Q+
Sbjct: 32  KTIAEALETPLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQA 91

Query: 195 DPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-G 253
           DP+F+P+ S+S++ ++C +  C  L+ + C    C YEVSYGDGSYT G  A ET+T+ G
Sbjct: 92  DPIFEPSFSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG 151

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
              + NVAIGCGH N+G+FVGAAGLLGLGGGS+S   Q+      +FSYCLV+R T S+ 
Sbjct: 152 SASLNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQIN---ASSFSYCLVNRDTDSAS 208

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
           +L F    +P  +   PL+RN +  +FYY+G++G+G
Sbjct: 209 TLEF-NSPIPSHSVTAPLLRNNQLDTFYYLGMTGIG 243


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/275 (43%), Positives = 167/275 (60%), Gaps = 20/275 (7%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS-GGGADAAKHEVQ 139
           + L H D +SS S+ +           F+ R+QRD  RV ++    +   G +A K   +
Sbjct: 63  VHLSHVDALSSFSDAS-------PADLFNLRLQRDSLRVKSITSLAAVSTGRNATKRTPR 115

Query: 140 D---FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
               F   V+SG+ QGSGEYF+R+GVG+P  + YMV+D+GSD+VW+QC PC  CY Q+D 
Sbjct: 116 TAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA 175

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAG-CHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
           +FDP  S +F+ V C S +C RL+++  C   R   C Y+VSYGDGS+T+G  + ETLT 
Sbjct: 176 IFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF 235

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----- 307
               V +V +GCGH N+G+FVGAAGLLGLG G +S   Q   +  G FSYCLV R     
Sbjct: 236 HGARVDHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGS 295

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
            +    ++VFG  A+P  + + PL+ NP+  +FYY
Sbjct: 296 SSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYY 330


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 191/360 (53%), Gaps = 18/360 (5%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           ++ G G Y + I VG+P  +  +V D+GSD++W QC PC++C++Q  P F PA S++FS 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + C+S+ C  L N+   C+A  C Y   YG G YT G LA ETL +G     +VA GC  
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
           +N G+    +G+ GLG G++SL+ QLG    G FSYCL S     +  ++FG  A L  G
Sbjct: 198 EN-GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDG 253

Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
                P V NP   PS+YYV L+G+ VG   +P++   F  TQ G   G ++D+GT +T 
Sbjct: 254 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 313

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
           L    YE  + AF++QT ++   +G    D C+     G   + VP++   F GG    +
Sbjct: 314 LAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV 373

Query: 441 PASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P     +  D  G+    C    P+     +S+IGN+ Q  + + +D   G   F P  C
Sbjct: 374 PTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 145/428 (33%), Positives = 217/428 (50%), Gaps = 51/428 (11%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           + LVHR    + + + +         SF    +R   R + +VR   G       H    
Sbjct: 22  VPLVHRHGPCAPAPSLST-----DTRSFADIFRRSRARPSYIVR---GKKVSVPAH---- 69

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVF 198
            GT V+S       EY VR+  G+P   Q +VID+GSD+ W+QC+PCS  QC+ Q DP++
Sbjct: 70  LGTSVMSL------EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLY 123

Query: 199 DPADSASFSGVSCSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG 253
           DP+ S+++S V C+S VC +L      +GC +G+ C + +SY DG+ T G  + + LT+ 
Sbjct: 124 DPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLA 183

Query: 254 R-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
              +V+N   GCGH    +     G+LGLG     L   LG + GG FSYCL S  +   
Sbjct: 184 PGAIVQNFYFGCGHGKHAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKP 238

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G L  G    P G  + P+   P  P+F  V L+G+ VGG ++ +    F        G+
Sbjct: 239 GFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGM 292

Query: 373 VMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
           ++D+GT +T L + AY A R AF   +     LP        DTCYNL+G+ +V VP ++
Sbjct: 293 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 348

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-PSGLS-IIGNIQQEGIQISFDGANGF 487
             F+GG  + L   N ++ V+     C AFA S P G + ++GN+ Q   ++ FD +   
Sbjct: 349 LTFTGGATINLDVPNGIL-VNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSK 403

Query: 488 VGFGPNVC 495
            GF    C
Sbjct: 404 FGFRAKAC 411


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 137/448 (30%), Positives = 209/448 (46%), Gaps = 48/448 (10%)

Query: 68  SSSNTSSDEAR-----WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           S S  SS EAR     ++++L+HRD  SS         ++    +   R+     R  + 
Sbjct: 13  SLSTLSSREAREGLRGFSVDLIHRDSPSSP--------FYNPSLTPSERIINAALRSMSR 64

Query: 123 VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
           ++R+S    +    E         S +    GEY +R  +GSPP  +  ++D+GS ++W+
Sbjct: 65  LQRVSHFLDENKLPE---------SLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWL 115

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GC-HAGRCRYEVSYGDGS 239
           QC PC  C+ Q  P+F+P  S+++   +C S  C  L+ +   C   G+C Y + YGD S
Sbjct: 116 QCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKS 175

Query: 240 YTKGTLALETLTIGRT------VVKNVAIGCGHKNQGMFVGA---AGLLGLGGGSMSLVG 290
           ++ G L  ETL+ G T         N   GCG  N      +    G+ GLG G +SLV 
Sbjct: 176 FSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVS 235

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGL 348
           QLG Q G  FSYCL+   + S+  L FG EA+    G    PL+  P  P++Y++ L  +
Sbjct: 236 QLGAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAV 295

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
            +G   +         T   D  +V+D+GT +T L    Y  F  +     G        
Sbjct: 296 TIGQKVVS--------TGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLP 347

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLS 467
           S   TC+      ++ +P ++F F+G  V   P  N LIP+ D+   C A  PS   G+S
Sbjct: 348 SPLKTCF--PNRANLAIPDIAFQFTGASVALRP-KNVLIPLTDSNILCLAVVPSSGIGIS 404

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           + G+I Q   Q+ +D     V F P  C
Sbjct: 405 LFGSIAQYDFQVEYDLEGKKVSFAPTDC 432


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 143/416 (34%), Positives = 211/416 (50%), Gaps = 52/416 (12%)

Query: 102 HRHQHSFHA-RMQRDVKRVATLVRR--------LSGGGADAAKHEVQDFGTDVVSGMDQG 152
           HRH     A  +  D +  A + RR        + G       H     GT V+S     
Sbjct: 60  HRHGPCAPAPSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAH----LGTSVMSL---- 111

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVS 210
             EY VR+  G+P   Q +VID+GSD+ W+QC+PCS  QC+ Q DP++DP+ S+++S V 
Sbjct: 112 --EYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169

Query: 211 CSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGC 264
           C+S VC +L      +GC +G+ C + +SY DG+ T G  + + LT+    +V+N   GC
Sbjct: 170 CASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGC 229

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
           GH    +     G+LGLG     L   LG + GG FSYCL S  +   G L  G    P 
Sbjct: 230 GHGKHAVRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLALGAGKNPS 284

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + P+   P  P+F  V L+G+ VGG ++ +    F        G+++D+GT +T L 
Sbjct: 285 GFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQ 338

Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           + AY A R AF   +     LP        DTCYNL+G+ +V VP ++  F+GG  + L 
Sbjct: 339 STAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLD 394

Query: 442 ASNFLIPVDDAGTFCFAFAPS-PSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
             N ++ V+     C AFA S P G + ++GN+ Q   ++ FD +    GF    C
Sbjct: 395 VPNGIL-VNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 146/428 (34%), Positives = 209/428 (48%), Gaps = 33/428 (7%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++E++HRD   S                 +   +   +RVA  VRR    G    K  V
Sbjct: 31  FSVEMIHRDSSRSP---------------LYRPTETPFQRVANAVRRSINRGNHFKKAFV 75

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                +  S +    GEY +R  VGSPP     ++D+GSDI+W+QC+PC  CYKQ+ P+F
Sbjct: 76  STDSAE--STVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIF 133

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           DP+ S ++  + CSS  C+ L N  C +   C Y + YGDGS++ G L++ETLT+G T  
Sbjct: 134 DPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLTLGSTDG 193

Query: 258 KNV-----AIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCL--VSRGT 309
            +V      IGCGH N G F      +   GG  +SL+ QL    GG FSYCL  +   +
Sbjct: 194 SSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSES 253

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNP-RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
            SS  L FG  A+  G   V    +P     FY++ L    VG  RI  S      +  G
Sbjct: 254 NSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSG 313

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVPT 427
           D  +++D+GT +T LP   Y     A V+    L RA   S +   CY  +    + +P 
Sbjct: 314 DGNIIIDSGTTLTLLPQEDYLNLESA-VSDVIKLERARDPSKLLSLCYKTTS-DELDLPV 371

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           ++ +F G  V   P S F +PV + G  CFAF  S  G +I GN+ Q+ + + +D     
Sbjct: 372 ITAHFKGADVELNPISTF-VPV-EKGVVCFAFISSKIG-AIFGNLAQQNLLVGYDLVKKT 428

Query: 488 VGFGPNVC 495
           V F P  C
Sbjct: 429 VSFKPTDC 436


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 136/369 (36%), Positives = 193/369 (52%), Gaps = 28/369 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           SG Y + I +GSPP+    ++D+GSD+VW+QC+PCSQCY QSDP++DP+ S++F+  SCS
Sbjct: 1   SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCS 60

Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
           ++ C  L  +GC   A  C Y   YGD S T+G  ALETLT+           N   GCG
Sbjct: 61  TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCG 120

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREALP 323
             N G F GAAG++GLG G +SL  QLG      FSYCLV     SS +  L+FG  A  
Sbjct: 121 RLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSAST 180

Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-------------RLTQMGD 369
             GA   P++ N    ++Y+VGL G+ VGG ++ ++                 R  ++  
Sbjct: 181 GSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVNS 240

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTV 428
            G + D+GT +T L    Y   + AF A + +LP     S  FD CY++S   + + P +
Sbjct: 241 GGTIFDSGTTLTLLDDAVYSKVKSAF-ASSVSLPTVDASSSGFDLCYDVSKSKNFKFPAL 299

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTF-CFAF-APSPSGLSIIGNIQQEGIQISFDGANG 486
           +  F G    + P  N+ + VD A T  C A       GL IIGN+ Q+   + +D    
Sbjct: 300 TLAFKGTK-FSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTS 358

Query: 487 FVGFGPNVC 495
            +   P  C
Sbjct: 359 TISMSPAQC 367


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 144/400 (36%), Positives = 215/400 (53%), Gaps = 22/400 (5%)

Query: 101 YHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRI 160
           +     ++ + M   ++  A  +R L       ++   QD   +V   +  GSGEY +++
Sbjct: 66  FRPPNRTWESLMSEKIRGDANRLRFLK----RTSRSSKQDANANV--PVRSGSGEYIIQV 119

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE 220
             G+P +S Y +ID+GSD+ W+ C+ C  C+  + P+FDPA S+S+   +C S  C  + 
Sbjct: 120 DFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFACDSQPCQEIS 178

Query: 221 NAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLG 280
                  +C++EVSYGDG+   GTLA + +T+G   + N + GC          + GL+G
Sbjct: 179 GNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSEDTSPSPGLMG 238

Query: 281 LGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRNPR 336
           LGGGS+SL+ Q       GG FSYCL    + SSGSLV G+EA    ++  +  L+++P 
Sbjct: 239 LGGGSLSLLTQAPTAELFGGTFSYCL-PSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPS 297

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAF 395
            P+FY+V L  + VG  RI +       T +    G ++D+GT +T L   AY A RDAF
Sbjct: 298 IPTFYFVTLKAISVGNTRISVPG-----TNIASGGGTIIDSGTTITHLVPSAYTALRDAF 352

Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
             Q  +L + + V   DTCY+LS   SV VPT++ +      L LP  N LI   ++G  
Sbjct: 353 RQQLSSL-QPTPVEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENILI-TQESGLA 409

Query: 456 CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C AF+ S    SIIGN+QQ+  +I FD  N  VGF    C
Sbjct: 410 CLAFS-STDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  201 bits (511), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 138/348 (39%), Positives = 178/348 (51%), Gaps = 27/348 (7%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213

Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMF-VG 274
           L    AGC   +C+Y V YGDG  T GT  ++ LT+   TVV N   GC H  +G F   
Sbjct: 214 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAS 273

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPL 331
            +G + LGGG  SL+ Q     G AFSYC+      SSG L  G  A   GA   A  PL
Sbjct: 274 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPL 331

Query: 332 VRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           VRNP   P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A
Sbjct: 332 VRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRA 385

Query: 391 FRDAFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            R AF +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++  
Sbjct: 386 LRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-- 443

Query: 450 DDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 444 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 134/388 (34%), Positives = 199/388 (51%), Gaps = 37/388 (9%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--VFDP 200
           + ++SG   GSG+YFV I +GSPP++  +V D+GSD+ WV+C  C        P   F  
Sbjct: 70  SPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLA 129

Query: 201 ADSASFSGVSCSSAVCDRLENAG---CHAGR----CRYEVSYGDGSYTKGTLALETLTI- 252
             S +FS   C S++C  +       C+  R    CRYE  Y DGS T G  + ET T+ 
Sbjct: 130 RHSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLN 189

Query: 253 ---GRTV-VKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
              GR + +K++A GCG    G       F GA+G++GLG G +S   QLG + G +FSY
Sbjct: 190 TSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSY 249

Query: 303 CLVSRGTGSS-------GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           CL+              G +V  ++      ++ PL+ NP AP+FYY+ + G+ V G+++
Sbjct: 250 CLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKL 309

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR-----ASGVSI 410
            I   ++ L ++G+ G V+D+GT +T L  PAY     AF  +   LP      AS  S 
Sbjct: 310 HIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREV-KLPSPTPGGASTRSG 368

Query: 411 FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG-LS 467
           FD C N++G    R P +S    G  + + P  N+ I + + G  C A  P  + SG  S
Sbjct: 369 FDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISE-GIKCLAIQPVEAESGRFS 427

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +IGN+ Q+G  + FD     +GF    C
Sbjct: 428 VIGNLMQQGFLLEFDRGKSRLGFSRRGC 455


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 138/348 (39%), Positives = 178/348 (51%), Gaps = 27/348 (7%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197

Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMF-VG 274
           L    AGC   +C+Y V YGDG  T GT  ++ LT+   TVV N   GC H  +G F   
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNFSAS 257

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPL 331
            +G + LGGG  SL+ Q     G AFSYC+      SSG L  G  A   GA   A  PL
Sbjct: 258 TSGTMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPL 315

Query: 332 VRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           VRNP   P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A
Sbjct: 316 VRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRA 369

Query: 391 FRDAFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            R AF +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++  
Sbjct: 370 LRLAFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV-- 427

Query: 450 DDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 428 ----EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 164/340 (48%), Gaps = 25/340 (7%)

Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
           Q M ID+  D+ W+QC PC   QCY Q DP+FDP  S++ + V C S  C  L     GC
Sbjct: 148 QTMAIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGC 207

Query: 225 HA----GRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVG-AAGL 278
                   CRY + Y D   T GT   +TLTI G T V+N   GC H  +G F    AG 
Sbjct: 208 SNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFRFGCSHAVRGRFSDLTAGT 267

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPLVRNP 335
           + LGGG+ SL+ Q     G AFSYC+      +SG L  G  A        A  PLVR+ 
Sbjct: 268 MSLGGGAQSLLAQTARSLGNAFSYCVPQ--ASASGFLSIGGPATTNSTTVFATTPLVRSA 325

Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
             PS Y V L G+ V G R+ I    F        G VMD+   +T+LP  AY A R AF
Sbjct: 326 INPSLYLVRLQGIVVAGRRLGIPPVAF------SAGAVMDSSAVITQLPPTAYRALRRAF 379

Query: 396 VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
                  PR+      DTCY+  G  +VRVP VS  F GG V+ L     +I     G  
Sbjct: 380 RNAMRAYPRSGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMI----GGCL 435

Query: 456 CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            F    S   L  IGN+QQ+  ++ +D A G VGF    C
Sbjct: 436 AFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 32/362 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   + +D+GSD++W QCQPC  C+ Q+ P FDP+ S++ S  SC S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
           +C  L  A C + +      C Y  SYGD S T G L ++  T       V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
            N G+F     G+ G G G +SL  QL     G FS+C  +       +++         
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257

Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            GR A+       PL++NP  P+FYY+ L G+ VG  R+P+ E  F L + G  G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTL-KNGTGGTIIDS 312

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
           GTA+T LPT  Y   RDAF AQ   LP  SG +  D  + LS  +  +  VP +  +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSGNTT-DPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPN 493
              + LP  N++  V+DAG+     A    G ++ IGN QQ+ + + +D  N  + F P 
Sbjct: 371 A-TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429

Query: 494 VC 495
            C
Sbjct: 430 QC 431


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 130/362 (35%), Positives = 188/362 (51%), Gaps = 32/362 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   + +D+GSD++W QCQPC  C+ Q+ P FDP+ S++ S  SC S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
           +C  L  A C + +      C Y  SYGD S T G L ++  T       V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
            N G+F     G+ G G G +SL  QL     G FS+C  +       +++         
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257

Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            GR A+       PL++NP  P+FYY+ L G+ VG  R+P+ E  F L + G  G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
           GTA+T LPT  Y   RDAF AQ   LP  SG +  D  + LS  +  +  VP +  +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSGNTT-DPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPN 493
              + LP  N++  V+DAG+     A    G ++ IGN QQ+ + + +D  N  + F P 
Sbjct: 371 A-TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPA 429

Query: 494 VC 495
            C
Sbjct: 430 QC 431


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 153/447 (34%), Positives = 223/447 (49%), Gaps = 43/447 (9%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
           ++  +SD +R ++ L++R    + ++         ++ S    ++RD  R   ++R+ SG
Sbjct: 46  AAQVTSDPSRASMPLMYRHGPCAPASAAAT-----NRPSPAEMLRRDRARRNHILRKASG 100

Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC- 187
                 +            G    S +Y V +G G+P   Q ++ID+GSD+ WVQCQPC 
Sbjct: 101 ------RRITLGVSIPTSLGAFVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCN 154

Query: 188 -SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE---------NAGCHAGRCRYEVSYGD 237
            S CY Q DPVFDP+ S++++ V C S  C  L+         N+   A  C+Y + YG+
Sbjct: 155 SSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGN 214

Query: 238 GSYTKGTLALETLTI---GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           G  T G  + ETLT+     TVV N + GCG   +G+F    GLLGLGG   SLV Q  G
Sbjct: 215 GDTTVGVYSTETLTLSPEAATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTG 274

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGV 350
             GGAFSYCL + G  ++G L  G  A       G  + PL       +FY V L+G+ V
Sbjct: 275 TYGGAFSYCLPA-GNSTAGFLALGAPATGGNNTAGFQFTPL--QVVETTFYLVKLTGISV 331

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGV 408
           GG ++ I   +F        G+++D+GT VT LP  AY A R AF +     P    +  
Sbjct: 332 GGKQLDIEPTVFA------GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDD 385

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
              DTCY+ +G  +V VPTV+  F GG  + L   + ++ +D  G   F    S     I
Sbjct: 386 EDLDTCYDFTGNTNVTVPTVALTFEGGVTIDLDVPSGVL-LD--GCLAFVAGASDGDTGI 442

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           IGN+ Q   ++ +D A G VGF    C
Sbjct: 443 IGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 127/364 (34%), Positives = 185/364 (50%), Gaps = 29/364 (7%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   +++D+GSD+VW QC+PC  C+ ++    DP++S++F  + CSS 
Sbjct: 414 EYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSP 473

Query: 215 VCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTI------GRTVVKNVAIG 263
           VCD L  + C         C Y  +Y DGS T G L  ET T       G+  V ++A G
Sbjct: 474 VCDNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFG 533

Query: 264 CGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
           CG  N G+F     G+ G G G++SL  QL       FS+C  +       S++ G  A 
Sbjct: 534 CGLFNNGIFTSNETGIAGFGRGALSLPSQLKVDN---FSHCFTAITGSEPSSVLLGLPAN 590

Query: 323 PVGAA-----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
               A       PLV+N  +   YY+ L G+ VG  R+PI E  F L Q G  G ++D+G
Sbjct: 591 LYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSG 650

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGFVSVR--VPTVSFYFS 433
           T +T LP  AY+   DAF AQ   LP   A+  S+   C++ S     +  VP +  +F 
Sbjct: 651 TGMTTLPQDAYKLVHDAFTAQV-RLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFE 709

Query: 434 GGPVLTLPASNFLIPVDDAG--TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
           G   L LP  N++   +DAG    C A   +   L+IIGN QQ+ + + +D     + F 
Sbjct: 710 GA-TLDLPRENYMFEFEDAGGSVTCLAIN-AGDDLTIIGNYQQQNLHVLYDLVRNMLSFV 767

Query: 492 PNVC 495
           P  C
Sbjct: 768 PAQC 771


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 134/436 (30%), Positives = 207/436 (47%), Gaps = 49/436 (11%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD + S                 +   Q   +      RR         K+ +
Sbjct: 28  FSVELIHRDSLKSP---------------LYKPTQNKYQYFVDAARRSINRANHFYKYSL 72

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
            +     V       GEY +   VG+PP   Y ++D+GSDIVW+QC+PC +CY Q+ P+F
Sbjct: 73  ANIPQSTVI---PDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMF 129

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTI----G 253
           +P+ S+S+  + C S +C  +E+  C+    C Y   YGD S++ G L+++TLT+    G
Sbjct: 130 NPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNG 189

Query: 254 RTV-VKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCL------V 305
            TV   N+ IGCG  N   + GA +G++G G G  S + QLG  TGG FSYCL       
Sbjct: 190 LTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVT 249

Query: 306 SRGTGSSGSLVFGREALPVGAAWVP---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           +  + ++  L FG  A   G   V    L ++P   +FYY+ L    VG  R+ I     
Sbjct: 250 NIQSNATSKLNFGDAATVSGDGVVTTPILKKDPE--TFYYLTLEAFSVGNRRVEIGG--- 304

Query: 363 RLTQMGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSG 419
                GD+   +++D+GT +T L    Y +F ++ V     L R        + CY++  
Sbjct: 305 --VPNGDNEGNIIIDSGTTLTSLTKDDY-SFLESAVVDLVKLERVDDPTQTLNLCYSVKA 361

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQI 479
                 P ++ +F G  V   P S F+   D  G FC AF  S    +I GN+ Q+ + +
Sbjct: 362 -EGYDFPIITMHFKGADVDLHPISTFVSVAD--GVFCLAFESSQDH-AIFGNLAQQNLMV 417

Query: 480 SFDGANGFVGFGPNVC 495
            +D     V F P+ C
Sbjct: 418 GYDLQQKIVSFKPSDC 433


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 145/454 (31%), Positives = 222/454 (48%), Gaps = 53/454 (11%)

Query: 59  ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
           + F   + ++  +  S  + W   L H     S + ++ N        S    +  D +R
Sbjct: 44  KTFCSGHKVAPGDVPSPNSTWA-PLHHLYGPCSPAPSSANSTAADVAASMADMVDDDQRR 102

Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR----------- 167
              + +RL+G     A  + Q       +   + +G+Y    G+GS P            
Sbjct: 103 ADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTTATTN 157

Query: 168 ---------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVC 216
                    +Q ++IDSGSD+ WVQC+PC    C++Q DP+FDPA S +++ V C+SA C
Sbjct: 158 SAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAAC 217

Query: 217 DRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG-- 270
            +L     GC A  +C++ ++YGDGS   GT + + LT+G   V++    GC H ++G  
Sbjct: 218 AQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGSA 277

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REALPVGA 326
                AG L LGGGS SLV Q   + G  FSYCL    + S G LV G    R  L    
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQLIPSF 336

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              PL+ +  AP+FY V L  + V G  + +   +F  +       V+D+ T ++RLP  
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISRLPPT 390

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY+A R AF +       A  VSI DTCY+ +G  S+ +P+++  F GG  + L A+  L
Sbjct: 391 AYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGIL 450

Query: 447 IPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQ 478
           +     G+ C AFAP+ S      IGN+QQ+ ++
Sbjct: 451 L-----GS-CLAFAPTASDRMPGFIGNVQQKTLE 478



 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 84/283 (29%), Positives = 125/283 (44%), Gaps = 51/283 (18%)

Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
           GC A  +C++ ++YGDGS   GT + + LT+G   V           QG+ +  A     
Sbjct: 479 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDR---------QGLPLRTAT---- 525

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP----- 335
                        Q G  FSYC +     S G +  G    P  AA VP  V  P     
Sbjct: 526 -------------QYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSS 569

Query: 336 -RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P+FY V L  + V G  +P+   +F  +       V+ + T ++RLP  AY+A R A
Sbjct: 570 SMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAA 623

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           F         A  VSI DTCY+ +G  S+ +P+++  F GG  + L A+  L+       
Sbjct: 624 FRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------Q 677

Query: 455 FCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C AFAP+ +      IGN+QQ  +++ +D     + F    C
Sbjct: 678 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 130/380 (34%), Positives = 198/380 (52%), Gaps = 35/380 (9%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQSDPVFDPA 201
           + ++SG   GSG+YFV I +G+PP+S  +V D+GSD+VWV+C  C  C +      F P 
Sbjct: 75  SPLISGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPR 134

Query: 202 DSASFSGVSCSSAVCDRLENAGCHA-------GRCRYEVSYGDGS-----YTKGTLALET 249
            S+SFS   C    C  L +A  H          CR+  SY DGS     ++K T  L++
Sbjct: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKS 194

Query: 250 LTIGRTVVKNVAIGCGHKNQG------MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           L+     +K ++ GCG +  G       F GA G++GLG GS+S   QLG + G  FSYC
Sbjct: 195 LSGSEIHLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYC 254

Query: 304 LVSRGTGSSGSLVF----GREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           L+        +       G  +LP+  A    + PL  NP +P+FYY+ +  + + G+++
Sbjct: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTC 414
           PI+  ++ + + G+ G V+D+GT +T L   AYE    + V +   LP A+ ++  FD C
Sbjct: 315 PINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKS-VRRRVKLPNAAELTPGFDLC 373

Query: 415 YNLSGFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGLSIIG 470
            N SG  S R  +P + F   GG V   P  N+ +  ++ G  C A     S +G S+IG
Sbjct: 374 VNASG-ESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE-GVMCLAIRAVESGNGFSVIG 431

Query: 471 NIQQEGIQISFDGANGFVGF 490
           N+ Q+G  + FD     +GF
Sbjct: 432 NLMQQGFLLEFDKEESRLGF 451


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 137/373 (36%), Positives = 192/373 (51%), Gaps = 30/373 (8%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +  G  EY + + +G+PP     + D+GSD+ W QC+PC  C+ Q  P++D A SASFS 
Sbjct: 88  LRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSP 147

Query: 209 VSCSSAVCDRL--ENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRT-------- 255
           V C+SA C  +   +  C A     CRY  +Y DG+Y+ G L  ETLT   +        
Sbjct: 148 VPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPG 207

Query: 256 -VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS 314
             V  VA GCG  N G+   + G +GLG GS+SLV QLG    G FSYCL      S GS
Sbjct: 208 VSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLGS 264

Query: 315 -LVFG---REALP--VGAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            ++FG     A P  +G A V   PLV+ P  PS YYV L G+ +G  R+PI    F L 
Sbjct: 265 PVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLR 324

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS--V 423
             G  G+++D+GT  T L   A+    +  VA   N P  +  S+   C+  +       
Sbjct: 325 DDGSGGMIVDSGTIFTVLVESAFRVVVN-HVAGVLNQPVVNASSLDSPCFPATAGEQQLP 383

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
            +P +  +F+GG  + L   N++    ++ +FC   A +PS   SI+GN QQ+ IQ+ FD
Sbjct: 384 DMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFD 443

Query: 483 GANGFVGFGPNVC 495
              G + F P  C
Sbjct: 444 ITVGQLSFVPTDC 456


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 142/434 (32%), Positives = 213/434 (49%), Gaps = 49/434 (11%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++E++HRD   S                F +  +   +RVA  V R      + A H  
Sbjct: 29  FSVEMIHRDSSRSP---------------FFSPTETQFQRVANAVHR----SINRANHLN 69

Query: 139 QDF------GTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
           Q F       T V+S +    GEY +   VG+P    + ++D+GSDI+W+QCQPC +CY+
Sbjct: 70  QSFVSPNSPETTVISAL----GEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYE 125

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLT 251
           Q+ P+FD + S ++  + C S  C  ++   C + + C Y + Y DGS + G L++ETLT
Sbjct: 126 QTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLT 185

Query: 252 IGRT-----VVKNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           +G T           IGCG  N  G+    +G++GLG G MSL+ QL   TGG FSYCLV
Sbjct: 186 LGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSYCLV 245

Query: 306 SRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
              + +S  L FG  A+  G   V  PL        FY++ L    VG  RI        
Sbjct: 246 PGLSTASSKLNFGNAAVVSGRGTVSTPLFSK-NGLVFYFLTLEAFSVGRNRIEFGSP--- 301

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLS-GFV 421
               G   +++D+GT +T LP   Y    +A VA+T  L R    + +   CY ++   +
Sbjct: 302 -GSGGKGNIIIDSGTTLTALPNGVYSKL-EAAVAKTVILQRVRDPNQVLGLCYKVTPDKL 359

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
              VP ++ +FSG  V TL A N  + V D    CFAF P+ +G ++ GN+ Q+ + + +
Sbjct: 360 DASVPVITAHFSGADV-TLNAINTFVQVAD-DVVCFAFQPTETG-AVFGNLAQQNLLVGY 416

Query: 482 DGANGFVGFGPNVC 495
           D     V F    C
Sbjct: 417 DLQMNTVSFKHTDC 430


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 143/396 (36%), Positives = 208/396 (52%), Gaps = 16/396 (4%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           S  AR+ +      TL+     G + ++  +       +  G   G G Y  R+G+G+P 
Sbjct: 78  SLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPA 137

Query: 167 RSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVS-----CSSAVCDRLE 220
           +S  MV+D+GS + W+QC PC   C++QS PVF+P  S+S++ VS     CS      L 
Sbjct: 138 KSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLN 197

Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
            A C     C Y+ SYGD S++ G L+ +T++ G T V N   GCG  N+G+F  +AGL+
Sbjct: 198 PASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLI 257

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
           GL    +SL+ QL    G +FSYCL +  + SSG L  G    P   ++ P+  +    S
Sbjct: 258 GLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYN-PGQYSYTPMASSSLDDS 316

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
            Y++ ++G+ V G   P+S      + +     ++D+GT +TRLPT  Y A   A     
Sbjct: 317 LYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVITRLPTGVYSALSKAVAGAM 371

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
              PRAS  SI DTC+       +RVP V+  F+GG  L L A N L+ VD A T C AF
Sbjct: 372 KGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAF 429

Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AP+ S  +IIGN QQ+   + +D  N  +GF    C
Sbjct: 430 APARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 154/441 (34%), Positives = 217/441 (49%), Gaps = 40/441 (9%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
           +SD  R ++ L HR    + + T++         S   R++RD  R   + R+    G  
Sbjct: 54  TSDPNRASMPLAHRHGPCAPATTSS-------WPSLAERLRRDRARRDHITRKAKASGRT 106

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQC 190
               +V    T + + +D  S EY V +G+G+P   Q ++ID+GSD+ WVQC+PC  S C
Sbjct: 107 TTLSDVS-IPTSLGAAVD--SLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSC 163

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRL----ENAGCH----AGRCRYEVSYGDGSYTK 242
           Y Q DP++DP  S++++ V C S  C  L     + GC        C+Y + YG+   T 
Sbjct: 164 YPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTV 223

Query: 243 GTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
           G  + ETLT+   V VK+   GCG   QG F    GLLGLGG   SLV Q     GGAFS
Sbjct: 224 GVYSTETLTLSPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFS 283

Query: 302 YCLVSRGTGSSGSLVFGREAL---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
           YCL   G  ++G L  G         G  + PL   P   +FY V L+G+ VGG  + I 
Sbjct: 284 YCL-PPGNSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIP 342

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYN 416
             +         G+++D+GT +T LP  AY A R AF       P    +   + DTCYN
Sbjct: 343 PTVLS------GGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYN 396

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQ 474
            +G  +V VPTV+  F GG  + L   + ++  D     C AFA   S   + IIGN+ Q
Sbjct: 397 FTGIANVTVPTVALTFDGGATIDLDVPSGVLIQD-----CLAFAGGASDGDVGIIGNVNQ 451

Query: 475 EGIQISFDGANGFVGFGPNVC 495
              ++ +D   G VGF P  C
Sbjct: 452 RTFEVLYDSGRGHVGFRPGAC 472


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 114/231 (49%), Positives = 147/231 (63%), Gaps = 28/231 (12%)

Query: 91  SSSNTTNNMHYH-RHQHSFHA--------RMQRDVKRVATLVRRLSGGGADAAKHEVQDF 141
           +SS +T ++  H R   S HA        R+ RD  RV  +  +L+           Q+F
Sbjct: 64  TSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLN-----------QNF 112

Query: 142 GTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
            TD     ++SG  QGSGEYF RIG+G PP   YMV+D+GSDI WVQC PC+ CY+Q+DP
Sbjct: 113 NTDKLSGPIISGTSQGSGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCADCYRQADP 172

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           +F+P  SAS++ +SC +A C  L+ + C  G C Y+VSYGDGSYT G    ET+TIG   
Sbjct: 173 IFEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNK 232

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           VKNVA+GCGH N+G+FVGAAGL+GLGGG +S   QL      +FSYCLV R
Sbjct: 233 VKNVALGCGHNNEGLFVGAAGLIGLGGGPLSFPAQLNST---SFSYCLVDR 280


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 135/430 (31%), Positives = 212/430 (49%), Gaps = 38/430 (8%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           ++++L+HRD   S     +     R   +F    +R V RV     R +   +D  +  +
Sbjct: 32  FSVDLIHRDSPHSPFFDPSKTQAERLTDAF----RRSVSRVGRF--RPTAMTSDGIQSRI 85

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                         +GEY + + +G+PP     ++D+GSD+ W QC+PC+ CYKQ  P+F
Sbjct: 86  V-----------PSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLF 134

Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP +S+++   SC ++ C  L ++  C    +C +  SY DGS+T G LA ETLT+  T 
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194

Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
            K V     A GCGH + G+F   ++G++GLGGG +SL+ QL     G FSYCL+   T 
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254

Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLT 365
           SS S  + FG      G   V  PLV+  ++P +FYY+ L G+ VG  R+P  +   + T
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLVQ--KSPDTFYYLTLEGISVGKKRLPY-KGYSKKT 311

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
           ++ +  +++D+GT  T LP   Y     +               IF  CYN +    +  
Sbjct: 312 EVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINA 369

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           P ++ +F    V   P + F+   +D    CF  AP+ S + ++GN+ Q    + FD   
Sbjct: 370 PIITAHFKDANVELQPLNTFMRMQEDL--VCFTVAPT-SDIGVLGNLAQVNFLVGFDLRK 426

Query: 486 GFVGFGPNVC 495
             V F    C
Sbjct: 427 KRVSFKAADC 436


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 143/396 (36%), Positives = 208/396 (52%), Gaps = 16/396 (4%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           S  AR+ +      TL+     G + ++  +       +  G   G G Y  R+G+G+P 
Sbjct: 78  SLAARLAKTPSSRPTLLDESRAGSSSSSPDDESLASVPLGPGTSVGVGNYVTRMGLGTPA 137

Query: 167 RSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVS-----CSSAVCDRLE 220
           +S  MV+D+GS + W+QC PC   C++QS PVF+P  S+S++ VS     CS      L 
Sbjct: 138 KSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQCSDLTTATLN 197

Query: 221 NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLL 279
            A C     C Y+ SYGD S++ G L+ +T++ G T V N   GCG  N+G+F  +AGL+
Sbjct: 198 PASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNFYYGCGQDNEGLFGQSAGLI 257

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
           GL    +SL+ QL    G +FSYCL +  + SSG L  G    P   ++ P+  +    S
Sbjct: 258 GLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSYN-PGQYSYTPMASSSLDDS 316

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
            Y++ ++G+ V G   P+S      + +     ++D+GT +TRLPT  Y A   A     
Sbjct: 317 LYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVITRLPTGVYSALSKAVAGAM 371

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
              PRAS  SI DTC+       +RVP V+  F+GG  L L A N L+ VD A T C AF
Sbjct: 372 KGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKLAARNLLVDVDSATT-CLAF 429

Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AP+ S  +IIGN QQ+   + +D  N  +GF    C
Sbjct: 430 APARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 133/428 (31%), Positives = 212/428 (49%), Gaps = 41/428 (9%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           +E++HRD   S         + R  +  H    R + RV    +  S             
Sbjct: 30  IEMIHRDFSKSPLYHPTVTKFQRAYNVVH----RSINRVNYFTKEFSLNKNQP------- 78

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
                VS +    GEY +   VG+PP   Y  +D+GS+IVW+QCQPC+ C+ Q+ P+F+P
Sbjct: 79  -----VSTLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNP 133

Query: 201 ADSASFSGVSCSSAVCDRLENA--GCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRT- 255
           + S+S+  + C+S+ C    +    C  G   C Y ++YG  + ++G L+ ++LT+  T 
Sbjct: 134 SKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTS 193

Query: 256 ----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQT-GGAFSYCLV--SR 307
               +  N+ IGCGH N       ++G++G+G G MSL+ Q+G  + G  FSYCL+  + 
Sbjct: 194 GSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNS 253

Query: 308 GTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            + SS  L+FG + +  G   V  P+V+     ++Y++ L    VG  RI   E     T
Sbjct: 254 DSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNAST 313

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVR 424
           Q     +++D+GT +T LP   + +   ++VAQ   LPR          CYN +G   + 
Sbjct: 314 Q----NILIDSGTPLTMLPN-LFLSKLVSYVAQEVKLPRIEPPDHHLSLCYNTTG-KQLN 367

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           VP ++ +F+G  V  L ++    P +D G  CF F  S +GL I GNI Q  + I +D  
Sbjct: 368 VPDITAHFNGADV-KLNSNGTFFPFED-GIMCFGFI-SSNGLEIFGNIAQNNLLIDYDLE 424

Query: 485 NGFVGFGP 492
              + F P
Sbjct: 425 KEIISFKP 432


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 120/350 (34%), Positives = 174/350 (49%), Gaps = 16/350 (4%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y  R G+G+P ++  + ID  +D  WV C  C+ C   S P F P  S+++  V C S 
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            C ++ +  C AG    C + ++Y   ++ +  L  ++L +   VV +   GC     G 
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVSGN 218

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVP 330
            V   GL+G G G +S + Q     G  FSYCL + R +  SG+L  G    P      P
Sbjct: 219 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 278

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L+ NP  PS YYV + G+ VG   + + +       +   G ++D GT  TRL  P Y A
Sbjct: 279 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 338

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            RDAF  +    P A  +  FDTCYN    V+V VPTV+F F+G   +TLP  N +I   
Sbjct: 339 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 393

Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G  C A A  PS      L+++ ++QQ+  ++ FD ANG VGF   +C
Sbjct: 394 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 443


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 129/362 (35%), Positives = 184/362 (50%), Gaps = 17/362 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG     G Y VR  +G+PP+  +MV+D+ +D VW+ C  CS C   +   F+   S+
Sbjct: 93  VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 151

Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           ++S VSCS+A C +     C +       C +  SYG  S    +L  +TLT+   V+ N
Sbjct: 152 TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 211

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
            + GC +   G  +   GL+GLG G MSLV Q      G FSYCL S R    SGSL  G
Sbjct: 212 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 271

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
               P    + PL+RNPR PS YYV L+G+ VG +++P+             G ++D+GT
Sbjct: 272 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 331

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
            +TR   P YEA RD F  Q  N+   S +  FDTC++         P ++ + +    L
Sbjct: 332 VITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADN--ENVAPKITLHMTSLD-L 387

Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            LP  N LI    AGT  C + A     + + L++I N+QQ+ ++I FD  N  +G  P 
Sbjct: 388 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 446

Query: 494 VC 495
            C
Sbjct: 447 PC 448


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 137/407 (33%), Positives = 199/407 (48%), Gaps = 28/407 (6%)

Query: 112 MQRDVKRVATL--VRRLSGGGADAAKHEVQDFGTDV-VSGMDQGSGEYFVRIGVGSPPRS 168
           MQR   R A L  VR  +     + K++ Q       VS    G  EY V + +G+PP+ 
Sbjct: 55  MQRSKARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQP 114

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AG 227
              ++D+GSD++W QC PC+ C  Q DP+F P +SAS+  + C+  +C  + + GC    
Sbjct: 115 VSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPD 174

Query: 228 RCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
            C Y  +YGDG+ T G  A E  T       R +   +  GCG  N G     +G++G G
Sbjct: 175 TCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFG 234

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV------GAAWVPLVRNPR 336
              +SLV QL  +    FSYCL S G+G   +L+FG  +  V           PL+++ +
Sbjct: 235 RNPLSLVSQLSIRR---FSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQ 291

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
            P+FYYV L+GL VG  R+ I E  F L   G  GV++D+GTA+T LP         AF 
Sbjct: 292 NPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFR 351

Query: 397 AQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
            Q   LP A+G +  D  C+ +       S    V VP + F+F     L LP  N+++ 
Sbjct: 352 QQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDAD-LDLPRRNYVLD 409

Query: 449 VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               G  C   A S    S IGN+ Q+ +++ +D     + F P  C
Sbjct: 410 DHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 189/381 (49%), Gaps = 30/381 (7%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPA 201
           + VVSG   GSG+YFV + +G PP+S  ++ D+GSD+VWV+C  C  C   S   VF P 
Sbjct: 71  SPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPR 130

Query: 202 DSASFSGVSCSSAVCDRLENAG----CHAGR----CRYEVSYGDGSYTKGTLALETLTI- 252
            S++FS   C   VC  +        C+  R    C YE  Y DGS T G  A ET ++ 
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190

Query: 253 ----GRTVVKNVAIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
                   +K+VA GCG +  G       F GA G++GLG G +S   QLG + G  FSY
Sbjct: 191 TSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSY 250

Query: 303 CLV--SRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           CL+  +     +  L+ G     +    + PL+ NP +P+FYYV L  + V G ++ I  
Sbjct: 251 CLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP 310

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS 418
            ++ +   G+ G V+D+GT +  L  PAY +   A V +   LP A  ++  FD C N+S
Sbjct: 311 SIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAA-VRRRVKLPIADALTPGFDLCVNVS 369

Query: 419 GFVSVR--VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF--APSPSGLSIIGNIQQ 474
           G       +P + F FSGG V   P  N+ I  ++    C A        G S+IGN+ Q
Sbjct: 370 GVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ-CLAIQSVDPKVGFSVIGNLMQ 428

Query: 475 EGIQISFDGANGFVGFGPNVC 495
           +G    FD     +GF    C
Sbjct: 429 QGFLFEFDRDRSRLGFSRRGC 449


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 137/398 (34%), Positives = 204/398 (51%), Gaps = 52/398 (13%)

Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR------- 167
           D +R   + +RL+G     A  + Q       +   + +G+Y    G+GS P        
Sbjct: 8   DQRRADYIQKRLTG-----ATDDKQPMAFSSRTSQYEKNGQYATNGGLGSVPHLKSLSTT 62

Query: 168 -------------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCS 212
                        +Q ++IDSGSD+ WVQC+PC    C++Q DP+FDPA S +++ V C+
Sbjct: 63  ATTNSAPDGTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCT 122

Query: 213 SAVCDRL--ENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKN 268
           SA C +L     GC A  +C++ ++YGDGS   GT + + LT+G   V++    GC H +
Sbjct: 123 SAACAQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVIRGFRFGCAHAD 182

Query: 269 QG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG----REAL 322
           +G       AG L LGGGS SLV Q   + G  FSYCL    + S G LV G    R  L
Sbjct: 183 RGSAFDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTAS-SLGFLVLGVPPERAQL 241

Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
                  PL+ +  AP+FY V L  + V G  + +   +F  +       V+D+ T ++R
Sbjct: 242 IPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASS------VIDSSTIISR 295

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           LP  AY+A R AF +       A  VSI DTCY+ +G  S+ +P+++  F GG  + L A
Sbjct: 296 LPPTAYQALRAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDA 355

Query: 443 SNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQ 478
           +  L+     G+ C AFAP+ S      IGN+QQ+ ++
Sbjct: 356 AGILL-----GS-CLAFAPTASDRMPGFIGNVQQKTLE 387



 Score = 99.8 bits (247), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 84/283 (29%), Positives = 125/283 (44%), Gaps = 51/283 (18%)

Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
           GC A  +C++ ++YGDGS   GT + + LT+G   V           QG+ +  A     
Sbjct: 388 GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDR---------QGLPLRTAT---- 434

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP----- 335
                        Q G  FSYC +     S G +  G    P  AA VP  V  P     
Sbjct: 435 -------------QYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSS 478

Query: 336 -RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P+FY V L  + V G  +P+   +F  +       V+ + T ++RLP  AY+A R A
Sbjct: 479 SMPPTFYRVLLRAIIVAGRPLPVPPTVFSTSS------VIASTTVISRLPPTAYQALRAA 532

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           F         A  VSI DTCY+ +G  S+ +P+++  F GG  + L A+  L+       
Sbjct: 533 FRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL------Q 586

Query: 455 FCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C AFAP+ +      IGN+QQ  +++ +D     + F    C
Sbjct: 587 GCLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 120/350 (34%), Positives = 174/350 (49%), Gaps = 16/350 (4%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y  R G+G+P ++  + ID  +D  WV C  C+ C   S P F P  S+++  V C S 
Sbjct: 82  NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 140

Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            C ++ +  C AG    C + ++Y   ++ +  L  ++L +   VV +   GC     G 
Sbjct: 141 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVSGN 199

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVP 330
            V   GL+G G G +S + Q     G  FSYCL + R +  SG+L  G    P      P
Sbjct: 200 SVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTP 259

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L+ NP  PS YYV + G+ VG   + + +       +   G ++D GT  TRL  P Y A
Sbjct: 260 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 319

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            RDAF  +    P A  +  FDTCYN    V+V VPTV+F F+G   +TLP  N +I   
Sbjct: 320 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 374

Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G  C A A  PS      L+++ ++QQ+  ++ FD ANG VGF   +C
Sbjct: 375 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 424


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 135/349 (38%), Positives = 196/349 (56%), Gaps = 16/349 (4%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           GSGEY +++  G+P +S Y +ID+GSD+ W+ C+ C  C+  + P+FDPA S+S+   +C
Sbjct: 111 GSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA-PIFDPAKSSSYKPFAC 169

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            S  C  +        +C++EV YGDG+   GTLA + +T+G   + N + GC       
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFSFGCAESLSED 229

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVGAA-- 327
              + GL+GLGGGS+SL+ Q       GG FSYCL    + SSGSLV G+EA    ++  
Sbjct: 230 TYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL-PSSSTSSGSLVLGKEAAVSSSSLK 288

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPTP 386
           +  L+++P  P+FY+V L  + VG  RI +       T +    G ++D+GT +T L   
Sbjct: 289 FTTLIKDPSFPTFYFVTLKAISVGNTRISVPA-----TNIASGGGTIIDSGTTITYLVPS 343

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY+  RDAF  Q  +L + + V   DTCY+LS   SV VPT++ +      L LP  N L
Sbjct: 344 AYKDLRDAFRQQLSSL-QPTPVEDMDTCYDLSS-SSVDVPTITLHLDRNVDLVLPKENIL 401

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I   ++G  C AF+ S    SIIGN+QQ+  +I FD  N  VGF    C
Sbjct: 402 I-TQESGLSCLAFS-STDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  197 bits (502), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 179/349 (51%), Gaps = 15/349 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+P +   + +D+ +D  WV C  C  C   S  +FDP+ S+S   + C 
Sbjct: 88  SPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC--ASSVLFDPSKSSSSRNLQCD 145

Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           +  C +  N  C AG+ C + ++YG GS  + +L  +TLT+   V+K+   GC  K  G 
Sbjct: 146 APQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLANDVIKSYTFGCISKATGT 204

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
            + A GL+GLG G +SL+ Q        FSYCL  S+ +  SGSL  G +  PV     P
Sbjct: 205 SLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCLPNSKSSNFSGSLRLGPKYQPVRIKTTP 264

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  PAY A
Sbjct: 265 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTRLVEPAYVA 324

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R+ F  +  N   A+ +  FDTCY  SG  SV  P+V+F F+G  V TLP  N LI   
Sbjct: 325 VRNEFRRRIKNA-NATSLGGFDTCY--SG--SVVYPSVTFMFAGMNV-TLPPDNLLIHSS 378

Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              T C A A +P    S L++I ++QQ+  ++  D  N  +G     C
Sbjct: 379 SGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  197 bits (502), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 142/341 (41%), Positives = 192/341 (56%), Gaps = 33/341 (9%)

Query: 171 MVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENAGC 224
           M +D+GSD+ WVQC+PC+    CY Q DP+FDPA S+S++ V C   VC  L     + C
Sbjct: 1   MEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASAC 60

Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGG 283
            A +C Y VSYGDGS T G  + +TLT+   + V+    GCGH   G+F G  GLLGLG 
Sbjct: 61  SAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSAVQGFFFGCGHAQSGLFNGVDGLLGLGR 120

Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPS 339
              SLV Q  G  GG FSYCL ++ + ++G L  G    P GAA       L+ +P AP+
Sbjct: 121 EQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYLTLGVGG-PSGAAPGFSTTQLLPSPNAPT 178

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
           +Y V L+G+ VGG ++ +    F    +      +DTGT VTRLP  AY A R AF +  
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTV------VDTGTVVTRLPPTAYAALRSAFRSGM 232

Query: 400 GN--LPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF-C 456
            +   P A    I DTCYN +G+ +V +P V+  F  G  +TL A   L       +F C
Sbjct: 233 ASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-------SFGC 285

Query: 457 FAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AFAPS S  G++I+GN+QQ   ++  DG +  VGF P+ C
Sbjct: 286 LAFAPSGSDGGMAILGNVQQRSFEVRIDGTS--VGFKPSSC 324


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 136/355 (38%), Positives = 190/355 (53%), Gaps = 16/355 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDP-----A 201
           G   G G Y  R+G+G+P +S  MV+D+GS + W+QC PC   C++QS PVF+P      
Sbjct: 121 GTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSY 180

Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
            S S S   CS      L  A C     C Y+ SYGD S++ G L+ +T++ G T V N 
Sbjct: 181 TSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNF 240

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
             GCG  N+G+F  +AGL+GL    +SL+ QL    G +FSYCL +  + SSG L  G  
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSY 300

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P   ++ P+  +    S Y++ ++G+ V G   P+S      + +     ++D+GT +
Sbjct: 301 N-PGQYSYTPMASSSLDDSLYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVI 354

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TRLPT  Y A   A        PRAS  SI DTC+       +RVP V+  F+GG  L L
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKL 413

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A N L+ VD A T C AFAP+ S  +IIGN QQ+   + +D  N  +GF    C
Sbjct: 414 AARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 136/355 (38%), Positives = 190/355 (53%), Gaps = 16/355 (4%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDP-----A 201
           G   G G Y  R+G+G+P +S  MV+D+GS + W+QC PC   C++QS PVF+P      
Sbjct: 121 GTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSY 180

Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
            S S S   CS      L  A C     C Y+ SYGD S++ G L+ +T++ G T V N 
Sbjct: 181 TSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSVPNF 240

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
             GCG  N+G+F  +AGL+GL    +SL+ QL    G +FSYCL +  + SSG L  G  
Sbjct: 241 YYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLSIGSY 300

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P   ++ P+  +    S Y++ ++G+ V G   P+S      + +     ++D+GT +
Sbjct: 301 N-PGQYSYTPMASSSLDDSLYFIKMTGIKVAGK--PLSVSSSAYSSL---PTIIDSGTVI 354

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TRLPT  Y A   A        PRAS  SI DTC+       +RVP V+  F+GG  L L
Sbjct: 355 TRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQA-ARLRVPEVTMAFAGGAALKL 413

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A N L+ VD A T C AFAP+ S  +IIGN QQ+   + +D  N  +GF    C
Sbjct: 414 AARNLLVDVDSATT-CLAFAPARSA-AIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 131/411 (31%), Positives = 191/411 (46%), Gaps = 27/411 (6%)

Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H+ H      ++ +  L R    RL    + AA   V      V SG  Q    Y VR G
Sbjct: 27  HNVHPPSSSPLESIIALAREDDARLLFLSSKAASTGVSS--APVASG--QSPPSYVVRAG 82

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +GSP +   + +D+ +D  W  C PC  C   S  +F PA+S S++ + CSS +C  L+ 
Sbjct: 83  LGSPAQPILLALDTSADATWAHCSPCGTC-PSSGSLFAPANSTSYAPLPCSSTMCTVLQG 141

Query: 222 AGCHA----------GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
             C A            C +   + D S+ + +LA + L +G+  + N A GC     G 
Sbjct: 142 QPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKDAIPNYAFGCVSAVSGP 200

Query: 272 F--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAW 328
              +   GLLGLG G M+L+ Q+G    G FSYCL S +    SGSL  G    P G  +
Sbjct: 201 TANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQPRGVRY 260

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
            P+++NP   S YYV ++GL VG   + +    F        G V+D+GT +TR   P Y
Sbjct: 261 TPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITRWTPPVY 320

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
            A R+ F          + +  FDTC+N     +   P V+ +  GG  L LP  N LI 
Sbjct: 321 AALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTVHMDGGLDLALPMENTLIH 380

Query: 449 VDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  C A A +P      ++++ N+QQ+ +++ FD AN  VGF    C
Sbjct: 381 SSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANSRVGFARESC 431


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/362 (35%), Positives = 184/362 (50%), Gaps = 17/362 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG     G Y VR  +G+PP+  +MV+D+ +D VW+ C  CS C   +   F+   S+
Sbjct: 19  VASGNQLHIGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 77

Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           ++S VSCS+A C +     C +       C +  SYG  S    +L  +TLT+   V+ N
Sbjct: 78  TYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDVIPN 137

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
            + GC +   G  +   GL+GLG G MSLV Q      G FSYCL S R    SGSL  G
Sbjct: 138 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 197

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
               P    + PL+RNPR PS YYV L+G+ VG +++P+             G ++D+GT
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 257

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
            +TR   P YEA RD F  Q  N+   S +  FDTC++         P ++ + +    L
Sbjct: 258 VITRFAQPVYEAIRDEFRKQV-NVSSFSTLGAFDTCFSADN--ENVAPKITLHMTSLD-L 313

Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            LP  N LI    AGT  C + A     + + L++I N+QQ+ ++I FD  N  +G  P 
Sbjct: 314 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 372

Query: 494 VC 495
            C
Sbjct: 373 PC 374


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 172/367 (46%), Gaps = 35/367 (9%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           + EY VR+ VG+P R   + +D+GSD+VW QC PC  C+ Q  PV DPA S++++ + C 
Sbjct: 81  TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCG 140

Query: 213 SAVCDRLENAGC------HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKN 259
           +A C  L    C      +   C Y   YGD S T G +A +  T G +         + 
Sbjct: 141 AARCRALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRR 200

Query: 260 VAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
           +  GCGH N+G+F     G+ G G G  SL  QL   +   FSYC  S     S  +  G
Sbjct: 201 LTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTS---FSYCFTSMFESKSSLVTLG 257

Query: 319 -------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                    A        P+++NP  PS Y++ L G+ VG  R+P+ E  FR T      
Sbjct: 258 GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST------ 311

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR---VPTV 428
            ++D+G ++T LP   YEA +  F AQ G  P     S  D C+ L      R   VP++
Sbjct: 312 -IIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSL 370

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           + +  G     LP SN++     A   C     +P   ++IGN QQ+   + +D  N  +
Sbjct: 371 TLHLEGAD-WELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRL 429

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 430 SFAPARC 436


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 141/444 (31%), Positives = 213/444 (47%), Gaps = 40/444 (9%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           +S   S +   +   L+HRD   S      N ++ R Q SFH  + R          R +
Sbjct: 22  TSLTASMNNGSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISR--------ANRFT 73

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
                AAK        D++     G GEYF+RI +G+PP    ++ D+GSD++WVQCQPC
Sbjct: 74  PNSVSAAK----TLEYDII----PGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC 125

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN--AGCHA----GRCRYEVSYGDGSYT 241
            +CYKQ  P+F+P  S+++  V C +  C+ L +    C A      C Y  SYGD S+T
Sbjct: 126 QECYKQKSPIFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFT 185

Query: 242 KGTLALETLTIGRT--VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGG 298
            G LA E   IG T   ++ +A GCG+ N G F    +G++GLGGGS+SL+ QLG +   
Sbjct: 186 MGYLATERFIIGSTNNSIQELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDN 245

Query: 299 AFSYCLV---SRGTGSSGSLVFGREALPVGA---AWVPLV-RNPRAPSFYYVGLSGLGVG 351
            FSYCLV    +   S G +VFG  +   G+      PLV + P   +FYY+ L  + VG
Sbjct: 246 KFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPE--TFYYLTLEAISVG 303

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
             R+   E+      +    +++D+GT +T L +  Y                +    IF
Sbjct: 304 NERLAY-ENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIF 362

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
             C+     + + +P ++ +F+   V   P + F    +D    CF   PS +G++I GN
Sbjct: 363 SICFRDK--IGIELPIITVHFTDADVELKPINTFAKAEEDL--LCFTMIPS-NGIAIFGN 417

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           + Q    + +D     V F P  C
Sbjct: 418 LAQMNFLVGYDLDKNCVSFMPTDC 441


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/341 (35%), Positives = 171/341 (50%), Gaps = 26/341 (7%)

Query: 173 IDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYE 232
           +D+GSD++W QC PC  C  Q  P FD   SA++  + C S+ C  L +  C    C Y+
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60

Query: 233 VSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMS 287
             YGD + T G LA ET T G     +    N+A GCG  N G    ++G++G G G +S
Sbjct: 61  YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSSGMVGFGRGPLS 120

Query: 288 LVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGAAWVPLVRNPRA 337
           LV QLG      FSYCL S  + +   L FG  A           PV +   P V NP  
Sbjct: 121 LVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS--TPFVINPAL 175

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
           P+ Y++ L  + +G   +PI   +F +   G  GV++D+GT++T L   AYEA R   V+
Sbjct: 176 PNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS 235

Query: 398 QTGNLPRASGVSI-FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
               LP  +   I  DTC+       V+V VP + F+F    +  LP  N+++     G 
Sbjct: 236 AIP-LPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLP-ENYMLIASTTGY 293

Query: 455 FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C   AP+  G +IIGN QQ+ + + +D  N F+ F P  C
Sbjct: 294 LCLVMAPTGVG-TIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 136/385 (35%), Positives = 192/385 (49%), Gaps = 38/385 (9%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQS---DPVFD 199
           SG   G G+Y V +  G+PP+   ++ D+GSD++W+QC     P + C K++    P F 
Sbjct: 45  SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 104

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCR--------YEVSYGDGSYTKGTLALETLT 251
            + SA+ S V CS+A C  +     H   C         Y   Y DGS T G LA +T T
Sbjct: 105 ASKSATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTAT 164

Query: 252 I-----GRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           I     G   V+ VA GCG +NQG  F G  G++GLG G +S   Q G      FSYCL+
Sbjct: 165 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLL 224

Query: 306 SRGTG----SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
               G    SS  L  GR       A+ PLV NP AP+FYYVG+  + VG   +P+    
Sbjct: 225 DLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 284

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNL 417
           + +  +G+ G V+D+G+ +T L   AY     AF A   +LPR    + F    + CYN+
Sbjct: 285 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNV 343

Query: 418 SGFVSVR-----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
           S   S+       P ++  F+ G  L LP  N+L+ V D    C A  P  SP   +++G
Sbjct: 344 SSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD-DVKCLAIRPTLSPFAFNVLG 402

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+ Q+G  + FD A+  +GF    C
Sbjct: 403 NLMQQGYHVEFDRASARIGFARTEC 427


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 150/448 (33%), Positives = 220/448 (49%), Gaps = 41/448 (9%)

Query: 60  LFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRV 119
           LF  H  I S+  +  +  +  +L+HRD   S           R +++ H    R   RV
Sbjct: 14  LFSSH--ILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIH----RSFNRV 67

Query: 120 ATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDI 179
           +           DA+ +  Q   TD+        GEY + + +G+PP     V D+GS++
Sbjct: 68  SHFTDL---SEMDASLNSPQ---TDIT----PCGGEYLMNLSLGTPPSPIMAVADTGSNL 117

Query: 180 VWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN-AGC--HAGRCRYEVSYG 236
           +W QC+PC  CY Q DP+FDP  S+++  VSCSS+ C  LEN A C      C Y VSY 
Sbjct: 118 IWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYA 177

Query: 237 DGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVG 290
           DGSYT G  A++TLT+G T      +KN+ IGCG  N   F    +G++GLGGG++SL+ 
Sbjct: 178 DGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIK 237

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGL 348
           QLG    G FSYCLV     +S  + FG  A+  G   V  PLV   R  +FYY+ L  +
Sbjct: 238 QLGDSIDGKFSYCLVPENDQTS-KINFGTNAVVSGPGTVSTPLVVKSRD-TFYYLTLKSI 295

Query: 349 GVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
            VG   +   +   +        +V+D+GT +T LP   Y    +A VA   N  ++   
Sbjct: 296 SVGSKNMQTPDSNIK------GNMVIDSGTTLTLLPVKYYIEIENA-VASLINADKSKDE 348

Query: 409 SIFDT-CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS 467
            I  + CYN +    + +P ++ +F G  V   P ++F    +D    C AF  S     
Sbjct: 349 RIGSSLCYNATA--DLNIPVITMHFEGADVKLYPYNSFFKVTEDL--VCLAFGMSFYRNG 404

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I GN+ Q+   + +D A+  + F P  C
Sbjct: 405 IYGNVAQKNFLVGYDTASKTMSFKPTDC 432


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 135/416 (32%), Positives = 190/416 (45%), Gaps = 33/416 (7%)

Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H+ H      ++ +  L R    RL    + AA   V      V SG  Q    Y VR G
Sbjct: 29  HNVHPSSPSPLESIIALARDDDARLLFLSSKAATAGVSS--APVASG--QAPPSYVVRAG 84

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +GSP +   + +D+ +D  W  C PC  C   S  +F PA+S+S++ + CSS+ C   + 
Sbjct: 85  LGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSWCPLFQG 142

Query: 222 AGCHAGR--------------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
             C A +              C +   + D S+ +  LA +TL +G+  + N   GC   
Sbjct: 143 QACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNYTFGCVSS 201

Query: 268 NQGMFVGAA--GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LP 323
             G        GLLGLG G M+L+ Q G    G FSYCL S R    SGSL  G     P
Sbjct: 202 VTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQP 261

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
               + P++RNP   S YYV ++GL VG   + +    F        G V+D+GT +TR 
Sbjct: 262 RSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDSGTVITRW 321

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
             P Y A R+ F  Q       + +  FDTC+N     +   P V+ +  GG  L LP  
Sbjct: 322 TAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPME 381

Query: 444 NFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N LI        C A A +P    S +++I N+QQ+ I++ FD AN  VGF    C
Sbjct: 382 NTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 437


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 132/446 (29%), Positives = 201/446 (45%), Gaps = 46/446 (10%)

Query: 67  ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
           +SS   S  +  ++++L+HRD   S         +++   +   R+     R    + R 
Sbjct: 17  VSSREVSEGQRGFSIDLIHRDSPLSP--------FYKPSLTPSDRIINTALRSIYQLNRA 68

Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
           S    +  K         +        GEY +R  +G+PP  +  + D+ SD++WVQC P
Sbjct: 69  SHSDLNEKK--------TLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSP 120

Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGT 244
           C  C+ Q  P+F+P  S++F+ +SC S  C       C      C Y  +YGDGS TKG 
Sbjct: 121 CETCFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGV 180

Query: 245 LALETLTIGRTVV--KNVAIGCGHKNQGMFV---GAAGLLGLGGGSMSLVGQLGGQTGGA 299
           L  E++  G   V       GCG  N  M        G++GLG G +SLV QLG Q G  
Sbjct: 181 LCTESIHFGSQTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHK 240

Query: 300 FSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
           FSYCL+   + S+  L FG +    G   V  PL+ +P  PS+Y++ L G+ +G   + +
Sbjct: 241 FSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV 300

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF----RDAF-VAQTG-NLPRASGVSIF 411
                R T   +  +++D GT +T L    Y  F    R+A  +++T  ++P       F
Sbjct: 301 -----RTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYP-----F 350

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSII 469
           D C+      ++  P + F F+G  V   P  N     DD    C A  P     G S+ 
Sbjct: 351 DFCF--PNQANITFPKIVFQFTGAKVFLSP-KNLFFRFDDLNMICLAVLPDFYAKGFSVF 407

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN+ Q   Q+ +D     V F P  C
Sbjct: 408 GNLAQVDFQVEYDRKGKKVSFAPADC 433


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 182/360 (50%), Gaps = 28/360 (7%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           +Y + + +G+PP   Y  +D+GSD++W+QC PC+ CYKQ +P+FDP  S+++S ++  S 
Sbjct: 58  DYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSE 117

Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHK 267
            C +L +  C   +  C Y  SY D S T+G LA ETLT+  T  K VA+     GCGH 
Sbjct: 118 SCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHN 177

Query: 268 NQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGTGSS--GSLVFGR--EA 321
           N G+F     G++GLG G +SLV Q+G   GG  FS CLV   T  S    + FG+  E 
Sbjct: 178 NNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEV 237

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
           L  G    PLV      +FY+V L G+ V  + +P + D   L  +    +V+D+GT  T
Sbjct: 238 LGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN-DGSSLEPITKGNMVIDSGTPTT 296

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFSGGP 436
            LP    E F    V +  N      + I     +  CY      +++  T++ +F G  
Sbjct: 297 LLP----EDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP--TNLKGTTLTAHFEGAD 350

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           VL  P   F IPV D G FCFAF  + S    I GN  Q    I FD     V F    C
Sbjct: 351 VLLTPTQIF-IPVQD-GIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDC 408


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 157/405 (38%), Positives = 207/405 (51%), Gaps = 29/405 (7%)

Query: 107 SFHARMQRDVKRVATLVRRLSG----GGAD--AAKHEVQDFGTDVVSGMDQGSGEYFVRI 160
           SF   ++ D +R   + RR+SG    GG     A    +        G   G+ +Y V +
Sbjct: 445 SFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKSVTIPANIGHSIGTLQYVVTV 504

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAVCDR 218
            +G+P  +Q + +D+GSD+ WVQC PC+      Q D +FDPA S+S+S V C++  C  
Sbjct: 505 SLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAADACSE 564

Query: 219 LEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVG 274
           L     GC AG +C Y VSYGDGS T G    +TLT+     V     GCGH   G+F G
Sbjct: 565 LSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDADAVTGFLFGCGHAQAGLFAG 624

Query: 275 AAGLLGLGGGSMSLVGQLGGQT-GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR 333
             GLL LG   MSL  Q  G   GG FSYCL    + S+G L  G  +   G A   L+ 
Sbjct: 625 IDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS-STGFLTLGGPSSASGFATTGLLT 683

Query: 334 NPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
               P+FY V L+G+GVGG ++  +    F        G V+DTGT +TRLP  AY A R
Sbjct: 684 AWDVPTFYMVMLTGIGVGGQQLSGVPASAFA------GGTVVDTGTVITRLPPTAYAALR 737

Query: 393 DAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            AF A       P A    I DTCYN + + +V +PTVS  FSGG  L L A  FL    
Sbjct: 738 AAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFSGGATLKLDAPGFL---- 793

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +G   FA        +I+GN+QQ    + FDG++  VGF P+ C
Sbjct: 794 SSGCLAFATNSGDGDPAILGNVQQRSFAVRFDGSS--VGFMPHSC 836


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 134/416 (32%), Positives = 190/416 (45%), Gaps = 33/416 (7%)

Query: 106 HSFHARMQRDVKRVATLVR----RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H+ H      ++ +  L R    RL    + AA   V      V SG  Q    Y VR G
Sbjct: 31  HNVHPSSPSPLESIIALARDDDARLLFLSSKAATAGVSS--APVASG--QAPPSYVVRAG 86

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +GSP +   + +D+ +D  W  C PC  C   S  +F PA+S+S++ + CSS+ C   + 
Sbjct: 87  LGSPSQQLLLALDTSADATWAHCSPCGTCPSSS--LFAPANSSSYASLPCSSSWCPLFQG 144

Query: 222 AGCHAGR--------------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
             C A +              C +   + D S+ +  LA +TL +G+  + N   GC   
Sbjct: 145 QACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKDAIPNYTFGCVSS 203

Query: 268 NQGMFVGAA--GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREA-LP 323
             G        GLLGLG G M+L+ Q G    G FSYCL S R    SGSL  G     P
Sbjct: 204 VTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYRSYYFSGSLRLGAGGGQP 263

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
               + P++RNP   S YYV ++GL VG   + +    F        G V+D+GT +TR 
Sbjct: 264 RSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDSGTVITRW 323

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPAS 443
             P Y A R+ F  Q       + +  FDTC+N     +   P V+ +  GG  L LP  
Sbjct: 324 TAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGVDLALPME 383

Query: 444 NFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N LI        C A A +P    S +++I N+QQ+ I++ FD AN  +GF    C
Sbjct: 384 NTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSRIGFAKESC 439


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  194 bits (494), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 139/427 (32%), Positives = 204/427 (47%), Gaps = 54/427 (12%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + LEL+HRD   S                F+   Q   +R+A  VRR         K+ +
Sbjct: 29  FTLELIHRDSSKSP---------------FYQPTQNKYERIANAVRRSINRVNHFYKYSL 73

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
               +   S ++   GEY +   +G+PP   +  +D+GSD+VW+QC+PC QCY Q  P+F
Sbjct: 74  T---STPQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIF 130

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
           DP+ S+S+  + C S  C  +    C                 +G L++ETLT+  T   
Sbjct: 131 DPSLSSSYQNIPCLSDTCHSMRTTSCDV---------------RGYLSVETLTLDSTTGY 175

Query: 259 NVA-----IGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS 312
           +V+     IGCG++N G F G ++G++GLG G MSL  QLG   GG FSYCL      S+
Sbjct: 176 SVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNST 235

Query: 313 GSLVFGREALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
             L FG  A+    GA   P+V+   A S YY+ L    VG   I         T  G++
Sbjct: 236 SKLNFGDAAIVYGDGAMTTPIVKK-DAQSGYYLTLEAFSVGNKLIEFGGP----TYGGNE 290

Query: 371 G-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFVSVRVPTV 428
           G +++D+GT  T LP   Y  F  A VA+  NL      +  F  CYN++ +     P +
Sbjct: 291 GNILIDSGTTFTFLPYDVYYRFESA-VAEYINLEHVEDPNGTFKLCYNVA-YHGFEAPLI 348

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           + +F G  +     S F I V D G  C AF PS +  +I GN+ Q+ + + ++     V
Sbjct: 349 TAHFKGADIKLYYISTF-IKVSD-GIACLAFIPSQT--AIFGNVAQQNLLVGYNLVQNTV 404

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 405 TFKPVDC 411


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  194 bits (493), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 129/361 (35%), Positives = 181/361 (50%), Gaps = 35/361 (9%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY + + +G+PP   Y   D+GSD+VW QC PC++CYKQ +P+FDP  S+S++ ++C + 
Sbjct: 59  EYLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTE 118

Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA-----IGCGHK 267
            C++L+++ C   +  C Y  SY D S T+G LA ETLT+  T  + VA      GCGH 
Sbjct: 119 SCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHN 178

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTGSS--GSLVFGR--E 320
           N G      GL+GLG G +SL+ Q+G   G  G  FS CLV   T  S    + FG+  E
Sbjct: 179 NSGFNDREMGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSE 238

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            L  G    PL+   +  + Y+  L G+ V  + +P S     L  +    +++D+GT +
Sbjct: 239 VLGNGTVSTPLIS--KDGTGYFATLLGISVEDINLPFSNGS-SLGTITKGNILIDSGTTI 295

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCY----NLSGFVSVRVPTVSFYFSG 434
           T LP    E F    + Q  N        I  ++ CY    NL+G      PT++ +F G
Sbjct: 296 TYLP----EEFYHRLIEQVRNKVALEPFRIDGYELCYQTPTNLNG------PTLTIHFEG 345

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           G VL  PA  F IPV D   FCFA   +       GN  Q    I FD     V F    
Sbjct: 346 GDVLLTPAQMF-IPVQD-DNFCFAVFDTNEEYVTYGNYAQSNYLIGFDLERQVVSFKATD 403

Query: 495 C 495
           C
Sbjct: 404 C 404


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 136/433 (31%), Positives = 208/433 (48%), Gaps = 41/433 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           + +EL++RD   S                F+   +   +R+ + VRR        +  + 
Sbjct: 29  FTVELINRDSPKSP---------------FYNPRETPTQRIVSAVRRSMSRVHHFSPTKN 73

Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
            D  TD   S M    GEY ++  +G+P      + D+GSD++W QC+PC QCY+Q  P+
Sbjct: 74  SDIFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPL 133

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIG 253
           FDP  S+++  +SCS+  CD L+     +G     C Y  SYGD S+T G +A +T+T+G
Sbjct: 134 FDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLG 193

Query: 254 RT-----VVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-- 305
            T     ++    IGCGH N G F    +G++GLGGG +SL+ QLG    G FSYCLV  
Sbjct: 194 STSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPL 253

Query: 306 SRGTGSSGSLVFGREALPVGAAW--VPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           S    +S  L FG   +  G      PL+ ++P   +FY++ L  + VG  RI      F
Sbjct: 254 SSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPD--TFYFLTLEAVSVGSERIKFPGSSF 311

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
             ++     +++D+GT +T  P   +     A        P      I   CY++     
Sbjct: 312 GTSE---GNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGILSLCYSIDA--D 366

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           ++ P+++ +F G  V   P + F + V D    CFAF P  SG +I GN+ Q    + +D
Sbjct: 367 LKFPSITAHFDGADVKLNPLNTF-VQVSDT-VLCFAFNPINSG-AIFGNLAQMNFLVGYD 423

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 424 LEGKTVSFKPTDC 436


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 150/445 (33%), Positives = 217/445 (48%), Gaps = 34/445 (7%)

Query: 66  NISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRR 125
           NIS SN+    + +++E++HRD   S          +RH        +   +RVA  +RR
Sbjct: 22  NISFSNSKVLNSGFSVEMIHRDSSRSP--------LYRH-------TETPFQRVANAMRR 66

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
                    K           S +    GEY +   VG+PP     V+D+GS I W+QCQ
Sbjct: 67  SINRANHFNKKSFVASTNTAESTVKASQGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQ 126

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR-LENAGCHAGR--CRYEVSYGDGSYTK 242
            C  CY+Q+ P+FDP+ S ++  + CSS +C   +    C + +  C+Y + YGDGS+++
Sbjct: 127 RCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQ 186

Query: 243 GTLALETLTIGRT-----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQT 296
           G L++ETLT+G T        N  IGCGH N+G F    +G++GLGGG +SL+ QL    
Sbjct: 187 GDLSVETLTLGSTNGSSVQFPNTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSI 246

Query: 297 GGAFSYCLVS--RGTGSSGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
           GG FSYCL      + SS  L FG  A+   +GA   PLV    +  FYY+ L    VG 
Sbjct: 247 GGKFSYCLAPMFSQSNSSSKLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGD 306

Query: 353 MRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
            RI  +       +  G+  +++D+GT +T LP   Y     A VA      R S  S F
Sbjct: 307 KRIEFVGGSSSSGSSNGEGNIIIDSGTTLTLLPQEDYSNLESA-VADAIQANRVSDPSNF 365

Query: 412 -DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
              CY  +    + VP ++ +F G  V   P S F+   +  G  CFAF  S   +SI G
Sbjct: 366 LSLCYQTTPSGQLDVPVITAHFKGADVELNPISTFVQVAE--GVVCFAFH-SSEVVSIFG 422

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+ Q  + + +D     V F P  C
Sbjct: 423 NLAQLNLLVGYDLMEQTVSFKPTDC 447


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/357 (37%), Positives = 182/357 (50%), Gaps = 15/357 (4%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSAS 205
           +G +  + E+ V +G GSP ++   + D+GSD+ W+QCQPCS  CYKQ DPVFDPA S+S
Sbjct: 103 TGTNLKTPEFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSS 162

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGC 264
           ++ V C +  C       C+   C Y V YGDGS T G LA ETLT   +        GC
Sbjct: 163 YAVVPCGTTEC-AAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSEFTGFIFGC 221

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP- 323
           G  N G F    GLLGLG GS+SL  Q     GG FSYCL S  T + G L  G   +  
Sbjct: 222 GETNLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNT-TPGYLSIGATPVTG 280

Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
            +   +  +V  P  PSFY++ L  + +GG  +P+    F  T     G ++D+GT +T 
Sbjct: 281 QIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT-----GTLLDSGTILTY 335

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           LP PAY A RD F         A      DTCY+ +G   + +P VSF FS G V  L  
Sbjct: 336 LPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFNLNF 395

Query: 443 SNFLIPVDDA--GTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              +   DD      C AF   P+ +  S++G+  Q   ++ +D     +GF P  C
Sbjct: 396 FGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/362 (35%), Positives = 180/362 (49%), Gaps = 18/362 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V SG     G Y VR  +G+PP+  +MV+D+ +D VW+ C  CS C   +   F+   S+
Sbjct: 94  VASGNQLHIGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC-SNASTSFNTNSSS 152

Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
           ++S VSCS+  C +     C +       C +  SYG  S     L  +TLT+   V+ N
Sbjct: 153 TYSTVSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPDVIPN 212

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
            + GC +   G  +   GL+GLG G MSLV Q      G FSYCL S R    SGSL  G
Sbjct: 213 FSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 272

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
               P    + PL+RNPR PS YYV L+G+ VG +++P+             G ++D+GT
Sbjct: 273 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGT 332

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
            +TR   P YEA RD F  Q       S +  FDTC++         P ++ + +    L
Sbjct: 333 VITRFAQPVYEAIRDEFRKQVNG--SFSTLGAFDTCFSADN--ENVTPKITLHMTSLD-L 387

Query: 439 TLPASNFLIPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            LP  N LI    AGT  C + A     + + L++I N+QQ+ ++I FD  N  +G  P 
Sbjct: 388 KLPMENTLI-HSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPE 446

Query: 494 VC 495
            C
Sbjct: 447 PC 448


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 134/423 (31%), Positives = 194/423 (45%), Gaps = 32/423 (7%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           NL++ H     S    +  + +        A+ Q  ++ +++LV R S     + +  VQ
Sbjct: 33  NLQVFHVYSPCSPFWPSKPLKWEESVLQMQAKDQARLQFLSSLVARKSVVPIASGRQIVQ 92

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
                        S  Y VR  +G+P ++  + +D+ +D  W+   PCS C   S  VF+
Sbjct: 93  -------------SPTYIVRAKIGTPAQTMLLAMDTSNDAAWI---PCSGCVGCSSTVFN 136

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN 259
              S +F  V C +  C ++ N+ C    C + ++YG  S     L+ + +T+    + +
Sbjct: 137 NVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIA-ANLSQDVVTLATDSIPS 195

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
              GC  +  G  +   GLLGLG G MSL+ Q        FSYCL S R    SGSL  G
Sbjct: 196 YTFGCLTEATGSSIPPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLG 255

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
               P      PL++NPR  S YYV L  + VG   + I             G + D+GT
Sbjct: 256 PVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGT 315

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGP 436
             TRL  PAY A RDAF  + GN   A+  S+  FDTCY       +  PT++F FSG  
Sbjct: 316 VFTRLVAPAYTAVRDAFRKRVGN---ATVTSLGGFDTCYT----SPIVAPTITFMFSGMN 368

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           V TLP  N LI    +   C A A +P    S L++I N+QQ+  +I FD  N  +G   
Sbjct: 369 V-TLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAR 427

Query: 493 NVC 495
             C
Sbjct: 428 EPC 430


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 137/385 (35%), Positives = 191/385 (49%), Gaps = 38/385 (9%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQS---DPVFD 199
           SG   G G+Y V +  G+PP+   ++ D+GSD++W+QC     P + C K++    P F 
Sbjct: 44  SGAFLGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFV 103

Query: 200 PADSASFSGVSCSSAVC-----DRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLT 251
            + SA+ S V CS+A C      R     C       C Y   Y DGS T G LA +T T
Sbjct: 104 ASKSATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTAT 163

Query: 252 I-----GRTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
           I     G   V+ VA GCG +NQG  F G  G++GLG G +S   Q G      FSYCL+
Sbjct: 164 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLL 223

Query: 306 SRGTG----SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
               G    SS  L  GR       A+ PLV NP AP+FYYVG+  + VG   +P+    
Sbjct: 224 DLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSE 283

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNL 417
           + +  +G+ G V+D+G+ +T L   AY     AF A   +LPR    + F    + CYN+
Sbjct: 284 WAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNV 342

Query: 418 SGFVSVR-----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
           S   S        P ++  F+ G  L LP  N+L+ V D    C A  P  SP   +++G
Sbjct: 343 SSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD-DVKCLAIRPTLSPFAFNVLG 401

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+ Q+G  + FD A+  +GF    C
Sbjct: 402 NLMQQGYHVEFDRASARIGFARTEC 426


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 148/433 (34%), Positives = 214/433 (49%), Gaps = 39/433 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++E++HRD   S         Y+R         +   +RVA  +RR         K  +
Sbjct: 32  FSVEIIHRDSSRSP--------YYRP-------TETQFQRVANALRRSINRANHFNKPNL 76

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                   S +    GEY +   VG+PP     ++D+GSDI+W+QCQPC  CY Q+ P+F
Sbjct: 77  VASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIF 136

Query: 199 DPADSASFSGVSCSSAVCDRLENAG-CHAG--RCRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP+ S ++  + CSS +C  +++A  C +    C Y ++YGD S+++G L++ETLT+G T
Sbjct: 137 DPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVETLTLGST 196

Query: 256 VVKNV-----AIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
              +V      IGCGH N+G F    +G++GLGGG +SL+ QL    GG FSYCL     
Sbjct: 197 DGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFS 256

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGMRIPISEDLFR 363
            + SS  L FG EA+  G      V  P  P     FY++ L    VG  RI      F 
Sbjct: 257 QSNSSSKLNFGDEAVVSGRG---TVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFE 313

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVS 422
            +    + +++D+GT +T LP   Y     A VA    L R    S F   CY  +    
Sbjct: 314 SSGGEGN-IIIDSGTTLTILPEDDYLNLESA-VADAIELERVEDPSKFLRLCYRTTSSDE 371

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           + VP ++ +F G  V   P S F I VD+ G  CFAF  S  G  I GN+ Q+ + + +D
Sbjct: 372 LNVPVITAHFKGADVELNPISTF-IEVDE-GVVCFAFRSSKIG-PIFGNLAQQNLLVGYD 428

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 429 LVKQTVSFKPTDC 441


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 141/418 (33%), Positives = 205/418 (49%), Gaps = 27/418 (6%)

Query: 95  TTNNMHYHRHQHSFHARMQRDVKRVATLVRR--LSGGGADAAKHEVQDFGTDVVSGMDQG 152
           TT+ +        F+   +   +R+    RR  L G    A +    D  +DV+SG    
Sbjct: 35  TTDFISRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISG---- 90

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
            G Y + I +G+PP     + D+GSD++W QC PC  CY+Q +P+FDP +S ++  + C 
Sbjct: 91  GGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCD 150

Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG 265
           +  C  L   G       C Y  SYGD SYT+G L+ +TLTIG T         +A GCG
Sbjct: 151 NEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPASFPGIAFGCG 210

Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREAL 322
           H N G F     GL+GLGGG +SLV QL  + GG FSYCLV   + S  S  + FG+  +
Sbjct: 211 HDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGV 270

Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIP---ISEDLFRLTQMGDDGVVMDTG 377
             G+  V  PL++     +FYY+ L GL VG   +     SE+      + +  +++D+G
Sbjct: 271 VSGSGTVSTPLIKG-TPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSG 329

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +T LP   Y     A     G         IF  CY  S   ++ +PT++ +F+G  V
Sbjct: 330 TTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCY--SSVNNLEIPTITAHFTGADV 387

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              P + F+   +D    CF+  PS S L+I GN+ Q    + +D  N  V F    C
Sbjct: 388 QLPPLNTFVQVQEDL--VCFSMIPS-SNLAIFGNLAQINFLVGYDLKNNKVSFKQTDC 442


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 151/429 (35%), Positives = 210/429 (48%), Gaps = 58/429 (13%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L HR    + S  ++         S    ++ D +R   ++RR+SG        +   
Sbjct: 68  LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 141 FGTDVVS--GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
               V +  G D G+  Y V   +G+P  +Q M +D+GSD+ WVQC+PCS    CY Q D
Sbjct: 123 AAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKD 182

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           P+FDPA S+S++ V C   VC  L   G +A         G                   
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGL---GIYAASACSAAQCG------------------- 220

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
            V+    GCGH   G+F G  GLLGLG    SLV Q  G  GG FSYCL ++ + ++G L
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYL 279

Query: 316 VFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
             G    P GAA       L+ +P AP++Y V L+G+ VGG ++ +    F    +    
Sbjct: 280 TLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV---- 334

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
             +DTGT VTRLP  AY A R AF +   +   P A    I DTCYN +G+ +V +P V+
Sbjct: 335 --VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA 392

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFDGANG 486
             F  G  +TL A   L       +F C AFAPS S  G++I+GN+QQ   ++  DG + 
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRIDGTS- 444

Query: 487 FVGFGPNVC 495
            VGF P+ C
Sbjct: 445 -VGFKPSSC 452


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 121/350 (34%), Positives = 170/350 (48%), Gaps = 16/350 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
             Y VR+ +G+P +  +MV+D+ +D  WV   PCS C   S   F P  S +   + CS 
Sbjct: 96  ANYVVRVKLGTPGQQMFMVLDTSNDAAWV---PCSGCTGFSSTTFLPNASTTLGSLDCSG 152

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           A C ++    C A     C +  SYG  S    TL  + +T+   V+     GC +   G
Sbjct: 153 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSG 212

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G    P      
Sbjct: 213 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 272

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  PS YYV L+G+ VG +++PI  +          G ++D+GT +TR   P Y 
Sbjct: 273 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 332

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RD F  Q  N P  S +  FDTC+  +       P ++ +F G   L LP  N LI  
Sbjct: 333 AIRDEFRKQV-NGP-ISSLGAFDTCFAATN--EAEAPAITLHFEGL-NLVLPMENSLIHS 387

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C + A +P    S L++I N+QQ+ ++I FD  N  +G    +C
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 120/350 (34%), Positives = 170/350 (48%), Gaps = 16/350 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
             Y VR+ +G+P +  +MV+D+ +D  WV C  C+ C   S   F P  S +   + CS 
Sbjct: 96  ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSG 152

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           A C ++    C A     C +  SYG  S    TL  + +T+   V+     GC +   G
Sbjct: 153 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPGFTFGCINAVSG 212

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G    P      
Sbjct: 213 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 272

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  PS YYV L+G+ VG +++PI  +          G ++D+GT +TR   P Y 
Sbjct: 273 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 332

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RD F  Q  N P  S +  FDTC+  +       P ++ +F G   L LP  N LI  
Sbjct: 333 AIRDEFRKQV-NGP-ISSLGAFDTCFAATN--EAEAPAITLHFEGL-NLVLPMENSLIHS 387

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C + A +P    S L++I N+QQ+ ++I FD  N  +G    +C
Sbjct: 388 SSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 122/379 (32%), Positives = 178/379 (46%), Gaps = 49/379 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           + EY V + VG+PPR   + +D+GSD+VW QC PC  C+ Q  P+ DPA S++++ + C 
Sbjct: 89  TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCG 148

Query: 213 SAVCDRLENAGCHAG----------RCRYEVSYGDGSYTKGTLALETLTIG--------R 254
           +  C  L    C  G           C Y   YGD S T G +A +  T G        R
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 255 TVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC---------- 303
              + +  GCGH N+G+F     G+ G G G  SL  QL   T   FSYC          
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTT---FSYCFTSMFESKSS 265

Query: 304 LVSRGTGSSGSLVFGREALPVGAA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           LV+ G   + +L++   A   G     PL++NP  PS Y++ L G+ VG  R+ + E   
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKL 325

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSG 419
           R T       ++D+G ++T LP   YEA +  F AQ G  P  +GV   S  D C+ L  
Sbjct: 326 RST-------IIDSGASITTLPEAVYEAVKAEFAAQVGLPP--TGVVEGSALDLCFALPV 376

Query: 420 FVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
               R   VP+++ +  G     LP  N++     A   C     +P   ++IGN QQ+ 
Sbjct: 377 TALWRRPPVPSLTLHLDGAD-WELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQN 435

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N ++ F P  C
Sbjct: 436 THVVYDLENDWLSFAPARC 454


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 150/477 (31%), Positives = 236/477 (49%), Gaps = 60/477 (12%)

Query: 54  MSQYNELFERHNNISSSNTSSDEARWNLELVHRD-KMSSSSNTTNNMHYHRHQHSFHARM 112
           M+ ++ L      I +S+ ++   R  L  +H D ++++S      +    H+H+  AR 
Sbjct: 1   MASFSVLLILACTILASDAAA-AVRVGLTRIHADPEVTASEFVRGALRRDMHRHARFARE 59

Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
           Q            L+   A AA   V   G      +  G GEY + + +G+PP S   +
Sbjct: 60  Q------------LAPSSAAAAGLTV---GAPTQKDLRNG-GEYIMTLSIGTPPLSYRAI 103

Query: 173 IDSGSDIVWVQCQPC--------SQCYKQSDPVFDPADSASFSGVSCSS--AVCDRLENA 222
            D+GSD++W QC PC        +QC+KQS  +++P+ S +F  + C+S  ++C  +   
Sbjct: 104 ADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGP 163

Query: 223 GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGHKNQGMFVGA 275
               G  C Y  +YG G +T G  ++ET T G +       V N+A GC + +   + G+
Sbjct: 164 SPPPGCACMYNQTYGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNGS 222

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREAL-------PVGAA 327
           AGL+GLG GSMSLV QLG    GAFSYCL   +   S+ +L+ G  A        PV + 
Sbjct: 223 AGLVGLGRGSMSLVSQLG---AGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRS- 278

Query: 328 WVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
             P V  P +AP  ++YY+ L+G+ VG   + I  D F L   G  G+++D+GT +T L 
Sbjct: 279 -TPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLV 337

Query: 385 TPAYEAFRDAFVA-QTGNLPRASGV---SIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLT 439
             AY+  R A  +     LP A G    +  D C+ L        +P+++ +F GG  + 
Sbjct: 338 DSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMV 397

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N++I    +G +C A      G +S++GN QQ+ I + +D     + F P VC
Sbjct: 398 LPVENYMI--LGSGVWCLAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVC 452


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 183/356 (51%), Gaps = 25/356 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVS 210
           S EY   +G+G+P   Q +++D+GS + WVQC+PC  SQCY Q  P+FDP  S+S+S V 
Sbjct: 126 SQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVP 185

Query: 211 CSSAVCDRL----ENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAI 262
           C S  C  L    +  GC +     C YE+ YG G+   G  + + LT+G   +VK    
Sbjct: 186 CDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGPGAIVKRFHF 245

Query: 263 GCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGSLVFGRE 320
           GCGH  Q G F  A G+LGLG    SL  Q   + GG  FS+CL   G  S+G L  G  
Sbjct: 246 GCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV-STGFLALGAP 304

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
                  + PL+     P FY +  + + V G  + I   +FR      +GV+ D+GT +
Sbjct: 305 HDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFR------EGVITDSGTVL 358

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           + L   AY A R AF +     P A  V   DTC+N +G+ +V VPTVS  F GG  + L
Sbjct: 359 SALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGGATVHL 418

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AS+ ++ +D     C AF  S    + +IG++ Q  I++ +D     VGF    C
Sbjct: 419 DASSGVL-MDG----CLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 121/349 (34%), Positives = 176/349 (50%), Gaps = 15/349 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+P ++  + +D+ +D  W+ C  C  C   S  +FDP+ S+S   + C 
Sbjct: 85  SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142

Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           +  C +  N  C   + C + ++YG GS  +  L  +TLT+   V+ N   GC +K  G 
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDVIPNYTFGCINKASGT 201

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
            + A GL+GLG G +SL+ Q        FSYCL  S+ +  SGSL  G +  P+     P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R+ F  +  N   A+ +  FDTCY  SG  SV  P+V+F F+G  V TLP  N LI   
Sbjct: 322 MRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375

Query: 451 DAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                C A A +P+     L++I ++QQ+  ++  D  N  +G     C
Sbjct: 376 AGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  191 bits (486), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 135/404 (33%), Positives = 210/404 (51%), Gaps = 36/404 (8%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD+ R A   R L+  G        +    D+ +G     GEY + + +G+PP S   
Sbjct: 52  LRRDMHRHARFTRELASSGDRTVAAPTRK---DLPNG-----GEYIMTLAIGTPPLSYPA 103

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV--CDRLENAGCHAG- 227
           + D+GSD++W QC PC SQC+KQ+   ++P+ S +F  + C+S+V  C  L       G 
Sbjct: 104 IADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGC 163

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
            C Y  +YG G +T G  ++ET T G     +T V  +A GC + +   + G+AGL+GLG
Sbjct: 164 SCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNGSAGLVGLG 222

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP--VGAAWVPLVRNP-RAP 338
            GSMSLV QLG    G FSYCL   +   S+ +L+ G  A     G    P V +P +AP
Sbjct: 223 RGSMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAP 279

Query: 339 --SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
             ++YY+ L+G+ +G   + I  + F L   G  G+++D+GT +T L   AY+  R A +
Sbjct: 280 MSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAA-I 338

Query: 397 AQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
                LP A G      D C+ L+   S    +P+++F+F G  ++ LP  N++I    +
Sbjct: 339 ESLVTLPVADGSDSTGLDLCFALTSETSTPPSMPSMTFHFDGADMV-LPVDNYMI--LGS 395

Query: 453 GTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           G +C A      G +S  GN QQ+ + + +D     + F P  C
Sbjct: 396 GVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKC 439


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 93/165 (56%), Positives = 117/165 (70%)

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L RNP+  ++YYVGL G+ VGG  + I E  F +   G+ G+++D+GTAVTRL +  Y  
Sbjct: 1   LRRNPQLDTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNV 60

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            RDAFV  T +L   + VS+FDTCY+LS   SV VPTV+F+F  G VL LPA N+L+PVD
Sbjct: 61  VRDAFVKGTKDLLATNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVD 120

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             GTFCFAFAP+ S LSIIGNIQQ+G ++SFD AN  VGF PN C
Sbjct: 121 SVGTFCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  191 bits (485), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 128/384 (33%), Positives = 176/384 (45%), Gaps = 47/384 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ-SDPVFDPADSASFSGVSC 211
           + EY V + VG+PPR   + +D+GSD+VW QC PC  C+ Q + PV DPA S++ + V C
Sbjct: 91  TNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRC 150

Query: 212 SSAVCDRLENAGCHAG-------RCRYEVSYGDGSYTKGTLALETLTIGR--------TV 256
            + VC  L    C  G        C Y   YGD S T G LA +  T G           
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210

Query: 257 VKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
            + +  GCGH N+G+F     G+ G G G  SL  QLG  +   FSYC  S    +S  +
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTS---FSYCFTSMFESTSSLV 267

Query: 316 VFGREA----LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
             G       L       PL+R+P  PS Y++ L  + VG  RIPI E   R  ++ +  
Sbjct: 268 TLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPE---RRQRLREAS 324

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-------SGF---- 420
            ++D+G ++T LP   YEA +  FVAQ G    A   S  D C+ L       S F    
Sbjct: 325 AIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRW 384

Query: 421 ------VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG---LSIIGN 471
                 + VRVP + F+  GG    LP  N++     A   C     +  G     +IGN
Sbjct: 385 RGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIGN 444

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
            QQ+   + +D  N  + F P  C
Sbjct: 445 YQQQNTHVVYDLENDVLSFAPARC 468


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  191 bits (484), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 122/349 (34%), Positives = 175/349 (50%), Gaps = 15/349 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+P +   + +D+ +D  W+ C  C  C   S  +FDP+ S+S   + C 
Sbjct: 85  SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142

Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           +  C +  N  C   + C + ++YG GS  +  L  +TLT+   V+ N   GC +K  G 
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
            + A GL+GLG G +SL+ Q        FSYCL  S+ +  SGSL  G +  P+     P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R+ F  +  N   A+ +  FDTCY  SG  SV  P+V+F F+G  V TLP  N LI   
Sbjct: 322 VRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375

Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                C A A +P    S L++I ++QQ+  ++  D  N  +G     C
Sbjct: 376 AGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 142/435 (32%), Positives = 218/435 (50%), Gaps = 46/435 (10%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD   SS +       +++QH   A + R + RV       S   + A+  E 
Sbjct: 28  FSIELIHRD---SSKSPFYKPTQNKYQHVVDA-VHRSINRV-----NHSNKNSLASTPE- 77

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                   S +    G+Y +   VG+PP   Y ++D+GSDIVW+QC+PC QCY Q+ P F
Sbjct: 78  --------STVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKF 129

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           +P+ S+S+  +SCSS +C  + +  C+  + C Y ++YG+ S+++G L+LETLT+  T  
Sbjct: 130 NPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTG 189

Query: 258 KNVA-----IGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
           + V+     IGCG  N G F   ++G++GLGGG  SL+ QLG   GG FSYCLV      
Sbjct: 190 RPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITL 249

Query: 309 ---TGSSGSLVFGREALPVG--AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
              +  S  L FG  A+  G      P+V+   +  FYY+ +    VG  R+      F 
Sbjct: 250 KNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHS-FFYYLTIEAFSVGDKRVE-----FA 303

Query: 364 LTQMG--DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGF 420
            +  G  +  +++D+ T VT +P+  Y     A V     L R    +  F  CYN+S  
Sbjct: 304 GSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLV-TLERVDDPNQQFSLCYNVSSD 362

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
                P ++ +F G  +L    + F+    D    CFAFAPS  G +I G+  Q+   + 
Sbjct: 363 EEYDFPYMTAHFKGADILLYATNTFVEVARDV--LCFAFAPSNGG-AIFGSFSQQDFMVG 419

Query: 481 FDGANGFVGFGPNVC 495
           +D     V F    C
Sbjct: 420 YDLQQKTVSFKSVDC 434


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 122/349 (34%), Positives = 175/349 (50%), Gaps = 15/349 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+P +   + +D+ +D  W+ C  C  C   S  +FDP+ S+S   + C 
Sbjct: 85  SPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGC--SSSVLFDPSKSSSSRTLQCE 142

Query: 213 SAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           +  C +  N  C   + C + ++YG GS  +  L  +TLT+   V+ N   GC +K  G 
Sbjct: 143 APQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDVIPNYTFGCINKASGT 201

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV-SRGTGSSGSLVFGREALPVGAAWVP 330
            + A GL+GLG G +SL+ Q        FSYCL  S+ +  SGSL  G +  P+     P
Sbjct: 202 SLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSSNFSGSLRLGPKNQPIRIKTTP 261

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  PAY A
Sbjct: 262 LLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVYTRLVEPAYVA 321

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R+ F  +  N   A+ +  FDTCY  SG  SV  P+V+F F+G  V TLP  N LI   
Sbjct: 322 VRNEFRRRVKNA-NATSLGGFDTCY--SG--SVVFPSVTFMFAGMNV-TLPPDNLLIHSS 375

Query: 451 DAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                C A A +P    S L++I ++QQ+  ++  D  N  +G     C
Sbjct: 376 AGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 136/420 (32%), Positives = 206/420 (49%), Gaps = 33/420 (7%)

Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS----GEYF 157
           H    +   +  RD  R     +R    G D  +   +  G   VS   +      GEY 
Sbjct: 54  HSDPDTTAPQFVRDALRRDMHRQRSRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYL 113

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAV- 215
           + + +G+PP     V D+GSD++W QC PC +QC++Q  P+++PA S +FS + C+S++ 
Sbjct: 114 MTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNSSLS 173

Query: 216 -CDRLENAGCHAGRC--RYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
            C            C   Y  +YG G +T G    ET T G +      V  VA GC + 
Sbjct: 174 MCAGALAGAAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNA 232

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP--V 324
           +   + G+AGL+GLG GS+SLV QLG    G FSYCL   + T S+ +L+ G  A     
Sbjct: 233 SSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGPSAALNGT 289

Query: 325 GAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
           G    P V +P RAP  ++YY+ L+G+ +G   +PIS   F L   G  G+++D+GT +T
Sbjct: 290 GVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTIT 349

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVR---VPTVSFYFSGGP 436
            L   AY+  R A  +    LP   G      D C+ L    S     +P+++ +F G  
Sbjct: 350 SLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGAD 409

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++ LPA +++I    +G +C A      G +S  GN QQ+ + I +D     + F P  C
Sbjct: 410 MV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKC 466


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 137/397 (34%), Positives = 205/397 (51%), Gaps = 43/397 (10%)

Query: 138 VQDF-GTD------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
           +QDF G D      +VSG   GSG+YFV + VG+P +   +++D+GSD+ W+QC P +  
Sbjct: 34  IQDFQGEDPALFSRLVSGSSIGSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTT 93

Query: 191 YKQSD---PVFDPADSASFSGVSCSSAVCDRLE---NAGC---HAGRCRYEVSYGDGSYT 241
              S    P +D + S+S+  + C+   C  L     + C       C Y   Y D S T
Sbjct: 94  ANSSSPPAPWYDKSSSSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRT 153

Query: 242 KGTLALETLTIG---------------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
            G LA ET+++                R  +KNVA+GC  ++ G  F+GA+G+LGLG G 
Sbjct: 154 TGILAYETISMKSRKRSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGP 213

Query: 286 MSLVGQLGGQT-GGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
           +SL  Q      GG FSYCLV   RG+ +S  LV GR       A  P+VRNP A SFYY
Sbjct: 214 ISLATQTRHTALGGIFSYCLVDYLRGSNASSFLVMGRTHW-RKLAHTPIVRNPAAQSFYY 272

Query: 343 VGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
           V ++G+ V G  +  I+   + +   G+ G + D+GT ++ L  PAY     A  A    
Sbjct: 273 VNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-Y 331

Query: 402 LPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
           LPRA  +   F+ CYN++  +   +P +   F GG V+ LP +N+++ V +    C A  
Sbjct: 332 LPRAQEIPEGFELCYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAE-NVQCVALQ 389

Query: 461 P--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              + +G +I+GN+ Q+   I +D A   +GF  + C
Sbjct: 390 KVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 139/418 (33%), Positives = 206/418 (49%), Gaps = 27/418 (6%)

Query: 95  TTNNMHYHRHQHSFHARMQRDVKRVATLVRR--LSGGGADAAKHEVQDFGTDVVSGMDQG 152
           TT+ +     +  F+   +   +R+    RR  L G    A +    D  ++V+SG    
Sbjct: 35  TTDFISRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISG---- 90

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
            G Y + I +G+PP S   + D+GSD++W QC PC  CYKQ +P+FDP  S ++  + C+
Sbjct: 91  GGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCN 150

Query: 213 SAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG 265
           +  C  L   G       C    SYGD SYT+  L+ ET TIG T         +A GCG
Sbjct: 151 NDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFTIGSTEGDPASFPGLAFGCG 210

Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFGREAL 322
           H N G F    +GL+GLGGG +SLV QL  + GG FSYCLV   + S+ S  + FG+ A+
Sbjct: 211 HSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAV 270

Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIP---ISEDLFRLTQMGDDGVVMDTG 377
             G+  V  PL++     +FYY+ L G+ +G  ++     S++        +  +++D+G
Sbjct: 271 VSGSGTVSTPLIKG-TPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDSG 329

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +T LP   Y     A     G          F  CY  SG   + +PT++ +F G  V
Sbjct: 330 TTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCY--SGVKKLEIPTITAHFIGADV 387

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              P + F+   +D    CF+  PS S L+I GN+ Q    + +D  N  V F P  C
Sbjct: 388 QLPPLNTFVQAQEDL--VCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDC 442


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 175/350 (50%), Gaps = 16/350 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y VR+ +G+P +  +MV+D+  D  WV C  C+ C   S P F P  S++++ + CS 
Sbjct: 97  GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGC---SSPTFSPNTSSTYASLQCSV 153

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
             C ++    C       C +  +YG  S     L+ ++L +    + + + GC +   G
Sbjct: 154 PQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSLGLAVDTLPSYSFGCVNAVSG 213

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GLLGLG G MSL+ Q G    G FSYC  S +    SGSL  G    P      
Sbjct: 214 STLPPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTT 273

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  P+ YYV L+G+ VG + +P++ +L         G ++D+GT +TR   P Y 
Sbjct: 274 PLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYA 333

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RD F  Q    P A+ +  FDTC+  +       P V+F+F+G   L LP  N LI  
Sbjct: 334 AIRDEFRKQVKG-PFAT-IGAFDTCFAATN--EDIAPPVTFHFTGMD-LKLPLENTLIHS 388

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C A A +P    S L++I N+QQ+ ++I FD  N  +G    +C
Sbjct: 389 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 150/429 (34%), Positives = 209/429 (48%), Gaps = 58/429 (13%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADA--AKHEV 138
           L L HR    + S  ++         S    ++ D +R   ++RR+SG       +K   
Sbjct: 68  LRLTHRHGPCAPSRASS-----LAAPSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAA 122

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSD 195
                    G D G+  Y V   +G+P  +Q M +D+GSD+ WVQC+PC+    CY Q D
Sbjct: 123 AVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKD 182

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT 255
           P+FDPA S+S++ V C   VC  L   G +A         G                   
Sbjct: 183 PLFDPAQSSSYAAVPCGGPVCAGL---GIYAASACSAAQCG------------------- 220

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
            V+    GCGH   G+F G  GLLGLG    SLV Q  G  GG FSYCL ++ + ++G L
Sbjct: 221 AVQGFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPS-TAGYL 279

Query: 316 VFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
             G    P GAA       L+ +P AP++Y V L+G+ VGG ++ +    F    +    
Sbjct: 280 TLGVGG-PSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV---- 334

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNLSGFVSVRVPTVS 429
             +DTGT VTRLP  AY A R AF +   +   P A    I DTCYN +G+ +V +P V+
Sbjct: 335 --VDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVA 392

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTF-CFAFAPSPS--GLSIIGNIQQEGIQISFDGANG 486
             F  G  +TL A   L       +F C AFAPS S  G++I+GN+QQ   ++  DG + 
Sbjct: 393 LTFGSGATVTLGADGIL-------SFGCLAFAPSGSDGGMAILGNVQQRSFEVRIDGTS- 444

Query: 487 FVGFGPNVC 495
            VGF P+ C
Sbjct: 445 -VGFKPSSC 452


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 128/382 (33%), Positives = 183/382 (47%), Gaps = 32/382 (8%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           +G  V +     SGEY +   +G+P P+   + +D+GSD+VW QC PC  C+ Q  P+FD
Sbjct: 72  YGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFD 131

Query: 200 PADSASFSGVSCSSAVCDR---LENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
           P+ S++F  V+C   +C     L  + C     RC Y  SYGD S T G +  +T T   
Sbjct: 132 PSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMS 191

Query: 253 ------GRTVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
                     V  +A GCG  N G+F    +G+ G G G +SL  QL     G FSYCL 
Sbjct: 192 PNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSLPSQL---RVGRFSYCLT 248

Query: 306 SRGTGSSG--SLVF------GREALPVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
           S     S   S VF      G  A   G     P++ +P  P+FYY+ L G+ VG  R+P
Sbjct: 249 SHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT--C 414
           +   +F L + G  G V+D+GT VT  P   +E  ++ FVAQ   LPR    S      C
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQL-PLPRYDNTSEVGNLLC 367

Query: 415 YNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
           +    G   V VP + F+ +    + LP  N++    D+G  C     +   + +IGN Q
Sbjct: 368 FQRPKGGKQVPVPKLIFHLASAD-MDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQ 426

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
           Q+ + I +D  N  + F    C
Sbjct: 427 QQNMHIVYDVENSKLLFASAQC 448


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 132/430 (30%), Positives = 208/430 (48%), Gaps = 55/430 (12%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +   L HRD + S    ++  HY R  ++F    +R + R A L+ R +  GA   +  +
Sbjct: 30  FTTSLFHRDSLLSPLEFSSLSHYDRLANAF----RRSLSRSAALLNRAATSGAVGLQSSI 85

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                                  +G+PP     + D+GSD+ W QC PC +CY+Q  P+F
Sbjct: 86  -----------------------IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIF 122

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           +P  S SFS V C++  C  +++  C   G C Y  +YGD +Y+KG L  E +TIG + V
Sbjct: 123 NPLKSTSFSHVPCNTQTCHAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSSV 182

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA---FSYCLVSRGTGSSGS 314
           K+V IGCGH + G F  A+G++GLGGG +SLV Q+  QT G    FSYCL +  + ++G 
Sbjct: 183 KSV-IGCGHASSGGFGFASGVIGLGGGQLSLVSQM-SQTSGISRRFSYCLPTLLSHANGK 240

Query: 315 LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           + FG+ A+  G   V  PL+ +    ++YY+ L  + +G  R         +       V
Sbjct: 241 INFGQNAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNER--------HMAFAKQGNV 291

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYN--LSGFVSVRVP 426
           ++D+GT ++ LP   Y    D  V+    + +A  V    + +D C++  ++   S  +P
Sbjct: 292 IIDSGTTLSFLPKELY----DGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 347

Query: 427 TVSFYFSGGP-VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
            ++  FSGG  V  LP + F    ++        A       IIGN+      I +D   
Sbjct: 348 IITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEA 407

Query: 486 GFVGFGPNVC 495
             + F P VC
Sbjct: 408 KRLSFKPTVC 417


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 145/447 (32%), Positives = 223/447 (49%), Gaps = 53/447 (11%)

Query: 76  EAR---WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
           EAR   ++  L+HRD   S      + ++ R ++SFH  + R          R       
Sbjct: 26  EARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISR--------ANRFKPNSI- 76

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
           +A+  VQ   +D+V G     GEY +RI +G+P      + D+GSD++WVQCQPC  CYK
Sbjct: 77  SARALVQ---SDIVPG----GGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYK 129

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHA----GRCRYEVSYGDGSYTKGTLA 246
           Q+ P+FDP  S+S+  V C +  C++L  E   C A      C Y  SYGD S++ G LA
Sbjct: 130 QNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLA 189

Query: 247 LETLTIGRT---------VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQT 296
           +E   IG T           + VA GCG KN G F    +G++GLGGGSMSLV QLG + 
Sbjct: 190 IERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKL 249

Query: 297 GGAFSYCLV--SRGTGSSGSLVFGREALPVGAAW----VPLVRNPRAP-SFYYVGLSGLG 349
            G FSYCLV  S  +  +  + FG +    G+ +     PL+  P+ P ++YY+ L  + 
Sbjct: 250 SGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLL--PKKPETYYYLTLEAIS 307

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           V   R+P +       + G+  +++D+GT +T L +  +    D+ V +     R S   
Sbjct: 308 VENKRLPYTNLWNGEVEKGN--IIIDSGTTLTFLDSEFFNNL-DSAVEEAVKGERVSDPH 364

Query: 410 -IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
            +F+ C+      ++ +P ++ +F+G  V   P + F    +D    CF   PS + ++I
Sbjct: 365 GLFNICFKDEK--AIELPIITAHFTGADVELQPVNTFAKVEED--LLCFTMIPS-NDIAI 419

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
            GN+ Q    + +D     V F P  C
Sbjct: 420 FGNLAQMNFLVGYDLEKKAVSFLPTDC 446


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 207/419 (49%), Gaps = 36/419 (8%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           +HS   R  RD  R+A L    +     A          +V + ++ G+G Y + I +G+
Sbjct: 42  KHSEAVR--RDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGT 99

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENA 222
           PP    +++D+GS+++W QC PC++C+ +    PV  PA S++FS + C+ + C  L  +
Sbjct: 100 PPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159

Query: 223 G----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
                C+A   C Y  +YG G YT G LA ETLT+G      VA GC  +N      ++G
Sbjct: 160 SRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVR 333
           ++GLG G +SLV QL     G FSYCL S    G +  ++FG  A     + V   PL++
Sbjct: 217 IVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLK 273

Query: 334 NP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTRLPTPAYEA 390
           NP  +  + YYV L+G+ V    +P++   F  TQ G   G ++D+GT +T L    Y  
Sbjct: 274 NPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAM 333

Query: 391 FRDAFVAQTGNL----PRASGVSIFDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPAS 443
            + AF +Q  NL    P +      D CY  S   G  +VRVP ++  F+GG    +P  
Sbjct: 334 VKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQ 393

Query: 444 NFL--IPVDDAGTF---CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+   +  D  G     C    P+   L  SIIGN+ Q  + + +D   G   F P  C
Sbjct: 394 NYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 207/419 (49%), Gaps = 36/419 (8%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           +HS   R  RD  R+A L    +     A          +V + ++ G+G Y + I +G+
Sbjct: 42  KHSEAVR--RDGHRLAFLSYAATAAAGKATTTGTNSSSVNVQAQLENGAGAYNMNISLGT 99

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENA 222
           PP    +++D+GS+++W QC PC++C+ +    PV  PA S++FS + C+ + C  L  +
Sbjct: 100 PPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159

Query: 223 G----CHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAG 277
                C+A   C Y  +YG G YT G LA ETLT+G      VA GC  +N      ++G
Sbjct: 160 SRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDGTFPKVAFGCSTENG--VDNSSG 216

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PLVR 333
           ++GLG G +SLV QL     G FSYCL S    G +  ++FG  A     + V   PL++
Sbjct: 217 IVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLK 273

Query: 334 NP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTRLPTPAYEA 390
           NP  +  + YYV L+G+ V    +P++   F  TQ G   G ++D+GT +T L    Y  
Sbjct: 274 NPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAM 333

Query: 391 FRDAFVAQTGNL----PRASGVSIFDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPAS 443
            + AF +Q  NL    P +      D CY  S   G  +VRVP ++  F+GG    +P  
Sbjct: 334 VKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQ 393

Query: 444 NFL--IPVDDAGTF---CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N+   +  D  G     C    P+   L  SIIGN+ Q  + + +D   G   F P  C
Sbjct: 394 NYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 203/424 (47%), Gaps = 43/424 (10%)

Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-------GEYFV 158
           H+ HA   R +     L+RR++      +   +   G    + MD GS        EY V
Sbjct: 31  HATHADAGRGLS-TRELLRRMAARSKARSARLLS--GRAASARMDPGSYTDGVPDTEYLV 87

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            + +G+PP+   +++D+GSD+ W QC PC  C++QS P F+P+ S +FS + C   +C  
Sbjct: 88  HMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD 147

Query: 219 LENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAIGCGH 266
           L  + C       G C Y  +Y D S T G L  +T +       IG   V ++  GCG 
Sbjct: 148 LTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGL 207

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------G 318
            N G+FV    G+ G   G++S+  QL       FSYC  +  TGS  S VF        
Sbjct: 208 FNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVPPNLY 263

Query: 319 REALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
            +A   G   V    L+R +      YY+ L G+ VG  R+PI E +F L + G  G ++
Sbjct: 264 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 323

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +T LP   Y    DAFVAQT      S  S+   C+++       VP +  +F G
Sbjct: 324 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEG 383

Query: 435 GPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
              L LP  N++  +++AG     C A   +   LS+IGN QQ+ + + +D AN  + F 
Sbjct: 384 A-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 441

Query: 492 PNVC 495
           P  C
Sbjct: 442 PARC 445


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/360 (33%), Positives = 177/360 (49%), Gaps = 29/360 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +GEY +R  +G+PP  +    D+GSD++WVQC PC+ C+ QS P+F P  S++F   +C 
Sbjct: 87  NGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCR 146

Query: 213 SAVCDRL--ENAGC-HAGRCRYEVSYGDG-SYTKGTLALETLT------IGRTVVKNVAI 262
           S  C  L  E  GC  +G C Y   YGD  S+++G L+ ETL       +      N   
Sbjct: 147 SQPCTLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFF 206

Query: 263 GCG-HKNQGMF--VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           GCG + N  +F      G++GLG G +SLV Q+G Q G  FSYCL+  G+ S+  L FG 
Sbjct: 207 GCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGSTSTSKLKFGN 266

Query: 320 EALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           E++  G   V  P++  P  P++Y++ L  + V    +P        T   D  V++D+G
Sbjct: 267 ESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDSG 318

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL-SGFVSVRVPTVSFYFSGGP 436
           T +T L    Y  F  +             +S    C+     FV    P ++F F+G  
Sbjct: 319 TLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYRDNFV---FPEIAFQFTGAR 375

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           V   PA N  +  +D  T C   APS  SG+SI G+  Q   Q+ +D     V F P  C
Sbjct: 376 VSLKPA-NLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTDC 434


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 130/367 (35%), Positives = 193/367 (52%), Gaps = 36/367 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSC 211
           GEY + + +G+PP S   + D+GSD++W QC PCS  QC+ Q  P+++PA S +F  + C
Sbjct: 90  GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPC 149

Query: 212 SSAVCDRLENAGCHAGR-------CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKN 259
           +S++      AG  AG+       C Y  +YG G +T G    ET T G        V  
Sbjct: 150 NSSLS---MCAGVLAGKAPPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPG 205

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFG 318
           +A GC + +   + G+AGL+GLG GS+SLV QLG    G FSYCL   + T S+ +L+ G
Sbjct: 206 IAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLG 262

Query: 319 REALP--VGAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
             A     G    P V +P +AP  ++YY+ L+G+ +G   + IS D F L   G  G++
Sbjct: 263 PSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLI 322

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVS 429
           +D+GT +T L   AY+  R A V     LP   G      D CY L    S    +P+++
Sbjct: 323 IDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMT 381

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFV 488
            +F G  ++ LPA +++I    +G +C A      G +S  GN QQ+ + I +D  N  +
Sbjct: 382 LHFDGADMV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEML 438

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 439 SFAPAKC 445


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 101/282 (35%), Positives = 154/282 (54%), Gaps = 18/282 (6%)

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           G  A  C Y ++YGDGS+T+G L  E L  G  +VK+   GCG  N+G+F G +GL+GLG
Sbjct: 127 GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPR 336
              +SL+ Q  G  GG FSYCL S     SGSL+ G      R + P+  ++  ++ NP+
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILGGNSSVYRNSSPI--SYAKMIENPQ 244

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
             +FY++ L+G+ +GG+ +       +   +G   +++D+GT +TRLP   Y+A +  F+
Sbjct: 245 LYNFYFINLTGISIGGVAL-------QAPSVGPSRILVDSGTVITRLPPTIYKALKAEFL 297

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTF 455
            Q    P A   SI DTC+NLS +  V +PT+  +F G   LT+  +  F     DA   
Sbjct: 298 KQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQV 357

Query: 456 CFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C A A       ++I+GN QQ+ +++ +D     VGF    C
Sbjct: 358 CLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 399


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 175/356 (49%), Gaps = 22/356 (6%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y + + +G+PP   Y + D+GSD+ W  C PC++CYKQ +P+FDP  S S+  +SC S
Sbjct: 23  GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDS 82

Query: 214 AVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
            +C +L+   C   + C Y  +Y   + T+G LA ET+T+  T      +K +  GCGH 
Sbjct: 83  KLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHN 142

Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGT----GSSGSLVFGREA 321
           N G F     G++GLGGG +S + Q+G   GG  FS CLV   T     S  SL  G E 
Sbjct: 143 NTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTDVSVSSKMSLGKGSEV 202

Query: 322 LPVGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
              G    PLV +  + P  Y+V L G+ VG   +  +    +  + G+  V +D+GT  
Sbjct: 203 SGKGVVSTPLVAKQDKTP--YFVTLLGISVGNTYLHFNGSSSQSVEKGN--VFLDSGTPP 258

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           T LPT  Y+       ++    P  + + +    CY      ++R P ++ +F GG V  
Sbjct: 259 TILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKN--NLRGPVLTAHFEGGDVKL 316

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP   F+ P D  G FC  F  + S   + GN  Q    I FD     V F P  C
Sbjct: 317 LPTQTFVSPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDC 370


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 139/397 (35%), Positives = 207/397 (52%), Gaps = 43/397 (10%)

Query: 138 VQDF-GTD------VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
           +QDF G D      +VSG   GSG+YFV + VG+P +   ++ID+GSD+ W+QC P +  
Sbjct: 2   IQDFQGEDPALFSRLVSGSSIGSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTT 61

Query: 191 YKQSD---PVFDPADSASFSGVSCSSAVCDRLE---NAGC---HAGRCRYEVSYGDGSYT 241
              S    P +D + S+S+  + C+   C  L     + C       C Y   Y D S T
Sbjct: 62  ANSSSPPAPWYDKSSSSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRT 121

Query: 242 KGTLALETLTIG--------------RTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGS 285
            G LA ET+++               RT+ +KNVA+GC  ++ G  F+GA+G+LGLG G 
Sbjct: 122 TGILAYETISMKSRKRSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGP 181

Query: 286 MSLVGQLGGQT-GGAFSYCLVS--RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
           +SL  Q      GG FSYCLV   RG+ +S  LV GR       A  P+VRNP A SFYY
Sbjct: 182 ISLATQTRHTALGGIFSYCLVDYLRGSNASSFLVMGRTRW-RKLAHTPIVRNPAAQSFYY 240

Query: 343 VGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
           V ++G+ V G  +  I+   + +   G+ G + D+GT ++ L  PAY     A  A    
Sbjct: 241 VNVTGVAVDGKPVDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-Y 299

Query: 402 LPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
           LPRA  +   F+ CYN++  +   +P +   F GG V+ LP +N+++ V +    C A  
Sbjct: 300 LPRAQEIPEGFELCYNVTR-MEKGMPKLGVEFQGGAVMELPWNNYMVLVAE-NVQCVALQ 357

Query: 461 P--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              + +G +I+GN+ Q+   I +D A   +GF  + C
Sbjct: 358 KVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 137/424 (32%), Positives = 203/424 (47%), Gaps = 43/424 (10%)

Query: 106 HSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-------GEYFV 158
           H+ HA   R +     L+RR++      +   +   G    + MD GS        EY V
Sbjct: 57  HATHADAGRGLS-TRELLRRMAARSKARSARLLS--GRAASARMDPGSYTDGVPDTEYLV 113

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            + +G+PP+   +++D+GSD+ W QC PC  C++QS P F+P+ S +FS + C   +C  
Sbjct: 114 HMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRICRD 173

Query: 219 LENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAIGCGH 266
           L  + C       G C Y  +Y D S T G L  +T +       IG   V ++  GCG 
Sbjct: 174 LTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGL 233

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------G 318
            N G+FV    G+ G   G++S+  QL       FSYC  +  TGS  S VF        
Sbjct: 234 FNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVPPNLY 289

Query: 319 REALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
            +A   G   V    L+R +      YY+ L G+ VG  R+PI E +F L + G  G ++
Sbjct: 290 SDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIV 349

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D+GT +T LP   Y    DAFVAQT      S  S+   C+++       VP +  +F G
Sbjct: 350 DSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLHFEG 409

Query: 435 GPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
              L LP  N++  +++AG     C A   +   LS+IGN QQ+ + + +D AN  + F 
Sbjct: 410 A-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDMLSFV 467

Query: 492 PNVC 495
           P  C
Sbjct: 468 PARC 471


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 133/433 (30%), Positives = 204/433 (47%), Gaps = 55/433 (12%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           LEL  RD ++        +    +Q++   + +++ K V T     +   A + + +   
Sbjct: 101 LELQIRD-LTRIQTLHKRVLEKNNQNTVSQKQKKNDKEVVT-----TTPVASSVEEQAGQ 154

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
               + SGM  GSGEYF+ + VGSPP+   +++D+GSD+ W+QC PC  C++Q+D     
Sbjct: 155 LVATLESGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND----- 209

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---- 256
                                       C Y   YGD S T G  A+ET T+  T     
Sbjct: 210 -------------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGS 244

Query: 257 -----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--T 309
                V+N+  GCGH N+G+F GAAGLLGLG G +S   QL    G +FSYCLV R   T
Sbjct: 245 SELYNVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDT 304

Query: 310 GSSGSLVFGREALPVGAAWVPLV-----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
             S  L+FG +   +    +        +     +FYYV +  + V G  + I E+ + +
Sbjct: 305 NVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNI 364

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV 423
           +  G  G ++D+GT ++    PAYE  ++    +  G  P      I D C+N+SG  +V
Sbjct: 365 SSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNV 424

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
           ++P +   F+ G V   P  N  I +++    C A   +P S  SIIGN QQ+   I +D
Sbjct: 425 QLPELGIAFADGAVWNFPTENSFIWLNE-DLVCLAMLGTPKSAFSIIGNYQQQNFHILYD 483

Query: 483 GANGFVGFGPNVC 495
                +G+ P  C
Sbjct: 484 TKRSRLGYAPTKC 496


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  188 bits (478), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 134/354 (37%), Positives = 185/354 (52%), Gaps = 23/354 (6%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSC 211
           E+ V +G+G+P +   ++ D+GSD+ WVQCQPC     C+ Q DP+FDP+ S++++ V C
Sbjct: 148 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 207

Query: 212 SSAVCDRLENAG--CHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGH 266
               C     AG  C      C Y V YGDGS T G L+ +TL +  +  +     GCG 
Sbjct: 208 GEPQC---AAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSRALAGFPFGCGT 264

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-REALPVG 325
           +N G F    GLLGLG G +SL  Q     G  FSYCL S  + ++G L  G   A   G
Sbjct: 265 RNLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTGYLTIGATPATDTG 323

Query: 326 AA-WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           AA +  ++R P+ PSFY+V L  + +GG  +P+   +F        G ++D+GT +T LP
Sbjct: 324 AAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSGTVLTYLP 378

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AYE  RD F         A    + D CY+ +G   V VP VSF F  G V  L    
Sbjct: 379 AQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFFG 438

Query: 445 FLIPVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +I +D+    C AFA   +G   LSIIGN QQ   ++ +D A   +GF P  C
Sbjct: 439 VMIFLDE-NVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 183/368 (49%), Gaps = 33/368 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   +++D+GSD+ W QC PC  C++QS P F+P+ S +FS + C   
Sbjct: 110 EYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLR 169

Query: 215 VCDRLENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLT-------IGRTVVKNVAI 262
           +C  L  + C       G C Y  +Y D S T G L  +T +       IG   V ++  
Sbjct: 170 ICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTF 229

Query: 263 GCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF---- 317
           GCG  N G+FV    G+ G   G++S+  QL       FSYC  +  TGS  S VF    
Sbjct: 230 GCGLFNNGIFVSNETGIAGFSRGALSMPAQLKVDN---FSYCFTAI-TGSEPSPVFLGVP 285

Query: 318 ---GREALPVGAAWV---PLVR-NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
                +A   G   V    L+R +      YY+ L G+ VG  R+PI E +F L + G  
Sbjct: 286 PNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTG 345

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           G ++D+GT +T LP   Y    DAFVAQT      S  S+   C+++       VP +  
Sbjct: 346 GTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVL 405

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTF---CFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
           +F G   L LP  N++  +++AG     C A   +   LS+IGN QQ+ + + +D AN  
Sbjct: 406 HFEGA-TLDLPRENYMFEIEEAGGIRLTCLAIN-AGEDLSVIGNFQQQNMHVLYDLANDM 463

Query: 488 VGFGPNVC 495
           + F P  C
Sbjct: 464 LSFVPARC 471


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 133/432 (30%), Positives = 201/432 (46%), Gaps = 38/432 (8%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+H     S    T   H+ R  ++    M+    RV  L    S          V
Sbjct: 26  FSVELIHPISSKSPFYNTAESHFQRMSNN----MKHSTNRVHYLNHVFSFPPNKVPNIVV 81

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
             F          G G Y +   +G+PP   Y V+D+ +D +W QC PC  C+  + P+F
Sbjct: 82  SPF---------MGDG-YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMF 131

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT 255
           DP+ S+++  + CSS  C  +EN  C +     C Y  +YG  +Y++G L+++TLT+   
Sbjct: 132 DPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSN 191

Query: 256 -----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
                  KN+ IGCGH+N+G   G  +G +GLG G +S + QL    GG FSYCLV    
Sbjct: 192 NDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFS 251

Query: 308 GTGSSGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
             G SG L FG +++   VG    P+         Y   L+ L VG   I       +  
Sbjct: 252 NEGISGKLHFGDKSVVSGVGTVSTPITAGEIG---YSTTLNALSVGDHIIKFENSTSKND 308

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVR 424
            +G+   ++D+GT +T LP   Y    ++ V     L RA S    F  CY  +   ++ 
Sbjct: 309 NLGN--TIIDSGTTLTILPENVYSRL-ESIVTSMVKLERAKSPNQQFKLCYKAT-LKNLD 364

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDG 483
           VP ++ +F+G  V  L + N   P+D     CFAF    +   +IIGNI Q+   + FD 
Sbjct: 365 VPIITAHFNGADV-HLNSLNTFYPIDHE-VVCFAFVSVGNFPGTIIGNIAQQNFLVGFDL 422

Query: 484 ANGFVGFGPNVC 495
               + F P  C
Sbjct: 423 QKNIISFKPTDC 434


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 139/419 (33%), Positives = 208/419 (49%), Gaps = 44/419 (10%)

Query: 84  VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGT 143
             R   ++ + T   ++  R  H  H       +R++ L  RL    + +A+  +Q    
Sbjct: 26  ARRSFRATMTRTEPAINLTRAAHKSH-------QRLSMLAARLDDAASGSAQTPLQ---- 74

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
                +D G G Y +   +G+PP+    + D+GSD++W +C  C++C  Q  P + P  S
Sbjct: 75  -----LDSGGGAYDMTFSIGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKS 129

Query: 204 ASFSGVSCSSAVCDRLENAGCHAG--RCRYEVSYGDGS----YTKGTLALETLTIGRTVV 257
           +SFS + CS ++C  L ++ C AG   C Y+ SYG  S    YT+G L  ET T+G   V
Sbjct: 130 SSFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAV 189

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
             +  GC   ++G +   +GL+GLG G +SLV QL     GAFSYCL S    +S  L+F
Sbjct: 190 PGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQL---NVGAFSYCLTSDAAKTS-PLLF 245

Query: 318 GREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
           G  AL   G    PL+R   +  +Y V L  + +G                G  G++ D+
Sbjct: 246 GSGALTGAGVQSTPLLRT--STYYYTVNLESISIGAATT---------AGTGSSGIIFDS 294

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
           GT V  L  PAY   ++A ++QT NL  ASG   ++ C+  SG V    P++  +F GG 
Sbjct: 295 GTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSGAV---FPSMVLHFDGGD 351

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + LP  N+   VDD+ + C+    SPS LSI+GNI Q    I +D     + F P  C
Sbjct: 352 -MDLPTENYFGAVDDSVS-CWIVQKSPS-LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 139/435 (31%), Positives = 200/435 (45%), Gaps = 51/435 (11%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKH-- 136
           +++E++HRD   S                F+   +   +RV   VRR      + A H  
Sbjct: 27  FSVEIIHRDSSRSP---------------FYRATETQFQRVTNAVRR----SMNRANHFN 67

Query: 137 EVQDFGTDVVSGMDQ-GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
           ++  +   V S +     G+Y +   +G+PP   Y ++D+ SDI+WVQCQ C  CY  + 
Sbjct: 68  QISVYSNAVESPVTLLDDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTS 127

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTI 252
           P+FDP+ S ++  + CSS  C  ++   C +     C + V+Y DGS+++G L +ET+T+
Sbjct: 128 PMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETVTL 187

Query: 253 G----------RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
           G          RTV     IGC  +N  +   + G++GLGGG +SLV QL       FSY
Sbjct: 188 GSYNDPFVHFPRTV-----IGC-IRNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSY 241

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
           CL      SS  L FG  A+  G   V   +       FYY+ L    VG  RI      
Sbjct: 242 CLAPISDRSS-KLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSS- 299

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG-VSIFDTCYNLSGF 420
                 G   +++D+GT  T LP   Y     A VA    L RA   +  F  CY  S +
Sbjct: 300 -SSRSSGKGNIIIDSGTTFTVLPDDVYSKLESA-VADVVKLERAEDPLKQFSLCYK-STY 356

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
             V VP ++ +FSG  V  L A N  I V      C AF  S SG +I GN+ Q+   + 
Sbjct: 357 DKVDVPVITAHFSGADV-KLNALNTFI-VASHRVVCLAFLSSQSG-AIFGNLAQQNFLVG 413

Query: 481 FDGANGFVGFGPNVC 495
           +D     V F P  C
Sbjct: 414 YDLQRKIVSFKPTDC 428


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 137/454 (30%), Positives = 209/454 (46%), Gaps = 52/454 (11%)

Query: 64  HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
           H+  S +  S   + +++ L+HR+   S     +     R +++      R  +R+    
Sbjct: 14  HSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFARSKRRL---- 69

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
            RLS         +  D     ++  D+   EY +R  +G+PP  ++ + D+GSD++WVQ
Sbjct: 70  -RLS---------QNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQ 119

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GC--HAGRCRYEVSYGDGS 239
           C PC +C  Q+ P+FDP  S++F  V C S  C  L  +   C   +G+C Y+  YGD +
Sbjct: 120 CAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHT 179

Query: 240 YTKGTLALETLTIGRTVVKNVAI-------GCGHKNQGMFVGAA---GLLGLGGGSMSLV 289
              G L  E++  G    KN AI       GC   N      +    GL+GLG G +SL+
Sbjct: 180 LVSGILGFESINFGS---KNNAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLI 236

Query: 290 GQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP---VGAAWVPLVRNPRAPSFYYVGLS 346
            QLG Q G  FSYC     + S+  + FG +A+     G    PL+     PS+YY+ L 
Sbjct: 237 SQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLE 296

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           G+ +G  ++  SE         D  +++D+GT+ T L     ++F + FVA    +    
Sbjct: 297 GVSIGNKKVKTSE------SQTDGNILIDSGTSFTILK----QSFYNKFVALVKEVYGVE 346

Query: 407 GVSI----FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP- 461
            V I    ++ C+   G    R P V F F+G  V  + ASN L   +D    C    P 
Sbjct: 347 AVKIPPLVYNFCFENKG-KRKRFPDVVFLFTGAKV-RVDASN-LFEAEDNNLLCMVALPT 403

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S    SI GN  Q G Q+ +D   G V F P  C
Sbjct: 404 SDEDDSIFGNHAQIGYQVEYDLQGGMVSFAPADC 437


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 136/432 (31%), Positives = 210/432 (48%), Gaps = 40/432 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           ++++L+HRD   S     +     R   +FH    R   RV     R S   +D  +   
Sbjct: 32  FSVDLIHRDSPHSPFFDPSKTRTERLTDAFH----RSASRVGRF--RQSAMTSDGIQ--- 82

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                   S +   +GEY + + +G+PP     ++D+GSD+ W QC+PC+ CYKQ  P F
Sbjct: 83  --------SRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFF 134

Query: 199 DPADSASFSGVSCSSAVCDRLEN-AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP +S+++   SC ++ C  L N   C  G +C +  SY DGS+T G LA+ETLT+  T 
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTA 194

Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
            K V     A GC H++ G+F   ++G++GLG   +S++ QL     G FSYCL+   T 
Sbjct: 195 GKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTD 254

Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           SS S  + FGR  +  GA  V  PLV       +Y + L G  VG  R+   +   +  +
Sbjct: 255 SSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSY-KGFSKKAE 313

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA---SGVSIFDTCYNLSGFVSV 423
           + +  +++D+GT  T LP   Y    ++ VA +    R    +G+S    CYN +    +
Sbjct: 314 VEEGNIIVDSGTTYTYLPLEFYVKLEES-VAHSIKGKRVRDPNGIS--SLCYNTT-VDQI 369

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
             P ++ +F    V   P + FL   +D    CF   P+ S + I+GN+ Q    + FD 
Sbjct: 370 DAPIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPT-SDIGILGNLAQVNFLVGFDL 426

Query: 484 ANGFVGFGPNVC 495
               V F    C
Sbjct: 427 RKKRVSFKAADC 438


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  187 bits (475), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 130/367 (35%), Positives = 186/367 (50%), Gaps = 30/367 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
           +  Y V   +G+PP +   V+D+GSD++W QC  PC +C+ Q  P++ PA S +++ VSC
Sbjct: 97  TATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSC 156

Query: 212 SSAVCDRLEN-------------AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVV 257
            S +CD L +                  G C Y  SYGDGS T G LA ET T G  T V
Sbjct: 157 GSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGAGTTV 216

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLV 316
            ++A GCG  N G    ++GL+G+G G +SLV QLG      FSYC      T +S  L 
Sbjct: 217 HDLAFGCGTDNLGGTDNSSGLVGMGRGPLSLVSQLGVTK---FSYCFTPFNDTTTSSPLF 273

Query: 317 FGREA-LPVGAAWVPLVRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
            G  A L   A   P V +P  P   S+YY+ L G+ VG   +PI   +FRLT  G  G+
Sbjct: 274 LGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRLTASGRGGL 333

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTV 428
           ++D+GT  T L   A+     A  A+   LP ASG  +    C+      G  +V VP +
Sbjct: 334 IIDSGTTFTALEERAFVVLARAVAARV-ALPLASGAHLGLSVCFAAPQGRGPEAVDVPRL 392

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
             +F G   + LP S+ ++    AG  C     S  G+S++G++QQ+ + + +D     +
Sbjct: 393 VLHFDGAD-MELPRSSAVVEDRVAGVACLGIV-SARGMSVLGSMQQQNMHVRYDVGRDVL 450

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 451 SFEPANC 457


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 125/383 (32%), Positives = 179/383 (46%), Gaps = 42/383 (10%)

Query: 145 VVSGMDQGSG----EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ-SDPVFD 199
           V +G+  G G    EY + + VG+PPR   + +D+GSD+VW QC PC  C++Q + PV D
Sbjct: 75  VRAGLGAGGGIVTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLD 134

Query: 200 PADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI- 252
           PA S++ + + C + +C  L    C  GR      C Y   YGD S T G LA ++ T  
Sbjct: 135 PAASSTHAALPCDAPLCRALPFTSC-GGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFG 193

Query: 253 -----GRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
                G    + V  GCGH N+G+F     G+ G G G  SL  QL   +   FSYC  S
Sbjct: 194 GDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTS---FSYCFTS 250

Query: 307 R-GTGSSGSLVFGREALPV----------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
              T SS  +  G  A  +                L++NP  PS Y+V L G+ VGG R+
Sbjct: 251 MFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARV 310

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
            + E   R         ++D+G ++T LP   YEA +  FV+Q G    A+G +  D C+
Sbjct: 311 AVPESRLR------SSTIIDSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCF 364

Query: 416 NLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNI 472
            L      R   VP ++ +  GG    LP  N++     A   C     +     +IGN 
Sbjct: 365 ALPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNY 424

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           QQ+   + +D  N  + F P  C
Sbjct: 425 QQQNTHVVYDLENDVLSFAPARC 447


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 137/396 (34%), Positives = 210/396 (53%), Gaps = 31/396 (7%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
            +R   R ATL+  L+       +           S +   SGE+ + I +G+PP +   
Sbjct: 57  FRRSFSRSATLLTHLTSVSTACIR-----------SPIIPDSGEFLMSIFIGTPPVNVIA 105

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC--HAGRC 229
           + D+GSD+ W QC PC +C+ QS P+F+P  S+S+  VSC+S  C  LE+  C      C
Sbjct: 106 IADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCGPDLQSC 165

Query: 230 RYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSL 288
            Y  SYGD S+T G LA + +TIG   +    IGCGH+N G F G   G++GLGGGS+SL
Sbjct: 166 SYGYSYGDRSFTYGDLASDQITIGSFKLPKTVIGCGHQNGGTFGGVTSGIIGLGGGSLSL 225

Query: 289 VGQLGGQTG--GAFSYCLVSRGTGS--SGSLVFGREALPVGAAWV--PLVRNPRAP-SFY 341
           V Q+    G    FSYCL +  + +  +G++ FGR+A+  G   V  PLV  PR+P +FY
Sbjct: 226 VSQMRTIAGVKPRFSYCLPTFFSNANITGTISFGRKAVVSGRQVVSTPLV--PRSPDTFY 283

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD--AFVAQT 399
           ++ L  + VG  R   +  +  +T  G+  +++D+GT +T LP   Y       A V + 
Sbjct: 284 FLTLEAISVGKKRFKAANGISAMTNHGN--IIIDSGTTLTLLPRSLYYGVFSTLARVIKA 341

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
             +   SG  I + CY+      + +P ++ +F+GG  + L   N   PV D  T C  F
Sbjct: 342 KRVDDPSG--ILELCYSAGQVDDLNIPIITAHFAGGADVKLLPVNTFAPVADNVT-CLTF 398

Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           AP+ + ++I GN+ Q   ++ +D  N  + F P +C
Sbjct: 399 APA-TQVAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 127/365 (34%), Positives = 191/365 (52%), Gaps = 30/365 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCS 212
           GEY + + +G+PP     V D+GSD++W QC PC +QC++Q  P+++PA S +FS + C+
Sbjct: 112 GEYLMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCN 171

Query: 213 SAV--CDRLENAGCHAGRC--RYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIG 263
           S++  C            C   Y  +YG G +T G    ET T G +      V  VA G
Sbjct: 172 SSLSMCAGALAGAAPPPGCACMYYQTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFG 230

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREAL 322
           C + +   + G+AGL+GLG GS+SLV QLG    G FSYCL   + T S+ +L+ G  A 
Sbjct: 231 CSNASSSDWNGSAGLVGLGRGSLSLVSQLGA---GRFSYCLTPFQDTNSTSTLLLGPSAA 287

Query: 323 P--VGAAWVPLVRNP-RAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
               G    P V +P RAP  ++YY+ L+G+ +G   +PIS   F L   G  G+++D+G
Sbjct: 288 LNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSG 347

Query: 378 TAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSI--FDTCYNLSGFVSVR---VPTVSFY 431
           T +T L   AY+  R A  +Q    LP   G      D C+ L    S     +P+++ +
Sbjct: 348 TTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLH 407

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGF 490
           F G  ++ LPA +++I    +G +C A      G +S  GN QQ+ + I +D     + F
Sbjct: 408 FDGADMV-LPADSYMI--SGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSF 464

Query: 491 GPNVC 495
            P  C
Sbjct: 465 APAKC 469


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 127/433 (29%), Positives = 194/433 (44%), Gaps = 39/433 (9%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+H D   S           R  +     +   +KR   L    S    D  K  +
Sbjct: 27  FSVELIHPDSSRSPFYNIRETQLQRISNV----VTHSIKRAHYLNHVFSLSHNDLPKPTI 82

Query: 139 QDFGTDVVSGMDQGSGEYFV-RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
             +           +G Y+V    +G+PP   Y V+D+GSD +W QC+PC  C  Q+ P+
Sbjct: 83  IPY-----------AGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPI 131

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGR 254
           F+P+ S+++  + CSS +C R E   C + R   C YE++Y D S ++G ++ +TLT+  
Sbjct: 132 FNPSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNS 191

Query: 255 T-----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG 308
                     + IGCGHKN     G A+G++G G G+ S+V QLG   GG FSYCL S  
Sbjct: 192 NDGSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLF 251

Query: 309 TGS--SGSLVFGREALPVGAAWVPLVRNPRAPSF----YYVGLSGLGVGGMRIPISEDLF 362
           + +  S  L FG  A+  G     +V  P   SF    Y+  L    VG   I + +   
Sbjct: 252 SKANISSKLYFGDMAVVSGHG---VVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDS-- 306

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
            L    +   V+D+G+ +T+LP   Y     A ++                CY  +    
Sbjct: 307 SLIPDNEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTT-LKK 365

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
             VP ++ +F G  V  L A N  I ++     CFAF  S     + GNI Q+   + +D
Sbjct: 366 YEVPIITAHFRGADV-KLNAFNTFIQMNHE-VMCFAFNSSAFPWVVYGNIAQQNFLVGYD 423

Query: 483 GANGFVGFGPNVC 495
                + F P  C
Sbjct: 424 TLKNIISFKPTNC 436


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 126/347 (36%), Positives = 176/347 (50%), Gaps = 16/347 (4%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+PP+   + +D+ +D  W+ C  C+ C   +   F+PA S S+  V C S  
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTP--FNPAASKSYRAVPCGSPA 165

Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C R  N  C  +   C + ++Y D S  +  L+ ++L +   VVK+   GC  K  G   
Sbjct: 166 CSRAPNPSCSLNTKSCGFSLTYADSSL-EAALSQDSLAVANDVVKSYTFGCLQKATGTAT 224

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q      G FSYCL S +    SG+L  GR+  P+     PL+
Sbjct: 225 PPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTPLL 284

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   +PI             G V+D+GT  TRL  PAY A R
Sbjct: 285 VNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYVAVR 344

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           D    +    P +S +  FDTCYN     +V+ P V+F F+G  V TLPA N +I     
Sbjct: 345 DEVRRRIRGAPLSS-LGGFDTCYN----TTVKWPPVTFMFTGMQV-TLPADNLVIHSTYG 398

Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            T C A A +P G    L++I ++QQ+  +I FD  NG VGF    C
Sbjct: 399 TTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQC 445


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 138/411 (33%), Positives = 202/411 (49%), Gaps = 40/411 (9%)

Query: 98  NMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
            M  H    +F     R  +R++ L  RL    A +A+  +Q         MD G G Y 
Sbjct: 32  TMTRHEPTINFTRAAHRSRERLSILATRLGAASAGSAQSPLQ---------MDSGGGAYD 82

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           +   +G+PP++   + D+GSD++W +C  C +C  +    + P  S+SFS + CSSA+C 
Sbjct: 83  MTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCR 142

Query: 218 RLEN---AGCHAGR-----CRYEVSYGDGS----YTKGTLALETLTIGRTVVKNVAIGCG 265
            LE+   A C   R     C Y  SYG  S    YT+G +  ET T+G   V+ +  GC 
Sbjct: 143 TLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQGIGFGCT 202

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-V 324
             ++G +   +GL+GLG G +SLV QL     GAFSYCL S  + SS  L+FG  AL   
Sbjct: 203 TMSEGGYGSGSGLVGLGRGKLSLVRQL---KVGAFSYCLTSDPSTSS-PLLFGAGALTGP 258

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G    PLV N +  +FY V L  + +G  + P           G  G++ D+GT +T L 
Sbjct: 259 GVQSTPLV-NLKTSTFYTVNLDSISIGAAKTP---------GTGRHGIIFDSGTTLTFLA 308

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
            PAY       ++QT NL R  G   ++ C+  SG      P++  +F GG  + L   N
Sbjct: 309 EPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSG--GAVFPSMVLHFDGGD-MALKTEN 365

Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +   V+D+ + C+    SPS +SI+GNI Q    I +D     + F P  C
Sbjct: 366 YFGAVNDSVS-CWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 133/411 (32%), Positives = 194/411 (47%), Gaps = 39/411 (9%)

Query: 112 MQRDVKRVATL--VR---RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           M+R   R A L  VR   R SG      K+E Q     V+     G  EY V + +G+PP
Sbjct: 54  MRRSKARAAALSAVRNRARFSG------KNE-QQTPAGVLPVRPSGDLEYVVDLAIGTPP 106

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH- 225
           +    ++D+GSD++W QC PC+ C  Q DP+F P  SAS+  + C+  +C  + +  C  
Sbjct: 107 QPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCER 166

Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN-------VAIGCGHKNQGMFVGAAGL 278
              C Y  +YGDG+ T G  A E  T   +           +  GCG  N G     +G+
Sbjct: 167 PDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGI 226

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV------GAAWVPLV 332
           +G G   +SLV QL  +    FSYCL S  +    +L+FG  +  V           PL+
Sbjct: 227 VGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLL 283

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
           ++P+ P+FYYV  +GL VG  R+ I E  F L   G  GV++D+GTA+T LP        
Sbjct: 284 QSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVV 343

Query: 393 DAFVAQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVPTVSFYFSGGPVLTLPASN 444
            AF  Q   LP A+G +  D  C+ +       S    + VP +  +F G   L LP  N
Sbjct: 344 RAFRQQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGAD-LDLPRRN 401

Query: 445 FLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +++     G  C   A S    S IGN+ Q+ +++ +D     +   P  C
Sbjct: 402 YVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 182/351 (51%), Gaps = 17/351 (4%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSC 211
           E+ V +G+G+P +   ++ D+GSD+ WVQCQPC     C+ Q DP+FDP+ S++++ V C
Sbjct: 143 EFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHC 202

Query: 212 SSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
               C    +        C Y V YGDGS T G L+ +TL +  +  +     GCG +N 
Sbjct: 203 GEPQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSRALTGFPFGCGTRNL 262

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-REALPVGAA- 327
           G F    GLLGLG G +SL  Q     G  FSYCL S  + ++G L  G   A   GAA 
Sbjct: 263 GDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNS-TTGYLTIGATPATDTGAAQ 321

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
           +  ++R P+ PSFY+V L  + +GG  +P+   +F        G ++D+GT +T LP  A
Sbjct: 322 YTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSGTVLTYLPAQA 376

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           Y   RD F         A    + D CY+ +G   V VP VSF F  G V  L     +I
Sbjct: 377 YALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDFFGVMI 436

Query: 448 PVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +D+    C AFA   +G   LSIIGN QQ   ++ +D A   +GF P  C
Sbjct: 437 FLDE-NVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  185 bits (469), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 134/424 (31%), Positives = 193/424 (45%), Gaps = 53/424 (12%)

Query: 78  RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
           R++++L+HRD   S          +    +   R+ R  +R       +S   A  + + 
Sbjct: 34  RFSIDLIHRDSPKSP--------LYNPSETPAERLDRFFRRF------MSFSEASISPNT 79

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
            +         +   +GEY ++I +G+PP   Y + D+GSD++W QC PC  CYKQ +P+
Sbjct: 80  PE-------PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPM 132

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGR- 254
           FDP+ S SF  VSC S  C  L+   C   +  C +   YGDGS  +G +A ETLT+   
Sbjct: 133 FDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSN 192

Query: 255 ----TVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSR 307
               T + N+  GCGH N G F     GL G GG  +SL  Q+     +G  FS CLV  
Sbjct: 193 SGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 252

Query: 308 GTGSS--GSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
            T  S    ++FG EA   G+  V  PLV     P++Y+V L G+ VG    P S     
Sbjct: 253 RTDPSITSKIIFGPEAEVSGSDVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS--- 308

Query: 364 LTQMGDDG-VVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
            + M   G V +D GT  T LP   Y    +  ++A   +    P          CY  +
Sbjct: 309 -SPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSA 363

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
             +    P ++ +F G  V   P + F+ P +  G +CFA  P      I GN  Q    
Sbjct: 364 TLID--GPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFL 419

Query: 479 ISFD 482
           I FD
Sbjct: 420 IGFD 423


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 125/385 (32%), Positives = 180/385 (46%), Gaps = 78/385 (20%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS--GMDQGSGEYFVRIGVGSPPRSQ 169
           ++RD  R   + R+ SG    AA  + Q     V +  G    + EY + +G+GSP  +Q
Sbjct: 60  LRRDQLRADYIRRKFSGSNGTAAGEDGQSSKVSVPTTLGSSLDTLEYVISVGLGSPAVTQ 119

Query: 170 YMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL----ENA 222
            +VID+GSD+ WVQC+PC   S C+  +  +FDPA S++++  +CS+A C +L    E  
Sbjct: 120 RVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGDSGEAN 179

Query: 223 GCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN--QGMFVGAAGLL 279
           GC A  RC+Y V YGDGS T GT                  GC H     GM     GL+
Sbjct: 180 GCDAKSRCQYIVKYGDGSNTTGT--------------GFQFGCSHAELGAGMDDKTDGLI 225

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
           GLGG + SLV Q                                         R+ + P+
Sbjct: 226 GLGGDAQSLVSQTA--------------------------------------ARSKKVPT 247

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
           +Y+  L  + VGG ++ +S  +F        G ++D+GT +TRLP  AY A   AF A  
Sbjct: 248 YYFAALEDIAVGGKKLGLSPSVFAA------GSLVDSGTVITRLPPAAYAALSSAFRAGM 301

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
               RA  + I DTC+N +G   V +PTV+  F+GG V+ L A   +         C AF
Sbjct: 302 TRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIV------SGGCLAF 355

Query: 460 APS--PSGLSIIGNIQQEGIQISFD 482
           AP+        IGN+QQ   ++ +D
Sbjct: 356 APTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  184 bits (468), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 121/360 (33%), Positives = 179/360 (49%), Gaps = 34/360 (9%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           ++ G G Y + I VG+P  +  +V D+GSD++W QC PC++C++Q  P F PA S++FS 
Sbjct: 79  LENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSK 138

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + C+S+ C  L N+   C+A  C Y   YG G YT G LA ETL +G     +VA GC  
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGDASFPSVAFGCST 197

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVG 325
           +N           GLG   + +         G FSYCL S     +  ++FG  A L  G
Sbjct: 198 EN-----------GLGQLDLGV---------GRFSYCLRSGSAAGASPILFGSLANLTDG 237

Query: 326 AAW-VPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG-DDGVVMDTGTAVTR 382
                P V NP   PS+YYV L+G+ VG   +P++   F  TQ G   G ++D+GT +T 
Sbjct: 238 NVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTY 297

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYFSGGPVLTL 440
           L    YE  + AF++QT ++   +G    D C+     G   + VP++   F GG    +
Sbjct: 298 LAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLRFDGGAEYAV 357

Query: 441 PASNFLIPVDDAGTF---CFAFAPSP--SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P     +  D  G+    C    P+     +S+IGN+ Q  + + +D   G   F P  C
Sbjct: 358 PTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 126/405 (31%), Positives = 190/405 (46%), Gaps = 26/405 (6%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           +QR   R A L     GG    A+ + Q+     +     G  EY V + VG+PP+    
Sbjct: 60  VQRSKARAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQPVSA 119

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGRCR 230
           ++D+GSD++W QC PC+ C  Q DP+F P  S+S+  + C+  +C+ + +  C     C 
Sbjct: 120 LLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELCNDILHHSCQRPDTCT 179

Query: 231 YEVSYGDGSYTKGTLALETLTIGR--------TVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           Y  SYGDG+ T+G  A E  T            +   +  GCG  N+G     +G++G G
Sbjct: 180 YRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGSGIVGFG 239

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR------EALPVGAAWVPLVRNPR 336
              +SLV QL  +    FSYCL    +G   +L+FG       +A         L+R+ +
Sbjct: 240 RAPLSLVSQLAIRR---FSYCLTPYASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQ 296

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
            P+FYYV  +G+ VG  R+ I    F L   G  G ++D+GTA+T  P P       AF 
Sbjct: 297 NPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFR 356

Query: 397 AQTGNLP-RASGVSIFD--TCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVD 450
           +Q   LP  A+G S  D   C+  +     R   VP + F+  G   L LP  N+++   
Sbjct: 357 SQL-RLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGAD-LDLPRRNYVLDDQ 414

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G  C   A S    + IGN  Q+ +++ +D     + F P  C
Sbjct: 415 RKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLSFAPAQC 459


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  184 bits (467), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 133/424 (31%), Positives = 192/424 (45%), Gaps = 53/424 (12%)

Query: 78  RWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHE 137
           R++++L+HRD   S          +    +   R+ R  +R       +S   A  + + 
Sbjct: 34  RFSIDLIHRDSPKSP--------LYNPSETPAERLDRFFRRF------MSFSEASISPNT 79

Query: 138 VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
            +         +   +GEY ++I +G+PP   Y + D+GSD++W QC PC  CYKQ +P+
Sbjct: 80  PE-------PPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPM 132

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
           FDP+ S SF  VSC S  C  L+   C   +  C +   YGDGS  +G +A ETLT+   
Sbjct: 133 FDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSN 192

Query: 256 -----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSR 307
                 + N+  GCGH N G F     GL G GG  +SL  Q+     +G  FS CLV  
Sbjct: 193 SGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPF 252

Query: 308 GTGSS--GSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
            T  S    ++FG EA   G+  V  PLV     P++Y+V L G+ VG    P S     
Sbjct: 253 RTDPSITSKIIFGPEAEVSGSXVVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS--- 308

Query: 364 LTQMGDDG-VVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
            + M   G V +D GT  T LP   Y    +  ++A   +    P          CY  +
Sbjct: 309 -SPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSA 363

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQ 478
             +    P ++ +F G  V   P + F+ P +  G +CFA  P      I GN  Q    
Sbjct: 364 TLID--GPILTAHFDGADVQLKPLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFL 419

Query: 479 ISFD 482
           I FD
Sbjct: 420 IGFD 423


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 120/350 (34%), Positives = 171/350 (48%), Gaps = 15/350 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y VR+ +G+P ++ YMV+D+ +D  W  C  C  C   S   F   +S++F+ + CS 
Sbjct: 93  GNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC--SSTTTFSAQNSSTFATLDCSK 150

Query: 214 AVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
             C +     C       C +  +YG  S    TL  ++L +G  V+ N + GC     G
Sbjct: 151 PECTQARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNVIPNFSFGCISSASG 210

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GL+GLG G +SL+ Q G    G FSYCL S +    SGSL  G    P      
Sbjct: 211 SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTT 270

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+ NP  PS YYV L+G+ VG + +PIS +L         G ++D+GT +TR     Y 
Sbjct: 271 PLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYT 330

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RD F  Q G     S +  FDTC+  +  VS   P ++ + SG   L LP  N LI  
Sbjct: 331 AVRDEFRKQVGG--SFSPLGAFDTCFATNNEVS--APAITLHLSGLD-LKLPMENSLIHS 385

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C A A +P    S +++I N+QQ+  +I FD  N  +G    +C
Sbjct: 386 SAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 139/408 (34%), Positives = 191/408 (46%), Gaps = 51/408 (12%)

Query: 106 HSFHARMQRDVK-RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           H    R  R  K RVA L  RL+G           D    +    D+G   Y V IG+G+
Sbjct: 54  HDMWRRSARASKARVARLEARLTG-----------DMSVPLARISDEG---YTVTIGIGT 99

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAG- 223
           PP+   ++ D+ SD+ W QC   +   KQ +P+FDPA S+SF+ V+CSS +C   +N G 
Sbjct: 100 PPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCTE-DNPGT 158

Query: 224 --CHAGRCRYEVSYGDGSYTKGTLALETLTI---GRTVVKNVAIGCGHKNQGMFVGAAGL 278
             C    CRY   Y       G LA E+ T+    + +  +   GCG    G  +GA+G+
Sbjct: 159 KRCSNKTCRYVYPYVSVE-AAGVLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGASGI 217

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVR----N 334
           LG+    +S+V QL       FSYCL       S  L FG        AW  L R     
Sbjct: 218 LGMSPAILSMVSQLAIPK---FSYCLTPYTDRKSSPLFFG--------AWADLGRYKTTG 266

Query: 335 PRAPS---FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           P   S   +YYV L GL +G  R+ +    F L Q    G V+D G  V +L  PA+ A 
Sbjct: 267 PIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQ---GGTVVDLGCTVGQLAEPAFTAL 323

Query: 392 RDAFVAQTGNLPRAS-GVSIFDTCYNLSGFV---SVRVPTVSFYFSGGPVLTLPASNFLI 447
           ++A V  T NLP  +  V  +  C+ L   V   +V+ P +  YF GG  + LP  N+  
Sbjct: 324 KEA-VLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADMVLPRDNYF- 381

Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               AG  C A  P   G+SIIGN+QQ+   + FD  +    F P +C
Sbjct: 382 QEPTAGLMCLALVPG-GGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 115/352 (32%), Positives = 167/352 (47%), Gaps = 36/352 (10%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY +++ +G+PP     V+D+GS+ +W QC PC  CY Q+ P+FDP+ S++F  + C + 
Sbjct: 58  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 116

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
                     H   C YE+ YG  SYTKGTL  ET+TI  T     V+    IGCG  N 
Sbjct: 117 ----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 166

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           G   G AG++GL  G  SL+ Q+GG+  G  SYC   +GT     + FG  A+  G   V
Sbjct: 167 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIVAGDGVV 223

Query: 330 P---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
                V+  + P FYY+ L  + VG  RI      F   +     +V+D+G+ +T  P  
Sbjct: 224 STTVFVKTAK-PGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPES 279

Query: 387 AYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
                R A   V      PR+  +  +    ++        P ++ +FSGG  L L   N
Sbjct: 280 YCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLDKYN 332

Query: 445 FLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  +  G FC A    SP   +I GN  Q    + +D ++  V F P  C
Sbjct: 333 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 384


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 115/352 (32%), Positives = 167/352 (47%), Gaps = 36/352 (10%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY +++ +G+PP     V+D+GS+ +W QC PC  CY Q+ P+FDP+ S++F  + C + 
Sbjct: 64  EYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDT- 122

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
                     H   C YE+ YG  SYTKGTL  ET+TI  T     V+    IGCG  N 
Sbjct: 123 ----------HDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNS 172

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
           G   G AG++GL  G  SL+ Q+GG+  G  SYC   +GT     + FG  A+  G   V
Sbjct: 173 GFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGT---SKINFGANAIVAGDGVV 229

Query: 330 P---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
                V+  + P FYY+ L  + VG  RI      F   +     +V+D+G+ +T  P  
Sbjct: 230 STTVFVKTAK-PGFYYLNLDAVSVGNTRIETVGTPFHALK---GNIVIDSGSTLTYFPES 285

Query: 387 AYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
                R A   V      PR+  +  +    ++        P ++ +FSGG  L L   N
Sbjct: 286 YCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDI-------FPVITMHFSGGADLVLDKYN 338

Query: 445 FLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  +  G FC A    SP   +I GN  Q    + +D ++  V F P  C
Sbjct: 339 MYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 390


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 174/350 (49%), Gaps = 18/350 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+PP++  + +D+ +D  W+ C  C  C   +  +F P  S +F  VSC+
Sbjct: 75  SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAPEKSTTFKNVSCA 131

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           +  C ++ N GC    C + ++YG  S     L  +T+T+    V +   GC  K  G  
Sbjct: 132 APECKQVPNPGCGVSSCNFNLTYGSSSIA-ANLVQDTITLATDPVPSYTFGCVSKTTGTS 190

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
               GLLGLG G +SL+ Q        FSYCL S +    SGSL  G  A P    + PL
Sbjct: 191 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPL 250

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L  + VG   + I             G + D+GT  TRL  P Y A 
Sbjct: 251 LKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLVAPVYVAV 310

Query: 392 RDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           RD F  + G  P+ +  S+  FDTCYN    V + VPT++F F+G  V TLP  N LI  
Sbjct: 311 RDEFRRRVG--PKLTVTSLGGFDTCYN----VPIVVPTITFIFTGMNV-TLPQDNILIHS 363

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               T C A A +P    S L++I N+QQ+  ++ +D  N  VG    +C
Sbjct: 364 TAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELC 413


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 176/350 (50%), Gaps = 36/350 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
           +  Y V I +G+PP     V+D+GSD++W QC  PC +C+ Q  P++ PA SA+++ VSC
Sbjct: 89  TATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSC 148

Query: 212 SSAVCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGH 266
            S +C  L++           C Y  SYGDG+ T G LA ET T+G  T V+ VA GCG 
Sbjct: 149 RSPMCQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTAVRGVAFGCGT 208

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
           +N G    ++GL+G+G G +SLV QLG           V+R   S  +            
Sbjct: 209 ENLGSTDNSSGLVGMGRGPLSLVSQLG-----------VTRPRRSCRARAA------ARG 251

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              P   +P         L G+ VG   +PI   +FRLT MGD GV++D+GT  T L   
Sbjct: 252 GGAPTTTSP---------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEER 302

Query: 387 AYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
           A+ A   A  ++   LP ASG  +    C+  +   +V VP +  +F G   + L   ++
Sbjct: 303 AFVALARALASRV-RLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGAD-MELRRESY 360

Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++    AG  C     S  G+S++G++QQ+   I +D   G + F P  C
Sbjct: 361 VVEDRSAGVACLGMV-SARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 179/360 (49%), Gaps = 34/360 (9%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   + +D+GSD++W QCQPC  C+ Q+ P FDP+ S++ S  SC S 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 93

Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
           +C  L  A C + +      C Y  SYGD S T G L ++  T       V  VA GCG 
Sbjct: 94  LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 153

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV- 324
            N G+F     G+ G G G +SL  QL     G FS+C  +  TG+  S V     LP  
Sbjct: 154 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVL--LDLPAD 207

Query: 325 -------GAAWVPLV---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
                       PL+   +N   P+ YY+ L G+ VG  R+P+ E  F LT  G  G ++
Sbjct: 208 LFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTII 266

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFS 433
           D+GT++T LP   Y+  RD F AQ   LP   G +    TC++        VP +  +F 
Sbjct: 267 DSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 325

Query: 434 GGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           G   + LP  N++  V DDAG    C A        +IIGN QQ+ + + +D  N  + F
Sbjct: 326 GA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNMLSF 383


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 102/293 (34%), Positives = 158/293 (53%), Gaps = 24/293 (8%)

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
           +   +A     S +S VC      G  A  C Y ++YGDGS+T+G L  E L  G  +VK
Sbjct: 52  EDVSNAQIPVTSGNSGVC------GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVK 105

Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG 318
           +   GCG  N+G+F G +GL+GLG   +SL+ Q  G  GG FSYCL S     SGSL+ G
Sbjct: 106 DFIFGCGRNNKGLFGGVSGLMGLGRSDLSLISQTSGIFGGVFSYCLPSTERKGSGSLILG 165

Query: 319 ------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
                 R + P+  ++  ++ NP+  +FY++ L+G+ +GG+ +       +   +G   +
Sbjct: 166 GNSSVYRNSSPI--SYAKMIENPQLYNFYFINLTGISIGGVAL-------QAPSVGPSRI 216

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           ++D+GT +TRLP   Y+A +  F+ Q    P A   SI DTC+NLS +  V +PT+  +F
Sbjct: 217 LVDSGTVITRLPPTIYKALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHF 276

Query: 433 SGGPVLTLPASN-FLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFD 482
            G   LT+  +  F     DA   C A A       ++I+GN QQ+ +++ +D
Sbjct: 277 EGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYD 329


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 117/349 (33%), Positives = 168/349 (48%), Gaps = 31/349 (8%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP     +ID+GS+I W QC PC  CY+Q+ P+FDP+ S++F         
Sbjct: 65  YLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTF--------- 115

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
               +   C    C YEV Y D +YT GTLA ET+T+  T     V+    IGCGH N  
Sbjct: 116 ----KEKRCDGHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSW 171

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
                +G++GL  G  SL+ Q+GG+  G  SYC   +GT     + FG  A+  G   V 
Sbjct: 172 FKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGT---SKINFGANAIVAGDGVVS 228

Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             +      P FYY+ L  + VG  RI      F   +     +V+D+GT +T  P    
Sbjct: 229 TTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE---GNIVIDSGTTLTYFPVSYC 285

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDT-CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
              R A V       RA+  +  D  CYN S  + +  P ++ +FSGG  L L   N  +
Sbjct: 286 NLVRQA-VEHVVTAVRAADPTGNDMLCYN-SDTIDI-FPVITMHFSGGVDLVLDKYNMYM 342

Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             ++ G FC A    SP+  +I GN  Q    + +D ++  V F P  C
Sbjct: 343 ESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 165/348 (47%), Gaps = 15/348 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V+  VG+PP++  M +D+  D  W+ C+ C  C   S  VF+   S +F  + C 
Sbjct: 32  SPSYIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC---SSTVFNTVKSTTFKTLGCG 88

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           +  C ++ N  C    C +  +YG  S     L  +T+ +    V   A GC  K  G  
Sbjct: 89  APQCKQVPNPICGGSTCTWNTTYGS-STILSNLTRDTIALSMDPVPYYAFGCIQKATGSS 147

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
           V   GLLG G G +S + Q        FSYCL S R    SGSL  G    P      PL
Sbjct: 148 VPPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPL 207

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L+G+ VG   + I             G + D+GT  TRL  PAY A 
Sbjct: 208 LKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLVAPAYIAV 267

Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           R+ F  + GN    S +  FDTCY+    V +  PT++F FSG  V T+P  N LI    
Sbjct: 268 RNEFRKRVGNA-TVSSLGGFDTCYS----VPIVPPTITFMFSGMNV-TMPPENLLIHSTA 321

Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P    S L++I ++QQ+  +I FD  N  +G     C
Sbjct: 322 GVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQC 369


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 166/345 (48%), Gaps = 35/345 (10%)

Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENA 222
           +Q MV+D+ SD+ WVQC PC    CY Q D ++DP  S+S    SC+S  C +L    N 
Sbjct: 168 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 227

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFV---GAAGL 278
             +  +C+Y V Y DG+ T GT   + LTI   T V++   GC H  QG F     AAG+
Sbjct: 228 CTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGI 287

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAW----VPL 331
           + LGGG  SLV Q     G  FS+C      RG        F    +P  AAW     P+
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRG--------FFTLGVPRVAAWRYVLTPM 339

Query: 332 VRNPR-APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           ++NP   P+FY V L  + V G RI +   +F        G  +D+ TA+TRLP  AY+A
Sbjct: 340 LKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA------GAALDSRTAITRLPPTAYQA 393

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R AF  +      A      DTCY+++G  S  +P ++  F     + L  S  L    
Sbjct: 394 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 450

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G   F   P+     IIGNIQ + +++ ++     VGF    C
Sbjct: 451 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 132/426 (30%), Positives = 205/426 (48%), Gaps = 49/426 (11%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
           H  ++R ++R      RL+G G   A+ E       VV  + +    GEY V++G+G+PP
Sbjct: 45  HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
                 ID+ SD++W QCQPC+ CY Q DP+F+P  S++++ + CSS  CD L+   C  
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
                C+Y  +Y   + T+GTLA++ L IG    + VA GC   + G      A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRA 337
           G G +SLV QL  +    F+YCL    +   G LV G +A     A     VP+ R+PR 
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
           PS+YY+ L GL +G   + +                             +GD    G+++
Sbjct: 277 PSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMII 336

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTVSF 430
           D  + +T L    Y+   +    +   LPR +G S+  D C+ L     F  V VP V+ 
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVAL 395

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVG 489
            F G   L L  +       ++G  C     + +G +SI+GN QQ+ +Q+ ++   G V 
Sbjct: 396 AFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVT 454

Query: 490 FGPNVC 495
           F  + C
Sbjct: 455 FVQSPC 460


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  182 bits (461), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 132/426 (30%), Positives = 205/426 (48%), Gaps = 49/426 (11%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
           H  ++R ++R      RL+G G   A+ E       VV  + +    GEY V++G+G+PP
Sbjct: 45  HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
                 ID+ SD++W QCQPC+ CY Q DP+F+P  S++++ + CSS  CD L+   C  
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
                C+Y  +Y   + T+GTLA++ L IG    + VA GC   + G      A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRA 337
           G G +SLV QL  +    F+YCL    +   G LV G +A     A     VP+ R+PR 
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
           PS+YY+ L GL +G   + +                             +GD    G+++
Sbjct: 277 PSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMII 336

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFVSVRVPTVSF 430
           D  + +T L    Y+   +    +   LPR +G S+  D C+ L     F  V VP V+ 
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFDRVYVPAVAL 395

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVG 489
            F G   L L  +       ++G  C     + +G +SI+GN QQ+ +Q+ ++   G V 
Sbjct: 396 AFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVT 454

Query: 490 FGPNVC 495
           F  + C
Sbjct: 455 FVQSPC 460


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  181 bits (460), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 166/345 (48%), Gaps = 35/345 (10%)

Query: 168 SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL---ENA 222
           +Q MV+D+ SD+ WVQC PC    CY Q D ++DP  S+S    SC+S  C +L    N 
Sbjct: 143 TQTMVLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANG 202

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFV---GAAGL 278
             +  +C+Y V Y DG+ T GT   + LTI   T V++   GC H  QG F     AAG+
Sbjct: 203 CTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATAVRSFQFGCSHGVQGSFSFGSSAAGI 262

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGTGSSGSLVFGREALPVGAAW----VPL 331
           + LGGG  SLV Q     G  FS+C      RG        F    +P  AAW     P+
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRG--------FFTLGVPRVAAWRYVLTPM 314

Query: 332 VRNPR-APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           ++NP   P+FY V L  + V G RI +   +F        G  +D+ TA+TRLP  AY+A
Sbjct: 315 LKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA------GAALDSRTAITRLPPTAYQA 368

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            R AF  +      A      DTCY+++G  S  +P ++  F     + L  S  L    
Sbjct: 369 LRQAFRDRMAMYQPAPPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLF--- 425

Query: 451 DAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G   F   P+     IIGNIQ + +++ ++     VGF    C
Sbjct: 426 -QGCLAFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  181 bits (459), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 122/342 (35%), Positives = 174/342 (50%), Gaps = 34/342 (9%)

Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
           Q +V+DS SD+ WVQC PC    C+ Q D  +DP+ S S +  SCSS  C  L     GC
Sbjct: 159 QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGC 218

Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMF-VGAAGLLGLG 282
              +C+Y V Y DGS T G    + LT+     V     GC H  QG F   AAG++ LG
Sbjct: 219 ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALG 278

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAP 338
           GG  SL+ Q   + G AFSYC+ +  +  SG    G   +P  A+      P+VR  +A 
Sbjct: 279 GGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLG---VPRRASSRYVVTPMVRFRQAA 334

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           +FY V L  + VGG R+ ++  +F        G V+D+ TA+TRLP  AY+A R AF + 
Sbjct: 335 TFYGVLLRTITVGGQRLGVAPAVFAA------GSVLDSRTAITRLPPTAYQALRSAFRSS 388

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--- 455
                 A      DTCY+ +G V++R+P +S  F           N ++P+D +G     
Sbjct: 389 MTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFND 439

Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C AF  +       ++G++QQ+ I++ +D   G VGF    C
Sbjct: 440 CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  181 bits (459), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 135/412 (32%), Positives = 207/412 (50%), Gaps = 47/412 (11%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD+ R       L+           QD  T         +GEY + + +G+PP     
Sbjct: 57  LRRDMHRHNARKLALAASSGATVSAPTQDSPT---------AGEYLMALAIGTPPLPYQA 107

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSA--VCDRL-------EN 221
           + D+GSD++W QC PC SQC++Q  P+++P+ S +F+ + C+S+  VC            
Sbjct: 108 IADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPP 167

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHKNQGMFVGAA 276
            GC    C Y V+YG G +T      ET T G T      V  +A GC   + G    +A
Sbjct: 168 PGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSA 223

Query: 277 -GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV---PL 331
            GL+GLG G +SLV QLG      FSYCL   + T S+ +L+ G  A   G A V   P 
Sbjct: 224 SGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTLLLGPSASLNGTAGVSSTPF 280

Query: 332 VRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           V +P  AP  +FYY+ L+G+ +G   + I  D F L   G  G+++D+GT +T L   AY
Sbjct: 281 VASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTITLLGNTAY 340

Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPVLTLPASN 444
           +  R A V+    LP   G +    D C+ L    S    +P+++ +F+G  ++ LPA +
Sbjct: 341 QQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMV-LPADS 398

Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +++  DD+G +C A      G ++I+GN QQ+ + I +D     + F P  C
Sbjct: 399 YMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKC 449


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score =  181 bits (459), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 171/360 (47%), Gaps = 18/360 (5%)

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           Q    Y VR G+G+P +   + +D+ +D  W  C PC  C   S   F PA S+S++ + 
Sbjct: 74  QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131

Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           C+S  C   E   C A +        C +   + D S+ + +L  +TL +G+  +   A 
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190

Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
           GC     G    +   GLLGLG G MSL+ Q G +  G FSYCL S R    SGSL  G 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P    + PL+ NP  PS YYV ++GL VG   + +    F        G V+D+GT 
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           +TR   P Y A R+ F  Q       + +  FDTC+N     +   P V+ +  GG  LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N LI        C A A +P    + ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score =  181 bits (458), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 171/360 (47%), Gaps = 18/360 (5%)

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           Q    Y VR G+G+P +   + +D+ +D  W  C PC  C   S   F PA S+S++ + 
Sbjct: 74  QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131

Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           C+S  C   E   C A +        C +   + D S+ + +L  +TL +G+  +   A 
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190

Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
           GC     G    +   GLLGLG G MSL+ Q G +  G FSYCL S R    SGSL  G 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P    + PL+ NP  PS YYV ++GL VG   + +    F        G V+D+GT 
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           +TR   P Y A R+ F  Q       + +  FDTC+N     +   P V+ +  GG  LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N LI        C A A +P    + ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  181 bits (458), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 128/371 (34%), Positives = 196/371 (52%), Gaps = 38/371 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
           +GEY + + +G+PP     + D+GSD++W QC PC SQC++Q  P+++P+ S +F+ + C
Sbjct: 87  AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 146

Query: 212 SSA--VCDRL-------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----V 257
           +S+  VC             GC    C Y V+YG G +T      ET T G T      V
Sbjct: 147 NSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRV 202

Query: 258 KNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSL 315
             +A GC   + G    +A GL+GLG G +SLV QLG      FSYCL   + T S+ +L
Sbjct: 203 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 259

Query: 316 VFGREALPVGAAWV---PLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           + G  A   G A V   P V +P  AP  +FYY+ L+G+ +G   + I  D F L   G 
Sbjct: 260 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT 319

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RV 425
            G+++D+GT +T L   AY+  R A V+    LP   G +    D C+ L    S    +
Sbjct: 320 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSAATGLDLCFMLPSSTSAPPAM 378

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGA 484
           P+++ +F+G  ++ LPA ++++  DD+G +C A      G ++I+GN QQ+ + I +D  
Sbjct: 379 PSMTLHFNGADMV-LPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 436

Query: 485 NGFVGFGPNVC 495
              + F P  C
Sbjct: 437 QETLSFAPAKC 447


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 139/447 (31%), Positives = 214/447 (47%), Gaps = 61/447 (13%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L+L+HRD   S  +T N     R Q SF   + R  + V                    D
Sbjct: 29  LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISRQSRHV--------------------D 68

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
           F TD++       GEY + + +G+PP     + D+GSD+ W+Q +PC QCY Q  P+FDP
Sbjct: 69  FQTDLLPS----GGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDP 124

Query: 201 ADSASFSGVSCSSAVCDRLENAG---CHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           ++S +F  + C++A C+ L+ +         C Y  SYGD SYT G LA +T+T+G   V
Sbjct: 125 SNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASV 184

Query: 258 --KNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV--------- 305
             +NVA GCG +N G F    +G++GLGGG++S V QLG   G  FSYCL+         
Sbjct: 185 QIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQ 244

Query: 306 SRGTGSSGSLVFGREAL-------PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI--- 355
              + ++  +VFG   +        V  A  PLV N    ++YY+ +  + VG  ++   
Sbjct: 245 PSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLV-NKEPSTYYYLTIEAITVGRKKLLYS 303

Query: 356 -----PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-- 408
                  S D    + + +  +++D+GT +T L    Y A   A V +   + R + V  
Sbjct: 304 SSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEI-KMERVNDVKN 362

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSI 468
           S+F  C+  SG   V +P +  +F GG  + L   N  +  ++ G  CF   P+ + + I
Sbjct: 363 SMFSLCFK-SGKEEVELPLMKVHFRGGADVELKPVNTFVRAEE-GLVCFTMLPT-NDVGI 419

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
            GN+ Q    + +D     V F P  C
Sbjct: 420 YGNLAQMNFVVGYDLGKRTVSFLPADC 446


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 131/410 (31%), Positives = 192/410 (46%), Gaps = 35/410 (8%)

Query: 112 MQRDVKRVATLVRRLSGGG----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           MQR   R A L    +GGG       A+   ++ G  V +    G  EY + + VG+PP+
Sbjct: 53  MQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRA---SGDLEYVLDLAVGTPPQ 109

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLENAGCHA 226
               ++D+GSD++W QC  C+ C +Q DP+F P  S+S+  + C+  +C D L ++    
Sbjct: 110 PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRP 169

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
             C Y  SYGDG+ T G  A E  T     G T    +  GCG  N G    A+G++G G
Sbjct: 170 DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFG 229

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG--------AAWVPLVRN 334
              +SLV QL  +    FSYCL    +    +L FG  A  VG            P++++
Sbjct: 230 RDPLSLVSQLSIRR---FSYCLTPYASSRKSTLQFGSLA-DVGLYDDATGPVQTTPILQS 285

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
            + P+FYYV  +G+ VG  R+ I    F L   G  GV++D+GTA+T  P         A
Sbjct: 286 AQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRA 345

Query: 395 FVAQTGNLPRASGVSIFD-TCYNLSGFV--------SVRVPTVSFYFSGGPVLTLPASNF 445
           F +Q   LP A+G S  D  C+               V VP + F+F G   L LP  N+
Sbjct: 346 FRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD-LDLPRENY 403

Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++     G  C     S    + IGN  Q+ +++ +D     + F P  C
Sbjct: 404 VLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 131/410 (31%), Positives = 192/410 (46%), Gaps = 35/410 (8%)

Query: 112 MQRDVKRVATLVRRLSGGG----ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPR 167
           MQR   R A L    +GGG       A+   ++ G  V +    G  EY + + VG+PP+
Sbjct: 53  MQRSKARAAALSVVRNGGGFYGSIAQAREREREPGMAVRA---SGDLEYVLDLAVGTPPQ 109

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLENAGCHA 226
               ++D+GSD++W QC  C+ C +Q DP+F P  S+S+  + C+  +C D L ++    
Sbjct: 110 PITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRP 169

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
             C Y  SYGDG+ T G  A E  T     G T    +  GCG  N G    A+G++G G
Sbjct: 170 DTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLGFGCGTMNVGSLNNASGIVGFG 229

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG--------AAWVPLVRN 334
              +SLV QL  +    FSYCL    +    +L FG  A  VG            P++++
Sbjct: 230 RDPLSLVSQLSIRR---FSYCLTPYASSRKSTLQFGSLA-DVGLYDDATGPVQTTPILQS 285

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
            + P+FYYV  +G+ VG  R+ I    F L   G  GV++D+GTA+T  P         A
Sbjct: 286 AQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRA 345

Query: 395 FVAQTGNLPRASGVSIFD-TCYNLSGFV--------SVRVPTVSFYFSGGPVLTLPASNF 445
           F +Q   LP A+G S  D  C+               V VP + F+F G   L LP  N+
Sbjct: 346 FRSQL-RLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGAD-LDLPRENY 403

Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++     G  C     S    + IGN  Q+ +++ +D     + F P  C
Sbjct: 404 VLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 130/445 (29%), Positives = 211/445 (47%), Gaps = 50/445 (11%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
           +SN+S++     +EL+HRD   S         Y+ H H+   R+     R  +  RR + 
Sbjct: 19  ASNSSANRENLTVELIHRDSPHSP-------LYNPH-HTVSDRLNAAFLRSISRSRRFTT 70

Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
                         TD+ SG+    GEYF+ I +G+PP   + + D+GSD+ WVQC+PC 
Sbjct: 71  K-------------TDLQSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQ 117

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGR--CRYEVSYGDGSYTKGT 244
           QCYKQ+ P+FD   S+++   SC S  C  L     GC   +  C+Y  SYGD S+TKG 
Sbjct: 118 QCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTKGD 177

Query: 245 LALETLTIGRTVVKN-----VAIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG 298
           +A ET++I  +   +        GCG+ N G F      +   GG  +SLV QLG   G 
Sbjct: 178 VATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGK 237

Query: 299 AFSYCLVSRGTGSSGSLV--FGREALPVGAA------WVPLV-RNPRAPSFYYVGLSGLG 349
            FSYCL      ++G+ V   G  ++P   +        PL+ ++P   ++Y++ L  + 
Sbjct: 238 KFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE--TYYFLTLEAVT 295

Query: 350 VGGMRIPISEDLFRLTQMGDD---GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           VG  ++P +   + L          +++D+GT +T L +  Y+ F  A         R S
Sbjct: 296 VGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVS 355

Query: 407 GVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
               +   C+  SG   + +P ++ +F+   V   P + F+   +D  T C +  P+ + 
Sbjct: 356 DPQGLLTHCFK-SGDKEIGLPAITMHFTNADVKLSPINAFVKLNED--TVCLSMIPT-TE 411

Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
           ++I GN+ Q    + +D     V F
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSF 436


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 123/353 (34%), Positives = 173/353 (49%), Gaps = 20/353 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +GSPP++  + +D+ +D  W+ C  C  C   +  +F P  S +F  VSC 
Sbjct: 95  SPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCG 151

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S  C+++ N  C    C + ++YG  S     +  +T+T+    + +   GC  K  G  
Sbjct: 152 SPQCNQVPNPSCGTSACTFNLTYGSSSIAANVVQ-DTVTLATDPIPDYTFGCVAKTTGAS 210

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
               GLLGLG G +SL+ Q        FSYCL S +    SGSL  G  A P+   + PL
Sbjct: 211 APPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPL 270

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L  + VG   + I  +          G V D+GT  TRL  PAY A 
Sbjct: 271 LKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAV 330

Query: 392 RDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           RD F     +A   NL   + +  FDTCY     V +  PT++F FSG  V TLP  N L
Sbjct: 331 RDEFQRRVAIAAKANL-TVTSLGGFDTCYT----VPIVAPTITFMFSGMNV-TLPEDNIL 384

Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I      T C A A +P    S L++I N+QQ+  ++ +D  N  +G    +C
Sbjct: 385 IHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 437


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 128/371 (34%), Positives = 196/371 (52%), Gaps = 38/371 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
           +GEY + + +G+PP     + D+GSD++W QC PC SQC++Q  P+++P+ S +F+ + C
Sbjct: 29  AGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPC 88

Query: 212 SSA--VCDRL-------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-----V 257
           +S+  VC             GC    C Y V+YG G +T      ET T G T      V
Sbjct: 89  NSSLSVCAAALAGTGTAPPPGC---ACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144

Query: 258 KNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSL 315
             +A GC   + G    +A GL+GLG G +SLV QLG      FSYCL   + T S+ +L
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201

Query: 316 VFGREALPVGAAWV---PLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           + G  A   G A V   P V +P  AP  +FYY+ L+G+ +G   + I  D F L   G 
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RV 425
            G+++D+GT +T L   AY+  R A V+    LP   G +    D C+ L    S    +
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGSADTGLDLCFMLPSSTSAPPAM 320

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGA 484
           P+++ +F+G  ++ LPA ++++  DD+G +C A      G ++I+GN QQ+ + I +D  
Sbjct: 321 PSMTLHFNGADMV-LPADSYMM-SDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378

Query: 485 NGFVGFGPNVC 495
              + F P  C
Sbjct: 379 QETLSFAPAKC 389


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 170/360 (47%), Gaps = 18/360 (5%)

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           Q    Y VR G+G+P +   + +D+ +D  W  C PC  C   S   F PA S+S++ + 
Sbjct: 74  QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131

Query: 211 CSSAVCDRLENAGCHAGR--------CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           C+S  C   E   C A +        C +   + D S+ + +L  +TL +G+  +   A 
Sbjct: 132 CASDWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKDAIAGYAF 190

Query: 263 GCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
           GC     G    +   GLLGLG G MSL+ Q G    G FSYCL S R    SGSL  G 
Sbjct: 191 GCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRLGA 250

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P    + PL+ NP  PS YYV ++GL VG   + +    F        G V+D+GT 
Sbjct: 251 AGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTV 310

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           +TR   P Y A R+ F  Q       + +  FDTC+N     +   P V+ +  GG  LT
Sbjct: 311 ITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLT 370

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N LI        C A A +P    + ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 371 LPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 430


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 129/410 (31%), Positives = 188/410 (45%), Gaps = 31/410 (7%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           MQR   R A L    SG G    K   Q        V     G  EY + + +G+PP+  
Sbjct: 57  MQRSKARAAALSVARSGSGRVPGKSAQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPV 116

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGR 228
             ++D+GSD++W QC PC+ C  Q DP+F PA S+S+  + CS  +C+ + +  C     
Sbjct: 117 SALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDT 176

Query: 229 CRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGG 284
           C Y  +YGDG+ T G  A E  T     G  +   +  GCG  N G     +G++G G  
Sbjct: 177 CTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLGFGCGTMNVGSLNNGSGIVGFGRD 236

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR---------EALPVGAAWVPLVRNP 335
            +SLV QL  +    FSYCL    +    +L+FG          +A         L+++ 
Sbjct: 237 PLSLVSQLSIRR---FSYCLTPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSR 293

Query: 336 RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF 395
           + P+FYYV  +G+ VG  R+ I    F L   G  GV++D+GTA+T  P         AF
Sbjct: 294 QNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAF 353

Query: 396 VAQTGNLPRASGVSIFD-TCY---------NLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
            AQ   LP  S  S  D  C+           S    V VP ++F+F G   L LP  N+
Sbjct: 354 RAQL-RLPFTSSSSPDDGVCFATPMAAGGRRASAATVVSVPRMAFHFQGAD-LELPRRNY 411

Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++     G+ C   A S    + IGN  Q+ +++ +D     + F P  C
Sbjct: 412 VLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 170/347 (48%), Gaps = 14/347 (4%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+PP+   + +D+ +D  W+ C  C+ C   S P FDPA S S+  V C S +
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPL 169

Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C +  NA C  G   C + ++Y D S  +  L+ ++L +    VK    GC  K  G   
Sbjct: 170 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGDAVKTYTFGCLQKATGTAA 228

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q      G FSYCL S +    SG+L  GR   P      PL+
Sbjct: 229 PPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTPLL 288

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   +PI             G V+D+GT  TRL  PAY A R
Sbjct: 289 ANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVAVR 348

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           D    + G     S +  FDTC+N +   +V  P V+  F G  V TLP  N +I     
Sbjct: 349 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPVTLLFDGMQV-TLPEENVVIHSTYG 402

Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A A +P G    L++I ++QQ+  ++ FD  NG VGF    C
Sbjct: 403 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 449


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/363 (34%), Positives = 179/363 (49%), Gaps = 37/363 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +GEY + + +G+PP  +  + D+GSD++WVQC PC  C+ Q  P+F+P  S++F   +C 
Sbjct: 89  NGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCD 148

Query: 213 SAVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRT------VVKNVAIG 263
           S  C  +  +    G+   C Y  SYGD S+T G +  ETL+ G T         +   G
Sbjct: 149 SQPCTSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFG 208

Query: 264 CGHKNQGMFVGA---AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
           CG  N   F  +    GL+GLGGG +SLV QLG Q G  FSYCL+   + S+  L FG E
Sbjct: 209 CGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFSSNSTSKLKFGSE 268

Query: 321 ALPV--GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           A+    G    PL+  P  PSFY++ L  + +G   +P        T   D  +++D+GT
Sbjct: 269 AIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVP--------TGRTDGNIIIDSGT 320

Query: 379 AVTRLPTPAYEAFRDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
            +T L    Y  F  +      V    +LP       F  C+    +  + +P ++F F+
Sbjct: 321 VLTYLEQTFYNNFVASLQEVLSVESAQDLPFP-----FKFCF---PYRDMTIPVIAFQFT 372

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           G  V   P  N LI + D    C A  PS  SG+SI GN+ Q   Q+ +D     V F P
Sbjct: 373 GASVALQP-KNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAP 431

Query: 493 NVC 495
             C
Sbjct: 432 TDC 434


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 126/371 (33%), Positives = 192/371 (51%), Gaps = 29/371 (7%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +D  +G Y + + +G+PP +  ++ D+GS ++W QC PC++C  +  P F PA S++FS 
Sbjct: 83  LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + C+S++C  L +    C+A  C Y   YG G +T G LA ETL +G      VA GC  
Sbjct: 143 LPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVGGASFPGVAFGCST 201

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG- 325
           +N G+   ++G++GLG   +SLV Q+G    G FSYCL S        ++FG  A   G 
Sbjct: 202 EN-GVGNSSSGIVGLGRSPLSLVSQVG---VGRFSYCLRSDADAGDSPILFGSLAKVTGG 257

Query: 326 -AAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV----MDTGT 378
                PL+ NP  P  S+YYV L+G+ VG   +P++   F  T+    G+V    +D+GT
Sbjct: 258 NVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIVDSGT 317

Query: 379 AVTRLPTPAYEAFRDAFVAQ--TGNL-PRASGVSI-FDTCYNLS---GFVSVRVPTVSFY 431
            +T L    Y   + AF++Q  T NL    +G    FD C++ +   G   V VPT+   
Sbjct: 318 TLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPTLVLR 377

Query: 432 FSGGPVLTLPASNF--LIPVDD---AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGA 484
           F+GG    +   ++  ++ VD    A   C    P+   L  SIIGN+ Q  + + +D  
Sbjct: 378 FAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLD 437

Query: 485 NGFVGFGPNVC 495
            G   F P  C
Sbjct: 438 GGMFSFAPADC 448


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 122/385 (31%), Positives = 184/385 (47%), Gaps = 21/385 (5%)

Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGS 177
           L+RR++      A   +    T  VS    D G    EY + + +G+PP+   + +D+GS
Sbjct: 53  LMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGS 112

Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEV 233
           D+VW QCQPC+ C+ QS P +D + S++F+  SC S  C    +      +    C +  
Sbjct: 113 DLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAFSY 172

Query: 234 SYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
           SYGD S T G L +ET++ +    V  V  GCG  N G+F     G+ G G G +SL  Q
Sbjct: 173 SYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ 232

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV---PLVRNPRAPSFYYVGLS 346
           L     G FS+C  +       +++F   A     G   V   PL++NP  P+FYY+ L 
Sbjct: 233 L---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLK 289

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           G+ VG  R+P+ E  F L + G  G ++D+GTA T LP   Y    D F A        S
Sbjct: 290 GITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPS 348

Query: 407 GVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
             +    C++      +  VP +  +F G   + LP  N++    D G      A     
Sbjct: 349 NETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPRENYVFEAKDGGNCSICLAIIEGE 407

Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
           ++IIGN QQ+ + + +D  N  + F
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSF 432


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 145/431 (33%), Positives = 198/431 (45%), Gaps = 45/431 (10%)

Query: 94  NTTNNMHYHRHQHSF-HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQ- 151
           N+ N+        SF  A+  RD  RV  L    SG G           G  + SG    
Sbjct: 41  NSNNDAAPSSSWTSFIAAQTSRDTSRVLYLSSLASGFG-----------GAPLASGRQLL 89

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
            +  Y VR  +G+PP+   + +D+ +D  WV C  C  C   + P F+PA SA+F  V C
Sbjct: 90  HTPTYLVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGC-PTTAPSFNPASSATFRPVPC 148

Query: 212 SSAVCDRLENAGCHA-----GRCRYEVSYGDGSYTKGTLALETLTIGRT--VVKNVAIGC 264
            +  C +  N  C +       C + +SYGD S    TL+ + L +     V+K    GC
Sbjct: 149 GAPPCSQAPNPSCTSLAKSKNSCGFSLSYGDSSL-DATLSQDNLAVTANGGVIKGYTFGC 207

Query: 265 GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS---RGTGSSGSLVFGREA 321
             K+ G    A GLLGLG G +  V Q  G   G FSYCL S        SGSL  GR+ 
Sbjct: 208 LTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGRKG 267

Query: 322 LPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            P        PL+ +P  PS YYV ++G+ +G   +PI             G V+D+GT 
Sbjct: 268 QPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSGTM 327

Query: 380 VTRLPTPAYEAFRDAFVAQ-TGNL----PRASGVSI-----FDTCYNLSGFVSVRVPTVS 429
             RL  PAY A RD    +  G+L       + VS+     FDTCYN+S   +V  P V+
Sbjct: 328 FARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVS---TVAWPAVT 384

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGA 484
             F GG  + LP  N +I      T C A A SP     + L++IG++QQ+  ++ FD  
Sbjct: 385 LVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFDVP 444

Query: 485 NGFVGFGPNVC 495
           N  VGF    C
Sbjct: 445 NARVGFARERC 455


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 126/361 (34%), Positives = 175/361 (48%), Gaps = 24/361 (6%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK-QSDPVFDPADSASFSGVSCSS 213
            Y  R  +G+PP++  + ID  +D  WV C  C  C    S P FDP  S+++  V C +
Sbjct: 99  SYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGA 158

Query: 214 AVCDRLENA--GCHAG---RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKN-VAIG 263
             C ++  A   C AG    C + +SY   S     L  + L++    G  V  +    G
Sbjct: 159 PQCAQVPPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNGAAVPDDHYTFG 217

Query: 264 CGH--KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
           C       G  V   GL+G G G +S + Q     G  FSYCL S + +  SG+L  G  
Sbjct: 218 CLRVVTGSGGSVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSSNFSGTLRLGPA 277

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTA 379
             P      PL+ NP  PS YYV + G+ V G  +PI      L    G  G ++D GT 
Sbjct: 278 GQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTIVDAGTM 337

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
            TRL  PAY A R+AF  +  + P A  +  FDTCY ++G  S  VP V+F F+GG  +T
Sbjct: 338 FTRLSPPAYAALRNAF-RRGVSAPAAPALGGFDTCYYVNGTKS--VPAVAFVFAGGARVT 394

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           LP  N +I     G  C A A  PS     GL+++ ++QQ+  ++ FD  NG VGF   +
Sbjct: 395 LPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVGFSREL 454

Query: 495 C 495
           C
Sbjct: 455 C 455


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 122/357 (34%), Positives = 175/357 (49%), Gaps = 24/357 (6%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G++ + I +G+PP     ++D+GSD++W+QC PC  CYKQ  P+FDP  S++++ +SC S
Sbjct: 66  GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDS 125

Query: 214 AVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI-----GCGHK 267
            +C +L+   C    RC Y   YGD S TKG LA +T T      K V++     GCGH 
Sbjct: 126 PLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHN 185

Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGT--GSSGSLVFGR--EA 321
           N G F     GL+GLGGG  SL+ Q+G   GG  FS CLV   T    S  + FG+  + 
Sbjct: 186 NTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKGSQV 245

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
           L  G    PLV   +  S Y+V L G+ V     P++      + +G   +++D+GT   
Sbjct: 246 LGNGVVTTPLVPREKDTS-YFVTLLGISVEDTYFPMN------STIGKANMLVDSGTPPI 298

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
            LP   Y+        +    P     S+    CY      +++ PT++F+F G  VL  
Sbjct: 299 LLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQ--TNLKGPTLTFHFVGANVLLT 356

Query: 441 PASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P   F+ P     G FC A +  + S   + GN  Q    I FD     V F P  C
Sbjct: 357 PIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 118/349 (33%), Positives = 162/349 (46%), Gaps = 31/349 (8%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP     VID+GS+I W QC PC  CYKQ+ P+FDP+ S++F         
Sbjct: 380 YLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTF--------- 430

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
               +   CH   C YEV Y D +YTKGTLA +T+TI  T     V+    IGCG  N  
Sbjct: 431 ----KEKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSW 486

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
                 G +GL  G +SL+ Q+GG+  G  SYC    GT     + FG  A+  G   V 
Sbjct: 487 FRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGT---SKINFGTNAIVGGGGVVS 543

Query: 331 ---LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
               V   R P FYY+ L  + VG  RI   E L       +  +V+D+GT +T  P   
Sbjct: 544 TTMFVTTAR-PGFYYLNLDAVSVGDTRI---ETLGTPFHALEGNIVIDSGTTLTYFPESY 599

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
               R A       +P A        CY  +   +   P ++ +FSGG  L L   N  +
Sbjct: 600 CNLVRQAVEHVVPAVPAADPTGNDLLCYYSN--TTEIFPVITMHFSGGADLVLDKYNMFM 657

Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                G FC A    +P+  +I GN  Q    + +D ++  V F P  C
Sbjct: 658 ESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNC 706



 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 153/336 (45%), Gaps = 49/336 (14%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY +++ +G+PP     V+D+GS+++W QC PC  CY Q  P+FDP+ S++F    C   
Sbjct: 64  EYLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRC--- 120

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQ 269
                 N   H+  C Y++ Y D SYT+GTLA ET+TI  T     V+    IGC   N 
Sbjct: 121 ------NTPDHS--CPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNS 172

Query: 270 G--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           G      ++G++GL  GS+SL+ Q+GG             G G   + +F + A      
Sbjct: 173 GSGFRPSSSGIVGLSRGSLSLISQMGG----------AYPGDGVVSTTMFAKTA------ 216

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
                        YY+ L  + VG  RI      F      +  +V+D+GT +T  P   
Sbjct: 217 ---------KRGQYYLNLDAVSVGDTRIETVGTPFHAL---NGNIVIDSGTPLTYFPVSY 264

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
               R A V +     R    S  D     S  + +  P ++ +FSGG  L L   N  +
Sbjct: 265 CNLVRKA-VERVVTADRVVDPSRNDMLCYYSNTIEI-FPVITVHFSGGADLVLDKYNMYM 322

Query: 448 PVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFD 482
            ++  G FC A    +P+ ++I GN  Q    + +D
Sbjct: 323 ELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 121/342 (35%), Positives = 174/342 (50%), Gaps = 34/342 (9%)

Query: 169 QYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGC 224
           Q +V+DS SD+ WVQC PC    C+ Q D  +DP+ S + +  SCSS  C  L     GC
Sbjct: 29  QTVVLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGC 88

Query: 225 HAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMF-VGAAGLLGLG 282
              +C+Y V Y DGS T G    + LT+     V     GC H  QG F   AAG++ LG
Sbjct: 89  ANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSGFKFGCSHAEQGSFDARAAGIMALG 148

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA----WVPLVRNPRAP 338
           GG  SL+ Q   + G AFSYC+ +  +  SG    G   +P  A+      P+VR  +A 
Sbjct: 149 GGPESLLSQTASRYGNAFSYCIPATAS-DSGFFTLG---VPRRASSRYVVTPMVRFRQAA 204

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           +FY V L  + VGG R+ ++  +F        G V+D+ TA+TRLP  AY+A R AF + 
Sbjct: 205 TFYGVLLRTITVGGQRLGVAPAVFAA------GSVLDSRTAITRLPPTAYQALRAAFRSS 258

Query: 399 TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF--- 455
                 A      DTCY+ +G V++R+P +S  F           N ++P+D +G     
Sbjct: 259 MTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFD---------RNAVLPLDPSGILFND 309

Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C AF  +       ++G++QQ+ I++ +D   G VGF    C
Sbjct: 310 CLAFTSNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 112/281 (39%), Positives = 159/281 (56%), Gaps = 16/281 (5%)

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGAAGLLGL 281
           GC  G C Y V YGDGSYT G  A++TLT+     +K    GCG +N+G+F  AAGLLGL
Sbjct: 15  GCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHDAIKGFRFGCGERNEGLFGEAAGLLGL 74

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV---PLVRNPRAP 338
           G G  SL  Q   + GG F++C  +R +G +G L FG  + P  +A +   P++ +   P
Sbjct: 75  GRGKTSLPVQTYDKYGGVFAHCFPARSSG-TGYLEFGPGSSPAVSAKLSTTPMLID-TGP 132

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ 398
           +FYYVG++G+ VGG  +PI + +F        G ++D+GT +TRLP  AY + R AF A 
Sbjct: 133 TFYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTVITRLPPAAYSSLRSAFAAS 187

Query: 399 TG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
                  RA  +S+ DTCY+L+G   V +PTVS  F GG  L + AS  +I        C
Sbjct: 188 MAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASG-IIYAASVSQAC 246

Query: 457 FAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             FA + +   ++I+GN Q +   + +D A+  VGF P  C
Sbjct: 247 LGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 107/307 (34%), Positives = 168/307 (54%), Gaps = 28/307 (9%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV--------QDFGTDVVSGMDQGSGEYFV 158
           SF   +  D  RV TL  RL+       K  +        +     +  G   GSG Y+V
Sbjct: 61  SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           ++G GSP R   M++D+GS + W+QC+PC   C+ Q+DP+FDP+ S ++  +SC+S+ C 
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180

Query: 218 RLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
            L +A  +       +  C Y  SYGD SY+ G L+ + LT+  +  +     GCG  + 
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSD 240

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW- 328
           G+F  AAG+LGLG   +S++GQ+  + G AFSYCL +RG G  G L  G+ +L  G+A+ 
Sbjct: 241 GLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIGKASL-AGSAYK 297

Query: 329 -VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
             P+  +P  PS Y++ L+ + VGG  + ++   +R+        ++D+GT +TRLP   
Sbjct: 298 FTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP------TIIDSGTVITRLPMSV 351

Query: 388 YEAFRDA 394
           Y  F+ A
Sbjct: 352 YTPFQQA 358


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 135/438 (30%), Positives = 212/438 (48%), Gaps = 40/438 (9%)

Query: 73  SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGAD 132
           +S +  ++L L+HRD   S     N+  + R +++F     R + RV     +       
Sbjct: 28  ASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAF----SRSISRVNVFKTK------- 76

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
               ++  F  D+V       GEYF+++ +G+P     ++ D+GSD+ WVQC PC  CY+
Sbjct: 77  --AVDINSFQNDLV----PNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYR 130

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLE--NAGC--HAGRCRYEVSYGDGSYTKGTLALE 248
           Q  P+FDP+ S+S+  + C S  C+ L+     C      C Y  SYGD SYT G LA E
Sbjct: 131 QKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYTNGNLATE 190

Query: 249 TLTIGRTVVKNVAI-----GCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
             TIG T  + V +     GCG  N G F    +G++GLGGG++SLV QL     G FSY
Sbjct: 191 KFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSY 250

Query: 303 CLV--SRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
           CLV  S  +  +  + FG +++  G   V  PLV   +  ++YYV L  + VG  R+P +
Sbjct: 251 CLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSK-QPDTYYYVTLEAISVGNKRLPYT 309

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNL 417
             L     +    V++D+GT +T L +  +    +  + +T    R S    +F  C+  
Sbjct: 310 NGLLN-GNVEKGNVIIDSGTTLTFLDSEFFTEL-ERVLEETVKAERVSDPRGLFSVCFRS 367

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
           +G   + +P ++ +F+   V   P + F+    D    CF    S + + I GN+ Q   
Sbjct: 368 AG--DIDLPVIAVHFNDADVKLQPLNTFVKA--DEDLLCFTMI-SSNQIGIFGNLAQMDF 422

Query: 478 QISFDGANGFVGFGPNVC 495
            + +D     V F P  C
Sbjct: 423 LVGYDLEKRTVSFKPTDC 440


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 136/453 (30%), Positives = 207/453 (45%), Gaps = 56/453 (12%)

Query: 85  HRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD 144
           H D     ++ T +++   H+    A +QR   R+A++  RL      +++++V      
Sbjct: 25  HLDIARVDASDTESLNLTDHELLRRA-IQRSRDRLASIAPRLL---PTSSRNKVVVAEAP 80

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V+S      GEY V++G+G+P       ID+ SD++W QCQPC +CYKQ DPVF+P  S 
Sbjct: 81  VLSA----GGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVAST 136

Query: 205 SFSGVSCSSAVCDRLENAGC-------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           S++ V C+S  CD L+   C           C+Y  SYG  + T+G LA++ L IG  V 
Sbjct: 137 SYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVF 196

Query: 258 KNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
           + V  GC   +  G     +G++GLG G++SLV QL  +    F YCL    + S+G LV
Sbjct: 197 RGVVFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRR---FMYCLPPPVSRSAGRLV 253

Query: 317 FGREALPV-----GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGDD 370
            G +A            VP+    R PS+YY+ L G+ +G   +   S +    T  G  
Sbjct: 254 LGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTA 313

Query: 371 ------------------------GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
                                   G+++D  + +T L    YE   D    +   LPR S
Sbjct: 314 AGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEI-RLPRGS 372

Query: 407 GVSI-FDTCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS 462
           G  +  D C+ L   V    V  P VS  F G   L L      +    +G  C     +
Sbjct: 373 GSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGV-WLRLDKEQMFVEDRASGMMCLMVGKT 431

Query: 463 PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G+SI+GN QQ+ +Q+ ++   G + F    C
Sbjct: 432 -DGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 195/412 (47%), Gaps = 25/412 (6%)

Query: 103 RHQHSFHARMQ---RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVR 159
           RH +   A ++   R  +R+A  + +LSG    A      D     V+        + + 
Sbjct: 51  RHDNWRRAALESNARQARRLAKALDKLSGAAPGAPAAAATDIAAADVTISPYAHQGHSLT 110

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-- 217
           +GVG+PP+   +++D GSD++W QC       KQ +PVFD A S+SFS + C S +C+  
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAG 170

Query: 218 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGA 275
              N  C   +C YE  YG  + T G LA ET T G    V  N+  GCG    G    A
Sbjct: 171 TFTNKTCTDRKCAYENDYGIMTAT-GVLATETFTFGAHHGVSANLTFGCGKLANGTIAEA 229

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAWV 329
           +G+LGL  G +S++ QL       FSYCL       +  ++FG  A             +
Sbjct: 230 SGILGLSPGPLSMLKQLAITK---FSYCLTPFADRKTSPVMFGAMADLGKYKTTGKVQTI 286

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL++NP    +YYV + G+ VG  R+ + ++   +   G  G V+D+ T +  L  PA+ 
Sbjct: 287 PLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVEPAFT 346

Query: 390 AFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNF 445
             + A V +   LP A+  V  +  C+ L   +S   V+VP +  +F G   ++LP  N+
Sbjct: 347 ELKKA-VMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLPRDNY 405

Query: 446 LIPVDDAGTFCFAF--APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  G  C A   AP     ++IGN+QQ+ + + +D  N    + P  C
Sbjct: 406 FQE-PSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  178 bits (452), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 135/420 (32%), Positives = 192/420 (45%), Gaps = 26/420 (6%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L++ H     S    +  M +     +  A+ Q  ++  ++LV R S     +A+  +Q 
Sbjct: 35  LKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASARQIIQ- 93

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
                       S  Y V+   G+PP++  + +D+ SD  W+ C  C  C   S P F P
Sbjct: 94  ------------SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAP 139

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
             S SF  VSC S  C ++ N  C    C +  +YG  S    ++  +TLT+    +   
Sbjct: 140 IKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA-ASVVQDTLTLATDPIPGY 198

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
             GC +K  G      GLLGLG G +SL+ Q        FSYCL S +    SGSL  G 
Sbjct: 199 TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGP 258

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P    + PL+RNPR  S YYV L  + VG   + I             G + D+GT 
Sbjct: 259 VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTV 318

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
            TRL  P Y A R+ F  + G     + +  FDTCYN    V + VPT++F FSG  V T
Sbjct: 319 FTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYN----VPIVVPTITFLFSGMNV-T 373

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N +I      T C A A +P    S L++I N+QQ+  ++ FD  N  +G    +C
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 142/446 (31%), Positives = 206/446 (46%), Gaps = 58/446 (13%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           S + T +    ++ +L+H++  +S    +NN H ++ + SF+   ++   + +   R  S
Sbjct: 19  SQTPTEAYNKGFSFKLIHKNSPNSPFYKSNNFHKNKLR-SFYQVPKKSFVQKSPYTRVTS 77

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
                                    +G+Y +++ +GSPP   Y ++D+GSD+VW QC PC
Sbjct: 78  N------------------------NGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC 113

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
             CY+Q  P+F+P  S ++S + C S  C     +      C Y  SY D S TKG LA 
Sbjct: 114 GGCYRQKSPMFEPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAR 173

Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-F 300
           E +T   T     VV ++  GCGH N G F     G++G+GGG +SLV Q+G   G   F
Sbjct: 174 EAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRF 233

Query: 301 SYCLVSRGTG--SSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGG--MR 354
           S CLV   T   +SG++ FG E+   G   V  PL       S Y V L G+ VG   +R
Sbjct: 234 SQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTS-YLVTLEGISVGDTFVR 292

Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDT 413
              SE L +        +++D+GT  T +P   YE   +    Q+  LP      +    
Sbjct: 293 FNSSETLSK------GNIMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQL 346

Query: 414 CY----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
           CY    NL G      P ++ +F G  V  LP   F+ P D  G FCFA A S  G  I 
Sbjct: 347 CYRSETNLEG------PILTAHFEGADVQLLPIQTFIPPKD--GVFCFAMAGSTDGDYIF 398

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN  Q  I + FD     + F P  C
Sbjct: 399 GNFAQSNILMGFDLDRKTISFKPTDC 424


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 172/364 (47%), Gaps = 27/364 (7%)

Query: 152 GSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           G  EY +  G+G+P P+   + +D+GSD+VW QC+PC  C+ Q  P FD + S +  GV 
Sbjct: 88  GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVL 147

Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
           C+  +C  L    C  G C Y+V+YGD S T G LA ++ T      G+  V ++  GCG
Sbjct: 148 CTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCG 207

Query: 266 HKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG---REA 321
             N G F     G+ G G G +SL  QLG  +   FSYC  +     S  +  G    + 
Sbjct: 208 QYNTGNFHSNETGIAGFGRGPLSLPRQLGVSS---FSYCFTTIFESKSTPVFLGGAPADG 264

Query: 322 LPVGAAWVPLVRN---PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           L   A   P++     P  P +YY+ L G+ VG  R+ + E  F +   G  G ++D+GT
Sbjct: 265 LRAHATG-PILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGT 323

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRAS-------GVSIFDTCYNLSGFVSVRVPTVSFY 431
           A+T  P   + +  +AFVAQ   LP  S        +  F T  ++     V VP ++ +
Sbjct: 324 AITAFPRAVFRSLWEAFVAQV-PLPHTSYNDTGEPTLQCFST-ESVPDASKVPVPKMTLH 381

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
             G     LP  N++    D+   C          ++IGN QQ+ + I  D A   +   
Sbjct: 382 LEGAD-WELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIE 440

Query: 492 PNVC 495
           P  C
Sbjct: 441 PAQC 444


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  178 bits (451), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 120/357 (33%), Positives = 176/357 (49%), Gaps = 25/357 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y + + +G+PP   Y + D+GSD+ W  C PC+ CYKQ +P+FDP  S ++  +SC S
Sbjct: 70  GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDS 129

Query: 214 AVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----GRTV-VKNVAIGCGHK 267
            +C +L+   C    RC Y  +Y   + T+G LA ET+T+    G++V +K +  GCGH 
Sbjct: 130 KLCHKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHN 189

Query: 268 NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSRGTGSSGS--LVFGREALP 323
           N G F     G++GLGGG +SL+ Q+G   GG  FS CLV   T  S S  + FG+ +  
Sbjct: 190 NTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKV 249

Query: 324 VGAAWV--PLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTA 379
            G   V  PLV +  + P  Y+V L G+ V    +  +      +Q  + G + +D+GT 
Sbjct: 250 SGKGVVSTPLVAKQDKTP--YFVTLLGISVENTYLHFNGS----SQNVEKGNMFLDSGTP 303

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
            T LPT  Y+       ++    P      +    CY      ++R P ++ +F G  V 
Sbjct: 304 PTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKN--NLRGPVLTAHFEGADVK 361

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             P   F+ P D  G FC  F  + S   + GN  Q    I FD     V F P  C
Sbjct: 362 LSPTQTFISPKD--GVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 104/311 (33%), Positives = 153/311 (49%), Gaps = 20/311 (6%)

Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           H    + + ++Q   + +A    R++   + A    V D  T     +   SGEY V + 
Sbjct: 35  HVDAGTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVLVTASSGEYLVDLA 94

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP     ++D+GSD++W QC PC  C  Q  P FD   SA++  + C S+ C  L +
Sbjct: 95  IGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSS 154

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAA 276
             C    C Y+  YGD + T G LA ET T G     +    N+A GCG  N G    ++
Sbjct: 155 PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS 214

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA----------LPVGA 326
           G++G G G +SLV QLG      FSYCL S  + +   L FG  A           PV +
Sbjct: 215 GMVGFGRGPLSLVSQLGPSR---FSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQS 271

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
              P V NP  P+ Y++ L  + +G   +PI   +F +   G  GV++D+GT++T L   
Sbjct: 272 --TPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQD 329

Query: 387 AYEAFRDAFVA 397
           AYEA R   V+
Sbjct: 330 AYEAVRRGLVS 340


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 122/385 (31%), Positives = 183/385 (47%), Gaps = 21/385 (5%)

Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVS--GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGS 177
           L+RR++      A   +    T  VS    D G    EY + + +G+PP+   + +D+GS
Sbjct: 53  LMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGS 112

Query: 178 DIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR----CRYEV 233
            +VW QCQPC+ C+ QS P +D + S++F+  SC S  C    +      +    C Y  
Sbjct: 113 VLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQCKLDPSVTMCVNQTVQTCAYSY 172

Query: 234 SYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQ 291
           SYGD S T G L +ET++ +    V  V  GCG  N G+F     G+ G G G +SL  Q
Sbjct: 173 SYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQ 232

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGAAWV---PLVRNPRAPSFYYVGLS 346
           L     G FS+C  +       +++F   A     G   V   PL++NP  P+FYY+ L 
Sbjct: 233 L---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLK 289

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
           G+ VG  R+P+ E  F L + G  G ++D+GTA T LP   Y    D F A        S
Sbjct: 290 GITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPS 348

Query: 407 GVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG 465
             +    C++      +  VP +  +F G   + LP  N++    D G      A     
Sbjct: 349 NETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPRENYVFEAKDGGNCSICLAIIEGE 407

Query: 466 LSIIGNIQQEGIQISFDGANGFVGF 490
           ++IIGN QQ+ + + +D  N  + F
Sbjct: 408 MTIIGNFQQQNMHVLYDLKNSKLSF 432


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 125/389 (32%), Positives = 190/389 (48%), Gaps = 30/389 (7%)

Query: 122 LVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
           L+R+     +  + + +QD    V + ++   G+Y + + +G+PP      +D+GSD++W
Sbjct: 37  LIRK----SSHLSSNNIQDI---VQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIW 89

Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSY 240
           VQC PC  CY Q +P+FDP  S++++ +SC S +C +     C    RC Y   Y D S 
Sbjct: 90  VQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSL 149

Query: 241 TKGTLALETLTI----GRTV-VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGG 294
           TKG LA ET+T+    G+ + ++ +  GCGH N G F     GL+GLGGG  SLV Q+G 
Sbjct: 150 TKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGP 209

Query: 295 QTGG-AFSYCLVSRGTG--SSGSLVFGR--EALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
             GG  FS CLV   T    S  + FG+  E L  G    PLV+  +  + YYV L G+ 
Sbjct: 210 LFGGKKFSQCLVPFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGIS 269

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           V    +P++  + +        +++D+GT    LP   Y+        +    P     S
Sbjct: 270 VEDTYLPMNSTIEK------GNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDDPS 323

Query: 410 I-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGL 466
           +    CY      +++ PT++++F G  +L  P   F+ P  +  G FC A    + S  
Sbjct: 324 LGPQLCYRTQ--TNLKGPTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDP 381

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            I GN  Q    I FD     V F P  C
Sbjct: 382 GIYGNFAQTNYLIGFDLDRQIVSFKPTDC 410


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 122/362 (33%), Positives = 175/362 (48%), Gaps = 39/362 (10%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS- 213
           EY V + +G+PP+   + +D+GSD++W QC+PC  C+ Q  P FD + S++ + + C S 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCEST 93

Query: 214 --------AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGC 264
                    VC +L         C Y  SYGD S T G LA +  T +  T +  V  GC
Sbjct: 94  QCKLDPTVTVCVKLNQT---VQTCAYYTSYGDNSVTIGLLAADKFTFVAGTSLPGVTFGC 150

Query: 265 GHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
           G  N G+F     G+ G G G +SL  QL     G FS+C  +  TG+  S V     LP
Sbjct: 151 GLNNTGVFNSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVLLD--LP 204

Query: 324 V--------GAAWVPLV---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
                         PL+   +N   P+ YY+ L G+ VG  R+P+ E  F LT  G  G 
Sbjct: 205 ADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGT 263

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFY 431
           ++D+GT++T LP   Y+  RD F AQ   LP   G +    TC++        VP +  +
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLH 322

Query: 432 FSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
           F G   + LP  N++  V DDAG    C A        +IIGN QQ+ + + +D  N  +
Sbjct: 323 FEGA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNML 380

Query: 489 GF 490
            F
Sbjct: 381 SF 382


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 132/446 (29%), Positives = 213/446 (47%), Gaps = 52/446 (11%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
           +S +S+     ++EL+HRD   S          +  QH+   R+       A  +R +S 
Sbjct: 19  TSTSSAHRKNLSVELIHRDSPHSP--------LYNPQHTVSDRLN------AAFLRSISR 64

Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
               + K       TD+ SG+    GEYF+ I +G+PP     + D+GSD+ WVQC+PC 
Sbjct: 65  SRRFSTK-------TDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQ 117

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAGR--CRYEVSYGDGSYTKGT 244
           QCYKQ+ P+FD   S+++   SC S  C+ L     GC   R  C+Y  SYGD S+TKG 
Sbjct: 118 QCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGE 177

Query: 245 LALETLTIGRTVVKNV-----AIGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGG 298
           +A ET++I  +    V     A GCG+ N G F      +   GG  +SLV QLG   G 
Sbjct: 178 VATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGK 237

Query: 299 AFSYCLVSRGTGSSGSLVFG--------REALPVGAAWVPLV-RNPRAPSFYYVGLSGLG 349
            FSYCL      ++G+ V          + +        PL+ ++P   ++Y++ L  + 
Sbjct: 238 KFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPE--TYYFLTLEAIT 295

Query: 350 VGGMRIPIS----EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA 405
           VG  ++P +      L R ++   + +++D+GT +T L +  Y+ F            R 
Sbjct: 296 VGKTKLPYTGGGGYSLNRKSKKTGN-IIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354

Query: 406 SGVS-IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
           S    I   C+  SG   + +PT++ +F+G  V   P ++F+   +D    C +  P+ +
Sbjct: 355 SDPQGILTHCFK-SGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDI--VCLSMIPT-T 410

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGF 490
            ++I GN+ Q    + +D     V F
Sbjct: 411 EVAIYGNMVQMDFLVGYDLETKTVSF 436


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 102/306 (33%), Positives = 159/306 (51%), Gaps = 21/306 (6%)

Query: 59  ELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKR 118
           ++ +R   + S      E+R     +  +    S  +   +++HR  H+        V+ 
Sbjct: 52  QILQRKQQLGSLGCLHPESRQEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRS 111

Query: 119 VATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSD 178
           +   +R++      +   EV      + SG++  +  Y V + +G   +   ++ID+GSD
Sbjct: 112 MQNRLRKM----VSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSD 165

Query: 179 IVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLE----NAG-CHAG--RCRY 231
           + WVQC+PC  CY Q  PVF P+ S+S+  + C+S+ C  L+    NAG C +    C Y
Sbjct: 166 LTWVQCEPCMSCYNQQGPVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSY 225

Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
            V+YGDGSYT G L  E L+ G   V N   GCG  N+G+F G +GL+GLG  ++SL+ Q
Sbjct: 226 AVNYGDGSYTNGELGAEHLSFGGISVSNFVFGCGKNNKGLFGGVSGLMGLGRSNLSLISQ 285

Query: 292 LGGQTGGAFSYCLVSRGTGSSGSLVFGREA------LPVGAAWVPLVRNPRAPSFYYVGL 345
                GG FSYCL     G+SGSL  G E+       P+  A+  +V NP+  +FY + L
Sbjct: 286 TNSTFGGVFSYCLPPTDAGASGSLAMGNESSVFKNLTPI--AYTRMVPNPQLSNFYMLNL 343

Query: 346 SGLGVG 351
           +G+ VG
Sbjct: 344 TGIDVG 349


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 134/420 (31%), Positives = 191/420 (45%), Gaps = 26/420 (6%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L++ H     S    +  M +     +  A+ Q  ++  ++LV R S     +A+  +Q 
Sbjct: 35  LKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQARMQYFSSLVARKSVVPIASARQIIQ- 93

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
                       S  Y V+   G+PP++  + +D+ SD  W+ C  C  C   S P F P
Sbjct: 94  ------------SPTYIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGC-STSKP-FAP 139

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
             S SF  VSC S  C ++ N  C    C +  +YG  S    ++  +TLT+    +   
Sbjct: 140 IKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA-ASVVQDTLTLAADPIPGY 198

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
             GC +K  G      GLLGLG G +SL+ Q        FSYCL S +    SGSL  G 
Sbjct: 199 TFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGP 258

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P    + PL+RNPR  S YYV L  + VG   + I             G + D+GT 
Sbjct: 259 VYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTV 318

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
            TRL  P Y A R+ F  + G     + +  FDTCYN    V + VPT++F FSG  V  
Sbjct: 319 FTRLAEPVYTAVRNEFRRRVGPKLPVTTLGGFDTCYN----VPIVVPTITFLFSGMNV-A 373

Query: 440 LPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N +I      T C A A +P    S L++I N+QQ+  ++ FD  N  +G    +C
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELC 433


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 113/348 (32%), Positives = 170/348 (48%), Gaps = 17/348 (4%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY + + +G+PP+   + +D+GS +VW QCQPC+ C+ QS P +D + S++F+  SC S 
Sbjct: 34  EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDST 93

Query: 215 VCDRLENAGCHAGR----CRYEVSYGDGSYTKGTLALETLT-IGRTVVKNVAIGCGHKNQ 269
            C    +      +    C Y  SYGD S T G L +ET++ +    V  V  GCG  N 
Sbjct: 94  QCKLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAGASVPGVVFGCGLNNT 153

Query: 270 GMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA--LPVGA 326
           G+F     G+ G G G +SL  QL     G FS+C  +       +++F   A     G 
Sbjct: 154 GIFRSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVSGRKPSTVLFDLPADLYKNGR 210

Query: 327 AWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
             V   PL++NP  P+FYY+ L G+ VG  R+P+ E  F L + G  G ++D+GTA T L
Sbjct: 211 GTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFAL-KNGTGGTIIDSGTAFTSL 269

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPA 442
           P   Y    D F A        S  +    C++      +  VP +  +F G   + LP 
Sbjct: 270 PPRVYRLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPR 328

Query: 443 SNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
            N++    D G      A     ++IIGN QQ+ + + +D  N  + F
Sbjct: 329 ENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSF 376


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/433 (32%), Positives = 203/433 (46%), Gaps = 27/433 (6%)

Query: 68  SSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLS 127
           S+ N ++D +   L++ H     S    +  + +  +     A+ Q  ++ +++LV R S
Sbjct: 29  SNCNPAADRSS-TLQVFHIFSPCSPFRPSKPLSWADNVLQMQAKDQARLQFLSSLVARRS 87

Query: 128 GGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
                +A+  +Q             S  + VR  +G+P ++  + +D+ +D  W+ C  C
Sbjct: 88  FVPIASARQLIQ-------------SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGC 134

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
             C   S  VF    S+SF  + C S  C+++ N  C    C + ++YG  S     L  
Sbjct: 135 IGC--PSTTVFSSDKSSSFRPLPCQSPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQ 191

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS- 306
           + LT+    V +   GC  K  G  V   GLLGLG G +SL+GQ        FSYCL S 
Sbjct: 192 DNLTLATDSVPSYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSF 251

Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           +    SGSL  G  A P+   + PL+RNPR  S YYV L  + VG   + I         
Sbjct: 252 KSVNFSGSLRLGPVAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNS 311

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
               G V+D+GT  TRL  PAY A RD F  + G     S +  FDTCY     V +  P
Sbjct: 312 ATGAGTVIDSGTTFTRLVAPAYTAVRDEFRRRVGRNVTVSSLGGFDTCYT----VPIISP 367

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFD 482
           T++F F+G  V TLP  NFLI      T C A A +P    S L++I ++QQ+  +I FD
Sbjct: 368 TITFMFAGMNV-TLPPDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFD 426

Query: 483 GANGFVGFGPNVC 495
             N  VG     C
Sbjct: 427 IPNSRVGVARESC 439


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 134/438 (30%), Positives = 190/438 (43%), Gaps = 24/438 (5%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR----RLSGG 129
           S   R +  L+ R   S S +            +  A+   D  R ATL           
Sbjct: 20  STALRSSTLLLARSPQSVSLSAVPGTPVTAWAATLAAQTASDAARAATLATGPRDPPPAS 79

Query: 130 GADAAKHEVQDFGTDVVSGMDQGS-GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
             DAAK   +     +  G    S   Y  R  +G+P ++  + ID  +D  WV C   +
Sbjct: 80  AVDAAKKGPRRSFVPIAPGRQLLSIPSYVARARLGTPAQALLVAIDPSNDAAWVPCA--A 137

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTL 245
                  P FDP  S+++  V C +  C +     C  G    C + +SY   ++ +  L
Sbjct: 138 CAGCARAPSFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTF-QALL 196

Query: 246 ALETLTIGRTV--VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
             + L +   V  V     GC H   G  V   GL+G G G +S   Q     G  FSYC
Sbjct: 197 GQDALALHDDVDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYC 256

Query: 304 LVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           L S + +  SG+L  G    P      PL+ NP  PS YYV + G+ VGG  +P+     
Sbjct: 257 LPSYKSSNFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASAL 316

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
                   G ++D GT  TRL  P Y A RD F ++    P A  +  FDTCYN    V+
Sbjct: 317 AFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV-RAPVAGPLGGFDTCYN----VT 371

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGI 477
           + VPTV+F F G   +TLP  N +I     G  C A A  P     + L+++ ++QQ+  
Sbjct: 372 ISVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNH 431

Query: 478 QISFDGANGFVGFGPNVC 495
           ++ FD ANG VGF   +C
Sbjct: 432 RVLFDVANGRVGFSRELC 449


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 126/416 (30%), Positives = 195/416 (46%), Gaps = 57/416 (13%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           ++++L+HRD   S     +     R   +F    +R V RV     R +   +D  +  +
Sbjct: 32  FSVDLIHRDSPHSPFFDPSKTQAERLTDAF----RRSVSRVGRF--RPTAMTSDGIQSRI 85

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                         +GEY + + +G+PP     ++D+GSD+ W QC+PC+ CYKQ  P+F
Sbjct: 86  V-----------PSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLF 134

Query: 199 DPADSASFSGVSCSSAVCDRL-ENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP +S+++   SC ++ C  L ++  C    +C +  SY DGS+T G LA ETLT+  T 
Sbjct: 135 DPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTA 194

Query: 257 VKNV-----AIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
            K V     A GCGH + G+F   ++G++GLGGG +SL+ QL     G FSYCL+   T 
Sbjct: 195 GKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254

Query: 311 SSGS--LVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           SS S  + FG      G   V  PL    R P   Y G S                + T+
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPL----RLP---YKGYS----------------KKTE 291

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
           + +  +++D+GT  T LP   Y     +               IF  CYN +    +  P
Sbjct: 292 VEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTA--EINAP 349

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
            ++ +F    V   P + F+   +D    CF  AP+ S + ++GN+ Q    + FD
Sbjct: 350 IITAHFKDANVELQPLNTFMRMQEDL--VCFTVAPT-SDIGVLGNLAQVNFLVGFD 402



 Score = 39.7 bits (91), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 34/134 (25%), Positives = 58/134 (43%), Gaps = 6/134 (4%)

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLSGFV 421
           +  ++ +  +++D+GT  T LP   Y    ++ VA +    R    + I   CYN +   
Sbjct: 411 KKAEVEEGNIIVDSGTTYTYLPLEFYVKLEES-VAHSIKGKRVRDPNGISSLCYNTT-VD 468

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
            +  P ++ +F    V   P + FL   +D    CF   P+ S + I+GN+ Q    + F
Sbjct: 469 QIDAPIITAHFKDANVELQPWNTFLRMQEDL--VCFTVLPT-SDIGILGNLAQVNFLVGF 525

Query: 482 DGANGFVGFGPNVC 495
           D     V F    C
Sbjct: 526 DLRKKRVSFKAADC 539


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 136/433 (31%), Positives = 208/433 (48%), Gaps = 63/433 (14%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD   SS +       +++QH  +A  +R + R               A H  
Sbjct: 28  FSVELIHRD---SSKSPLYQPTQNKYQHIVNA-ARRSINR---------------ANHFY 68

Query: 139 QDFGTDVV-SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV 197
           +   T+   S +    GEY +   VG+PP   Y + D+GSDIVW+QC+PC +CY Q+ P 
Sbjct: 69  KTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPK 128

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           F P+ S+++  + CSS +C                 S   G+ +  TL LE+ T      
Sbjct: 129 FKPSKSSTYKNIPCSSDLCK----------------SGQQGNLSVDTLTLESSTGHPISF 172

Query: 258 KNVAIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGS 314
               IGCG  N   F GA +G++GLGGG  SL+ QLG      FSYCL+     + ++  
Sbjct: 173 PKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSYCLLPNPVESNTTSK 232

Query: 315 LVFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           L FG  A+  G   V  P+V+  + P  FYY+ L    VG  RI         +  G +G
Sbjct: 233 LNFGDTAVVSGDGVVSTPIVK--KDPIVFYYLTLEAFSVGNKRIEFEGS----SNGGHEG 286

Query: 372 -VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTCYNLS--GFVSVRVPT 427
            +++D+GT +T +PT  Y     A V +   L R +  + +F+ CY+++  G+     P 
Sbjct: 287 NIIIDSGTTLTVIPTDVYNNLESA-VLELVKLKRVNDPTRLFNLCYSVTSDGY---DFPI 342

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS----PSG-LSIIGNIQQEGIQISFD 482
           ++ +F G  V   P S F+   D  G  C AFA +    PS  +SI GN+ Q+ + + +D
Sbjct: 343 ITTHFKGADVKLHPISTFVDVAD--GIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYD 400

Query: 483 GANGFVGFGPNVC 495
                V F P  C
Sbjct: 401 LQQKIVSFKPTDC 413


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 124/353 (35%), Positives = 176/353 (49%), Gaps = 40/353 (11%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   + +D+GSD++W QCQPC  C+ Q+ P FDP+ S++ S  SC S 
Sbjct: 88  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 147

Query: 215 VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV- 273
           +C  L  A     R       G G+   G                VA GCG  N G+F  
Sbjct: 148 LCQGLPVASLP--RSDKFTFVGAGASVPG----------------VAFGCGLFNNGVFKS 189

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--------G 325
              G+ G G G +SL  QL     G FS+C  +  TG+  S V     LP          
Sbjct: 190 NETGIAGFGRGPLSLPSQL---KVGNFSHCFTTI-TGAIPSTVL--LDLPADLFSNGQGA 243

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
               PL++NP  P+FYY+ L G+ VG  R+P+ E  F L + G  G ++D+GTA+T LPT
Sbjct: 244 VQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDSGTAMTSLPT 302

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLPAS 443
             Y   RDAF AQ   LP  SG +  D  + LS  +  +  VP +  +F G   + LP  
Sbjct: 303 RVYRLVRDAFAAQV-KLPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEGA-TMDLPRE 359

Query: 444 NFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           N++  V+DAG+     A    G ++ IGN QQ+ + + +D  N  + F P  C
Sbjct: 360 NYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 114/312 (36%), Positives = 163/312 (52%), Gaps = 31/312 (9%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           EY V + +G+PP+   + +D+GSD++W QCQPC  C+ Q+ P FDP+ S++ S  SC S 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDST 140

Query: 215 VCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGH 266
           +C  L  A C + +      C Y  SYGD S T G L ++  T       V  VA GCG 
Sbjct: 141 LCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGL 200

Query: 267 KNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF-------- 317
            N G+F     G+ G G G +SL  QL     G FS+C  +       +++         
Sbjct: 201 FNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAVNGLKPSTVLLDLPADLYK 257

Query: 318 -GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            GR A+       PL++NP  P+FYY+ L G+ VG  R+P+ E  F L + G  G ++D+
Sbjct: 258 SGRGAV----QSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFAL-KNGTGGTIIDS 312

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSG 434
           GTA+T LPT  Y   RDAF AQ   LP  SG +  D  + LS  +  +  VP +  +F G
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQV-KLPVVSG-NTTDPYFCLSAPLRAKPYVPKLVLHFEG 370

Query: 435 GPVLTLPASNFL 446
              + LP  N++
Sbjct: 371 A-TMDLPRENYV 381


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 183/360 (50%), Gaps = 32/360 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVS 210
           +G Y +RI +G+P   +  + D+GSD+ WVQC PC  ++C+ Q+ P++DP +S++F+ + 
Sbjct: 93  NGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLP 152

Query: 211 CSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV---KNVAIGC 264
           C S  C +L   +      G C Y  +YGD SY+ G L+ +++ +    +     +  GC
Sbjct: 153 CDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNSKICFGC 212

Query: 265 GHKNQGMFVG-----AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           G +N+  F         G++GLG G +SLV QLG + G  FSYCL+   + S+  L FG 
Sbjct: 213 GFQNK--FTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLKFGE 270

Query: 320 EALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
            A+  G   V  PL+  P  P FYY+ L G+ VG   +         T   D  +++D+G
Sbjct: 271 AAIVQGNGVVSTPLIIKPDLP-FYYLNLEGITVGAKTVK--------TGQTDGNIIIDSG 321

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGP 436
           + +T L    Y  F  + V +T  +     +   FD C+     +S   P V F+F+GG 
Sbjct: 322 STLTYLEESFYNEFV-SLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP-PDVVFHFTGGD 379

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           V+  P  N L+ ++D    C    PS   G++I GN+ Q    + +D   G V F P  C
Sbjct: 380 VVLKPM-NTLVLIED-NLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDC 437


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  174 bits (442), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 139/423 (32%), Positives = 193/423 (45%), Gaps = 46/423 (10%)

Query: 105 QHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS 164
           Q      +QR + R   +V R  GG AD A   V      V      G GEY V++G G+
Sbjct: 47  QELIRRAVQRSLDRPG-IVARSGGGAADEAGKAVASEAPLV-----PGGGEYLVKLGTGT 100

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC 224
           P       ID+ SD+VW+QCQPC  CY+Q DPVF+P  S+S++ V C+S  C +L+   C
Sbjct: 101 PQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRC 160

Query: 225 HA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-GMFVGAAGLLG 280
           H    G C+Y   Y     TKGTLA++ L IG  V   V  GC   +  G    A+GL+G
Sbjct: 161 HEDDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVG 220

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPR 336
           LG G +SLV QL       F YCL    + +SG LV G  A  V        V +  + R
Sbjct: 221 LGRGPLSLVSQLSVHR---FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTR 277

Query: 337 APSFYYVGLSGLGVGGMRIPISED-------------------LFRLTQMGDDGVVMDTG 377
            PS+YY+ L GL VG      + +                   +         G+++D  
Sbjct: 278 YPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVA 337

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSI-FDTCYNLS---GFVSVRVPTVSFYF 432
           + ++ L T  Y+   D    +   LPRA+  + +  D C+ L    G   V VPTVS  F
Sbjct: 338 STISFLETSLYDELADDLEEEI-RLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSF 396

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
            G   L L        V D    C     + SG+SI+GN Q + +++ F+   G + F  
Sbjct: 397 DGR-WLELDRDRLF--VTDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAK 452

Query: 493 NVC 495
             C
Sbjct: 453 ASC 455


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 126/356 (35%), Positives = 180/356 (50%), Gaps = 22/356 (6%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSAS 205
           G    S EY   +  G+P   Q +VID+GSD+ W+QC+PCS  QC  Q DP+FDP+ S++
Sbjct: 104 GTSVKSLEYVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSST 163

Query: 206 FSGVSCSSAVCDRLE----NAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKN 259
           +S V C+S  C +L      +GC  G+ C + +SY DG+ T G    + LT+    +VK+
Sbjct: 164 YSAVPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAPGAIVKD 223

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
              GCGH    +     GLLGLG  S SL  Q     GG FSYCL +  +   G L FG 
Sbjct: 224 FYFGCGHSKSSLPGLFDGLLGLGRLSESLGAQY--GGGGGFSYCLPAVNS-KPGFLAFGA 280

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              P G  + P+ R P  P+F  V L+G+ VGG ++ +    F        G+++D+GT 
Sbjct: 281 GRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAF------SGGMIVDSGTV 334

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           VT L +  Y A R AF           G    DTCY+L+G+ +V VP ++  FSGG  + 
Sbjct: 335 VTVLQSTVYRALRAAFREAMKAYRLVHG--DLDTCYDLTGYKNVVVPKIALTFSGGATIN 392

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N ++     G   FA         ++GN+ Q   ++ FD +    GF    C
Sbjct: 393 LDVPNGIL---VNGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 136/439 (30%), Positives = 197/439 (44%), Gaps = 28/439 (6%)

Query: 64  HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
           HN    +    D     L++ H     S    +  M +        A+ Q  ++ +++LV
Sbjct: 19  HNPKCDATHQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLKLQAKDQARMQYLSSLV 78

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
            R S     + +   Q             S  Y V+  +G+P ++  + +D+ +D  WV 
Sbjct: 79  ARRSIVPIASGRQITQ-------------SPTYIVKAKIGTPAQTLLLAMDTSNDASWVP 125

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
           C  C  C   +   F PA S +F  V C ++ C ++ N  C    C +  +YG  S    
Sbjct: 126 CTACVGCSTTTP--FAPAKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAA 182

Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           +L  +T+T+    V   A GC  K  G  V   GLLGLG G +SL+ Q        FSYC
Sbjct: 183 SLVQDTVTLATDPVPAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYC 242

Query: 304 LVSRGTGS-SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           L S  T + SGSL  G  A P    + PL++NPR  S YYV L  + VG   + I  +  
Sbjct: 243 LPSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEAL 302

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGF 420
                   G V D+GT  TRL  PAY A R+ F  +     + +  S+  FDTCY     
Sbjct: 303 AFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYT---- 358

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEG 476
             +  PT++F FSG  V TLP  N LI        C A AP+P    S L++I N+QQ+ 
Sbjct: 359 APIVAPTITFMFSGMNV-TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 417

Query: 477 IQISFDGANGFVGFGPNVC 495
            ++ FD  N  +G    +C
Sbjct: 418 HRVLFDVPNSRLGVARELC 436


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 108/348 (31%), Positives = 163/348 (46%), Gaps = 29/348 (8%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP      ID+GSD++W QC PC+ CY Q  P+FDP++S++F         
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTF--------- 111

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
               +   C+   C Y++ Y D +Y+KGTLA ET+TI  T     V+    IGCGH +  
Sbjct: 112 ----KEKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
                +G++GL  G  SL+ Q+GG+  G  SYC  S+GT     + FG  A+  G   V 
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDGVVS 224

Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             +      P  YY+ L  + VG   +      F   +     +++D+GT +T  P    
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYC 281

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
              R+A       +  A        CY  +  + +  P ++ +FSGG  L L   N  I 
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDMLCY-YTDTIDI-FPVITMHFSGGADLVLDKYNMYIE 339

Query: 449 VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               GTFC A    +P   +I GN  Q    + +D ++  V F P  C
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 387


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 120/370 (32%), Positives = 184/370 (49%), Gaps = 33/370 (8%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ----CYKQSDPVFDPADSAS 205
           DQG   + + +G+G+PP+ + +++D+GSD++W QC+  S         S PV+DP +S++
Sbjct: 88  DQG---HSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESST 144

Query: 206 FSGVSCSSAVCD--RLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNV 260
           F+ + CS  +C   +     C +  RC YE  YG  +   G LA ET T G  R V   +
Sbjct: 145 FAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRL 203

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
             GCG  + G  +GA G+LGL   S+SL+ QL  Q    FSYCL       +  L+FG  
Sbjct: 204 GFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGAM 260

Query: 319 ----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
               R           +V NP    +YYV L G+ +G  R+ +      +   G  G ++
Sbjct: 261 ADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIV 320

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVRVPT 427
           D+G+ V  L   A+EA ++A V     LP A+  V  ++ C+ L      +   +V+VP 
Sbjct: 321 DSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPP 379

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGAN 485
           +  +F GG  + LP  N+      AG  C A   +   SG+SIIGN+QQ+ + + FD  +
Sbjct: 380 LVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 438

Query: 486 GFVGFGPNVC 495
               F P  C
Sbjct: 439 HKFSFAPTQC 448


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/412 (31%), Positives = 192/412 (46%), Gaps = 31/412 (7%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L+++H     S    +  + +        A+    ++ + +LV R S     + +  +Q 
Sbjct: 31  LQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVPIASGRQIIQ- 89

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP 200
                       S  Y VR  +G+PP++  + +D+ +D  W+ C  C  C   +  +F P
Sbjct: 90  ------------SPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGC---ASTLFAP 134

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV 260
             S +F  VSC++  C ++ N GC      + ++YG  S     L  +T+T+    V + 
Sbjct: 135 EKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATDPVPSY 193

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGR 319
             GC  K  G      GLLGLG G +SL+ Q        FSYCL S +    SGSL  G 
Sbjct: 194 TFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGP 253

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            A P    + PL++NPR  S YYV L  + VG   + I             G + D+GT 
Sbjct: 254 VAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTV 313

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPV 437
            TRL  P Y A RD F  + G  P+ +  S+  FDTCYN    V + VPT++F F+G  V
Sbjct: 314 FTRLVAPVYVAVRDEFRRRVG--PKLTVTSLGGFDTCYN----VPIVVPTITFIFTGMNV 367

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGAN 485
            TLP  N LI      T C A A +P    S L++I N+QQ+  ++ +D  N
Sbjct: 368 -TLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPN 418


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 179/364 (49%), Gaps = 29/364 (7%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +  G  EY + + +G+PP     + D+GSD+ W QC+PC  C+ Q  P++D   S+SFS 
Sbjct: 76  LRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSP 135

Query: 209 VSCSSAVCDRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + CSSA C  + ++ C   +  CRY  +Y DG+Y+     +         V  +A GCG 
Sbjct: 136 LPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSPECAGIS--------VGGIAFGCGV 187

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVG 325
            N G+   + G +GLG GS+SLV QLG    G FSYCL     T  S  + FG  A    
Sbjct: 188 DNGGLSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTSLSSPVFFGSLAELAA 244

Query: 326 AAW---------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT-QMGDDGVVMD 375
           ++           PLV++P  PS YYV L G+ +G  R+PI    F L    G  G+++D
Sbjct: 245 SSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGGMIVD 304

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY--NLSGFVSV-RVPTVSFYF 432
           +GT  T L    +    D      G  P  +  S+   C+    +G   +  +P +  +F
Sbjct: 305 SGTIFTILVETGFRVVVDHVAGVLGQ-PVVNASSLDRPCFPAPAAGVQELPDMPDMVLHF 363

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGFVGFG 491
           +GG  + L   N++   ++  +FC     + S   S++GN QQ+ IQ+ FD   G + F 
Sbjct: 364 AGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVGQLSFM 423

Query: 492 PNVC 495
           P  C
Sbjct: 424 PTDC 427


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 108/348 (31%), Positives = 163/348 (46%), Gaps = 29/348 (8%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP      ID+GSD++W QC PC+ CY Q  P+FDP++S++F         
Sbjct: 61  YLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTF--------- 111

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
               +   C+   C Y++ Y D +Y+KGTLA ET+TI  T     V+    IGCGH +  
Sbjct: 112 ----KEKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSW 167

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV- 329
                +G++GL  G  SL+ Q+GG+  G  SYC  S+GT     + FG  A+  G   V 
Sbjct: 168 FKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGT---SKINFGTNAIVAGDGVVS 224

Query: 330 -PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             +      P  YY+ L  + VG   +      F   +     +++D+GT +T  P    
Sbjct: 225 TTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE---GNIIIDSGTTLTYFPVSYC 281

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
              R+A       +  A        CY  +  + +  P ++ +FSGG  L L   N  I 
Sbjct: 282 NLVREAVDHYVTAVRTADPTGNDMLCY-YTDTIDI-FPVITMHFSGGADLVLDKYNMYIE 339

Query: 449 VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               GTFC A    +P   +I GN  Q    + +D ++  V F P  C
Sbjct: 340 TITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNC 387


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/353 (34%), Positives = 170/353 (48%), Gaps = 20/353 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y VR  +G+PP++  + ID+ +D  W+ C  C  C   +  +F P  S +F  VSC 
Sbjct: 94  SPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGC---TSTLFAPEKSTTFKNVSCG 150

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S  C+++ +  C    C + ++YG  S     +  +T+T+    +     GC  K  G  
Sbjct: 151 SPECNKVPSPSCGTSACTFNLTYGSSSIAANVVQ-DTVTLATDPIPGYTFGCVAKTTGPS 209

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
               GLLGLG G +SL+ Q        FSYCL S +    SGSL  G  A P+   + PL
Sbjct: 210 TPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTPL 269

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L  + VG   + I             G V D+GT  TRL  P Y A 
Sbjct: 270 LKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAV 329

Query: 392 RDAF-----VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           RD F     +A   NL   + +  FDTCY     V +  PT++F FSG  V TLP  N L
Sbjct: 330 RDEFRRRVAMAAKANL-TVTSLGGFDTCYT----VPIVAPTITFMFSGMNV-TLPQDNIL 383

Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I      T C A A +P    S L++I N+QQ+  ++ +D  N  +G    +C
Sbjct: 384 IHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELC 436


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 134/355 (37%), Positives = 169/355 (47%), Gaps = 48/355 (13%)

Query: 168 SQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAG 223
           SQ M ID+  D+ W+QC PC   QCY Q +  FDP  S++ + V C S  C  L     G
Sbjct: 158 SQTMAIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANG 217

Query: 224 CH----AGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVG-AAG 277
           C      G C Y + Y D   T GT   +TLTI   T   N   GC H  +G F   A+G
Sbjct: 218 CSKPNSTGDCLYRIEYSDHRLTLGTYMTDTLTISPSTTFLNFRFGCSHAVRGKFSAQASG 277

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRGTGSSGSLVFGREALPVGA-AWVPLVR 333
            + LGGG  SL+ Q     G AFSYC+    + G  S G  V G +    GA A  PLVR
Sbjct: 278 TMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVR 337

Query: 334 --NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
             N   P+ Y V L G+ V G R+ +   +F        G VMD+   +T+LP  AY A 
Sbjct: 338 SANVINPTIYVVRLQGIEVAGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRAL 391

Query: 392 RDAF---------VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           R AF          A TGNL         DTC++  G   V VPTVS  F GG V+ L  
Sbjct: 392 RLAFRNAMRAYKTRAPTGNL---------DTCFDFVGVSKVTVPTVSLVFDGGAVIELGL 442

Query: 443 SNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + L+        C AFAP  +   L  IGN+QQ+  ++ +D A G VGF    C
Sbjct: 443 LSVLL------DSCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 123/371 (33%), Positives = 193/371 (52%), Gaps = 39/371 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSC 211
           +GEY + + +G+PP S   + D+GSD++W QC PC SQC++Q  P+++P+ S +F+ + C
Sbjct: 83  AGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPC 142

Query: 212 SS-------AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------RTVVK 258
           +S       A+       GC    C Y ++YG G +T      ET T G      +T V 
Sbjct: 143 NSSLSMCAAALAGTTPPPGC---TCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVP 198

Query: 259 NVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLV 316
            +A GC + + G     A+GL+GLG GS+SLV QLG      FSYCL   + T S+ +L+
Sbjct: 199 GIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLL 255

Query: 317 FGREAL---PVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
            G  A      G +  P V +P  AP  ++YY+ L+G+ +G   + I      L   G  
Sbjct: 256 LGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG 315

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSV--RV 425
           G ++D+GT +T L   AY+  R A V+    LP   G S     D C+ L    S    +
Sbjct: 316 GFIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGGSAATGLDLCFELPSSTSAPPTM 374

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-PSPSGLSIIGNIQQEGIQISFDGA 484
           P+++ +F G  ++ LPA ++++   D+  +C A    +  G+SI+GN QQ+ + I +D  
Sbjct: 375 PSMTLHFDGADMV-LPADSYMM--LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVG 431

Query: 485 NGFVGFGPNVC 495
              + F P  C
Sbjct: 432 QETLTFAPAKC 442


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 131/408 (32%), Positives = 194/408 (47%), Gaps = 43/408 (10%)

Query: 102 HRHQHSFHARMQRDVKRVATL---VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
           H    S      RD  RV+ +     + + G      H    F  D         G + V
Sbjct: 80  HSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDED---------GNFLV 130

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  G+P     +++D+GS I W QC+ C  C + S+  FD + S+++S  SC   +   
Sbjct: 131 DVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTYSFGSC---IPST 187

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAA 276
           +EN         Y ++YGD S + G    +T+T+  + V +    GCG  N+G F  G  
Sbjct: 188 VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVD 239

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRN 334
           G+LGLG G +S V Q   +    FSYCL      S GSL+FG +A    ++  +  LV  
Sbjct: 240 GMLGLGQGQLSTVSQTASKFNKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFTSLVNG 297

Query: 335 P---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           P   +   +Y+V LS + VG  R+ I   +F        G ++D+ T +TRLP  AY A 
Sbjct: 298 PGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSAL 352

Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           + AF       P ++G      I DTCYNLSG   V +P +  +F GG  + L  +N ++
Sbjct: 353 KAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTN-IV 411

Query: 448 PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              DA   C AFA + S L+IIGN QQ  + + +D     +GFG N C
Sbjct: 412 WGSDASRLCLAFAGT-SELTIIGNRQQLSLTVLYDIQGRRIGFGGNGC 458


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 203/430 (47%), Gaps = 27/430 (6%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
           N  + +    L+++H     S       + +        A+ +  ++ +++LV R S   
Sbjct: 29  NCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVP 88

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
             + +  VQ+               Y VR  +G+P ++  M +D+ SD+ W+   PC+ C
Sbjct: 89  IASGRQIVQN-------------PTYIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGC 132

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
              S  +F+   S ++  + C +A C ++    C  G C + ++YG GS     L+ +T+
Sbjct: 133 LGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTI 191

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGT 309
           T+    V   + GC  K  G  + A GLLGLG G +SL+ Q        FSYCL S +  
Sbjct: 192 TLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSL 251

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
             SGSL  G    P    + PL++NPR PS Y+V L  + VG   + +    F       
Sbjct: 252 NFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTG 311

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G + D+GT  TRL TPAY A RDAF  + G     + +  FDTCY     V +  PT++
Sbjct: 312 AGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCYT----VPIAAPTIT 367

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGAN 485
           F F+G  V TLP  N LI      T C A A +P    S L++I N+QQ+  ++ +D  N
Sbjct: 368 FMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPN 426

Query: 486 GFVGFGPNVC 495
             +G    +C
Sbjct: 427 SRLGVARELC 436


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 127/436 (29%), Positives = 203/436 (46%), Gaps = 49/436 (11%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           ++EL+HRD   S      N    R     +A   R + R   L   LS            
Sbjct: 27  SVELIHRDSPLSPLYNPKNTVTDR----LNAAFLRSISRSRRLNNILSQ----------- 71

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
              TD+ SG+    GE+F+ I +G+PP   + + D+GSD+ WVQC+PC QCYK++ P+FD
Sbjct: 72  ---TDLQSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFD 128

Query: 200 PADSASFSGVSCSSAVCDRLENA--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
              S+++    C S  C  L ++  GC   +  C+Y  SYGD S++KG +A ET++I   
Sbjct: 129 KKKSSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSA 188

Query: 256 VVKNVA-----IGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCLVSRGT 309
               V+      GCG+ N G F      +   GG  +SL+ QLG      FSYCL  +  
Sbjct: 189 SGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSA 248

Query: 310 GSSGSLV--FGREALP------VGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISED 360
            ++G+ V   G  ++P       G    PLV + PR  ++YY+ L  + VG  +IP +  
Sbjct: 249 TTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR--TYYYLTLEAISVGKKKIPYTGS 306

Query: 361 LFRLTQMG-----DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDTC 414
            +     G        +++D+GT +T L +  ++ F  A         R S    +   C
Sbjct: 307 SYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHC 366

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQ 474
           +  SG   + +P ++ +F+G  V   P + F+   +D    C +  P+ + ++I GN  Q
Sbjct: 367 FK-SGSAEIGLPEITVHFTGADVRLSPINAFVKVSEDM--VCLSMVPT-TEVAIYGNFAQ 422

Query: 475 EGIQISFDGANGFVGF 490
               + +D     V F
Sbjct: 423 MDFLVGYDLETRTVSF 438


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 170/348 (48%), Gaps = 13/348 (3%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  + VR  +G+P ++  + +D+ +D  W+ C  C  C   S  VF    S+SF  + C 
Sbjct: 23  SPTFVVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGC--PSTTVFSSDKSSSFRPLPCQ 80

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S  C+++ N  C    C + ++YG  S     L  + LT+    V +   GC  K  G  
Sbjct: 81  SPQCNQVPNPSCSGSACGFNLTYG-SSTVAADLVQDNLTLATDSVPSYTFGCIRKATGSS 139

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
           V   GLLGLG G +SL+GQ        FSYCL S +    SGSL  G  A P+   + PL
Sbjct: 140 VPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGPVAQPIRIKYTPL 199

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           +RNPR  S YYV L  + VG   + I             G V+D+GT  TRL  PAY A 
Sbjct: 200 LRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRLVAPAYTAV 259

Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           RD F  + G     S +  FDTCY     V +  PT++F F+G  V TLP  NFLI    
Sbjct: 260 RDEFRRRVGRNVTVSSLGGFDTCYT----VPIISPTITFMFAGMNV-TLPPDNFLIHSTS 314

Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P    S L++I ++QQ+  +I FD  N  VG     C
Sbjct: 315 GSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESC 362


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 142/431 (32%), Positives = 191/431 (44%), Gaps = 60/431 (13%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGA---DAAKHEVQDFGTDVVSG-----------MDQG 152
           +  A +Q D  R   + R+LSG  A   DA +   Q   T V S             D  
Sbjct: 94  TLSATLQWDEHRAGHIQRKLSGNAAPMDDAGEETPQS--TQVTSSPAANVNVGKSSTDSA 151

Query: 153 SGEYFVRIGVGS------PPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSA 204
             +  V    G       P  +Q MV+D+ SD+ WVQC PC Q  CY QSD ++DP  S 
Sbjct: 152 FEQGIVPAATGPGGQKKLPGVAQSMVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSI 211

Query: 205 SFSGVSCSSAVCDRLEN--AGC----HAGRCRYEVSYGDGSYTKGTLALETLTIG---RT 255
             +   CSS  C  L     GC    + G C+Y V Y DGS T GT   + LT+    + 
Sbjct: 212 LSAPFPCSSPQCRSLGRYANGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKG 271

Query: 256 VVKNVAIGCGHK--NQGMFVG-AAGLLGLGGGSMSLVGQLGG--QTGGAFSYCLVSRGTG 310
            V     GC H     G F    AG + LG G+ SL  Q  G    G  FSYCL   G+ 
Sbjct: 272 AVSKFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGS- 330

Query: 311 SSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
             G L  G   +P  AA      P++++  AP  Y V L G+ V G R+P+   +F    
Sbjct: 331 HKGFLSLG---VPQHAASRYAVTPMLKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAAN- 386

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
                  MD+ T +TRLP  AY A R AF AQ       +     DTCY+ +G   VR+P
Sbjct: 387 -----AAMDSRTIITRLPPTAYMALRAAFRAQMRAYRAVAPKGQLDTCYDFTGVPMVRLP 441

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGA 484
            V+  F     + L  S  ++        C AFAP+ +     IIGN+QQ+ +++ ++  
Sbjct: 442 KVTLVFDRNAAVELDPSGVML------DSCLAFAPNANDFMPGIIGNVQQQTLEVLYNVD 495

Query: 485 NGFVGFGPNVC 495
              VGF    C
Sbjct: 496 GASVGFRRAAC 506


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  171 bits (434), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 134/421 (31%), Positives = 193/421 (45%), Gaps = 45/421 (10%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PR 167
           H  ++R V R    +  L     D A     D G     G D GS EY + +G+G+P P+
Sbjct: 52  HELLRRMVARSKARLASLRSSACDTALTAPVDHG-----GSDVGSSEYLIHLGIGTPRPQ 106

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR---LENAGC 224
              + +D+GSD+VW QC  C+ C+ Q  PVF  + S +FS V CS  +C     L  +GC
Sbjct: 107 RVVLHLDTGSDLVWTQCA-CTVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGC 165

Query: 225 HAG--RCRYEVSYGDGSYTKGTLALETLTIGR-------TVVKNVAIGCGHKNQGMFV-G 274
            A    C Y   Y D S T G +A +T T            V N+  GCG  N G+F   
Sbjct: 166 AARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPN 225

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVR 333
            +G+ G G G +SL  QL  +    FSYC  +        ++ G E   + A A  P+  
Sbjct: 226 QSGIAGFGTGPLSLPSQLKVRR---FSYCFTAMEESRVSPVILGGEPENIEAHATGPIQS 282

Query: 334 NPRAP----------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
            P AP           FY++ L G+ VG  R+P +   F L   G  G  +D+GTA+T  
Sbjct: 283 TPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFF 342

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDT--CYNLSGFVSV-RVPTVSFYFSGGPVLTL 440
           P   + + R+AFVAQ   LP A G +  D   C+++        VP +  +  G     L
Sbjct: 343 PQAVFRSLREAFVAQV-PLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEGAD-WEL 400

Query: 441 PASNFLIPVDDAGT-----FCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           P  N+++  DD G+      C    +   S  +IIGN QQ+ + I +D  +  + F P  
Sbjct: 401 PRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPAR 460

Query: 495 C 495
           C
Sbjct: 461 C 461


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 154/536 (28%), Positives = 232/536 (43%), Gaps = 89/536 (16%)

Query: 16  LHLLCSIITTSTSAASDTHFQILNVNES-------IKGSRTDHAKMSQYNELFERHNNIS 68
           L +LC   +    A +D     + V  S        KG R  H  ++ Y+  +   +N  
Sbjct: 7   LLILCIATSLLADAGADDQVNYVVVETSSLKPSAVCKGHRV-HPSVNNYSSSWTPLSNPH 65

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARM--------QRDVKRVA 120
              + S E    ++       S+SS   + + + +H+  +  R           ++    
Sbjct: 66  GPCSPSWEEGAAMDY------SASSMVDDMLRWDQHRAGYIQRKLSGNVSHEDTEISDST 119

Query: 121 TLVRRLSGGGAD-----------AAKHEVQDFGTDVVSGMDQ--------GSGEYFVRIG 161
           T +  ++GGGA             AK + QD    VV  +          GS    +R G
Sbjct: 120 TTLESVNGGGAGDFSMGDDGTGGMAKAQQQDTHHQVVEELSSAADPAATGGSRRSRLRPG 179

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           V      Q M++D+ SD+ WVQC PC  SQCY Q+D ++DP+ S S    +CSS  C +L
Sbjct: 180 V-----RQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL 234

Query: 220 --ENAGCH-----AGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGM 271
                GC      AG+C+Y V Y DGS T GTL  + L++  T  V     GC H  +G 
Sbjct: 235 GPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFEFGCSHAARGS 294

Query: 272 FV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA--- 326
           F     AG++ LG G  SLV Q   + G  FSYC     +   G  V G   +P  +   
Sbjct: 295 FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTAS-HKGFFVLG---VPRRSSSR 350

Query: 327 -AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
            A  P+++ P     Y V L  + V G R+ +   +F        G  +D+ T +TRLP 
Sbjct: 351 YAVTPMLKTPM---LYQVRLEAIAVAGQRLDVPPTVFAA------GAALDSRTVITRLPP 401

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
            AY+A R AF  +      A+     DTCY+ +G  S+ +PT+S  F          +  
Sbjct: 402 TAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFD--------RTGA 453

Query: 446 LIPVDDAGTF---CFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + +D +G     C AFA +        IIG +Q + I++ ++ A G VGF    C
Sbjct: 454 GVQLDPSGVLFGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 125/439 (28%), Positives = 204/439 (46%), Gaps = 53/439 (12%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD   S          +  Q +   R+     R  +  RR +        H++
Sbjct: 26  FSVELIHRDSPLSP--------IYNPQITVTDRLNAAFLRSVSRSRRFN--------HQL 69

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
               TD+ SG+    GE+F+ I +G+PP   + + D+GSD+ WVQC+PC QCYK++ P+F
Sbjct: 70  SQ--TDLQSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIF 127

Query: 199 DPADSASFSGVSCSSAVCDRLENA--GCHAGR--CRYEVSYGDGSYTKGTLALETLTIGR 254
           D   S+++    C S  C  L +   GC      C+Y  SYGD S++KG +A ET++I  
Sbjct: 128 DKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDS 187

Query: 255 TVVKNVA-----IGCGHKNQGMFVGAAGLLGLGGGS-MSLVGQLGGQTGGAFSYCLVSRG 308
                V+      GCG+ N G F      +   GG  +SL+ QLG      FSYCL  + 
Sbjct: 188 ASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247

Query: 309 TGSSGSLV--FGREALP------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
             ++G+ V   G  ++P       G    PLV +    ++YY+ L  + VG  +IP +  
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLV-DKEPLTYYYLTLEAISVGKKKIPYTGS 306

Query: 361 LFRLTQMGDDG--------VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IF 411
            +      DDG        +++D+GT +T L    ++ F  A         R S    + 
Sbjct: 307 SY---NPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLL 363

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGN 471
             C+  SG   + +P ++ +F+G  V   P + F+   +D    C +  P+ + ++I GN
Sbjct: 364 SHCFK-SGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDM--VCLSMVPT-TEVAIYGN 419

Query: 472 IQQEGIQISFDGANGFVGF 490
             Q    + +D     V F
Sbjct: 420 FAQMDFLVGYDLETRTVSF 438


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 138/437 (31%), Positives = 200/437 (45%), Gaps = 56/437 (12%)

Query: 90  SSSSNTTNNMHYHRHQHSFHARMQRD-VKRVATLVRRLSGGGADAAKHEVQDFGTDVVSG 148
           +  S+    + + +H+  +  R   D V    +++ ++S  G    K   Q  GT V   
Sbjct: 80  APPSSVAETLRWDQHRAGYIQRKLEDQVPITRSVITQVSHQGVVQPKVGTQGQGTGV--- 136

Query: 149 MDQGSGEYFVRIGVGSPPR------SQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDP 200
             Q +GE      VG  P       +Q MVID+ SD+ WVQC PC    C+ Q+D ++DP
Sbjct: 137 --QPAGE-----PVGDAPTGGSGGVAQTMVIDTASDVPWVQCAPCPAPHCHAQTDVLYDP 189

Query: 201 ADSASFSGVSCSSAVCDRL---ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR-- 254
           + S+S +   CSS  C  L    N    AG +C+Y V Y DGS + GT   + LT+    
Sbjct: 190 SKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPAK 249

Query: 255 --TVVKNVAIGCGHK--NQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
             + +     GC H     G F    +G++ LG G+ SL  Q     G  FSYCL     
Sbjct: 250 PASAISEFRFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTPV 309

Query: 310 GSSGSLVFGREALPVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
             SG  + G   +     A  P++R+  AP  Y V L  + V G R+P+   +F      
Sbjct: 310 -HSGFFILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAA---- 364

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS-----GFVSV 423
             G VMD+ T VTRLP  AY A R AFVA+      A+     DTCY+ S     G   V
Sbjct: 365 --GAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPKEHLDTCYDFSGAAPGGGGGV 422

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF---CFAFAPSPSG--LSIIGNIQQEGIQ 478
           ++P ++  F G         N  + +D +G     C AFAP+       IIGN+QQ+ ++
Sbjct: 423 KLPKITLVFDG--------PNGAVELDPSGVLLDGCLAFAPNTDDQMTGIIGNVQQQALE 474

Query: 479 ISFDGANGFVGFGPNVC 495
           + ++     VGF    C
Sbjct: 475 VLYNVDGATVGFRRGAC 491


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/295 (33%), Positives = 137/295 (46%), Gaps = 28/295 (9%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
            + EY V + VG+PPR   + +D+GSD+VW QC PC  C+ Q  P+ DPA S++++ + C
Sbjct: 82  ATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPC 141

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT----------VVKNVA 261
            +  C  L    C    C Y   YGD S T G +A +  T G              + + 
Sbjct: 142 GAPRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLT 201

Query: 262 IGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
            GCGH N+G+F     G+ G G G  SL  QL   +   FSYC  S     S  +  G  
Sbjct: 202 FGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNATS---FSYCFTSMFDSKSSIVTLGGA 258

Query: 319 -----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
                  A        PL +NP  PS Y++ L G+ VG  R+P+ E  FR T       +
Sbjct: 259 PAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST-------I 311

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
           +D+G ++T LP   YEA +  F AQ G  P     S  D C+ L      R P V
Sbjct: 312 IDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAV 366


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 122/347 (35%), Positives = 169/347 (48%), Gaps = 18/347 (5%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+PP+   + +D+ +D  W+ C  C+ C   S   FDPA SAS+  V C S +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPL 171

Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C +  NA C  G   C + ++Y D S  +  L+ ++L +    VK    GC  +  G   
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAA 230

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q        FSYCL S +    SG+L  GR   P      PL+
Sbjct: 231 PPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLL 290

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   +PI             G V+D+GT  TRL  PAY A R
Sbjct: 291 ANPHRSSLYYVNMTGIRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVR 346

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           D    + G     S +  FDTC+N +   +V  P V+  F G  V TLP  N +I     
Sbjct: 347 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPVTLLFDGMQV-TLPEENVVIHSTYG 400

Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A A +P G    L++I ++QQ+  ++ FD  NG VGF    C
Sbjct: 401 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 156 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 215

Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
           L    AGC   +C+Y V YGDG  T GT  ++ LT+   TVV N   GC H  +G F   
Sbjct: 216 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 272

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
                                            + S+   +F R          PLVRNP
Sbjct: 273 ---------------------------------SASTSGTMFAR---------TPLVRNP 290

Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A R A
Sbjct: 291 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 344

Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           F +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++      
Sbjct: 345 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 398

Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 399 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 123/375 (32%), Positives = 188/375 (50%), Gaps = 38/375 (10%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-------CYKQSDPVFDPAD 202
           DQG   + + +G+G+PP+ + +++D+GSD++W QC   S+         +Q +P+++P  
Sbjct: 81  DQG---HSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRR 137

Query: 203 SASFSGVSCSSAVCD--RLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVV 257
           S+SF+ + CS  +C   +     C    RC Y+  YG      G LA ET T G    V 
Sbjct: 138 SSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTFGVNAKVS 196

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
             +  GCG  + G  VGA+GL+GL  G MSLV QL   +   FSYCL       +  L+F
Sbjct: 197 LPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQL---SVPRFSYCLTPFAERKTSPLLF 253

Query: 318 G-----REALPVGAAWVP-LVRNPRAPS-FYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD 369
           G     R     G      ++RNP   + +YYV L GL +G  R+ + +  L  +   G 
Sbjct: 254 GAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGS 313

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI----FDTCYNLSGFV---S 422
            G ++D+G+ ++ L   A+ A + A V +   LP A+G       ++ C+ L   V   +
Sbjct: 314 GGTIVDSGSTMSYLEETAFRAVKKA-VVEAVRLPVANGTDEDYDDYELCFALPTGVAMEA 372

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQIS 480
           V+ P +  +F GG  +TLP  N+      AG  C A   SP   G+SIIGN+QQ+ + + 
Sbjct: 373 VKTPPLVLHFDGGAAMTLPRDNYF-QEPRAGLMCLAVGTSPDGFGVSIIGNVQQQNMHVL 431

Query: 481 FDGANGFVGFGPNVC 495
           FD  N    F P  C
Sbjct: 432 FDVRNQKFSFAPTKC 446


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           ++RD+ R A   R L+   + ++        T  D+ +G     GEY + + +G+PP+S 
Sbjct: 51  LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 105

Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
             + D+GSD+VW QC PC + C+KQ  P+++P+ S +F  + CSSA+  C    RL  A 
Sbjct: 106 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 165

Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
              G  CRY  +YG G +T G    ET T G     +  V  +A GC + +   + G+AG
Sbjct: 166 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 224

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
           L+GLG G +SLV QL     G FSYCL   + T S  +L+ G  A        G    P 
Sbjct: 225 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 281

Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           V +P  P   ++YY+ L+G+ VG   +PI    F L   G  G+++D+GT +T L   AY
Sbjct: 282 VPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 341

Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
           +  R A V     LP   G +    D C+ L  S      +P+++ +F GG  + LP  N
Sbjct: 342 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 400

Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++I   D G +C A      G LS +GN QQ+ + I +D     + F P  C
Sbjct: 401 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 175/359 (48%), Gaps = 16/359 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  +V C  C+ C   SD  F P  S 
Sbjct: 89  IASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKAST 145

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S+  + CS   C ++    C A   G C +  SY   S++  TL  ++L +   V+ N +
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYS 204

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            GC +   G  V A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G  
Sbjct: 205 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGPV 264

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+R+P  PS YYV  +G+ VG + +P   +          G ++D+GT +
Sbjct: 265 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 324

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TR   P Y A R+ F  Q G     S +  FDTC+  +       P ++ +F G   L L
Sbjct: 325 TRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTCFVKT--YETLAPPITLHFEGLD-LKL 380

Query: 441 PASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P  N LI        C A A +P    S L++I N QQ+ ++I FD  N  VG    VC
Sbjct: 381 PLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197

Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
           L    AGC   +C+Y V YGDG  T GT  ++ LT+   TVV N   GC H  +G F   
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 254

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
                                            + S+   +F R          PLVRNP
Sbjct: 255 ---------------------------------SASTSGTMFAR---------TPLVRNP 272

Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A R A
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 326

Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           F +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++      
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 380

Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 381 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 175/345 (50%), Gaps = 14/345 (4%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+P ++  M +D+ SD+ W+   PC+ C   S  +F+   S ++  + C +A 
Sbjct: 36  YIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQ 92

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGA 275
           C ++    C  G C + ++YG GS     L+ +T+T+    V   + GC  K  G  + A
Sbjct: 93  CKQVPKPTCGGGVCSFNLTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPA 151

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRN 334
            GLLGLG G +SL+ Q        FSYCL S +    SGSL  G    P    + PL++N
Sbjct: 152 QGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKN 211

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
           PR PS Y+V L  + VG   + +    F        G + D+GT  TRL TPAY A RDA
Sbjct: 212 PRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDA 271

Query: 395 FVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           F  + G     + +  FDTCY     V +  PT++F F+G  V TLP  N LI      T
Sbjct: 272 FRNRVGRNLTVTSLGGFDTCYT----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGST 326

Query: 455 FCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C A A +P    S L++I N+QQ+  ++ +D  N  +G    +C
Sbjct: 327 TCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELC 371


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 157/344 (45%), Gaps = 66/344 (19%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 138 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 197

Query: 219 L--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVAIGCGHKNQGMFVGA 275
           L    AGC   +C+Y V YGDG  T GT  ++ LT+   TVV N   GC H  +G F   
Sbjct: 198 LGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPSTVVMNFRFGCSHAVRGNF--- 254

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNP 335
                                            + S+   +F R          PLVRNP
Sbjct: 255 ---------------------------------SASTSGTMFAR---------TPLVRNP 272

Query: 336 RA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A R A
Sbjct: 273 SIIPTLYLVRLRGIEVGGRRLNVPPVVFA------GGAVMDSSVIITQLPPTAYRALRLA 326

Query: 395 FVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG 453
           F +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++      
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV------ 380

Query: 454 TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 381 EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 123/400 (30%), Positives = 180/400 (45%), Gaps = 36/400 (9%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           R   R A L    SG  A  A   V    TDV S       EY + + +G+P RSQ +V+
Sbjct: 58  RSRARAANLCP-YSGATARPATAPVGRANTDVNS-------EYLIHLSIGAP-RSQPVVL 108

Query: 174 --DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
             D+GSD+VW QC+PC++C+ Q  P FD A S +   V+CS  +C+     GC    C Y
Sbjct: 109 TLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTY 168

Query: 232 EVSYGDGSYTKGTLALETLTI------GRTVVKNVAIGCGHKNQGMFVGA-AGLLGLGGG 284
              YGDGS + G    ++ T       G+  V ++  GCG  N G F+    G+ G G G
Sbjct: 169 VSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRG 228

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF---- 340
            +SL  QL  +    FSYC  +R    S  +  G        A  P++  P   S     
Sbjct: 229 PLSLPSQLKVRQ---FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGT 285

Query: 341 ----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
               Y +   G+ VG  R+P+ E    +   G     +D+GT +T  P   +   + AF+
Sbjct: 286 DNSHYVLSFKGVTVGKTRLPVPE----IKADGSGATFIDSGTDITTFPDAVFRQLKSAFI 341

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
           AQ   LP        D C++  G  +  +P + F+  G     LP  N++    ++G  C
Sbjct: 342 AQAA-LPVNKTADEDDICFSWDGKKTAAMPKLVFHLEGAD-WDLPRENYVTEDRESGQVC 399

Query: 457 FAFAPS-PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A + S     ++IGN QQ+   I +D A G +   P  C
Sbjct: 400 VAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGKLLLVPAQC 439


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           ++RD+ R A   R L+   + ++        T  D+ +G     GEY + + +G+PP+S 
Sbjct: 51  LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 105

Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
             + D+GSD+VW QC PC + C+KQ  P+++P+ S +F  + CSSA+  C    RL  A 
Sbjct: 106 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 165

Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
              G  CRY  +YG G +T G    ET T G     +  V  +A GC + +   + G+AG
Sbjct: 166 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 224

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
           L+GLG G +SLV QL     G FSYCL   + T S  +L+ G  A        G    P 
Sbjct: 225 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 281

Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           V +P  P   ++YY+ L+G+ VG   +PI    F L   G  G+++D+GT +T L   AY
Sbjct: 282 VPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 341

Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
           +  R A V     LP   G +    D C+ L  S      +P+++ +F GG  + LP  N
Sbjct: 342 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 400

Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++I   D G +C A      G LS +GN QQ+ + I +D     + F P  C
Sbjct: 401 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 450


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 146/298 (48%), Gaps = 12/298 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
             Y VR+ +G+P +  +MV+D+ +D  WV C  C+ C   S   F P  S +   + CS 
Sbjct: 43  ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSE 99

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           A C ++    C A     C +  SYG  S    TL  + +T+   V+     GC +   G
Sbjct: 100 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSG 159

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G    P      
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  PS YYV L+G+ VG +++PI  +          G ++D+GT +TR   P Y 
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           A RD F  Q  N P +S +  FDTC+  +       P V+ +F G   L LP  N LI
Sbjct: 280 AIRDEFRKQV-NGPISS-LGAFDTCFAATN--EAEAPAVTLHFEGL-NLVLPMENSLI 332


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 140/412 (33%), Positives = 208/412 (50%), Gaps = 40/412 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGT--DVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           ++RD+ R A   R L+   + ++        T  D+ +G     GEY + + +G+PP+S 
Sbjct: 56  LRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNG-----GEYIMTLAIGTPPQSY 110

Query: 170 YMVIDSGSDIVWVQCQPCSQ-CYKQSDPVFDPADSASFSGVSCSSAV--CD---RLENAG 223
             + D+GSD+VW QC PC + C+KQ  P+++P+ S +F  + CSSA+  C    RL  A 
Sbjct: 111 PAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAGAT 170

Query: 224 CHAG-RCRYEVSYGDGSYTKGTLALETLTIG-----RTVVKNVAIGCGHKNQGMFVGAAG 277
              G  CRY  +YG G +T G    ET T G     +  V  +A GC + +   + G+AG
Sbjct: 171 PPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWNGSAG 229

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALP-----VGAAWVPL 331
           L+GLG G +SLV QL     G FSYCL   + T S  +L+ G  A        G    P 
Sbjct: 230 LVGLGRGGLSLVSQLAA---GMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPF 286

Query: 332 VRNPRAP---SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
           V +P  P   ++YY+ L+G+ VG   +PI    F L   G  G+++D+GT +T L   AY
Sbjct: 287 VPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAY 346

Query: 389 EAFRDAFVAQTGNLPRASGVSI--FDTCYNL--SGFVSVRVPTVSFYFSGGPVLTLPASN 444
           +  R A V     LP   G +    D C+ L  S      +P+++ +F GG  + LP  N
Sbjct: 347 KRVRAA-VRSLVKLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVEN 405

Query: 445 FLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++I   D G +C A      G LS +GN QQ+ + I +D     + F P  C
Sbjct: 406 YMI--LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKC 455


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 190/366 (51%), Gaps = 36/366 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCS 212
           GE+ + + +G+PP     + D+GSD++W QC PCS QC++Q  P+++P+ S +FS + C+
Sbjct: 83  GEFLMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCN 142

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAIGCGH 266
           S++   L    C    C Y ++YG G +T      ET T G +       V  +A GC +
Sbjct: 143 SSL--GLCAPAC---ACMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196

Query: 267 KNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPV 324
            + G    +A GL+GLG GS+SLV QLG      FSYCL   + T S+ +L+ G  A   
Sbjct: 197 ASSGFNASSASGLVGLGRGSLSLVSQLGAP---KFSYCLTPYQDTNSTSTLLLGPSASLN 253

Query: 325 GAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
               V   P V +P +  +YY+ L+G+ +G   +PI  + F L   G  G+++D+GT +T
Sbjct: 254 DTGVVSSTPFVASPSS-IYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTIT 312

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSV--RVPTVSFYFSGGPV 437
            L   AY+  R A ++    LP   G +    D C+ L    S    +P+++ +F G  +
Sbjct: 313 MLGNTAYQQVRAAVLSLV-TLPTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADM 371

Query: 438 LTLPASNFLI----PVDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVG 489
           + LPA N+++    P  D+  +C A           +SI+GN QQ+ + I +D     + 
Sbjct: 372 V-LPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLS 430

Query: 490 FGPNVC 495
           F P  C
Sbjct: 431 FAPAKC 436


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 165/348 (47%), Gaps = 15/348 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V+  VG+P ++  M +D+ +D  W+ C  C  C   S  VF+   S +F  + C 
Sbjct: 87  SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCD 143

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           +  C ++ N  C    C +  +YG GS     L  +T+ +   +V     GC  K  G  
Sbjct: 144 APQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGSS 202

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
           V   GLLGLG G +S + Q        FSYCL S R    SG+L  G    P+     PL
Sbjct: 203 VPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTPL 262

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  P Y A 
Sbjct: 263 LKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAV 322

Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           RD F  + GN    S +  FDTCY  +G   +  PT++F FSG  V TLP  N LI    
Sbjct: 323 RDEFRKRVGNA-IVSSLGGFDTCY--TG--PIVAPTMTFMFSGMNV-TLPTDNLLIRSTA 376

Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P    S L++I N+QQ+  +I FD  N  +G     C
Sbjct: 377 GSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 118/342 (34%), Positives = 171/342 (50%), Gaps = 33/342 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G + V +  G+PP+   +++D+GS I W QC+PC +C K S   FDP+ S ++S  SC  
Sbjct: 160 GNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIP 219

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
           +                Y ++YGD S + G    +T+T+  + V      GCG  N+G F
Sbjct: 220 STVGN-----------TYNMTYGDKSTSVGNYGCDTMTLEHSDVFPKFQFGCGRNNEGDF 268

Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
             GA G+LGLG G +S V Q   +    FSYCL      S GSL+FG +A    ++  + 
Sbjct: 269 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFT 326

Query: 330 PLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            LV  P         +Y+V L  + VG  R+ I   +F        G ++D+GT +TRLP
Sbjct: 327 SLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLP 381

Query: 385 TPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
             AY A + AF       P ++G      I DTCYNLSG   V +P +  +F  G  + L
Sbjct: 382 QRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRL 441

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
                +I  +DA   C AFA + S L+IIGN QQ  + + +D
Sbjct: 442 NGKR-VIWGNDASRLCLAFAGN-SELTIIGNRQQVSLTVLYD 481


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 146/298 (48%), Gaps = 12/298 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
             Y VR+ +G+P +  +MV+D+ +D  WV C  C+ C   S   F P  S +   + CS 
Sbjct: 43  ANYVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC---SSTTFLPNASTTLGSLDCSE 99

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           A C ++    C A     C +  SYG  S    TL  + +T+   V+     GC +   G
Sbjct: 100 AQCSQVRGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPGFTFGCINAVSG 159

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             +   GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G    P      
Sbjct: 160 GSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYYFSGSLKLGPVGQPKSIRTT 219

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  PS YYV L+G+ VG +++PI  +          G ++D+GT +TR   P Y 
Sbjct: 220 PLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYF 279

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           A RD F  Q  N P +S +  FDTC+  +       P V+ +F G   L LP  N LI
Sbjct: 280 AIRDEFRKQV-NGPISS-LGAFDTCFAETN--EAEAPAVTLHFEGL-NLVLPMENSLI 332


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 165/348 (47%), Gaps = 15/348 (4%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V+  VG+P ++  M +D+ +D  W+ C  C  C   S  VF+   S +F  + C 
Sbjct: 87  SPTYIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGC---SSTVFNSVTSTTFKTLGCD 143

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           +  C ++ N  C    C +  +YG GS     L  +T+ +   +V     GC  K  G  
Sbjct: 144 APQCKQVPNPTCGGSTCTWNTTYG-GSTILSNLTRDTIALSTDIVPGYTFGCIQKTTGSS 202

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPL 331
           V   GLLGLG G +S + Q        FSYCL S R    SG+L  G    P+     PL
Sbjct: 203 VPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLNFSGTLRLGPAGQPLRIKTTPL 262

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           ++NPR  S YYV L G+ VG   + I             G + D+GT  TRL  P Y A 
Sbjct: 263 LKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLVAPVYTAV 322

Query: 392 RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           RD F  + GN    S +  FDTCY  +G   +  PT++F FSG  V TLP  N LI    
Sbjct: 323 RDEFRKRVGNA-IVSSLGGFDTCY--TG--PIVAPTMTFMFSGMNV-TLPPDNLLIRSTA 376

Query: 452 AGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P    S L++I N+QQ+  +I FD  N  +G     C
Sbjct: 377 GSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPC 424


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  168 bits (425), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 169/347 (48%), Gaps = 18/347 (5%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+PP+   + +D+ +D  W+ C  C+ C   S   FDPA SAS+  V C S +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPL 171

Query: 216 CDRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C +  NA C  G   C + ++Y D S  +  L+ ++L +    VK    GC  +  G   
Sbjct: 172 CAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVAGNAVKAYTFGCLQRATGTAA 230

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q        FSYCL S +    SG+L  GR   P      PL+
Sbjct: 231 PPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTPLL 290

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   +PI             G V+D+GT  TRL  PAY A R
Sbjct: 291 ANPHRSSLYYVNMTGVRVGRKVVPIPA----FDPATGAGTVLDSGTMFTRLVAPAYVAVR 346

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           D    + G     S +  FDTC+N +   +V  P ++  F G  V TLP  N +I     
Sbjct: 347 DEVRRRVGA--PVSSLGGFDTCFNTT---AVAWPPMTLLFDGMQV-TLPEENVVIHSTYG 400

Query: 453 GTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A A +P G    L++I ++QQ+  ++ FD  NG VGF    C
Sbjct: 401 TISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 173/360 (48%), Gaps = 37/360 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G + V +  G+PP+   +++D+GS I W QC+ C  C K S   FD   S+++S  SC  
Sbjct: 125 GNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTYSFGSCIP 184

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
           +                Y ++YGD S + G    +T+T+  + V +    GCG  N+G F
Sbjct: 185 STVGNT-----------YNMTYGDKSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNEGDF 233

Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
             GA G+LGLG G +S V Q   +    FSYCL      S GSL+FG +A    ++  + 
Sbjct: 234 GSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEE--NSIGSLLFGEKATSQSSSLKFT 291

Query: 330 PLVRNP-----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            LV  P         +Y+V L  + VG  R+ I   +F        G ++D+GT +TRLP
Sbjct: 292 SLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTVITRLP 346

Query: 385 TPAYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
             AY A + AF       P ++G      + DTCYNLSG   V +P    +F  G  + L
Sbjct: 347 QRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRL 406

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                ++  +DA   C AFA +        L+IIGN QQ  + + +D     +GFG N C
Sbjct: 407 NGKR-VVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGC 465


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 167/348 (47%), Gaps = 16/348 (4%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+P +   + +D+ +D  W+ C  C+ C   S   F+PA SAS+  V C S  
Sbjct: 107 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 164

Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C    N  C  +A  C + +SY D S  +  L+ +TL +   VVK    GC  +  G   
Sbjct: 165 CVLAPNPSCSPNAKSCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRATGTAA 223

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q     G  FSYCL S +    SG+L  GR   P      PL+
Sbjct: 224 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 283

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   + I             G V+D+GT  TRL  P Y A R
Sbjct: 284 ANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALR 343

Query: 393 DAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           D    + G    A S +  FDTCYN     +V  P V+  F G  V TLP  N +I    
Sbjct: 344 DEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFDGMQV-TLPEENVVIHTTY 398

Query: 452 AGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P G    L++I ++QQ+  ++ FD  NG VGF    C
Sbjct: 399 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 446


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  167 bits (424), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
           D+G   + V   VG PP  Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++  +
Sbjct: 86  DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 144

Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
           S  S +C +  +    H  +C Y  SY DGS + G LA E +       G   V +V  G
Sbjct: 145 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 204

Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
           CGH N+G F G  +G+LGL  G  S+V +LG +    FSYC+  +     +   LV G +
Sbjct: 205 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 259

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            + +  +  P         FYYV L G+ VG  R+ I+ ++F+ T+ G  GVVMD+GT  
Sbjct: 260 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 316

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
           T L    +    D    +   L R      I+ T     CY      +L GF     P +
Sbjct: 317 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 367

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
           +F+F+ G  L L A++  +   +   FC A   S   +  S+IG + Q+   +++D    
Sbjct: 368 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 426

Query: 487 FVGFGPNVC 495
            V F    C
Sbjct: 427 RVYFQRTDC 435


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 121/350 (34%), Positives = 173/350 (49%), Gaps = 16/350 (4%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y VR+ +G+P +  +MV+D+ +D  WV C  C+ C   +        S+++  + CS 
Sbjct: 95  GNYVVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFST---NTSSTYGSLDCSM 151

Query: 214 AVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
           A C ++    C A     C +  SYG  S    TL  ++L +   V+ N A GC +   G
Sbjct: 152 AQCTQVRGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVNDVIPNFAFGCINSISG 211

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
             V   GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G    P    + 
Sbjct: 212 GSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFKSYYFSGSLKLGPAGQPKSIRYT 271

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PL+RNP  PS YYV L+G+ VG   +PI+ +L         G ++D+GT +TR   P Y 
Sbjct: 272 PLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVITRFVQPIYT 331

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
           A RD F  Q       S +  FDTC+  +       P V+ +F+G   L LP  N LI  
Sbjct: 332 AIRDEFRKQVAG--PFSSLGAFDTCFAATN--EAVAPAVTLHFTGL-NLVLPMENSLIHS 386

Query: 450 DDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C A A +P    S L++I N+QQ+ +++ FD  N  +G    +C
Sbjct: 387 SAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score =  167 bits (424), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 167/348 (47%), Gaps = 16/348 (4%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y VR  +G+P +   + +D+ +D  W+ C  C+ C   S   F+PA SAS+  V C S  
Sbjct: 54  YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP--FNPAASASYRPVPCGSPQ 111

Query: 216 CDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           C    N  C  +A  C + +SY D S  +  L+ +TL +   VVK    GC  +  G   
Sbjct: 112 CVLAPNPSCSPNAKSCGFSLSYADSSL-QAALSQDTLAVAGDVVKAYTFGCLQRATGTAA 170

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLV 332
              GLLGLG G +S + Q     G  FSYCL S +    SG+L  GR   P      PL+
Sbjct: 171 PPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTPLL 230

Query: 333 RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
            NP   S YYV ++G+ VG   + I             G V+D+GT  TRL  P Y A R
Sbjct: 231 ANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLALR 290

Query: 393 DAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD 451
           D    + G    A S +  FDTCYN     +V  P V+  F G  V TLP  N +I    
Sbjct: 291 DEVRRRVGAGAAAVSSLGGFDTCYN----TTVAWPPVTLLFDGMQV-TLPEENVVIHTTY 345

Query: 452 AGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             T C A A +P G    L++I ++QQ+  ++ FD  NG VGF    C
Sbjct: 346 GTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESC 393


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
           D+G   + V   VG PP  Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++  +
Sbjct: 54  DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 112

Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
           S  S +C +  +    H  +C Y  SY DGS + G LA E +       G   V +V  G
Sbjct: 113 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172

Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
           CGH N+G F G  +G+LGL  G  S+V +LG +    FSYC+  +     +   LV G +
Sbjct: 173 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 227

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            + +  +  P         FYYV L G+ VG  R+ I+ ++F+ T+ G  GVVMD+GT  
Sbjct: 228 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
           T L    +    D    +   L R      I+ T     CY      +L GF     P +
Sbjct: 285 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 335

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
           +F+F+ G  L L A++  +   +   FC A   S   +  S+IG + Q+   +++D    
Sbjct: 336 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 394

Query: 487 FVGFGPNVC 495
            V F    C
Sbjct: 395 RVYFQRTDC 403


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 184/369 (49%), Gaps = 42/369 (11%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
           D+G   + V   VG PP  Q + ID+GSD++WVQC+PC+ C++QS P+FDP+ S+++  +
Sbjct: 54  DRGQA-FLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDL 112

Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
           S  S +C +  +    H  +C Y  SY DGS + G LA E +       G   V +V  G
Sbjct: 113 SYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFG 172

Query: 264 CGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGRE 320
           CGH N+G F G  +G+LGL  G  S+V +LG +    FSYC+  +     +   LV G +
Sbjct: 173 CGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSR----FSYCIGDLFDPHYTHNQLVLG-D 227

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            + +  +  P         FYYV L G+ VG  R+ I+ ++F+ T+ G  GVVMD+GT  
Sbjct: 228 GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTA 284

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT-----CY------NLSGFVSVRVPTV 428
           T L    +    D    +   L R      I+ T     CY      +L GF     P +
Sbjct: 285 TFLAKDGF----DPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGF-----PEL 335

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANG 486
           +F+F+ G  L L A++  +   +   FC A   S   +  S+IG + Q+   +++D    
Sbjct: 336 AFHFAEGADLVLDANSLFVQ-KNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGK 394

Query: 487 FVGFGPNVC 495
            V F    C
Sbjct: 395 RVYFQRTDC 403


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 173/354 (48%), Gaps = 26/354 (7%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC--YKQSDPVFDPADSASFSGV 209
           G GEY + + +G+PP+    +ID+GSD+VW++C  C  C      + +F    S+S+  +
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 210 SCSSAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVV 257
            C+S  C  + +AG    C    C+Y+  YGDGS T G +  + ++          R+  
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
                GCG K +G +    GL+GLG  S SL+ QLG + G  FSYCLVS  +  S    L
Sbjct: 120 DGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 316 VFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD-- 369
             G  A   G   V  P++       + YYV L  + VGG+ + +  ++    T +G   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFL 239

Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
            +  V+D+GT  T L  P YEA R +   Q   LP     +  D C+N SG  S   P+V
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYGFPSV 298

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           +FYF+    L LP  N +  V      C +   S   LSIIGN+QQ+   I +D
Sbjct: 299 TFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/376 (32%), Positives = 182/376 (48%), Gaps = 42/376 (11%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           + +++G+GS  ++   +ID+GS+ V VQC        +S PVFDPA S S+  V C S +
Sbjct: 100 FSMQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQL 153

Query: 216 CDRLENAGCH---------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKN 259
           C  ++    +         +  C Y +SYGD   + G  + + + +  T         ++
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRD 213

Query: 260 VAIGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSR--GTGSSGS 314
           VA GC H  QG  V  G+ G++G   G++SL  QL  + GG+ FSYC  S+     ++G 
Sbjct: 214 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGV 273

Query: 315 LVFGREALPVGAA-WVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLT-QMGD 369
           +  G   L      + PL+ NP  P+    YYVGL+ + V G  + I E  F+L    GD
Sbjct: 274 IFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 333

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVA--QTGNLPRASGVSIFDTCYNLSGFVSVR-VP 426
            G V+D+GT  TR+   AY AFR+AF A  ++G   +    + FD CYN+S   S+  VP
Sbjct: 334 GGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVP 393

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSPSG----LSIIGNIQQEGIQI 479
            V         L L   +  +PV  AG   T C A   S       ++++GN QQ    +
Sbjct: 394 EVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLV 453

Query: 480 SFDGANGFVGFGPNVC 495
            +D     VGF    C
Sbjct: 454 EYDNERSRVGFERADC 469


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 164/357 (45%), Gaps = 30/357 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY +R  +G+P   +  + D+GSD+ W+QC PC  CY Q  P+FDP  S+++  V C S
Sbjct: 86  GEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCES 145

Query: 214 AVCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTI--------GRTVVKNVAI 262
             C          G   +C Y   YG  S+T G L  +T++         G T  K+V  
Sbjct: 146 QPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSV-F 204

Query: 263 GCGHKNQGMF---VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
           GC   +   F     A G +GLG G +SL  QLG Q G  FSYC+V   + S+G L FG 
Sbjct: 205 GCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGS 264

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            A        P + NP  PS+Y + L G+ VG  ++        LT      +++D+   
Sbjct: 265 MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNIIIDSVPI 316

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
           +T L    Y  F  + V +  N+  A    + F+ C  +    ++  P   F+F+G  V+
Sbjct: 317 LTHLEQGIYTDFISS-VKEAINVEVAEDAPTPFEYC--VRNPTNLNFPEFVFHFTGADVV 373

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             P + F+    D    C    PS  G+SI GN  Q   Q+ +D     V F P  C
Sbjct: 374 LGPKNMFI--ALDNNLVCMTVVPS-KGISIFGNWAQVNFQVEYDLGEKKVSFAPTNC 427


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 173/347 (49%), Gaps = 27/347 (7%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G + V +G G+P +   ++ID+GSD  W+QC  CS     +   F+P+ S+S+S  SC  
Sbjct: 127 GLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKTFNPSLSSSYSNRSCIP 186

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFV 273
           +                Y + Y D SY+KG    + +T+   V      GCG    G F 
Sbjct: 187 ST------------DTNYTMKYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGEFG 234

Query: 274 GAAGLLGLGGGSM-SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPL 331
            A+G+LGL  G   SL+ Q   +    FSYC   +   + GSL+FG +A+    +     
Sbjct: 235 TASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPK-EHTLGSLLFGEKAISASPSLKFTQ 293

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           + NP +   Y+V L G+ V   R+ +S  LF        G ++D+GT +TRLPT AYEA 
Sbjct: 294 LLNPPSGLGYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITRLPTAAYEAL 348

Query: 392 RDAFVAQTGNLPRASGV---SIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFL 446
           R AF  +  + P  S      + DTCYNL G    ++++P +  +F G   ++L  S  L
Sbjct: 349 RTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGIL 408

Query: 447 IPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
               D    C AFA   +PS ++IIGN QQ  +++ +D   G +GFG
Sbjct: 409 WANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLGFG 455


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 130/408 (31%), Positives = 197/408 (48%), Gaps = 39/408 (9%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
           +   +  A + +   RV  +  R     A+++        TDV S +    G Y + I V
Sbjct: 7   KRSEAIRALVAKSHARVRWMAAR-----ANSSSWSSMAGTTDVESPLHPDGGGYVMDISV 61

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA 222
           G+P +    + D+GSD+VWVQ +PC+ C   +  +FDP  S++F  + CSS +C  L  +
Sbjct: 62  GTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQSSTFREMDCSSQLCAELPGS 119

Query: 223 GCHAGR--CRYEVSYG----DGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAA 276
            C  G   C Y   YG    +G + + T++L T + G     + A+GCG  N G F G  
Sbjct: 120 -CEPGSSTCSYSYEYGSGETEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSG-FDGVD 177

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWVPLVRNP 335
           GL+GLG G +SL  QL       FSYCLV   + S  S L+FG  A   G         P
Sbjct: 178 GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITP 237

Query: 336 RA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTAVTRLPTPAYEAF 391
            +   P++Y + ++G+ V G              MG  G  ++D+GT +T +P+  Y   
Sbjct: 238 PSDTYPTYYLLTVNGIAVAGQ------------TMGSPGTTIIDSGTTLTYVPSGVYGRV 285

Query: 392 RDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
                +    LPR  G S+  D CY+ S   + + P ++   +G   +T P+SN+ + VD
Sbjct: 286 LSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGA-TMTPPSSNYFLVVD 343

Query: 451 DAG-TFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           D+G T C A   S SGL  SIIGN+ Q+G  I +D  +  + F    C
Sbjct: 344 DSGDTVCLAMG-SASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 127/369 (34%), Positives = 176/369 (47%), Gaps = 32/369 (8%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           V + +   +G+Y +++ +G+PP   Y ++D+GSD+VW QC PC  CY+Q  P+F+P  S 
Sbjct: 39  VFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSN 98

Query: 205 SFSGVSCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-----VVK 258
           +++ + C S  C+ L    C   + C Y  +Y D S TKG LA ET+T   T     VV 
Sbjct: 99  TYTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVG 158

Query: 259 NVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVS--RGTGSSGS 314
           ++  GCGH N G F     G++GLGGG +SLV Q G   G   FS CLV       + G+
Sbjct: 159 DIVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGT 218

Query: 315 LVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           + FG   +    G A  PLV      + Y V L G+ VG   +      F  ++M   G 
Sbjct: 219 ISFGDASDVSGEGVAATPLVSE-EGQTPYLVTLEGISVGDTFVS-----FNSSEMLSKGN 272

Query: 373 VM-DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCY----NLSGFVSVRVP 426
           +M D+GT  T LP   Y+        Q+  LP      +    CY    NL G      P
Sbjct: 273 IMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEG------P 326

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
            +  +F G  V  +P   F+ P D  G FCFA A +  G  I GN  Q  + I FD    
Sbjct: 327 ILIAHFEGADVQLMPIQTFIPPKD--GVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRK 384

Query: 487 FVGFGPNVC 495
            V F    C
Sbjct: 385 TVSFKATDC 393


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 121/352 (34%), Positives = 174/352 (49%), Gaps = 18/352 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V++ +G+P +   + +D+ SD+ W+ C  C  C   S+  F PA S SF  VSCS
Sbjct: 96  STTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 153

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
           +  C ++ N  C A  C + ++YG  S     L+ +T+ +    +K    GC +K  G  
Sbjct: 154 APQCKQVPNPACGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 212

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
                 GLLGLG G +SL+ Q        FSYCL S R    SGSL  G  + P    + 
Sbjct: 213 TIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 272

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
            L+RNPR  S YYV L  + VG   + +             G + D+GT  TRL  P YE
Sbjct: 273 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 332

Query: 390 AFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           A R+ F  +    P A   S+  FDTCY  SG   V+VPT++F F G   +T+PA N ++
Sbjct: 333 AVRNEFRKRVKP-PTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLML 386

Query: 448 PVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 T C A A +P    S +++I ++QQ+  ++  D  NG +G     C
Sbjct: 387 HSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  164 bits (416), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 26/354 (7%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC--YKQSDPVFDPADSASFSGV 209
           G GEY + + +G+PP+    +ID+GSD+VW++C  C  C      + +F    S+S+  +
Sbjct: 1   GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKL 60

Query: 210 SCSSAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTI--------GRTVV 257
            C+S  C  + +AG    C    C+Y+  YGDGS T G +  + ++          R+  
Sbjct: 61  PCNSTHCSGMSSAGIGPRCEE-TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFF 119

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
                GC  K +G +    GL+GLG  S SL+ QLG + G  FSYCLVS  +  S    L
Sbjct: 120 DGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKSFL 179

Query: 316 VFGREALPVGAAWV--PLVRNPRAP-SFYYVGLSGLGVGGMRIPI-SEDLFRLTQMGD-- 369
             G  A   G   V  P++       + YYV L  + +GG+ + +  ++    T +G   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFL 239

Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
            +  V+D+GT  T L  P YEA R +   Q   LP     +  D C+N SG  S   P+V
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRKSIEEQV-ILPTLGNSAGLDLCFNSSGDTSYGFPSV 298

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           +FYF+    L LP  N +  V      C +   S   LSIIGN+QQ+   I +D
Sbjct: 299 TFYFANQVQLVLPFEN-IFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYD 351


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 123/359 (34%), Positives = 173/359 (48%), Gaps = 16/359 (4%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  +V C  C+ C   SD  F P  S 
Sbjct: 88  IASGQAFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDTTFSPKAST 144

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S+  + CS   C ++    C A   G C +  SY   S++  TL  + L +   V+   +
Sbjct: 145 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDALRLATDVIPYYS 203

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            GC +   G  V A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G  
Sbjct: 204 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLGPV 263

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+R+P  PS YYV  +G+ VG + +P   +          G ++D+GT +
Sbjct: 264 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 323

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TR   P Y A R+ F  Q G     S +  FDTC+  +       P ++ +F G   L L
Sbjct: 324 TRFVEPVYNAVREEFRKQVGGTTFTS-IGAFDTCFVKT--YETLAPPITLHFEGLD-LKL 379

Query: 441 PASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P  N LI        C A A +P    S L++I N QQ+ ++I FD  N  VG    VC
Sbjct: 380 PLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  164 bits (415), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 124/389 (31%), Positives = 190/389 (48%), Gaps = 32/389 (8%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           +D  RV ++  R+ G     +  E +D G+          G + V +G G P ++  ++I
Sbjct: 90  QDRSRVRSINARILG---QYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLII 146

Query: 174 DSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRY 231
           D+GSD  W++C  CS   C+ +  P F+P+ S+S+S  SC  +             +  Y
Sbjct: 147 DTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPST------------KTNY 194

Query: 232 EVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSM-SLVG 290
            ++Y D SY+KG    + +T+   V      GCG    G F  A+G+LGL  G   SL+ 
Sbjct: 195 TMNYEDNSYSKGVFVCDEVTLKPDVFPKFQFGCGDSGGGDFGSASGVLGLAQGEQYSLIS 254

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPLVRNPRAPSFYYVGLSGLG 349
           Q   +    FSYC       + GSL+FG +A+    +     + NP + S Y+V L G+ 
Sbjct: 255 QTASKFKKKFSYCF-PHNENTRGSLLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGIS 313

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV- 408
           V   R+ +S  LF        G ++D+GT +T LPT AYEA R AF  +  + P  S   
Sbjct: 314 VAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPP 368

Query: 409 --SIFDTCYNLSGF--VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-- 462
                DTCYNL G    ++++P +  +F G   ++L  S  L    D    C AFA    
Sbjct: 369 QEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSH 428

Query: 463 PSGLSIIGNIQQEGIQISFDGANGFVGFG 491
           PS ++IIGN QQ  +++ +D   G +GFG
Sbjct: 429 PSHVTIIGNRQQVSLKVVYDIEGGRLGFG 457


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 122/388 (31%), Positives = 184/388 (47%), Gaps = 35/388 (9%)

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC-----QPCSQCY 191
           E   F   + SG   G+G+YFVR+ VG+P +   +V D+GSD+ WV+C        S   
Sbjct: 85  ESSAFAMPLTSGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAA 144

Query: 192 KQSDPVFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLA 246
                VF PA S S+S + C S  C       L N       C Y+  Y D S  +G + 
Sbjct: 145 SPPQRVFRPAGSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVG 204

Query: 247 LETLTIG--------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTG 297
           L++ T+         +  ++ V +GC     G  F  + G+L LG  ++S   +   + G
Sbjct: 205 LDSATVSLSGNDGTRKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFG 264

Query: 298 GAFSYCLVSR--GTGSSGSLVFGREALPVGAAW----VPLV--RNPRAPSFYYVGLSGLG 349
           G FSYCLV       ++  L FG      G        PLV   + R   FY+V +  + 
Sbjct: 265 GRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVT 324

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           V G R+ I  D++   + G  G ++D+GT++T L TPAY+A   A   Q   +PR + + 
Sbjct: 325 VAGERLEILPDVWDFRKNG--GAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVN-MD 381

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLS 467
            F+ CYN +G VS  +P +   F+G   L  P  +++I  D A G  C      +  G+S
Sbjct: 382 PFEYCYNWTG-VSAEIPRMELRFAGAATLAPPGKSYVI--DTAPGVKCIGVVEGAWPGVS 438

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +IGNI Q+     FD AN ++ F  + C
Sbjct: 439 VIGNILQQEHLWEFDLANRWLRFKQSRC 466


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 120/369 (32%), Positives = 181/369 (49%), Gaps = 42/369 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           +++G+GS  ++   +ID+GS+ V VQC        +S PVFDPA S S+  V C S +C 
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLVQCG------SRSRPVFDPAASQSYRQVPCISQLCL 54

Query: 218 RLENAGCH---------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-------VVKNVA 261
            ++    +         +  C Y +SYGD   + G  + + + +  T         ++VA
Sbjct: 55  AVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVA 114

Query: 262 IGCGHKNQGMFV--GAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVSR--GTGSSGSLV 316
            GC H  QG  V  G+ G++G   G++SL  QL  + GG+ FSYC  S+     ++G + 
Sbjct: 115 FGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIF 174

Query: 317 FGREALPVG-AAWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLT-QMGDDG 371
            G   L     ++ PL+ NP  P+    YYVGL+ + V G  + I E  F+L    GD G
Sbjct: 175 LGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGG 234

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVA--QTGNLPRASGVSIFDTCYNLSGFVSVR-VPTV 428
            V+D+GT  TR+   AY AFR+AF A  ++G   +    + FD CYN+S   S+  VP V
Sbjct: 235 TVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSPSG----LSIIGNIQQEGIQISF 481
                    L L   +  +PV  AG   T C A   S       ++++GN QQ    + +
Sbjct: 295 RLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEY 354

Query: 482 DGANGFVGF 490
           D     VGF
Sbjct: 355 DNERSRVGF 363


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 125/364 (34%), Positives = 183/364 (50%), Gaps = 36/364 (9%)

Query: 143 TDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD 202
           TDV S +    G Y + I VG+P +    + D+GSD+VWVQ +PC+ C   +  +FDP  
Sbjct: 42  TDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGT--IFDPRQ 99

Query: 203 SASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT----- 255
           S++F  + CSS +C  L  + C  G   C Y   YG G  T+G  A +T+++G T     
Sbjct: 100 SSTFREMDCSSQLCTELPGS-CEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQ 157

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS- 314
              + A+GCG  N G F G  GL+GLG G +SL  QL       FSYCLV   + S  S 
Sbjct: 158 KFPSFAVGCGMVNSG-FDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSP 216

Query: 315 LVFGREALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           L+FG  A   G         P +   P++Y + ++G+ V G              MG  G
Sbjct: 217 LLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQ------------TMGSPG 264

Query: 372 -VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVS 429
             ++D+GT +T +P+  Y        +    LPR  G S+  D CY+ S   + + P ++
Sbjct: 265 TTIIDSGTTLTYVPSGVYGRVLSRMESMV-TLPRVDGSSMGLDLCYDRSSNRNYKFPALT 323

Query: 430 FYFSGGPVLTLPASNFLIPVDDAG-TFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANG 486
              +G   +T P+SN+ + VDD+G T C A   S  GL  SIIGN+ Q+G  I +D  + 
Sbjct: 324 IRLAGA-TMTPPSSNYFLVVDDSGDTVCLAMG-SAGGLPVSIIGNVMQQGYHILYDRGSS 381

Query: 487 FVGF 490
            + F
Sbjct: 382 ELSF 385


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/370 (31%), Positives = 188/370 (50%), Gaps = 36/370 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC-S 212
           GEY+  I +GSP +   +++D+GS++ W+QC PC  C    D ++D A SAS+  V+C +
Sbjct: 98  GEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNN 157

Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAI 262
           S +C        A C  G +C++   YGDGS++ G+L+ +TL +   V      V++ A 
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAF 217

Query: 263 GCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGR 319
           GC   +  +   GA+G+LGL  G M+L  QLG + G  FS+C   R +   S+G + FG 
Sbjct: 218 GCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGN 277

Query: 320 EALP---VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
             LP   V    V L  +     FY+V L G+ +       S +L  L +     V++D+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSIN------SHELVFLPR--GSVVILDS 329

Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFD--TCYNLSG----FVSVRVPTVS 429
           G++ +    P +   R+AF+  +  +L    G S  D  TC+ +S      +   +P++S
Sbjct: 330 GSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLS 389

Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
             F  G  + +P+   L+PV    +    CFAF    P+ +++IGN QQ+ + + +D   
Sbjct: 390 LVFEDGVTIGIPSIGVLLPVARFQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQR 449

Query: 486 GFVGFGPNVC 495
             VGF    C
Sbjct: 450 SRVGFARASC 459


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/346 (31%), Positives = 159/346 (45%), Gaps = 70/346 (20%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
           +++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+    A        
Sbjct: 178 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237

Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
                      + RC Y ++YGDGS+++G LA +T+ +G   V     GCG  N+G+F G
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNRGLFGG 297

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRN 334
            AGL+GLG       G L G                           LP GA        
Sbjct: 298 TAGLMGLGPD-----GALAG---------------------------LPDGAP------- 318

Query: 335 PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
              P FY++ ++G  V                +G   V++D+GT +TRL    Y A R  
Sbjct: 319 ---PPFYFMNVTGASV-------GGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAE 368

Query: 395 FVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL-IPVDD 451
           F  Q G    P A   S+ D CYNL+G   V+VP ++    GG  +T+ A+  L +   D
Sbjct: 369 FARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKD 428

Query: 452 AGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 429 GSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 474


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 180/370 (48%), Gaps = 36/370 (9%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ----SDPVFDPADSAS 205
           DQG   + + +G+  P +   +++D+GSD++W QC+  S         S PV+DP +S++
Sbjct: 13  DQG---HSLTVGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESST 66

Query: 206 FSGVSCSSAVCD--RLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNV 260
           F+ + CS  +C   +     C +  RC YE  YG  +   G LA ET T G  R V   +
Sbjct: 67  FAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTFGARRAVSLRL 125

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG-- 318
             GCG  + G  +GA G+LGL   S+SL+ QL  Q    FSYCL       +  L+FG  
Sbjct: 126 GFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGAM 182

Query: 319 ----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
               R           +V NP    +YYV L G+ +G  R+ +      +   G  G ++
Sbjct: 183 ADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIV 242

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVRVPT 427
           D+G+ V  L   A+EA ++A V     LP A+  V  ++ C+ L      +   +V+VP 
Sbjct: 243 DSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPP 301

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFDGAN 485
           +  +F GG  + LP  N+      AG  C A   +   SG+SIIGN+QQ+ + + FD  +
Sbjct: 302 LVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 360

Query: 486 GFVGFGPNVC 495
               F P  C
Sbjct: 361 HKFSFAPTQC 370


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 176/375 (46%), Gaps = 44/375 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           GEY V++G+G+P       ID+ SD+VW+QCQPC  CY+Q DP+F+P  S+S++ V CSS
Sbjct: 86  GEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSS 145

Query: 214 AVCDRLENAGCHAGR---CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ- 269
             C +L+   C       CRY   Y   + T GTLA++ L +G  V   V +GC   +  
Sbjct: 146 DTCSQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVLGCSDSSVG 205

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA--- 326
           G    A+GL+GL  G +SL+ QL  +    F YCL    + + G LV G  A   GA   
Sbjct: 206 GPPPQASGLVGLARGPLSLLSQLSVRR---FMYCLPPPMSRTPGKLVLGAGA---GADAV 259

Query: 327 ------AWVPLVRNPRAPSFYYVGLSGLGVGG-----MRIPISEDLFRLTQMGDD----- 370
                   V +  + R PS+YY+   GL VG      +R P S         G       
Sbjct: 260 RNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGS 319

Query: 371 -----GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLS---GF 420
                G+++D  + ++ L    Y+   D    +   LPRA+  +    D C+ L    G 
Sbjct: 320 GANAYGMIVDVASTISFLEASLYDELADDLEEEI-RLPRATPSTRLGLDLCFILPEGVGI 378

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQIS 480
             V VPTVS  F G   L L      +  +D    C     + SG+SI+GN QQ+ + + 
Sbjct: 379 DRVYVPTVSMSFDGR-WLELERDRLFL--EDGRMMCLMIGRT-SGVSILGNYQQQNMHVL 434

Query: 481 FDGANGFVGFGPNVC 495
           ++   G + F    C
Sbjct: 435 YNLRRGKITFAKASC 449


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 86/214 (40%), Positives = 128/214 (59%), Gaps = 23/214 (10%)

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC-DRLEN 221
           GSP  +  +++D+GSD+ WVQC+PCS CY Q DP+FDPA SA+++ V C+++ C D L  
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162

Query: 222 A----------GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
           A          G  + +C Y ++YGDGS+++G LA +T+ +G   +     GCG  N+G+
Sbjct: 163 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGASLGGFVFGCGLSNRGL 222

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG-SSGSLVFG---------REA 321
           F G AGL+GLG   +SLV Q   + GG FSYCL +  +G +SGSL  G         R  
Sbjct: 223 FGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNT 282

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
            PV  A+  ++ +P  P FY++ ++G  VGG  +
Sbjct: 283 TPV--AYTRMIADPAQPPFYFLNVTGAAVGGTAL 314


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 128/414 (30%), Positives = 193/414 (46%), Gaps = 30/414 (7%)

Query: 92  SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVS-GMD 150
           +  TT  +++ +     H R+     R + + +  S   +  + ++     TD V   MD
Sbjct: 40  TDTTTAAINFTQAALESHRRLSFLASRSSQVDKPQSSSASQLSNND-----TDTVPLRMD 94

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
            G G Y +   +G+PP+    + D+GSD++W +C             + P  S++F+ + 
Sbjct: 95  GGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLP 154

Query: 211 CSSAVCDRLEN---AGCHAG--RCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAI 262
           CS  +C  L +   A C AG   C Y+ +YG   D  +T+G L  ET T+G   V  V  
Sbjct: 155 CSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDAVPGVGF 214

Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
           GC    +G +   AGL+GLG G +SLV QL     G F YCL +  + +S  L+FG  A 
Sbjct: 215 GCTTALEGDYGEGAGLVGLGRGPLSLVSQL---DAGTFMYCLTADASKAS-PLLFGALAT 270

Query: 323 PVGA-AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
             GA A V       + +FY V L  + +G                    VV D+GT +T
Sbjct: 271 MTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPGG--------VVFDSGTTLT 322

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
            L  PAY   + AF++QT +L    G   F+ CY       + +P +  +F GG  + LP
Sbjct: 323 YLAEPAYTEAKAAFLSQTTSLTPVEGRYGFEACYEKPDSARL-IPAMVLHFDGGADMALP 381

Query: 442 ASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +N+++ VDD G  C+    SPS LSIIGNI Q    +  D     + F P  C
Sbjct: 382 VANYVVEVDD-GVVCWVVQRSPS-LSIIGNIMQMNYLVLHDVRKSVLSFQPANC 433


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 114/367 (31%), Positives = 179/367 (48%), Gaps = 32/367 (8%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC 211
           G   + + + +G+PP+ + +++D+GSD++W QC+       +  P++DPA S+SF+   C
Sbjct: 85  GRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPC 144

Query: 212 SSAVCD--RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKNVAIGCGHK 267
              +C+        C   +C Y  +YG  + TKG LA ET T G  R V  ++  GCG  
Sbjct: 145 DGRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSLDFGCGKL 203

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRGT------GSSGSLVFG 318
             G   GA+G+LG+    +SLV QL       FSYCL   + R T      G+   L   
Sbjct: 204 TSGSLPGASGILGISPDRLSLVSQLQIPR---FSYCLTPFLDRNTTSHIFFGAMADLSKY 260

Query: 319 REALPVGAAWVPLVRNPRAPS-FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           R   P+      LV NP   + +YYV L G+ VG  R+ +    F + + G  G  +D+G
Sbjct: 261 RTTGPIQT--TSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSG 318

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS---IFDTCYNLS----GFV--SVRVPTV 428
                LP+   EA ++A V +   LP  +       ++ C+ L     G V  +V+VP +
Sbjct: 319 DTTGMLPSVVMEALKEAMV-EAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPL 377

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
            ++F GG  + L   ++++ V  AG  C   +    G +IIGN QQ+ + + FD  N   
Sbjct: 378 VYHFDGGAAMLLRRDSYMVEV-SAGRMCLVISSGARG-AIIGNYQQQNMHVLFDVENHEF 435

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 436 SFAPTQC 442


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 131/444 (29%), Positives = 203/444 (45%), Gaps = 41/444 (9%)

Query: 71  NTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
           N  + +    L+++H     S       + +        A+ +  ++ +++LV R S   
Sbjct: 29  NCETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFLSSLVARKSVVP 88

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
             + +  VQ+               Y VR  +G+P ++  M +D+ SD+ W+   PC+ C
Sbjct: 89  IASGRQIVQN-------------PTYIVRAKIGTPAQTMLMAMDTSSDVAWI---PCNGC 132

Query: 191 YKQSDPVFDPADSASFSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYG 236
              S  +F+   S ++  + C +A C ++                  C  G C + ++YG
Sbjct: 133 LGCSSTLFNSPASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYG 192

Query: 237 DGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT 296
            GS     L+ +T+T+    V   + GC  K  G  + A GLLGLG G +SL+ Q     
Sbjct: 193 -GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLY 251

Query: 297 GGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
              FSYCL S +    SGSL  G    P    + PL++NPR PS Y+V L  + VG   +
Sbjct: 252 QSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVV 311

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
            +    F        G + D+GT  TRL TPAY A RDAF  + G     + +  FDTCY
Sbjct: 312 DVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGFDTCY 371

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGN 471
                V +  PT++F F+G  V TLP  N LI      T C A A +P    S L++I N
Sbjct: 372 T----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIAN 426

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           +QQ+  ++ +D  N  +G    +C
Sbjct: 427 LQQQNHRLLYDVPNSRLGVARELC 450


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 179/361 (49%), Gaps = 20/361 (5%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  ++   P S C   S   F P  S 
Sbjct: 87  IASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI---PSSGCIGCSATTFSPNAST 143

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S+  + CS   C ++    C A   G C +  SY   +Y+  TL  ++L +   V+ + +
Sbjct: 144 SYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYS-ATLVQDSLRLATDVIPSYS 202

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            G  +   G  + A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G  
Sbjct: 203 FGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKLGPV 262

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+RNPR PS Y+V L+G+ VG + +P  ++L         G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVI 322

Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           TR   P Y A RD F  Q TG     S +  FDTC+ +  + ++  P ++ +F+    L 
Sbjct: 323 TRFVEPVYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           LP  N LI        C A A +P       L++I N QQ+ +++ FD  N  VG    +
Sbjct: 377 LPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGIAREL 436

Query: 495 C 495
           C
Sbjct: 437 C 437


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 173/353 (49%), Gaps = 20/353 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V+  +G+P +   + +D+ SD+ W+ C  C  C   S+  F PA S SF  VSCS
Sbjct: 96  STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 153

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
           +  C ++ N  C A  C + ++YG  S     L+ +T+ +    +K    GC +K  G  
Sbjct: 154 APQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 212

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
                 GLLGLG G +SL+ Q        FSYCL S R    SGSL  G  + P    + 
Sbjct: 213 TIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 272

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
            L+RNPR  S YYV L  + VG   + +             G + D+GT  TRL  P YE
Sbjct: 273 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 332

Query: 390 AFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           A R+ F  +    P  + V+    FDTCY  SG   V+VPT++F F G   +T+PA N +
Sbjct: 333 AVRNEFRKRVK--PTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLM 385

Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +      T C A A +P    S +++I ++QQ+  ++  D  NG +G     C
Sbjct: 386 LHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 438


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 173/353 (49%), Gaps = 20/353 (5%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           S  Y V+  +G+P +   + +D+ SD+ W+ C  C  C   S+  F PA S SF  VSCS
Sbjct: 112 STTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGC--PSNTAFSPAKSTSFKNVSCS 169

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG-- 270
           +  C ++ N  C A  C + ++YG  S     L+ +T+ +    +K    GC +K  G  
Sbjct: 170 APQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIKAFTFGCVNKVAGGG 228

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWV 329
                 GLLGLG G +SL+ Q        FSYCL S R    SGSL  G  + P    + 
Sbjct: 229 TIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRVKYT 288

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
            L+RNPR  S YYV L  + VG   + +             G + D+GT  TRL  P YE
Sbjct: 289 QLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKPVYE 348

Query: 390 AFRDAFVAQTGNLPRASGVSI---FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           A R+ F  +    P  + V+    FDTCY  SG   V+VPT++F F G   +T+PA N +
Sbjct: 349 AVRNEFRKRVK--PTTAVVTSLGGFDTCY--SG--QVKVPTITFMFKGV-NMTMPADNLM 401

Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +      T C A A +P    S +++I ++QQ+  ++  D  NG +G     C
Sbjct: 402 LHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERC 454


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 145/479 (30%), Positives = 217/479 (45%), Gaps = 66/479 (13%)

Query: 54  MSQYNELFERHNNISSSNTSSDEARWNLELVHRDKM--SSSSNTTNNMHYHRH------- 104
           M  Y+ L  R  +  S    S  +     +    K+  SSSS  T  ++ HRH       
Sbjct: 15  MITYHALVARAGDEKSYKVLSASSLKPGAVCAEPKVRDSSSSGATVPLN-HRHGPCSPVP 73

Query: 105 -----QHSFHARMQRDVKRVATLVRRLSG------GGADAAKHEVQDFGTDVVSGMDQGS 153
                Q +F   ++RD  R   + R+ S       GG   ++  V      +  G    +
Sbjct: 74  SGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGGLQQSEATVP-----IALGSLLNT 128

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
            EY + + +GSP  +  M ID+GSD+ W++C+           ++DP  S++++  SCS+
Sbjct: 129 LEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------SRLYDPGTSSTYAPFSCSA 179

Query: 214 AVCDRL--ENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHK 267
             C +L     GC +G  C Y V YGDGS T GT   +TLT+  T   ++     GC   
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLISGFQFGCSAV 239

Query: 268 NQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPV 324
             G       GL+GLGG + S V Q     G AFSYCL      SSG L  G    +   
Sbjct: 240 EHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWN-SSGFLTLGAPSSSTSA 298

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
             +  P++R+ +A +FY + L G+ VGG  + I   +F        G ++D+GT +TRLP
Sbjct: 299 AFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF------SAGSIVDSGTVITRLP 352

Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTVSFYFSGGPVL 438
             AY A   AF   +A+    P A+   + DTC++ +G     +  VP+V+    GG V+
Sbjct: 353 PTAYGALSAAFRDGMARYQYQP-AAPRGLLDTCFDFTGHGEGNNFTVPSVALVLDGGAVV 411

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            L  +     V D    C AFA +       IIGN+QQ   ++ +D      GF P  C
Sbjct: 412 DLHPNGI---VQDG---CLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 159/350 (45%), Gaps = 38/350 (10%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y  R G+G+P ++  + ID  +D  WV C  C+ C   S P F P  S+++  V C S 
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGC-AASSPSFSPTQSSTYRTVPCGSP 159

Query: 215 VCDRLENAGCHAG---RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGM 271
            C ++ +  C AG    C + ++Y   ++ +  L  ++L +   VV +   GC     G 
Sbjct: 160 QCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNVVVSYTFGCLRVVNGN 218

Query: 272 FVGAAGLLGLGG-GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP 330
              AAG   L    ++ LV   G                        G    P      P
Sbjct: 219 SRAAAGAHRLRPRAALLLVADQGH----------------------LGPIGQPKRIKTTP 256

Query: 331 LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEA 390
           L+ NP  PS YYV + G+ VG   + + +       +   G ++D GT  TRL  P Y A
Sbjct: 257 LLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAA 316

Query: 391 FRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD 450
            RDAF  +    P A  +  FDTCYN    V+V VPTV+F F+G   +TLP  N +I   
Sbjct: 317 VRDAFRGRV-RTPVAPPLGGFDTCYN----VTVSVPTVTFMFAGAVAVTLPEENVMIHSS 371

Query: 451 DAGTFCFAFAPSPS-----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             G  C A A  PS      L+++ ++QQ+  ++ FD ANG VGF   +C
Sbjct: 372 SGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELC 421


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 187/370 (50%), Gaps = 36/370 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSC-S 212
           GEY+  I +GSP +   +++D+GS++ W++C PC  C    D ++D A S S+  V+C +
Sbjct: 98  GEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNN 157

Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV------VKNVAI 262
           S +C        A C  G +C++   YGDGS++ G+L+ +TL +   V      V++ A 
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFAF 217

Query: 263 GCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGR 319
           GC   +  +   GA+G+LGL  G M+L  QLG + G  FS+C   R +   S+G + FG 
Sbjct: 218 GCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCFPDRSSHLNSTGVVFFGN 277

Query: 320 EALP---VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
             LP   V    V L  +     FY+V L G+ +       S +L  L +     V++D+
Sbjct: 278 AELPHEQVQYTSVALTNSELQRKFYHVALKGVSIN------SHELVLLPR--GSVVILDS 329

Query: 377 GTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFD--TCYNLSG----FVSVRVPTVS 429
           G++ +    P +   R+AF+  +  +L    G S  D  TC+ +S      +   +P++S
Sbjct: 330 GSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLS 389

Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
             F  G  + +P+   L+PV    +    CFAF    P+ +++IGN QQ+ + + +D   
Sbjct: 390 LVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQR 449

Query: 486 GFVGFGPNVC 495
             VGF    C
Sbjct: 450 SRVGFARASC 459


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 123/217 (56%), Gaps = 13/217 (5%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           +Y + + +G+PP   Y   D+GSD++W+QC PC+ CYKQ +P+FD   S++FS ++C S 
Sbjct: 58  DYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSE 117

Query: 215 VCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV-----VKNVAIGCGHK 267
            C +L +  C   +  C+Y  SY DGS T+G LA ETLT+  T       K V  GCGH 
Sbjct: 118 SCSKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHN 177

Query: 268 NQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSRGTGSSGS--LVFGR--EA 321
           N G F     G++GLG G +SLV Q+G   GG  FS CLV   T  S S  + FG+  E 
Sbjct: 178 NNGAFNDKEMGIIGLGRGPLSLVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEV 237

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
           L  G    PLV      SFY+V L G+ V  + +P +
Sbjct: 238 LGNGVVSTPLVSKTTYQSFYFVTLLGISVEDINLPFN 274


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  161 bits (407), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 159/343 (46%), Gaps = 37/343 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP      ID+GSD++W QC PC  CY Q DP+FDP+ S++F+        
Sbjct: 82  YLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFN-------- 133

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCG----- 265
                   CH   C YE+ Y D +Y+KG LA ET+TI  T     V+    IGCG     
Sbjct: 134 -----EQRCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTD 188

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
             N G    ++G++GL  G  SL+ Q+     G  SYC   +GT     + FG  A+  G
Sbjct: 189 LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGT---SKINFGTNAIVAG 245

Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              V   +   +   FYY+ L  + V   RI   E L       D  +V+D+G+ VT  P
Sbjct: 246 DGTVAADMFIKKDNPFYYLNLDAVSVEDNRI---ETLGTPFHAEDGNIVIDSGSTVTYFP 302

Query: 385 TPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
                  R A   V     +P  SG  +   CY  S  + +  P ++ +FSGG  L L  
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVPDPSGNDML--CY-FSETIDI-FPVITMHFSGGADLVLDK 358

Query: 443 SNFLIPVDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGA 484
            N  +  +  G FC A    SP+  +I GN  Q    + +D +
Sbjct: 359 YNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYDSS 401



 Score =  154 bits (390), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 107/354 (30%), Positives = 158/354 (44%), Gaps = 37/354 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +++ VG+PP      ID+GSDI+W QC PC  CY Q  P+FDP+ S++F         
Sbjct: 421 YLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTF--------- 471

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKN-- 268
                   C+   C YE+ Y D +Y+KG LA ET+TI  T     V+    IGCG  N  
Sbjct: 472 ----REQRCNGNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTN 527

Query: 269 ---QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
               G    ++G++GL  G +SL+ Q+     G  SYC   +GT     + FG  A+  G
Sbjct: 528 LQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGT---SKINFGTNAIVAG 584

Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              V   +   +   FYY+ L  + V    I      F      D  + +D+GT +T  P
Sbjct: 585 DGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAE---DGNIFIDSGTTLTYFP 641

Query: 385 TPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
                  R+A   V     +P     ++   CY  S  + +  P ++ +FSGG  L L  
Sbjct: 642 MSYCNLVREAVEQVVTAVKVPDMGSDNLL--CY-YSDTIDI-FPVITMHFSGGADLVLDK 697

Query: 443 SNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            N  +     G FC A     PS  ++ GN  Q    + +D ++  + F P  C
Sbjct: 698 YNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNC 751


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 137/450 (30%), Positives = 207/450 (46%), Gaps = 52/450 (11%)

Query: 77  ARWNLELVHRDKMSS-SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAK 135
           AR  LELV     +S S    +++H H +  S  A  +R         RR +  GA A  
Sbjct: 36  ARPRLELVPAAPGASLSDRARDDLHRHAYIRSQLASSRRG--------RRAAEVGASA-- 85

Query: 136 HEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD 195
                F   + SG   G+G+YFVR  VG+P +   +V D+GSD+ WV+C+          
Sbjct: 86  -----FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGA 140

Query: 196 P----VFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLA 246
                VF  A S S++ ++CSS  C       L N    A  C Y+  Y DGS  +G + 
Sbjct: 141 GSPARVFRTAASKSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVG 200

Query: 247 LETLTIG----------------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLV 289
            ++ TI                 R  ++ V +GC     G  F  + G+L LG  ++S  
Sbjct: 201 TDSATIALSSGSGRGGGDSSGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFA 260

Query: 290 GQLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
            +   + GG FSYCLV       ++  L FG  A    AA  PL+ + R   FY V +  
Sbjct: 261 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPGAT-APAAQTPLLLDRRMTPFYAVTVDA 319

Query: 348 LGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
           + V G  + I  D++ + + G  G ++D+GT++T L TPAY A   A       LPR + 
Sbjct: 320 VYVAGEALDIPADVWDVDRNG--GAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVT- 376

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSG 465
           +  F+ CYN +   ++ +P +  +F+G   L  PA +++I  D A G  C      S  G
Sbjct: 377 MDPFEYCYNWTDAGALEIPKMEVHFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGSWPG 434

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +S+IGNI Q+     FD  + ++ F    C
Sbjct: 435 VSVIGNILQQEHLWEFDLRDRWLRFKHTRC 464


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 103/284 (36%), Positives = 145/284 (51%), Gaps = 17/284 (5%)

Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRTV---------VKNVAIGCGHKNQGMFVGAAGLL 279
           C Y   YGD S T G  ALET T+  T+         V+NV  GCGH N+G+F GAAGLL
Sbjct: 74  CPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAAGLL 133

Query: 280 GLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVGAA---WVPLVRN 334
           GLG G +S   QL    G +FSYCLV R + +  S  L+FG +   +      +  LV  
Sbjct: 134 GLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAG 193

Query: 335 PRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
              P  +FYYV +  + VGG  + I E+ +++   G  G ++D+GT ++    PAY+  +
Sbjct: 194 KENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIK 253

Query: 393 DAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           +AF+A+    P      + + CYN++G     +P     FS G V   P  N+ I ++  
Sbjct: 254 EAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPR 313

Query: 453 GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C A     PS LSIIGN QQ+   I +D     +GF P  C
Sbjct: 314 EVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357


>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 451

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 133/397 (33%), Positives = 190/397 (47%), Gaps = 33/397 (8%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           +D +RV  L         DA+          + SG   G G Y VR+ +GSP +  +MV+
Sbjct: 72  KDPERVVYL------SSLDASLRRKPISAAPIASGQAFGIGSYVVRVKLGSPNQLFFMVL 125

Query: 174 DSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG-VSCSSAVCDRLENA-GCH---AGR 228
           D+ +D  WV C  C+ C   S   + P  S ++ G V+C +  C +   A  C    +  
Sbjct: 126 DTSTDEAWVPCTGCTGC-SSSSTYYSPQASTTYGGAVACYAPRCAQARGALPCPYTGSKA 184

Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
           C +  SY  GS    TL  ++L +G   + + A GC +   G  + A GLLGLG G +SL
Sbjct: 185 CTFNQSYA-GSTFSATLVQDSLRLGIDTLPSYAFGCVNSASGWTLPAQGLLGLGRGPLSL 243

Query: 289 VGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSG 347
             Q      G FSYCL S + +  SGSL  G    P      PL++NPR PS YYV L+G
Sbjct: 244 PSQSSKLYSGIFSYCLPSFQSSYFSGSLKLGPTGQPRRIRTTPLLQNPRRPSLYYVNLTG 303

Query: 348 LGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
           + VG +++P+  +          G ++D+GT +TR   P Y A RD F  Q      + G
Sbjct: 304 VTVGRVKVPLPIEYLAFDPNKGSGTILDSGTVITRFVGPVYSAIRDEFRNQVKGPFFSRG 363

Query: 408 VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS 462
              FDTC+     NL+  + +R       F+G  V TLP  N LI     G  C A A +
Sbjct: 364 G--FDTCFVKTYENLTPLIKLR-------FTGLDV-TLPYENTLIHTAYGGMACLAMAAA 413

Query: 463 P----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P    S L++I N QQ+ +++ FD  N  VG    +C
Sbjct: 414 PNNVNSVLNVIANYQQQNLRVLFDTVNNRVGIARELC 450


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 183/371 (49%), Gaps = 34/371 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           ++  +G+PPR   +++D+ S++ WVQ   C+ C     P F+P  S+SF    C+S+VC 
Sbjct: 1   MQTKIGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCL 60

Query: 218 RLENAGCHA------GRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGH 266
                G  +      G C ++V+Y DGS   G +A E  ++       + + +V  GC  
Sbjct: 61  GRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCAS 120

Query: 267 KNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA----FSYCLVSRGT--GSSGSLVFGR 319
           K+    V  ++G LGL  GS S   Q+G ++       FSYC  +R     SSG ++FG 
Sbjct: 121 KDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSSGVIIFGD 180

Query: 320 EALPVGA-AWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
             +P     ++ L + P   S   FYYVGL G+ VGG  + I    F++ ++G+ G   D
Sbjct: 181 SGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGTYFD 240

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF-DTCYNLSGFVSVRVPT---VSFY 431
           +GT V+ L  PA+ A  +AF  +  +L R SG     + CY+++     R+PT   V+ +
Sbjct: 241 SGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAA-GDARLPTAPLVTLH 299

Query: 432 FSGGPVLTLPASNFLIPV---DDAGTFCFAF----APSPSGLSIIGNIQQEGIQISFDGA 484
           F     + L  ++  +P+       T C AF    A +  G+++IGN QQ+   I  D  
Sbjct: 300 FKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDLE 359

Query: 485 NGFVGFGPNVC 495
              +GF P  C
Sbjct: 360 RSRIGFAPANC 370


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 111/352 (31%), Positives = 167/352 (47%), Gaps = 55/352 (15%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC------ 224
           +++D+GSD+ WVQC+PCS CY Q DP+FDP+ SAS++ V C+++ C+    A        
Sbjct: 124 VIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 183

Query: 225 ----------HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG 274
                      + RC Y ++YGDGS+++G LA +T+ +G   V     GCG  N+     
Sbjct: 184 ATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGASVDGFVFGCGLSNR----- 238

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAW 328
             GL   G  + S      G +G A            +GSL  G      R A PV  ++
Sbjct: 239 --GLRRPGSAASSPTASPPGTSGDA------------AGSLSLGGDTSSYRNATPV--SY 282

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
             ++ +P  P FY++ ++G  VGG  +           +G   V++D+GT +TRL    Y
Sbjct: 283 TRMIADPAQPPFYFMNVTGASVGGAAV-------AAAGLGAANVLLDSGTVITRLAPSVY 335

Query: 389 EAFRDAFVAQTG--NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
            A R  F  Q G    P A   S+ D CYNL+G   V+VP ++     G  +T+ A+  L
Sbjct: 336 RAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGML 395

Query: 447 -IPVDDAGTFCFAFAPS--PSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +   D    C A A         IIGN QQ+  ++ +D     +GF    C
Sbjct: 396 FMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 447


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 112/355 (31%), Positives = 160/355 (45%), Gaps = 38/355 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y +R+ +G+PP      ID+GSD++W QC PC  CY Q  P+FDP+ S++F         
Sbjct: 61  YLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTF--------- 111

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQG 270
               +   CH   C YE+ Y D SY+ G LA ET+TI  T     V+   +IGCG  N  
Sbjct: 112 ----KEKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSN 167

Query: 271 MFV-----GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           +        ++G++GL  G  SL+ Q+     G  SYC  S+GT     + FG  A+  G
Sbjct: 168 LMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGT---SKINFGTNAVVAG 224

Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              V   +   +   FYY+ L  + VG  RI   E L       D  + +D+GT  T LP
Sbjct: 225 DGTVAADMFIKKDQPFYYLNLDAVSVGDKRI---ETLGTPFHAQDGNIFIDSGTTYTYLP 281

Query: 385 TP---AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           T           A V     +P  S  ++   CYN         P ++ +F+GG  L L 
Sbjct: 282 TSYCNLVREAVAASVVAANQVPDPSSENLL--CYNWDTM--EIFPVITLHFAGGADLVLD 337

Query: 442 ASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             N  +     GTFC A     PS  +I GN     + + +D +   + F P  C
Sbjct: 338 KYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNC 392


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 176/370 (47%), Gaps = 43/370 (11%)

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS--DPV 197
           DF  DV   +   +  +FV   VG PP  Q+ ++D+GS ++W+QC PC  C       PV
Sbjct: 54  DFQVDVHQAIK--TSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPV 111

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----G 253
           F+PA S++F   SC    C    N  C + +C YE  Y  G+ +KG LA E LT     G
Sbjct: 112 FNPALSSTFVECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNG 171

Query: 254 RTVV-KNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
            TVV + +A GCGH+N + +     G+LGLG    SL  QLG +    FSYC+      +
Sbjct: 172 NTVVTQPIAFGCGHENGEQLESEFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANKN 227

Query: 312 SG--SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            G   LV G +A  +G    P +        YY+ L G+ VG  ++ I   +F+  +   
Sbjct: 228 YGYNQLVLGEDADILGDP-TP-IEFETENGIYYMNLEGISVGDKQLNIEPVVFK-RRGSR 284

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYN------LSGFVS 422
            GV++DTGT  T L   AY    +   +     P+       D  CY+      L GF  
Sbjct: 285 TGVILDTGTLYTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYHGRVNEELIGF-- 340

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT----FCFAFAPSPS------GLSIIGNI 472
              P V+F+F+GG  L + A++   P+ ++ T    FC +  P+          + IG +
Sbjct: 341 ---PVVTFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLM 397

Query: 473 QQEGIQISFD 482
            Q+   I++D
Sbjct: 398 AQQYYNIAYD 407


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 122/394 (30%), Positives = 187/394 (47%), Gaps = 40/394 (10%)

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
           A   E   F   + SG   G+G+YFV+  VG+P +   +V D+GSD+ WV+C+       
Sbjct: 87  APMPEASAFAMPLTSGAYTGTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSP 146

Query: 193 QSDP-----VFDPADSASFSGVSCSSAVCDR---LENAGCHAGR-----CRYEVSYGDGS 239
            + P     VF PA+S S++ + CSS  C        A C AG      C Y+  Y D S
Sbjct: 147 DASPLASPRVFRPANSKSWAPIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKS 206

Query: 240 YTKGTLALETLTIG--------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVG 290
             +G +  +  TI         +  ++ V +GC     G  F  + G+L LG  ++S   
Sbjct: 207 SARGVVGTDAATIALSGSGSDRKAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFAS 266

Query: 291 QLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVG 344
           +   + GG FSYCLV       ++  L FG    PVGAA      PL+ + +   FY V 
Sbjct: 267 RAAARFGGRFSYCLVDHLAPRNATSYLTFG----PVGAAHSPSRTPLLLDAQVAPFYAVT 322

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           +  + V G  + I  +++ + + G  G ++D+GT++T L TPAY+A   A   Q   +PR
Sbjct: 323 VDAVSVAGKALNIPAEVWDVKKNG--GAILDSGTSLTILATPAYKAVVAALSKQLARVPR 380

Query: 405 ASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS 462
            + +  F+ CYN +       VP +   F+G   L  P  +++I  D A G  C      
Sbjct: 381 VT-MDPFEYCYNWTATRRPPAVPRLEVRFAGSARLRPPTKSYVI--DAAPGVKCIGLQEG 437

Query: 463 P-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              G+S+IGNI Q+     FD AN ++ F  + C
Sbjct: 438 VWPGVSVIGNILQQEHLWEFDLANRWLRFQESRC 471


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/356 (32%), Positives = 169/356 (47%), Gaps = 30/356 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y     +G+PP+    VID   ++VW QC+ CS+C++Q  P+FDP  S ++    C + 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 215 VCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
           +C+ +  ++  C    C Y+ S   G  T G +  +T  +G T   ++A GC    +   
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
             G +G++GLG    SLV Q G     AFSYCL     G + +L  G  A   G   AA 
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGRNSALFLGSSAKLAGGGKAAS 224

Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            P V          ++Y V L GL  G   IP+              V++DT + ++ L 
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+A + A  A  G  P A+ V  FD C+  SG  S   P + F F GG  +T+PA+N
Sbjct: 277 DGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPATN 335

Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L+   + GT C A   S      + LS++G++QQE I   FD     + F P  C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 147/462 (31%), Positives = 208/462 (45%), Gaps = 83/462 (17%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARM----QRDVKRVATLVR---RLSGGGA 131
           +++E +HRD   S         +H    +  AR+    +R   R A L R   R+    A
Sbjct: 35  FSVEFIHRDSARSP--------FHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSA 86

Query: 132 DAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-----P 186
           D             VS +     EY + + +G+PP     + D+GSD++W+ C      P
Sbjct: 87  DG-----------FVSELTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGP 135

Query: 187 CSQCYKQSDP-----VFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSY 240
                + +D       FDP+ S +F  V C S  C  L  A C A  +CRY  SYGDGS+
Sbjct: 136 GLAAARDADAQPPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSH 195

Query: 241 TKGTLALETLTIG----------RTVVKNVAIGCGHKNQGMFVGAA---GLLGLGGGSMS 287
           T G L+ ET T             T V NV  GC       FVG++   GL+GLGGG +S
Sbjct: 196 TSGVLSTETFTFADAPGARGDGTTTRVANVNFGCSTT----FVGSSVGDGLVGLGGGDLS 251

Query: 288 LVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPV--GAAWVPLVRNPRAPSFYYV 343
           LV QLG  T  G  FSYCLV     +S +L FG  A     GA   PL+ + +  ++Y V
Sbjct: 252 LVSQLGADTSLGRRFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPS-QVKAYYIV 310

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNL 402
            L  + VG       +            +++D+GT +T LP    EA  D  V + TG +
Sbjct: 311 ELRSVKVGNKTFEAPD---------RSPLIVDSGTTLTFLP----EALVDPLVKELTGRI 357

Query: 403 ---PRASGVSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
              P  S   +   C+++SG     V+  +P V+    GG  +TL A N  + V + GT 
Sbjct: 358 KLPPAQSPERLLPLCFDVSGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQE-GTL 416

Query: 456 CFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C A +        SIIGNI Q+ + + +D   G V F P  C
Sbjct: 417 CLAVSAMSEQFPASIIGNIAQQNMHVGYDLDKGTVTFAPAAC 458


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 128/377 (33%), Positives = 179/377 (47%), Gaps = 36/377 (9%)

Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
           GGA  A   V +    V S +   S EY + + VG+PP     + D+GSD+VWV C    
Sbjct: 73  GGASPAPGPVPEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNG 132

Query: 189 QCYKQSD--PVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
                SD   VF P+ S ++S +SC SA C  L  A C A   C+Y+ +YGDGS T G L
Sbjct: 133 GGGGASDGAVVFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVL 192

Query: 246 ALETLTI--------GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
           + ET +         G+  V  V+ GC   + G F  + GL+GLG G++SLV QLG    
Sbjct: 193 STETFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSF-RSDGLVGLGAGALSLVSQLGAAAR 251

Query: 298 GA--FSYCLVS--RGTGSSGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
            A  FSYCLV       SS +L FG  A+    GAA  PLV +    S+Y V L  + V 
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPS-EVDSYYTVALESVAVA 310

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-I 410
           G  +                +++D+GT +T L  PA      A + +   LPRA     +
Sbjct: 311 GQDV---------ASANSSRIIVDSGTTLTFL-DPALLRPLVAELERRIRLPRAQPPEQL 360

Query: 411 FDTCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG 465
              CY++ G        +P V+  F GG  +TL   N    +++ GT C    P      
Sbjct: 361 LQLCYDVQGKSQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEE-GTLCLVLVPVSESQP 419

Query: 466 LSIIGNIQQEGIQISFD 482
           +SI+GNI Q+   + +D
Sbjct: 420 VSILGNIAQQNFHVGYD 436


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 189/439 (43%), Gaps = 29/439 (6%)

Query: 64  HNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
           HN    +    D     L++ H     S    +  M +        A+ Q  ++ ++ LV
Sbjct: 27  HNPKCDAAYQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLQLQAKDQARMQYLSNLV 86

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
            R S     + +   Q             S  Y VR   G+P ++  + +D+ +D  WV 
Sbjct: 87  ARRSIVPIASGRQITQ-------------SPTYIVRAKFGTPAQTLLLAMDTSNDAAWVP 133

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
           C  C  C   +   F P  S +F  V C ++ C ++ N  C    C +  +YG  S    
Sbjct: 134 CTACVGCSTTTP--FAPPKSTTFKKVGCGASQCKQVRNPTCDGSACAFNFTYGTSS-VAA 190

Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
           +L  +T+T+    V     GC  K  G  +   GLLGLG G +SL+ Q        FSYC
Sbjct: 191 SLVQDTVTLATDPVPAYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYC 250

Query: 304 LVSRGTGS-SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           L S  T + SG       A P    + P  +NPR  S YYV L  + VG   + I  +  
Sbjct: 251 LPSFKTLNFSGHXDLXPVAQPRDQVY-PSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEAL 309

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGF 420
                   G V D+GT  TRL  PAY A R+ F  +     + +  S+  FDTCY     
Sbjct: 310 AFNPXTGAGTVFDSGTVFTRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYT---- 365

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEG 476
           V +  PT++F FSG  V TLP  N LI        C A AP+P    S L++I N+QQ+ 
Sbjct: 366 VPIVAPTITFMFSGMNV-TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 424

Query: 477 IQISFDGANGFVGFGPNVC 495
            ++ FD  N  +G    +C
Sbjct: 425 HRVLFDVPNSRLGVARELC 443


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 175/352 (49%), Gaps = 20/352 (5%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  ++   P S C   S   F P  S 
Sbjct: 87  IASGQAFNIGNYIVRVKIGTPGQLLFMVLDTSTDEAFI---PSSGCIGCSATTFSPNAST 143

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S+  + CS   C ++    C A   G C +  SY   +Y+  TL  ++L +   V+ + +
Sbjct: 144 SYVPLECSVPQCSQVRGLSCPATGSGACSFNKSYAGSTYS-ATLVQDSLRLATDVIPSYS 202

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            G  +   G  + A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G  
Sbjct: 203 FGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFKSYYFSGSLKLGPV 262

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+RNPR PS Y+V L+G+ VG + +P  ++L         G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVI 322

Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           TR   P Y A RD F  Q TG     S +  FDTC+ +  + ++  P ++ +F+    L 
Sbjct: 323 TRFVEPVYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANG 486
           LP  N LI        C A A +P       L++I N QQ+ +++ FD  N 
Sbjct: 377 LPLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNN 428


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 123/360 (34%), Positives = 176/360 (48%), Gaps = 19/360 (5%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  +V   P S C   S   F P  S 
Sbjct: 87  IASGQTFNIGNYVVRVKIGTPGQLLFMVLDTSTDEAFV---PSSGCIGCSATTFYPNVST 143

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           SF  + CS   C ++    C A   G C +  SY  GS    TL  ++L +   V+ + +
Sbjct: 144 SFVPLDCSVPQCGQVRGLSCPATGSGACSFNQSYA-GSTFSATLVQDSLRLATDVIPSYS 202

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            G  +   G  V A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL  G  
Sbjct: 203 FGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYCLPSFKSYYFSGSLKLGPV 262

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+ NP  PS YYV L+ + VG + +P+  +L         G ++D+GT +
Sbjct: 263 GQPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGAGTIIDSGTVI 322

Query: 381 TRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           TR   P Y A RD F  Q TG     S +  FDTC+ +  + ++  P ++ +F+    L 
Sbjct: 323 TRFVEPIYNAVRDEFRKQVTGPF---SSLGAFDTCF-VKNYETL-APAITLHFTDLD-LK 376

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LP  N LI        C A A +PS     L++I N QQ+ +++ FD  N  VG    +C
Sbjct: 377 LPLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNNKVGIARELC 436


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 128/234 (54%), Gaps = 12/234 (5%)

Query: 65  NNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR 124
           +++ S +   D+ R +LE++H+    S  +        R Q      + +D  RV ++  
Sbjct: 52  SSVCSPSPKGDDKRASLEVIHKHGPCSKLSQDKGRSPSRTQM-----LDQDESRVNSIRS 106

Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
           RL+   AD  K +         SG   G+G Y V +G+G+P R    + D+GSD+ W QC
Sbjct: 107 RLAKNPADGGKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQC 166

Query: 185 QPCSQ-CYKQSDPVFDPADSASFSGVSCSSAVCDRLENA-----GCHAGRCRYEVSYGDG 238
           +PC++ CY Q +P+F+P+ S S++ +SCSS  CD L++       C A  C Y + YGD 
Sbjct: 167 EPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQ 226

Query: 239 SYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQ 291
           SY+ G  A + L +  T V  N   GCG  N+G+FVG AGL+GLG  ++SL+ +
Sbjct: 227 SYSVGFFAQDKLALTSTDVFNNFLFGCGQNNRGLFVGVAGLIGLGRNALSLMSK 280



 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 3/95 (3%)

Query: 403 PRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-- 460
           P+A+  SI DTCY+ S + +V VP ++ YFS G  + L  S     + +    C AFA  
Sbjct: 282 PKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGIFY-ILNISQVCLAFAGN 340

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              + ++I+GN+QQ+   + +D A G +GF P  C
Sbjct: 341 SDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  155 bits (391), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 124/421 (29%), Positives = 188/421 (44%), Gaps = 45/421 (10%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
             Q    A   RD  R   +++ + GG  D +     D             G YF ++ +
Sbjct: 39  NQQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSD---------PYFVGLYFTKVKL 89

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD 217
           GSP +  Y+ ID+GSDI+W+ C  CS C   S        FD A S++ + VSC+  +C 
Sbjct: 90  GSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICS 149

Query: 218 ---RLENAGC--HAGRCRYEVSYGDGS-----YTKGTLALETLTIGRTVVKN----VAIG 263
              +   +GC   A +C Y   YGDGS     Y   T+  +T+ +G+++V N    +  G
Sbjct: 150 YAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFG 209

Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           C     G          G+ G G G++S++ QL   G T   FS+CL   G    G LV 
Sbjct: 210 CSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-KGGENGGGVLVL 268

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G E L     + PLV  P  P  Y + L  + V G  +PI  ++F  T   + G ++D+G
Sbjct: 269 G-EILEPSIVYSPLV--PSLPH-YNLNLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSG 322

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +  L   AY  F DA  A      +   +S  + CY +S  V    P VS  F GG  
Sbjct: 323 TTLAYLVQEAYNPFVDAITAAVSQFSKPI-ISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381

Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           + L   ++L+    +D A  +C  F     G +I+G++  +     +D AN  +G+    
Sbjct: 382 MVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLANQRIGWADYN 441

Query: 495 C 495
           C
Sbjct: 442 C 442


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 127/445 (28%), Positives = 204/445 (45%), Gaps = 76/445 (17%)

Query: 72  TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR------R 125
           T +    +N+EL+H   +SS S              F+   +  ++R+++++       R
Sbjct: 20  TKTQNHGFNVELIH--PISSRS-------------PFYNPKETQIQRISSILNYSINRVR 64

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
                   + +++QD    + S M  G   Y +   +G+PP   Y +ID+G+D +W QC+
Sbjct: 65  YLNHVFSFSPNKIQD--VPLSSFMGAG---YVMSYSIGTPPFQLYSLIDTGNDNIWFQCK 119

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTL 245
           PC  C  Q+ P+F P+ S+++  + C+S +C   +NA              DG Y    L
Sbjct: 120 PCKPCLNQTSPMFHPSKSSTYKTIPCTSPIC---KNA--------------DGHY----L 158

Query: 246 ALETLTIGRT-----VVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGA 299
            ++TLT+          KN+ IGCGH+NQG   G  +G +GL  G +S + QL    GG 
Sbjct: 159 GVDTLTLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGK 218

Query: 300 FSYCLVSRGTGS--SGSLVFGREALP--VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           FSYCLV   +    S  L FG ++    +G    P+    +  + Y+V L    VG    
Sbjct: 219 FSYCLVPLFSKENVSSKLHFGDKSTVSGLGTVSTPI----KEENGYFVSLEAFSVG---- 270

Query: 356 PISEDLFRLTQMGDDG-VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS-IFDT 413
              + + +L    + G  ++D+GT +T LP   Y    ++ V     L R    S  F+ 
Sbjct: 271 ---DHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRL-ESVVLDMVKLKRVKDPSQQFNL 326

Query: 414 CYN-LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSGLSIIG 470
           CY   S  +  +V  ++ +FSG  V  L A N   P+ D    CFAF    + S L+I G
Sbjct: 327 CYQTTSTTLLTKVLIITAHFSGSEV-HLNALNTFYPITDE-VICFAFVSGGNFSSLAIFG 384

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+ Q+   + FD     + F P  C
Sbjct: 385 NVVQQNFLVGFDLNKKTISFKPTDC 409


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 166/356 (46%), Gaps = 30/356 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y     +G+PP+    VID   ++VW QC+ C +C++Q  P+FDP  S ++    C + 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTP 109

Query: 215 VCDRLEN--AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
           +C+ + +    C    C YE S   G  T G +  +T  +G T   ++A GC    +   
Sbjct: 110 LCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
             G +G++GLG    SLV Q G     AFSYCL     G + +L  G  A   G   AA 
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAAS 224

Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            P V          ++Y V L GL  G   IP+              V++DT + ++ L 
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+A + A     G  P A+ V  FD C+  SG  S   P + F F GG  +T+PA+N
Sbjct: 277 DGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVPATN 335

Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L+   + GT C A   S      + LS++G++QQE I   FD     + F P  C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
          Length = 320

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 114/328 (34%), Positives = 165/328 (50%), Gaps = 14/328 (4%)

Query: 173 IDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYE 232
           +D+ SD+ W+   PC+ C   S  +F+   S ++  + C +A C ++    C  G C + 
Sbjct: 1   MDTSSDVAWI---PCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFN 57

Query: 233 VSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQL 292
           ++YG GS     L+ +T+T+    V   + GC  K  G  + A GLLGLG G +SL+ Q 
Sbjct: 58  LTYG-GSSLAANLSQDTITLATDAVPGYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQT 116

Query: 293 GGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG 351
                  FSYCL S +    SGSL  G    P    + PL++NPR PS Y+V L  + VG
Sbjct: 117 QNLYQSTFSYCLPSFKSLNFSGSLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVG 176

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
              + +    F        G + D+GT  TRL TPAY A RDAF  + G     + +  F
Sbjct: 177 RRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGGF 236

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP----SGLS 467
           DTCY     V +  PT++F F+G  V TLP  N LI      T C A A +P    S L+
Sbjct: 237 DTCYT----VPIAAPTITFMFTGMNV-TLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLN 291

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +I N+QQ+  ++ +D  N  +G    +C
Sbjct: 292 VIANLQQQNHRLLYDVPNSRLGVARELC 319


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 109/344 (31%), Positives = 168/344 (48%), Gaps = 44/344 (12%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
           H  ++R ++R      RL+G G   A+ E       VV  + +    GEY V++G+G+PP
Sbjct: 45  HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
                 ID+ SD++W QCQPC+ CY Q DP+F+P  S++++ + CSS  CD L+   C  
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
                C+Y  +Y   + T+GTLA++ L IG    + VA GC   + G      A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219

Query: 282 GGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRA 337
           G G +SLV QL  +    F+YCL    +   G LV G +A     A     VP+ R+PR 
Sbjct: 220 GRGPLSLVSQLSVRR---FAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRY 276

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRL--------------------TQMGDD---GVVM 374
           PS+YY+ L GL +G   + +                             +GD    G+++
Sbjct: 277 PSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAPTPSPNATAVAVGDANRYGMII 336

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNL 417
           D  + +T L    Y+   +    +   LPR +G S+  D C+ L
Sbjct: 337 DIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFIL 379


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 167/356 (46%), Gaps = 30/356 (8%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
            Y     +G+PP+    VID   ++VW QC+ CS+C++Q  P+FDP  S ++    C + 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTP 109

Query: 215 VCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGM 271
           +C+ +  ++  C    C Y+ S   G  T G +  +T  +G T   ++A GC    +   
Sbjct: 110 LCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVG-TAKASLAFGCVVASDIDT 167

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAW 328
             G +G++GLG    SLV Q G     AFSYCL     G + +L  G  A   G   AA 
Sbjct: 168 MGGPSGIVGLGRTPWSLVTQTG---VAAFSYCLAPHDAGKNSALFLGSSAKLAGGGKAAS 224

Query: 329 VPLV----RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            P V          ++Y V L GL  G   IP+              V++DT + ++ L 
Sbjct: 225 TPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPS--------GSTVLLDTFSPISFLV 276

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+A + A     G  P A+ V  FD C+  SG  S   P + F F GG  +T+ ASN
Sbjct: 277 DGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSG-ASGAAPDLVFTFRGGAAMTVAASN 335

Query: 445 FLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L+   + GT C A   S      + LS++G++QQE I   FD     + F P  C
Sbjct: 336 YLLDYKN-GTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 112/322 (34%), Positives = 158/322 (49%), Gaps = 23/322 (7%)

Query: 142 GTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPA 201
           GT       Q  G+Y ++  +G PP   +  +D+GSD++WV+C PC+ C     P++DPA
Sbjct: 73  GTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPA 132

Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGRCR-------YEVSYG-DGSY-TKGTLALETLTI 252
            S S   + CSS +C  L      + +C        Y  +YG  G + T+G L  ET T 
Sbjct: 133 RSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF 192

Query: 253 GR-TVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
           G   V  NV+ G      G  F G AGL+GLG G +SLV QLG    G F+YCL +    
Sbjct: 193 GDGYVANNVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLG---AGRFAYCLAADPNV 249

Query: 311 SSGSLVFGREALPVGAAWV---PLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            S  L     AL   A  V   PLV NP+    + YYV L G+ VGG R+PI +  F + 
Sbjct: 250 YSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAIN 309

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV-R 424
             G  GV  D+G   T L   AY+  R A  ++   L   +G    DTC+  +   +V +
Sbjct: 310 SDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAG---DDTCFVAANQQAVAQ 366

Query: 425 VPTVSFYFSGGPVLTLPASNFL 446
           +P +  +F  G  ++L   N+L
Sbjct: 367 MPPLVLHFDDGADMSLNGRNYL 388


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 122/436 (27%), Positives = 188/436 (43%), Gaps = 28/436 (6%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           L L+H         +   +H +  +  F+     D +R+  LV        + A      
Sbjct: 15  LSLIHFAISKPDGFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSG 74

Query: 141 FGTDVVS-GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD 199
           F  +     + Q    Y V++ +GSP    Y+V D+GS + W QC+PC++ ++Q  P+F+
Sbjct: 75  FSPEAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFN 134

Query: 200 PADSASFSGVSCSSAVCDRLENA-GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVK 258
              S ++  + C    C   +N   C   +C Y ++Y  GS T G  A + L        
Sbjct: 135 STASRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDILQSAENDRI 194

Query: 259 NVAIGCGHKNQGM-----FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRGT 309
               GC   NQ            G++GL    +SL+ Q+   T   FSYCL    +S  +
Sbjct: 195 PFYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPS 254

Query: 310 GSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
            ++  L FG +       ++  P V +PR    Y++ L  + V G R+ I    F L   
Sbjct: 255 HATSLLRFGNDIRKSRRKYLSTPFV-SPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPD 313

Query: 368 GDDGVVMDTGTAVTRLPTPAY----EAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGF 420
           G  G ++D+GTAVT +   AY     AF++ F     Q  N+ + SG      CY   G 
Sbjct: 314 GTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNI-QLSGY----ICYKQQGH 368

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
                P+++F+F G      P   +L  V D G FC A  P SP   +IIG + Q   Q 
Sbjct: 369 TFHNYPSMAFHFQGADFFVEPEYVYLT-VQDRGAFCVALQPISPQQRTIIGALNQANTQF 427

Query: 480 SFDGANGFVGFGPNVC 495
            +D AN  + F P  C
Sbjct: 428 IYDAANRQLLFTPENC 443


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 160/345 (46%), Gaps = 40/345 (11%)

Query: 161 GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  P  +Q M ID+  D+ W+QC PC   +CY Q + +FDP  S + + V C SA C  
Sbjct: 154 AIDDPILAQPMSIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGE 213

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF-VGAAG 277
           L          RY      G +           + R   +     C H  +G F    +G
Sbjct: 214 LG---------RY------GRWLLQQPVPVLRRLRRRQGQPRGRTC-HAVRGNFSASTSG 257

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---AWVPLVRN 334
            + LGGG  SL+ Q     G AFSYC+      SSG L  G  A   GA   A  PLVRN
Sbjct: 258 TMSLGGGRQSLLSQTAATFGNAFSYCVPD--PSSSGFLSLGGPADGGGAGRFARTPLVRN 315

Query: 335 PRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRD 393
           P   P+ Y V L G+ VGG R+ +   +F        G VMD+   +T+LP  AY A R 
Sbjct: 316 PSIIPTLYLVRLRGIEVGGRRLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRL 369

Query: 394 AFVAQTGNLPR-ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA 452
           AF +     PR A G +  DTCY+   F SV VP VS  F GG V+ L A   ++     
Sbjct: 370 AFRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----- 424

Query: 453 GTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C AF P+P    L  IGN+QQ+  ++ +D   G VGF    C
Sbjct: 425 -EGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 128/353 (36%), Positives = 172/353 (48%), Gaps = 35/353 (9%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDP-ADSASFSGVSC 211
           +G+Y +++ +G+PP   Y ++D+ SD+VW QC PC  CYKQ +P+FDP  +  SF   SC
Sbjct: 28  NGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSFFDHSC 87

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHK 267
           S       E A      C Y  +Y D S TKG LA E  T     G+ +V+++  GCGH 
Sbjct: 88  SP------EKA------CDYVYAYADDSATKGMLAKEIATFSSTDGKPIVESIIFGCGHN 135

Query: 268 NQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGA-FSYCLVS--RGTGSSGSLVFGREALP 323
           N G+F     GL+GLGGG +SLV Q+G   G   FS CLV       +SG++  G EA  
Sbjct: 136 NTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLG-EASD 194

Query: 324 V---GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM-DTGTA 379
           V   G    PLV      + Y V L G+ VG   +P     F  ++M   G +M D+GT 
Sbjct: 195 VSGEGVVTTPLVSE-EGQTPYLVTLEGISVGDTFVP-----FNSSEMLSKGNIMIDSGTP 248

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
            T LP   Y+   +    Q  NLP         T        ++  P ++ +F G  V  
Sbjct: 249 ETYLPQEFYDRLVEELKVQI-NLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKL 307

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           LP   F+ P D  G FCFA   +  GL I GN  Q  + I FD     V F P
Sbjct: 308 LPLQTFIPPKD--GVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKP 358


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 171/371 (46%), Gaps = 36/371 (9%)

Query: 156 YFVRIGVGSPP--------RSQYMVIDSGSDIVWVQCQPC----SQCYKQSDPVFDPADS 203
           +  ++GVGS          ++ Y  ID+G+++ W+QC+ C    + C+   DP +  + S
Sbjct: 80  FLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQS 139

Query: 204 ASFSGVSCSS-AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVV 257
            S+  VSC+  + C   E   C  G C Y V+YG GSYT G LA ET T        T +
Sbjct: 140 KSYKPVSCNQHSFC---EPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKHTAL 196

Query: 258 KNVAIGCGHKNQGMFVG-------AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
           K+++ GC   ++ M           +G+LG+G G  S + QLG  + G FSYC+ +  T 
Sbjct: 197 KSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTH 256

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           ++  L FG+  +         +   +  + Y+V L G+ V G+++ I++    + + G  
Sbjct: 257 NT-YLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSR 315

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIF-DTCYN-LSGFVSVRV 425
           G ++D GT  T L  P ++    A    ++   NL R     +  D CY  LS      +
Sbjct: 316 GCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYEQLSDAGRKNL 375

Query: 426 PTVSFYFSGGPVLTLPASNFLI-PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           P V+F+     +   P + FL    +    FC +   S    +IIG  QQ   +  +D  
Sbjct: 376 PVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSML-SDDSKTIIGAYQQMKQKFVYDTK 434

Query: 485 NGFVGFGPNVC 495
              + FGP  C
Sbjct: 435 ARVLSFGPEDC 445


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 113/340 (33%), Positives = 160/340 (47%), Gaps = 37/340 (10%)

Query: 161 GVGSPPRSQYMVIDSGSD-IVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           G   PP  Q ++ +   D I W QC+PC +C K S   FDP+ S ++S  SC  +     
Sbjct: 79  GHSQPPSPQEILAEMNPDSITWTQCKPCVRCLKDSHRHFDPSASLTYSLGSCIPSTVGN- 137

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAAG 277
                      Y ++YGD S + G    +T+T+  + V      GCG  N+G F  GA G
Sbjct: 138 ----------TYNMTYGDKSTSVGNYGCDTMTLEPSDVFPKFQFGCGRNNEGDFGSGADG 187

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AWVPLVRNP- 335
           +LGLG G +S V Q   +    FSYCL      S GSL+FG +A    +  +  LV  P 
Sbjct: 188 MLGLGQGQLSTVSQTASKFKKVFSYCLPEE--DSIGSLLFGEKATSQSSLKFTSLVNGPG 245

Query: 336 ----RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
                   +Y+V L  + VG  R+ +   +F        G ++D+GT +T LP  AY A 
Sbjct: 246 TSGLEESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYSAL 300

Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
             AF       P ++G      I DTCYNLSG   V +P +  +F  G  + L     +I
Sbjct: 301 TAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKR-VI 359

Query: 448 PVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFD 482
             +DA   C AFA +      S L+IIGN QQ  + + +D
Sbjct: 360 WGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYD 399


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 112/348 (32%), Positives = 166/348 (47%), Gaps = 34/348 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS-- 213
           + V I +GSPP +Q + +D+ SD++W+QC+PC  CY QS P+FDP+ S +    SC +  
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQ 144

Query: 214 -AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKNVAIGCG 265
            ++     NA   +  C Y + Y DG+ +KG LA E L             + +V  GCG
Sbjct: 145 YSMPSLRFNAKTRS--CEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCG 202

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALP 323
           H N G  +   G+LGLG G  SLV + G +    FSYC  S    S     LV G +   
Sbjct: 203 HDNYGEPLVGTGILGLGYGEFSLVHRFGTK----FSYCFGSLDDPSYPHNVLVLGDDGAN 258

Query: 324 VGAAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-RLTQMGDDGVVMDTGTAVT 381
           +     PL + N     FYYV +  + V G+ +PI   +F R  Q G  G ++DTG ++T
Sbjct: 259 ILGDTTPLEIYN----GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLT 314

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT----CYN---LSGFVSVRVPTVSFYFSG 434
            L   AY+  ++           A+ V+  D     CYN       V    P V+F+FS 
Sbjct: 315 SLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHFSD 374

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           G  L+L   +  + +     FC A   +P  ++ IG   Q+   I +D
Sbjct: 375 GAELSLDVKSVFMKL-SPNVFCLAV--TPGNMNSIGATAQQSYNIGYD 419


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 118/368 (32%), Positives = 169/368 (45%), Gaps = 66/368 (17%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ--PCSQCYKQSDPVFDPADSASFSGVSCS 212
           EY V +  G+PP+   + +D+GSDI W QC+  P S C+ Q+ P+FDP+ S+SF+ + CS
Sbjct: 87  EYLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCS 146

Query: 213 SAVCDRLENAG----CHAGRCRYEVSYGDGSYTKGTLALETLTIGR-------TVVKNVA 261
           S  C+     G      +  C Y +SYGDGS ++G +  E  T            V  + 
Sbjct: 147 SPACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLV 206

Query: 262 IGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
            GCGH N+G+F     G+ G G GS+SL  QL     G FS+C  +  TGS  S V    
Sbjct: 207 FGCGHANRGVFTSNETGIAGFGRGSLSLPSQL---KVGNFSHCFTTI-TGSKTSAVL--- 259

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT---QMGDDGVVMDTG 377
                                      LG+ G+  P +  L R     +        ++G
Sbjct: 260 ---------------------------LGLPGVAPPSASPLGRRRGSYRCRSTPRSSNSG 292

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYN--LSGFVSVRVPTVSFYFSG 434
           T++T LP   Y A R+ F AQ   LP   G +    TC++  L G     VPT++ +F G
Sbjct: 293 TSITSLPPRTYRAVREEFAAQV-KLPVVPGNATDPFTCFSAPLRG-PKPDVPTMALHFEG 350

Query: 435 GPVLTLPASNFLIPV---DDAGT----FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
              + LP  N++  V   DDAG      C A      G  I+GNIQQ+ + + +D  N  
Sbjct: 351 A-TMRLPQENYVFEVVDDDDAGNSSRIICLAVI--EGGEIILGNIQQQNMHVLYDLQNSK 407

Query: 488 VGFGPNVC 495
           + F P  C
Sbjct: 408 LSFVPAQC 415


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 199/431 (46%), Gaps = 60/431 (13%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGG--GADAAK---------HEVQDFGTDVVS---G 148
           R + S    +++D  RV  + RR+SG   GA A+K          E Q      +S   G
Sbjct: 78  RTKPSLADVLRQDRLRVHHIHRRVSGSSRGARASKGSFKEPVSVEETQLHHQAAISVEVG 137

Query: 149 MDQGSGEYFVRI-------GVGSPPRSQYMVIDSGSDIVWVQCQPCS--QCYKQSDPVFD 199
             Q S E    I       G  SPP +  +V+D+  D+ W++C PC+  QC       +D
Sbjct: 138 TSQTSSEPSSGIHPAAATDGSSSPPVT--VVLDTAGDVPWMRCVPCTFAQCAD-----YD 190

Query: 200 PADSASFSGVSCSSAVCDRLEN--AGCHA-GRCRYEV-SYGDGSYTKGTLALETLTIGR- 254
           P  S+++S   C+S+ C +L     GC A G+C+Y V + GD   T GT + + LTI   
Sbjct: 191 PTRSSTYSAFPCNSSACKQLGRYANGCDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSG 250

Query: 255 TVVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG 313
             V+    GC    QG F   A G++ LG G  SL+ Q     G AFSYCL    T    
Sbjct: 251 DRVEGFRFGCSQNEQGSFENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKG- 309

Query: 314 SLVFGREALPVGAAW----VPLVR-----NPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
              F +  +P+GA++     P+++     +  A + Y   L  + V G  + +  ++F  
Sbjct: 310 ---FFQIGVPIGASYRFVTTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFAA 366

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
                 G VMD+ T +TRLP  AY A R AF  +      A      DTCY+L+G    R
Sbjct: 367 ------GTVMDSRTIITRLPVTAYGALRAAFRNRM-RYRVAPPQEELDTCYDLTGVRYPR 419

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           +P ++  F G  V+ +  S  L+     G   FA     S  SI+GN+QQ+ IQ+  D  
Sbjct: 420 LPRIALVFDGNAVVEMDRSGILL----NGCLAFASNDDDSSPSILGNVQQQTIQVLHDVG 475

Query: 485 NGFVGFGPNVC 495
            G +GF    C
Sbjct: 476 GGRIGFRSAAC 486


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  152 bits (384), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 139/452 (30%), Positives = 196/452 (43%), Gaps = 49/452 (10%)

Query: 69  SSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG 128
           +++ ++ E  ++++ +HRD   S        H     H+      R   R   L R  SG
Sbjct: 23  TASAAAGEGGFSVDFIHRDSARSPYR-----HPALSPHARALAAARRSLRGEVLGRSYSG 77

Query: 129 GGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
               AA     D G  V S +   S EY + + VG+PP     + D+GSD+VWV C    
Sbjct: 78  ASPAAAPVSAADGG--VESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSG 135

Query: 189 QCYKQSDP----VFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKG 243
                +D     VF P  S+++S +SC S  C  L  A C A   C+Y+ SYGDGS T G
Sbjct: 136 GGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIG 195

Query: 244 TLALETLTI------GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT- 296
            L+ ET +       G+  V  V  GC   + G F  + GL+GLG G+ SLV QLG  T 
Sbjct: 196 VLSTETFSFVDGGGKGQVRVPRVNFGCSTASAGTFR-SDGLVGLGAGAFSLVSQLGATTH 254

Query: 297 -GGAFSYCLV-SRGTGSSGSLVFGREAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
                SYCL+ S    SS +L FG  A+    GAA  PLV +    S+Y V L  + VGG
Sbjct: 255 IDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPS-DVDSYYTVALESVAVGG 313

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT----PAYEAFRDAFVAQTGNLPRASGV 408
             +             D  +++D+GT +T L      P           Q    P     
Sbjct: 314 QEVATH----------DSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPE---- 359

Query: 409 SIFDTCYNLSGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SP 463
            +   CY++ G        +P V+  F GG  +TL   N    + + GT C    P    
Sbjct: 360 QLLQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQE-GTLCLVLVPVSES 418

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +SI+GNI Q+   + +D     V F    C
Sbjct: 419 QPVSILGNIAQQNFHVGYDLDARTVTFAAADC 450


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 125/437 (28%), Positives = 201/437 (45%), Gaps = 45/437 (10%)

Query: 67  ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
           ISS+  ++  +R   +L+HR+         N     R +    + ++R    + + ++ L
Sbjct: 26  ISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIER-FDFLESKIKEL 84

Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
              G +A    +           ++GSG + V + +GSPP +Q +V+D+GS ++WVQC P
Sbjct: 85  KSVGNEARSSLIP---------FNRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
           C  C++QS   FDP  S SF  + C     + +    C+   +  Y++ Y  G  ++G L
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL 194

Query: 246 A-----LETLTIGRTVVKNVAIGCGHKNQGMFVGAA--GLLGLGG-GSMSLVGQLGGQTG 297
           A      ETL  G+    N+  GCGH N       A  G+ GLG    +++  QLG +  
Sbjct: 195 AKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLGNK-- 252

Query: 298 GAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF--YYVGLSGLGVGGM 353
             FSYC+  ++    +   LV G+       +++     P    F  YYV L  + VG  
Sbjct: 253 --FSYCIGDINNPLYTHNHLVLGQ------GSYIEGDSTPLQIHFGHYYVTLQSISVGSK 304

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQTGNLPRASGVSIFD 412
            + I  + F+++  G  GV++D+G   T+L    +E   D  V    G L R      F+
Sbjct: 305 TLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFE 364

Query: 413 -TCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---G 465
             C+   G VS   V  P V+F+F+GG  L L + + L        FC A  PS S    
Sbjct: 365 GLCF--KGVVSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRFCLAILPSNSELLN 421

Query: 466 LSIIGNIQQEGIQISFD 482
           LS+IG + Q+   + FD
Sbjct: 422 LSVIGILAQQNYNVGFD 438


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  152 bits (383), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 91/238 (38%), Positives = 133/238 (55%), Gaps = 25/238 (10%)

Query: 100 HYHRHQHSFHARMQRD-------VKRVATLVRRLSGGGADAAKHEVQDFGTDV--VSGMD 150
           H    +  ++ R+Q+        V+ +   +RR+      A+ H V+   T +   SG++
Sbjct: 6   HCSEKKIDWNRRLQKQLILDDLRVRSMQNRIRRV------ASTHNVEASQTQIPLSSGIN 59

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
             +  Y V +G+GS  ++  ++ID+ SD+ WVQC+PC  CY Q  P+F P+ S+S+  VS
Sbjct: 60  LQTLNYIVTMGLGS--KNMTVIIDTRSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVS 117

Query: 211 CSSAVCDRLENAGCHAG--------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI 262
           C+S+ C  L+ A  + G         C Y V+YGDGSYT G L +E L+ G   V +   
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFGGVSVSDFVF 177

Query: 263 GCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
           GCG  N+G+F G +GL+GLG   +SLV Q     GG FSYCL +   GSSGSLV G E
Sbjct: 178 GCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNE 235


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 110/346 (31%), Positives = 161/346 (46%), Gaps = 30/346 (8%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           + V I +GSPP +Q + +D+ SD++W+QC PC  CY QS P+FDP+ S +    +C ++ 
Sbjct: 85  FLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQ 144

Query: 216 CDRLE-NAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKNVAIGCGHK 267
                     +   C Y + Y D + +KG LA E L             + +V  GCGH 
Sbjct: 145 YSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHD 204

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVG 325
           N G  +   G+LGLG G  SLV + G +    FSYC  S    S     LV G +   + 
Sbjct: 205 NYGEPLVGTGILGLGYGEFSLVHRFGKK----FSYCFGSLDDPSYPHNVLVLGDDGANIL 260

Query: 326 AAWVPL-VRNPRAPSFYYVGLSGLGVGGMRIPISEDLF-RLTQMGDDGVVMDTGTAVTRL 383
               PL + N     FYYV +  + V G+ +PI   +F R  Q G  G ++DTG ++T L
Sbjct: 261 GDTTPLEIHN----GFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSL 316

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDT----CYN---LSGFVSVRVPTVSFYFSGGP 436
              AY+  ++           A+ VS  D     CYN       V    P V+F+FS G 
Sbjct: 317 VEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTFHFSEGA 376

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
            L+L   +  + +     FC A   +P  L+ IG   Q+   I +D
Sbjct: 377 ELSLDVKSLFMKL-SPNVFCLAV--TPGNLNSIGATAQQSYNIGYD 419


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 187/378 (49%), Gaps = 38/378 (10%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDP 200
           DV + + + + +Y     +GSPP+    +ID+GSD++W QC        C KQ  P ++ 
Sbjct: 74  DVSAQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNL 133

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHA----GRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           + S++F  V C+        N G H     G C +  SYG G    G+L  E+     + 
Sbjct: 134 SQSSTFVPVPCADKAGFCAAN-GVHLCGLDGSCTFIASYGAGRVI-GSLGTESFAF-ESG 190

Query: 257 VKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGS 311
             ++A GC        G    A+GL+GLG G +SLV Q+G      FSYCL      +G+
Sbjct: 191 TTSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATR---FSYCLTPYFHSSGA 247

Query: 312 SGSL-VFGREALPVGAAWVPLVRNPR---APSFYYVGLSGLGVGGMRIP-ISEDLFRLTQ 366
           S  L V    +L  G A +P V++P+     +FYY+ L G+ VG  R+P ++   F+L Q
Sbjct: 248 SSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQ 307

Query: 367 MGD----DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN-----LPRASGVSIFDTCYNL 417
           +       GV++DTG+ +T+L + AYEA ++   AQ GN      P  SG+ +   C   
Sbjct: 308 LFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLEL---CVAR 364

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
            GF  V VP + F+F GG  + +PA+++  PVD A   C          SIIGN QQ+ +
Sbjct: 365 EGFQKV-VPALVFHFGGGADMAVPAASYWAPVDKAAA-CMMILEGGYD-SIIGNFQQQDM 421

Query: 478 QISFDGANGFVGFGPNVC 495
            + +D   G   F    C
Sbjct: 422 HLLYDLRRGRFSFQTADC 439


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  151 bits (381), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 135/447 (30%), Positives = 199/447 (44%), Gaps = 67/447 (14%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           LEL H D               +   S   RM+R  +R  T  R  S G A A  H  + 
Sbjct: 26  LELTHVDA--------------KQNCSTEERMRRATER--THRRLASMGEASAPVHWAES 69

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVF 198
                         +Y     +G PP+    +ID+GS+++W QC  C    C+ Q+   +
Sbjct: 70  --------------QYIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFY 115

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP+ S +   V+C+   C       C      C    +YG G    G L  E  T  +  
Sbjct: 116 DPSRSRTARPVACNDTACALGSETRCARDNKACAVLTAYGAG-VIGGVLGTEAFTF-QPQ 173

Query: 257 VKNV--AIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV---SRG 308
            +NV  A GC    +   G   GA+G++GLG G++SLV QLG      FSYCL    S+ 
Sbjct: 174 SENVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNK---FSYCLTPYFSQS 230

Query: 309 TGSSGSLVFGREALPVG---AAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISEDLF 362
           T +S   V     L  G   A  VP ++NP      +FYY+ L+G+ VG  ++ + E  F
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290

Query: 363 RLTQMGD---DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCYNL 417
            L Q+      G ++D+G+  T L   AY+A RD  V Q G   +P  +G    D C  +
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV 350

Query: 418 S-GFVSVRVPTVSFYF-SGGPVLTLPASNFLIPVDDAGTFCFAFA---PSPS----GLSI 468
           + G V   VP +  +F SGG  + +P  N+  PVDD+      F+   P+ +      +I
Sbjct: 351 AHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTI 410

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           IGN  Q+ + + +D   G + F P  C
Sbjct: 411 IGNYMQQDMHLLYDLEKGMLSFQPADC 437


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 129/450 (28%), Positives = 200/450 (44%), Gaps = 78/450 (17%)

Query: 110 ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           ARM R+  R+A +  R    G   A      F   + SG   G+G+YFVR  VG+P +  
Sbjct: 47  ARMDRE--RMAFISSR----GRRRAAETASAFAMPLSSGAYTGTGQYFVRFRVGTPAQPF 100

Query: 170 YMVIDSGSDIVWVQCQ------------------PCSQCYKQSDPVFDPADSASFSGVSC 211
            +V D+GSD+ WV+C                   P     +++   F P  S +++ + C
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT---FRPDKSRTWAPIPC 157

Query: 212 SSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG-------RTVVKN 259
           SSA C       L      A  C Y+  Y DGS  +GT+ +++ TI        +  ++ 
Sbjct: 158 SSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLRG 217

Query: 260 VAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLV 316
           V +GC     G  F+ + G+L LG  ++S   +   + GG FSYCLV       ++  L 
Sbjct: 218 VVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNATSYLT 277

Query: 317 FG--------REALPVGAA-----------------WVPLVRNPRAPSFYYVGLSGLGVG 351
           FG        R +  + +                    PLV + R   FY V + G+ V 
Sbjct: 278 FGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVA 337

Query: 352 GMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF 411
           G  + I   ++ + Q G  G ++D+GT++T L  PAY A   A   +   LPR + +  F
Sbjct: 338 GELLKIPRAVWDVEQGG--GAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVT-MDPF 394

Query: 412 DTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SG 465
           D CYN +      V+  +P ++ +F+G   L  PA +++I  D A G  C      P  G
Sbjct: 395 DYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVI--DAAPGVKCIGLQEGPWPG 452

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LS+IGNI Q+     +D  N  + F  + C
Sbjct: 453 LSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 33/362 (9%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC--QPCSQCYKQSDPVFDPADSASF 206
           MD   G Y +   +G+PP+    + D+GSD++W +C     + C  Q  P + P  S++F
Sbjct: 84  MDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTF 143

Query: 207 SGVSCSSAVCDRLEN---AGCHAG--RCRYEVSYG----DGSYTKGTLALETLTIGRTVV 257
           + + CS  +C  L +   A C A    C Y  SYG    D  YT+G LA ET T+G   V
Sbjct: 144 AKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLGADAV 203

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVF 317
            +V  GC   ++G +   +GL+GLG G +SLV QL   T   F YCL S  + +S  L+F
Sbjct: 204 PSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLNAST---FMYCLTSDASKAS-PLLF 259

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDT 376
           G  A   GA  V       + +FY V L  + +G    P + E          +GVV D+
Sbjct: 260 GSLASLTGAQ-VQSTGLLASTTFYAVNLRSISIGSATTPGVGE---------PEGVVFDS 309

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG---FVSVRVPTVSFYFS 433
           GT +T L  PAY   + AF++QT +L +      F+ C+         +  VPT+  +F 
Sbjct: 310 GTTLTYLAEPAYSEAKAAFLSQT-SLDQVEDTDGFEACFQKPANGRLSNAAVPTMVLHFD 368

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           G   + LP +N+++ V+D G  C+    SPS LSIIGNI Q    +  D     + F P 
Sbjct: 369 GAD-MALPVANYVVEVED-GVVCWIVQRSPS-LSIIGNIMQVNYLVLHDVHRSVLSFQPA 425

Query: 494 VC 495
            C
Sbjct: 426 NC 427


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 173/376 (46%), Gaps = 34/376 (9%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
           D G G+YFV   VG+P +   +V D+GSD+ W+ C+       CS    +      VF  
Sbjct: 77  DYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 136

Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
             S+SF  + C + +C         L N       C Y+  Y DGS   G  A ET+T+ 
Sbjct: 137 NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 196

Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
              GR + + NV IGC    QG  F  A G++GLG    S   +   + GG FSYCLV  
Sbjct: 197 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256

Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
            +    S  L FG    +EAL     +  LV      SFY V + G+ +GG  + I  ++
Sbjct: 257 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 315

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
           + +   G  G ++D+G+++T L  PAY+    A         +    +   + C+N +GF
Sbjct: 316 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 373

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
               VP + F+F+ G     P  +++I   D G  C  F   +  G S++GNI Q+    
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 432

Query: 480 SFDGANGFVGFGPNVC 495
            FD     +GF P+ C
Sbjct: 433 EFDLGLKKLGFAPSSC 448


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 126/450 (28%), Positives = 191/450 (42%), Gaps = 67/450 (14%)

Query: 81  LELVHR--DKMSSSSNTTNNMH----YHRHQHSFHARMQRDVKRVATLVRRLSGGGADAA 134
           LELVHR  ++ +      + +     + +       RM +    V+    R  G      
Sbjct: 35  LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS 194
             EV+     + SG D   GEYF  + VGSP +  ++V+D+GS+  W+ C          
Sbjct: 95  PAEVE---MPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC---------- 141

Query: 195 DPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLAL 247
                   S SF  V+C+S  C         L      +  C Y++SY DGS  KG    
Sbjct: 142 --------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGT 193

Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMFVGA------AGLLGLGGGSMSLVGQLGGQT 296
           +++T+G T      + N+ IGC    + M  G        G+LGLG    S + +   + 
Sbjct: 194 DSITVGLTNGKQGKLNNLTIGC---TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKY 250

Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVG----AAWVPLVRNPRA---PSFYYVGLSGLG 349
           G  FSYCLV   +  S S       L +G    A  +  +R       P FY V + G+ 
Sbjct: 251 GAKFSYCLVDHLSHRSVS-----SNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGIS 305

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           +GG  + I   ++     G  G ++D+GT +T L  PAYEA  +A       + R +G  
Sbjct: 306 IGGQMLKIPPQVWDFNAEG--GTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGED 363

Query: 410 I--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP--SPSG 465
               + C++  GF    VP + F+F+GG     P  +++I V      C    P     G
Sbjct: 364 FDALEFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPL-VKCIGIVPIDGIGG 422

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S+IGNI Q+     FD +   VGF P+ C
Sbjct: 423 ASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 185/421 (43%), Gaps = 45/421 (10%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGV 162
             Q    A   RD  R   +++ + GG  D +     D             G YF ++ +
Sbjct: 39  NQQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSD---------PYFVGLYFTKVKL 89

Query: 163 GSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD 217
           GSP +  Y+ ID+GSDI+W+ C  CS C   S        FD A S++ + VSC   +C 
Sbjct: 90  GSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDPICS 149

Query: 218 ---RLENAGC--HAGRCRYEVSYGDGS-----YTKGTLALETLTIGRTVVKN----VAIG 263
              +   + C   A +C Y   YGDGS     Y   T+  +T+ +G++VV N    +  G
Sbjct: 150 YAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFG 209

Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           C     G          G+ G G G++S++ QL   G T   FS+CL   G    G LV 
Sbjct: 210 CSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-KGGENGGGVLVL 268

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G E L     + PLV  P  P  Y + L  + V G  +PI  ++F  T   + G ++D+G
Sbjct: 269 G-EILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQLLPIDSNVFATTN--NQGTIVDSG 322

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +  L   AY  F  A  A      +   +S  + CY +S  V    P VS  F GG  
Sbjct: 323 TTLAYLVQEAYNPFVKAITAAVSQFSKPI-ISKGNQCYLVSNSVGDIFPQVSLNFMGGAS 381

Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
           + L   ++L+    +D A  +C  F     G +I+G++  +     +D AN  +G+    
Sbjct: 382 MVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLANQRIGWADYD 441

Query: 495 C 495
           C
Sbjct: 442 C 442


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/276 (30%), Positives = 131/276 (47%), Gaps = 49/276 (17%)

Query: 223 GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           G  A  C Y ++YGDGS+T+G L  E L  G  +VK+   GCG  N+G+F G +GL+GLG
Sbjct: 127 GSAAPICNYAINYGDGSFTRGELGHEKLKFGTILVKDFIFGCGRNNKGLFGGVSGLMGLG 186

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYY 342
              +SL+ Q                                          NP+  +FY+
Sbjct: 187 RSDLSLISQTS---------------------------------------ENPQLYNFYF 207

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           + L+G+ +GG+ +       +   +G   +++D+GT +TRLP   Y+A +  F+ Q    
Sbjct: 208 INLTGISIGGVAL-------QAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 260

Query: 403 PRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN-FLIPVDDAGTFCFAFA- 460
           P A   SI DTC+NLS +  V +PT+  +F G   LT+  +  F     DA   C A A 
Sbjct: 261 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 320

Query: 461 -PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 ++I+GN QQ+ +++ +D     VGF    C
Sbjct: 321 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETC 356


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 118/368 (32%), Positives = 175/368 (47%), Gaps = 29/368 (7%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASF 206
           SG   G+G+YFV++ VG+P +   +V D+GSD+ WV+C   S   +    VF P  S S+
Sbjct: 107 SGAYSGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR----VFRPKTSRSW 162

Query: 207 SGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSY-TKGTLALETLTI----GRTV 256
           + + CSS  C       L N    A  C Y+  Y +GS   +G +  E+ TI    G+  
Sbjct: 163 APIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVA 222

Query: 257 -VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSS 312
            +K+V +GC   + G  F  A G+L LG   +S   Q   + GG+FSYCLV       ++
Sbjct: 223 QLKDVVLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNAT 282

Query: 313 GSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           G L FG   +P   A    L  +P  P FY V +  + V G  + I  +++        G
Sbjct: 283 GYLAFGPGQVPRTPATQTKLFLDPEMP-FYGVKVDAIHVAGKALDIPAEVW---DAKSGG 338

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSVRVPTV 428
           V++D+G  +T L  PAY+A   A       +P+ S    F+ CYN +         +P +
Sbjct: 339 VILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVS-FPPFEHCYNWTARRPGAPEIIPKL 397

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGF 487
           +  F+G   L  PA +++I V   G  C         GLS+IGNI Q+     FD  N  
Sbjct: 398 AVQFAGSARLEPPAKSYVIDVKP-GVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQ 456

Query: 488 VGFGPNVC 495
           V F  + C
Sbjct: 457 VRFKQSNC 464


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/404 (29%), Positives = 185/404 (45%), Gaps = 55/404 (13%)

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ----------- 189
           F   + SG   G+G+YFVR  VG+P +   ++ D+GSD+ WV+C+  +            
Sbjct: 95  FAMPLSSGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPA 154

Query: 190 ----CYKQSDPVFDPADSASFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSY 240
                      VF P DS ++S + CSS  C       L N       C Y+  Y D S 
Sbjct: 155 AAPSPAVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSA 214

Query: 241 TKGTLALETLTIG-------------RTVVKNVAIGC--GHKNQGMFVGAAGLLGLGGGS 285
            +G +  ++ T+              +  ++ V +GC   H  QG F  + G+L LG  +
Sbjct: 215 ARGVVGTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQG-FEASDGVLSLGYSN 273

Query: 286 MSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFG------REALPVGAAWVPLVRNPRA 337
           +S   +   + GG FSYCLV       ++  L FG        + P   +  PL+ + R 
Sbjct: 274 ISFASRAASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333

Query: 338 PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
             FY V +  + V G+ + I  +++ +   G  G ++D+GT++T L TPAY+A   A   
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVWDVGSNG--GTIIDSGTSLTVLATPAYKAVVAALSE 391

Query: 398 QTGNLPRASGVSIFDTCYNLS----GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA- 452
           Q   LPR + +  FD CYN +    G   + VP ++  F+G   L  PA +++I  D A 
Sbjct: 392 QLAGLPRVA-MDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVI--DAAP 448

Query: 453 GTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           G  C      +  G+S+IGNI Q+     FD  N ++ F    C
Sbjct: 449 GVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSC 492


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y   + +G+PP+    +I    + VW QC PC +C+KQ  P+F+ + S+++    C +A+
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFNRSASSTYRPEPCGTAL 87

Query: 216 CDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCG-HKNQGM 271
           C+ +  + C   G C YEV   +GD S   GT   +T  IG T   ++A GC    N   
Sbjct: 88  CESVPASTCSGDGVCSYEVETMFGDTSGIGGT---DTFAIG-TATASLAFGCAMDSNIKQ 143

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVG---AA 327
            +GA+G++GLG    SLVGQ+      AFSYCL   G  G   +L+ G  A   G   AA
Sbjct: 144 LLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLAGGKSAA 200

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
             PLV      S Y + L G+  G        D+          V++DT   V+ L   A
Sbjct: 201 TTPLVNTSDDSSDYMIHLEGIKFG--------DVIIAPPPNGSVVLVDTIFGVSFLVDAA 252

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-----SVRVPTVSFYFSGGPVLTLPA 442
           ++A + A     G  P A+    FD C+  +        S+ +P V   F G   LT+P 
Sbjct: 253 FQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 312

Query: 443 SNFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S ++    + GT C A   S      + LSI+G + QE I   FD     + F P  C
Sbjct: 313 SKYMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 369


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 121/407 (29%), Positives = 188/407 (46%), Gaps = 30/407 (7%)

Query: 95  TTNNMHYHRHQHSFH----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD 150
           +TN +H H     +       + +D    +TL R            +  DF   V   + 
Sbjct: 44  STNLIHIHSPSSPYKNVKAESLAKDTALESTLSRHAYLRARQQKALQPADF---VPPPLI 100

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           +    +   + +G+PP + Y+V+D+GSD+ W+QC+PC  CYKQ DP+++   S S++ + 
Sbjct: 101 RDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEML 160

Query: 211 CSSAVCDRL--ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
           C+   C  L  E     +G C Y+ SY DGS T G L+ E +              V  G
Sbjct: 161 CNEPPCLSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFG 220

Query: 264 CGHKNQGMFVGA--AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS-LVFG 318
           CG +N      +   G+LGLG G +SLV QL   G+   +F+YC  +    ++G  LVFG
Sbjct: 221 CGLQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFG 280

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVG--GMRIPISEDLFRLTQMGDDGVVMDT 376
            +A  +     P+V       FYYV L G+G+G    R+ I+   F     G  GV++D+
Sbjct: 281 -DATYLNGDMTPMV----IAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDS 335

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN-LSGFVSVRVPTVSFYFSGG 435
           G+ ++  P   YE  R+A V +       S ++    C+    G      PT+  Y    
Sbjct: 336 GSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIGRDLPLFPTLVLYLEST 395

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
            +L    S FL   D+   FC  F  S  GLSIIG + Q+  +  ++
Sbjct: 396 GILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYN 439


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 79/195 (40%), Positives = 114/195 (58%), Gaps = 17/195 (8%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +   L HRD + S    ++  HY R  ++F    +R + R ATL+ R +  GA       
Sbjct: 30  FTTSLFHRDSLLSPLEFSSLSHYDRLTNAF----RRSLSRSATLLNRAATNGA------- 78

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVF 198
                D+ + +  GSGEY + + +G+PP     + D+GSD++W QC PC +CYKQS P+F
Sbjct: 79  ----LDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIF 134

Query: 199 DPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVV 257
           DP  S SFS V C+S  C  ++++ C A G C Y  +YGD +YTKG L  E +TIG + V
Sbjct: 135 DPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSSV 194

Query: 258 KNVAIGCGHKNQGMF 272
           K+V IGCGH++ G F
Sbjct: 195 KSV-IGCGHESGGGF 208


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 126/381 (33%), Positives = 183/381 (48%), Gaps = 33/381 (8%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
           K + Q  G +  SG    +    + I VG+P  ++   ++D  S  VW QC PC+     
Sbjct: 70  KQQQQQLGGEAASG---AAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGC 126

Query: 194 SDP---VFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RC-RYEVSYG-DG 238
             P    F P  SA+FS + CSS +C  +    C             RC  Y ++YG   
Sbjct: 127 LPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSA 186

Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
           + T G LA +T T G T V  V  GC   + G F GA+G++G+G G++SL+ QL     G
Sbjct: 187 ANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL---QFG 243

Query: 299 AFSYCLVSRGTGSSGS----LVFGREALPVGA--AWVPLVRNPRAPSFYYVGLSGLGVGG 352
            FSY L++      GS    + FG +A+P        PL+ +   P FYYV L+G+ V G
Sbjct: 244 KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDG 303

Query: 353 MRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
            R+  I    F L   G  GV++ + T VT L   AY+  R A  ++ G LP  +G +  
Sbjct: 304 NRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGSAAL 362

Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
             D CYN S    V+VP ++  F GG  + L A+N+    +D G  C    PS  G S++
Sbjct: 363 ELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVL 421

Query: 470 GNIQQEGIQISFDGANGFVGF 490
           G + Q G  + +D   G + F
Sbjct: 422 GTLLQTGTNMIYDVDAGRLTF 442


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  148 bits (373), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 121/409 (29%), Positives = 189/409 (46%), Gaps = 34/409 (8%)

Query: 95  TTNNMHYHRHQHSFH----ARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMD 150
           +TN +H H     +       + +D    +TL R            +  DF   V   + 
Sbjct: 31  STNLIHIHSPSSPYKNVKAESLAKDTALESTLSRHAYLRARQQKALQPADF---VPPPLI 87

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           +    +   + +G+PP + Y+V+D+GSD+ W+QC+PC  CYKQ DP+++   S S++ + 
Sbjct: 88  RDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEML 147

Query: 211 CSSAVCDRLENAG--CHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIG 263
           C+   C  L   G    +G C Y+ +Y DG+ T G L+ E +              V  G
Sbjct: 148 CNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFG 207

Query: 264 CGHKNQGMFVGA--AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS-LVFG 318
           CG +N          G+LGLG G +SLV QL   G+   +F+YC  +    ++G  LVFG
Sbjct: 208 CGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFG 267

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGL--SGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            +A  +     P+V       FYYV L   GLGVG  R+ I+   F     G  GV++D+
Sbjct: 268 -DATYLNGDMTPMV----IAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDS 322

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV---PTVSFYFS 433
           G+ ++  P   YE  R+A V +       S ++    C+   G +   +   PT+  Y  
Sbjct: 323 GSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFE--GKIERDLPLFPTLVLYLE 380

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              +L    S FL   D+   FC  F  S  GLSIIG + Q+  +  ++
Sbjct: 381 STGILNDRWSIFLQRYDE--LFCLGFT-SGEGLSIIGTLAQQSYKFGYN 426


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  148 bits (373), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 175/367 (47%), Gaps = 37/367 (10%)

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQS--DPV 197
           +F  DV   +   +  + V   VG PP  Q  ++D+GS ++W+QCQPC  C       PV
Sbjct: 82  NFQVDVEQAIK--TSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPV 139

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGC-HAGRCRYEVSYGDGSYTKGTLALETLTI---- 252
           F+PA S++F   SC    C    N  C  + +C YE  Y  G+ +KG LA E LT     
Sbjct: 140 FNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPN 199

Query: 253 GRTVV-KNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTG 310
           G TVV + +A GCG++N + +     G+LGLG    SL  QLG +    FSYC+      
Sbjct: 200 GNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLGSK----FSYCIGDLANK 255

Query: 311 SSG--SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + G   LV G +A  +G    P +      S YY+ L G+ VG  ++ I   +F+  +  
Sbjct: 256 NYGYNQLVLGEDADILGDP-TP-IEFETENSIYYMNLEGISVGDTQLNIEPVVFK-RRGP 312

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNLSGFVSVRV-- 425
             GV++D+GT  T L   AY    +   +     P+       D  CY+  G VS  +  
Sbjct: 313 RTGVILDSGTLYTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYH--GRVSEELIG 368

Query: 426 -PTVSFYFSGGPVLTLPASNFLIPVDDAGT---FCFAFAPSPS------GLSIIGNIQQE 475
            P V+F+F+GG  L + A++   P+ +  T   FC +  P+          + IG + Q+
Sbjct: 369 FPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQ 428

Query: 476 GIQISFD 482
              I +D
Sbjct: 429 YYNIGYD 435


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  148 bits (373), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 110/342 (32%), Positives = 150/342 (43%), Gaps = 57/342 (16%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +GEY ++I +G+PP   Y + D+GSD++W QC PC  CYKQ +P+FDP+ S SF  VSC 
Sbjct: 21  NGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE 80

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMF 272
           S  C  L+                                  T + N+  GCGH N G F
Sbjct: 81  SQQCRLLDTP--------------------------------TSILNIVFGCGHNNSGTF 108

Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS--GSLVFGREALPVGAA 327
                GL G GG  +SL  Q+     +G  FS CLV   T  S    ++FG EA   G+ 
Sbjct: 109 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSD 168

Query: 328 WV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG-VVMDTGTAVTRLP 384
            V  PLV     P++Y+V L G+ VG    P S      + M   G V +D GT  T LP
Sbjct: 169 VVSTPLVTK-DDPTYYFVTLDGISVGDKLFPFSSS----SPMATKGNVFIDAGTPPTLLP 223

Query: 385 TPAY----EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
              Y    +  ++A   +    P          CY  +  +    P ++ +F G  V   
Sbjct: 224 RDFYNRLVQGVKEAIPMEPVQDPDLQP----QLCYRSATLID--GPILTAHFDGADVQLK 277

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           P + F+ P +  G +CFA  P      I GN  Q    I FD
Sbjct: 278 PLNTFISPKE--GVYCFAMQPIDGDTGIFGNFVQMNFLIGFD 317


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 126/381 (33%), Positives = 183/381 (48%), Gaps = 33/381 (8%)

Query: 135 KHEVQDFGTDVVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
           K + Q  G +  SG    +    + I VG+P  ++   ++D  S  VW QC PC+     
Sbjct: 70  KQQQQQLGGEAASG---AAPPLVINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGC 126

Query: 194 SDP---VFDPADSASFSGVSCSSAVCDRLENAGCHAG----------RC-RYEVSYG-DG 238
             P    F P  SA+FS + CSS +C  +    C             RC  Y ++YG   
Sbjct: 127 LPPPATAFRPNGSATFSPLPCSSDMCLPVLRETCGRAGAAANATAGARCDSYSLTYGGSA 186

Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG 298
           + T G LA +T T G T V  V  GC   + G F GA+G++G+G G++SL+ QL     G
Sbjct: 187 ANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL---QFG 243

Query: 299 AFSYCLVSRGTGSSGS----LVFGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLGVGG 352
            FSY L++      GS    + FG +A+P        PL+ +   P FYYV L+G+ V G
Sbjct: 244 KFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDG 303

Query: 353 MRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI- 410
            R+  I    F L   G  GV++ + T VT L   AY+  R A  ++ G LP  +G +  
Sbjct: 304 NRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGSAAL 362

Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSII 469
             D CYN S    V+VP ++  F GG  + L A+N+    +D G  C    PS  G S++
Sbjct: 363 ELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG-SVL 421

Query: 470 GNIQQEGIQISFDGANGFVGF 490
           G + Q G  + +D   G + F
Sbjct: 422 GTLLQTGTNMIYDVDAGRLTF 442


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 172/360 (47%), Gaps = 24/360 (6%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCS 212
           +Y     +G PP+    +ID+GSD+VW QC  C +  C +Q+ P ++ + S++F+ V C+
Sbjct: 89  QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148

Query: 213 SAVC---DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GH 266
           + +C   D + +    A  C     YG G    GTL  E     ++    +A GC     
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAG-VVAGTLGTEAFAF-QSGTAELAFGCVTFTR 206

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFGREALPV 324
             QG   GA+GL+GLG G +SLV Q G      FSYCL       G++G L  G  A   
Sbjct: 207 IVQGALHGASGLIGLGRGRLSLVSQTGATK---FSYCLTPYFHNNGATGHLFVGASASLG 263

Query: 325 GAAWV---PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG----DDGVVMDTG 377
           G   V     V+ P+   FYY+ L GL VG  R+PI   +F L ++       GV++D+G
Sbjct: 264 GHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSG 323

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPTVSFYFSGGP 436
           +  T L   AY+A      A+      A      D    ++   V   VP V F+F GG 
Sbjct: 324 SPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRVVPAVVFHFRGGA 383

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + +PA ++  PVD A       +  P    S+IGN QQ+ +++ +D ANG   F P  C
Sbjct: 384 DMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 108/339 (31%), Positives = 160/339 (47%), Gaps = 21/339 (6%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y    G+G+PP+     +D  SD+VW  C   +         F+P  S + + V C+
Sbjct: 97  AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148

Query: 213 SAVCDRLENAGCHAG--RCRYEVSYGDGSY-TKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
              C +     C AG   C Y   YG G+  T G L  E  T G T +  V  GCG KN 
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNV 208

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPVGAAW 328
           G F G +G++GLG G++SLV QL       FSY      +  + S + FG +A P  +  
Sbjct: 209 GDFSGVSGVIGLGRGNLSLVSQLQVDR---FSYHFAPDDSVDTQSFILFGDDATPQTSHT 265

Query: 329 VP--LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVTRLPT 385
           +   L+ +   PS YYV L+G+ V G  + I    F L  + G  GV +     VT L  
Sbjct: 266 LSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEE 325

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
            AY+  R A  ++ G LP  +G ++  D CY        +VP+++  F+GG V+ L   N
Sbjct: 326 AAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELGN 384

Query: 445 FLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
           +       G  C    PS +G  S++G++ Q G  + +D
Sbjct: 385 YFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYD 423


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 181/396 (45%), Gaps = 36/396 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           +QR   R++ L  R       A     Q       + + +GSG+Y +  G+G+P      
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAPGESAQ-------TPLKKGSGDYAMSFGIGTPATGLSG 107

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH------ 225
             D+GSD++W +C  C++C  +  P + P  S+S + V+C    C  L    C       
Sbjct: 108 EADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGG 167

Query: 226 --AGRCRYEVSYGDGS----YTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAG 277
             +G C Y  +YG+      YT+G L  ET T G        +A GC  +++G F   +G
Sbjct: 168 SGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSG 227

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLV 332
           L+GLG G +SLV QL  +   AF Y L S  +  S  + FG  A   G         PL+
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLL 283

Query: 333 RNP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYE 389
            NP  +   FYYVGL+G+ VGG  + I    F   +  G  GV+ D+GT +T LP PAY 
Sbjct: 284 TNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYT 343

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
             RD  ++Q G        +  D      G  +   P++  +F GG  + L   N+L  +
Sbjct: 344 LVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403

Query: 450 ---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              +     C++   S   L+IIGNI Q    + FD
Sbjct: 404 QGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFD 439


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  147 bits (372), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 181/396 (45%), Gaps = 36/396 (9%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           +QR   R++ L  R       A     Q       + + +GSG+Y +  G+G+P      
Sbjct: 55  VQRSRSRLSMLAARAVSNAGAAPGESAQ-------TPLKKGSGDYAMSFGIGTPATGLSG 107

Query: 172 VIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH------ 225
             D+GSD++W +C  C++C  +  P + P  S+S + V+C    C  L    C       
Sbjct: 108 EADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGG 167

Query: 226 --AGRCRYEVSYGDGS----YTKGTLALETLTIG--RTVVKNVAIGCGHKNQGMFVGAAG 277
             +G C Y  +YG+      YT+G L  ET T G        +A GC  +++G F   +G
Sbjct: 168 SGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSG 227

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-----AWVPLV 332
           L+GLG G +SLV QL  +   AF Y L S  +  S  + FG  A   G         PL+
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLL 283

Query: 333 RNP--RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYE 389
            NP  +   FYYVGL+G+ VGG  + I    F   +  G  GV+ D+GT +T LP PAY 
Sbjct: 284 TNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYT 343

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
             RD  ++Q G        +  D      G  +   P++  +F GG  + L   N+L  +
Sbjct: 344 LVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQM 403

Query: 450 ---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              +     C++   S   L+IIGNI Q    + FD
Sbjct: 404 QGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFD 439


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/365 (30%), Positives = 167/365 (45%), Gaps = 41/365 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y     +G+PP+    V+D   ++VW QC PC  C++Q  P+FDP  S++F G+ C S
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
            +C+ +  +   C +  C YE     G  T G    +T  IG    + +  GC     K 
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIG-AAKETLGFGCVVMTDKR 172

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
                G +G++GLG    SLV Q+      AFSYCL  +   SSG+L  G  A  +    
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226

Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
            +  P V    A S       +Y V L+G+  GG  +       +        V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPL-------QAASSSGSTVLLDTVS 279

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
             + L   AY+A + A  A  G  P AS    +D C+  S  V+   P + F F GG  L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCF--SKAVAGDAPELVFTFDGGAAL 337

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS--------GLSIIGNIQQEGIQISFDGANGFVGF 490
           T+P +N+L+   + GT C     S S        G SI+G++QQE + + FD     + F
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSF 396

Query: 491 GPNVC 495
            P  C
Sbjct: 397 KPADC 401


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 172/376 (45%), Gaps = 34/376 (9%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
           D G G+Y V   VG+P +   +V D+GSD+ W+ C+       CS    +      VF  
Sbjct: 77  DYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 136

Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
             S+SF  + C + +C         L N       C Y+  Y DGS   G  A ET+T+ 
Sbjct: 137 NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 196

Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
              GR + + NV IGC    QG  F  A G++GLG    S   +   + GG FSYCLV  
Sbjct: 197 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 256

Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
            +    S  L FG    +EAL     +  LV      SFY V + G+ +GG  + I  ++
Sbjct: 257 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 315

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
           + +   G  G ++D+G+++T L  PAY+    A         +    +   + C+N +GF
Sbjct: 316 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 373

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
               VP + F+F+ G     P  +++I   D G  C  F   +  G S++GNI Q+    
Sbjct: 374 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 432

Query: 480 SFDGANGFVGFGPNVC 495
            FD     +GF P+ C
Sbjct: 433 EFDLGLKKLGFAPSSC 448


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 132/452 (29%), Positives = 198/452 (43%), Gaps = 74/452 (16%)

Query: 80  NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL-SGGGADAAKHEV 138
            LEL H D               +  ++   R++R  +R     RRL S GG  A  H  
Sbjct: 24  RLELTHVDA--------------KEHYTVEERVRRATERTH---RRLASMGGVTAPIH-- 64

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQCYKQSDPV 197
             +G         G  +Y     +G PP+    +ID+GS+++W QC  C   C++Q+ P 
Sbjct: 65  --WG---------GQSQYIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPY 113

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRT 255
           +DP+ S +   V C+ A C       C +    C     YG G+   GTLA E LT    
Sbjct: 114 YDPSRSRAARAVGCNDAACALGSETQCLSDNKTCAVVTGYGAGN-IAGTLATENLTFQSE 172

Query: 256 VVKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS------ 306
            V ++  GC      + G   GA+G++GLG G +SL  QLG      FSYCL        
Sbjct: 173 TV-SLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDTR---FSYCLTPYFEDTI 228

Query: 307 ----RGTGSSGSLVFGR-EALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPIS 358
                  G+S  L+ G   + PV    VP VR+P      +FYY+ L+G+  G +++ + 
Sbjct: 229 EPSHMVVGASAGLINGSASSTPVTT--VPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVP 286

Query: 359 EDLFRLTQMGD---DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDT 413
              F L Q+      G  +D+G  +T L   AY+A R     Q G   +   +G + FD 
Sbjct: 287 SAAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTGFDL 346

Query: 414 CYNLSGFVSVRVPTVSFYFSG----GPVLTLPASNFLIPVDDAGTFCFAFAP------SP 463
           C  L     + VP +  +F G    G  L +P +N+  PVD A      F+         
Sbjct: 347 CVALKDAERL-VPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPM 405

Query: 464 SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +  ++IGN  Q+ + + +D A G + F P  C
Sbjct: 406 NETTVIGNYMQQNMHVLYDLAGGVLSFQPADC 437


>gi|3641868|emb|CAA09458.1| hypothetical protein [Cicer arietinum]
          Length = 110

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 71/109 (65%), Positives = 81/109 (74%)

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AYE+ RDAF   T NL  A GV+IFDTCY+LS   SVRVPTVSF+F    V  LPA N+L
Sbjct: 2   AYESVRDAFKRLTQNLRSAEGVAIFDTCYDLSSLRSVRVPTVSFHFGNDRVWDLPAKNYL 61

Query: 447 IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IPVD  GTFCFAFAP+ S LSIIGN+QQ+G ++SFD AN  VGF PN C
Sbjct: 62  IPVDSDGTFCFAFAPTSSSLSIIGNVQQQGTRVSFDIANSLVGFSPNKC 110


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 182/377 (48%), Gaps = 52/377 (13%)

Query: 146 VSGMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVF 198
           +SG D    +G Y+ RI +G+PP+  Y+ +D+GSD+ WV C PC+ C + S+      +F
Sbjct: 36  ISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIF 95

Query: 199 DPADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           DP  S S + +SC+   C    N+ C  ++  C Y   YGDGS T G L  + L+  +  
Sbjct: 96  DPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVP 155

Query: 257 VKN---------VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLV 305
             N         +  GCG    G ++   GL+G G   +SL  QL  Q  +   F++CL 
Sbjct: 156 SGNSTATSGTARLTFGCGSNQTGTWL-TDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQ 214

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
               G SG+LV G    P G  + P+V  P+  S Y V L  +GV G  +  +   F L+
Sbjct: 215 GDNKG-SGTLVIGHIREP-GLVYTPIV--PKQ-SHYNVELLNIGVSGTNV-TTPTAFDLS 268

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAF----RDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
             G  GV+MD+GT +T L  PAY+ F    RD    ++G LP A     F     + G+ 
Sbjct: 269 NSG--GVIMDSGTTLTYLVQPAYDQFQAKVRDCM--RSGVLPVA-----FQFFCTIEGY- 318

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFL---IPVDDAGTFCFAFAPSPS-----GLSIIGNIQ 473
               P V+ YF+GG  + L  S++L   +       +CF++  S S       +I G+  
Sbjct: 319 ---FPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFGDNV 375

Query: 474 QEGIQISFDGANGFVGF 490
            +   + +D  N  +G+
Sbjct: 376 LKDQLVVYDNVNNRIGW 392


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 46/371 (12%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
           G+ EY V  G G+P +   +  D+   +  ++C+PC   + C    DP F+P+ S+SF+ 
Sbjct: 84  GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 139

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
           + C S  C       C    C + + +G+ +   GTL  +TLT+  +        GC   
Sbjct: 140 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 195

Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
             +   F GA GL+ L   S SL  ++       +  AFSYCL S    SS      R  
Sbjct: 196 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 249

Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           L +GA+          + P+  NP  P+ Y+V L G+ VGG  +P+   +F        G
Sbjct: 250 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVELVGISVGGEDLPVPPAVF-----AAHG 304

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            +++  T  T L   AY A RDAF       P A    + DTCYNL+G  S+ VPTV+  
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRRDMAPYPAAPPFRVLDTCYNLTGLASLAVPTVALR 364

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           F+GG  L L     +   D +  F         A       +S+IG + Q   ++ +D  
Sbjct: 365 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 424

Query: 485 NGFVGFGPNVC 495
            G VGF P  C
Sbjct: 425 GGRVGFIPGRC 435


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 167/365 (45%), Gaps = 41/365 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y     +G+PP+    V+D   ++VW QC PC  C++Q  P+FDP  S++F G+ C S
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
            +C+ +  +   C +  C YE     G  T G    +T  IG    + +  GC     K 
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIG-AAKETLGFGCVVMTDKR 172

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
                G +G++GLG    SLV Q+      AFSYCL  +   SSG+L  G  A  +    
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226

Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
            +  P V    A S       +Y V L+G+  GG  +       +        V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL-------QAASSSGSTVLLDTVS 279

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
             + L   AY+A + A  A  G  P AS    +D C+  +  V+   P + F F GG  L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKA--VAGDAPELVFTFDGGAAL 337

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS--------GLSIIGNIQQEGIQISFDGANGFVGF 490
           T+P +N+L+   + GT C     S S        G SI+G++QQE + + FD     + F
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSF 396

Query: 491 GPNVC 495
            P  C
Sbjct: 397 KPADC 401


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 172/376 (45%), Gaps = 34/376 (9%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCYKQS---DPVFDP 200
           D G G+Y V   VG+P +   +V D+GSD+ W+ C+       CS    +      VF  
Sbjct: 6   DYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHA 65

Query: 201 ADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI- 252
             S+SF  + C + +C         L N       C Y+  Y DGS   G  A ET+T+ 
Sbjct: 66  NLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVE 125

Query: 253 ---GRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
              GR + + NV IGC    QG  F  A G++GLG    S   +   + GG FSYCLV  
Sbjct: 126 LKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDH 185

Query: 308 GTGS--SGSLVFG----REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
            +    S  L FG    +EAL     +  LV      SFY V + G+ +GG  + I  ++
Sbjct: 186 LSHKNVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAMLKIPSEV 244

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGF 420
           + +   G  G ++D+G+++T L  PAY+    A         +    +   + C+N +GF
Sbjct: 245 WDVK--GAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGF 302

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQI 479
               VP + F+F+ G     P  +++I   D G  C  F   +  G S++GNI Q+    
Sbjct: 303 EESLVPRLVFHFADGAEFEPPVKSYVISAAD-GVRCLGFVSVAWPGTSVVGNIMQQNHLW 361

Query: 480 SFDGANGFVGFGPNVC 495
            FD     +GF P+ C
Sbjct: 362 EFDLGLKKLGFAPSSC 377


>gi|110739922|dbj|BAF01866.1| chloroplast nucleoid DNA binding protein like [Arabidopsis
           thaliana]
          Length = 142

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 97/139 (69%), Gaps = 1/139 (0%)

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
           ++  LF+L Q+G+ GV++D+GT+VTRL  PAY A RDAF      L RA   S+FDTC++
Sbjct: 4   VTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFD 63

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEG 476
           LS    V+VPTV  +F G  V +LPA+N+LIPVD  G FCFAFA +  GLSIIGNIQQ+G
Sbjct: 64  LSNMNEVKVPTVVLHFRGADV-SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQG 122

Query: 477 IQISFDGANGFVGFGPNVC 495
            ++ +D A+  VGF P  C
Sbjct: 123 FRVVYDLASSRVGFAPGGC 141


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 118/395 (29%), Positives = 177/395 (44%), Gaps = 49/395 (12%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-------- 196
           + SG   G G+YFVR  VG+P +   +V D+GSD+ WV+C+  +       P        
Sbjct: 86  LTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPG 145

Query: 197 -VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
             F P DS +++ +SC+S  C +     L         C Y+  Y DGS  +GT+  E+ 
Sbjct: 146 RAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESA 205

Query: 251 TIG-------RTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
           TI        +  +K + +GC     G  F  + G+L LG   +S       + GG FSY
Sbjct: 206 TIALSGREERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSY 265

Query: 303 CLVSRGTGSSGS--LVFG--------------REALPVGAAWVPLVRNPRAPSFYYVGLS 346
           CLV   +  + +  L FG                A    A   PL+ + R   FY V L 
Sbjct: 266 CLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLK 325

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
            + V G  + I   ++ +   G  GV++D+GT++T L  PAY A   A       LPR +
Sbjct: 326 AISVAGEFLKIPRAVWDVEAGG--GVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVT 383

Query: 407 GVSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP 461
            +  F+ CYN +        V VP ++ +F+G   L  P  +++I  D A G  C     
Sbjct: 384 -MDPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVI--DAAPGVKCIGLQE 440

Query: 462 SP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            P  G+S+IGNI Q+     FD  N  + F  + C
Sbjct: 441 GPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 165/371 (44%), Gaps = 46/371 (12%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
           G+ EY V  G G+P +   +  D+   +  ++C+PC   + C    DP F+P+ S+SF+ 
Sbjct: 172 GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 227

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
           + C S  C       C    C + + +G+ +   GTL  +TLT+  +        GC   
Sbjct: 228 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 283

Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
             +   F GA GL+ L   S SL  ++       +  AFSYCL S    SS      R  
Sbjct: 284 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 337

Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           L +GA+          + P+  NP  P+ Y+V L G+ VGG  +P+   +F        G
Sbjct: 338 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVF-----AAHG 392

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            +++  T  T L   AY A RDAF       P A    + DTCYNL+G  S+ VP V+  
Sbjct: 393 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALR 452

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           F+GG  L L     +   D +  F         A       +S+IG + Q   ++ +D  
Sbjct: 453 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 512

Query: 485 NGFVGFGPNVC 495
            G VGF P  C
Sbjct: 513 GGRVGFIPGRC 523


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 124/450 (27%), Positives = 201/450 (44%), Gaps = 58/450 (12%)

Query: 67  ISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL 126
           ISS+  ++  +R   +L+HR+         N     R +    + ++R    + + ++ L
Sbjct: 26  ISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIER-FDFLESKIKEL 84

Query: 127 SGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP 186
              G +A    +           ++GSG + V + +GSPP +Q +V+D+GS ++WVQC P
Sbjct: 85  KSVGNEARSSLIP---------FNRGSG-FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 187 CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTL 245
           C  C++QS   FDP  S SF  + C     + +    C+   +  Y++ Y  G  ++G L
Sbjct: 135 CINCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQGIL 194

Query: 246 ALETL------------------TIGRTVVKNVAIGCGHKNQGMFVGAA--GLLGLGG-G 284
           A E+L                   I +    N+  GCGH N       A  G+ GLG   
Sbjct: 195 AKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYNGVFGLGAYP 254

Query: 285 SMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSF-- 340
            +++  QLG +    FSYC+  ++    +   LV G+       +++     P    F  
Sbjct: 255 HITMATQLGNK----FSYCIGDINNPLYTHNHLVLGQ------GSYIEGDSTPLQIHFGH 304

Query: 341 YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-AQT 399
           YYV L  + VG   + I  + F+++  G  GV++D+G   T+L    +E   D  V    
Sbjct: 305 YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMK 364

Query: 400 GNLPRASGVSIFD-TCYNLSGFVS---VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF 455
           G L R      F+  C+   G VS   V  P V+F+F+GG  L L + + L        F
Sbjct: 365 GLLERIPTQRKFEGLCF--KGVVSRDLVGFPAVTFHFAGGADLVLESGS-LFRQHGGDRF 421

Query: 456 CFAFAPSPS---GLSIIGNIQQEGIQISFD 482
           C A  PS S    LS+IG + Q+   + FD
Sbjct: 422 CLAILPSNSELLNLSVIGILAQQNYNVGFD 451


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 107/343 (31%), Positives = 160/343 (46%), Gaps = 25/343 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y    G+G+PP+     +D  SD+VW  C   +         F+P  S + + V C+
Sbjct: 97  AGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP--------FNPVRSTTVADVPCT 148

Query: 213 SAVCDRLENAGCHAG------RCRYEVSYGDGSY-TKGTLALETLTIGRTVVKNVAIGCG 265
              C +     C AG       C Y   YG G+  T G L  E  T G T +  V  GCG
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTRIDGVVFGCG 208

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPV 324
            +N G F G +G++GLG G++SLV QL       FSY      +  + S + FG +A P 
Sbjct: 209 LQNVGDFSGVSGVIGLGRGNLSLVSQLQVDR---FSYHFAPDDSVDTQSFILFGDDATPQ 265

Query: 325 GAAWVP--LVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL-TQMGDDGVVMDTGTAVT 381
            +  +   L+ +   PS YYV L+G+ V G  + I    F L  + G  GV +     VT
Sbjct: 266 TSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVT 325

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
            L   AY+  R A  ++ G LP  +G ++  D CY        +VP+++  F+GG V+ L
Sbjct: 326 VLEEAAYKPLRQAVASKIG-LPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMEL 384

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
              N+       G  C    PS +G  S++G++ Q G  + +D
Sbjct: 385 ELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYD 427


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score =  145 bits (365), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 165/371 (44%), Gaps = 46/371 (12%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSG 208
           G+ EY V  G G+P +   +  D+   +  ++C+PC   + C    DP F+P+ S+SF+ 
Sbjct: 84  GALEYRVLAGYGAPAQRFPVAFDTNFGVSVLRCKPCVGGAPC----DPAFEPSRSSSFAA 139

Query: 209 VSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGH- 266
           + C S  C       C    C + + +G+ +   GTL  +TLT+  +        GC   
Sbjct: 140 IPCGSPEC----AVECTGASCPFTIQFGNVTVANGTLVRDTLTLPPSATFAGFTFGCIEV 195

Query: 267 -KNQGMFVGAAGLLGLGGGSMSLVGQL----GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
             +   F GA GL+ L   S SL  ++       +  AFSYCL S    SS      R  
Sbjct: 196 GADADTFDGAVGLIDLSRSSHSLASRVISNGATTSAAAFSYCLPSSSATSS------RGF 249

Query: 322 LPVGAA----------WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           L +GA+          + P+  NP  P+ Y+V L G+ VGG  +P+   +F        G
Sbjct: 250 LSIGASRPEYSGGDIKYAPMSSNPNHPNSYFVDLVGISVGGEDLPVPPAVF-----AAHG 304

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            +++  T  T L   AY A RDAF       P A    + DTCYNL+G  S+ VP V+  
Sbjct: 305 TLLEAATEFTFLAPAAYAALRDAFRKDMAPYPAAPPFRVLDTCYNLTGLASLAVPAVALR 364

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFC-------FAFAPSPSGLSIIGNIQQEGIQISFDGA 484
           F+GG  L L     +   D +  F         A       +S+IG + Q   ++ +D  
Sbjct: 365 FAGGTELELDVRQMMYFADPSSVFSSVACLAFAAAPLPAFPVSVIGTLAQRSTEVVYDLR 424

Query: 485 NGFVGFGPNVC 495
            G VGF P  C
Sbjct: 425 GGRVGFIPGRC 435


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 179/384 (46%), Gaps = 45/384 (11%)

Query: 145 VVSGMDQGSGEYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP----VFD 199
           + SG D G  +YFV I +G+P P+   +V D+GSD+ W+ C+   +   + +P    VF 
Sbjct: 108 IHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFR 167

Query: 200 PADSASFSGVSCSSAVC-----DRLENAGCHA--GRCRYEVSYGDGSYTKGTLALETLTI 252
             DS+SF  + CSS  C     D      C      C ++  Y +G    G  A ET+T+
Sbjct: 168 ANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTV 227

Query: 253 G-----RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
           G     +  + +V IGC            G++GLG    SL  +L    G  FSYCLV  
Sbjct: 228 GLNDHKKIRLFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDH 287

Query: 308 GTGSSGS--LVFGREALPVGAAWVPLVRNPRAP----------SFYYVGLSGLGVGGMRI 355
            + S+    L FG          +P ++ P+            +FY V +SG+ VGG  +
Sbjct: 288 LSSSNHKNFLSFGD---------IPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSML 338

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT-- 413
            IS D++ +T +G  G+++D+GT++T L   AY+   DA         +   + + +   
Sbjct: 339 SISSDIWNVTGVG--GMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNN 396

Query: 414 -CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGN 471
            C+   GF    VP +  +F+ G +   P  +++I V + G  C     +   G SI+GN
Sbjct: 397 FCFEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAE-GIKCLGIIKADFPGSSILGN 455

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           + Q+     +D   G +GFGP+ C
Sbjct: 456 VMQQNHLWEYDLGRGKLGFGPSSC 479


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 127/414 (30%), Positives = 192/414 (46%), Gaps = 36/414 (8%)

Query: 107 SFHARMQRDVKRVATLVRRLSG--GGADAAKHEVQD---FGTDVVSGMDQGSGEYFVRIG 161
           S  AR + D +R A +  +L    GG      EV         + SG   G+G+YFV++ 
Sbjct: 37  SVTARARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVL 96

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--VFDPADSASFSGVSCSSAVCD-- 217
           VG+P +   +V D+GS++ WV+C         S P  VF P  S S++ V CSS  C   
Sbjct: 97  VGTPAQEFTLVADTGSELTWVKC-----AGGASPPGLVFRPEASKSWAPVPCSSDTCKLD 151

Query: 218 ---RLENAGCHAGRCRYEVSYGDGSY-TKGTLALETLTI----GRTV-VKNVAIGCGHKN 268
               L N    A  C Y+  Y +GS    G +  ++ TI    G+   +++V +GC   +
Sbjct: 152 VPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTH 211

Query: 269 QGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFGREALP-V 324
            G  F    G+L LG   +S   +   + GG+FSYCLV       ++G L FG   +P  
Sbjct: 212 DGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRT 271

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
            A    L  +P  P FY V +  + V G  + I  +++        GV++D+GT +T L 
Sbjct: 272 PATQTKLFLDPAMP-FYGVKVDAVHVAGQALDIPAEVW---DPKSGGVILDSGTTLTVLA 327

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV--SVRVPTVSFYFSGGPVLTLPA 442
           TPAY+A   A       +P+      F+ CYN +     +  +P ++  F+G   L  PA
Sbjct: 328 TPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTGCARLEPPA 386

Query: 443 SNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +++I V   G  C         G+S+IGNI Q+     FD  N  V F P+ C
Sbjct: 387 KSYVIDVKP-GVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/409 (29%), Positives = 185/409 (45%), Gaps = 44/409 (10%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           RD  R A L++   GG  D     VQ      + G+      YF R+ +G+PPR   + I
Sbjct: 48  RDHLRHARLLQGFVGGVVD---FSVQGSSDPYLVGL------YFTRVKLGTPPREFNVQI 98

Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-DRLENAGC--- 224
           D+GSD++WV C  CS C + S        FD   S++   V CS  +C  +++       
Sbjct: 99  DTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCP 158

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
             + +C Y   YGDGS T G    +T      +G +++ N    +  GC     G     
Sbjct: 159 PQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKT 218

Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
                G+ G G G +S++ QL   G T   FS+CL  +G  S G ++   E L  G  + 
Sbjct: 219 DKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL--KGEDSGGGILVLGEILEPGIVYS 276

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PLV  P  P  Y + L  + V G  +PI    F  +   + G ++DTGT +  L   AY+
Sbjct: 277 PLV--PSQPH-YNLDLQSIAVSGQLLPIDPAAFATS--SNRGTIIDTGTTLAYLVEEAYD 331

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            F  A  A    L   + ++  + CY +S  VS   P VSF F+GG  + L    +L+ +
Sbjct: 332 PFVSAITAAVSQLATPT-INKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYL 390

Query: 450 DD---AGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +   A  +C  F     G++I+G++  +     +D A+  +G+    C
Sbjct: 391 TNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANYDC 439


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/348 (30%), Positives = 158/348 (45%), Gaps = 28/348 (8%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP++    ID   ++VW QC  C  C+KQ  PVF P  S++F    C + VC  +  
Sbjct: 60  IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 119

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAGLLG 280
             C +  C Y+   G G +T G +A +T  IG     ++  GC    +     G +G +G
Sbjct: 120 PKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 179

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPR 336
           LG    SLV Q+       FSYCL    TG +  L  G  A L  G AW P V+   N  
Sbjct: 180 LGRTPWSLVAQMKLTR---FSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDG 236

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAF 395
              +Y + L  +  G   I +          G + V++ T    V+ L    Y+ F+ A 
Sbjct: 237 MSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAV 288

Query: 396 VAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           +A  G  P A+ V + F+ C+  +G      P + F F  G  LT+P +N+L  V +  T
Sbjct: 289 MASVGAAPTATPVGAPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DT 345

Query: 455 FCFAFA-------PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C +          +  GL+I+G+ QQE + + FD     + F P  C
Sbjct: 346 VCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 393


>gi|357125298|ref|XP_003564331.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 524

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 127/376 (33%), Positives = 167/376 (44%), Gaps = 63/376 (16%)

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQS--DPVFDPADSASFSGVSCSSAVCDRLENAG-- 223
           +Q M ID+  DI W+QC+PC         + +FDP  S S + V C S  C  L N G  
Sbjct: 164 AQTMAIDTTIDIPWIQCRPCPPPQCYPQRNALFDPTKSFSAAAVPCGSRACRALGNYGNG 223

Query: 224 -----------------CHAGRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCG 265
                               G C Y V+Y DG  + GT   + LTI   T   N   GC 
Sbjct: 224 CSNNSRRNKKKNKSKSNNSTGDCNYRVAYSDGRVSSGTYMTDILTISPGTSFLNFRFGCS 283

Query: 266 HKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------ 318
           H  +G F G  +G + LGGG  SL+ Q     G AFSYC+      +SG L  G      
Sbjct: 284 HGVRGSFSGETSGTMSLGGGRQSLLSQTARAYGNAFSYCVPK--PSASGFLSLGGAINDG 341

Query: 319 --REALPVGAAWVPLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
                 P      PL+RN R   P++Y V L G+ V G R+ +   +F        G +M
Sbjct: 342 DSDSDSPSSFVTTPLMRNARIVNPTYYVVRLQGIDVAGRRLNVPPVVF------SGGTLM 395

Query: 375 DTGTAVTRLPTPAYEAFRDAFV------------AQTGNLPRASGVSIFDTCYNLSGFVS 422
           D+   VT+LP  AY A R AF               T + P A G  I DTCY+  G  +
Sbjct: 396 DSSAVVTQLPPTAYRALRLAFRNAMRGYRMNTRNGSTSSTP-AGGEMILDTCYDFEGLDN 454

Query: 423 VRVPTVSFYFSGGPVLTL-PASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQI 479
           V VPTVS  F GG V+ L P +  ++        C AF P+P+   L  IGN+QQ+  ++
Sbjct: 455 VTVPTVSLVFFGGAVVDLDPTTAVMM------EGCLAFVPTPADFDLGFIGNVQQQTHEV 508

Query: 480 SFDGANGFVGFGPNVC 495
            +D     VGF    C
Sbjct: 509 LYDVGARNVGFRRGAC 524


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 166/359 (46%), Gaps = 29/359 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP FDP  S+++  + C+
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
              +CD          +C YE  Y + S + G L  + ++ G     + +    GC +  
Sbjct: 140 IDCICD------SDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
            G      A G++GLG G +SLV QL   G    +FS C      G  G++V G  + P 
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIG-GGAMVLGGISPPS 252

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              +     +P    +Y V L  + V G ++P+S  +F     G  G V+D+GT    LP
Sbjct: 253 DMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTYAYLP 306

Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
             A+ AF+DA + +  +L +  G   +  D C++ +G     +S + PTV   F  G  L
Sbjct: 307 AEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366

Query: 439 TLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L   N+        G +C   F       +++G I      + +D AN  +GF    C
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/233 (36%), Positives = 128/233 (54%), Gaps = 19/233 (8%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEV--------QDFGTDVVSGMDQGSGEYFV 158
           SF   +  D  RV TL  RL+       K  +        +     +  G   GSG Y+V
Sbjct: 61  SFSDVLAWDDARVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYV 120

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           ++G GSP R   M++D+GS + W+QC+PC   C+ Q+DP+FDP+ S ++  +SC+S+ C 
Sbjct: 121 KVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCS 180

Query: 218 RLENAGCH-------AGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
            L +A  +       +  C Y  SYGD SY+ G L+ + LT+  +  +     GCG  + 
Sbjct: 181 SLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQTLPGFVYGCGQDSD 240

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL 322
           G+F  AAG+LGLG   +S++GQ+  + G AFSYCL +RG G  G L  G+ +L
Sbjct: 241 GLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGG--GFLSIGKASL 291


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 166/359 (46%), Gaps = 29/359 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP FDP  S+++  + C+
Sbjct: 80  NGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCN 139

Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
              +CD          +C YE  Y + S + G L  + ++ G     + +    GC +  
Sbjct: 140 IDCICD------SDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSELIPQRAVFGCENME 193

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
            G      A G++GLG G +SLV QL   G    +FS C      G  G++V G  + P 
Sbjct: 194 TGDLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIG-GGAMVLGGISPPS 252

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              +     +P    +Y V L  + V G ++P+S  +F     G  G V+D+GT    LP
Sbjct: 253 DMIFT--YSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTYAYLP 306

Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
             A+ AF+DA + +  +L +  G   +  D C++ +G     +S + PTV   F  G  L
Sbjct: 307 AEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQKL 366

Query: 439 TLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L   N+        G +C   F       +++G I      + +D AN  +GF    C
Sbjct: 367 SLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 179/369 (48%), Gaps = 35/369 (9%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQS----------DPVFDPA 201
            G Y  R+ +G+PP    +++D+GS + +V C  C+ C + Q+          DP F P 
Sbjct: 37  KGYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPE 96

Query: 202 DSASFSGVSCSSAVCDR-LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVV 257
           +S+S+  + C S+ C   L ++  H  +C+YE  Y + S +KG L  + L  G   R   
Sbjct: 97  NSSSYQKIGCRSSDCITGLCDSNSH--QCKYERMYAEMSTSKGVLGKDLLDFGPASRLQS 154

Query: 258 KNVAIGCGHKNQG-MFVGAA-GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSG 313
           + ++ GC     G +++  A G++GLG G +S+V QL   G    +FS C      G  G
Sbjct: 155 QLLSFGCETAESGDLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGG-G 213

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
           S+V G  A+P  +  V    +PR  ++Y + L+ + V G  + +  ++F     G  G +
Sbjct: 214 SMVLG--AIPAPSGMVFAKSDPRRSNYYNLELTEIQVQGASLKLDSNVFN----GKFGTI 267

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PT 427
           +D+GT    LP  A+EAF DA VAQ G+L    G   +  D CY  +G  +  +    P 
Sbjct: 268 LDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDTKELGKHFPL 327

Query: 428 VSFYFSGGPVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           V F F+    ++L   N+L       G +C  F  +    +++G I    + +++D  N 
Sbjct: 328 VDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYNH 387

Query: 487 FVGFGPNVC 495
            +GF    C
Sbjct: 388 QIGFLKTNC 396


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 168/360 (46%), Gaps = 46/360 (12%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y+  I +GSPP+   +V+D+GSD+ WV+C PCS         FD   S ++  ++C+ 
Sbjct: 1   GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCAD 57

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNV------AIGCGHK 267
                            Y   YGDGS+T+G L+++TL +       +        GCG  
Sbjct: 58  ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---GSLVFGREAL-- 322
            +G+  G  G+L L  GS+S   Q+G + G  FSYCL+ +   +S     +VFG  A+  
Sbjct: 102 LKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVEL 161

Query: 323 --PVGAAWVPLVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             P       L   P   S  +Y V L G+ VG  R+ +S   F   Q  D   + D+GT
Sbjct: 162 KEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQ--DKPTIFDSGT 219

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGP 436
            +T LP    ++ + +  +       A  V+I   D C+ +       +P ++F+F+GG 
Sbjct: 220 TLTMLPPGVCDSIKQSLASMVSG---AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGA 276

Query: 437 VLTLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 SN++I   D G+  C  F P+ + +SI GN+QQ+   +  D  N  +GF    C
Sbjct: 277 DFVTRPSNYVI---DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 167/358 (46%), Gaps = 62/358 (17%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G + V +  G+PP++  +++D+GS I W QC+ C+                         
Sbjct: 126 GNFLVDVAFGTPPQNFTLILDTGSSITWTQCKACT------------------------- 160

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF 272
                +EN         Y ++YGD S + G    +T+T+  + V +    G G  N+G F
Sbjct: 161 -----VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGRGRNNKGDF 207

Query: 273 -VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WV 329
             G  G+LGLG G +S V Q   +    FSYCL      S GSL+FG +A    ++  + 
Sbjct: 208 GSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEED--SIGSLLFGEKATSQSSSLKFT 265

Query: 330 PLVRNP---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
            LV  P   +   +Y+V LS + VG  R+ I   +F        G ++D+ T +TRLP  
Sbjct: 266 SLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQR 320

Query: 387 AYEAFRDAFVAQTGNLPRASGV----SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPA 442
           AY A + AF       P ++G      I DTCYNLSG   V +P +  +F GG  + L  
Sbjct: 321 AYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNG 380

Query: 443 SNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +N +   D++   C AFA +        L+IIGN QQ  + + +D   G +GF  N C
Sbjct: 381 TNIVWGSDES-RLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGC 437


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 129/414 (31%), Positives = 184/414 (44%), Gaps = 50/414 (12%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
           RD  R A   R L GGG  ++   V DF         QGS      G YF ++ +GSPP 
Sbjct: 62  RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
              + ID+GSDI+WV C  CS C   S        FD   S +   V+CS  +C  +   
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT 171

Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
             A C    +C Y   YGDGS T G    +T      +G ++V N    +  GC     G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231

Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
                     G+ G G G +S+V QL   G T   FS+CL  +G GS G +    E L  
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + PLV  P  P  Y + L  +GV G  +P+   +F  +     G ++DTGT +T L 
Sbjct: 290 GMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLV 344

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+ F +A       L     +S  + CY +S  +S   P+VS  F+GG  + L   +
Sbjct: 345 KEAYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQD 403

Query: 445 FLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L      D A  +C  F  +P   +I+G++  +     +D A   +G+    C
Sbjct: 404 YLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 129/414 (31%), Positives = 184/414 (44%), Gaps = 50/414 (12%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
           RD  R A   R L GGG  ++   V DF         QGS      G YF ++ +GSPP 
Sbjct: 62  RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
              + ID+GSDI+WV C  CS C   S        FD   S +   V+CS  +C  +   
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQT 171

Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
             A C    +C Y   YGDGS T G    +T      +G ++V N    +  GC     G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231

Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
                     G+ G G G +S+V QL   G T   FS+CL  +G GS G +    E L  
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + PLV  P  P  Y + L  +GV G  +P+   +F  +     G ++DTGT +T L 
Sbjct: 290 GMVYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLV 344

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+ F +A       L     +S  + CY +S  +S   P+VS  F+GG  + L   +
Sbjct: 345 KEAYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQD 403

Query: 445 FLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L      D A  +C  F  +P   +I+G++  +     +D A   +G+    C
Sbjct: 404 YLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 110/345 (31%), Positives = 162/345 (46%), Gaps = 30/345 (8%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD-SASFSGVSCSSAVCDRLE 220
           +G+PP    + +++G++++W    P  +C++Q+ P F+P   S      SC S       
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSP--KFWP 58

Query: 221 NAGCHAGRCRYEVSYGDGSYTKGTLALETLTI--GRTVVKNVAIGCGHKNQGMFV-GAAG 277
           N       C Y  SYGD S T G L ++  T       V  VA GCG  N G+F     G
Sbjct: 59  NQ-----TCVYTYSYGDKSVTTGFLEVDKFTFVGAGASVPGVAFGCGLFNNGVFKSNETG 113

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWV---PLV 332
           + G G G +SL  QL     G FS+C   ++    S+  L    +    G   V   PL+
Sbjct: 114 IAGFGRGPLSLPSQL---KVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLI 170

Query: 333 ---RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
              +N   P+ YY+ L G+ VG  R+P+ E  F LT  G  G ++D+GT++T LP   Y+
Sbjct: 171 QYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQ 229

Query: 390 AFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP 448
             RD F AQ   LP   G +    TC++        VP +  +F G   + LP  N++  
Sbjct: 230 VVRDEFAAQI-KLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGA-TMDLPRENYVFE 287

Query: 449 V-DDAGT--FCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           V DDAG    C A        +IIGN QQ+ + + +D  N  + F
Sbjct: 288 VPDDAGNSIICLAINKG-DETTIIGNFQQQNMHVLYDLQNNMLSF 331


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 179/374 (47%), Gaps = 34/374 (9%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-CSQCYKQSDPV--FDP 200
           DVVS +   S EY + + +GSPPRS   + D+GSD+VWV+C+   +     + P   FDP
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 148

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI-----GR 254
           + S+++  VSC +  C+ L  A C  G  C Y  +YGDGS T G L+ ET T      GR
Sbjct: 149 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGR 208

Query: 255 TV----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG 308
           +     V  V  GC     G F  A GL+GLGGG++SLV QLGG T  G  FSYCLV   
Sbjct: 209 SPRQVRVGGVKFGCSTATAGSFP-ADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHS 267

Query: 309 TGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
             +S +L FG   +    GAA  PLV      ++Y V L  + VG   +           
Sbjct: 268 VNASSALNFGALADVTEPGAASTPLVAG-DVDTYYTVVLDSVKVGNKTV---------AS 317

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSV 423
                +++D+GT +T L         D    +    P  S   +   CYN++G       
Sbjct: 318 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 377

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG--LSIIGNIQQEGIQISF 481
            +P ++  F GG  + L   N  + V + GT C A   +     +SI+GN+ Q+ I + +
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 436

Query: 482 DGANGFVGFGPNVC 495
           D   G V F    C
Sbjct: 437 DLDAGTVTFAGADC 450


>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
 gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
          Length = 408

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 149/353 (42%), Gaps = 27/353 (7%)

Query: 151 QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVS 210
           Q    Y VR G+G+P +   + +D+ +D  W  C PC  C   S   F PA S+S++ + 
Sbjct: 74  QTPPSYVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR--FIPASSSSYASLP 131

Query: 211 CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---VAIGCGHK 267
           C+S  C             R     G+         +  L       ++    A  CG  
Sbjct: 132 CASDWCPLF----------RRPAVPGEPGRVGAAADVRLLQAASRTPRSGVLAATRCGWA 181

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGA 326
                           G MSL+ Q G +  G FSYCL S R    SGSL  G    P   
Sbjct: 182 RTPS-------PATRSGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNV 234

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
            + PL+ NP  PS YYV ++GL VG   +      F        G V+D+GT +TR   P
Sbjct: 235 RYTPLLTNPHRPSLYYVNVTGLSVGRALVKAPAGSFAFDPSTGAGTVIDSGTVITRWTAP 294

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
            Y A RD F  Q       + +  FDTC+N     +   P V+ +  GG  LTLP  N L
Sbjct: 295 VYAALRDEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMGGGVDLTLPMENTL 354

Query: 447 IPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I        C A A +P    S ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 355 IHSSATPLACLAMAEAPQNVNSVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 407


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score =  142 bits (358), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 175/361 (48%), Gaps = 27/361 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-YKQS--DPVFDPADSASFSGV 209
            G Y  R+ +G+P +   +++D+GS + +V C  C+ C + Q+  DP F P +S+S+  V
Sbjct: 96  KGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTV 155

Query: 210 SCSSAVC-DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCG 265
           SC+S  C  ++ +A  H  +C+YE  Y + S +KG L  + L  G   R     +  GC 
Sbjct: 156 SCNSPDCITKMCDARVH--QCKYERVYAEMSSSKGVLGKDLLGFGNGSRLQPHPLLFGCE 213

Query: 266 HKNQG--MFVGAAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREA 321
               G      A G++GLG G +S+V QL   G    +FS C      G  GS+V G  A
Sbjct: 214 TAETGDLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGG-GSMVLG--A 270

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
           +P   A V    +P   ++Y + LS + V G+ + +  ++F     G  G V+D+GT   
Sbjct: 271 IPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSGTTYA 326

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGG 435
            LP  A++AF+DA   Q G+L    G   S  D C+  +G  S  +    P V F FSG 
Sbjct: 327 YLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGN 386

Query: 436 PVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
             + L   N+L       G +C  F  +    +++G I      +++D AN  +GF    
Sbjct: 387 QKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQIGFFKTN 446

Query: 495 C 495
           C
Sbjct: 447 C 447


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 113/344 (32%), Positives = 158/344 (45%), Gaps = 21/344 (6%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-----YKQSDPVFDPADSASFS 207
           +G Y +   VG+PP+    V+D  SD VW+QC  C+ C        S P F    S++  
Sbjct: 94  TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153

Query: 208 GVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIG 263
            V C++  C RL    C A    C Y   YG G+   T G LA++           V  G
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFG 213

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL 322
           C    +G      G++GLG G +SLV QL     G FSY L        GS + F  +A 
Sbjct: 214 CAVATEGDI---GGVIGLGRGELSLVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAK 267

Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
           P  +  V  PLV N  + S YYV L+G+ V G  + I    F L   G  GVV+     V
Sbjct: 268 PRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPV 327

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           T L   AY+  R A  ++ G L  A G  +  D CY      + +VP+++  F+GG V+ 
Sbjct: 328 TFLDAGAYKVVRQAMASKIG-LRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVME 386

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
           L   N+       G  C    PSP+G  S++G++ Q G  + +D
Sbjct: 387 LEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYD 430


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 157/348 (45%), Gaps = 28/348 (8%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP++    ID   ++VW QC  C  C+KQ  PVF P  S++F    C + VC  +  
Sbjct: 30  IGTPPQAASAFIDLTGELVWTQCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPT 89

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAGLLG 280
             C +  C ++   G G +T G +A +T  IG     ++  GC    +     G +G +G
Sbjct: 90  PKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTAAPASLGFGCVVASDIDTMGGPSGFIG 149

Query: 281 LGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPR 336
           LG    SLV Q+       FSYCL    TG +  L  G  A L  G AW P V+   N  
Sbjct: 150 LGRTPWSLVAQMKLTR---FSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDG 206

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAF 395
              +Y + L  +  G   I +          G + V++ T    V+ L    Y+ F+ A 
Sbjct: 207 MSQYYPIELEEIKAGDATITMPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAV 258

Query: 396 VAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGT 454
           +A  G  P A+ V   F+ C+  +G      P + F F  G  LT+P +N+L  V +  T
Sbjct: 259 MASVGAAPTATPVGEPFEVCFPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DT 315

Query: 455 FCFAFA-------PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            C +          +  GL+I+G+ QQE + + FD     + F P  C
Sbjct: 316 VCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADC 363


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 118/415 (28%), Positives = 188/415 (45%), Gaps = 47/415 (11%)

Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
           +RD  R     RRL GG A      V+      + G+      YF R+ +G+P +  ++ 
Sbjct: 54  RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGL------YFTRVKLGNPAKEFFVQ 107

Query: 173 IDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR--------L 219
           ID+GSDI+WV C PC+ C   S        F+P  S++ S ++CS   C           
Sbjct: 108 IDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAIC 167

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGHKNQG 270
           + +   +  C Y  +YGDGS T G    +T+    TV+ N         +  GC +   G
Sbjct: 168 QTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNEQTANSSASIVFGCSNSQSG 226

Query: 271 MFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
               A     G+ G G   +S++ QL   G +   FS+CL  +G+ + G ++   E +  
Sbjct: 227 DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNGGGILVLGEIVEP 284

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + PLV  P  P  Y + L  + V G ++PI   LF  T     G ++D+GT +  L 
Sbjct: 285 GLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--TTSNTQGTIVDSGTTLAYLA 339

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+ F  A  A      R S VS    C+  S  V    PTV+ YF GG  +++   N
Sbjct: 340 DGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPEN 398

Query: 445 FLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L+    VD++  +C  +  +    ++I+G++  +     +D AN  +G+    C
Sbjct: 399 YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 453


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  141 bits (356), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 118/415 (28%), Positives = 188/415 (45%), Gaps = 47/415 (11%)

Query: 113 QRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMV 172
           +RD  R     RRL GG A      V+      + G+      YF R+ +G+P +  ++ 
Sbjct: 52  RRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGL------YFTRVKLGNPAKEFFVQ 105

Query: 173 IDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR--------L 219
           ID+GSDI+WV C PC+ C   S        F+P  S++ S ++CS   C           
Sbjct: 106 IDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAIC 165

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGHKNQG 270
           + +   +  C Y  +YGDGS T G    +T+    TV+ N         +  GC +   G
Sbjct: 166 QTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNEQTANSSASIVFGCSNSQSG 224

Query: 271 MFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
               A     G+ G G   +S++ QL   G +   FS+CL  +G+ + G ++   E +  
Sbjct: 225 DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL--KGSDNGGGILVLGEIVEP 282

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + PLV  P  P  Y + L  + V G ++PI   LF  T     G ++D+GT +  L 
Sbjct: 283 GLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--TTSNTQGTIVDSGTTLAYLA 337

Query: 385 TPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
             AY+ F  A  A      R S VS    C+  S  V    PTV+ YF GG  +++   N
Sbjct: 338 DGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPEN 396

Query: 445 FLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L+    VD++  +C  +  +    ++I+G++  +     +D AN  +G+    C
Sbjct: 397 YLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWADYDC 451


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  141 bits (355), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 131/417 (31%), Positives = 187/417 (44%), Gaps = 56/417 (13%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPR 167
           RD  R A   R L GGG  ++   V DF         QGS      G YF ++ +GSPP 
Sbjct: 62  RDRVRHA---RILLGGGRQSSVGGVVDFPV-------QGSSDPYLVGLYFTKVKLGSPPT 111

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL--- 219
              + ID+GSDI+WV C  CS C   S        FD   S +   V+CS  +C  +   
Sbjct: 112 EFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQT 171

Query: 220 ENAGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQG 270
             A C    +C Y   YGDGS T G    +T      +G ++V N    +  GC     G
Sbjct: 172 TAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSG 231

Query: 271 MFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
                     G+ G G G +S+V QL   G T   FS+CL  +G GS G +    E L  
Sbjct: 232 DLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVP 289

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           G  + PL+  P  P  Y + L  +GV G  +PI   +F  +     G ++DTGT +T L 
Sbjct: 290 GMVYSPLL--PSQPH-YNLNLLSIGVNGQILPIDAAVFEASNT--RGTIVDTGTTLTYLV 344

Query: 385 TPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
             AY+ F +A    V+Q   L  ++G    + CY +S  +S   P VS  F+GG  + L 
Sbjct: 345 KEAYDPFLNAISNSVSQLVTLIISNG----EQCYLVSTSISDMFPPVSLNFAGGASMMLR 400

Query: 442 ASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             ++L      D A  +C  F  +P   +I+G++  +     +D A   +G+    C
Sbjct: 401 PQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  141 bits (355), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 114/375 (30%), Positives = 167/375 (44%), Gaps = 42/375 (11%)

Query: 144 DVVSGMDQ--GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPA 201
           D+VS +        +   I +G PP  Q ++ID+GSD+ W+QC PC +CY Q+ P F P+
Sbjct: 74  DIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC-KCYPQTIPFFHPS 132

Query: 202 DSASFSGVSCSSAV-----CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI---- 252
            S+++   SC SA        R E      G CRY + Y D S T+G LA E LT     
Sbjct: 133 RSSTYRNASCESAPHAMPQIFRDEK----TGNCRYHLRYRDFSNTRGILAKEKLTFQTSD 188

Query: 253 -GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGT 309
            G     N+  GCG  N G F   +G+LGLG G+ S+V +     G  FSYC  S    T
Sbjct: 189 EGLISKPNIVFGCGQDNSG-FTQYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLIDPT 244

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
                L+ G  A   G      +   R    YY+ L  + +G   + I   +F+  +   
Sbjct: 245 YPHNFLILGNGARIEGDPTPLQIFQDR----YYLDLQAISLGEKLLDIEPGIFQRYR-SK 299

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR--ASGVSIFDTCY------NLSGFV 421
            G V+DTG + T L   AYE   +      G + R         + CY      +L GF 
Sbjct: 300 GGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGF- 358

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
               P V+F+F+GG  L L   +  +  +   +FC A   +    +S+IG + Q+   + 
Sbjct: 359 ----PVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVG 414

Query: 481 FDGANGFVGFGPNVC 495
           ++     V F    C
Sbjct: 415 YNLRTMKVYFQRTDC 429


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 172/366 (46%), Gaps = 37/366 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF R+ +GSPP+  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
           + CS   C    +   A C       C Y  +YGDGS T G    +T+     +G     
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTA 208

Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
           N    +  GC +   G          G+ G G   +S+V QL   G +   FS+CL  +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  T   
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             G ++D+GT +  L   AY+ F +A  A      R S VS  + C+  S  V    PTV
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTV 380

Query: 429 SFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGA 484
           S YF GG  +T+   N+L+    +D+   +C  +  +    ++I+G++  +     +D A
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440

Query: 485 NGFVGF 490
           N  +G+
Sbjct: 441 NMRMGW 446


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 172/366 (46%), Gaps = 37/366 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF R+ +GSPP+  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
           + CS   C    +   A C       C Y  +YGDGS T G    +T+     +G     
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208

Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
           N    +  GC +   G          G+ G G   +S+V QL   G +   FS+CL  +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  T   
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             G ++D+GT +  L   AY+ F +A  A      R S VS  + C+  S  V    PTV
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTV 380

Query: 429 SFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGA 484
           S YF GG  +T+   N+L+    +D+   +C  +  +    ++I+G++  +     +D A
Sbjct: 381 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLA 440

Query: 485 NGFVGF 490
           N  +G+
Sbjct: 441 NMRMGW 446


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 126/418 (30%), Positives = 193/418 (46%), Gaps = 35/418 (8%)

Query: 107 SFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPP 166
           S   R + D +R A +  +L+     AA      F   + SG   G+G+YFVR  VG+P 
Sbjct: 56  SLGERARDDARRHAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVGTPA 115

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVCD-----RL 219
           +   +V D+GSD+ WV+C+  +       P   F  ++S S++ ++CSS  C       L
Sbjct: 116 QPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDTCTSYVPFSL 175

Query: 220 ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---------------RTVVKNVAIGC 264
            N    A  C Y+  Y DGS  +G +  +  TI                R  ++ V +GC
Sbjct: 176 ANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKLQGVVLGC 235

Query: 265 GHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS----RGTGSSGSLVFGR 319
                G  F  + G+L LG  ++S   +   + GG FSYCLV     R   S  +   G 
Sbjct: 236 TATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASSYLTFGPGP 295

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
           E     AA  PLV + R   FY V +  + V G  + I  D++ + + G  G ++D+GT+
Sbjct: 296 EGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVGRGG--GAILDSGTS 353

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           +T L TPAY A   A   +   LPR + +  F+ CYN +      +P +   F+G   L 
Sbjct: 354 LTVLATPAYRAVVAALGGRLAALPRVA-MDPFEYCYNWTAGAP-EIPKLEVSFAGSARLE 411

Query: 440 LPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            PA +++I  D A G  C      +  G+S+IGNI Q+     FD  + ++ F    C
Sbjct: 412 PPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWLRFKHTRC 467


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 185/422 (43%), Gaps = 47/422 (11%)

Query: 102 HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIG 161
           + H    H    RD  R A L++   GG  D     VQ      + G+      YF ++ 
Sbjct: 21  NNHGLELHQLRARDRLRHARLLQGFVGGVVD---FSVQGSSDPYLVGL------YFTKVK 71

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC 216
           +GSPPR   + ID+GSD++WV C  C+ C + S        FD + S++   V CS  +C
Sbjct: 72  LGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPIC 131

Query: 217 DR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIG 263
                           +C Y   YGDGS T G    +TL     +G++++ N    +  G
Sbjct: 132 TSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFG 191

Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           C     G          G+ G G G +S++ QL   G T   FS+CL  +G GS G ++ 
Sbjct: 192 CSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL--KGDGSGGGILV 249

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
             E L  G  + PLV  P  P  Y + L  + V G  +PI    F  +     G ++D+G
Sbjct: 250 LGEILEPGIVYSPLV--PSQPH-YNLNLLSIAVNGQLLPIDPAAFATSN--SQGTIVDSG 304

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
           T +  L   AY+ F  A  A     P  + + S  + CY +S  VS   P  SF F+GG 
Sbjct: 305 TTLAYLVAEAYDPFVSAVNAIVS--PSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGA 362

Query: 437 VLTLPASNFLIPVDDAG---TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            + L   ++LIP   +G    +C  F     G++I+G++  +     +D     +G+   
Sbjct: 363 SMVLKPEDYLIPFGSSGGSAMWCIGFQ-KVQGVTILGDLVLKDKIFVYDLVRQRIGWANY 421

Query: 494 VC 495
            C
Sbjct: 422 DC 423


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 78/198 (39%), Positives = 112/198 (56%), Gaps = 5/198 (2%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD  RV ++  +LS   AD    + +       +G+  GS  Y V IG+G+P     +
Sbjct: 91  LRRDEARVESIHSKLSKNIADEVS-KAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISL 149

Query: 172 VIDSGSDIVWVQCQPC-SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
           + D+GSD+ W QC+PC   CY Q +P F+P+ S+S+  VSCSS +C   E+  C A  C 
Sbjct: 150 MFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSSYHNVSCSSPMCGNPES--CSASNCL 207

Query: 231 YEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLV 289
           Y + YGDGS T G LA E  T+  + V+ ++  GCG  N+G+F+G+AG+LGLG G  S  
Sbjct: 208 YGIGYGDGSVTVGFLAKEKFTLTNSDVLDDIYFGCGENNKGVFIGSAGILGLGPGKFSFP 267

Query: 290 GQLGGQTGGAFSYCLVSR 307
            Q        FSYC   R
Sbjct: 268 LQTTTTYNNIFSYCCGCR 285


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 184/412 (44%), Gaps = 41/412 (9%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDF----GTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           RD  R A   R L GGG  ++   V DF     +D      + +  YF ++ +GSPP   
Sbjct: 62  RDRVRHA---RILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPPTEF 118

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRL---EN 221
            + ID+GSDI+WV C  CS C   S        FD   S +   V+CS  +C  +     
Sbjct: 119 NVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTA 178

Query: 222 AGC-HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMF 272
           A C    +C Y   YGDGS T G    +T      +G ++V N    +  GC     G  
Sbjct: 179 AQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL 238

Query: 273 V----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
                   G+ G G G +S+V QL   G T   FS+CL  +G GS G +    E L  G 
Sbjct: 239 TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL--KGDGSGGGVFVLGEILVPGM 296

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
            + PLV  P  P  Y + L  +GV G  +P+   +F  +     G ++DTGT +T L   
Sbjct: 297 VYSPLV--PSQPH-YNLNLLSIGVNGQMLPLDAAVFEASNT--RGTIVDTGTTLTYLVKE 351

Query: 387 AYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           AY+ F +A       L     +S  + CY +S  +S   P+VS  F+GG  + L   ++L
Sbjct: 352 AYDLFLNAISNSVSQLVTPI-ISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYL 410

Query: 447 I---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 D A  +C  F  +P   +I+G++  +     +D A   +G+    C
Sbjct: 411 FHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 462


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 171/383 (44%), Gaps = 51/383 (13%)

Query: 132 DAAKHEV--QDFGTDVVSGMDQGSGE--YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC 187
           D   H++  Q F  D +S +        + +   +G PP  Q  V+D+GS + WV C PC
Sbjct: 65  DHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC 124

Query: 188 SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSY-----GDGSYTK 242
           S C +QS P+FDP+ S+++S +SCS   C++ +      G C Y V Y       G Y +
Sbjct: 125 SSCSQQSVPIFDPSKSSTYSNLSCSE--CNKCDVVN---GECPYSVEYVGSGSSQGIYAR 179

Query: 243 GTLALETLTIGRTVVKNVAIGCGHK-----NQGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
             L LET+      V ++  GCG K     N   + G  G+ GLG G  SL+   G +  
Sbjct: 180 EQLTLETIDESIIKVPSLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFGKK-- 237

Query: 298 GAFSYCLVS-RGTGSS-GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
             FSYC+ + R T      LV G +A   G +    V N      YYV L  + +GG ++
Sbjct: 238 --FSYCIGNLRNTNYKFNRLVLGDKANMQGDSTTLNVIN----GLYYVNLEAISIGGRKL 291

Query: 356 PISEDLF-RLTQMGDDGVVMDTGTAVTRLPTPAYEAFR---DAFVAQTGNLPRASGVSIF 411
            I   LF R     + GV++D+G   T L    +E      +  +     L +    + +
Sbjct: 292 DIDPTLFERSITDNNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPY 351

Query: 412 DTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP---- 461
             CY      +LSGF     P V+F+F+ G VL L  ++  I   +   FC A  P    
Sbjct: 352 TLCYSGVVSQDLSGF-----PLVTFHFAEGAVLDLDVTSMFIQTTE-NEFCMAMLPGNYF 405

Query: 462 --SPSGLSIIGNIQQEGIQISFD 482
                  S IG + Q+   + +D
Sbjct: 406 GDDYESFSSIGMLAQQNYNVGYD 428


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 129/440 (29%), Positives = 196/440 (44%), Gaps = 40/440 (9%)

Query: 83  LVHRDKMSSSSNTTNNMHY---HRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQ 139
           L  R  + +SS+T   +     H    + +   +R  + VA    RL    A   + +  
Sbjct: 12  LCFRASLVTSSSTGAGLRMKLTHVDDKAGYTTEERVRRAVAVSRERL----AYTQQQQQL 67

Query: 140 DFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPC--SQCYKQSDP 196
               DV + +   + +Y     +G PP+    +ID+GS+++W QC   C    C KQ  P
Sbjct: 68  RASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLP 127

Query: 197 VFDPADSASFSGVSCSSAVCDRLENAGCHA----GRCRYEVSYGDGSYTKGTLALETLTI 252
            ++ + S++F+ V C+ +      N G H     G C +  SYG GS   G+L  E  T 
Sbjct: 128 YYNLSRSSTFAAVPCADSAKLCAAN-GVHLCGLDGSCTFAASYGAGS-VFGSLGTEAFTF 185

Query: 253 GRTVVKNVAIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--R 307
                K +  GC       +G   GA+GL+GLG G +SLV Q G      FSYCL    R
Sbjct: 186 QSGAAK-LGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATK---FSYCLTPYLR 241

Query: 308 GTGSSGSLVFGREALPVG----AAWVPLVRNPR---APSFYYVGLSGLGVGGMRIPISED 360
             G+S  L  G  A   G       +P V++P      +FYY+ L G+ VG  ++PI   
Sbjct: 242 NHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSA 301

Query: 361 LFRLTQMG----DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVSIFDTCY 415
            F L ++       GV++DTG+ VT L   AY A  D    Q   +L +    +  D C 
Sbjct: 302 AFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPADTGLDLCV 361

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
                  V VP + F+F GG  + + A ++  PVD + T C          ++IGN QQ+
Sbjct: 362 ARQDVDKV-VPVLVFHFGGGADMAVSAGSYWGPVDKS-TACMLIEEGGYE-TVIGNFQQQ 418

Query: 476 GIQISFDGANGFVGFGPNVC 495
            + + +D   G + F    C
Sbjct: 419 DVHLLYDIGKGELSFQTADC 438


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 156/356 (43%), Gaps = 34/356 (9%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP+    +ID   ++VW QC  CS+C+KQ  P+F P  S++F    C +  C     
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 222 AGCHAGRCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAG 277
           + C    C YE +     D   T G +  ET  IG T   ++A GC    +     G +G
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-TATASLAFGCVVASDIDTMDGTSG 167

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLVR- 333
            +GLG    SLV Q+       FSYCL  RGTG S  L  G  A   G    +  P ++ 
Sbjct: 168 FIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKT 224

Query: 334 NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           +P   S  +Y + L  +  G   I         T      +VM T +  + L   AY AF
Sbjct: 225 SPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLVDSAYRAF 276

Query: 392 RDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           + A     G     P A+    FD C+   +GF     P + F F G   LT+P + +LI
Sbjct: 277 KKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLI 336

Query: 448 PV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            V ++  T C A             G+S++G++QQE +   +D     + F P  C
Sbjct: 337 DVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADC 392


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 129/451 (28%), Positives = 206/451 (45%), Gaps = 61/451 (13%)

Query: 67  ISSSNT--SSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVK----RVA 120
            SS+NT  S    R   +L+H         + ++ HY  ++ +   RM+ D++    R+A
Sbjct: 21  FSSTNTISSGKPQRLVSKLIH-------PGSVHHPHYKPNETA-KDRMELDIQHSAARLA 72

Query: 121 TLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIV 180
            +  R+ G       ++ +      VS    G       I +G PP  Q +V+D+GSDI+
Sbjct: 73  NIQARIEGSLVSNNDYKAR------VSPSLTGR-TIMANISIGQPPIPQLVVMDTGSDIL 125

Query: 181 WVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGD--- 237
           WV C PC+ C      +FDP+ S++FS + C +  CD     GC      + V+Y D   
Sbjct: 126 WVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-CDF---EGCRCDPIPFTVTYADNST 180

Query: 238 --GSYTKGTLALETLTIGRTVVKNVAIGCGHK-NQGMFVGAAGLLGLGGGSMSLVGQLGG 294
             G++ + T+  ET   G + + +V  GCGH        G  G+LGL  G  SLV +LG 
Sbjct: 181 ASGTFGRDTVVFETTDEGTSRISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQ 240

Query: 295 QTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
           +    FSYC+  ++    +   L+ G  A   G +    V N     FYYV + G+ VG 
Sbjct: 241 K----FSYCIGNLADPYYNYHQLILGEGADLEGYSTPFEVYN----GFYYVTMEGISVGE 292

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SI 410
            R+ I+ + F + +    GV++DTG+ +T L    ++          G   R + +  S 
Sbjct: 293 KRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSP 352

Query: 411 FDTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS-- 462
           +  C+      +L GF     P V+F+FS G  L L + +F   ++D   FC    P   
Sbjct: 353 WMQCFYGSISRDLVGF-----PVVTFHFSDGADLALDSGSFFNQLND-NVFCMTVGPVSS 406

Query: 463 ---PSGLSIIGNIQQEGIQISFDGANGFVGF 490
               S  S+IG + Q+   + +D  N FV F
Sbjct: 407 LNIKSKPSLIGLLAQQSYNVGYDLVNQFVYF 437


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 115/401 (28%), Positives = 173/401 (43%), Gaps = 48/401 (11%)

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQ 189
           A  H +++  T  V       G Y + +  G+PP++   V+D+GS  VW  C     C+ 
Sbjct: 56  ARAHHLKNPQTTPV--FSHSYGGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNN 113

Query: 190 C-YKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR------------YEVSYG 236
           C +      F P  S+S   + C +  C  +         C             Y + YG
Sbjct: 114 CSFTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYG 173

Query: 237 DGSYTKGTLAL-ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
            G  T G +AL ETL +   +V N  +GC   +       AG+ G G G  SL  QLG  
Sbjct: 174 SG--TTGGVALSETLHLHGLIVPNFLVGCSVFSSRQ---PAGIAGFGRGPSSLPSQLGLT 228

Query: 296 TGGAFSYCLVSRG---TGSSGSLVFGREA----LPVGAAWVPLVRNPRA---PSF---YY 342
               FSYCL+S     T  S SLV   ++          + PLV+NP+    P+F   YY
Sbjct: 229 ---KFSYCLLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYY 285

Query: 343 VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL 402
           V L  + +GG  + I        + G+ G ++D+GT  T + T A+E   + F++Q  N 
Sbjct: 286 VSLRRISIGGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNY 345

Query: 403 PRA---SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
            RA     +S    C+N+SG   + +P +  +F GG  + LP  N+   +      CF  
Sbjct: 346 ERALMVEALSGLKPCFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTV 405

Query: 460 ----APSPSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
               A   SG   I+GN Q +   + +D  N  +GF    C
Sbjct: 406 VTDGAEKASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 176/394 (44%), Gaps = 59/394 (14%)

Query: 155 EYFVRIGVGS-PPRSQYMVIDSGSDIVWVQCQP-----CSQCYKQSDPVF---------- 198
           +Y +   +GS P +S  + +D+GSD+VW  C P     C   +  + P+           
Sbjct: 18  DYTLSFNLGSHPSQSITLYMDTGSDLVWFPCAPFECILCEGKFNATKPLNITRSHRVSCQ 77

Query: 199 DPADSASFSGVS----CSSAVC--DRLENAGCHAGRCR-YEVSYGDGSYTKGTLALETLT 251
            PA S + S VS    C+ A C  D +E + C +  C  +  +YGDGS+    L  +TL+
Sbjct: 78  SPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFI-AHLHRDTLS 136

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYCLVSRG 308
           + +  +KN   GC H          G+ G G G +SL  QL   +   G  FSYCLVS  
Sbjct: 137 MSQLFLKNFTFGCAHT---ALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHS 193

Query: 309 -----TGSSGSLVFGR----EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
                      L+ G      +  V   +  ++RNP+   FY VGL+G+ VG   I   E
Sbjct: 194 FDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTILAPE 253

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTCY 415
            L R+ + GD GVV+D+GT  T LP   Y +    F  + G +  RAS V        CY
Sbjct: 254 MLRRVDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGPCY 313

Query: 416 NLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDD--------AGTFCFAFAPSPSGL 466
            L G   V VPTV+++F G    + LP  N+     D         G          + L
Sbjct: 314 FLEGL--VEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDDTEL 371

Query: 467 S-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           S     I+GN QQ+G ++ +D  N  VGF    C
Sbjct: 372 SGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQC 405


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 178/398 (44%), Gaps = 52/398 (13%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-------- 196
           + S    G G+YFVR  VG+P +   +V D+GSD+ WV+C+P       ++         
Sbjct: 84  LTSAAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASS 143

Query: 197 ---VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSYGDGSYTKGTLALE 248
               F P  S +++ + C+S  C +     L         C Y+  Y DGS  +GT+  E
Sbjct: 144 PRRAFRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTE 203

Query: 249 TLTIG-------------RTVVKNVAIGC-GHKNQGMFVGAAGLLGLGGGSMSLVGQLGG 294
           + TI              +  ++ + +GC G      F  + G+L LG  ++S       
Sbjct: 204 SATIALSSSSSSSKNKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAAS 263

Query: 295 QTGGAFSYCLVSRGTGSSGS--LVFGRE---------ALPVGAAWVPLVRNPRAPSFYYV 343
           + GG FSYCLV   +  + +  L FG           A   GA   PLV + R   FY V
Sbjct: 264 RFGGRFSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDV 323

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
            +  + V G  + I  D++ +   G  GV++D+GT++T L  PAY A   A   +    P
Sbjct: 324 SIKAISVDGELLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFP 381

Query: 404 RASGVSIFDTCYNLSGFVSV----RVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFA 458
           R + +  F+ CYN +          +P ++ +F+G   L  P+ +++I  D A G  C  
Sbjct: 382 RVA-MDPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVI--DAAPGVKCIG 438

Query: 459 FAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               P  G+S+IGNI Q+     FD  N  + F  + C
Sbjct: 439 VQEGPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 158/356 (44%), Gaps = 37/356 (10%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP+    +ID   ++VW QC  CS+C+KQ  P+F P  S++F    C +  C  +  
Sbjct: 73  IGTPPQPASAIIDVAGELVWTQCSMCSRCFKQDLPLFVPNASSTFRPEPCGTDACKSIPT 132

Query: 222 AGCHAGRCRYE--VSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GHKNQGMFVGA 275
           + C +  C YE  ++   G +T G +A +T  IG T   ++  GC    G    G   G 
Sbjct: 133 SNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIG-TATASLGFGCVVASGIDTMG---GP 188

Query: 276 AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLV 332
           +GL+GLG    SLV Q+       FSYCL    +G +  L+ G  A   G   +   P V
Sbjct: 189 SGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSAKLAGGGNSTTTPFV 245

Query: 333 RNPRA---PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           +         +Y + L G+  G   I +            + V++ T   ++ L   AY+
Sbjct: 246 KTSPGDDMSQYYPIQLDGIKAGDAAIALPPS--------GNTVLVQTLAPMSFLVDSAYQ 297

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF-SGGPVLTLPASNFLIP 448
           A +       G  P A+ +  FD C+  +G  +   P + F F  G   LT+P   +LI 
Sbjct: 298 ALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKYLID 357

Query: 449 V-DDAGTFCFAFAPSP--------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           V ++ GT C A   +           L+I+G++QQE      D     + F P  C
Sbjct: 358 VGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADC 413


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 79/176 (44%), Positives = 103/176 (58%), Gaps = 14/176 (7%)

Query: 116 VKRVATLVRRLSGGGADAAKHE-----VQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQY 170
            KR + L +RL+   ADAA++           + V SG+   SGEYF  +GVG+P     
Sbjct: 44  AKRGSLLRQRLA---ADAARYASLVDATGRLHSPVFSGIPFESGEYFALVGVGTPSTKAM 100

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---- 226
           +VID+GSD+VW+QC PC +CY Q   VFDP  S+++  V CSS  C  L   GC +    
Sbjct: 101 LVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRFPGCDSGGAA 160

Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLG 280
            G CRY V+YGDGS + G LA + L     T V NV +GCG  N+G+F  AAGLLG
Sbjct: 161 GGGCRYMVAYGDGSSSTGDLATDKLAFANDTYVNNVTLGCGRDNEGLFDSAAGLLG 216



 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 70/130 (53%), Gaps = 9/130 (6%)

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSVRVPTVSFY 431
           D+GTA++R    AY A RDAF A+             S+FD CY+L G  +   P +  +
Sbjct: 316 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 375

Query: 432 FSGGPVLTLPASNFLIPVD----DAGTF--CFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           F+GG  + LP  N+ +PVD     A ++  C  F  +  GLS+IGN+QQ+G ++ FD   
Sbjct: 376 FAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDVEK 435

Query: 486 GFVGFGPNVC 495
             +GF P  C
Sbjct: 436 ERIGFAPKGC 445


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 110/361 (30%), Positives = 163/361 (45%), Gaps = 40/361 (11%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           +   I +G+PP  Q ++ID+GSD+ W+ C PC +CY Q+ P F P+ S+++   SC SA 
Sbjct: 78  FLANISIGNPPVPQLLLIDTGSDLTWIHCLPC-KCYPQTIPFFHPSRSSTYRNASCVSAP 136

Query: 216 -----CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCG 265
                  R E      G C+Y + Y D S T+G LA E LT      G    +N+  GCG
Sbjct: 137 HAMPQIFRDEK----TGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCG 192

Query: 266 HKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALP 323
             N G F   +G+LGLG G+ S+V +     G  FSYC   ++  T     L+ G  A  
Sbjct: 193 QDNSG-FTKYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLTNPTYPHNILILGNGAKI 248

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
            G      +   R    YY+ L  +  G   + I    F+  +    G V+DTG + T L
Sbjct: 249 EGDPTPLQIFQDR----YYLDLQAISFGEKLLDIEPGTFQRYR-SQGGTVIDTGCSPTIL 303

Query: 384 PTPAYEAFRDAFVAQTGN-LPRASGVSIFDT-CY------NLSGFVSVRVPTVSFYFSGG 435
              AYE   +      G  L R      + T CY      +L GF     P V+F+F+GG
Sbjct: 304 AREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGF-----PVVTFHFAGG 358

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNV 494
             L L   +  +  +   +FC A   +    +S+IG + Q+   + ++     V F    
Sbjct: 359 AELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTD 418

Query: 495 C 495
           C
Sbjct: 419 C 419


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 120/397 (30%), Positives = 178/397 (44%), Gaps = 39/397 (9%)

Query: 130 GADAAKHEVQD--------FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
           GAD  +H +             D+ SG+D G+ +YF  I VG+P +   +V+D+GS++ W
Sbjct: 72  GADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTW 131

Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVS 234
           V C+  ++  K +  VF   +S SF  V C +  C         L      +  C Y+  
Sbjct: 132 VNCRYRARG-KDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190

Query: 235 YGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSL 288
           Y DGS  +G  A ET+T+G T      +    IGC     G  F GA G+LGL     S 
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250

Query: 289 VGQLGGQTGGAFSYCLVSRGTGS--SGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
                   G  FSYCLV   +    S  L+FG  R          PL    R P FY + 
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAIN 309

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           + G+ +G   + I   ++  T  G  G ++D+GT++T L   AY+            L R
Sbjct: 310 VIGISLGYDMLDIPSQVWDATSGG--GTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 367

Query: 405 AS--GVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAF- 459
               GV I + C++  SGF   ++P ++F+  GG        ++L  VD A G  C  F 
Sbjct: 368 VKPEGVPI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFV 424

Query: 460 -APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            A +P+  ++IGNI Q+     FD     + F P+ C
Sbjct: 425 SAGTPA-TNVIGNIMQQNYLWEFDLMASTLSFAPSAC 460


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 125/435 (28%), Positives = 194/435 (44%), Gaps = 56/435 (12%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-----RLSGGGADA 133
           +++E +HRD + S         +H    +  AR+++  +R  ++ R     R++   A A
Sbjct: 4   FSVEFIHRDSVKS--------LFHDPTLTPEARLRQAARR--SMARHAHAARINNSAAAA 53

Query: 134 AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQ 193
                 D   DVVS M   + EY + + V +PP     + D+GS +VW++C+        
Sbjct: 54  GASGSDDSDADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-------- 105

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRL-ENAGCHA-----GRCRYEVSYGDGSYTKGTLAL 247
             P      S+S++ + C +  C  L + A C A       C Y  ++ DGS T G + +
Sbjct: 106 -LPAAHTPASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTV 164

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA--FSYCLV 305
           +  T        +  GC  + +G+ V   GL+GL  G +SLV QL  +T  A  FSYCLV
Sbjct: 165 DAFTFS----TRLDFGCATRTEGLSVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLV 220

Query: 306 --SRGTGSSGSLVFGREAL---PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
             S     S SL FG  A+     GAA  PLV   R  SFY + L  + V G  +P+   
Sbjct: 221 PYSSSETVSSSLNFGSHAIVSSSPGAATTPLVAG-RNKSFYTIALDSIKVAGKPVPL--- 276

Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSG 419
                Q     +++D+GT +T LP    +    A  A    LPR  S  +++  CY++  
Sbjct: 277 -----QTTTTKLIVDSGTMLTYLPKAVLDPLVAALTAAI-KLPRVKSPETLYAVCYDVRR 330

Query: 420 F----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
                V   +P V+    GG  + LP  N  +  +   T C A   S     I+GN+ Q+
Sbjct: 331 RAPEDVGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQ 390

Query: 476 GIQISFDGANGFVGF 490
            + + FD     V F
Sbjct: 391 NLHVGFDLERRTVSF 405


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 167/364 (45%), Gaps = 62/364 (17%)

Query: 149 MDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSG 208
           +D  +G Y + + +G+PP +  ++ D+GS ++W QC PC++C  +  P F PA S++FS 
Sbjct: 83  LDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSK 142

Query: 209 VSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           + C+S++C  L +    C+A  C Y   YG G +T G LA ETL +G      V  GC  
Sbjct: 143 LPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGASFPGVTFGCST 201

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG- 325
           +N G+   ++G++GLG   +SLV Q+G      FSYCL S        ++FG  A   G 
Sbjct: 202 EN-GVGNSSSGIVGLGRSPLSLVSQVG---VARFSYCLRSNADAGDSPILFGSLAKVTGG 257

Query: 326 -AAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
                PL+ NP  P  S+YYV L+G+ VG   +P++                        
Sbjct: 258 NVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMA------------------------ 293

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYN---LSGFVSVRVPTVSFYFSGGPVL 438
                             NL   +G    FD C++     G   V VPT+   F+GG   
Sbjct: 294 ----------------MANLTTVNGTRFGFDLCFDATAAGGGGGVPVPTLVLRFAGGAEY 337

Query: 439 TLPASNF--LIPVDD---AGTFCFAFAPSPSGL--SIIGNIQQEGIQISFDGANGFVGFG 491
            +   ++  ++ VD    A   C    P+   L  SIIGN+ Q  + + +D   G   F 
Sbjct: 338 AVRRRSYFGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFA 397

Query: 492 PNVC 495
           P  C
Sbjct: 398 PADC 401


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  139 bits (349), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 171/364 (46%), Gaps = 37/364 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
           YF R+ +GSPP+  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S + 
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 211 CSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVKN- 259
           CS   C    +   A C       C Y  +YGDGS T G    +T+     +G     N 
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 260 ---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
              +  GC +   G          G+ G G   +S+V QL   G +   FS+CL  +G+ 
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KGSD 294

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  T     
Sbjct: 295 NGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSNTQ 349

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           G ++D+GT +  L   AY+ F +A  A      R S VS  + C+  S  V    PTVS 
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVR-SLVSKGNQCFVTSSSVDSSFPTVSL 408

Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANG 486
           YF GG  +T+   N+L+    +D+   +C  +  +    ++I+G++  +     +D AN 
Sbjct: 409 YFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANM 468

Query: 487 FVGF 490
            +G+
Sbjct: 469 RMGW 472


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 112/346 (32%), Positives = 158/346 (45%), Gaps = 40/346 (11%)

Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDG 238
           +QCQPC  CY+Q DPVF+P  S+S++ V C+S  C +L+   CH    G C+Y   Y   
Sbjct: 1   MQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGH 60

Query: 239 SYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
             TKGTLA++ L IG  V   V  GC   +  G    A+GL+GLG G +SLV QL     
Sbjct: 61  GVTKGTLAIDKLAIGGDVFHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSVHR- 119

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPV----GAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
             F YCL    + +SG LV G  A  V        V +  + R PS+YY+ L GL VG  
Sbjct: 120 --FMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQ 177

Query: 354 RIPISED-------------------LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA 394
               + +                   +         G+++D  + ++ L T  Y+   D 
Sbjct: 178 TPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADD 237

Query: 395 FVAQTGNLPRAS-GVSI-FDTCYNLS---GFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
              +   LPRA+  + +  D C+ L    G   V VPTVS  F G   L L        V
Sbjct: 238 LEEEI-RLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGR-WLELDRDRLF--V 293

Query: 450 DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            D    C     + SG+SI+GN Q + +++ F+   G + F    C
Sbjct: 294 TDGRMMCLMIGRT-SGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 176/396 (44%), Gaps = 37/396 (9%)

Query: 130 GADAAKHEVQD--------FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVW 181
           GAD  +H +             D+ SG+D G+ +YF  I VG+P +   +V+D+GS++ W
Sbjct: 50  GADQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTW 109

Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVS 234
           V C+  ++  K +  VF   +S SF  V C +  C         L      +  C Y+  
Sbjct: 110 VNCRYRARG-KDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168

Query: 235 YGDGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSL 288
           Y DGS  +G  A ET+T+G T      +    IGC     G  F GA G+LGL     S 
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228

Query: 289 VGQLGGQTGGAFSYCLVSRGTGS--SGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVG 344
                   G  FSYCLV   +    S  L+FG  R          PL    R P FY + 
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLT-RIPPFYAIN 287

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           + G+ +G   + I   ++  T  G  G ++D+GT++T L   AY+            L R
Sbjct: 288 VIGISLGYDMLDIPSQVWDATSGG--GTILDSGTSLTLLADAAYKQVVTGLARYLVELKR 345

Query: 405 AS--GVSIFDTCYNL-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFA 460
               GV I + C++  SGF   ++P ++F+  GG        ++L  VD A G  C  F 
Sbjct: 346 VKPEGVPI-EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFV 402

Query: 461 PSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            + +   ++IGNI Q+     FD     + F P+ C
Sbjct: 403 SAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSAC 438


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 118/382 (30%), Positives = 180/382 (47%), Gaps = 39/382 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-PVFDPADSAS 205
           SG   G+G+YFVR  VG+P +   +V D+GSD+ WV+C         +   VF  A S S
Sbjct: 103 SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRS 162

Query: 206 FSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------- 253
           ++ ++CSS  C       L N    A  C Y+  Y DGS  +G +  ++ TI        
Sbjct: 163 WAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESR 222

Query: 254 -----RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
                R  ++ V +GC     G  F  + G+L LG  ++S   +   + GG FSYCLV  
Sbjct: 223 DGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDH 282

Query: 308 --GTGSSGSLVFGREALPVGAAW----------VPLVRNPRAPSFYYVGLSGLGVGGMRI 355
                ++  L FG      GAA            PL+ + R   FY V +  + V G  +
Sbjct: 283 LAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEAL 342

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
            I  D++ + + G  G ++D+GT++T L TPAY A   A   +   LPR S +  F+ CY
Sbjct: 343 DIPADVWDVARGG--GAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVS-MDPFEYCY 399

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQ 473
           N +   ++ +P +   F+G   L  PA +++  VD A G  C      +  G+S+IGNI 
Sbjct: 400 NWTA-AALEIPGLEVRFAGSARLQPPAKSYV--VDAAPGVKCIGVQEGAWPGVSVIGNIL 456

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
           Q+     FD  + ++ F    C
Sbjct: 457 QQDHLWEFDLRDRWLRFKHTRC 478


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score =  138 bits (347), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 113/320 (35%), Positives = 154/320 (48%), Gaps = 33/320 (10%)

Query: 65  NNISSSNTSSDEARW-NLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLV 123
           + ++ SN +S  + W  L LV      + S  T+N        S    +  D  RVA + 
Sbjct: 48  HKVAPSNEASLNSTWAPLHLVSGPCSPAYSRGTDNSSTDDDVTSIAKMLDADQHRVAYIQ 107

Query: 124 RRLSGG----GADAAKHEVQ--DFGTDVVSGMDQGSGEYFVRIGVGSPPR-----SQYMV 172
           +RL+GG    G   A  + Q  D GT  +   + G G     IG  + P       Q ++
Sbjct: 108 KRLAGGDTSNGVAGASWDGQTTDVGT-YLPASNVGVGAKM--IGTTAAPDGTSAVRQTVI 164

Query: 173 IDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSASFSGVSCSSAVCDRL--ENAGCHAG- 227
           IDSGSD+ WVQCQPC    C+ Q DP+FDPA S ++S V CSSA C RL     GC A  
Sbjct: 165 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSANV 224

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG--MFVGAAGLLGLGGG 284
           +C++  +Y DG+   GT + + LT+G   VV+    GC H ++G       +G L LGGG
Sbjct: 225 QCQFGFTYTDGATATGTYSSDDLTLGPYDVVRGFLFGCAHADRGSTFSFDVSGTLALGGG 284

Query: 285 SMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVP-LVRNP------RA 337
           + S V Q   Q G  FSYC +     S G +  G    P  AA VP  V  P        
Sbjct: 285 AQSFVQQTATQYGRVFSYC-IPPSPSSLGFITLGVP--PQRAALVPTFVSTPLLSSSSMP 341

Query: 338 PSFYYVGLSGLGVGGMRIPI 357
           P+FY V L  + V G  +P+
Sbjct: 342 PTFYRVLLRAIIVAGRPLPV 361


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  138 bits (347), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 111/330 (33%), Positives = 165/330 (50%), Gaps = 28/330 (8%)

Query: 189 QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLA 246
           +C  +  P F PA S++FS + C+S++C  L +    C+A  C Y   YG G +T G LA
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145

Query: 247 LETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
            ETL +G      VA GC  +N G+   ++G++GLG   +SLV Q+G    G FSYCL S
Sbjct: 146 TETLHVGGASFPGVAFGCSTEN-GVGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCLRS 201

Query: 307 RGTGSSGSLVFGREALPVGAAWVP-LVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFR 363
                   ++FG  A   G    P ++ NP  PS  +YYV L+G+ VG   +P++   F 
Sbjct: 202 DADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVTSTTFG 261

Query: 364 LTQMGDDGVV----MDTGTAVTRLPTPAYEAFRDAFVAQ--TGNL-PRASGVSI-FDTCY 415
            T+    G+V    +D+GT +T L    Y   + AF++Q  T NL    +G    FD C+
Sbjct: 262 FTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCF 321

Query: 416 NLS---GFVSVRVPTVSFYFSGGPVLTLPASNF--LIPVDD---AGTFCFAFAPSPSGL- 466
           + +   G   V VPT+   F+GG    +   ++  ++ VD    A   C    P+   L 
Sbjct: 322 DANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLS 381

Query: 467 -SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            SIIGN+ Q  + + +D   G   F P  C
Sbjct: 382 ISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  137 bits (346), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 166/358 (46%), Gaps = 29/358 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS- 212
           G Y  RI +G+PP++  +++D+GS + +V C  C QC K  DP F P  S+++  + CS 
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
              CD           C Y+  Y + S + G L  + ++ G+      +    GC +   
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S+V QL   G  G +FS C      G  G++V G  + P G
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVG-GGAMVLGGISPPAG 262

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P   ++Y + L  + + G ++PI+  +F     G  G ++D+GT    LP 
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
           PA++AF+DA + +  +L    G   +  D C++  G     +S   P V   FS G  L+
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 440 LPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L     A G +C   F       +++G I      + +D  +  +GF    C
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 166/358 (46%), Gaps = 29/358 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS- 212
           G Y  RI +G+PP++  +++D+GS + +V C  C QC K  DP F P  S+++  + CS 
Sbjct: 90  GYYTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM 149

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
              CD           C Y+  Y + S + G L  + ++ G+      +    GC +   
Sbjct: 150 ECTCDS------EMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKPQRTVFGCENVET 203

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S+V QL   G  G +FS C      G  G++V G  + P G
Sbjct: 204 GDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVG-GGAMVLGGISPPAG 262

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P   ++Y + L  + + G ++PI+  +F     G  G ++D+GT    LP 
Sbjct: 263 MVFTH--SDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTYAYLPE 316

Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
           PA++AF+DA + +  +L    G   +  D C++  G     +S   P V   FS G  L+
Sbjct: 317 PAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLS 376

Query: 440 LPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L     A G +C   F       +++G I      + +D  +  +GF    C
Sbjct: 377 LSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 171/373 (45%), Gaps = 41/373 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G YF +IG+G+PP+  Y+ +D+GSDI+WV C  C +C  +SD      ++DP  S S +
Sbjct: 79  AGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSAT 138

Query: 208 GVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVK----- 258
            + C    C    N    GC     C+Y V YGDGS T G    + L   R         
Sbjct: 139 RIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSS 198

Query: 259 ---NVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGT 309
              +V  GCG K  G    ++    G+LG G  + S++ QL   G+    F++CL +   
Sbjct: 199 ANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKG 258

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           G  G    G    P      P+V  P  P  Y V +  + VGG  + +  D+F     GD
Sbjct: 259 G--GIFAIGEVVSP-KVNTTPMV--PNQPH-YNVVMKEIEVGGNVLELPTDIF---DTGD 309

Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             G ++D+GT +  LP   YE+     V++   L   +    F TC+  +G V+   P V
Sbjct: 310 RRGTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQF-TCFQYTGNVNEGFPVV 368

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISFD 482
            F+F+G   LT+   ++L  + +   +CF +  S         ++++G++      + +D
Sbjct: 369 KFHFNGSLSLTVNPHDYLFQIHEE-VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYD 427

Query: 483 GANGFVGFGPNVC 495
             N  +G+    C
Sbjct: 428 LENQAIGWTDYNC 440


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 140/485 (28%), Positives = 208/485 (42%), Gaps = 61/485 (12%)

Query: 49  TDHAKMSQYNELFERHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSF 108
           TDH     YN   +  N   S    ++E  +++E +HRD + S  +      + R   + 
Sbjct: 14  TDHV----YNLRHKAINLFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAA 69

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRS 168
                R  +    L RR SG  +          G  VV+ +     EY + I VG+PP  
Sbjct: 70  RRSAARAAELHHLLARRSSGAPSPGT-------GAGVVAEVVSRQFEYLMAIEVGTPPVR 122

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCDRLENAGCH 225
              + D+GSD+VWV+C+        + P    F P+ S+++  V C +  C  L +A   
Sbjct: 123 VLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCDTKACRALSSAASC 182

Query: 226 A--GRCRYEVSYGDGSYTKGTLALETLTIG----------------------RTVVKNVA 261
           +  G C Y  SYGDGS   G L+ ET T                        +  +  + 
Sbjct: 183 SPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLD 242

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG-TGSSGSLVFG 318
            GC     G F  A GL+GLGGG +SL  QLG  T  G  FSYCL     T +S +L FG
Sbjct: 243 FGCSTTTTGTF-RADGLVGLGGGPVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFG 301

Query: 319 REAL--PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
             A+    GAA  PL+      ++Y + L  + V G + P        T      +++D+
Sbjct: 302 SRAVVSEPGAASTPLITG-EVETYYTIALDSINVAGTKRP--------TTAAQAHIIVDS 352

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFV---SVRVPTVSFYF 432
           GT +T L +            +   LPRA S   I D CY++SG     ++ +P V+   
Sbjct: 353 GTTLTYLDSALLTPLVKDLTRRI-KLPRAESPEKILDLCYDISGVRGEDALGIPDVTLVL 411

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGF 490
            GG  +TL   N  + V + G  C A   +     +SI+GNI Q+ + + +D   G V F
Sbjct: 412 GGGGEVTLKPDNTFVVVQE-GVLCLALVATSERQSVSILGNIAQQNLHVGYDLEKGTVTF 470

Query: 491 GPNVC 495
               C
Sbjct: 471 AAADC 475


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 123/427 (28%), Positives = 188/427 (44%), Gaps = 75/427 (17%)

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-------- 190
           + F   + SG   G+G+YFVR  VG+P R   +V D+GSD+ WV+C+  +          
Sbjct: 38  EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAP 97

Query: 191 ---YKQSDP-----------------VFDPADSASFSGVSCSSAVCDR---LENAGCHA- 226
              Y    P                 VF P  S +++ + CSS  C        A C   
Sbjct: 98  GYNYGYGAPASNDSSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTP 157

Query: 227 -GRCRYEVSYGDGSYTKGTLALETLTIG-----------RTVVKNVAIGCGHKNQGM-FV 273
              C YE  Y DGS  +GT+  ++ TI            R  ++ V +GC     G  F+
Sbjct: 158 GSPCAYEYRYKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL 217

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFGRE----------- 320
            + G+L LG  ++S   +   + GG FSYCLV       ++  L FG             
Sbjct: 218 ASDGVLSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRT 277

Query: 321 -----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
                A   GA   PL+ + R   FY V ++G+ V G  + I   ++ + + G  G ++D
Sbjct: 278 ACAGSAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGG--GAILD 335

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS-----VRVPTVSF 430
           +GT++T L +PAY A   A   +   LPR + +  FD CYN +  ++     V VP ++ 
Sbjct: 336 SGTSLTVLVSPAYRAVVAALGKKLVGLPRVA-MDPFDYCYNWTSPLTGEDLAVAVPALAV 394

Query: 431 YFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFV 488
           +F+G   L  P  +++I  D A G  C         G+S+IGNI Q+     FD  N  +
Sbjct: 395 HFAGSARLQPPPKSYVI--DAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRL 452

Query: 489 GFGPNVC 495
            F  + C
Sbjct: 453 RFKRSRC 459


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 156/344 (45%), Gaps = 21/344 (6%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC-----YKQSDPVFDPADSASFS 207
           +G Y +   VG+PP+    V+D  SD VW+QC  C+ C        S P F    S++  
Sbjct: 94  TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCGADAPAATSAPPFYAFLSSTIR 153

Query: 208 GVSCSSAVCDRLENAGCHA--GRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIG 263
            V C++  C RL    C A    C Y   YG G+   T G LA++           V  G
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFG 213

Query: 264 CGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL 322
           C    +G      G++GLG G +S V QL     G FSY L        GS + F  +A 
Sbjct: 214 CAVATEG---DIGGVIGLGRGELSPVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAK 267

Query: 323 PVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
           P  +  V  PLV +  + S YYV L+G+ V G  + I    F L   G  GVV+     V
Sbjct: 268 PRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPV 327

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLT 439
           T L   AY+  R A  ++   L  A G  +  D CY      + +VP+++  F+GG V+ 
Sbjct: 328 TFLDAGAYKVVRQAMASKI-ELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVME 386

Query: 440 LPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFD 482
           L   N+       G  C    PSP+G  S++G++ Q G  + +D
Sbjct: 387 LEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYD 430


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/368 (30%), Positives = 172/368 (46%), Gaps = 42/368 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
           G Y+ R+ +G+PP+  Y+ ID+GSD++WV C  C+ C   S    P+  FDP  S + S 
Sbjct: 81  GLYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASL 140

Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTV------- 256
           VSCS  +C    +  ++ C   + +C Y   YGDGS T G   ++ + +   +       
Sbjct: 141 VSCSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSN 200

Query: 257 -VKNVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
              +V  GC     G          G+ G G   +S++ QL   G     FS+CL  +G 
Sbjct: 201 SSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCL--KGD 258

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            S G ++   E +     + PLV  P  P  Y + L  + V G  +PIS  +F  +    
Sbjct: 259 DSGGGILVLGEIVEPNVVYTPLV--PSQPH-YNLNLQSISVNGQVLPISPAVFATS--SS 313

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVP 426
            G ++D+GT +  L   AY    +AFV    N+   S  S+    + CY  S  VS   P
Sbjct: 314 QGTIIDSGTTLAYLAEEAY----NAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAG---TFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
            VS  F+GG  L L A ++LI  +  G    +C  F   P  G++I+G++  +     +D
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYD 429

Query: 483 GANGFVGF 490
            AN  +G+
Sbjct: 430 LANQRIGW 437


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 119/417 (28%), Positives = 184/417 (44%), Gaps = 56/417 (13%)

Query: 125 RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC 184
           R +  G+ AA  E+      + SG   G G+YFVR  VG+P +   +V D+GSD+ WV+C
Sbjct: 68  RETAAGSSAAAFEMP-----LTSGAYTGIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKC 122

Query: 185 -QPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCDR-----LENAGCHAGRCRYEVSY 235
            +P +   +        F P DS +++ +SC+S  C +     L         C Y+  Y
Sbjct: 123 RRPAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRY 182

Query: 236 GDGSYTKGTLALETLTIG---------RTVVKNVAIGCGHKNQG-MFVGAAGLLGLGGGS 285
            DGS  +GT+  E+ TI          +  +K + +GC     G  F  + G+L LG   
Sbjct: 183 KDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSSYTGPSFEVSDGVLSLGYSD 242

Query: 286 MSLVGQLGGQTGGAFSYCLVSRGTGSSGS--LVFG----------------------REA 321
           +S       +  G FSYCLV   +  + +  L FG                         
Sbjct: 243 VSFASHAASRFAGRFSYCLVDHLSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPR 302

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
               A   PL+ + R   FY V +  + V G  + I   ++ +   G  GV++D+GT++T
Sbjct: 303 PRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLKIPRAVWDVDAGG--GVILDSGTSLT 360

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-SVRVPTVSFYFSGGPVLTL 440
            L  PAY A   A       LPR + +  F+ CYN +     V +P ++ +F+G   L  
Sbjct: 361 VLAKPAYRAVVAALSEGLAGLPRVT-MDPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEP 419

Query: 441 PASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P  +++I  D A G  C      P  G+S+IGNI Q+     FD  N  + F  + C
Sbjct: 420 PGKSYVI--DAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 474


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 174/374 (46%), Gaps = 41/374 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF R+ +G+P +  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S 
Sbjct: 3   GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62

Query: 209 VSCSSAVCDR--------LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN- 259
           ++CS   C           + +   +  C Y  +YGDGS T G    +T+    TV+ N 
Sbjct: 63  ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFF-ETVMGNE 121

Query: 260 --------VAIGCGHKNQGMFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLV 305
                   +  GC +   G    A     G+ G G   +S++ QL   G +   FS+CL 
Sbjct: 122 QTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 180

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            +G+ + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  T
Sbjct: 181 -KGSDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIAVNGQKLPIDSSLF--T 234

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
                G ++D+GT +  L   AY+ F  A  A      R S VS    C+  S  V    
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVR-SLVSKGSQCFITSSSVDSSF 293

Query: 426 PTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQISF 481
           PTV+ YF GG  +++   N+L+    VD++  +C  +  +    ++I+G++  +     +
Sbjct: 294 PTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVY 353

Query: 482 DGANGFVGFGPNVC 495
           D AN  +G+    C
Sbjct: 354 DLANMRMGWADYDC 367


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 109/408 (26%), Positives = 165/408 (40%), Gaps = 62/408 (15%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC---------------------- 184
           +G D   GEYF  + VGSP +  ++  D+GS+  W  C                      
Sbjct: 102 AGRDDALGEYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTKKK 161

Query: 185 -------------QPCSQCYKQSDP---VFDPADSASFSGVSCSSAVCD-------RLEN 221
                        +   +   +S+P   VF P  S SF  V+C+S  C         L  
Sbjct: 162 HHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSL 221

Query: 222 AGCHAGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQG---MFV 273
               +  C Y++SY DGS  KG    +T+T+         + N+ IGC    +       
Sbjct: 222 CPKPSDPCLYDISYADGSSAKGFFGTDTITVDLKNGKEGKLNNLTIGCTKSMENGVNFNE 281

Query: 274 GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT--GSSGSLVFGREALPVGAAWVPL 331
              G+LGLG    S + +   + G  FSYCLV   +    S  L  G          +  
Sbjct: 282 DTGGILGLGFAKDSFIDKAAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKR 341

Query: 332 VRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
                 P FY V + G+ +GG  + I   ++     G  G ++D+GT +T L  PAYE  
Sbjct: 342 TELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQG--GTLIDSGTTLTALLVPAYEPV 399

Query: 392 RDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            +A +     + R +G      D C++  GF    VP + F+F+GG     P  +++I V
Sbjct: 400 FEALIKSLTKVKRVTGEDFGALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV 459

Query: 450 DDAGTFCFAFAP--SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                 C    P     G S+IGNI Q+     FD +   +GF P++C
Sbjct: 460 APL-VKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTIGFAPSIC 506


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  135 bits (340), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 170/364 (46%), Gaps = 36/364 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QP-CSQCYKQSDPVFDPADSASFSGVSCSS 213
           Y ++  +GSPP   Y + D+GS+IVW+QC  P C+ CYKQ  P+F+P  S++++   C  
Sbjct: 108 YVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGH 167

Query: 214 AVCDRL-----ENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVK------NV 260
             C +      E  GC +    CRY +SY D S+++GT++ + +T    + +       +
Sbjct: 168 RECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRM 227

Query: 261 AIGCGHKNQGM------FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRGTG 310
             GCG+ N            A G++GLG    SLVGQL   T G FSYC+    V +  G
Sbjct: 228 FFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNG 284

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGD 369
           +   + FG  A   G +   L  N     + +  + G+ V   ++    E +F+  + G 
Sbjct: 285 TI-EIRFGLAASISGHS-TALANNLEG-WYIFQNVDGIYVDDTKVKGYPEWVFQFAEGGI 341

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGFVSVRVPT 427
            G++MD+GT  T L   A +A       Q    P  +    S +  CYN + F+   VP 
Sbjct: 342 GGLIMDSGTTYTELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPA 401

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAG-TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           +   F+       P +     +D+    +C A   + SG+SIIG  Q   I+I +D    
Sbjct: 402 IELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMFGT-SGISIIGIYQHRDIKIGYDLKYN 460

Query: 487 FVGF 490
            V F
Sbjct: 461 LVSF 464


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  135 bits (339), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 97/316 (30%), Positives = 142/316 (44%), Gaps = 28/316 (8%)

Query: 203 SASFSGVSCSSAVCDRLENAGCHAG-----RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
           S++F  V+C   +C         A      +C Y  SYGD S T G +  +T T      
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 255 --TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS 311
               V  +A GCG  N G+FV   +G+ G G G  SL  QL     G FSYCL       
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQL---KVGRFSYCLTLVTESK 118

Query: 312 SGSLVFGREALPVGA--------AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
           S  ++ G    P G            P++ NP  P+FYY+ L G+ VG  R+P  + +F 
Sbjct: 119 SSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSVFA 178

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF--DTCYNL-SGF 420
           L + G  G V+D+GT++T LP   +E  ++  VAQ   LPR           C+    G 
Sbjct: 179 LKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVAQF-PLPRYDNTPEVGDRLCFRRPKGG 237

Query: 421 VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF-APSPSGLSIIGNIQQEGIQI 479
             V VP +  + +G   + LP  N+ +   D+G  C        + + +IGN QQ+ + +
Sbjct: 238 KQVPVPKLILHLAGAD-MDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQNMHV 296

Query: 480 SFDGANGFVGFGPNVC 495
            +D  N  + F P  C
Sbjct: 297 VYDVENNKLLFAPAQC 312


>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
          Length = 335

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 102/278 (36%), Positives = 138/278 (49%), Gaps = 47/278 (16%)

Query: 112 MQRDVKRVATLVRRLSGGGADAA-KHEVQDFG-TDVVSGMDQGSGEYFVRIGVGSPPR-- 167
           +  D  RVA + +RL+G   D A  H+  + G T VVS +   +G      G+G  P   
Sbjct: 5   LDADQLRVAYIQKRLAGDTGDGADPHKFVEGGDTHVVSSLQVATGA-----GIGQKPHLT 59

Query: 168 --------------------SQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPADSAS 205
                               SQ ++IDSGSD+ WVQCQPC    C+ Q DP+FDPA S +
Sbjct: 60  TTRLGTTATTNSAPDGTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTT 119

Query: 206 FSGVSCSSAVCDRL--ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG-RTVVKNVA 261
           ++ V CSSA C RL     GC A  +C++ ++Y +G+   GT + + LT+G   VV+   
Sbjct: 120 YAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYDVVRGFL 179

Query: 262 IGCGHKNQG--MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR 319
            GC H +QG       AG L LGGGS S V Q   Q    FSYC V   T S G ++FG 
Sbjct: 180 FGCAHADQGSTFSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGV 238

Query: 320 EALPVGAAWVP-LVRNP------RAPSFYYVGLSGLGV 350
              P  AA VP  V  P       +P+FY + L  + +
Sbjct: 239 P--PQRAALVPTFVSTPLLSSSTMSPTFYSITLPSIAL 274



 Score = 38.9 bits (89), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 38/78 (48%), Gaps = 8/78 (10%)

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGI 477
           F S+ +P+++  F GG  + L A+  L+        C AFAP+ S      IGN+QQ  +
Sbjct: 264 FYSITLPSIALVFDGGATVNLDAAGILL------QGCLAFAPTASDRMPGFIGNVQQRTL 317

Query: 478 QISFDGANGFVGFGPNVC 495
           ++ +D     + F    C
Sbjct: 318 EVVYDVPGKAIRFRSAAC 335


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  134 bits (338), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 107/369 (28%), Positives = 173/369 (46%), Gaps = 39/369 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV----FDPADSASFSGV 209
           G YF +IG+G+P R  ++ +D+GSDI+WV C  C +C ++SD V    +D   S++   V
Sbjct: 83  GLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSV 142

Query: 210 SCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTKGTLA-----LETLTIGR---TVVKN 259
           SCS   C  + + + CH+G  C+Y + YGDGS T G L      L+ +T  R   +    
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGT 202

Query: 260 VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
           +  GCG K  G          G++G G  + S + QL   G+   +F++CL +   G  G
Sbjct: 203 IIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG--G 260

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD-GV 372
               G    P      P++      + Y V L+ + VG   + +S D F     GDD GV
Sbjct: 261 IFAIGEVVSP-KVKTTPMLSK---SAHYSVNLNAIEVGNSVLQLSSDAF---DSGDDKGV 313

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           ++D+GT +  LP   Y    +  +A    L   +    F TC++    +  R PTV+F F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSF-TCFHYIDRLD-RFPTVTFQF 371

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGANG 486
                L +    +L  V +  T+CF +          + L+I+G++      + +D  N 
Sbjct: 372 DKSVSLAVYPQEYLFQVRE-DTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQ 430

Query: 487 FVGFGPNVC 495
            +G+  + C
Sbjct: 431 VIGWTNHNC 439


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  134 bits (338), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           Y V + +G+PP+    +ID G ++VW QC Q C +C+KQ  P+FD   S++F    C +A
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 215 VCDRLEN---AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-G 270
           VC+ +     AG   G C YE S   G  T G +  + + IG      +A GC   ++  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFG-RTVGRIGTDAVAIGTAATARLAFGCAVASEMD 169

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GA 326
              G++G +GLG  ++SL  Q+      AFSYCL    TG S +L  G  A       GA
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGASAKLAGAGKGA 226

Query: 327 AWVPLVRNPRAP-----SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
              P V+    P       Y + L  +  G   I        + Q G+  +++ T T VT
Sbjct: 227 GTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATI-------AMPQSGNT-IMVSTATPVT 278

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
            L    Y   R A     G  P    V  +D C+  +   S   P +   F GG  +T+P
Sbjct: 279 ALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVP 337

Query: 442 ASNFLIPVDDAG--TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S++L    DAG  T C A   SP+  G+SI+G++QQ  I + FD     + F P  C
Sbjct: 338 VSSYLF---DAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  134 bits (337), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 165/358 (46%), Gaps = 34/358 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQC-QPCSQCYKQSDPVFDPADSASFSGVSCSSA 214
           Y V + +G+PP+    +ID G ++VW QC Q C +C+KQ  P+FD   S++F    C +A
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRRCFKQDLPLFDTNASSTFRPEPCGAA 110

Query: 215 VCDRLEN---AGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-G 270
           VC+ +     AG   G C YE S   G  T G +  + + IG      +A GC   ++  
Sbjct: 111 VCESIPTRSCAGDGGGACGYEASTSFG-RTVGRIGTDAVAIGTAATARLAFGCAVASEMD 169

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----GA 326
              G++G +GLG  ++SL  Q+      AFSYCL    TG S +L  G  A       GA
Sbjct: 170 TMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGASAKLAGAGKGA 226

Query: 327 AWVPLVRNPRAPS-----FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
              P V+    P+      Y + L  +  G   I        + Q G+  + + T T VT
Sbjct: 227 GTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATI-------AMPQSGNT-ITVSTATPVT 278

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
            L    Y   R A     G  P    V  +D C+  +   S   P +   F GG  +T+P
Sbjct: 279 ALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKAS-ASGGAPDLVLAFQGGAEMTVP 337

Query: 442 ASNFLIPVDDAG--TFCFAFAPSPS--GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S++L    DAG  T C A   SP+  G+SI+G++QQ  I + FD     + F P  C
Sbjct: 338 VSSYLF---DAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADC 392


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 101/336 (30%), Positives = 163/336 (48%), Gaps = 35/336 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G+   +G YF +IG+G+P +S Y+ +D+GSDI+WV C  C  C ++S       ++DP+
Sbjct: 72  NGLPTETGLYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPS 131

Query: 202 DSASFSGVSCSSAVCDRLEN----AGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---- 253
            S+S +GV+C    C         +   A  C+Y +SYGDGS T G    + L       
Sbjct: 132 GSSSGTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSG 191

Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
             +T + N +I  GCG K  G    ++    G+LG G  + S++ QL   G+    F++C
Sbjct: 192 NSQTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHC 251

Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
           L +   G  G    G    P   +  PLV  P  P  Y V L  + VGG+++ +  ++F 
Sbjct: 252 LDTINGG--GIFAIGDVVQP-KVSTTPLV--PGMPH-YNVNLEAIDVGGVKLQLPTNIFD 305

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
           + +    G ++D+GT +  LP   Y A      AQ G++P  +       C+  SG V  
Sbjct: 306 IGE--SKGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDF--QCFRYSGSVDD 361

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
             P ++F+F GG  L +   ++L    +   +C  F
Sbjct: 362 GFPIITFHFEGGLPLNIHPHDYLF--QNGELYCMGF 395


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 166/362 (45%), Gaps = 35/362 (9%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP F P  S+++  V C+
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 140

Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
                   +  C + R  C YE  Y + S + G L  + ++ G       +    GC + 
Sbjct: 141 I-------DCNCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAPQRAVFGCENV 193

Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
             G      A G++GLG G +S++ QL  +     +FS C      G  G++V G  + P
Sbjct: 194 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVG-GGAMVLGGISPP 252

Query: 324 --VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
             +  A+   VR+P    +Y + L  + V G R+P++ ++F     G  G V+D+GT   
Sbjct: 253 SDMAFAYSDPVRSP----YYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTYA 304

Query: 382 RLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGG 435
            LP  A+ AF+DA V +  +L + SG   +  D C++ +G     +S   P V   F  G
Sbjct: 305 YLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENG 364

Query: 436 PVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
              TL   N++       G +C   F       +++G I      + +D     +GF   
Sbjct: 365 QKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWKT 424

Query: 494 VC 495
            C
Sbjct: 425 NC 426


>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 485

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 127/455 (27%), Positives = 187/455 (41%), Gaps = 78/455 (17%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGS-PPR 167
           H+  +        L++  S   A    H  +     +  G D     Y +   +GS PP+
Sbjct: 31  HSLSKSQFNSTPHLLKFTSARSATRFHHRHRQISLPLSPGSD-----YTLSFNLGSHPPQ 85

Query: 168 SQYMVIDSGSDIVWVQCQP--CSQCYKQSDPV----FDPADSASFSGVSCSSAVCDRLEN 221
              + +D+GSD+VW  C P  C  C  + D        P +  S + VSC S  C     
Sbjct: 86  PISLYMDTGSDLVWFPCAPFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAHT 145

Query: 222 AG-----CHAGRCRYEV----------------SYGDGSYTKGTLALETLTIGRT---VV 257
           +      C   RC  E+                +YGDGS     L  ++L++  +   V+
Sbjct: 146 SLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLV-ARLYRDSLSMPASSPLVL 204

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLVSRGTGSS-- 312
            N   GC H   G  VG AG    G G +SL  QL       G  FSYCLVS    +   
Sbjct: 205 HNFTFGCAHTALGEPVGVAGF---GRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADRV 261

Query: 313 ---GSLVFGREALP------VGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
                L+ GR +L       VG       +  ++ NP+ P FY VGL G+ VG  +IP+ 
Sbjct: 262 RRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKIPVP 321

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTC 414
           E L R+ + G+ G+V+D+GT  T LP   YE+    F  + G +  RA+ +        C
Sbjct: 322 EILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLGPC 381

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA--------GTFCFAF------A 460
           Y  S   + +VP V+ +F G   + LP +N+     D            C         A
Sbjct: 382 Y-YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCLMLMNGGDEA 440

Query: 461 PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S    + +GN QQ+G ++ +D     VGF    C
Sbjct: 441 ESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKC 475


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 166/369 (44%), Gaps = 38/369 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP++  MV+D+GS++ W+ C         +D  F P  SA+F+ V C SA C 
Sbjct: 63  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAAD-SFRPRASATFAAVPCGSARCS 121

Query: 217 --DRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKNQ 269
             D      C A   RCR  +SY DGS + G LA +   +G       A GC    + + 
Sbjct: 122 SRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRSAFGCMSAAYDSS 181

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGAAW 328
              V  AGLLG+  G++S V Q   +    FSYC+  R    +G L+ G   LP +   +
Sbjct: 182 PDAVATAGLLGMNRGALSFVTQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPLNY 236

Query: 329 VPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
            PL +  P  P F    Y V L G+ VGG  +PI   +      G    ++D+GT  T L
Sbjct: 237 TPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSGTQFTFL 296

Query: 384 PTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSG---FVSVRVPTVSFYFSG 434
              AY A +  F+ QT  L      P  +    FDTC+ +       S R+P V+  F+G
Sbjct: 297 LGDAYSAVKAEFLKQTKPLLPALEDPSFAFQEAFDTCFRVPKGRPPPSARLPPVTLLFNG 356

Query: 435 GPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGANG 486
              +++     L  V        G +C  F  +   P    +IG+  Q  + + +D   G
Sbjct: 357 A-QMSVAGDRLLYKVPGERRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWVEYDLERG 415

Query: 487 FVGFGPNVC 495
            VG  P  C
Sbjct: 416 RVGLAPVKC 424


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 171/368 (46%), Gaps = 51/368 (13%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-CSQCYKQSDPV--FDP 200
           DVVS +   S EY + + +GSPPRS   + D+GSD+VWV+C+   +     + P   FDP
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDP 148

Query: 201 ADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI-----GR 254
           + S+++  VSC +  C+ L  A C  G  C Y  +YGDGS T G L+ ET T      GR
Sbjct: 149 SRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGR 208

Query: 255 TV----VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRG 308
           +     +  V  GC     G F  A GL+GLGGG++SLV QLGG T  G  FSYCLV   
Sbjct: 209 SPRQVRIGGVKFGCSTATAGSF-PADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHS 267

Query: 309 TGSSGSLVFG--REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
             +S +L FG   +    GAA  PLV N    S                           
Sbjct: 268 VNASSALNFGALADVTEPGAASTPLVGNKTVAS--------------------------- 300

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF---VSV 423
                +++D+GT +T L         D    +    P  S   +   CYN++G       
Sbjct: 301 AASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGE 360

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS--GLSIIGNIQQEGIQISF 481
            +P ++  F GG  + L   N  + V + GT C A   +     +SI+GN+ Q+ I + +
Sbjct: 361 SIPDLTLEFGGGAAVALKPENAFVAVQE-GTLCLAIVATTEQQPVSILGNLAQQNIHVGY 419

Query: 482 DGANGFVG 489
           D   G VG
Sbjct: 420 DLDAGTVG 427


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 182/394 (46%), Gaps = 42/394 (10%)

Query: 132 DAAKHEVQDFGTDVVSGMD---QGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS 188
           D  +H       D+  G D   +  G YF +IG+G+P R  ++ +D+GSDI+WV C  C 
Sbjct: 58  DVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCI 117

Query: 189 QCYKQSDPV----FDPADSASFSGVSCSSAVCDRL-ENAGCHAGR-CRYEVSYGDGSYTK 242
           +C ++SD V    +D   S++   VSCS   C  + + + CH+G  C+Y + YGDGS T 
Sbjct: 118 RCPRKSDLVELTPYDVDASSTAKSVSCSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTN 177

Query: 243 GTLA-----LETLTIGR---TVVKNVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVG 290
           G L      L+ +T  R   +    +  GCG K  G          G++G G  + S + 
Sbjct: 178 GYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFIS 237

Query: 291 QLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGL 348
           QL   G+   +F++CL +   G  G    G    P      P++      + Y V L+ +
Sbjct: 238 QLASQGKVKRSFAHCLDNNNGG--GIFAIGEVVSP-KVKTTPMLSK---SAHYSVNLNAI 291

Query: 349 GVGGMRIPISEDLFRLTQMGDD-GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG 407
            VG   + +S + F     GDD GV++D+GT +  LP   Y    +  +A    L   + 
Sbjct: 292 EVGNSVLELSSNAF---DSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTV 348

Query: 408 VSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------AP 461
              F TC++ +  +  R PTV+F F     L +    +L  V +  T+CF +        
Sbjct: 349 QESF-TCFHYTDKLD-RFPTVTFQFDKSVSLAVYPREYLFQVRE-DTWCFGWQNGGLQTK 405

Query: 462 SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             + L+I+G++      + +D  N  +G+  + C
Sbjct: 406 GGASLTILGDMALSNKLVVYDIENQVIGWTNHNC 439


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 167/373 (44%), Gaps = 63/373 (16%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ---CYKQSDPVFD---PA 201
           G DQG   + + +G+  P   + +++D+GSD++W QC+  S      +   P      PA
Sbjct: 38  GSDQG---HSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPA 91

Query: 202 DSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG--RTVVKN 259
            + +F+    +SA                            G LA ET T G  R V   
Sbjct: 92  RTGAFTRTCTASAAA-------------------------VGVLASETFTFGARRAVSLR 126

Query: 260 VAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG- 318
           +  GCG  + G  +GA G+LGL   S+SL+ QL  Q    FSYCL       +  L+FG 
Sbjct: 127 LGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQR---FSYCLTPFADKKTSPLLFGA 183

Query: 319 -------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                  +   P+    +  V NP    +YYV L G+ +G  R+ +      +   G  G
Sbjct: 184 MADLSRHKTTRPIQTTAI--VSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGG 241

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNL------SGFVSVR 424
            ++D+G+ V  L   A+EA ++A V     LP A+  V  ++ C+ L      +   +V+
Sbjct: 242 TIVDSGSTVAYLVEAAFEAVKEA-VMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQ 300

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGLSIIGNIQQEGIQISFD 482
           VP +  +F GG  + LP  N+      AG  C A   +   SG+SIIGN+QQ+ + + FD
Sbjct: 301 VPPLVLHFDGGAAMVLPRDNYFQE-PRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 359

Query: 483 GANGFVGFGPNVC 495
             +    F P  C
Sbjct: 360 VQHHKFSFAPTQC 372


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 157/357 (43%), Gaps = 26/357 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC K  DP F P  S+S+  + C+
Sbjct: 77  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN 136

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
                   N       C YE  Y + S + G L+ + ++ G   +   +    GC +   
Sbjct: 137 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLTPQRAVFGCENVET 191

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S+V QL   G     FS C      G  G++V G+ + P G
Sbjct: 192 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPAG 250

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G  + ++  +F     G  G V+D+GT     P 
Sbjct: 251 MVFSH--SDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 304

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
            A+ A +DA + +  +L R  G   +  D C++ +G     +    P +   F  G  L 
Sbjct: 305 EAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLI 364

Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L       G +C    P     +++G I      +++D  N  +GF    C
Sbjct: 365 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 421


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 117/434 (26%), Positives = 193/434 (44%), Gaps = 63/434 (14%)

Query: 82  ELVHRDKMSSSSNTTNNMHYHRHQHSF---HARMQ--RDVKRVATLVRRLSGGGADAAKH 136
           +L+HRD + S +   N+    R +      +AR    + + +  + V    GG   AA  
Sbjct: 38  KLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADD 97

Query: 137 EVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP 196
               +   ++S +      + V   +G PP  QY V+D+GS + W+QC+PC  C++Q  P
Sbjct: 98  A---YEASLLSEL----CTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGP 150

Query: 197 VFDPADSASFSGVSCSSAVCDRLEN--AGCHAGRCRYEVSYGDGSYTKGTLALETLTI-- 252
           +++P+ S++      S +  DR +      H   C Y  +Y D + T+GT A E L    
Sbjct: 151 LYNPSSSST----YVSCSDFDRTDTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFET 206

Query: 253 ---GRTVVKNVAIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
              G T++ +V  GCGH N    G    A+G+ GLG    S++ +L    G  FSYC+  
Sbjct: 207 PDDGITIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKL----GFGFSYCI-- 260

Query: 307 RGTGSSGSLVFGREALPVGAAW------VPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
              G+ G  ++G   L +G          PLV  PR    YY+ L G+ +G  R+ I   
Sbjct: 261 ---GNIGDPLYGFHRLTLGNKLKIEGYSTPLV--PRG--LYYITLVGISIGQERLDIDPI 313

Query: 361 LFRLTQMG--DDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSI-FDTCY- 415
           +F+   +      +V+D+G  ++ +P  AY   RD   +  +G L R   ++     CY 
Sbjct: 314 VFQRVDLNGISSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYI 373

Query: 416 -----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS--I 468
                +L GF     P  +F+ + G  L            D    C A  P+ S     +
Sbjct: 374 GKLNQDLQGF-----PDATFHLADGADLVFQVEGLFFQYTD-NVLCLALVPTESDEETCL 427

Query: 469 IGNIQQEGIQISFD 482
           IG + Q+   +++D
Sbjct: 428 IGLLAQQYYNVAYD 441


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  133 bits (335), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 189/411 (45%), Gaps = 57/411 (13%)

Query: 101 YHRHQHSFHARMQRDVK----RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEY 156
           Y     S   R +R VK    R+A L  ++ G         + DF  +++    +    +
Sbjct: 48  YFNPNASVAERAERIVKTSATRIAYLYAQIKG------DIHMNDFELNLLPSTYEPL--F 99

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
            V   +G P   Q  ++D+GS+I+WV+C PC +C +Q+ P+ DP+ S++++ + C++ +C
Sbjct: 100 LVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMC 159

Query: 217 DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQG 270
               +A C+   +C Y +SY  G  + G LA E L       G   V +V  GC H+N G
Sbjct: 160 HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHEN-G 218

Query: 271 MFVGA--AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSG--SLVFGREALPVGA 326
            +      G+ GLG G  S V ++G +    FSYCL +      G   LVFG +A   G 
Sbjct: 219 DYKDRRFTGVFGLGKGITSFVTRMGSK----FSYCLGNIADPHYGYNQLVFGEKANFEGY 274

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
           +    V N      YYV L G+ VG  R+ I    F + +  +   ++D+GTA+T L   
Sbjct: 275 STPLKVVNGH----YYVTLEGISVGEKRLDIDSTAFSM-KGNEKSALIDSGTALTWLAES 329

Query: 387 AYEAFRDAFVAQTGN---LPRASGVSIFDTCYNLSGFVS---VRVPTVSFYFSGGPVLTL 440
           A+ A  D  V Q  +   +P   G      CY   G VS   +  P V+F+FSGG  L L
Sbjct: 330 AFRAL-DNEVRQLLDGVLMPFWRGSF---ACYK--GTVSQDLIGFPVVTFHFSGGADLDL 383

Query: 441 PASNFL---------IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              +           I V  A     A+       S+IG + Q+   +++D
Sbjct: 384 DTESMFYQATPDILCIAVRQAS----AYGNDFKSFSVIGLMAQQYYNMAYD 430


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 175/389 (44%), Gaps = 43/389 (11%)

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
           L+ GG  +A+  + D   D+++     +G Y  R+ +G+PP+   +++DSGS + +V C 
Sbjct: 66  LAEGGRPSARMRLHD---DLLT-----NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCA 117

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
            C QC    DP F P  S+++S V C+    CD  +N      +C YE  Y + S + G 
Sbjct: 118 SCEQCGNHQDPRFQPDLSSTYSPVKCNVDCTCDSDKN------QCTYERQYAEMSSSSGV 171

Query: 245 LALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTG 297
           L  + ++ G       +    GC +   G      A G++GLG G +S++ QL   G  G
Sbjct: 172 LGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIG 231

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI 355
            +FS C      G  G++V G    P G  +     VR+P    +Y + L  + V G  +
Sbjct: 232 DSFSMCYGGMDIG-GGAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKAL 286

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDT 413
            +   +F     G  G V+D+GT    LP  A+ AF+DA  +Q   L +  G   +  D 
Sbjct: 287 RVDPRIFD----GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDI 342

Query: 414 CY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGL 466
           C+     N+S    V  P V   F  G  L+L   N+L       G +C   F       
Sbjct: 343 CFAGAGRNVSQLSEV-FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPT 401

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +++G I      +++D  N  +GF    C
Sbjct: 402 TLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 157/357 (43%), Gaps = 35/357 (9%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G+PP+    +ID   ++VW QC  CS+C+KQ  P+F P  S++F    C +  C     
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 222 AGCHAGRCRYEVSYG---DGSYTKGTLALETLTIGRTVVKNVAIGC-GHKNQGMFVGAAG 277
           + C    C YE +     D   T G +  ET  IG T   ++A GC    +     G +G
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIG-TATASLAFGCVVASDIDTMDGTSG 167

Query: 278 LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AAWVPLVR- 333
            +GLG    SLV Q+       FSYCL  RGTG S  L  G  A   G    +  P ++ 
Sbjct: 168 FIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAKLAGGESTSTAPFIKT 224

Query: 334 NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           +P   S  +Y + L  +  G   I         T      +VM T +  + L   AY AF
Sbjct: 225 SPDDDSHHYYLLSLDAIRAGNTTIA--------TAQSGGILVMHTVSPFSLLVDSAYRAF 276

Query: 392 RDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFS-GGPVLTLPASNFL 446
           + A     G     P A+    FD C+   +GF     P + F F  GG  LT+P + +L
Sbjct: 277 KKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYL 336

Query: 447 IPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I V ++  T C A             G+S++G++QQE +   +D     + F P  C
Sbjct: 337 IDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADC 393


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 164/388 (42%), Gaps = 53/388 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
           G Y + + +G+PP++   V+D+GS +VW  C     CS C +   D    P F P +S++
Sbjct: 90  GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSST 149

Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
              + C +  C  +              E+  C      Y + YG GS T G L L+ L 
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLN 208

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---G 308
                V    +GC   +       +G+ G G G  SL  Q+  +    FSYCLVS     
Sbjct: 209 FPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDD 262

Query: 309 TGSSGSLVFGR----EALPVGAAWVPLVRNPRA--PSF---YYVGLSGLGVGGMRIPISE 359
           T  S  LV       +    G ++ P   NP    P+F   YY+ L  + VGG  + I  
Sbjct: 263 TPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPY 322

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSI---FDTCY 415
                   G+ G ++D+G+  T +  P Y      FV Q   N  RA           C+
Sbjct: 323 TFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCF 382

Query: 416 NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF-------AFAPSPSGLSI 468
           N+SG  +V  P ++F F GG  +T P  N+   V DA   C        A  P  +G +I
Sbjct: 383 NISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAI 442

Query: 469 I-GNIQQEGIQISFDGANGFVGFGPNVC 495
           I GN QQ+   I +D  N   GFGP  C
Sbjct: 443 ILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 117/378 (30%), Positives = 178/378 (47%), Gaps = 35/378 (9%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSA 204
           SG   G+G+YFVR  VG+P +   +V D+GSD+ WV+C+  +       P   F  ++S 
Sbjct: 5   SGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESR 64

Query: 205 SFSGVSCSSAVCD-----RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG------ 253
           S++ ++CSS  C       L N    A  C Y+  Y DGS  +G +  +  TI       
Sbjct: 65  SWAPLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGS 124

Query: 254 ---------RTVVKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
                    R  ++ V +GC     G  F  + G+L LG  ++S   +   + GG FSYC
Sbjct: 125 EDGSGGGGRRAKLQGVVLGCTATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYC 184

Query: 304 LVS----RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           LV     R   S  +   G E     AA  PLV + R   FY V +  + V G  + I  
Sbjct: 185 LVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA 244

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
           D++ + + G  G ++D+GT++T L TPAY A   A   +   LPR + +  F+ CYN + 
Sbjct: 245 DVWDVGRGG--GAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVA-MDPFEYCYNWTA 301

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAP-SPSGLSIIGNIQQEGI 477
                +P +   F+G   L  PA +++I  D A G  C      +  G+S+IGNI Q+  
Sbjct: 302 GAP-EIPKLEVSFAGSARLEPPAKSYVI--DAAPGVKCIGVQEGAWPGVSVIGNILQQEH 358

Query: 478 QISFDGANGFVGFGPNVC 495
              FD  + ++ F    C
Sbjct: 359 LWEFDLRDRWLRFKHTRC 376


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 114/364 (31%), Positives = 161/364 (44%), Gaps = 87/364 (23%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
           D+ S +  G G Y + I +G+PP S   + D+GSD++W QC PC  CYKQ +P+FDP  S
Sbjct: 17  DIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKS 76

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-----VVK 258
            ++                                  T G L+ ET TIG T        
Sbjct: 77  KTYK---------------------------------TLGYLSSETFTIGSTEGDPASFP 103

Query: 259 NVAIGCGHKNQGMF-VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS--L 315
            +A GCGH N G F    +GL+GLGGG +SLV QL  + GG FSYCLV   + S+ S  +
Sbjct: 104 GLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKI 163

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
            FG+ A+  G+                         G   P + +        +  +++D
Sbjct: 164 NFGKSAVVSGS-------------------------GTSSPAAAE--------ESNIIID 190

Query: 376 TGTAVTRLPTPAYEAFRDAFVA----QTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
           +GT +T LP   Y     A       QT   PR +    F  CY  SG   + +PT++ +
Sbjct: 191 SGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGT----FSLCY--SGVKKLEIPTITAH 244

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
           F G  V   P + F+   +D    CF+  PS S L+I GN+ Q    + +D  N  V F 
Sbjct: 245 FIGADVQLPPLNTFVQAQEDL--VCFSMIPS-SNLAIFGNLSQMNFLVGYDLKNNKVSFK 301

Query: 492 PNVC 495
           P  C
Sbjct: 302 PTDC 305


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 162/359 (45%), Gaps = 30/359 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C  C K  DP F P +S+++  V C+
Sbjct: 85  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCN 144

Query: 213 -SAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
               CD       H G  C YE  Y + S + G L  + ++ G     V +    GC + 
Sbjct: 145 MDCNCD-------HDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSEVVPQRAVFGCENV 197

Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
             G      A G++GLG G +S+V QL  +     +FS C      G  G++V G   +P
Sbjct: 198 ETGDLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVG-GGAMVLG--GIP 254

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
                V    +P    +Y + L  + V G  + +S   F        G V+D+GT    L
Sbjct: 255 PPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKH----GTVLDSGTTYAYL 310

Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
           P  A+ AFRDA + ++ NL +  G   +  D C++ +G     +S   P V   FS G  
Sbjct: 311 PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQK 370

Query: 438 LTLPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L+L   N+L       G +C     +    +++G I      +++D  N  +GF    C
Sbjct: 371 LSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKTNC 429


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 175/389 (44%), Gaps = 43/389 (11%)

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
           L+ GG  +A+  + D   D+++     +G Y  R+ +G+PP+   +++DSGS + +V C 
Sbjct: 66  LAEGGRPSARMRLHD---DLLT-----NGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCA 117

Query: 186 PCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGT 244
            C QC    DP F P  S+++S V C+    CD  +N      +C YE  Y + S + G 
Sbjct: 118 SCEQCGNHQDPRFQPDLSSTYSPVKCNVDCTCDSDKN------QCTYERQYAEMSSSSGV 171

Query: 245 LALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTG 297
           L  + ++ G       +    GC +   G      A G++GLG G +S++ QL   G  G
Sbjct: 172 LGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIG 231

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPRAPSFYYVGLSGLGVGGMRI 355
            +FS C      G  G++V G    P G  +     VR+P    +Y + L  + V G  +
Sbjct: 232 DSFSMCYGGMDIG-GGAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKAL 286

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDT 413
            +   +F     G  G V+D+GT    LP  A+ AF+DA  +Q   L +  G   +  D 
Sbjct: 287 RVDPRIFD----GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDI 342

Query: 414 CY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGL 466
           C+     N+S    V  P V   F  G  L+L   N+L       G +C   F       
Sbjct: 343 CFAGAGRNVSQLSEV-FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPT 401

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +++G I      +++D  N  +GF    C
Sbjct: 402 TLLGGIVVRNTLVTYDRHNEKIGFWKTNC 430


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 111/372 (29%), Positives = 170/372 (45%), Gaps = 47/372 (12%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
           YF ++G+G+P +   + +D+GSD++WV C+PCS C ++S       ++DP +S++ S VS
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 211 CSSAVC---DRLENAGCH--AGRCRYEVSYGDGSYTKGTL---ALETLTIGRTVVKN--- 259
           CS  +C    R   A C      C Y  SYGDGS ++G     A++   I    + N   
Sbjct: 62  CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121

Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTGSS 312
            V  GC  +  G          G++G G   +S+  QL  Q      FS+CL   G    
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEKRG 179

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G ++        G  + PLV +      Y V L G+ V   R+PI  + F  T   D GV
Sbjct: 180 GGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLPIDAEDFSSTN--DTGV 234

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNLSGFVSVRVPTVSFY 431
           +MD+GT +   P+ AY  F  A    T   P R  G+     C+ +SG +S   P V+  
Sbjct: 235 IMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDT--QCFLVSGRLSDLFPNVTLN 292

Query: 432 FSGGPVLTLPASNFLI-----PVDDAGTFCFAFAPSPSG--------LSIIGNIQQEGIQ 478
           F GG  + L   N+L+     P      +C  +  S S         L+I+G+I  +   
Sbjct: 293 FEGG-AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKL 351

Query: 479 ISFDGANGFVGF 490
           + +D  N  +G+
Sbjct: 352 VVYDLDNSRIGW 363


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 125/448 (27%), Positives = 188/448 (41%), Gaps = 91/448 (20%)

Query: 132 DAAKH---EVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPRSQYMVIDSGSDIVWV 182
           D A+H    +QD G  ++    QG+      G YF ++ +GSP +  Y+ ID+GSDI+W+
Sbjct: 38  DRARHGGRILQDGGGGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWL 97

Query: 183 QCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCD---RLENAGC--HAGRCRYE 232
            C  C+ C K S        FD A S++ + VSCS  VC    +   + C   A +C Y 
Sbjct: 98  NCNTCNNCPKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYT 157

Query: 233 VSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV----GAAGLLG 280
             YGDGS T G    + +     +G++V  N    V  GC     G          G+ G
Sbjct: 158 FQYGDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLARTEKAVDGIFG 217

Query: 281 LGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAP 338
            G G++S+V Q+   G     FS+CL  +G GS G ++   E L     + PLV  P  P
Sbjct: 218 FGPGALSVVSQVSSQGMAPKVFSHCL--KGQGSGGGILVLGEILEPNIVYTPLV--PLQP 273

Query: 339 SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDA---- 394
             Y + L  + V G  +PI +D+F      + G ++D+GT +  L   AY+ F +A    
Sbjct: 274 H-YNLNLQSIAVNGQILPIDQDVFATGN--NRGTIVDSGTTLAYLVQEAYDPFLNAGSPC 330

Query: 395 -----FVAQTGNLPRASG-------------------------------VSIF------- 411
                F   T N+    G                               VS F       
Sbjct: 331 HFFTHFNEPTNNIKYEDGNNNHQSRVKRHYYDEVTLRLVLKHSAIITTTVSQFSKPIISK 390

Query: 412 -DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIP---VDDAGTFCFAFAPSPSGLS 467
            + CY +   +    P VS  F GG  + L    +LI    +D A  +C  F     G +
Sbjct: 391 GNQCYLVPTSLGDIFPLVSLNFMGGASMVLKPEQYLIHYGFLDGAAMWCIGFQKVQKGYT 450

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I+G++  +     +D AN  +G+    C
Sbjct: 451 ILGDLVLKDKIFVYDLANQRIGWTDYDC 478


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 26/357 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC K  DP F P  S S+  + C+
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
                   N       C YE  Y + S + G L+ + ++ G   +   +    GC ++  
Sbjct: 133 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S+V QL   G     FS C      G  G++V G+ + P G
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPPG 246

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G  + ++  +F     G  G V+D+GT     P 
Sbjct: 247 MVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 300

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
            A+ A +DA + +  +L R  G   +  D C++ +G     +    P ++  F  G  L 
Sbjct: 301 EAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLI 360

Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L       G +C    P     +++G I      +++D  N  +GF    C
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 26/357 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC K  DP F P  S S+  + C+
Sbjct: 73  NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN 132

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
                   N       C YE  Y + S + G L+ + ++ G   +   +    GC ++  
Sbjct: 133 PDC-----NCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNESQLSPQRAVFGCENEET 187

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S+V QL   G     FS C      G  G++V G+ + P G
Sbjct: 188 GDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVG-GGAMVLGKISPPPG 246

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G  + ++  +F     G  G V+D+GT     P 
Sbjct: 247 MVFS--HSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTTYAYFPK 300

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRV----PTVSFYFSGGPVLT 439
            A+ A +DA + +  +L R  G   +  D C++ +G     +    P ++  F  G  L 
Sbjct: 301 EAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLI 360

Query: 440 LPASNFLIPVDDA-GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L       G +C    P     +++G I      +++D  N  +GF    C
Sbjct: 361 LSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNC 417


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 164/358 (45%), Gaps = 38/358 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y+  I +GSPP+   +V+D+GSD+ WV+C PCS         FD   S ++  ++C+ 
Sbjct: 122 GVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCS---PDCSSTFDRLASNTYKALTCAD 178

Query: 214 ----AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ 269
                V  RL     H+GR   +     G+ +     LE              GCG   +
Sbjct: 179 DLRLPVLLRLWRRLFHSGRSLRDTLKMAGAASD---ELEEF-------PGFVFGCGSLLK 228

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---GSLVFGREAL---- 322
           G+  G  G+L L  GS+S   Q+G + G  FSYCL+ +   +S     +VFG  A+    
Sbjct: 229 GLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKE 288

Query: 323 PVGAAWVPLVRNPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
           P       L   P   S  +Y V L G+ VG  R+ +S   F   Q  D   + D+GT +
Sbjct: 289 PGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFLNGQ--DKPTIFDSGTTL 346

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSI--FDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
           T LP+   ++ + +  +       A  V+I   D C+ +       +P ++F+F+GG   
Sbjct: 347 TMLPSGVCDSIKQSLASMVSG---AEFVAIKGLDACFRVPPSSGQGLPDITFHFNGGADF 403

Query: 439 TLPASNFLIPVDDAGTF-CFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               SN++I   D G+  C  F P+ + +SI GN+QQ+   +  D  N  +GF    C
Sbjct: 404 VTRPSNYVI---DLGSLQCLIFVPT-NEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 457


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  132 bits (332), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 171/374 (45%), Gaps = 47/374 (12%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF ++G+G+P +   + +D+GSD++WV C+PCS C ++S       ++DP +S++ S 
Sbjct: 27  GLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSL 86

Query: 209 VSCSSAVC---DRLENAGCH--AGRCRYEVSYGDGSYTKGTL---ALETLTIGRTVVKN- 259
           VSCS  +C    R   A C      C Y  SYGDGS ++G     A++   I    + N 
Sbjct: 87  VSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANT 146

Query: 260 ---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTG 310
              V  GC  +  G          G++G G   +S+  QL  Q      FS+CL   G  
Sbjct: 147 TSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEK 204

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
             G ++        G  + PLV +      Y V L G+ V   R+PI  + F  T   D 
Sbjct: 205 RGGGILVIGGIAEPGMTYTPLVPD---SVHYNVVLRGISVNSNRLPIDAEDFSSTN--DT 259

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP-RASGVSIFDTCYNLSGFVSVRVPTVS 429
           GV+MD+GT +   P+ AY  F  A    T   P R  G+     C+ +SG +S   P V+
Sbjct: 260 GVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDT--QCFLVSGRLSDLFPNVT 317

Query: 430 FYFSGGPVLTLPASNFLI-----PVDDAGTFCFAFAPSPSG--------LSIIGNIQQEG 476
             F GG  + L   N+L+     P      +C  +  S S         L+I+G+I  + 
Sbjct: 318 LNFEGG-AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKD 376

Query: 477 IQISFDGANGFVGF 490
             + +D  N  +G+
Sbjct: 377 KLVVYDLDNSRIGW 390


>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 480

 Score =  132 bits (332), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 120/406 (29%), Positives = 167/406 (41%), Gaps = 71/406 (17%)

Query: 155 EYFVRIGVGSPPRSQ--YMVIDSGSDIVWVQCQP--CSQCY-KQSDPVFDPADSASFS-G 208
           +Y +   +G   ++Q   + +D+GSD+VW  C P  C  C  K ++P   P  + + S  
Sbjct: 69  DYTLSFNLGPQAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPNEPNASPPTNITQSVA 128

Query: 209 VSCSSAVCDRLENAG-----CHAGRCRYE----------------VSYGDGSYTKGTLAL 247
           VSC S  C    N       C A RC  E                 +YGDGS     L  
Sbjct: 129 VSCKSPACSAAHNLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLI-ARLYR 187

Query: 248 ETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCL 304
           +TL++    ++N   GC H          G+ G G G +SL  QL     Q G  FSYCL
Sbjct: 188 DTLSLSSLFLRNFTFGCAHTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCL 244

Query: 305 VSRGTGSS-----GSLVFGR----EALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGV 350
           VS    S        L+ GR    E   +G       +  ++ NP+ P FY V L G+ V
Sbjct: 245 VSHSFDSERVRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAV 304

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-NLPRASGVS 409
           G   IP  E L R+   GD GVV+D+GT  T LP   Y +  D F  + G +  RA  + 
Sbjct: 305 GKRTIPAPEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIE 364

Query: 410 I---FDTCYNLSGFVSVRVPTVSFYFSGGP--VLTLPASNFLIPVDD----------AGT 454
                  CY L+      VP ++  F+GG    + LP  N+     D           G 
Sbjct: 365 EKTGLAPCYYLNSVAD--VPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGC 422

Query: 455 FCFAFAPSPSGLS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
                    + LS      +GN QQ+G ++ +D     VGF    C
Sbjct: 423 LMLMNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 468


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  132 bits (332), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 116/427 (27%), Positives = 187/427 (43%), Gaps = 48/427 (11%)

Query: 85  HRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD 144
           HR  +     +T+N+  HR    F +   R         R+L       A   + D   D
Sbjct: 36  HRPMIIPLHLSTSNISSHRK--PFTSNYHR---------RQLHNSDLPNAHMRLYD---D 81

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           ++S     +G Y  R+ +G+PP+   +++D+GS + +V C  C QC K  DP F P  S+
Sbjct: 82  LLS-----NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSS 136

Query: 205 SFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNV 260
           ++  + C+ S  CD          +C YE  Y + S + G LA + L+ G       +  
Sbjct: 137 TYKPMQCNPSCNCDD------EGKQCTYERRYAEMSSSSGLLAEDVLSFGNESELTPQRA 190

Query: 261 AIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLV 316
             GC     G      A G++GLG G +S+V QL  +   G +FS C         G++V
Sbjct: 191 IFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDV-VGGAMV 249

Query: 317 FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            G   +P     V    +P   ++Y + L  L V G R+ ++  +F     G  G V+D+
Sbjct: 250 LGN--IPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDS 303

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSF 430
           GT    LP  A+ AF+DA + +   L +  G   S  D C++ +G     +S   P V+ 
Sbjct: 304 GTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNM 363

Query: 431 YFSGGPVLTLPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFV 488
            F  G  L+L   N+L      +G +C   F       +++G I      +++D  N  +
Sbjct: 364 VFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKI 423

Query: 489 GFGPNVC 495
           GF    C
Sbjct: 424 GFWKTNC 430


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 169/391 (43%), Gaps = 59/391 (15%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
           G Y + + +G+PP++   V+D+GS +VW  C     CS C +   D    P F P +S++
Sbjct: 86  GGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSST 145

Query: 206 FSGVSCSSAVCDRL--ENAGCHAGRCR-------------YEVSYGDGSYTKGTLALETL 250
              + C +  C  L   +      +C+             Y + YG G+ T G L L+ L
Sbjct: 146 AKLLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNL 204

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
                 V    +GC   +       +G+ G G G  SL  Q+  +    FSYCLVS    
Sbjct: 205 NFPGKTVPQFLVGCSILS---IRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFD 258

Query: 308 GTGSSGSLVFGR----EALPVGAAWVPLVRNPRAPS----FYYVGLSGLGVGGMRIPISE 359
            T  S  LV       +    G ++ P   NP   S    +YYV L  L VGG+ + I  
Sbjct: 259 DTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPY 318

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTG-------NLPRASGVSIFD 412
                   G+ G ++D+G+  T +  P Y      F+ Q G       N+   SG+S   
Sbjct: 319 KFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLS--- 375

Query: 413 TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCF-------AFAPSPSG 465
            C+N+SG  ++  P  +F F GG  ++ P  N+   V DA   CF       A  P  +G
Sbjct: 376 PCFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAG 435

Query: 466 LSII-GNIQQEGIQISFDGANGFVGFGPNVC 495
            +II GN QQ+   + +D  N   GFGP  C
Sbjct: 436 PAIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 165/367 (44%), Gaps = 41/367 (11%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFS 207
           +G YF ++ +G+PPR+  + +D+GSD++WV C PC  C   SD   P+  +D   SAS S
Sbjct: 33  AGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSS 92

Query: 208 GVSCSSAVC---DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
            V CS   C    ++  +GC+   +C Y   YGDGS T G L  + L         V  G
Sbjct: 93  KVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFG 152

Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           CG K  G    +     G++G G   +S   QL   G+T   F++CL   G    G LV 
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL-DGGERGGGILVL 211

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G    P    + PLV  P   S Y V L  + V    + I   LF    M   G + D+G
Sbjct: 212 GNVIEP-DIQYTPLV--PYM-SHYNVVLQSISVNNANLTIDPKLFSNDVM--QGTIFDSG 265

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC-YNLSGFVSVRVPTVSFYFSGGP 436
           T +  LP  AY+AF  A          +  V+ F  C   LS F+    P V  YF G  
Sbjct: 266 TTLAYLPDEAYQAFTQAV---------SLVVAPFLLCDTRLSRFIYKLFPNVVLYFEGAS 316

Query: 437 VLTLPASNFLI---PVDDAGTFCFAF-----APSPSGLSIIGNIQQEGIQISFDGANGFV 488
            +TL  + +LI      +A  +C  +     A S    +I G++  +   + +D   G +
Sbjct: 317 -MTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRI 375

Query: 489 GFGPNVC 495
           G+ P  C
Sbjct: 376 GWRPFDC 382


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 120/418 (28%), Positives = 190/418 (45%), Gaps = 56/418 (13%)

Query: 101 YHRHQHSFHARMQRDVK----RVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEY 156
           +++   +   RM+ D++    R A +  R+ G      +++ +      VS    G    
Sbjct: 49  HYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKAR------VSPSLTGR-TI 101

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
              I +G PP  Q +V+D+GSDI+WV C PC+ C      +FDP+ S++FS + C +  C
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-C 159

Query: 217 DRLENAGCHAGRCR---YEVSYGDGSYTKG-----TLALETLTIGRTVVKNVAIGCGHK- 267
           D     GC   RC    + V+Y D S   G     T+  ET   G + + +V  GCGH  
Sbjct: 160 DF---KGC--SRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGHNI 214

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVG 325
            Q    G  G+LGL  G  SL  ++G +    FSYC+  ++    +   L+ G  A   G
Sbjct: 215 GQDTDPGHNGILGLNNGPDSLATKIGQK----FSYCIGDLADPYYNYHQLILGEGADLEG 270

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
            +    V N     FYYV + G+ VG  R+ I+ + F + +    GV++DTG+ +T L  
Sbjct: 271 YSTPFEVHN----GFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326

Query: 386 PAYEAFRDAFVAQTGNLPRASGV--SIFDTCY------NLSGFVSVRVPTVSFYFSGGPV 437
             +           G   R + +  S +  C+      +L GF     P V+F+F+ G  
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGF-----PVVTFHFADGAD 381

Query: 438 LTLPASNFLIPVDDAGTFCFAFAPS-----PSGLSIIGNIQQEGIQISFDGANGFVGF 490
           L L + +F   ++D   FC    P       S  S+IG + Q+   + +D  N FV F
Sbjct: 382 LALDSGSFFNQLND-NVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYF 438


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 123/404 (30%), Positives = 175/404 (43%), Gaps = 58/404 (14%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD------VVSGMDQGSGEY 156
            H+        RD  R   L++ L G         V DF  D      VV       G Y
Sbjct: 38  NHEMELSQLKARDKARHGRLLQSLGG---------VIDFPVDGTFDPFVV-------GLY 81

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSC 211
           + +I +GSPPR  Y+ +D+GSD++WV C  C+ C + S        FDP  S + + VSC
Sbjct: 82  YTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSC 141

Query: 212 SSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN--- 259
           S   C    +  ++GC      C Y   YGDGS T G    + L     +G ++V N   
Sbjct: 142 SDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTA 201

Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS 312
            V  GC     G  V       G+ G G   MS++ QL  Q      FS+CL     G  
Sbjct: 202 PVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGE-NGGG 260

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G LV G    P    + PLV  P  P  Y V L  + V G  +PI+  +F  +     G 
Sbjct: 261 GILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGT 314

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           ++DTGT +  L   AY  F +A         R   VS  + CY ++  V+   P VS  F
Sbjct: 315 IIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVIATSVADIFPPVSLNF 373

Query: 433 SGGPVLTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
           +GG  + L   ++LI  ++ G    +C  F      G++I+G++
Sbjct: 374 AGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 172/380 (45%), Gaps = 60/380 (15%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG PP++  MV+D+GS++ W+ C+           VF+P  S+++S V CSS +C 
Sbjct: 67  VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 122

Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
               D    A C      C   +SY D +  +G LA ET  IG         GC      
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182

Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
            N      + GL+G+  GS+S V QLG      FSYC+   G+ SSG L+ G  +     
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCI--SGSDSSGFLLLGDASYSWLG 237

Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           P+   + PLV ++   P F    Y V L G+ VG   + + + +F     G    ++D+G
Sbjct: 238 PI--QYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--------NLSGFVSV 423
           T  T L  P Y A ++ F+ QT ++ R      F      D CY        N SG    
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSG---- 351

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQ 474
            +P VS  F G   +++     L  V+ AG+      +CF F  S   G+   +IG+  Q
Sbjct: 352 -LPMVSLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 409

Query: 475 EGIQISFDGANGFVGFGPNV 494
           + + + FD A   VGF  NV
Sbjct: 410 QNVWMEFDLAKSRVGFAGNV 429


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 98/341 (28%), Positives = 157/341 (46%), Gaps = 41/341 (12%)

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           RRL G     A+  + D   D++      +G Y  RI +G+PP++  +++D+GS + +V 
Sbjct: 66  RRLQGSARPNARMRLYD---DLLL-----NGYYTTRIWIGTPPQTFALIVDTGSTVTYVP 117

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
           C  C QC +  DP F+P  S+++  VSC+    CD          +C YE  Y + S + 
Sbjct: 118 CSTCEQCGRHQDPKFEPELSSTYQPVSCNIDCTCDN------ERKQCVYERQYAEMSSSS 171

Query: 243 GTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQ 295
           G L  + ++ G     V +    GC ++  G      A G++GLG G +S+V QL   G 
Sbjct: 172 GVLGEDIISFGNQSELVPQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQLVEKGV 231

Query: 296 TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
              +FS C      G  G+++ G  + P G  +     +P    +Y + L  + V G ++
Sbjct: 232 ISDSFSLCYGGMDIG-GGAMILGGISPPSGMVFAE--SDPVRSQYYNIDLKAIHVAGKQL 288

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY 415
            +   +F     G  G V+D+GT    LP  A+ AF+DA + +  +L +  G    D  Y
Sbjct: 289 HLDPSIFD----GKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGP---DPNY 341

Query: 416 NLSGF---------VSVRVPTVSFYFSGGPVLTLPASNFLI 447
           N   F         +S   P V   FS G  L+L   N+L 
Sbjct: 342 NDICFSGAESDVSQLSNTFPAVEMVFSNGQKLSLSPENYLF 382


>gi|194701538|gb|ACF84853.1| unknown [Zea mays]
 gi|194703714|gb|ACF85941.1| unknown [Zea mays]
          Length = 208

 Score =  132 bits (331), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 119/221 (53%), Gaps = 17/221 (7%)

Query: 279 LGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV--PLVRNPR 336
           +GLGGG+ SLV Q  G  G AFSYCL    + SSG L  G       + +V  P++R+ +
Sbjct: 1   MGLGGGAQSLVSQTAGTLGRAFSYCLPPTPS-SSGFLTLGAAGGSGTSGFVKTPMLRSSQ 59

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV 396
            P+FY V L  + VGG ++ I   +F        G VMD+GT +TRLP  AY A   AF 
Sbjct: 60  VPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGTVITRLPPTAYSALSSAFK 113

Query: 397 AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFC 456
           A     P A    I DTC++ SG  SV +P+V+  FSGG V++L AS  ++      + C
Sbjct: 114 AGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGIIL------SNC 167

Query: 457 FAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AFA     S L IIGN+QQ   ++ +D   G VGF    C
Sbjct: 168 LAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 208


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 177/397 (44%), Gaps = 37/397 (9%)

Query: 115 DVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVID 174
           +  R+A+  R L  GG  +A+  + D   D+++     +G Y  R+ +G+PP+   +++D
Sbjct: 52  NASRLASSRRVLGDGGRPSARMRLHD---DLLT-----NGYYTTRLYIGTPPQEFALIVD 103

Query: 175 SGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS-AVCDRLENAGCHAGRCRYEV 233
           SGS + +V C  C QC    DP F P  S+++S V CS+   CD          +C YE 
Sbjct: 104 SGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCSADCTCD------SDKSQCTYER 157

Query: 234 SYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSL 288
            Y + S + G L  + ++ G       +    GC +   G      A G++GLG G +S+
Sbjct: 158 QYAEMSSSSGVLGEDIVSFGTESELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSI 217

Query: 289 VGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLS 346
           + QL   G  G +FS C      G  G++V G  A+P     V    +P    +Y + L 
Sbjct: 218 MDQLVDKGVIGDSFSMCYGGMDIG-GGAMVLG--AMPAPPDMVFSRSDPVRSPYYNIELK 274

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS 406
            + V G  + +   +F        G V+D+GT    LP  A+ AF+DA  ++   L +  
Sbjct: 275 EIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIR 330

Query: 407 G--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-A 458
           G   +  D C+  +G     +S   P V   F  G  L+L   N+L       G +C   
Sbjct: 331 GPDPNYKDICFAGAGRNVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGV 390

Query: 459 FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           F       +++G I      +++D  N  +GF    C
Sbjct: 391 FQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 427


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 168/361 (46%), Gaps = 53/361 (14%)

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
            V + +G P   Q +V+D+GSDI+W+ C PC+ C      +FDP+ S++FS + C +   
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTPCG 160

Query: 217 DRLENAGCHAGRCRYEVSYGDGSYTKGT-----LALETLTIGRTVVKNVAIGCGHKNQGM 271
            +    GC      + +SY D S   GT     L  ET   G + + +V IGCGH N G 
Sbjct: 161 FK----GCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NIGF 215

Query: 272 FV--GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-AW 328
               G  G+LGL  G  SL  Q+G +    FSYC+     G+     +    L +G  A 
Sbjct: 216 NSDPGYNGILGLNNGPNSLATQIGRK----FSYCI-----GNLADPYYNYNQLRLGEGAD 266

Query: 329 VPLVRNPRA--PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTP 386
           +     P      FYYV + G+ VG  R+ I+ + F + + G  GV++D+GT +T L   
Sbjct: 267 LEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDS 326

Query: 387 AYEAFRDAFVAQTGNLPRASGVSI------FDTCYNLSGFVS---VRVPTVSFYFSGGPV 437
           A++   +    +  NL + S   +      +  CY   G +S   V  P V+F+F  G  
Sbjct: 327 AHKLLYN----EVRNLLKWSFRQVIFENAPWKLCY--YGIISRDLVGFPVVTFHFVDGAD 380

Query: 438 LTLPASNFLIPVDDAGTFCFAFAP--------SPSGLSIIGNIQQEGIQISFDGANGFVG 489
           L L   +F    DD   FC   +P        SP   S+IG + Q+   + +D  N FV 
Sbjct: 381 LALDTGSFFSQRDD--IFCMTVSPASILNTTISP---SVIGLLAQQSYNVGYDLVNQFVY 435

Query: 490 F 490
           F
Sbjct: 436 F 436


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 163/364 (44%), Gaps = 36/364 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + +G+PP+ Q MV+D+GS + W+QC   +         FDP+ S++FS + C+  VC 
Sbjct: 99  VDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVCK 158

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQG 270
               D      C   R C Y   Y DG+Y +G L  E  T  R++    + +GC  ++  
Sbjct: 159 PRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSLFTPPLILGCATES-- 216

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
                 G+LG+  G +S   Q        FSYC+ +R    G   +GS   G        
Sbjct: 217 --TDPRGILGMNRGRLSFASQ---SKITKFSYCVPTRVTRPGYTPTGSFYLGHNPNSNTF 271

Query: 327 AWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            ++ ++   R+ R P+     Y V L G+ +GG ++ IS  +FR    G    ++D+G+ 
Sbjct: 272 RYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSGQTMLDSGSE 331

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGF-VSVRVPTVSFYFSG 434
            T L   AY+  R   V   G  PR     ++    D C++ +   +   +  + F F  
Sbjct: 332 FTYLVNEAYDKVRAEVVRAVG--PRMKKGYVYGGVADMCFDGNAIEIGRLIGDMVFEFEK 389

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFG 491
           G  + +P    L  V + G  C   A S    +  +IIGN  Q+ + + FD  N  +GFG
Sbjct: 390 GVQIVVPKERVLATV-EGGVHCIGIANSDKLGAASNIIGNFHQQNLWVEFDLVNRRMGFG 448

Query: 492 PNVC 495
              C
Sbjct: 449 TADC 452


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 166/371 (44%), Gaps = 36/371 (9%)

Query: 156 YFVRIGVGSPP--RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           Y V +GVG+     +  + +D  +   W+QC PC  C  Q +PVFDPA S +F  VS  +
Sbjct: 101 YAVAVGVGTEHGYENYELEMDMAAGFSWMQCAPCHPCLPQLNPVFDPAKSPTFRPVSGHN 160

Query: 214 AVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR-----TVVKNVAIGCGHK- 267
           AV  R        GRC + ++Y +G+   G LA +T +          +  +  GC ++ 
Sbjct: 161 AVLCRPPYHPLQDGRCGFGIAYRNGASAAGYLARDTFSFPTGDNNFQHLPGIVFGCANRI 220

Query: 268 ----NQGMFVGAAGLLGLGGGSMSLVG---QLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
                 G   G  G +G+G     L G   QL    GG FSYC +  GT +   L FG +
Sbjct: 221 ARFDTHGALAGVLG-MGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRFGND 279

Query: 321 ---ALPVGAAWVPL-VRNPRAPS-FYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
                P G     + V  P   S  YYV L+G+ VG +R+P ++ ++F   Q G  G  +
Sbjct: 280 IPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAI 339

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI----FDTCYNLSGFVSVRVPTVSF 430
           D GT +T +   AY     A     G+L R     +       C + +  +  R+P+++ 
Sbjct: 340 DIGTKMTAIVQTAYAHVEAAV---RGHLQRNRARFVQSPGHHLCVHRTPAIEERLPSMTL 396

Query: 431 YFSGGPVLTL-PASNFLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
           +F GGP L + P   FL+   P       C    P  + +++IG +QQ   +  FD  N 
Sbjct: 397 HFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVPD-AEMTVIGAMQQIDTRFIFDLHNN 455

Query: 487 --FVGFGPNVC 495
              V F P  C
Sbjct: 456 IPIVSFNPEDC 466


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  131 bits (330), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 173/376 (46%), Gaps = 46/376 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD------PVFDPADSASF 206
           +G Y+ +I +G+PP   Y+ +D+GSD+ W+ C PC+ C  ++         +DP+ S++ 
Sbjct: 34  TGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTD 93

Query: 207 SGVSCSSAVCDRL----ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVK 258
             +SC  + C       E +   AG C Y  +YGDGS T+G    + +T       T V 
Sbjct: 94  GALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVN 153

Query: 259 ---NVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
              +V  GCG    G  + ++    GL+G G  ++S+  QL   G+ G  F++CL     
Sbjct: 154 GTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQ 213

Query: 310 GSSGSLVFGREALPVGAAWVPLV-RNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           G  G++V G  + P   ++ P+V RN      Y VG+  + V G  +      F  T   
Sbjct: 214 G-GGTIVIGSVSEP-NISYTPIVSRN-----HYAVGMQNIAVNGRNVTTPAS-FDTTSTS 265

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-VSVRVPT 427
             GV+MD+GT +  L  PAY  F +A      +   +S  S    C  L+   +    PT
Sbjct: 266 AGGVIMDSGTTLAYLVDPAYTQFVNAV-----STFESSMFSSHSQCLQLAWCSLQADFPT 320

Query: 428 VSFYFSGGPVLTLPASNFLI--PVDDA-GTFCFAFAPSPS-----GLSIIGNIQQEGIQI 479
           V  +F  G V+ L   N+L   P+ +    +C  +  S +       SI+G+I  +   +
Sbjct: 321 VKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLV 380

Query: 480 SFDGANGFVGFGPNVC 495
            +D  N  VG+    C
Sbjct: 381 VYDNDNRVVGWKSFDC 396


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 122/453 (26%), Positives = 191/453 (42%), Gaps = 68/453 (15%)

Query: 81  LELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQD 140
           LEL H D               +   +   RM+R  +R    +  ++GGG +A+      
Sbjct: 35  LELTHVDA--------------KQNCTTKERMRRATERTHRRLASMAGGGGEAS------ 74

Query: 141 FGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVF 198
                 + +     +Y     +G PP+    +ID+GS+++W QC  C    C+ Q    +
Sbjct: 75  ------APIHWNETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFY 128

Query: 199 DPADSASFSGVSCSSAVCDRLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI--GR 254
           DP+ S +   V+C+   C       C      C    +YG G+   G L  E  T   G+
Sbjct: 129 DPSRSRTAKPVACNDTACLLGSETRCARDGKACAVLTAYGAGA-IGGFLGTEVFTFGHGQ 187

Query: 255 TVVKNV--AIGC---GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGT 309
           +   NV  A GC        G   GA+G++GLG G +SL  QLG      FSYCL    +
Sbjct: 188 SSENNVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDNK---FSYCLTPYFS 244

Query: 310 GSSGSLVF------GREALPVGAAWVPLVRNPRA---PSFYYVGLSGLGVGGMRIPISED 360
            ++ +         G       A  VP ++NP      SFYY+ L+G+ VG  ++ +   
Sbjct: 245 DAANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAA 304

Query: 361 LFRLTQMGDD---GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN--LPRASGVSIFDTCY 415
            F L ++      G ++D+G+  T L   AY+A RD  V Q G   +P  +G    D C 
Sbjct: 305 AFDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCV 364

Query: 416 N--LSGFVSVRVPTVSFYFSGGPV----LTLPASNFLIPVDDAGTFCFAFA---PSPS-- 464
                G     VP +  +F  G      + +P  N+  PVDD+      F+   P+ +  
Sbjct: 365 GGVAPGDAGKLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLP 424

Query: 465 --GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               +IIGN  Q+ + + +D   G + F P  C
Sbjct: 425 LNETTIIGNYMQQDMHLLYDLGQGVLSFQPADC 457


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  131 bits (329), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 121/439 (27%), Positives = 190/439 (43%), Gaps = 48/439 (10%)

Query: 74  SDEARWNLELVHRDKMSS----SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGG 129
           +++  +  EL+HRD  +S    +S TT+             R+   V+R A  V R +  
Sbjct: 32  AEKLSFTTELIHRDSPNSPLFNASETTD------------IRLANAVERSADRVNRFN-- 77

Query: 130 GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ 189
             D   + +     +  S +D G  ++ ++I +G PP    + + +GSD+VW+ C     
Sbjct: 78  --DLISNSIT--AAEFPSILDNG--DFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKP 131

Query: 190 CYKQSD-PVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVS-YGDGSYTKGTLAL 247
           C    D   FDP +S+++  V C S  C     A C    C Y        S   G LA+
Sbjct: 132 CTHNCDLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAM 191

Query: 248 ETLTIGRT-----VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
           +TLT+  T     ++ N    CG++  G + G  G+LGLG GS+SL+ ++     G FS+
Sbjct: 192 DTLTLNSTTGKSFMLPNTGFICGNRIGGDYPGV-GILGLGHGSLSLLNRISHLIDGKFSH 250

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGGMRIP---IS 358
           C+V   +  +  L FG +A+  G+A      +    P  Y +   G+ VG   I    I 
Sbjct: 251 CIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIG 310

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR-DAFVAQTGNLPRASGVSIFDTCYNL 417
            D +       +G+ MD+GT  T  P   Y     D   A                CY  
Sbjct: 311 SDYYM------NGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRY 364

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL-SIIGNIQQEG 476
           S   S   PT++ +F GG V    +++F+   +D    C AFA S S   ++ G  QQ  
Sbjct: 365 SPDFS--PPTITMHFEGGSVELSSSNSFIRMTED--IVCLAFATSSSEQDAVFGYWQQTN 420

Query: 477 IQISFDGANGFVGFGPNVC 495
           + I +D   GF+ F    C
Sbjct: 421 LLIGYDLDAGFLSFLKTDC 439


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 121/433 (27%), Positives = 189/433 (43%), Gaps = 81/433 (18%)

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-----PCSQCYKQ 193
           + F   + SG   G+G+YFVR  VG+P R   +V D+GSD+ WV+C        +  Y  
Sbjct: 90  EAFAMPLSSGAYTGTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGY 149

Query: 194 SDP----------------------VFDPADSASFSGVSCSSAVCDR---LENAGCHA-- 226
           + P                      VF P  S +++ + CSS  C        A C    
Sbjct: 150 AAPASNDSSTSSLSAAAASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPG 209

Query: 227 GRCRYEVSYGDGSYTKGTLALETLTIG-----------RTVVKNVAIGCGHKNQG-MFVG 274
             C Y+  Y DGS  +GT+  ++ TI            +  ++ V +GC     G  F+ 
Sbjct: 210 SPCAYDYRYKDGSAARGTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLA 269

Query: 275 AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--GTGSSGSLVFG-------------- 318
           + G+L LG  ++S   +   + GG FSYCLV       ++  L FG              
Sbjct: 270 SDGVLSLGYSNISFASRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTA 329

Query: 319 ---------REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
                        P GA   PL+ + R   FY V ++G+ V G  + I   ++ + + G 
Sbjct: 330 CAGGGSPAAAPPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGG- 388

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGF-----VSVR 424
            G ++D+GT++T L +PAY A   A   +   LPR + +  FD CYN +       ++V 
Sbjct: 389 -GAILDSGTSLTVLVSPAYRAVVAALNKKLAGLPRVT-MDPFDYCYNWTSPSTGEDLTVA 446

Query: 425 VPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP-SGLSIIGNIQQEGIQISFD 482
           +P ++ +F+G   L  PA +++I  D A G  C         G+S+IGNI Q+     FD
Sbjct: 447 MPELAVHFAGSARLQPPAKSYVI--DAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFD 504

Query: 483 GANGFVGFGPNVC 495
             N  + F  + C
Sbjct: 505 LKNRRLRFKRSRC 517


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 165/369 (44%), Gaps = 45/369 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFS 207
           +G YF ++ +G+PPR+  + +D+GSD++WV C PC  C   SD   P+  +D   SAS S
Sbjct: 33  AGLYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSS 92

Query: 208 GVSCSSAVC---DRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
            V CS   C    ++  +GC+   +C Y   YGDGS T G L  + L         V  G
Sbjct: 93  KVPCSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVIFG 152

Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           CG K  G    +     G++G G   +S   QL   G+T   F++CL   G    G LV 
Sbjct: 153 CGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCL-DGGERGGGILVL 211

Query: 318 GREALPVGAAWVPLVRNPRAPSFYY--VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
           G    P    + PLV     P  Y+  V L  + V    + I   LF    M   G + D
Sbjct: 212 GNVIEP-DIQYTPLV-----PYMYHYNVVLQSISVNNANLTIDPKLFSNDVM--QGTIFD 263

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTC-YNLSGFVSVRVPTVSFYFSG 434
           +GT +  LP  AY+AF  A          +  V+ F  C   LS F+    P V  YF G
Sbjct: 264 SGTTLAYLPDEAYQAFTQAV---------SLVVAPFLLCDTRLSRFIYKLFPNVVLYFEG 314

Query: 435 GPVLTLPASNFLI---PVDDAGTFCFAF-----APSPSGLSIIGNIQQEGIQISFDGANG 486
              +TL  + +LI      +A  +C  +     A S    +I G++  +   + +D   G
Sbjct: 315 AS-MTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERG 373

Query: 487 FVGFGPNVC 495
            +G+ P  C
Sbjct: 374 RIGWRPFDC 382


>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 500

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 110/367 (29%), Positives = 160/367 (43%), Gaps = 37/367 (10%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSC 211
           +Y V +G G+P +   M  D+G  I  V+C  C   + C   +   FDP+ S++F+ V C
Sbjct: 145 DYTVVVGYGTPAQQLAMAFDTGLGISLVRCAACRPGAPCDGLAS--FDPSRSSTFAPVPC 202

Query: 212 SSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQG 270
            S  C     +GC +G            +  G +A + LT+  +  V +   GC   + G
Sbjct: 203 GSPDC----RSGCSSGSTP-SCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSG 257

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG----- 325
             +GAAGLL L   S S+  +L    GG FSYCL    T S G L  G   +P       
Sbjct: 258 EPLGAAGLLDLSRDSRSVASRLAADAGGTFSYCLPLSTTSSHGFLAIGEADVPHNRTARV 317

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
            A  PLV +P  P+ Y + L+G+ +GG  IPI              +V+DT    T +  
Sbjct: 318 TAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPH----AATASAAMVLDTALPYTYMKP 373

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-SVRVPTVSFYF-----SGGPVLT 439
             Y   RDAF       PRA  +   DTCYN +G    V +P V   F      GG  + 
Sbjct: 374 SMYAPLRDAFRRAMARYPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGIGGGGGGQVL 433

Query: 440 LPASNFLIPVDDAGTF----CFAFAPSPSG-------LSIIGNIQQEGIQISFDGANGFV 488
              ++ +  + + G F    C AFA  PS          ++G + Q  +++  D   G +
Sbjct: 434 GLGADQMFYMSEPGNFFSVTCLAFAALPSDGDAEAPLAMVMGTLAQSSMEVVHDVPGGKI 493

Query: 489 GFGPNVC 495
           GF P  C
Sbjct: 494 GFIPGSC 500


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 164/371 (44%), Gaps = 40/371 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAV 215
           V + VG+PP++  MV+D+GS++ W+ C P        +S   F P  S +F+ V C SA 
Sbjct: 68  VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127

Query: 216 C---DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHK 267
           C   D      C   + +CR  +SY DGS + G LA E  T+G+      A GC      
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFD 187

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGA 326
                V  AGLLG+  G++S V Q   +    FSYC+  R    +G L+ G   LP +  
Sbjct: 188 TSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPL 242

Query: 327 AWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            + PL + P  P  Y+      V L G+ VGG  +PI   +      G    ++D+GT  
Sbjct: 243 NYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQF 301

Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSG--FVSVRVPTVSFYF 432
           T L   AY A +  F  QT       N P  +    FDTC+ +        R+P V+  F
Sbjct: 302 TFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLF 361

Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGA 484
           +G   +T+     L  V        G +C  F  +   P    +IG+  Q  + + +D  
Sbjct: 362 NGA-QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLE 420

Query: 485 NGFVGFGPNVC 495
            G VG  P  C
Sbjct: 421 RGRVGLAPIRC 431


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 170/379 (44%), Gaps = 39/379 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G+   +G YF +IG+GSP +  Y+ +D+GSDI+WV C  C++C ++SD      ++DP 
Sbjct: 60  NGLPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPK 119

Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR--- 254
            S +   VSC    C         GC A   C Y +SYGDGS T G    + LT  R   
Sbjct: 120 RSKTSEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNG 179

Query: 255 ---TVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
              T  +N +I  GCG    G F  ++     G++G G  + S++ QL   G+    FS+
Sbjct: 180 NPHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 239

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL    T   G +    E +       PLV N    + Y V L  + V G  + +  D F
Sbjct: 240 CL---DTNVGGGIFSIGEVVEPKVKTTPLVPN---MAHYNVILKNIEVDGDILQLPSDTF 293

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
             ++ G  G V+D+GT +  LP   Y+      +A+   L +   V    +C+  +G V 
Sbjct: 294 D-SENG-KGTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRL-KVYLVEEQYSCFQYTGNVD 350

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
              P V  +F     LT+   ++L        +C  +  S S       ++++G+     
Sbjct: 351 SGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSN 410

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +G+    C
Sbjct: 411 KLVVYDLENMTIGWTDYNC 429


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 159/363 (43%), Gaps = 38/363 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y     +G+PP+    ++D   ++VW QC  C +C+KQ  PVF P  S++F    C +AV
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 104

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSY----TKGTLALETLTIGRTVVKNVAIGC-GHKNQG 270
           C+ +    C    C Y+   G  +     T G  A +T  IG   V+ +A GC    +  
Sbjct: 105 CESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR-LAFGCVVASDID 160

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA---A 327
              G +G +GLG    SLV Q+       FSYCL  R TG S  L  G  A   G+   +
Sbjct: 161 TMDGPSGFIGLGRTPWSLVAQMKLTR---FSYCLSPRNTGKSSRLFLGSSAKLAGSESTS 217

Query: 328 WVPLVR---NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
             P ++   +    ++Y + L  +  G   I         T      +VM T +  + L 
Sbjct: 218 TAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLV 269

Query: 385 TPAYEAFRDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTL 440
             AY+AF+ A     G     P A+    FD C+   +GF     P + F F G   LT+
Sbjct: 270 DSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTV 329

Query: 441 PASNFLIPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           P + +LI V ++  T C A             G+S++G++QQE +   +D     + F P
Sbjct: 330 PPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEP 389

Query: 493 NVC 495
             C
Sbjct: 390 ADC 392


>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 598

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/246 (35%), Positives = 117/246 (47%), Gaps = 11/246 (4%)

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGS 314
           VV     GC     G  V   GL+G G G +S   Q     G  FSYCL S + +  S +
Sbjct: 356 VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSST 415

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           L  G    P      PL+ NP  PS YYV + G+ VGG  + +             G ++
Sbjct: 416 LRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIV 475

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D GT  TRL  P Y A RD F ++    P    +  FDTCYN    V++ VPTV+F F G
Sbjct: 476 DAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCYN----VTISVPTVTFSFDG 530

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVG 489
              +TLP  N +I     G  C A A  PS      L+++ ++QQ+  ++ FD ANG VG
Sbjct: 531 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 590

Query: 490 FGPNVC 495
           F   +C
Sbjct: 591 FSRELC 596


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  130 bits (328), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/422 (27%), Positives = 185/422 (43%), Gaps = 49/422 (11%)

Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           H+    A   RD  R A ++R ++GG  D +     D             G Y+ ++ +G
Sbjct: 35  HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSD---------PNSVGLYYTKVKMG 85

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-D 217
           +PP+   + ID+GSDI+WV C  CS C + S        FD   S++ + + CS  +C  
Sbjct: 86  TPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTS 145

Query: 218 RLENAGCH----AGRCRYEVSYGDGSYTKGTLALE----TLTIGRTVVKN----VAIGCG 265
           R++ A         +C Y   YGDGS T G    +    +L +G+    N    +  GC 
Sbjct: 146 RVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCS 205

Query: 266 HKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGR 319
               G          G+ G G G +S+V QL   G T   FS+CL  +G G  G ++   
Sbjct: 206 ISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL--KGDGDGGGVLVLG 263

Query: 320 EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
           E L     + PLV  P  P  Y + L  + V G  +PI+  +F ++     G ++D GT 
Sbjct: 264 EILEPSIVYSPLV--PSQP-HYNLNLQSIAVNGQLLPINPAVFSISN-NRGGTIVDCGTT 319

Query: 380 VTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGP 436
           +  L   AY+    A    V+Q+     + G    + CY +S  +    P+VS  F GG 
Sbjct: 320 LAYLIQEAYDPLVTAINTAVSQSARQTNSKG----NQCYLVSTSIGDIFPSVSLNFEGGA 375

Query: 437 VLTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            + L    +L+    +D A  +C  F     G SI+G++  +   + +D A   +G+   
Sbjct: 376 SMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIGWANY 435

Query: 494 VC 495
            C
Sbjct: 436 DC 437


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 166/358 (46%), Gaps = 27/358 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +GSPP+   +++D+GS + +V C  C QC    DP F P  S+++  V C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
           +A C+  EN      +C YE  Y + S + G LA + ++ G+    V +    GC     
Sbjct: 145 NADCNCDEN----GVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMES 200

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G++S++ QL G+     +FS C      G  G++V G  + P G
Sbjct: 201 GDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG-GGAMVLGGISSPPG 259

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G  + ++   F     G  G ++D+GT     P 
Sbjct: 260 MVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYAYFPE 313

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRVPT----VSFYFSGGPVLT 439
            AY AF+DA + +   L + SG   +  D C++ +G     +P     V   F+ G  ++
Sbjct: 314 KAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373

Query: 440 LPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L      +G +C   F       +++G I      ++++  N  +GF    C
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 115/377 (30%), Positives = 173/377 (45%), Gaps = 54/377 (14%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP++  MV+D+GS++ W+ C+           VF+P  S+++S V CSS +C 
Sbjct: 63  VTLAVGSPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 118

Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
               D    A C      C   +SY D +  +G LA +T  IG         GC      
Sbjct: 119 TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVTRPGTLFGCMDSGLS 178

Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
            +      + GL+G+  GS+S V QLG      FSYC+   G+ SSG L+ G  +     
Sbjct: 179 SDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCI--SGSDSSGILLLGDASYSWLG 233

Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           P+   + PLV +    P F    Y V L G+ VG   + + + +F     G    ++D+G
Sbjct: 234 PI--QYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 291

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVSVR-----VP 426
           T  T L  P Y A ++ F+AQT ++ R      F      D CY +    S R     +P
Sbjct: 292 TQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGS--STRPNFTGLP 349

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQEGI 477
            +S  F G   +++     L  V+ AG+      +CF F  S   G+   +IG+  Q+ +
Sbjct: 350 VISLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQQNV 408

Query: 478 QISFDGANGFVGFGPNV 494
            + FD A   VGF  NV
Sbjct: 409 WMEFDLAKSRVGFAGNV 425


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 164/371 (44%), Gaps = 40/371 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--QSDPVFDPADSASFSGVSCSSAV 215
           V + VG+PP++  MV+D+GS++ W+ C P        +S   F P  S +F+ V C SA 
Sbjct: 67  VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126

Query: 216 C---DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHK 267
           C   D      C   + +CR  +SY DGS + G LA E  T+G+      A GC      
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGCMATAFD 186

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGA 326
                V  AGLLG+  G++S V Q   +    FSYC+  R    +G L+ G   LP +  
Sbjct: 187 TSPDGVATAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPFLPL 241

Query: 327 AWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            + PL + P  P  Y+      V L G+ VGG  +PI   +      G    ++D+GT  
Sbjct: 242 NYTPLYQ-PAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVDSGTQF 300

Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSG--FVSVRVPTVSFYF 432
           T L   AY A +  F  QT       N P  +    FDTC+ +        R+P V+  F
Sbjct: 301 TFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLPAVTLLF 360

Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFDGA 484
           +G   +T+     L  V        G +C  F  +   P    +IG+  Q  + + +D  
Sbjct: 361 NGA-QMTVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPITAYVIGHHHQMNVWVEYDLE 419

Query: 485 NGFVGFGPNVC 495
            G VG  P  C
Sbjct: 420 RGRVGLAPIRC 430


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 156/339 (46%), Gaps = 34/339 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y     +G+PP+    V+D   ++VW QC PC  C++Q  P+FDP  S++F G+ C S
Sbjct: 55  GLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQPCFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 214 AVCDRLENA--GCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKN 268
            +C+ +  +   C +  C YE     G  T G    +T  IG    + +  GC     K 
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIG-AAKETLGFGCVVMTDKR 172

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA-- 326
                G +G++GLG    SLV Q+      AFSYCL  +   SSG+L  G  A  +    
Sbjct: 173 LKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGK---SSGALFLGATAKQLAGGK 226

Query: 327 -AWVPLVRNPRAPS-------FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
            +  P V    A S       +Y V L+G+  GG  +       +        V++DT +
Sbjct: 227 NSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPL-------QAASSSGSTVLLDTVS 279

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVL 438
             + L   AY+A + A  A  G  P AS    +D C+  +  V+   P + F F GG  L
Sbjct: 280 RASYLADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKA--VAGDAPELVFTFDGGAAL 337

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGI 477
           T+P +N+L+   + GT C     S S L++ G ++   I
Sbjct: 338 TVPPANYLLASGN-GTVCLTIGSSAS-LNLTGELEGASI 374


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/358 (27%), Positives = 166/358 (46%), Gaps = 27/358 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +GSPP+   +++D+GS + +V C  C QC    DP F P  S+++  V C 
Sbjct: 86  NGYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKC- 144

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQ 269
           +A C+  EN      +C YE  Y + S + G LA + ++ G+    V +    GC     
Sbjct: 145 NADCNCDEN----GVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRAVFGCETMES 200

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G++S++ QL G+     +FS C      G  G++V G  + P G
Sbjct: 201 GDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG-GGAMVLGGISSPPG 259

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G  + ++   F     G  G ++D+GT     P 
Sbjct: 260 MVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTYAYFPE 313

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGFVSVRVPT----VSFYFSGGPVLT 439
            AY AF+DA + +   L + SG   +  D C++ +G     +P     V   F+ G  ++
Sbjct: 314 KAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQKIS 373

Query: 440 LPASNFLIP-VDDAGTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L      +G +C   F       +++G I      ++++  N  +GF    C
Sbjct: 374 LSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFWKTNC 431


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 164/359 (45%), Gaps = 29/359 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC K  DP F P  S+++  V C+
Sbjct: 74  NGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN 133

Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKN 268
            S  CD          +C YE  Y + S + G +A + ++ G       +    GC +  
Sbjct: 134 PSCNCDD------EGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKPQRAVFGCENVE 187

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
            G      A G++GLG G +S+V QL   G  G +FS C      G  G++V G+ + P 
Sbjct: 188 TGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVG-GGAMVLGQISPPP 246

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              +     NP    +Y + L  L V G  + +   +F        G V+D+GT     P
Sbjct: 247 NMVFS--HSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH----GTVLDSGTTYAYFP 300

Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSG----FVSVRVPTVSFYFSGGPVL 438
             A+ A +DA + +  +L +  G   +  D C++ +G     +S   P V+  F  G  L
Sbjct: 301 EAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVFGSGQKL 360

Query: 439 TLPASNFLI-PVDDAGTFCFAFAPSPSGL-SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L   N+L      +G +C     + + L +++G I      +++D  N  +GF    C
Sbjct: 361 SLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNC 419


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 109/373 (29%), Positives = 168/373 (45%), Gaps = 47/373 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G Y+ RI +G+PPR  Y+ ID+GSDI+WV C+PC+ C   S        FDP  S++ S
Sbjct: 38  AGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTAS 97

Query: 208 GVSCSSAVC---DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLT----IGRTVVKN 259
            +SC  + C   +++  + C   R C Y   YGDGS T G    +       + + V  N
Sbjct: 98  PLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNN 157

Query: 260 ----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
               +  GC +   G          G+ G G   +S+V QL  Q      FS+CL     
Sbjct: 158 ASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADP 217

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           G  G LV G    P G  + P+V  P  P  Y + L G+ V G ++ I   +F  T    
Sbjct: 218 G-GGILVLGEITEP-GMVYTPIV--PSQPH-YNLNLQGIAVNGQQLSIDPQVFATTNT-- 270

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVA---QTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
            G ++D GT +  L   AYE F +  +A   Q+       G   F T +++        P
Sbjct: 271 RGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVHSIDEI----FP 326

Query: 427 TVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAF------APSPSGLSIIGNIQQEGI 477
           +V+ YF G P + L   ++LI     D +  +C  +      A   S ++I+G++  +  
Sbjct: 327 SVTLYFEGAP-MDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDK 385

Query: 478 QISFDGANGFVGF 490
              +D  N  +G+
Sbjct: 386 VFVYDLENQRIGW 398


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 121/404 (29%), Positives = 174/404 (43%), Gaps = 58/404 (14%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD------VVSGMDQGSGEY 156
            H+        RD  R   L++ L G         V DF  D      VV       G Y
Sbjct: 38  NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVDGTFDPFVV-------GLY 81

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSC 211
           + ++ +G+PPR  Y+ +D+GSD++WV C  C+ C + S        FDP  S + S +SC
Sbjct: 82  YTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISC 141

Query: 212 SSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN--- 259
           S   C    +  ++GC      C Y   YGDGS T G    + L     +G ++V N   
Sbjct: 142 SDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTA 201

Query: 260 -VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSS 312
            V  GC     G  V       G+ G G   MS++ QL  Q      FS+CL     G  
Sbjct: 202 PVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGG 260

Query: 313 GSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGV 372
           G LV G    P    + PLV  P  P  Y V L  + V G  +PI+  +F  +     G 
Sbjct: 261 GILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGT 314

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYF 432
           ++DTGT +  L   AY  F +A         R   VS  + CY ++  V    P VS  F
Sbjct: 315 IIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNF 373

Query: 433 SGGPVLTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
           +GG  + L   ++LI  ++ G    +C  F      G++I+G++
Sbjct: 374 AGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 174/412 (42%), Gaps = 75/412 (18%)

Query: 152 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC----------SQCYKQSDPVFDPA 201
           G  +Y    G+G PP+    V+D+GSD+VW QC  C            C+ Q+ P ++ +
Sbjct: 74  GKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFS 133

Query: 202 DSASFSGVSCSS---AVCD-RLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLT 251
            S +   V C     A+C    E AGC  G       C    SYG G    G L  +  T
Sbjct: 134 LSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFT 192

Query: 252 IGRTVVKNVAIGCGHKNQ---GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-- 306
              +    +A GC  + +   G   GA+G++GLG G++SLV QL       FSYCL    
Sbjct: 193 FPSSSSVTLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYF 249

Query: 307 RGTGSSGSLVFG---------------REALPVGAAWVPLVRNPR-AP--SFYYVGLSGL 348
           R T S   L  G                   PV    VP  +NP+ +P  +FYY+ L GL
Sbjct: 250 RDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTT--VPFAKNPKDSPFSTFYYLPLVGL 307

Query: 349 GVGGMRIPISEDLFRLTQMGDD----GVVMDTGTAVTRLPTPAYEAFRDAFVAQ---TGN 401
             G   + +    F L +        G ++D+G+  TRL  PA+ A       Q   +G+
Sbjct: 308 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 367

Query: 402 L--PRASGVSIFDTCYNL----SGFVSVRVPTVSFYFS----GGPVLTLPASNFLIPVDD 451
           L  P A      + C           +  VP +   F     GG  L +PA  +   V +
Sbjct: 368 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-E 426

Query: 452 AGTFCFAFAPSPSG--------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A T+C A   S SG         +IIGN  Q+ +++ +D ANG + F P  C
Sbjct: 427 ASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 43/371 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP++  MV+D+GS++ W+ C+     +     VFDP  S+S+S + C+S  C 
Sbjct: 65  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCR 120

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
               D      C   + C   +SY D S  +G LA +T  IG + +     GC       
Sbjct: 121 TRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSS 180

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           N        GL+G+  GS+S V Q+G Q    FSYC+   G  SSG L+FG  +     A
Sbjct: 181 NSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCI--SGQDSSGILLFGESSFSWLKA 235

Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ +   P F    Y V L G+ V    + + + ++     G    ++D+GT  
Sbjct: 236 LKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQF 295

Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYF 432
           T L  P Y A ++ FV QT         P        D CY   L+      +PTV+  F
Sbjct: 296 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 355

Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
            G   +++ A   +  V          +CF F  S   G+   IIG+  Q+ + + FD A
Sbjct: 356 RGAE-MSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLA 414

Query: 485 NGFVGFGPNVC 495
              VGF    C
Sbjct: 415 KSRVGFAEVRC 425


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 107/363 (29%), Positives = 158/363 (43%), Gaps = 38/363 (10%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y     +G+PP+    ++D   ++VW QC  C +C+KQ  PVF P  S++F    C +AV
Sbjct: 62  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAV 121

Query: 216 CDRLENAGCHAGRCRYEVSYGDGSY----TKGTLALETLTIGRTVVKNVAIGC-GHKNQG 270
           C+ +    C    C Y+   G  +     T G  A +T  IG   V+ +A GC    +  
Sbjct: 122 CESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR-LAFGCVVASDID 177

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG---AA 327
              G +G +GLG    SLV Q+       FSYCL  R TG S  L  G  A   G    +
Sbjct: 178 TMDGPSGFIGLGRTPWSLVAQMKLTR---FSYCLSPRNTGKSSRLFLGSSAKLAGGESTS 234

Query: 328 WVPLVR-NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
             P ++ +P   S  +Y + L  +  G   I         T      +VM T +  + L 
Sbjct: 235 TAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLLV 286

Query: 385 TPAYEAFRDAFVAQTG---NLPRASGVSIFDTCY-NLSGFVSVRVPTVSFYFSGGPVLTL 440
             AY AF+ A     G     P A+    FD C+   +GF     P + F F G   LT+
Sbjct: 287 DSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTV 346

Query: 441 PASNFLIPV-DDAGTFCFAFAPSP-------SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
           P + +LI V ++  T C A             G+S++G++QQE +   +D     + F P
Sbjct: 347 PPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEP 406

Query: 493 NVC 495
             C
Sbjct: 407 ADC 409


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 166/371 (44%), Gaps = 43/371 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP++  MV+D+GS++ W+ C+     +     VFDP  S+S+S + C+S  C 
Sbjct: 58  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS----VFDPLRSSSYSPIPCTSPTCR 113

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
               D      C   + C   +SY D S  +G LA +T  IG + +     GC       
Sbjct: 114 TRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNSAIPATIFGCMDSGFSS 173

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           N        GL+G+  GS+S V Q+G Q    FSYC+   G  SSG L+FG  +     A
Sbjct: 174 NSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCI--SGQDSSGILLFGESSFSWLKA 228

Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ +   P F    Y V L G+ V    + + + ++     G    ++D+GT  
Sbjct: 229 LKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQF 288

Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYN--LSGFVSVRVPTVSFYF 432
           T L  P Y A ++ FV QT         P        D CY   L+      +PTV+  F
Sbjct: 289 TFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 348

Query: 433 SGGPVLTLPASNFLIPVDDA-----GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
            G   +++ A   +  V          +CF F  S   G+   IIG+  Q+ + + FD A
Sbjct: 349 RGAE-MSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLA 407

Query: 485 NGFVGFGPNVC 495
              VGF    C
Sbjct: 408 KSRVGFAEVRC 418


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/316 (32%), Positives = 145/316 (45%), Gaps = 31/316 (9%)

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALET 249
           P FD + S++    SC S +C  L  A C   +      C Y   Y D S T G L ++ 
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLLEVDK 234

Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
            T G    V  VA GCG  N G+F     G+ G G G +SL  QL     G FS+C  + 
Sbjct: 235 FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAV 291

Query: 308 GTGSSGSLVF---------GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
                 +++          GR A+       PL++N   P+ YY+ L G+ VG  R+P+ 
Sbjct: 292 NGLKQSTVLLDLLADLYKNGRGAV----QSTPLIQNSANPTLYYLSLKGITVGSTRLPVP 347

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
           E  F LT  G  G ++D+GT++T LP   Y+  RD F AQ   LP   G +    TC++ 
Sbjct: 348 ESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGPYTCFSA 405

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQ 474
                  VP +  +F G   + LP  N++  V DDAG    C A        + IGN QQ
Sbjct: 406 PSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSMICLAINELGDERATIGNFQQ 464

Query: 475 EGIQISFDGANGFVGF 490
           + + + +D  N  + F
Sbjct: 465 QNMHVLYDLQNNMLSF 480



 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 49/139 (35%), Positives = 69/139 (49%), Gaps = 8/139 (5%)

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
           G  G+ VG  R+P+ E  F LT  G  G ++D+GT++T LP   Y+  RD F AQ   LP
Sbjct: 38  GRPGITVGSTRLPVPESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLP 95

Query: 404 RASGVSIFD-TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAF 459
              G +    TC++        VP +  +F G   + LP  N++  V DDAG    C A 
Sbjct: 96  VVPGNATGPYTCFSAPSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSIICLAI 154

Query: 460 APSPSGLSIIGNIQQEGIQ 478
                  +IIGN QQ+ + 
Sbjct: 155 NKG-DETTIIGNFQQQNMH 172


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/264 (35%), Positives = 137/264 (51%), Gaps = 14/264 (5%)

Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQ 295
           G  + T G LA +T T G T V  V  GC   + G F GA+G++G+G G++SL+ QL   
Sbjct: 124 GSAANTSGYLATDTFTFGATAVPGVVFGCSDASYGDFAGASGVIGIGRGNLSLISQL--- 180

Query: 296 TGGAFSYCLV---SRGTGSSGSLV-FGREALPVGAAW--VPLVRNPRAPSFYYVGLSGLG 349
             G FSY L+   +   GS+ S++ FG +A+P        PL+ +   P FYYV L+G+ 
Sbjct: 181 QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVR 240

Query: 350 VGGMRI-PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV 408
           V G R+  I    F L   G  GV++ + T VT L   AY+  R A  ++ G LP  +G 
Sbjct: 241 VDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIG-LPAVNGS 299

Query: 409 SI--FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
           +    D CYN S    V+VP ++  F GG  + L A+N+    +D G  C    PS  G 
Sbjct: 300 AALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGG- 358

Query: 467 SIIGNIQQEGIQISFDGANGFVGF 490
           S++G + Q G  + +D   G + F
Sbjct: 359 SVLGTLLQTGTNMIYDVDAGRLTF 382


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 119/456 (26%), Positives = 193/456 (42%), Gaps = 65/456 (14%)

Query: 73  SSDEARWNLELVHRDKMSSS--SNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGG 130
           S+++    L+L HRD +  +  S   + +   + +HS  +R            R+  GG 
Sbjct: 25  STEDTAVRLKLAHRDTLWPNPLSRIEDIIGADQKRHSLISRK-----------RKFKGG- 72

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC 190
                        D+ SG+D G+ +YF  + VG+P +   +V+D+GS++ WV C+     
Sbjct: 73  ----------VKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCR----- 117

Query: 191 YK-------QSDPVFDPADSASFSGVSCSSAVCD-------RLENAGCHAGRCRYEVSYG 236
           Y+       ++  VF   +S SF  V C +  C         L      +  C Y+  Y 
Sbjct: 118 YRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYA 177

Query: 237 DGSYTKGTLALETLTIGRT-----VVKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVG 290
           DGS  +G  A ET+T+G T      ++ + +GC     G     A G+LGL     S   
Sbjct: 178 DGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTS 237

Query: 291 QLGGQTGGAFSYCLVSRGTGS--SGSLVFGREALPVGAAWVPLVRNP----RAPSFYYVG 344
                 G   SYCLV   +    S  L+FG  +        P    P      P FY + 
Sbjct: 238 TATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAIN 297

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           + G+ +G   + I   ++  T  G  G ++D+GT++T L   AY+            L R
Sbjct: 298 IIGISIGDDMLDIPTQVWDATTGG--GTILDSGTSLTLLAEAAYKPVVTGLARYLVELKR 355

Query: 405 ASGVSI-FDTCY-NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAF-- 459
                I  + C+ + SGF   ++P ++F+  GG        ++L  VD A G  C  F  
Sbjct: 356 VKPEGIPIEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYL--VDAAPGVKCLGFMS 413

Query: 460 APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           A +P+  +++GNI Q+     FD     + F P+ C
Sbjct: 414 AGTPA-TNVVGNIMQQNYLWEFDLMASTLSFAPSTC 448


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 116/402 (28%), Positives = 167/402 (41%), Gaps = 66/402 (16%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDP-VFDPADSASFSGVSC 211
           +Y +   + S   S YM  D+GSDIVW  C P  C  C  + +P    P + +  S +SC
Sbjct: 93  DYTLTFSINSQTLSVYM--DTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISC 150

Query: 212 SSAVC--------------------DRLENAGCHAGRC-RYEVSYGDGSYT----KGTLA 246
            S  C                    D +E + C    C  +  +YGDGS      K  L 
Sbjct: 151 KSRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLI 210

Query: 247 LETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQT---GGAFSYC 303
           + + +     +K+   GC H   G  +G AG    G GS+SL  QL   +   G  FSYC
Sbjct: 211 MPSTSNKPFSLKDFTFGCAHSALGEPIGVAGF---GFGSLSLPAQLANLSPDLGNQFSYC 267

Query: 304 LVSRGTGSS-----GSLVFGREALP-----VGAAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
           LVS    S+       L+ G+             + P++ NP+ P FY V +  + VG  
Sbjct: 268 LVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVGSS 327

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI-- 410
           R+     L R+ + G+ GVV+D+GT  T LPT  Y +       + G +  RAS      
Sbjct: 328 RVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKT 387

Query: 411 -FDTCYNLSG----FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-------GTFCFA 458
               CY L G     + + VP ++F+F G   + LP  N+     D           C  
Sbjct: 388 GLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLM 447

Query: 459 FA----PSPSGL-SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  S  G  + +GN QQ+G Q+ +D     VGF P  C
Sbjct: 448 LMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKC 489


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 171/391 (43%), Gaps = 61/391 (15%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDP---VFDPADSASF 206
           G Y + +  G+PP++  +++D+GSD+VW  C     C  C +  S+P   +F P  S+S 
Sbjct: 88  GAYSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSS 147

Query: 207 SGVSCSSAVCDRLENAGCHAGRCR---------------YEVSYGDGSYTKGTLALETLT 251
             + C +  C  +  +   + RCR               Y V YG G  T G +  ETL 
Sbjct: 148 KVLGCVNPKCGWIHGSKVQS-RCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLD 205

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---G 308
           +    V N  +GC   +       AG+ G G G  SL  QLG +    FSYCL+SR    
Sbjct: 206 LPGKGVPNFIVGCSVLSTSQ---PAGISGFGRGPPSLPSQLGLK---KFSYCLLSRRYDD 259

Query: 309 TGSSGSLVFGREA----LPVGAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIPIS 358
           T  S SLV   E+       G ++ P V+NP+         +YY+GL  + VGG  + I 
Sbjct: 260 TTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHVKIP 319

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV--AQTGNLPRASGVSIFDTCYN 416
                    GD G ++D+GT  T +    +E     F    Q+       G++    C+N
Sbjct: 320 YKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGITGLRPCFN 379

Query: 417 LSGFVSVRVPTVSFYFSGGPVLTLPASNFL------------IPVDDAGTFCFAFAPSPS 464
           +SG  +   P ++  F GG  + LP +N++            I  D A    F+  P+  
Sbjct: 380 ISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAGKEFSGGPA-- 437

Query: 465 GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              I+GN QQ+   + +D  N  +GF    C
Sbjct: 438 --IILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 163/373 (43%), Gaps = 44/373 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP++  MV+D+GS++ W+ C P     K S   F P  S++F+ V C+SA C 
Sbjct: 87  VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146

Query: 217 --DRLENAGCH--AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC---GHKNQ 269
             D      C   + RC   +SY DGS + G LA +   +G       A GC      + 
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAAFGCMSSAFDSS 206

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
              V +AGLLG+  G++S V Q   +    FSYC+  R    +G L+ G   LP    ++
Sbjct: 207 PDGVASAGLLGMNRGALSFVSQASTRR---FSYCISDR--DDAGVLLLGHSDLPT---FL 258

Query: 330 PLVRNPR------APSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
           PL   P        P F    Y V L G+ VGG  +PI   +      G    ++D+GT 
Sbjct: 259 PLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQ 318

Query: 380 VTRLPTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLS---GFVSVRVPTVSF 430
            T L   AY A +  F  Q   L      P  +    FDTC+ +       + R+P V+ 
Sbjct: 319 FTFLLGDAYSALKAEFTRQARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPGVTL 378

Query: 431 YFSGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQISFD 482
            F+G   + +     L  V        G +C  F  +   P    +IG+  Q  + + +D
Sbjct: 379 LFNGA-EMAVAGDRLLYKVPGERRGGDGVWCLTFGNADMVPIMAYVIGHHHQMNVWVEYD 437

Query: 483 GANGFVGFGPNVC 495
              G VG  P  C
Sbjct: 438 LERGRVGLAPVRC 450


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 162/389 (41%), Gaps = 55/389 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCYKQSDPV------FDPADSA 204
           G Y V +  G+PP++   ++D+GSDIVW  C     C  C   S         F P +S+
Sbjct: 65  GGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESS 124

Query: 205 SFSGVSCSSAVCDRLENAG------CHAGRC------RYEVSYGDGSYTKGTLAL-ETLT 251
           S   + C +  C  + ++       C    C       Y + YG G  T G +AL ETL 
Sbjct: 125 SSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSG--TTGGVALSETLH 182

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---- 307
           +      N  +GC   +       AG+ G G G  SL  QLG    G FSYCL+S     
Sbjct: 183 LHSLSKPNFLVGCSVFSSHQ---PAGIAGFGRGLSSLPSQLGL---GKFSYCLLSHRFDD 236

Query: 308 GTGSSGSLVFGREALPV-----GAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIP 356
            T  S SLV   E L          + P V+NP+  +      +YY+GL  + VGG  + 
Sbjct: 237 DTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHVK 296

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FDT 413
           +        + G+ GV++D+GT  T +   A+E   D F+ Q  +  R   +        
Sbjct: 297 VPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRP 356

Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS------ 467
           C+N+S   +V  P +  YF GG  + LP  N+   V              +G        
Sbjct: 357 CFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVTDGVAGPERVGGPG 416

Query: 468 -IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            I+GN Q +   + +D  N  +GF    C
Sbjct: 417 MILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score =  129 bits (325), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 101/339 (29%), Positives = 153/339 (45%), Gaps = 38/339 (11%)

Query: 169 QYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVCD---RLENAG 223
           Q +V+D+ SD+ WVQC P +           +DPA S+++  ++C+SA C    RL    
Sbjct: 124 QTVVLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGA 183

Query: 224 CHAGRCRYEVSYGD--------GSYTKGTLALETLTIGRTVVKNVAIGCGH--KNQG--- 270
           C   +C+Y V            G+Y    L L T         +   GC H    QG   
Sbjct: 184 CVNNQCQYRVPIPSSPASSSSSGTYGSDLLKL-TADPADGASMSFKFGCSHGEAKQGGEG 242

Query: 271 -MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV----G 325
            +    AG++ LGGG  SLV Q     G AFSYC+ +  +   G  V G     +    G
Sbjct: 243 SIDNATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGG 302

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
            A  P++R  R P+ Y V L  + V G ++ ++  +F        G V+D+ TA+TRLP 
Sbjct: 303 YAVTPMLRYARVPTLYRVRLLAIAVDGQQLNVTPSVFA------SGSVLDSRTAITRLPP 356

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
            AY+A R+AF ++      A      DTCY+ +G   V VP V+    G  V+ L     
Sbjct: 357 TAYQALREAFRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVALDRQGI 416

Query: 446 LIPVDDAGTFCFAFAPSPSGL--SIIGNIQQEGIQISFD 482
           L         C  F  +       I+GN+QQ+ +++ ++
Sbjct: 417 LF------HDCLVFTSNTDDRMPGILGNVQQQTMEVLYN 449


>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
          Length = 537

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 88/246 (35%), Positives = 117/246 (47%), Gaps = 11/246 (4%)

Query: 256 VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGS 314
           VV     GC     G  V   GL+G G G +S   Q     G  FSYCL S + +  S +
Sbjct: 295 VVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVYGFVFSYCLPSYKSSNFSST 354

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           L  G    P      PL+ NP  PS YYV + G+ VGG  + +             G ++
Sbjct: 355 LRLGPAGQPKRIKMTPLLSNPHRPSLYYVNMVGIHVGGRPMLVPASALAFDPASGRGTIV 414

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSG 434
           D GT  TRL  P Y A RD F ++    P    +  FDTCYN    V++ VPTV+F F G
Sbjct: 415 DAGTMFTRLSAPVYAAVRDVFRSRV-RAPVTGPLGGFDTCYN----VTISVPTVTFSFDG 469

Query: 435 GPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVG 489
              +TLP  N +I     G  C A A  PS      L+++ ++QQ+  ++ FD ANG VG
Sbjct: 470 RVSVTLPEENVVIRSSSDGIACLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVG 529

Query: 490 FGPNVC 495
           F   +C
Sbjct: 530 FSRELC 535


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 165/400 (41%), Gaps = 74/400 (18%)

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFS---GVSCSSA 214
           +G  S P + YM  D+GSD+VW  C P  C  C  +     DP+   + S    +SC+S 
Sbjct: 81  LGPHSQPITLYM--DTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTPISCNSH 138

Query: 215 VC--------------------DRLENAGCHAGRC-RYEVSYGDGSYTKGTLALETLTIG 253
            C                    D +E   C +  C  +  +YGDGS    +L  +TL++ 
Sbjct: 139 ACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLI-ASLYRDTLSLS 197

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTG 310
              + N   GC H     F    G+ G G G +SL  QL     Q G  FSYCLVS    
Sbjct: 198 TLQLTNFTFGCAHTT---FSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFR 254

Query: 311 SS-----GSLVFGREALP--------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPI 357
           S        L+ GR            V   +  ++ NP+   FY VGL G+ VG   +P 
Sbjct: 255 SERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTVPA 314

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF--VAQTGN--LPRASGVSIFDT 413
            + L R+ + GD GVV+D+GT  T LP   Y +  + F   A+  N   P     +    
Sbjct: 315 PKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLSP 374

Query: 414 CYNLSGFVSVRVPTVSFYFSG-GPVLTLPASNFLIPVDDAG--------TFCFAF----- 459
           CY L+   +  VP V+  F G    + LP  N+     D G          C  F     
Sbjct: 375 CYYLN--TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMFMNGGD 432

Query: 460 ----APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
               +  P G  ++GN QQ+G ++ +D     VGF    C
Sbjct: 433 EAEMSGGPGG--VLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 111/376 (29%), Positives = 172/376 (45%), Gaps = 44/376 (11%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G+   +G YF +IG+G+P +  Y+ +D+GSDI+WV C  C  C ++S       ++DP 
Sbjct: 80  NGIPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPT 139

Query: 202 DSASFSGVSCSSAVCDRLENAG----CHAGR-CRYEVSYGDGSYTKGTLALETLTI---- 252
            SAS   V+C    C    N G    C A   C+Y ++YGDGS T G    + L      
Sbjct: 140 ASASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVS 199

Query: 253 --GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFSY 302
             G+T + N  V  GCG K  G      V   G+LG G  + S++ QL   G+    FS+
Sbjct: 200 GDGQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSH 259

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL +   G  G    G    P      PLV  P  P  Y V L  + VGG  + +  ++F
Sbjct: 260 CLDTVNGG--GIFAIGNVVQP-KVKTTPLV--PGMPH-YNVVLKTIDVGGSTLQLPTNIF 313

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNLSGFV 421
            +   G  G ++D+GT +  LP   Y+A   A  +   N P  +  ++ D  C+  SG V
Sbjct: 314 DIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAVFS---NHPDVTLKNVQDFLCFQYSGSV 369

Query: 422 SVRVPTVSFYFSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQ 474
               P V+F+F G  P++  P        +D   +C  F      +     + ++G++  
Sbjct: 370 DNGFPEVTFHFDGDLPLVVYPHDYLFQNTEDV--YCVGFQSGGVQSKDGKDMVLLGDLAL 427

Query: 475 EGIQISFDGANGFVGF 490
               + +D  N  +G+
Sbjct: 428 SNKLVVYDLENQVIGW 443


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 118/395 (29%), Positives = 161/395 (40%), Gaps = 72/395 (18%)

Query: 166 PRSQ----YMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSGVSCSSAVCDRL 219
           PR+Q     + +D+GSD+VW  C P  C  C  + +    P ++     VSC S  C   
Sbjct: 56  PRAQAQPITLYMDTGSDLVWFPCAPFKCILCEGKPN-ASPPVNTTRSVAVSCKSPACSAA 114

Query: 220 ENAG-----CHAGRCRYE----------------VSYGDGSYTKGTLALETLTIGRTVVK 258
            N       C A RC  E                 +YGDGS     L  +TL++    ++
Sbjct: 115 HNLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLI-ARLYRDTLSLSSLFLR 173

Query: 259 NVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFSYCLVSRGTGSS--- 312
           N   GC +          G+ G G G +SL  QL     Q G  FSYCLVS    S    
Sbjct: 174 NFTFGCAYTT---LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVR 230

Query: 313 --GSLVFGR-----EALPVGA-----AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISED 360
               L+ GR     E   VG       + P++ NP+ P FY VGL G+ VG   +P  E 
Sbjct: 231 KPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGKRIVPAPEM 290

Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI---FDTCYN 416
           L R+   GD GVV+D+GT  T LP   Y +  D F    G +  RA  +        CY 
Sbjct: 291 LRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAPCYY 350

Query: 417 LSGFVSVRVPTVSFYFSGG-PVLTLPASNFLIPVDD----------AGTFCFAFAPSPSG 465
           L+      VP ++  F+GG   + LP  N+     D           G          + 
Sbjct: 351 LNSVAE--VPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAE 408

Query: 466 LS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           LS      +GN QQ+G ++ +D     VGF    C
Sbjct: 409 LSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQC 443


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 113/405 (27%), Positives = 179/405 (44%), Gaps = 44/405 (10%)

Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           R   +  R+A   RR  G GA   A+  + D   D+++     +G Y  R+ +G+PP+  
Sbjct: 51  RSYPNASRLAASSRRGLGDGAHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 102

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
            +++DSGS + +V C  C QC    DP F P  S+S+S V C+    CD          +
Sbjct: 103 ALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCD------SDKKQ 156

Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
           C YE  Y + S + G L  + ++ GR      +    GC +   G      A G++GLG 
Sbjct: 157 CTYERQYAEMSSSSGVLGEDIVSFGRESELKPQRAVFGCENSETGDLFSQHADGIMGLGR 216

Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           G +S++ QL   G    +FS C      G  G++V G   +P  +  V    +P    +Y
Sbjct: 217 GQLSIMDQLVEKGVISDSFSLCYGGMDIG-GGAMVLG--GVPAPSDMVFSHSDPLRSPYY 273

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            + L  + V G  + +   +F        G V+D+GT    LP  A+ AF+DA  ++  +
Sbjct: 274 NIELKEIHVAGKALRVDSRVFN----SKHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHS 329

Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDD 451
           L +  G   +  D C+     N+S    V  P V   F  G  L+L   N+L     VD 
Sbjct: 330 LKKIRGPDPNYKDICFAGAGRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVD- 387

Query: 452 AGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            G +C   F       +++G I      +++D  N  +GF    C
Sbjct: 388 -GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  129 bits (324), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 117/416 (28%), Positives = 179/416 (43%), Gaps = 44/416 (10%)

Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           H+        RD  R   ++R   GG  D       D  T        G G Y  ++ +G
Sbjct: 39  HRVEIDTLRARDRVRHGRILRASVGGVVDFRVQGSSDPST-------LGYGLYTTKVKMG 91

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC-D 217
           +PPR   + ID+GSDI+W+ C  CS C K S        FD   S++ + V CS  +C  
Sbjct: 92  TPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVPCSDPMCAS 151

Query: 218 RLENAGC----HAGRCRYEVSYGDGSYTKGTLALET----LTIGRTVVKNVA------IG 263
            ++ A         +C Y   Y DGS T G    +     + +G++   NVA       G
Sbjct: 152 AIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIVFG 211

Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           C     G          G+LG G G +S+V QL   G T   FS+CL  +G G+ G ++ 
Sbjct: 212 CSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL--KGDGNGGGILV 269

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
             E L     + PLV  P  P  Y + L  + V G  + I+  +F  +     G ++D+G
Sbjct: 270 LGEILEPSIVYSPLV--PSQPH-YNLNLQSIAVNGQVLSINPAVFATSD--KRGTIIDSG 324

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T ++ L   AY+   +A           S +S    CY +   +    PTVSF F GG  
Sbjct: 325 TTLSYLVQEAYDPLVNAVDTAVSQFA-TSFISKGSQCYLVLTSIDDSFPTVSFNFEGGAS 383

Query: 438 LTLPASNFLIP---VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           + L  S +L+     D A  +C  F     G++I+G++  +   + +D A   +G+
Sbjct: 384 MDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQIGW 439


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 124/410 (30%), Positives = 185/410 (45%), Gaps = 45/410 (10%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           RD  R   L+R + GG  D   +   D             G YF ++ +GSPPR   + I
Sbjct: 53  RDQARHGRLLRGVVGGVVDFTVYGTSD---------PYLVGLYFTKVKLGSPPREFNVQI 103

Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRLEN---AGC- 224
           D+GSDI+WV C  C+ C + S        FDP+ S++ S VSCS  +C  L     A C 
Sbjct: 104 DTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECS 163

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
             + +C Y   YGDGS T G    + L     +G +++ N    +  GC     G     
Sbjct: 164 PQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLTKV 223

Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
                G+ G G   +S+V QL   G T   FS+CL   G G  G LV G E L     + 
Sbjct: 224 DKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDG-GGKLVLG-EILEPNIIYS 281

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PLV    + S Y + L  + V G  +PI   +F  +   + G ++D+GT +T L   AY+
Sbjct: 282 PLV---PSQSHYNLNLQSISVNGQLLPIDPAVFATSN--NQGTIVDSGTTLTYLVETAYD 336

Query: 390 AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
            F  A  A   +      +S  + CY +S  V    P VS  F+GG  + L    +L+ +
Sbjct: 337 PFVSAITATVSS-STTPVLSKGNQCYLVSTSVDEIFPPVSLNFAGGASMVLKPGEYLMHL 395

Query: 450 ---DDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              D A  +C  F   +  G++I+G++  +     +D A+  +G+    C
Sbjct: 396 GFSDGAAMWCIGFQKVAEPGITILGDLVLKDKIFVYDLAHQRIGWANYDC 445


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 167/375 (44%), Gaps = 58/375 (15%)

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVC-----D 217
           PP++  MVID+GS++ W++C   S      +PV  FDP  S+S+S + CSS  C     D
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 218 RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG----M 271
            L  A C + + C   +SY D S ++G LA E    G  T   N+  GC     G     
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--VGAAWV 329
                GLLG+  GS+S + Q+G      FSYC +S      G L+ G           + 
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFP---KFSYC-ISGTDDFPGFLLLGDSNFTWLTPLNYT 253

Query: 330 PLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           PL+R +   P F    Y V L+G+ V G  +PI + +      G    ++D+GT  T L 
Sbjct: 254 PLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFTFLL 313

Query: 385 TPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSGF-----VSVRVPTVSFYF- 432
            P Y A R  F+ QT  +      P        D CY +S F     +  R+PTVS  F 
Sbjct: 314 GPVYTALRSDFLNQTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTVSLVFE 373

Query: 433 ------SGGPVLTLPASNFLIPVDDAG---TFCFAFAPSP-SGLS--IIGNIQQEGIQIS 480
                 SG P+L      + +P   AG    +CF F  S   G+   +IG+  Q+ + I 
Sbjct: 374 GAEIAVSGQPLL------YRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427

Query: 481 FDGANGFVGFGPNVC 495
           FD     +G  P  C
Sbjct: 428 FDLQRSRIGLAPVQC 442


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 171/380 (45%), Gaps = 60/380 (15%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG PP++  MV+D+GS++ W+ C+           VF+P  S+++S V CSS +C 
Sbjct: 67  VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNL----GSVFNPVSSSTYSPVPCSSPICR 122

Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
               D    A C      C   +SY D +  +G LA ET  IG         GC      
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTRPGTLFGCMDSGLS 182

Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREAL---- 322
            N      + GL+G+  GS+S V QLG      FSYC+   G+ SS  L+ G  +     
Sbjct: 183 SNSEEDAKSTGLMGMNRGSLSFVNQLGF---SKFSYCI--SGSDSSVFLLLGDASYSWLG 237

Query: 323 PVGAAWVPLV-RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           P+   + PLV ++   P F    Y V L G+ VG   + + + +F     G    ++D+G
Sbjct: 238 PI--QYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDSG 295

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--------NLSGFVSV 423
           T  T L  P Y A ++ F+ QT ++ R      F      D CY        N SG    
Sbjct: 296 TQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSG---- 351

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGT------FCFAFAPSP-SGLS--IIGNIQQ 474
            +P VS  F G   +++     L  V+ AG+      +CF F  S   G+   +IG+  Q
Sbjct: 352 -LPMVSLMFRGAE-MSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGHHHQ 409

Query: 475 EGIQISFDGANGFVGFGPNV 494
           + + + FD A   VGF  NV
Sbjct: 410 QNVWMEFDLAKSRVGFAGNV 429


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 120/399 (30%), Positives = 173/399 (43%), Gaps = 48/399 (12%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGS-GEYFVRIG 161
            H+        RD  R   L++ L G         V DF  D     D    G Y+ ++ 
Sbjct: 38  NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVD--GTFDPFVVGLYYTKLR 86

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVC 216
           +G+PPR  Y+ +D+GSD++WV C  C+ C + S        FDP  S + S +SCS   C
Sbjct: 87  LGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRC 146

Query: 217 D---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIG 263
               +  ++GC      C Y   YGDGS T G    + L     +G ++V N    V  G
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206

Query: 264 CGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVF 317
           C     G  V       G+ G G   MS++ QL  Q      FS+CL     G  G LV 
Sbjct: 207 CSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGGGILVL 265

Query: 318 GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G    P    + PLV  P  P  Y V L  + V G  +PI+  +F  +     G ++DTG
Sbjct: 266 GEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGTIIDTG 319

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV 437
           T +  L   AY  F +A         R   VS  + CY ++  V    P VS  F+GG  
Sbjct: 320 TTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNFAGGAS 378

Query: 438 LTLPASNFLIPVDDAG---TFCFAFAP-SPSGLSIIGNI 472
           + L   ++LI  ++ G    +C  F      G++I+G++
Sbjct: 379 MFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDL 417


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 171/373 (45%), Gaps = 41/373 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF R+ +G+P +  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S 
Sbjct: 87  GLYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSR 146

Query: 209 VSCSSAVCDRLENAG---CHAGR-----CRYEVSYGDGSYTKGTLALETLTIGRTVVKN- 259
           + CS   C      G   C +       C Y  +YGDGS T G    +T+    TV+ N 
Sbjct: 147 IPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYF-DTVMGNE 205

Query: 260 --------VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLV 305
                   V  GC +   G  +       G+ G G   +S+V QL   G +   FS+CL 
Sbjct: 206 QTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL- 264

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            +G+ + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  +
Sbjct: 265 -KGSDNGGGILVLGEIVEPGLVFTPLV--PSQP-HYNLNLESIAVSGQKLPIDSSLFATS 320

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRV 425
                G ++D+GT +  L   AY+ F +A  A      R+        C+  +  V    
Sbjct: 321 NT--QGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQ-CFVTTSSVDSSF 377

Query: 426 PTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
           PT + YF GG  +T+   N+L+    VD+   +C  +  S  G++I+G++  +     +D
Sbjct: 378 PTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRS-QGITILGDLVLKDKIFVYD 436

Query: 483 GANGFVGFGPNVC 495
            AN  +G+    C
Sbjct: 437 LANMRMGWADYDC 449


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  129 bits (323), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 163/367 (44%), Gaps = 40/367 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSG 208
             +YF+ I +G+PP    + ID+GS + WVQC+ C  +CY Q+     +F+P +S+++S 
Sbjct: 3   KNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSK 62

Query: 209 VSCSSAVCDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNV 260
           V CS+  C+ +        GC      C Y + YG G Y+ G L  + LT+     + N 
Sbjct: 63  VGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNF 122

Query: 261 AIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFG 318
             GCG  N  ++ G  AG++G G  S S   Q+  QT   AFSYC   R   + GSL  G
Sbjct: 123 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIG 179

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             A  +   W  L+     P+ Y +    + V G+R+ I   ++ +++M     ++D+GT
Sbjct: 180 PYARDINLMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGT 233

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFY 431
           A T + +P ++A   A   +        G      C+       N + F +V +  +   
Sbjct: 234 ADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR-- 291

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFV 488
                 L LP  N      +    C  F P  +   G+ ++GN      ++ FD      
Sbjct: 292 ----STLKLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNF 346

Query: 489 GFGPNVC 495
           GF    C
Sbjct: 347 GFKARAC 353


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 121/432 (28%), Positives = 185/432 (42%), Gaps = 67/432 (15%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
            Y R Q S  A  + D +R  T++    L  GG                +G     G Y+
Sbjct: 38  RYPRLQGSLSALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
            +IG+G+P +S Y+ +D+GSDI+WV C  C QC ++S       +++  +S S   VSC 
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 213 SAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
              C ++     +GC A   C Y   YGDGS T G    + +           +T   +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 261 AIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
             GCG +  G    +      G+LG G  + S++ QL   G+    F++CL  R  G  G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
               GR   P      PLV  P  P  Y V ++ + VG   + I  DLF   Q GD  G 
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLNIPADLF---QPGDRKGA 312

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
           ++D+GT +  LP   YE       +Q   L     V I D    C+  SG V    P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDEGFPNVT 368

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSIIGNIQQEGIQISFDG 483
           F+F     L +   ++L P +  G +C  +  S         ++++G++      + +D 
Sbjct: 369 FHFENSVFLRVYPHDYLFPYE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426

Query: 484 ANGFVGFGPNVC 495
            N  +G+    C
Sbjct: 427 ENQLIGWTEYNC 438


>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
          Length = 565

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 116/240 (48%), Gaps = 11/240 (4%)

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            GC     G  V + GL+G   G +S   Q     G  FSYCL S + +  SG+L  G  
Sbjct: 329 FGCLCVVTGGSVPSQGLVGFNRGPLSFPSQNKNVYGSVFSYCLPSYKSSNFSGTLRLGPA 388

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+ NP  PS YYV + G+ VGG  + +             G ++D GT  
Sbjct: 389 GQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVAVPASALAFDPASGHGTIVDAGTMF 448

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTL 440
           TRL  P Y A  D F ++    P A  +  FDTCYN    V++ VPTV+F F G   +TL
Sbjct: 449 TRLSAPVYAAVCDVFRSRV-RAPVAGPLGGFDTCYN----VTISVPTVTFLFDGRVSVTL 503

Query: 441 PASNFLIPVDDAGTFCFAFAPSPSG-----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           P  N +I     G  C A A  PS      L+++ ++QQ+  ++ FD ANG VGF   +C
Sbjct: 504 PEENVVIRSSLDGIACLAMAAGPSDSVDAVLNVMASMQQQNHRVLFDVANGRVGFSRELC 563


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  128 bits (322), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 163/367 (44%), Gaps = 40/367 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSG 208
             +YF+ I +G+PP    + ID+GS + WVQC+ C  +CY Q+     +F+P +S+++S 
Sbjct: 22  KNKYFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSK 81

Query: 209 VSCSSAVCDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNV 260
           V CS+  C+ +        GC      C Y + YG G Y+ G L  + LT+     + N 
Sbjct: 82  VGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNF 141

Query: 261 AIGCGHKNQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFG 318
             GCG  N  ++ G  AG++G G  S S   Q+  QT   AFSYC   R   + GSL  G
Sbjct: 142 IFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIG 198

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             A  +   W  L+     P+ Y +    + V G+R+ I   ++ +++M     ++D+GT
Sbjct: 199 PYARDINLMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGT 252

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFY 431
           A T + +P ++A   A   +        G      C+       N + F +V +  +   
Sbjct: 253 ADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR-- 310

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFV 488
                 L LP  N      +    C  F P  +   G+ ++GN      ++ FD      
Sbjct: 311 ----STLKLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNF 365

Query: 489 GFGPNVC 495
           GF    C
Sbjct: 366 GFKARAC 372


>gi|110740049|dbj|BAF01928.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
          Length = 183

 Score =  128 bits (322), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 73/186 (39%), Positives = 103/186 (55%), Gaps = 8/186 (4%)

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           +G L FG   +     + P+       SFY + +  + VGG ++PI   +F        G
Sbjct: 3   TGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFS-----TPG 57

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +TRLP  AY A R +F A+    P  SGVSI DTC++LSGF +V +P V+F 
Sbjct: 58  ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFS 117

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVG 489
           FSGG V+ L  S  +  V      C AFA     S  +I GN+QQ+ +++ +DGA G VG
Sbjct: 118 FSGGAVVEL-GSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVG 176

Query: 490 FGPNVC 495
           F PN C
Sbjct: 177 FAPNGC 182


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 102/311 (32%), Positives = 145/311 (46%), Gaps = 32/311 (10%)

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCHAGR------CRYEVSYGDGSYTKGTLALET 249
           P FD + S++    SC S +C  L  A C   +      C Y   Y D S T G + ++ 
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82

Query: 250 LTIGR-TVVKNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR 307
            T G    V  VA GCG  N G+F     G+ G G G +SL  QL     G FS+C  + 
Sbjct: 83  FTFGAGASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQL---KVGNFSHCFTAV 139

Query: 308 GTGSSGSLVF---------GREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
                 +++          GR A+       PL++N   P+FYY+ L G+ VG  R+P+ 
Sbjct: 140 NGLKQSTVLLDLPADLYKNGRGAV----QSTPLIQNSANPTFYYLSLKGITVGSTRLPVP 195

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
           E  F LT  G  G ++D+GT++T LP   Y+  RD F AQ   LP   G +    TC++ 
Sbjct: 196 ESAFALTN-GTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI-KLPVVPGNATGPYTCFSA 253

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV-DDAGT--FCFAFAPSPSGLSIIGNIQQ 474
                  VP +  +F G   + LP  N++  V DDAG    C A        +IIGN QQ
Sbjct: 254 PSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSIICLAINKG-DETTIIGNFQQ 311

Query: 475 EGIQISFDGAN 485
           + + + +D  N
Sbjct: 312 QNMHVLYDLQN 322


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 161/360 (44%), Gaps = 31/360 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP F P  S+++  V C+
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCT 168

Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
                   +  C   R  C YE  Y + S + G L  + ++ G       +    GC + 
Sbjct: 169 I-------DCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAPQRAVFGCENV 221

Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
             G      A G++GLG G +S++ QL  +     +FS C      G  G++V G  + P
Sbjct: 222 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVG-GGAMVLGGISPP 280

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
               +     +P    +Y + L  + V G R+P++ ++F     G  G V+D+GT    L
Sbjct: 281 SDMTFA--YSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTYAYL 334

Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
           P  A+ AF+DA V +  +L + SG   +  D C++ +G     +S   P V   F  G  
Sbjct: 335 PEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHK 394

Query: 438 LTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +L   N++       G +C   F       +++G I      + +D     +GF    C
Sbjct: 395 YSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNC 454


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 124/457 (27%), Positives = 199/457 (43%), Gaps = 84/457 (18%)

Query: 60  LFERHNNISSSN----TSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRD 115
           +F  H  ++ +N    +S    R   +L+HRD + S         Y+R   +   R +R 
Sbjct: 14  IFSTHFALTIANNLEFSSIQPTRLVTKLIHRDSIVSP--------YYRSNDTVADRTERT 65

Query: 116 VKRVATLVRRLSGGGADAAKHEVQDFG-TDVVSGMDQGSGE--YFVRIGVGSPPRSQYMV 172
           +K  A+L R LS   A   +    DF   D+   +   + E  + V   +G PP  Q  +
Sbjct: 66  MK--ASLAR-LSYLYAKIER----DFDINDLWLNLHPSASEPLFLVNFSMGQPPVPQLAI 118

Query: 173 IDSGSDIVWVQCQPCSQCYKQ-SDPVFDPADSASFSGVSCSSAVCDRLENAGCH-AGRCR 230
           +D+GS ++W+QC PC  C +Q   P+FDP+ S+++  +SC + +C    +  C  + +C 
Sbjct: 119 MDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNIICRYAPSGECDSSSQCV 178

Query: 231 YEVSYGDGSYTKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVGA--AGLLGLGG 283
           Y  +Y +G  + G +A E L       GR  V NV  GC H+N G +      G+ GLG 
Sbjct: 179 YNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRN-GNYKDRRFTGVFGLGS 237

Query: 284 GSMSLVGQLGGQTGGAFSYCL--VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           G  S+V Q+G +    FSYC+  ++    S   LV   E + +     PL         Y
Sbjct: 238 GITSVVNQMGSK----FSYCIGNIADPDYSYNQLVLS-EGVNMEGYSTPL---DVVDGHY 289

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF---------- 391
            V L G+ VG  R+ I    F+ T+     V++D+GTA T L    Y A           
Sbjct: 290 QVILEGISVGETRLVIDPSAFKRTE-KQRRVIIDSGTAPTWLAENEYRALEREVRNLLDR 348

Query: 392 ------RDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
                 R++F+   G + +           +L GF     P V+F+F+ G  L       
Sbjct: 349 FLTPFMRESFLCYKGKVGQ-----------DLVGF-----PAVTFHFAEGADLV------ 386

Query: 446 LIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
              VD        +       S+IG + Q+   +++D
Sbjct: 387 ---VDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYD 420


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  128 bits (321), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 166/371 (44%), Gaps = 48/371 (12%)

Query: 158  VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
            V + VGSPP+   MV+D+GS++ W+ C+           VF+P  S+S+S + CSS +C 
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPICR 1057

Query: 217  ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
                D      C   + C   VSY D S  +G LA +   IG + +     GC       
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 1117

Query: 268  NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
            N        GL+G+  GS+S V QLG      FSYC+   G  SSG L+FG   L     
Sbjct: 1118 NSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCI--SGRDSSGVLLFGDLHLSWLGN 1172

Query: 326  AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
              + PLV+ +   P F    Y V L G+ VG   +P+ + +F     G    ++D+GT  
Sbjct: 1173 LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQF 1232

Query: 381  TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNL-SGFVSVRVPTVSFYFS 433
            T L  P Y A R+ F+ QT  +    G   F      D CY++ +G     +P+VS  F 
Sbjct: 1233 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKLPTLPSVSLMFR 1292

Query: 434  ------GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
                  GG VL       +    +   +C  F  S   G+   +IG+  Q+ + + FD  
Sbjct: 1293 GAEMVVGGEVLLYRVPEMM--KGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFD-- 1348

Query: 485  NGFVGFGPNVC 495
               V F  ++C
Sbjct: 1349 --LVAFAADLC 1357


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 125/428 (29%), Positives = 182/428 (42%), Gaps = 50/428 (11%)

Query: 92  SSNTTNNMHYHRHQHSFHARMQRDVKRVATLVR-------RLSGGGADAAKHEVQDFGTD 144
           +SNT   M         +      V+R   L R       R  GGG  A  H        
Sbjct: 29  TSNTGIRMKLTHVDAKGNYTAPERVRRAIALSRQINLASTRAEGGGVSAPVH-------- 80

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--CYKQSDPVFDPAD 202
                   + +Y     VG PP+    +ID+GS ++W QC  C +  C +Q  P F+ + 
Sbjct: 81  ------WATRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASS 134

Query: 203 SASFSGVSCSSAVCDRLENAGCHA-GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S SF+ V C    C       C   G C + V+YG G    G L  +  T  ++    +A
Sbjct: 135 SGSFAPVPCQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTF-QSGGATLA 192

Query: 262 IGCGHKNQ----GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSL 315
            GC    +     +  GA+GL+GLG G +SL  Q G +    FSYCL       G+S  L
Sbjct: 193 FGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTGAKR---FSYCLTPYFHNNGASSHL 249

Query: 316 VFGREALPVGA-------AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM- 367
             G  A   G        A+V   ++    +FYY+ L G+ VG  ++ I    F L ++ 
Sbjct: 250 FVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVE 309

Query: 368 ---GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLS-GFVS 422
               + GV++D+G+  T L   AYE        Q  G+L    G         ++ G + 
Sbjct: 310 EGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLD 369

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFD 482
             VPT+  +FSGG  + LP  N+  P++ + T C A        SIIGN QQ+ + I FD
Sbjct: 370 RVVPTLVLHFSGGADMALPPENYWAPLEKS-TACMAIVRG-YLQSIIGNFQQQNMHILFD 427

Query: 483 GANGFVGF 490
              G + F
Sbjct: 428 VGGGRLSF 435


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 168/371 (45%), Gaps = 47/371 (12%)

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD--PVFDPADSASFSGVSCSSAVC- 216
           + +G+PP++  MV+D+GS++ W++C+      K+ +   +F+P  S +++ + CSS  C 
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRCK------KEPNFTSIFNPLASKTYTKIPCSSQTCK 124

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GHK 267
               D      C   + C + +SY D S  +G LA ET   G         GC       
Sbjct: 125 TRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRFGSLTRPATVFGCMDSGSSS 184

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG--REALPVG 325
           N        GL+G+  GS+S V Q+G +    FSYC+   G  S+G L+ G  R +    
Sbjct: 185 NTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCI--SGLDSTGFLLLGEARYSWLKP 239

Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ +   P F    Y V L G+ V    +P+ + +F     G    ++D+GT  
Sbjct: 240 LNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSGTQF 299

Query: 381 TRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSGFVSV--RVPTVSFYF 432
           T L  P Y A R  F+ QT       N P+       D CY +    S    +P V   F
Sbjct: 300 TFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPVVKLMF 359

Query: 433 SGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSPS-GLS--IIGNIQQEGIQISFDGA 484
            G   +++     L  +P +  G    +CF F  S   G+S  +IG+ QQ+ + + +D  
Sbjct: 360 RGAE-MSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWMEYDLE 418

Query: 485 NGFVGFGPNVC 495
           N  +GF    C
Sbjct: 419 NSRIGFAELRC 429


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 40/370 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF +I +GSPP+  Y+ +D+GSDI+WV C PC +C  ++D      ++D   S++   
Sbjct: 75  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKN 134

Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
           V C  A C   +++  C A + C Y V YGDGS + G    + +T+ +         + +
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194

Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
            V  GCG KNQ   +G       G++G G  + S++ QL  GG     FS+CL +   G 
Sbjct: 195 EVVFGCG-KNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGG- 252

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G    G    PV     PLV N      Y V L G+ V G  I +   L   +  GD G
Sbjct: 253 -GIFAIGEVESPV-VKTTPLVPNQV---HYNVILKGMDVDGEPIDLPPSL--ASTNGDGG 305

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +  LP   Y +  +   A+     +   V     C++ +       P V+ +
Sbjct: 306 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 363

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
           F     L++   ++L  + +   +CF +          + + ++G++      + +D  N
Sbjct: 364 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 422

Query: 486 GFVGFGPNVC 495
             +G+  + C
Sbjct: 423 EVIGWADHNC 432


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  127 bits (320), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 121/432 (28%), Positives = 185/432 (42%), Gaps = 67/432 (15%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
            Y R Q S  A  + D +R  T++    L  GG                +G     G Y+
Sbjct: 38  RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
            +IG+G+P +S Y+ +D+GSDI+WV C  C QC ++S       +++  +S S   VSC 
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 213 SAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
              C ++     +GC A   C Y   YGDGS T G    + +           +T   +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 261 AIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
             GCG +  G    +      G+LG G  + S++ QL   G+    F++CL  R  G  G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
               GR   P      PLV  P  P  Y V ++ + VG   + I  DLF   Q GD  G 
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLTIPADLF---QPGDRKGA 312

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
           ++D+GT +  LP   YE       +Q   L     V I D    C+  SG V    P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKKITSQEPALK----VHIVDKDYKCFQYSGRVDEGFPNVT 368

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSIIGNIQQEGIQISFDG 483
           F+F     L +   ++L P +  G +C  +  S         ++++G++      + +D 
Sbjct: 369 FHFENSVFLRVYPHDYLFPHE--GMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDL 426

Query: 484 ANGFVGFGPNVC 495
            N  +G+    C
Sbjct: 427 ENQLIGWTEYNC 438


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 171/374 (45%), Gaps = 43/374 (11%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G Y+ RIG+GSPP   ++ +D+GSDI+WV C  CS C K+SD      +++P  S++ +
Sbjct: 70  TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---- 259
            ++C    C    +A   GC     C+Y+V YGDGS T G    + + + R V  +    
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189

Query: 260 ----VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
               +  GCG K  G    ++    G+LG G  + S++ QL   G+    F++CL S   
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           G  G    G    P       L   P  P  + Y V L+G+ VG   + +   LF  +  
Sbjct: 250 G--GIFAIGEVVEP------KLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSY- 300

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
              G ++D+GT +  LP   Y    +  +    +L   +    F TC+     V    PT
Sbjct: 301 -KRGAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDDGFPT 358

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQISF 481
           V+F F    +LT+    +L  + D   +C  +    A S  G  ++++G++  +   + +
Sbjct: 359 VTFKFEESLILTIYPHEYLFQIRD-DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYY 417

Query: 482 DGANGFVGFGPNVC 495
           +  N  +G+    C
Sbjct: 418 NLENQTIGWTEYNC 431


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 163/373 (43%), Gaps = 57/373 (15%)

Query: 173 IDSGSDIVWVQCQ---PCSQCYKQS--DPVFDPADSASFSGVSCSSAVCDRLEN------ 221
           +D+GSD+VWV C     C  C + S  + VF P  S+S   V+C+ + C  L        
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 222 --------AGCHAGRCRYEVSYGDGSYTKGTLALETLTI------GRTVVKNVAIGCGHK 267
                     C      Y + YG GS T G L  ETL +      G   + + A+GC   
Sbjct: 61  CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAVGCSIV 119

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGG-AFSYCLVSR---GTGSSGSLVFGREALP 323
           +       +G+ G G G++S+  QLG   G   F+YCL S           +V G +ALP
Sbjct: 120 SSQQ---PSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVLGDKALP 176

Query: 324 --VGAAWVPLVRNPRAPS------FYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVM 374
             +   + P + N RAP       +YY+GL G+ +GG R+  +   L R    G+ G ++
Sbjct: 177 NNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKGNGGTII 236

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQ-----TGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
           D+GT  T      ++     F +Q      G +   +G+ +   CY+++G  ++ +P  +
Sbjct: 237 DSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGL---CYDVTGLENIVLPEFA 293

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLS-------IIGNIQQEGIQISFD 482
           F+F GG  + LP +N+        + C     S   L        I+GN QQ+   + +D
Sbjct: 294 FHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQQDFYLLYD 353

Query: 483 GANGFVGFGPNVC 495
                +GF    C
Sbjct: 354 REKNRLGFTQQTC 366


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 117/385 (30%), Positives = 165/385 (42%), Gaps = 53/385 (13%)

Query: 103 RHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTD-----VVSGMDQGSGEYF 157
            H+        RD  R   L++ L G         V DF  D      V G+      Y+
Sbjct: 38  NHEMELSQLKARDEARHGRLLQSLGG---------VIDFPVDGTFDPFVVGL------YY 82

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
            ++ +G+PPR  Y+ +D+GSD++WV C  C+ C + S        FDP  S + S +SCS
Sbjct: 83  TKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCS 142

Query: 213 SAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN---- 259
              C    +  ++GC      C Y   YGDGS T G    + L     +G ++V N    
Sbjct: 143 DQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAP 202

Query: 260 VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSG 313
           V  GC     G  V       G+ G G   MS++ QL  Q      FS+CL     G  G
Sbjct: 203 VVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE-NGGGG 261

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVV 373
            LV G    P    + PLV  P  P  Y V L  + V G  +PI+  +F  +     G +
Sbjct: 262 ILVLGEIVEP-NMVFTPLV--PSQPH-YNVNLLSISVNGQALPINPSVFSTSN--GQGTI 315

Query: 374 MDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFS 433
           +DTGT +  L   AY  F +A         R   VS  + CY ++  V    P VS  F+
Sbjct: 316 IDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPV-VSKGNQCYVITTSVGDIFPPVSLNFA 374

Query: 434 GGPVLTLPASNFLIPVDD-AGTFCF 457
           GG  + L   ++LI  ++ A   CF
Sbjct: 375 GGASMFLNPQDYLIQQNNVASALCF 399


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 162/371 (43%), Gaps = 44/371 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP+   MV+D+GS++ W+ C+           VF+P  S+S+S + CSS VC 
Sbjct: 42  VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS----VFNPLSSSSYSPIPCSSPVCR 97

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
               D      C   + C   VSY D S  +G LA +   IG + +     GC       
Sbjct: 98  TRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSSALPGTLFGCMDSGFSS 157

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
           N        GL+G+  GS+S V QLG      FSYC+   G  SSG L+FG   L     
Sbjct: 158 NSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCI--SGRDSSGVLLFGDSHLSWLGN 212

Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ +   P F    Y V L G+ VG   +P+ + +F     G    ++D+GT  
Sbjct: 213 LTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMVDSGTQF 272

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNL-SGFVSVRVPTVSFYFS 433
           T L  P Y A R+ F+ QT  +    G   F      D CY + +G     +P VS  F 
Sbjct: 273 TFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKLPELPAVSLMFR 332

Query: 434 ------GGPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
                 GG VL       +        +C  F  S   G+   +IG+  Q+ + + FD  
Sbjct: 333 GAEMVVGGEVLLYKVPGMM--KGKEWVYCLTFGNSDLLGIEAFVIGHHHQQNVWMEFDLV 390

Query: 485 NGFVGFGPNVC 495
              VGF    C
Sbjct: 391 KSRVGFVETRC 401


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 169/371 (45%), Gaps = 40/371 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP++  MVID+GS++ W+ C   SQ    S   F+P  S+S+S + CSS+ C 
Sbjct: 75  VSLTVGTPPQNVTMVIDTGSELSWLHCN-TSQNSSSSSSTFNPVWSSSYSPIPCSSSTCT 133

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
               D      C + + C   +SY D S ++G LA +T  IG + + NV  GC       
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSGIPNVVFGCMDSIFSS 193

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           N        GL+G+  GS+S V Q+G      FSYC+       SG L+ G       A 
Sbjct: 194 NSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISEYDF--SGLLLLGDANFSWLAP 248

Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PL+  +   P F    Y V L G+ V    +PI E +F     G    ++D+GT  
Sbjct: 249 LNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQTMVDSGTQF 308

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVS--VRVPTVSFYF 432
           T L  PAY A RD F+ +T    R    S F      D CY +    +    +P+V+  F
Sbjct: 309 TFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTNQTRLPPLPSVTLVF 368

Query: 433 SGGPVLTLPASNFL--IPVDDAGT---FCFAFAPSP-SGLS--IIGNIQQEGIQISFDGA 484
            G   +T+     L  +P +  G     CF F  S   G+   +IG++ Q+ + + FD  
Sbjct: 369 RGAE-MTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFVIGHLHQQNVWMEFDLK 427

Query: 485 NGFVGFGPNVC 495
              +G     C
Sbjct: 428 KSRIGLAEIRC 438


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  127 bits (318), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/405 (27%), Positives = 178/405 (43%), Gaps = 44/405 (10%)

Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           R   +  R+A  +RR  G G    A+  + D   D+++     +G Y  R+ +G+PP+  
Sbjct: 50  RSYPNASRLAASLRRGLGDGVHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 101

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
            +++DSGS + +V C  C QC    DP F P  S+S+S V C+    CD          +
Sbjct: 102 ALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCD------SDKKQ 155

Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
           C YE  Y + S + G L  + ++ GR      ++   GC +   G      A G++GLG 
Sbjct: 156 CTYERQYAEMSSSSGVLGEDIVSFGRESELKPQHAIFGCENSETGDLFSQHADGIMGLGR 215

Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           G +S++ QL   G    +FS C      G  G++V G    P    +     +P    +Y
Sbjct: 216 GQLSIMDQLVEKGVISDSFSLCYGGMDIG-GGAMVLGGMLAPPDMIFS--NSDPLRSPYY 272

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            + L  + V G  + +   +F        G V+D+GT    LP  A+ AF++A  ++  +
Sbjct: 273 NIELKEIHVAGKALRVESRIFN----SKHGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHS 328

Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDD 451
           L +  G   S  D C+     N+S    V  P V   F  G  L+L   N+L     VD 
Sbjct: 329 LKKIRGPDPSYKDICFAGAGRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLFRHSKVD- 386

Query: 452 AGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            G +C   F       +++G I      +++D  N  +GF    C
Sbjct: 387 -GAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 172/372 (46%), Gaps = 39/372 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G Y+ RIG+GSPP   ++ +D+GSDI+WV C  CS C K+SD      +++P  S++ +
Sbjct: 70  TGLYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTST 129

Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTVVKN---- 259
            ++C    C    +A   GC     C+Y+V YGDGS T G    + + + R V  +    
Sbjct: 130 LITCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSE 189

Query: 260 ----VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
               +  GCG K  G    ++    G+LG G  + S++ QL   G+    F++CL S   
Sbjct: 190 TNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISG 249

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           G  G    G    P      P+V N    + Y V L+G+ VG   + +   LF  +    
Sbjct: 250 G--GIFAIGEVVEP-KLKTTPVVPN---QAHYNVVLNGVKVGDTALDLPLGLFETSY--K 301

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G ++D+GT +  LP   Y    +  +    +L   +    F TC+     V    PTV+
Sbjct: 302 RGAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDDQF-TCFVFDKNVDDGFPTVT 360

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQISFDG 483
           F F    +LT+    +L  + D   +C  +    A S  G  ++++G++  +   + ++ 
Sbjct: 361 FKFEESLILTIYPHEYLFQIRD-DVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419

Query: 484 ANGFVGFGPNVC 495
            N  +G+    C
Sbjct: 420 ENQTIGWTEYNC 431


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 181/400 (45%), Gaps = 44/400 (11%)

Query: 132 DAAKH--EVQDFGTDVVSGMDQGS------GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           D  +H   +Q  G  VV    QG+      G Y+ R+ +G+PPR  Y+ ID+GSD++WV 
Sbjct: 20  DRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTGSDVLWVS 79

Query: 184 CQPCSQCYKQSD---PV--FDPADSASFSGVSCSSAVCD---RLENAGCHA--GRCRYEV 233
           C  C+ C   S    P+  FDP  S + S +SCS   C    +  ++ C A    C Y  
Sbjct: 80  CGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQNNLCGYNF 139

Query: 234 SYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV----GAAGLLGL 281
            YGDGS T G    + L     +G +V+ N    +  GC     G          G+ G 
Sbjct: 140 QYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGF 199

Query: 282 GGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
           G   MS+V QL  Q  +  AFS+CL  +G  S G ++   E +     + PLV  P  P 
Sbjct: 200 GQQDMSVVSQLASQGISPRAFSHCL--KGDDSGGGILVLGEIVEPNIVYTPLV--PSQPH 255

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
            Y + +  + V G  + I   +F  +     G ++D+GT +  L   AY+ F  A  +  
Sbjct: 256 -YNLNMQSISVNGQTLAIDPSVFGTSS--SQGTIIDSGTTLAYLAEAAYDPFISAITSIV 312

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDAGTFC 456
               R   +S  + CY +S  ++   P VS  F+GG  + L   ++LI    +  A  +C
Sbjct: 313 SPSVRPY-LSKGNHCYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSIGGAALWC 371

Query: 457 FAFAP-SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             F      G++I+G++  +     +D AN  +G+    C
Sbjct: 372 IGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGWANYDC 411


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  126 bits (316), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 168/370 (45%), Gaps = 40/370 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF +I +GSPP+  Y+ +D+GSDI+WV C PC +C  ++D      ++D   S++   
Sbjct: 76  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
           V C    C   +++  C A + C Y V YGDGS + G    + +T+ +         + +
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
            V  GCG KNQ   +G       G++G G  + S++ QL  GG T   FS+CL +   G 
Sbjct: 196 EVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG- 253

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G    G    PV     P+V N      Y V L G+ V G  I +   L   +  GD G
Sbjct: 254 -GIFAVGEVESPV-VKTTPIVPNQV---HYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 306

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +  LP   Y +  +   A+     +   V     C++ +       P V+ +
Sbjct: 307 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 364

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
           F     L++   ++L  + +   +CF +          + + ++G++      + +D  N
Sbjct: 365 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 423

Query: 486 GFVGFGPNVC 495
             +G+  + C
Sbjct: 424 EVIGWADHNC 433


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 101/329 (30%), Positives = 153/329 (46%), Gaps = 41/329 (12%)

Query: 102 HRHQHSFHARMQRDVKRVATL---VRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFV 158
           H    S      RD  RV+ +     + + G      H    F  D         G + V
Sbjct: 80  HSQPPSPQEIFGRDESRVSFINSKCNQYTSGNLKNHAHNNNLFDED---------GNFLV 130

Query: 159 RIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDR 218
            +  G+PP++  +++D+GS I W QC+ C  C + S   F+ + S+++S  SC   +   
Sbjct: 131 DVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGSC---IPGT 187

Query: 219 LENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQGMF-VGAA 276
           +EN         Y ++YGD S + G    +T+T+  + V +    GCG  N+G F  G  
Sbjct: 188 VEN--------NYNMTYGDDSTSVGNYGCDTMTLEPSDVFQKFQFGCGRNNKGDFGSGVD 239

Query: 277 GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA--WVPLVRN 334
           G+LGLG G +S V Q   +    FSYCL      S GSL+FG +A    ++  +  LV  
Sbjct: 240 GMLGLGQGQLSTVSQTASKFNKVFSYCLPEE--DSIGSLLFGEKATSQSSSLKFTSLVNG 297

Query: 335 P---RAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF 391
           P   +   +Y+V LS + VG  R+ I   +F        G ++D+ T +TRLP  AY A 
Sbjct: 298 PGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSAL 352

Query: 392 RDAFVAQTGNLPRASGV----SIFDTCYN 416
           + AF       P ++G      I DTCYN
Sbjct: 353 KAAFKKAMAKYPLSNGRRKKGDILDTCYN 381


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 159/371 (42%), Gaps = 48/371 (12%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSS 213
           Y     +G+PP++   ++D   ++VW QC  C  S C+KQ  PVFDP+ S ++    C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 214 AVCDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
            +C  +    C   G C YE    +GD   T G  + + + IG    + +A GC   + G
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-LAFGCVVASDG 177

Query: 271 MFVGA----AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
              GA    +G +GLG    SLVGQ       AFSYCL   G G   +L  G  A   GA
Sbjct: 178 SIDGAMDGPSGFVGLGRTPWSLVGQ---SNVTAFSYCLAPHGPGKKSALFLGASAKLAGA 234

Query: 327 AWVPLVRNPRAP---------------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                  NP  P                +Y V L G+  G + +  +        +    
Sbjct: 235 G----KSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI---- 286

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
           + ++T   ++ LP  AY+A      A  G+   A+    FD C+  +      VP + F 
Sbjct: 287 LQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSG--VPDLVFT 344

Query: 432 FSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP------SGLSIIGNIQQEGIQISFDGA 484
           F GG  LT P S +L+   +  GT C +   S        G+SI+G++ QE +   FD  
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404

Query: 485 NGFVGFGPNVC 495
              + F P  C
Sbjct: 405 KETLSFEPADC 415


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  125 bits (315), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 113/414 (27%), Positives = 164/414 (39%), Gaps = 52/414 (12%)

Query: 126 LSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ 185
           LS   A   K    +F         +  G Y + +  G+PP++   V+D+GS +VW  C 
Sbjct: 53  LSLSRAHHIKSPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCT 112

Query: 186 P---CSQC-----YKQSDPVFDPADSASFSGVSCSSAVCD---------RLENAGCHAGR 228
               CS+C      K   P F P  S+S   + C +  C          + +     A  
Sbjct: 113 SRYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQN 172

Query: 229 CR-----YEVSYGDGSYTKGTLALETLTI-GRTVVKNVAIGCGHKNQGMFVGAAGLLGLG 282
           C      Y + YG GS T G L  ETL    +  + +  +GC   +        G+ G G
Sbjct: 173 CTQTCPPYVIQYGSGS-TAGLLLSETLDFPNKKTIPDFLVGCSIFS---IKQPEGIAGFG 228

Query: 283 GGSMSLVGQLGGQTGGAFSYCLVSRG---TGSSGSLVFGREA-----LPVGAAWVPLVRN 334
               SL  QLG +    FSYCLVS     T +S  LV    +        G +  P ++N
Sbjct: 229 RSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKN 285

Query: 335 PRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
           P      +YYV L  + +G   + +          G+ G ++D+GT  T +  P YE   
Sbjct: 286 PTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVA 345

Query: 393 DAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV 449
             F  Q  +   A+ +        CYN+SG  S+ VP + F F GG  + LP SN+   +
Sbjct: 346 KEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYF-SI 404

Query: 450 DDAGTFCFAFAP--------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            D+G  C                   I+GN QQ    + FD  N   GF    C
Sbjct: 405 VDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 168/370 (45%), Gaps = 40/370 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF +I +GSPP+  Y+ +D+GSDI+WV C PC +C  ++D      ++D   S++   
Sbjct: 72  GLYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKN 131

Query: 209 VSCSSAVCD-RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
           V C    C   +++  C A + C Y V YGDGS + G    + +T+ +         + +
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191

Query: 259 NVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
            V  GCG KNQ   +G       G++G G  + S++ QL  GG T   FS+CL +   G 
Sbjct: 192 EVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGG- 249

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G    G    PV     P+V N      Y V L G+ V G  I +   L   +  GD G
Sbjct: 250 -GIFAVGEVESPV-VKTTPIVPNQV---HYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 302

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +  LP   Y +  +   A+     +   V     C++ +       P V+ +
Sbjct: 303 TIIDSGTTLAYLPQNLYNSLIEKITAK--QQVKLHMVQETFACFSFTSNTDKAFPVVNLH 360

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGAN 485
           F     L++   ++L  + +   +CF +          + + ++G++      + +D  N
Sbjct: 361 FEDSLKLSVYPHDYLFSLRE-DMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLEN 419

Query: 486 GFVGFGPNVC 495
             +G+  + C
Sbjct: 420 EVIGWADHNC 429


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/396 (28%), Positives = 162/396 (40%), Gaps = 53/396 (13%)

Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPAD--- 202
           VS   +  G Y V +  G+PP++   + D+GS +VW  C    +C + S P  DPA    
Sbjct: 122 VSLFPRSYGAYSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISK 181

Query: 203 -----SASFSGVSCSSAVC---------DRLENAGCHAGRCR-----YEVSYGDGSYTKG 243
                S+S   V C +  C          R  N    + +C      Y + YG G+ T G
Sbjct: 182 FVPKLSSSVKVVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAG 240

Query: 244 TLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYC 303
            L  ETL +    V +  +GC   +       AG+ G G G  SL  Q+  +    FS+C
Sbjct: 241 ILLSETLDLENKRVPDFLVGCSVMSVHQ---PAGIAGFGRGPESLPSQMRLKR---FSHC 294

Query: 304 LVSRG---TGSSGSLVF-----GREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGV 350
           LVSRG   +  S  LV        E+      + P   NP   +     +YY+ L  + +
Sbjct: 295 LVSRGFDDSPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILI 354

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV-- 408
           GG  +            G+ G ++D+G+  T L  P +EA  D    Q    PRA  V  
Sbjct: 355 GGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEA 414

Query: 409 -SIFDTCYNLSG-FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
            S    C+N+     S   P V   F GG  L+L A N+L  V D G  C       + +
Sbjct: 415 QSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVV 474

Query: 467 S-------IIGNIQQEGIQISFDGANGFVGFGPNVC 495
                   I+G  QQ+ + + +D A   +GF    C
Sbjct: 475 GGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKC 510


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 107/357 (29%), Positives = 150/357 (42%), Gaps = 63/357 (17%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAV 215
           Y   + +G+PP+    +I    + VW QC PC +C+KQ  P+F+                
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRRCFKQDLPLFN---------------- 71

Query: 216 CDRLENAGCHAGRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCG-HKNQGMF 272
                         RYEV   +GD S   GT   +T  IG T   ++A GC    N    
Sbjct: 72  --------------RYEVETMFGDTSGIGGT---DTFAIG-TATASLAFGCAMDSNIKQL 113

Query: 273 VGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-TGSSGSLVFGREALPVG---AAW 328
           +GA+G++GLG    SLVGQ+      AFSYCL   G  G   +L+ G  A   G   AA 
Sbjct: 114 LGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLAGGKSAAT 170

Query: 329 VPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAY 388
            PLV      S Y + L G+  G + I    +           V++DT   V+ L   A+
Sbjct: 171 TPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPN--------GSVVLVDTIFGVSFLVDAAF 222

Query: 389 EAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV-----SVRVPTVSFYFSGGPVLTLPAS 443
            A + A     G  P A+    FD C+  +        S+ +P V   F G   LT+P S
Sbjct: 223 HAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPS 282

Query: 444 NFLIPVDDAGTFCFAFAPSP-----SGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            ++    + GT C A   S      + LSI+G + QE I   FD     + F P  C
Sbjct: 283 KYMYDAGN-GTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADC 338


>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
          Length = 492

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 161/362 (44%), Gaps = 28/362 (7%)

Query: 150 DQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGV 209
           D G+ +Y V +G G+P +   M +D+   +  V C+PC+      DP FD + S +F+ V
Sbjct: 143 DAGALDYTVNVGYGTPEQQFPMFLDTIFGVSLVLCKPCAPGSTSCDPAFDTSQSTTFTHV 202

Query: 210 SCSSAVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHK 267
            C S  C    N  C AG  C + + + +G++++     + LT+  +V V++    C   
Sbjct: 203 PCDSPDCPSTAN--CSAGSVCPFNLFFVEGTFSQ-----DVLTVAPSVAVQDFTFVCLDA 255

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG-- 325
                +   G L L     SL  +L G    AFSYC+  +   S G L  G +A   G  
Sbjct: 256 GASDGMPEVGTLDLSRDRNSLPSRLAGSASAAFSYCM-PQYPDSPGFLSLGDDATVRGDN 314

Query: 326 -AAWVPLVR--NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
             A  PL+   +P   + Y++ + G+ +G + +PI    F      +   +++ GT  T 
Sbjct: 315 CTAHAPLLSSDDPDLANMYFIDVVGMSLGDVDLPIPSGTFG----NNASTIVEAGTTFTM 370

Query: 383 LPTPAYEAFRDAFVAQTGNLPRA-SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLP 441
           L   AY   RDAF        R+  G   FDTCYN +G   + VP V F F  G  L + 
Sbjct: 371 LAPDAYTPLRDAFRQAMAQYNRSVPGFYDFDTCYNFTGLQELTVPLVEFKFGNGDSLLID 430

Query: 442 ASNFL---IPVDDAGTF-CFAFA----PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
               L   IP +   T  C AF+          ++IG       ++ +D A G VGF P 
Sbjct: 431 GDQMLYYDIPSEGPFTVTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFIPE 490

Query: 494 VC 495
            C
Sbjct: 491 SC 492


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  125 bits (314), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 41/370 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
           G YF R+ +GSPP+  Y+ ID+GSD++WV C  C+ C   S    P+  FDP  S + + 
Sbjct: 82  GLYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAAL 141

Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKG-----TLALETL-------- 250
           VSCS   C    +  ++ C     +C Y   YGDGS T G      + L+TL        
Sbjct: 142 VSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELS 201

Query: 251 TIGRTVVKNVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCL 304
            I +T   +V+  C     G          G+ G G   MS++ QL  Q  T   FS+CL
Sbjct: 202 QICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCL 261

Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
             +G  S G ++   E +     + PLV  P  P  Y + L  + V G  + I   +F  
Sbjct: 262 --KGDDSGGGVLVLGEIVEPNIVYTPLV--PSQPH-YNLYLQSISVAGQTLAIDPSVFGA 316

Query: 365 TQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR 424
           +   + G ++D+GT +  L   AY+ F  A +    +L   + +S  + CY ++  V+  
Sbjct: 317 SS--NQGTIVDSGTTLAYLAEGAYDPFVSA-ITSVVSLNARTYLSKGNQCYLVTSSVNDV 373

Query: 425 VPTVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPSP-SGLSIIGNIQQEGIQIS 480
            P VS  F+GG  L L   ++L+    V  A  +C  F  +P   ++I+G++  +     
Sbjct: 374 FPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFV 433

Query: 481 FDGANGFVGF 490
           +D AN  VG+
Sbjct: 434 YDIANQRVGW 443


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 112/386 (29%), Positives = 162/386 (41%), Gaps = 52/386 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSD----PVFDPADSAS 205
           G Y + +  G+PP++   V+D+GS +VW  C     CS+C +   +    P F P  S+S
Sbjct: 90  GGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSS 149

Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
            + + C +  C  L                  C      Y + YG GS T G L  ETL 
Sbjct: 150 SNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLD 208

Query: 252 IG-RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG-- 308
              +  +    +GC   +        G+ G G    SL  QLG +    FSYCLVS    
Sbjct: 209 FPHKKTIPGFLVGCSLFS---IRQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFD 262

Query: 309 -TGSSGSLVF-----GREALPVGAAWVPLVRNPRAP--SFYYVGLSGLGVGGMRIPISED 360
            T +S  LV        +    G ++ P  +NP A    +YYV L  + +G   + +   
Sbjct: 263 DTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYK 322

Query: 361 LFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNL 417
                  G+ G ++D+GT  T +  P YE     F  Q  +   A+ V   +    C+N+
Sbjct: 323 FLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNI 382

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP---SPSGLS-----II 469
           SG  SV VP   F+F GG  + LP +N+   V D+G  C        S SG+      I+
Sbjct: 383 SGEKSVSVPEFIFHFKGGAKMALPLANYFSFV-DSGVICLTIVSDNMSGSGIGGGPAIIL 441

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           GN QQ    + FD  N   GF    C
Sbjct: 442 GNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 113/450 (25%), Positives = 198/450 (44%), Gaps = 58/450 (12%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSG-GGAD 132
           +D+  +  EL+H D       + N+  ++  + + H R+ + ++R A  V RL+    +D
Sbjct: 33  ADKFSFTAELIHID-------SPNSPFFNASETTTH-RLAKALQRSANRVARLNPLSNSD 84

Query: 133 AAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK 192
              H          + +  G G Y +++ +G+PP   +  ID+GS+++W+ C  C  C+ 
Sbjct: 85  EGVH----------ASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFN 134

Query: 193 QSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDG-SYTKGTLALETLT 251
           QS  +F+P  S+++    C S  C+   ++      C Y        +   G +A++T+T
Sbjct: 135 QSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMT 194

Query: 252 I----GRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS 306
           +    GR   +      CG+     F G  G++GLG G++SL  +L   + G FSYCL  
Sbjct: 195 LTSSDGRPFPLPYSDFVCGNSIYKTFAG-VGVIGLGRGALSLTSKLYHLSDGKFSYCLAD 253

Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVR----NPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
             +     + FG ++  +    + +V     + R    YYV L G+ VG  R    +DL+
Sbjct: 254 YYSKQPSKINFGLQSF-ISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKR----QDLY 308

Query: 363 RLTQMGDD-------GVVMDTGTAVTRLPTPAYEAFRD----AFVAQTGNLPRASGVSI- 410
            +    DD        +++D+GT  T LP   Y+        A      N P  S     
Sbjct: 309 YV----DDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFS 364

Query: 411 FDTCYNLSG----FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGL 466
            D    LS     +  ++ P ++ +F+   V     ++F+   +D    CFAFA +  G 
Sbjct: 365 MDNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDV--VCFAFAATQPGQ 422

Query: 467 SII-GNIQQEGIQISFDGANGFVGFGPNVC 495
           S + G+ QQ    + +D   G V F    C
Sbjct: 423 STVYGSWQQMNFILGYDLKRGTVSFKRTDC 452


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 117/413 (28%), Positives = 177/413 (42%), Gaps = 52/413 (12%)

Query: 114 RDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           RD  R A L++   GG  D +     D             G YF ++ +GSPPR   + I
Sbjct: 33  RDRLRHARLLQGFVGGVVDFSVQGSPD---------PYLVGLYFTKVKLGSPPREFNVQI 83

Query: 174 DSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDR-----LENAG 223
           D+GSD++WV C  C+ C + S        FD + S++   V CS  +C       +    
Sbjct: 84  DTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCS 143

Query: 224 CHAGRCRYEVSYGDGSYTKGTLALETL----TIGRTVVKN----VAIGCGHKNQGMFV-- 273
               +C Y   Y DGS T G    +TL     +G ++V N    +  GC     G     
Sbjct: 144 PQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMT 203

Query: 274 --GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWV 329
                G+ G G G +S++ QL   G T   FS+CL  +G G  G ++   E L  G  + 
Sbjct: 204 DKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL--KGEGIGGGILVLGEILEPGMVYS 261

Query: 330 PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYE 389
           PLV  P  P  Y + L  + V G  +PI   +F  +     G ++D+GT +  L   AY 
Sbjct: 262 PLV--PSQPH-YNLNLQSIAVNGKLLPIDPSVFATSN--SQGTIVDSGTTLAYLVAEAY- 315

Query: 390 AFRDAFVAQTGNLPRASGVSIF---DTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
              D FV+    +   S   I    + CY +S  VS   P  SF F+GG  + L   ++L
Sbjct: 316 ---DPFVSAVNVIVSPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYL 372

Query: 447 IPVDDAG----TFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           IP   +      +C  F     G++I+G++  +     +D     +G+    C
Sbjct: 373 IPFGPSQGGSVMWCIGFQ-KVQGVTILGDLVLKDKIFVYDLVRQRIGWANYDC 424


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 166/379 (43%), Gaps = 41/379 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG     G Y+ +IG+G+PP++ Y+ +D+GSDI+WV C  C +C  +S+      ++D  
Sbjct: 76  SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIK 135

Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
           +S+S   V C    C  +      GC A   C Y   YGDGS T G    + +   +   
Sbjct: 136 ESSSGKFVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 195

Query: 255 -----TVVKNVAIGCGHKNQGMFVGA-----AGLLGLGGGSMSLVGQLG--GQTGGAFSY 302
                +   ++  GCG +  G    +      G+LG G  + S++ QL   G+    F++
Sbjct: 196 DLKTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAH 255

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL   G    G    G    P      PL+  P  P  Y V ++ + VG   + +S D  
Sbjct: 256 CL--NGVNGGGIFAIGHVVQP-KVNMTPLL--PDQPH-YSVNMTAVQVGHAFLSLSTD-- 307

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
             TQ    G ++D+GT +  LP   YE      ++Q  +L +   +    TC+  S  V 
Sbjct: 308 TSTQGDRKGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDL-KVRTLHDEYTCFQYSESVD 366

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEG 476
              P V+FYF  G  L +   ++L P  D   +C  +  S         ++++G++    
Sbjct: 367 DGFPAVTFYFENGLSLKVYPHDYLFPSGDF--WCIGWQNSGTQSRDSKNMTLLGDLVLSN 424

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +G+    C
Sbjct: 425 KLVFYDLENQVIGWTEYNC 443


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score =  125 bits (313), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 69/200 (34%), Positives = 105/200 (52%), Gaps = 4/200 (2%)

Query: 298 GAFSYCLVSRGTGSSGSLVFGREALPVGAAW-VPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
             FSYCL S     +  L+ G  A     A   PL+ NP  PSFYY+ L G+ VGG ++ 
Sbjct: 4   AKFSYCLTSMDDSKASVLLLGSLAKATKDAISTPLLTNPSQPSFYYLSLEGIPVGGTQLS 63

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYN 416
           I + +F ++  G  GV++D+GT +T L    ++  +  F++Q+      S  +  D C++
Sbjct: 64  IEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDVCFS 123

Query: 417 L-SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQE 475
           L S    V VP + F+F GG  L LPA +++I     G  C A   S +G+SI GN+QQ+
Sbjct: 124 LPSETTQVEVPKLVFHFKGGD-LELPAESYMIADSKLGVACLAMGAS-NGMSIFGNVQQQ 181

Query: 476 GIQISFDGANGFVGFGPNVC 495
            I ++ D     + F P  C
Sbjct: 182 NILVNHDLEKETISFVPTQC 201


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 166/379 (43%), Gaps = 45/379 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ------PCSQCY-----KQSDPVFDPAD 202
           G Y V   +G+PP+   +V+D+GS +VW  C        C  C          P++    
Sbjct: 72  GGYSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNK 131

Query: 203 SASFSGVSCSSAVCDRL--ENAGCH-AGRCRYE-VSYGDGSYTKGTLALETLTIGR-TVV 257
           S++   + C S  C+ +   +  C    RC Y  + YG GS T G L  + L + +   +
Sbjct: 132 SSTVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLNRI 190

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---GTGSSGS 314
            +   GC   +        G+ G G G  S+  QLG      FSYCLVS     T  SG 
Sbjct: 191 PDFLFGCSLVSNRQ---PEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGD 244

Query: 315 LVFGR-----EALPVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           LV  R     +A   G A+ P  ++P  +P   +YY+ LS + VGG  +PI       ++
Sbjct: 245 LVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSK 304

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSV 423
            GD G+++D+G+  T +    ++              RA  +   S    CYN++G   V
Sbjct: 305 EGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEV 364

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP------SGLSII-GNIQQEG 476
            VP ++F F GG  + LP +++   V D G  C      P      +G +II GN QQ+ 
Sbjct: 365 DVPKLTFSFKGGANMDLPLTDYFSLVTD-GVVCMTVLTDPDEPGSTTGPAIILGNYQQQN 423

Query: 477 IQISFDGANGFVGFGPNVC 495
             I +D      GF P  C
Sbjct: 424 FYIEYDLKKQRFGFKPQQC 442


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 115/414 (27%), Positives = 179/414 (43%), Gaps = 59/414 (14%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           ++RD+ R+         G +    H V+      V G     G Y++ + +GSPP+  ++
Sbjct: 9   LERDLSRL---------GKSSVGNHSVRFH----VGGNIYPDGLYYMALLLGSPPKLYFL 55

Query: 172 VIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCH----- 225
            +D+GSD+ W QC  PC  C      +++P  +     V C   VC +++  G +     
Sbjct: 56  DMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSD 112

Query: 226 AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAA----G 277
             +C YEV Y DGS T G L  +TLT+    G  +     IGCG+  QG    +     G
Sbjct: 113 VKQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAIIGCGYDQQGTLAKSPASTDG 172

Query: 278 LLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRN 334
           ++GL    ++L  QL   G       +CL + G+   G L FG E +P  G  W P++  
Sbjct: 173 VIGLSSSKVALPAQLAEKGIIKNVLGHCL-ADGSNGGGYLFFGDELVPSWGMTWTPMMGK 231

Query: 335 PRAPSFYYVGLSGLGVGGMRIPIS--EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFR 392
           P     Y   L  +  GG  + ++  EDL R T      V+ D+GT+ T L   AY +  
Sbjct: 232 PEMLG-YQARLQSIRYGGDSLVLNNDEDLTRSTS----SVMFDSGTSFTYLVPQAYASVL 286

Query: 393 DAFVAQTGNLPRASGVSIFDTCYN-LSGFVSV-------RVPTVSF----YFSGGPVLTL 440
            A   Q+G L R    +    C+   S F S+       +  T+ F    +F+    L L
Sbjct: 287 SAVTKQSG-LLRVKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDL 345

Query: 441 PASNFLIPVDDAGTFCF----AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
               +LI V   G  C     A   S    +IIG++   G  + +D     +G+
Sbjct: 346 SPQGYLI-VSTQGNVCLGILDASGASLEVTNIIGDVSMRGYLVVYDNVRDRIGW 398


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  124 bits (312), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 122/433 (28%), Positives = 193/433 (44%), Gaps = 46/433 (10%)

Query: 98  NMHYHR-----HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQG 152
            +H  R     H+       +RD  R + +++   GG  D     VQ      + G   G
Sbjct: 28  TLHLERGVPASHKLKLSQLKERDRVRHSRMLQSSGGGVVD---FPVQGTFDPFLVGFYFG 84

Query: 153 S--GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSAS 205
           S    Y+ R+ +GSPPR  Y+ ID+GSD++WV C  C+ C   S    P+  FDP  S +
Sbjct: 85  SFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPT 144

Query: 206 FSGVSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKGTLALETL----TIGRTV 256
            S +SCS   C    +  ++ C A   +C Y   YGDGS T G    + L     +G +V
Sbjct: 145 ASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSV 204

Query: 257 VKN----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVS 306
           +KN    +  GC     G          G+ G G   MS++ QL  Q  T   FS+CL  
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL-- 262

Query: 307 RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           +G  S G ++   E +     + PLV  P  P  Y + L  + V G  + I   +F  + 
Sbjct: 263 KGDDSGGGILVLGEIVEPNIVYTPLV--PSQPH-YNLNLQSIYVNGQTLAIDPSVFATSS 319

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
             + G ++D+GT +  L   AY+ F  A +  T +   +  +S  + CY  S  ++   P
Sbjct: 320 --NQGTIIDSGTTLAYLTEAAYDPFISA-ITSTVSPSVSPYLSKGNQCYLTSSSINDVFP 376

Query: 427 TVSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFD 482
            VS  F+GG  + L   ++LI    ++ A  +C  F       ++I+G++  +     +D
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436

Query: 483 GANGFVGFGPNVC 495
            A   +G+    C
Sbjct: 437 IAGQRIGWANYDC 449


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  124 bits (312), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 152/328 (46%), Gaps = 33/328 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G    +G YF +IG+G+P +  Y+ +D+GSDI+WV C  C +C  +SD      ++D  
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 202 DSASFSGVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR---- 254
            S +   V C    C   +    GC  G +C Y V YGDGS T G    + +   R    
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265

Query: 255 --TVVKN--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCL 304
             T   N  V  GCG+K  G    ++    G+LG G  + S++ QL   G+    FS+CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
            +   G  G    G E +       PLV+N    + Y V +  + VGG  + +  D F  
Sbjct: 326 DNVDGG--GIFAIG-EVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF-- 377

Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
            + GD  G ++D+GT +   P   Y    +  ++Q  +L R   V    TC++ +G V  
Sbjct: 378 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 435

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDD 451
             PTV+ +F     LT+    +L  V +
Sbjct: 436 GFPTVTLHFDKSISLTVYPHEYLFQVKE 463


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 43/372 (11%)

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
            V + VG+PP++  MVID+GS++ W+ C   +  Y  +   FDP  S S+  + CSS  C
Sbjct: 32  IVSLTVGTPPQNVSMVIDTGSELSWLHCNK-TLSYPTT---FDPTRSTSYQTIPCSSPTC 87

Query: 217 -DRLEN----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK--- 267
            +R ++    A C +   C   +SY D S + G LA +   IG + +  +  GC      
Sbjct: 88  TNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSDISGLVFGCMDSVFS 147

Query: 268 -NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--V 324
            N      + GL+G+  GS+S V QLG      FSYC+   GT  SG L+ G   L   V
Sbjct: 148 SNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCI--SGTDFSGLLLLGESNLTWSV 202

Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              + PL++ +   P F    Y V L G+ V    +PI +  F     G    ++D+GT 
Sbjct: 203 PLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQTMVDSGTQ 262

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCY--NLSGFVSVRVPTVSFY 431
            T L  P Y A R AF+ QT ++ R      F      D CY   LS  V   +PTV+  
Sbjct: 263 FTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVLPLLPTVTLV 322

Query: 432 FSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDG 483
           F G   +T+     L  V      +    C +F  S        +IG+  Q+ + + FD 
Sbjct: 323 FRGAE-MTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFDL 381

Query: 484 ANGFVGFGPNVC 495
               +G     C
Sbjct: 382 EKSRIGLAQVRC 393


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/381 (29%), Positives = 174/381 (45%), Gaps = 44/381 (11%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC------SQCYKQSDPV 197
           +++S +  G  EY V  G G+P +   +  D  S +  ++C+PC       +     D  
Sbjct: 127 NIISSL-PGVFEYTVLAGYGTPAQQLPLFFDV-SGMSNMRCKPCFSGSSGGETTTTCDVA 184

Query: 198 FDPADSASFSGVSCSSAVCDRLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV 256
           FDP+ S+SF  V C S  C       C AG  C + +      +  GT+ ++TLT+  + 
Sbjct: 185 FDPSMSSSFRSVLCGSPDCG---GHSCSAGGSCTFTLQNSTFVFGNGTIVMDTLTLSPSA 241

Query: 257 V-KNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQLGGQTG---GAFSYCLVSRGTG 310
             +N A+GC   +  +F    A G + L     SL  ++   +     AFSYCL +  T 
Sbjct: 242 TFENFAVGCMQLDNDLFTDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYCLPAD-TD 300

Query: 311 SSGSLVFGREALP-----VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           + G L     AL       G  +VPLV NP  P+FYYV L  + + G  +PI   LF   
Sbjct: 301 THGFLTIA-PALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAINGEDLPIPPALFT-- 357

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTGNLPRASGVSIFDTCYNLSGFVS 422
               +G ++D+ +A T L  P Y A RD F   + Q   +P   G+   DTCYN +   +
Sbjct: 358 ---GNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQPVPAFGGL---DTCYNFTLAEN 411

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLI----PVDDAGTF-CFAFAPSPS---GLSIIGNIQQ 474
           + +P ++  FS G  + L    F+      + D   F C AFA +P      + +G+  Q
Sbjct: 412 IYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPDQNFPWNYLGSQVQ 471

Query: 475 EGIQISFDGANGFVGFGPNVC 495
              +I +D   G V F P+ C
Sbjct: 472 RTKEIVYDVRGGMVAFVPSRC 492


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 167/369 (45%), Gaps = 35/369 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF ++ +G+PP    + ID+GSDI+WV C  C+ C + S        FD + S+S S 
Sbjct: 77  GLYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSL 136

Query: 209 VS-----CSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALET----LTIGRTVVKN 259
           VS     C+SA           + +C Y   YGDGS T G    E+    + +G++++ N
Sbjct: 137 VSCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIAN 196

Query: 260 ----VAIGCGHKNQGMFVGA----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
               V  GC     G    +     G+ G G G +S++ QL   G T   FS+CL   G 
Sbjct: 197 SSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGEGN 256

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
           G  G LV G E L  G  + PLV  P  P  Y + L  + V G  +PI   +F  +   +
Sbjct: 257 G-GGILVLG-EVLEPGIVYSPLV--PSQPH-YNLYLQSISVNGQTLPIDPSVFATSI--N 309

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G ++D+GT +  L   AY  F  A  A          +S  + CY +S  V    P VS
Sbjct: 310 RGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQ-SVTPTISKGNQCYLVSTSVGEIFPLVS 368

Query: 430 FYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
             F+G   + L    +L+ +   D A  +C  F     G++I+G++  +     +D A  
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQ 428

Query: 487 FVGFGPNVC 495
            +G+    C
Sbjct: 429 RIGWASYDC 437


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 172/372 (46%), Gaps = 40/372 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ ++ +G+PP    + ID+GSD++WV C  CS C + S        FDP  S++ S 
Sbjct: 73  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 132

Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
           ++CS   C+   +  +A C +   +C Y   YGDGS T G      + L T+  G     
Sbjct: 133 IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 192

Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
           +   V  GC ++  G          G+ G G   MS++ QL  Q      FS+CL  +G 
Sbjct: 193 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL--KGD 250

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            S G ++   E +     +  LV  P  P  Y + L  + V G  + I   +F  +    
Sbjct: 251 SSGGGILVLGEIVEPNIVYTSLV--PAQPH-YNLNLQSIAVNGQTLQIDSSVFATSN--S 305

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA--SGVSIFDTCYNLSGFVSVRVPT 427
            G ++D+GT +  L   AY+ F  A    T ++P++  + VS  + CY ++  V+   P 
Sbjct: 306 RGTIVDSGTTLAYLAEEAYDPFVSAI---TASIPQSVHTVVSRGNQCYLITSSVTEVFPQ 362

Query: 428 VSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDG 483
           VS  F+GG  + L   ++LI    +  A  +C  F      G++I+G++  +   + +D 
Sbjct: 363 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDL 422

Query: 484 ANGFVGFGPNVC 495
           A   +G+    C
Sbjct: 423 AGQRIGWANYDC 434


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/277 (33%), Positives = 140/277 (50%), Gaps = 32/277 (11%)

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKN---QGMFVGAAGLLGLGG 283
           +C + +SY DG+ T G  + + LT+    +V+N   GCGH     +G+F    G+LGLG 
Sbjct: 36  QCGFAISYADGTSTVGAYSQDKLTLAPGAIVQNFYFGCGHGKHAVRGLF---DGVLGLG- 91

Query: 284 GSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYV 343
               L   LG + GG FSYCL S  +   G L  G    P G  + P+   P  P+F  V
Sbjct: 92  ---RLRESLGARYGGVFSYCLPSVSS-KPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTV 147

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAF---VAQTG 400
            L+G+ VGG ++ +    F        G+++D+GT +T L + AY A R AF   +    
Sbjct: 148 TLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYR 201

Query: 401 NLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA 460
            LP        DTCYNL+G+ +V VP ++  F+GG  + L   N ++ V+     C AFA
Sbjct: 202 LLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGIL-VNG----CLAFA 252

Query: 461 PS-PSGLS-IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            S P G + ++GN+ Q   ++ FD +    GF    C
Sbjct: 253 ESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 172/379 (45%), Gaps = 40/379 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G+   +G YF ++G+GSPP+  Y+ +D+GSDI+WV C  CS+C ++SD      ++DP 
Sbjct: 61  NGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPK 120

Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---- 253
            S +   +SC    C    +    GC +   C Y ++YGDGS T G    + LT      
Sbjct: 121 GSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVND 180

Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
             RT  +N +I  GCG    G    ++     G++G G  + S++ QL   G+    FS+
Sbjct: 181 NLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSH 240

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL +   G  G    G    P   +  PLV  PR  + Y V L  + V    + +  D+F
Sbjct: 241 CLDNIRGG--GIFAIGEVVEP-KVSTTPLV--PRM-AHYNVVLKSIEVDTDILQLPSDIF 294

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
                   G ++D+GT +  LP   Y+      +A+   L        F +C+  +G V 
Sbjct: 295 --DSGNGKGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQF-SCFQYTGNVD 351

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
              P V  +F     LT+   ++L    D G +C  +  S +       ++++G++    
Sbjct: 352 RGFPVVKLHFEDSLSLTVYPHDYLFQFKD-GIWCIGWQKSVAQTKNGKDMTLLGDLVLSN 410

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +G+    C
Sbjct: 411 KLVIYDLENMAIGWTDYNC 429


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 156/360 (43%), Gaps = 31/360 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+P +   +++DSGS + +V C  C QC    DP F P  S+++S V C+
Sbjct: 88  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCN 147

Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKN 268
               CD          +C YE  Y + S + G L  + ++ G+      +    GC +  
Sbjct: 148 VDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQRAVFGCENTE 201

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPV 324
            G      A G++GLG G +S++ QL   G    +FS C      G  G++V G   +P 
Sbjct: 202 TGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGTMVLG--GMPA 258

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
               V    NP    +Y + L  + V G  + +   +F        G V+D+GT    LP
Sbjct: 259 PPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVLDSGTTYAYLP 314

Query: 385 TPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPV 437
             A+ AF+DA   +  +L +  G   +  D C+     N+S    V  P V   F  G  
Sbjct: 315 EQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPDVDMVFGNGQK 373

Query: 438 LTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L+L   N+L       G +C   F       +++G I      +++D  N  +GF    C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/353 (28%), Positives = 161/353 (45%), Gaps = 38/353 (10%)

Query: 111 RMQRDVKRVATLVRRLSGGGADA-AKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQ 169
           R   +  R+A  +RR  G GA   A+  + D   D+++     +G Y  R+ +G+PP+  
Sbjct: 51  RSYPNASRLAASLRRGLGDGAHPNARMRLHD---DLLT-----NGYYTTRLYIGTPPQEF 102

Query: 170 YMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS-SAVCDRLENAGCHAGR 228
            +++DSGS + +V C  C QC    DP F P  S+S+S V C+    CD  +       +
Sbjct: 103 ALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNVDCTCDSDKK------Q 156

Query: 229 CRYEVSYGDGSYTKGTLALETLTIGRT---VVKNVAIGCGHKNQGMFVG--AAGLLGLGG 283
           C YE  Y + S + G L  + ++ GR      +    GC +   G      A G++GLG 
Sbjct: 157 CTYERQYAEMSSSSGVLGEDIVSFGRESELKAQRAVFGCENSETGDLFSQHADGIMGLGR 216

Query: 284 GSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY 341
           G +S++ QL   G    +FS C      G  G++V G   +P  +  V    +P    +Y
Sbjct: 217 GQLSIMDQLVEKGVINDSFSLCYGGMDIG-GGAMVLG--GVPTPSDMVFSRSDPLRSPYY 273

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            + L  + V G  + +   +F        G V+D+GT    LP  A+ AF+DA  ++  +
Sbjct: 274 NIELKEIHVAGKALRVDSRIFDSKH----GTVLDSGTTYAYLPEQAFMAFKDAVTSKVHS 329

Query: 402 LPRASG--VSIFDTCY-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           L +  G   S  D C+     N+S    V  P V   F  G  L+L   N+L 
Sbjct: 330 LKKIRGPDPSYKDICFAGARRNVSKLHEV-FPDVDMVFGNGQKLSLTPENYLF 381


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 107/191 (56%), Gaps = 12/191 (6%)

Query: 109 HARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVV--SGMDQGSGEYFVRIGVGSPP 166
           H  ++R ++R      RL+G G   A+ E       VV  + +    GEY V++G+G+PP
Sbjct: 45  HELLRRAIQRSRY---RLAGIGM--ARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPP 99

Query: 167 RSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGC-- 224
                 ID+ SD++W QCQPC+ CY Q DP+F+P  S++++ + CSS  CD L+   C  
Sbjct: 100 YKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGH 159

Query: 225 -HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG--MFVGAAGLLGL 281
                C+Y  +Y   + T+GTLA++ L IG    + VA GC   + G      A+G++GL
Sbjct: 160 DDDESCQYTYTYSGNATTEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGL 219

Query: 282 GGGSMSLVGQL 292
           G G +SLV QL
Sbjct: 220 GRGPLSLVSQL 230



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 83/195 (42%), Gaps = 13/195 (6%)

Query: 306 SRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
           + GT +   LV G +A    A          AP     G+ GLG G + +     + R  
Sbjct: 177 TEGTLAVDKLVIGEDAFRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSVRRY- 235

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS---GFV 421
                G+++D  + +T L    Y+   +    +   LPR +G S+  D C+ L     F 
Sbjct: 236 -----GMIIDIASTITFLEASLYDELVNDLEVEI-RLPRGTGSSLGLDLCFILPDGVAFD 289

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG-LSIIGNIQQEGIQIS 480
            V VP V+  F G   L L  +       ++G  C     + +G +SI+GN QQ+ +Q+ 
Sbjct: 290 RVYVPAVALAFDGR-WLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVL 348

Query: 481 FDGANGFVGFGPNVC 495
           ++   G V F  + C
Sbjct: 349 YNLRRGRVTFVQSPC 363


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 159/360 (44%), Gaps = 40/360 (11%)

Query: 160 IGVGSPPRSQYMVIDSGSDIVWVQCQPCS-QCYKQSDP---VFDPADSASFSGVSCSSAV 215
           I +G+PP    + ID+GS + WVQC+ C  +CY Q+     +F+P +S+++S V CS+  
Sbjct: 3   ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQIFNPYNSSTYSKVGCSTEA 62

Query: 216 CDRLE-----NAGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHK 267
           C+ +        GC      C Y + YG G Y+ G L  + LT+     + N   GCG  
Sbjct: 63  CNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNRSIDNFIFGCGED 122

Query: 268 NQGMFVGA-AGLLGLGGGSMSLVGQLGGQTG-GAFSYCLVSRGTGSSGSLVFGREALPVG 325
           N  ++ G  AG++G G  S S   Q+  QT   AFSYC   R   + GSL  G  A  + 
Sbjct: 123 N--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCF-PRDHENEGSLTIGPYARDIN 179

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             W  L+     P+ Y +    + V G+R+ I   ++ +++M     ++D+GTA T + +
Sbjct: 180 LMWTKLIYYDHKPA-YAIQQLDMMVNGIRLEIDPYIY-ISKM----TIVDSGTADTYILS 233

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCY-------NLSGFVSVRVPTVSFYFSGGPVL 438
           P ++A   A   +        G      C+       N + F +V +  +         L
Sbjct: 234 PVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIR------STL 287

Query: 439 TLPASNFLIPVDDAGTFCFAFAPSPS---GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            LP  N      +    C  F P  +   G+ ++GN      ++ FD      GF    C
Sbjct: 288 KLPVENAFYESSN-NVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/322 (30%), Positives = 149/322 (46%), Gaps = 33/322 (10%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G YF +IG+G+P +  Y+ +D+GSDI+WV C  C +C  +SD      ++D   S +  
Sbjct: 71  AGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 130

Query: 208 GVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
            V C    C   +    GC  G +C Y V YGDGS T G    + +   R      T   
Sbjct: 131 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 190

Query: 259 N--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
           N  V  GCG+K  G    ++    G+LG G  + S++ QL   G+    FS+CL +   G
Sbjct: 191 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGG 250

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD- 369
             G    G    P      PLV+N    + Y V +  + VGG  + +  D F   + GD 
Sbjct: 251 --GIFAIGEVVEP-KVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF---ESGDR 301

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G ++D+GT +   P   Y    +  ++Q  +L R   V    TC++ +G V    PTV+
Sbjct: 302 KGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDDGFPTVT 360

Query: 430 FYFSGGPVLTLPASNFLIPVDD 451
            +F     LT+    +L  V +
Sbjct: 361 LHFDKSISLTVYPHEYLFQVKE 382


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/367 (29%), Positives = 169/367 (46%), Gaps = 42/367 (11%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS 207
           G+ +   E  V +G+G+P  +  +V D+ SD++W QCQPC  C  Q+  ++DP  + +++
Sbjct: 80  GVQEKHVEPHVFLGIGTPAMNVTLVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYA 139

Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK 267
            ++ SS                 Y  +Y   S+T G  A ET  +G   V N+  GCG +
Sbjct: 140 NLTSSS-----------------YNYTYSKQSFTSGYFATETFALGNVTVANITFGCGTR 182

Query: 268 NQGMFVGAA---GLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFG------ 318
           NQG +   A   G+   G G +SL+ QLG      FSYC  S G   S ++  G      
Sbjct: 183 NQGYYDNVAGVFGVGRGGRGGVSLLNQLGIDR---FSYCFSSSGAPGSSAVFLGGSPELA 239

Query: 319 REALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             A    AA  P+V +P   S Y+V L G+ VG   + ++       + G   +V+D+ +
Sbjct: 240 TNATTTPAASTPMVADPVLKSGYFVKLVGVTVGATLVDVAGA--SSAEGGGRALVIDSTS 297

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRA-----SGVSIFDTCYNLSGFVSVRVP---TVSF 430
            VT L    Y   R A VAQ   L  A     +GV + D C+ L+   +   P   T++ 
Sbjct: 298 PVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGL-DLCFELAAGGATPTPPNVTMTL 356

Query: 431 YFSGGPV-LTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFV 488
           +F GG   L LP +++L      G  C    PS S G+ ++G+       + +D A   V
Sbjct: 357 HFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVV 416

Query: 489 GFGPNVC 495
            F P  C
Sbjct: 417 SFQPLDC 423


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/324 (30%), Positives = 150/324 (46%), Gaps = 33/324 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G    +G YF +IG+G+P +  Y+ +D+GSDI+WV C  C +C  +SD      ++D  
Sbjct: 146 NGHPSEAGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMK 205

Query: 202 DSASFSGVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR---- 254
            S +   V C    C   +    GC  G +C Y V YGDGS T G    + +   R    
Sbjct: 206 ASTTSDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGN 265

Query: 255 --TVVKN--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCL 304
             T   N  V  GCG+K  G    ++    G+LG G  + S++ QL   G+    FS+CL
Sbjct: 266 FQTTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL 325

Query: 305 VSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
            +   G  G    G E +       PLV+N    + Y V +  + VGG  + +  D F  
Sbjct: 326 DNVDGG--GIFAIG-EVVEPKVNITPLVQN---QAHYNVVMKEIEVGGDPLDVPSDAF-- 377

Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
            + GD  G ++D+GT +   P   Y    +  ++Q  +L R   V    TC++ +G V  
Sbjct: 378 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 435

Query: 424 RVPTVSFYFSGGPVLTLPASNFLI 447
             PTV+ +F     LT+    +L 
Sbjct: 436 GFPTVTLHFDKSISLTVYPHEYLF 459


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 35/369 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
           G YF R+ +GSPP+  Y+ ID+GSD++WV C  C+ C + S    P+  FDP  S++ S 
Sbjct: 66  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 125

Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT----IGRTVVK- 258
           +SCS   C    +  +AGC     +C Y   YGDGS T G    + L     +G +V   
Sbjct: 126 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 185

Query: 259 --NVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTG 310
             ++  GC     G          G+ G G   MS++ Q+  Q  T   FS+CL   G G
Sbjct: 186 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 245

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
               ++   E +     + PLV  P  P  Y + L  + V G  + I  ++F  +   + 
Sbjct: 246 GGILVL--GEIVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATST--NR 298

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           G ++D+GT +  L   AY+ F  A         R   +S    CY ++  V    PTVS 
Sbjct: 299 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSL 357

Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANG 486
            F+GG  + L   ++L+    + DA  +C  F      G++I+G++  +     +D A  
Sbjct: 358 NFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQ 417

Query: 487 FVGFGPNVC 495
            +G+    C
Sbjct: 418 RIGWANYDC 426


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 44/366 (12%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
           + + +G+PP++Q MV+D+GS + W+      QC+K+  P   FDP+ S++FS + C+  +
Sbjct: 77  INLPIGTPPQTQPMVLDTGSQLSWI------QCHKKQPPTASFDPSLSSTFSILPCTHPL 130

Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKN 268
           C     D      C   R C Y   Y DG+Y +G L  E  T  R+V    + +GC  ++
Sbjct: 131 CKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPLILGCATES 190

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
                   G+LG+  G +S   Q        FSYC+  R    G   +GS   G      
Sbjct: 191 ----TDPRGILGMNLGRLSFAKQ---SKITKFSYCVPPRQTRPGFTPTGSFYLGNNPSSK 243

Query: 325 GAAWVPLVRNP--RAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           G  +V ++ +   R P+F    Y + + G+ + G ++ IS  +FR    G    ++D+G+
Sbjct: 244 GFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSGQTMIDSGS 303

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGFVSV--RVPTVSFYF 432
             T L + AY+  R   V   G  PR     ++    D C++    V +   +  + F F
Sbjct: 304 EFTYLVSEAYDKVRAQVVRAVG--PRLKKGYVYGGVADMCFDSVKAVEIGRLIGEMVFEF 361

Query: 433 SGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVG 489
             G  + +P    L  V   G  C     S    +  +IIGN  Q+ + + FD     VG
Sbjct: 362 ERGVEVVIPKERVLADV-GGGVHCVGIGSSDKLGAASNIIGNFHQQNLWVEFDLVRRRVG 420

Query: 490 FGPNVC 495
           FG   C
Sbjct: 421 FGKADC 426


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 115/375 (30%), Positives = 166/375 (44%), Gaps = 58/375 (15%)

Query: 165 PPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAVC-----D 217
           PP++  MVID+GS++ W++C   S      +PV  FDP  S+S+S + CSS  C     D
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSN----PNPVNNFDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 218 RLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQG----M 271
            L  A C + + C   +SY D S ++G LA E    G  T   N+  GC     G     
Sbjct: 138 FLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDSNLIFGCMGSVSGSDPEE 197

Query: 272 FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP--VGAAWV 329
                GLLG+  GS+S + Q+G      FSYC +S      G L+ G           + 
Sbjct: 198 DTKTTGLLGMNRGSLSFISQMGFP---KFSYC-ISGTDDFPGFLLLGDSNFTWLTPLNYT 253

Query: 330 PLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
           PL+R +   P F    Y V L+G+ V G  +PI + +      G    ++D+GT  T L 
Sbjct: 254 PLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFTFLL 313

Query: 385 TPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNLSGF-----VSVRVPTVSFYF- 432
            P Y A R  F+ +T  +      P        D CY +S       +  R+PTVS  F 
Sbjct: 314 GPVYTALRSHFLNRTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTVSLVFE 373

Query: 433 ------SGGPVLTLPASNFLIP---VDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQIS 480
                 SG P+L      + +P   V +   +CF F  S   G+   +IG+  Q+ + I 
Sbjct: 374 GAEIAVSGQPLL------YRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIE 427

Query: 481 FDGANGFVGFGPNVC 495
           FD     +G  P  C
Sbjct: 428 FDLQRSRIGLAPVEC 442


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/402 (26%), Positives = 164/402 (40%), Gaps = 67/402 (16%)

Query: 155 EYFVRIGVGS-PPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSGVSC 211
           +Y +   +GS PP+   + +D+GSD+VW  C P  C  C  +         +     VSC
Sbjct: 74  DYTLSFNLGSNPPQLITLYMDTGSDLVWFPCSPFECILCEGKPQTTKPANITKQTHSVSC 133

Query: 212 SSA-------------VC-------DRLENAGCHAGRCR-YEVSYGDGSYTKGTLALETL 250
            S              +C       D +E + C +  C  +  +YGDGS+    L  +TL
Sbjct: 134 QSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLYQQTL 192

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLVSR 307
           ++    ++N   GC H          G+ G G G +SL  QL       G  FSYCLVS 
Sbjct: 193 SLSSLHLQNFTFGCAHT---ALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSH 249

Query: 308 G-----TGSSGSLVFGREALPVGAA---------WVPLVRNPRAPSFYYVGLSGLGVGGM 353
                       L+ GR    +  A         +  ++ NP+ P +Y VGL+G+ VG  
Sbjct: 250 SFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKR 309

Query: 354 RIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI-- 410
            +P  E L R+ + G+ G+V+D+GT  T LP   Y A  + F  +      RAS +    
Sbjct: 310 TVPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKT 369

Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDDAG--------TFCFAFA 460
               CY L+G    ++P +  +F G    + LP  N+     D G          C    
Sbjct: 370 GLGPCYYLNGL--SQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLM 427

Query: 461 PSPSGLSI-------IGNIQQEGIQISFDGANGFVGFGPNVC 495
                  +       +GN QQ+G ++ +D     VGF    C
Sbjct: 428 NGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 165/380 (43%), Gaps = 60/380 (15%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYKQSDPVFDPADSASFSG 208
           +G ++V + +G P +  ++ ID+GS++ W++C     PC  C K   P++ P        
Sbjct: 37  TGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPKKL----- 91

Query: 209 VSCSSAVCDRLEN-----AGC--HAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           V C+  +CD L         C     +C Y+++Y DG+ + G L L+  ++     +N+A
Sbjct: 92  VPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPTGSARNIA 151

Query: 262 IGCGH-------KNQGMFVGAAGLLGLGGGSMSLVGQL---GGQTGGAFSYCLVSRGTGS 311
            GCG+       K     V   G+LGLG GS+ LV QL   G  +     +CL S+G   
Sbjct: 152 FGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVIGHCLSSKG--- 208

Query: 312 SGSLVFGREALPVGAAWVPLVRN-PRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
            G L  G E +P     +  +    R P+ Y  G + L +G  R PI    F+       
Sbjct: 209 GGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLG--RNPIGTKPFK------- 259

Query: 371 GVVMDTGTAVTRLPTPAY----EAFRDAFVAQTGNLPRAS---------GVSIFDTCYNL 417
             + D+G+  T LP   +     A + + +  +  L   +         G   F T ++L
Sbjct: 260 -AIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDL 318

Query: 418 SG-FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIGNIQQE 475
              F S+    V+  F  G  +T+P  N+LI +   G  CF     P   L +IG I  +
Sbjct: 319 PKEFKSL----VTLKFDHGVTMTIPPENYLI-ITGHGNACFGILELPGYDLFVIGGISMQ 373

Query: 476 GIQISFDGANGFVGFGPNVC 495
              +  D   G + + P+ C
Sbjct: 374 EQLVIHDNEKGRLAWMPSPC 393


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 171/379 (45%), Gaps = 44/379 (11%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
           G+   +G Y+  IG+G+P +  Y+ +D+GSDI+WV C  C +C ++S       ++DP D
Sbjct: 81  GLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKD 140

Query: 203 SASFSGVSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTI------ 252
           S++ S VSC    C         GC     C Y V+YGDGS T G    + L        
Sbjct: 141 SSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200

Query: 253 GRTVVKN--VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
           G+T   N  V  GCG + QG  +G++     G++G G  + S++ QL   G+    F++C
Sbjct: 201 GQTRPANSTVTFGCGSQ-QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHC 259

Query: 304 LVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFR 363
           L +   G  G    G    P      PLV  P  P  Y V L  + VGG  + +   +F 
Sbjct: 260 LDTINGG--GIFAIGNVVQP-KVKTTPLV--PNMPH-YNVNLKSIDVGGTALKLPSHMFD 313

Query: 364 LTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
             +    G ++D+GT +T LP   Y+    A  A+  ++   + V  F  C+   G V  
Sbjct: 314 TGE--KKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHN-VQEF-LCFQYVGRVDD 369

Query: 424 RVPTVSFYFSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEG 476
             P ++F+F    P+   P   F    D+   +C  F      +    G+ ++G++    
Sbjct: 370 DFPKITFHFENDLPLNVYPHDYFFENGDNL--YCVGFQNGGLQSKDGKGMVLLGDLVLSN 427

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +G+    C
Sbjct: 428 KLVVYDLENQVIGWTEYNC 446


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 161/382 (42%), Gaps = 47/382 (12%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ---PCSQC-YKQSD----PVFDPADSAS 205
           G + + +  G+PP+    ++D+GSD+VW  C     C+ C +  +D    P+FDP  S+S
Sbjct: 76  GGHSISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSS 135

Query: 206 FSGVSCSSAVCDRLENAGCHAG-------------RCRYEVSYGDGSYTKGTLALETLTI 252
              + C +  C        H G              C Y   YG G+ + G   LE L  
Sbjct: 136 SKILDCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGA-SSGYFLLENLKF 194

Query: 253 GRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR---GT 309
            R  ++N  +GC   +    + +  L G G    SL  Q+G +    F+YCL S     T
Sbjct: 195 PRKTIRNFLLGCT-TSAARELSSDALAGFGRSMFSLPIQMGVK---KFAYCLNSHDYDDT 250

Query: 310 GSSGSLVFG-REALPVGAAWVPLVRNPRAPSFYY-VGLSGLGVGGMRIPISEDLFRLTQM 367
            +SG L+   R+    G ++ P +++P A +FYY +G+  + +G   + I          
Sbjct: 251 RNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSD 310

Query: 368 GDDGVVMDTGT-AVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFVSV 423
           G  GV++D+G      +  P ++   +    Q     R   A   +    CYN +G  S+
Sbjct: 311 GRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSI 370

Query: 424 RVPTVSFYFSGGPVLTLPASNFL----------IPVDDAGTFCFAFAPSPSGLSIIGNIQ 473
           ++P + + F GG  + +P  N+             +D  GT      P PS   I+GN Q
Sbjct: 371 KIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPS--IILGNSQ 428

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
                + +D  N   GF    C
Sbjct: 429 HVDYYVEYDLKNDRFGFRRQTC 450


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 167/365 (45%), Gaps = 31/365 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQC-YKQSDPVFDPADSASFSGVSC 211
           G ++  + +G+P +   +++D+GS + +V C  C S C     D  FDP  S++ S +SC
Sbjct: 76  GYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRISC 135

Query: 212 SSAVCD-RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV-KNVAIGCGHKNQ 269
           +S  C       GC   +C Y  SY + S + G L  + L +   +    +  GC  +  
Sbjct: 136 TSPKCSCGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAPIIFGCETRET 195

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCL-VSRGTGSSGSLVFGREALP- 323
           G      A GL GLG    S+V QL   G     FS C  +  G    G+L+ G   +P 
Sbjct: 196 GEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEG---DGALLLGDAEVPG 252

Query: 324 -VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
            +   + PL+ +   P +Y V +  L V G  +P+S+ LF        G V+D+GT  T 
Sbjct: 253 SISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGY----GTVLDSGTTFTY 308

Query: 383 LPTPAYEAFRDAF--VAQTGNLPRASGV--SIFDTCY-------NLSGFVSVRVPTVSFY 431
           +P+P ++AF  A    A +  L R  G      D C+       +L    SV  P++   
Sbjct: 309 MPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSV-FPSMEVQ 367

Query: 432 FSGGPVLTLPASNFL-IPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           F  G  L L   N+L +   ++G +C     +    +++G I    + + +D AN  VGF
Sbjct: 368 FDQGTSLVLGPLNYLFVHTFNSGKYCLGVFDNGRAGTLLGGITFRNVLVRYDRANQRVGF 427

Query: 491 GPNVC 495
           GP +C
Sbjct: 428 GPALC 432


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 172/379 (45%), Gaps = 40/379 (10%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           +G+   +G YF ++G+GSPPR  Y+ +D+GSDI+WV C  CS+C ++SD      ++DP 
Sbjct: 61  NGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPK 120

Query: 202 DSASFSGVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIG---- 253
            S +   VSC    C    +    GC +   C Y ++YGDGS T G    + LT      
Sbjct: 121 GSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRING 180

Query: 254 --RTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
             RT  +N +I  GCG    G    ++     G++G G  + S++ QL   G+    FS+
Sbjct: 181 NLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSH 240

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL +   G  G    G    P   +  PLV  PR  + Y V L  + V    + +  D+F
Sbjct: 241 CLDNVRGG--GIFAIGEVVEP-KVSTTPLV--PRM-AHYNVVLKSIEVDTDILQLPSDIF 294

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
               +   G V+D+GT +  LP   Y+      +A+   L        F  C+  +G V 
Sbjct: 295 --DSVNGKGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQF-RCFLYTGNVD 351

Query: 423 VRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEG 476
              P V  +F     LT+   ++L    D G +C  +  S +       ++++G++    
Sbjct: 352 RGFPVVKLHFKDSLSLTVYPHDYLFQFKD-GIWCIGWQRSVAQTKNGKDMTLLGDLVLSN 410

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D  N  +G+    C
Sbjct: 411 KLVIYDLENMVIGWTDYNC 429


>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 480

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 119/403 (29%), Positives = 170/403 (42%), Gaps = 68/403 (16%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP-----CSQCYKQSDPVFDPADSASFS- 207
           G+Y +   +GS      + +D+GSD+VW  C P     C    K   P+   A++ S S 
Sbjct: 74  GDYTLSFNLGSESHKISLYMDTGSDLVWFPCSPFECILCEGKPKIQSPLPKIANNKSVSC 133

Query: 208 ----------GVSCSSAVC-------DRLENAGCHAGRCR-YEVSYGDGSYT----KGTL 245
                     G   +S +C       + +E + C +  C  +  +YGDGS      + +L
Sbjct: 134 SAAACSAAHGGSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVARLYRDSL 193

Query: 246 ALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG---GQTGGAFS 301
           +L T      + V+N   GC H   G  VG AG    G G +S+  QL     Q G  FS
Sbjct: 194 SLPTPAPSPPINVRNFTFGCAHTTLGEPVGVAGF---GRGVLSMPSQLATFSPQLGNRFS 250

Query: 302 YCLVSRGTGSS-----GSLVFGREAL-PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRI 355
           YCLVS    +        L+ GR         +  L+ NP+ P FY VGL+G+ VG +RI
Sbjct: 251 YCLVSHSFAADRVRRPSPLILGRYYTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRI 310

Query: 356 PISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP----RASGVSIF 411
           P  E L ++ + G  GVV+D+GT  T LP   YE+    F  +TG +     R    +  
Sbjct: 311 PAPEFLTKVDEGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGL 370

Query: 412 DTCYNLSGFVSVRVPTVSFYFSGGPV-LTLPASNFLIPVDDAG---------TFCF---- 457
             CY      SV VP V  +F G    + LP  N+     D G           C     
Sbjct: 371 SPCYYYEN--SVGVPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLMLMN 428

Query: 458 -----AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  A  P   + +GN QQ+G ++ +D     VGF    C
Sbjct: 429 GGDEAELAGGPG--ATLGNYQQQGFEVVYDLEKNRVGFARRQC 469


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 54/387 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
           G Y   +  G+P ++ +++ D+GS +VW  C     CS+C + + DP     F P  S+S
Sbjct: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138

Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
              V C +  C  +              +   C      Y V YG GS T G L  ETL 
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
                + N  +GC   +       +G+ G G GS SL  Q+G +    F+YCL SR    
Sbjct: 198 FPDKXIPNFVVGCSFLSIHQ---PSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDD 251

Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLF 362
           +  SG L+     +   G  + P  +NP         +YY+ +  + VG   + +     
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFL 311

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSG 419
                G+ G ++D+G+  T +  P  E     F  Q  N  RA+ V        C+++S 
Sbjct: 312 VPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISK 371

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-----------PSPSGLSI 468
             SV+ P + F F GG    LP +N+   V  +G  C                 PS   I
Sbjct: 372 EKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPS--VI 429

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           +G  QQ+   + +D  N  +GF    C
Sbjct: 430 LGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 169/369 (45%), Gaps = 35/369 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD---PV--FDPADSASFSG 208
           G YF R+ +GSPP+  Y+ ID+GSD++WV C  C+ C + S    P+  FDP  S++ S 
Sbjct: 81  GLYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASL 140

Query: 209 VSCSSAVCD---RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLT----IGRTVVK- 258
           +SCS   C    +  +AGC     +C Y   YGDGS T G    + L     +G +V   
Sbjct: 141 ISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS 200

Query: 259 --NVAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTG 310
             ++  GC     G          G+ G G   MS++ Q+  Q  T   FS+CL   G G
Sbjct: 201 SASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGG 260

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
               ++   E +     + PLV  P  P  Y + L  + V G  + I  ++F  +   + 
Sbjct: 261 GGILVL--GEIVEEDIVYSPLV--PSQPH-YNLNLQSISVNGKSLAIDPEVFATST--NR 313

Query: 371 GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSF 430
           G ++D+GT +  L   AY+ F  A         R   +S    CY ++  V    PTVS 
Sbjct: 314 GTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPL-LSKGTQCYLITSSVKGIFPTVSL 372

Query: 431 YFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGANG 486
            F+GG  + L   ++L+    + DA  +C  F      G++I+G++  +     +D A  
Sbjct: 373 NFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQ 432

Query: 487 FVGFGPNVC 495
            +G+    C
Sbjct: 433 RIGWANYDC 441


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 116/385 (30%), Positives = 175/385 (45%), Gaps = 52/385 (13%)

Query: 155 EYFVRIGVGSP-PRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           EY + + +G+P P+   + +D+GSD+VW QC  C  C+ Q  P FD   S +   V CS 
Sbjct: 99  EYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHVCFAQPFPTFDALASQTTLAVPCSD 157

Query: 214 AVCD--RLENAGC--HAGRCRYEVSYGDGSYTKGTLALETLTI------------GRTVV 257
            +C   +   +GC  +   C Y   Y D S T G +  +T T                 V
Sbjct: 158 PICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAV 217

Query: 258 KNVAIGCGHKNQGMFV-GAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV 316
            NV  GCG  N+G+F    +G+ G   G MSL  QL       FS+C  +     +  + 
Sbjct: 218 PNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQL---KVARFSHCFTAIADARTSPVF 274

Query: 317 FGREALP--VGA-AWVPLVRNPRAPS---FYYVGLSGLGVGGMRIPISEDLF--RLTQMG 368
            G    P  +GA A  P+   P A S    YY+ L G+ VG  R+P++   F  + T  G
Sbjct: 275 LGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTGSG 334

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR---- 424
             G ++D+GT +  LP P Y + R AFVA+   LP A+  S  D    L  F + R    
Sbjct: 335 SGGTIIDSGTGIRTLPGPMYRSLRAAFVARV-KLPVANE-SAADAESTLC-FEAARSASL 391

Query: 425 --------VPTVSFYFSGGPVLTLPASNFLIPV--DDAGT---FCFAF-APSPSGLSIIG 470
                   +P V  + +G     LP  ++++ +  D+ G+    C    +   S L+IIG
Sbjct: 392 PPEAPAPALPKVVLHVAGAD-WDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIG 450

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N QQ+ + +++D     + F P  C
Sbjct: 451 NFQQQNMHVAYDLEKNKLVFVPARC 475


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 117/415 (28%), Positives = 166/415 (40%), Gaps = 93/415 (22%)

Query: 155 EYFVRIGVGSP--PRSQYMVIDSGSDIVWVQCQP--CSQCY-------KQSDPVFDPADS 203
           +Y + + VG P    S  + +D+GSD+VW  C P  C  C          S P+  P DS
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146

Query: 204 ASFSGVSCSSAVCDRLENAG-----CHAGRCRYEV----------------SYGDGSYTK 242
                +SC+S +C    ++      C A RC  +                 +YGDGS   
Sbjct: 147 RR---ISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA 203

Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
                         V+N    C H      VG AG    G G +SL  QL     G FSY
Sbjct: 204 NLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSY 260

Query: 303 CLVSRGTGS-----SGSLVFGR--EALPVGAA-----WVPLVRNPRAPSFYYVGLSGLGV 350
           CLV+    +     S  L+ GR  +A  +GA+     + PL+ NP+ P FY V L  + V
Sbjct: 261 CLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 320

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-------------- 396
           GG RI    +L  + + G+ G+V+D+GT  T LP+  +    D F               
Sbjct: 321 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 380

Query: 397 -AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDA 452
            AQTG  P          CY+ S      VP V+ +F G   + LP  N+ +     +  
Sbjct: 381 EAQTGLAP----------CYHYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429

Query: 453 GTFCFAFAP------------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C                  P+G   +GN QQ+G ++ +D   G VGF    C
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 170/368 (46%), Gaps = 36/368 (9%)

Query: 148 GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPADSA 204
           G D+G  +  Y + +G+G+P ++Q + ID+GS   WV C+ C  C+  ++P  F  + S 
Sbjct: 72  GWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCH--TNPRTFLQSRST 128

Query: 205 SFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
           + + VSC +++C     D       +   C + VSY DGS + G L  +TLT      + 
Sbjct: 129 TCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIP 188

Query: 259 NVAIGCGHKNQGM--FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRG--TG 310
             + GC   + G   F    GLLG+G G MS++ Q    T   FSYCL      RG  + 
Sbjct: 189 GFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCLPLQKSERGFFSK 247

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           ++G    G+ A      +  +V   +    ++V L+ + V G R+ +S  +F        
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF-----SRK 302

Query: 371 GVVMDTGTAVTRLPTPAYEAFRD---AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           GVV D+G+ ++ +P  A           + + G     S       CY++       +P 
Sbjct: 303 GVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESE----RNCYDMRSVDEGDMPA 358

Query: 428 VSFYFSGGPVLTLPASNFLIP--VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           +S +F  G    L +    +   V +   +C AFAP+ S +SIIG++ Q   ++ +D   
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKR 417

Query: 486 GFVGFGPN 493
             +G GP+
Sbjct: 418 QLIGIGPS 425


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 32/363 (8%)

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
            V + +G+PP++Q M++D+GS + W+QC            VFDP+ S+SFS + C+  +C
Sbjct: 83  LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142

Query: 217 -----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
                D      C   R C Y   Y DG+  +G L  E +T  R+     + +GC  ++ 
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPLILGCAEESS 202

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVG 325
                A G+LG+  G +S   Q        FSYC+ +R    G   +GS   G      G
Sbjct: 203 ----DAKGILGMNLGRLSFASQ---AKLTKFSYCVPTRQVRPGFTPTGSFYLGENPNSGG 255

Query: 326 AAWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             ++ L+   ++ R P+     Y V + G+ +G  ++ I    FR    G    ++D+G+
Sbjct: 256 FRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAGQTMIDSGS 315

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLSGFVSVR-VPTVSFYFSGG 435
             T L   AY   R+  V   G   +   V   + D C+N +     R +  + F F  G
Sbjct: 316 EFTYLVDEAYNKVREEVVRLVGARLKKGYVYGGVSDMCFNGNAIEIGRLIGNMVFEFDKG 375

Query: 436 PVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGP 492
             + +     L  V   G  C     S    +  +IIGN  Q+ I + FD AN  VGFG 
Sbjct: 376 VEIVVEKERVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNIWVEFDLANRRVGFGK 434

Query: 493 NVC 495
             C
Sbjct: 435 ADC 437


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/199 (37%), Positives = 110/199 (55%), Gaps = 15/199 (7%)

Query: 162 VGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLEN 221
           +G P    Y + D+GS+++W+QC PC+ CY Q+ P+FDPA+S ++  VS  S +C+ +  
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122

Query: 222 AGCHAG--RCRYEVSYGDGSYTKGTLALETLTI---GRTVVK--NVAIGCGHKNQGMFVG 274
             C  G   C Y+ +YGDG+ TKGTL+ +        RT+V+   +  GC H  +    G
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKG 182

Query: 275 -AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGS-LVFGREALPVGAAWVPLV 332
             AG++GL     SLV QL  +    FSYC+V      SGS + FG  A+ +G    PL+
Sbjct: 183 HQAGVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILGGK-TPLL 238

Query: 333 RNPRAPSFYYVGLSGLGVG 351
           +     S Y+V L G+ VG
Sbjct: 239 KGDY--SHYFVTLKGISVG 255



 Score = 62.8 bits (151), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 37/117 (31%), Positives = 60/117 (51%), Gaps = 9/117 (7%)

Query: 182 VQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGS 239
           ++ Q  +QC+ Q+ P+FDP+ S+++S V   +  C +     CH     C Y +SYG GS
Sbjct: 326 LEAQEVAQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGS 385

Query: 240 Y-TKGTLALETLTI-----GRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLV 289
             T+GT++++             V ++  GC     G F G   G++GL   S+SLV
Sbjct: 386 TSTEGTISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLV 442


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score =  122 bits (306), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/387 (26%), Positives = 158/387 (40%), Gaps = 54/387 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
           G Y   +  G+P ++ +++ D+GS +VW  C     CS+C + + DP     F P  S+S
Sbjct: 79  GAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSS 138

Query: 206 FSGVSCSSAVCDRL--------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLT 251
              V C +  C  +              +   C      Y V YG GS T G L  ETL 
Sbjct: 139 SKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLD 197

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
                + N  +GC   +       +G+ G G GS SL  Q+G +    F+YCL SR    
Sbjct: 198 FPDKKIPNFVVGCSFLSIHQ---PSGIAGFGRGSESLPSQMGLK---KFAYCLASRKFDD 251

Query: 309 TGSSGSLVFGREALP-VGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLF 362
           +  SG L+     +   G  + P  +NP         +YY+ +  + VG   + +     
Sbjct: 252 SPHSGQLILDSTGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFL 311

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSG 419
                G+ G ++D+G+  T +  P  E     F  Q  N  RA+ V        C+++S 
Sbjct: 312 VPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISK 371

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-----------PSPSGLSI 468
             SV+ P + F F GG    LP +N+   V  +G  C                 PS   I
Sbjct: 372 EKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGPS--VI 429

Query: 469 IGNIQQEGIQISFDGANGFVGFGPNVC 495
           +G  QQ+   + +D  N  +GF    C
Sbjct: 430 LGAFQQQNFYVEYDLVNQRLGFRQQTC 456


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 155/368 (42%), Gaps = 47/368 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C  C    DP F P DS ++  V C+
Sbjct: 90  NGYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCT 149

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV---VKNVAIGCGHKNQ 269
                   N      +C YE  Y + S + G L  + ++ G       +    GC +   
Sbjct: 150 WQC-----NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSPQRAIFGCENDET 204

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSR----------GTGSSGSL 315
           G      A G++GLG G +S++ QL  +     +FS C              G      +
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADM 264

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
           VF R            VR+P    +Y + L  + V G R+ ++  +F     G  G V+D
Sbjct: 265 VFTRSD---------PVRSP----YYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLD 307

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS--IFDTCYNLSGF----VSVRVPTVS 429
           +GT    LP  A+ AF+ A + +T +L R SG      D C++ +      +S   P V 
Sbjct: 308 SGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVE 367

Query: 430 FYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGF 487
             F  G  L+L   N+L       G +C   F+      +++G I      + +D  +  
Sbjct: 368 MVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTK 427

Query: 488 VGFGPNVC 495
           +GF    C
Sbjct: 428 IGFWKTNC 435


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  122 bits (306), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 114/394 (28%), Positives = 162/394 (41%), Gaps = 66/394 (16%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
           G Y V +  G+P ++   V D+GS +VW+ C     CS C +   DP     F P +S+S
Sbjct: 88  GGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147

Query: 206 FSGVSCSSAVCDRL------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
              + C S  C  L                C  G   Y + YG GS T G L  E L   
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFP 206

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
              V +  +GC   +       AG+ G G G +SL  Q+  +    FS+CLVSR      
Sbjct: 207 DLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTN 260

Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
                   TGS   SGS          G  + P  +NP   +     +YY+ L  + VG 
Sbjct: 261 VTTDLDLDTGSGHNSGSKT-------PGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
             + I          GD G ++D+G+  T +  P +E   + F +Q  N  R   +    
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373

Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
               C+N+SG   V VP + F F GG  L LP SN+   V +  T C         +PSG
Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSG 433

Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +    I+G+ QQ+   + +D  N   GF    C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 162/365 (44%), Gaps = 35/365 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
           + + +G+P +SQ +V+D+GS + W+QC P         P   FDP+ S+SFS + CS  +
Sbjct: 83  LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 142

Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
           C     D      C + R C Y   Y DG++ +G L  E  T   +     + +GC  ++
Sbjct: 143 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 202

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
                   G+LG+  G +S + Q        FSYC+ +R    G  S+GS   G      
Sbjct: 203 ----TDVKGILGMNLGRLSFISQ---AKISKFSYCIPTRSNRPGLASTGSFYLGENPNSR 255

Query: 325 GAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G  +V L+  P++       P  Y V L G+ +G  R+ I   +FR    G    ++D+G
Sbjct: 256 GFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSGQTMVDSG 315

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCY--NLSGFVSVRVPTVSFYFS 433
           +  T L   AY+  ++  V   G+  +   V  S  D C+  N    +   +  + F F 
Sbjct: 316 SEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHQMVIGRLIGDLVFEFG 375

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGF 490
            G  + +     L+ V   G  C     S    +  +IIGN+ Q+ + + FD AN  VGF
Sbjct: 376 RGVEILVEKQRLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVANRRVGF 434

Query: 491 GPNVC 495
               C
Sbjct: 435 SKAEC 439


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 158/371 (42%), Gaps = 48/371 (12%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--SQCYKQSDPVFDPADSASFSGVSCSS 213
           Y     +G+PP++   ++D   ++VW QC  C  S C+KQ  PVFDP+ S ++    C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 214 AVCDRLENAGCHA-GRCRYEVS--YGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
            +C  +    C   G C YE    +GD   T G  + + + IG    + +A GC   + G
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD---TFGIASTDAIAIGNAEGR-LAFGCVVASDG 177

Query: 271 MFVGA----AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
              GA    +G +GLG    SLVGQ       AFSYCL   G G   +L  G  A   GA
Sbjct: 178 SIDGAMDGPSGFVGLGRTPWSLVGQ---SNVTAFSYCLALHGPGKKSALFLGASAKLAGA 234

Query: 327 AWVPLVRNPRAP---------------SFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
                  NP  P                +Y V L G+  G + +  +        +    
Sbjct: 235 G----KSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITV---- 286

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
           + ++T   ++ LP  AY+A      A  G+   A+    FD C+  +      VP + F 
Sbjct: 287 LQLETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSG--VPDLVFT 344

Query: 432 FSGGPVLTLPASNFLIPVDDA-GTFCFAFAPSP------SGLSIIGNIQQEGIQISFDGA 484
           F GG  LT   S +L+   +  GT C +   S        G+SI+G++ QE +   FD  
Sbjct: 345 FQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404

Query: 485 NGFVGFGPNVC 495
              + F P  C
Sbjct: 405 KETLSFEPADC 415


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  122 bits (305), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 117/415 (28%), Positives = 166/415 (40%), Gaps = 93/415 (22%)

Query: 155 EYFVRIGVGSP--PRSQYMVIDSGSDIVWVQCQP--CSQCY-------KQSDPVFDPADS 203
           +Y + + VG P    S  + +D+GSD+VW  C P  C  C          S P+  P DS
Sbjct: 87  DYTLSLSVGPPSTASSVSLFLDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPIDS 146

Query: 204 ASFSGVSCSSAVCDRLENAG-----CHAGRCRYEV----------------SYGDGSYTK 242
                +SC+S +C    ++      C A RC  +                 +YGDGS   
Sbjct: 147 RR---ISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA 203

Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSY 302
                         V+N    C H      VG AG    G G +SL  QL     G FSY
Sbjct: 204 NLRRGRVGLAASMAVENFTFACAHTALAEPVGVAGF---GRGPLSLPAQLAPSLSGRFSY 260

Query: 303 CLVSRGTGS-----SGSLVFGR--EALPVGAA-----WVPLVRNPRAPSFYYVGLSGLGV 350
           CLV+    +     S  L+ GR  +A  +GA+     + PL+ NP+ P FY V L  + V
Sbjct: 261 CLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSV 320

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFV-------------- 396
           GG RI    +L  + + G+ G+V+D+GT  T LP+  +    D F               
Sbjct: 321 GGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGA 380

Query: 397 -AQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI---PVDDA 452
            AQTG  P          CY+ S      VP V+ +F G   + LP  N+ +     +  
Sbjct: 381 EAQTGLAP----------CYHYSP-SDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429

Query: 453 GTFCFAFAP------------SPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              C                  P+G   +GN QQ+G ++ +D   G VGF    C
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  122 bits (305), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 159/360 (44%), Gaps = 31/360 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP F P  S+++  V C+
Sbjct: 78  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCT 137

Query: 213 SAVCDRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHK 267
                   +  C   R  C YE  Y + S + G L  + ++ G       +    GC + 
Sbjct: 138 L-------DCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAPQRAVFGCENV 190

Query: 268 NQGMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALP 323
             G      A G++GLG G +S++ QL  +     +FS C      G  G++V G  + P
Sbjct: 191 ETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVG-GGAMVLGGISPP 249

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
               +     +P    +Y + L  + V G R+P++  +F     G  G V+D+GT    L
Sbjct: 250 SDMVFA--QSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAYL 303

Query: 384 PTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPV 437
           P  A+ AF++A V +  +  + SG   +  D C++ +G     +S   P V   F  G  
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFGNGHK 363

Query: 438 LTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +L   N++       G +C   F       +++G I      + +D     +GF    C
Sbjct: 364 YSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFWKTNC 423


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 167/388 (43%), Gaps = 60/388 (15%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCY---KQSDPVFDPADSASFS 207
           G + + +  G+PP+    ++D+GS +VW  C     C+ C     +  P+F+P  S+S  
Sbjct: 85  GAHTIPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144

Query: 208 GVSCSSAVCDRLENAGCHAG--RC------------RYEVSYGDGSYTKGTLALETLTIG 253
            + C    C    +   H G  RC            +Y + YG G+   G   LE L   
Sbjct: 145 ILGCRDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFP 203

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG---TG 310
              +    +GC   +      +  L G G    SL  Q+G +    F+YCL S     T 
Sbjct: 204 GKTIHKFLVGCT-TSADREPSSDALAGFGRTMFSLPMQMGVK---KFAYCLNSHDYDDTR 259

Query: 311 SSGSLVFG-REALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGG--MRIPISEDLFRLTQ 366
           +SG L+    +    G ++ P  +NP   P +YY+G+  + +G   +RIP       LT 
Sbjct: 260 NSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIPGK----YLTP 315

Query: 367 MGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA------SGVSIFDTCYNLS 418
             D   GVV+D+G A + +  P ++   +    Q     R+      +GV+    CYN +
Sbjct: 316 GSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVT---PCYNFT 372

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-----------FAPSPSGLS 467
           G  S+++P + + F+GG  + +P  N+ +   +A   CF            F P PS   
Sbjct: 373 GHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPS--I 430

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I+GN QQ    + FD  N  +GF    C
Sbjct: 431 ILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 98/338 (28%), Positives = 156/338 (46%), Gaps = 39/338 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ ++ +G+PP    + ID+GSD++WV C  CS C + S        FDP  S++ S 
Sbjct: 23  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSM 82

Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
           ++CS   C+   +  +A C +   +C Y   YGDGS T G      + L T+  G     
Sbjct: 83  IACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTN 142

Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
           +   V  GC ++  G          G+ G G   MS++ QL  Q      FS+CL  +G 
Sbjct: 143 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL--KGD 200

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            S G ++   E +     +  LV  P  P  Y + L  + V G  + I   +F  +    
Sbjct: 201 SSGGGILVLGEIVEPNIVYTSLV--PAQP-HYNLNLQSIAVNGQTLQIDSSVFATSN--S 255

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRA--SGVSIFDTCYNLSGFVSVRVPT 427
            G ++D+GT +  L   AY+ F  A    T ++P++  + VS  + CY ++  V+   P 
Sbjct: 256 RGTIVDSGTTLAYLAEEAYDPFVSAI---TASIPQSVHTAVSRGNQCYLITSSVTEVFPQ 312

Query: 428 VSFYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAPS 462
           VS  F+GG  + L   ++LI    +  A  +C  F  S
Sbjct: 313 VSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKS 350


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 94/309 (30%), Positives = 140/309 (45%), Gaps = 27/309 (8%)

Query: 209 VSCSSAVCDRLENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKN-------V 260
           + C+  +C  + +  C     C Y  +YGDG+ T G  A E  T   +           +
Sbjct: 1   MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60

Query: 261 AIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE 320
             GCG  N G     +G++G G   +SLV QL  +    FSYCL S  +    +L+FG  
Sbjct: 61  GFGCGSVNVGSLNNGSGIVGFGRNPLSLVSQLSIRR---FSYCLTSYASRRQSTLLFGSL 117

Query: 321 ALPV------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           +  V           PL+++P+ P+FYYV  +GL VG  R+ I E  F L   G  GV++
Sbjct: 118 SDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIV 177

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL-------SGFVSVRVP 426
           D+GTA+T LP         AF  Q   LP A+G +  D  C+ +       S    + VP
Sbjct: 178 DSGTALTLLPAAVLAEVVRAFRQQL-RLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVP 236

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANG 486
            +  +F G   L LP  N+++     G  C   A S    S IGN+ Q+ +++ +D    
Sbjct: 237 RMVLHFQGAD-LDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAE 295

Query: 487 FVGFGPNVC 495
            +   P  C
Sbjct: 296 TLSIAPARC 304


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 116/396 (29%), Positives = 156/396 (39%), Gaps = 79/396 (19%)

Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQ-----SDPVFDPADSASFSGVSCSSAVCDRLENAG 223
           + +D+GSD+VW  C P  C  C  +     S P+  P DS     + C+S +C     + 
Sbjct: 107 LFLDTGSDLVWFPCAPFTCMLCEGKPTPGRSGPLPPPPDSRR---IPCASPLCSAAHASA 163

Query: 224 -----CHAGRCRYE-----------------VSYGDGSYT----KGTLALETLTIGRTVV 257
                C A RC  E                  +YGDGS      +G +AL         V
Sbjct: 164 PPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARASVAV 223

Query: 258 --KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS---- 311
              N    C H   G  VG AG    G G +SL GQL  Q  G FSYCLVS    +    
Sbjct: 224 AVDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLI 280

Query: 312 -SGSLVFGREALPV--------GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
               L+ GR             G  + PL+ NP+ P FY V L  + VG  RI    +L 
Sbjct: 281 RPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARPELA 340

Query: 363 RLTQMGDDGVVMDTGTAVTRLPTPAYE-----AFRDAFVAQTGNLPRASGVSIFDTCYNL 417
           R+ + G+ G+V+D+GT  T LP   Y        R    A      RA   +    CY  
Sbjct: 341 RVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRY 400

Query: 418 SGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPV-----------DDAGTFCFAFAPSPSG 465
           +   S R VP ++ +F G   + LP  N+ +             DD G          SG
Sbjct: 401 A--ASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGDASG 458

Query: 466 ------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                    +GN QQ+G ++ +D   G VGF    C
Sbjct: 459 EEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 81/127 (63%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
           DV + +  G+GE+ +++ +G P  +   ++D+GSD+ W QC PCS CYKQ  P++DP+ S
Sbjct: 9   DVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCSDCYKQPTPIYDPSLS 68

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           +++  VSC S++C  L  + C +  C Y  +YGD S T+G L+ ET T+    + ++A G
Sbjct: 69  STYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQSIPHIAFG 128

Query: 264 CGHKNQG 270
           CG  N+G
Sbjct: 129 CGQDNEG 135


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  121 bits (304), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 157/364 (43%), Gaps = 42/364 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFS----GV 209
           G Y  R+ +G+PP    +++D+GS + +V C  C+ C    DP F PA S+S+     G 
Sbjct: 33  GYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS 92

Query: 210 SCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVV---KNVAIGCGH 266
            CS+  CD         G  +Y+  Y + S + G L  + +    +     + +  GC  
Sbjct: 93  ECSTGFCD---------GSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRLVFGCET 143

Query: 267 KNQGMFVG--AAGLLGLGGGSMSLVGQLGGQTG--GAFSYCLVSRGTGSSGSLVFGREAL 322
              G      A G++GLG G +S++ QL  +      FS C      G  G+++ G    
Sbjct: 144 AETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEG-GGAMILGGFQP 202

Query: 323 PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTR 382
           P    +     +P    +Y + L G+ VGG  + +  ++F     G  G V+D+GT    
Sbjct: 203 PKDMVFT--ASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAY 256

Query: 383 LPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY--------NLSGFVSVRVPTVSFYF 432
            P  A++AF+ A   Q G+L    G      D CY        NLS F     P+V F F
Sbjct: 257 FPGAAFQAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQF----FPSVDFVF 312

Query: 433 SGGPVLTLPASNFLIP-VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFG 491
             G  +TL   N+L      +G +C     +    +++G I    + ++++     +GF 
Sbjct: 313 GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFL 372

Query: 492 PNVC 495
              C
Sbjct: 373 KTKC 376


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 101/368 (27%), Positives = 171/368 (46%), Gaps = 36/368 (9%)

Query: 148 GMDQG--SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP-VFDPADSA 204
           G D+G  +  Y + +G+G+P ++Q + ID+GS   WV C+ C  C+  ++P  F  + S 
Sbjct: 72  GWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCH--TNPRTFLQSRST 128

Query: 205 SFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT-VVK 258
           + + VSC +++C     D       +   C + VSY DGS + G L  +TLT      + 
Sbjct: 129 TCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIP 188

Query: 259 NVAIGCGHKNQGM--FVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL----VSRG--TG 310
           +   GC   + G   F    GLLG+G G MS++ Q   +  G FSYCL      RG  + 
Sbjct: 189 SFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSK 247

Query: 311 SSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD 370
           ++G    G+ A      +  +V   +    ++V L+ + V G R+ +S  +F        
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF-----SRK 302

Query: 371 GVVMDTGTAVTRLPTPAYEAFRD---AFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
           GVV D+G+ ++ +P  A           + + G     S       CY++       +P 
Sbjct: 303 GVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESE----RNCYDMRSVDEGDMPA 358

Query: 428 VSFYFSGGPVLTLPASNFLIP--VDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           +S +F  G    L +    +   V +   +C AFAP+ S +SIIG++ Q   ++ +D   
Sbjct: 359 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKR 417

Query: 486 GFVGFGPN 493
             +G GP+
Sbjct: 418 QLIGIGPS 425


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  121 bits (303), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 167/370 (45%), Gaps = 36/370 (9%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ ++ +G+PP    + ID+GSD++WV C  C+ C + S        FDP  S++ S 
Sbjct: 76  GLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSM 135

Query: 209 VSCSSAVCD---RLENAGCHA--GRCRYEVSYGDGSYTKG-----TLALETLTIGRTVVK 258
           ++CS   C+   +  +A C +   +C Y   YGDGS T G      + L T+  G     
Sbjct: 136 IACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTN 195

Query: 259 N---VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGT 309
           +   V  GC ++  G          G+ G G   MS++ QL  Q      FS+CL  +G 
Sbjct: 196 STAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL--KGD 253

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
            S G ++   E +     +  LV  P  P  Y + L  + V G  + I   +F  +    
Sbjct: 254 SSGGGILVLGEIVEPNIVYTSLV--PAQP-HYNLNLQSISVNGQTLQIDSSVFATSN--S 308

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVS 429
            G ++D+GT +  L   AY+ F  A  A      R   VS  + CY ++  V+   P VS
Sbjct: 309 RGTIVDSGTTLAYLAEEAYDPFVSAITAAIPQSVRTV-VSRGNQCYLITSSVTDVFPQVS 367

Query: 430 FYFSGGPVLTLPASNFLI---PVDDAGTFCFAFAP-SPSGLSIIGNIQQEGIQISFDGAN 485
             F+GG  + L   ++LI    +  A  +C  F      G++I+G++  +   + +D A 
Sbjct: 368 LNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAG 427

Query: 486 GFVGFGPNVC 495
             +G+    C
Sbjct: 428 QRIGWANYDC 437


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 164/385 (42%), Gaps = 54/385 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQCY---KQSDPVFDPADSASFS 207
           G + + +  G+PP+    ++D+GS +VW  C     C+ C     +  P+F+P  S+S  
Sbjct: 85  GGHTIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDK 144

Query: 208 GVSCSSAVCDRLENAGCHAG--RC------------RYEVSYGDGSYTKGTLALETLTIG 253
            + C    C    +   H G  RC            +Y + YG G+   G   LE L   
Sbjct: 145 ILGCRDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFP 203

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG---TG 310
              +    +GC   +      +  L G G    SL  Q+G +    F+YCL S     T 
Sbjct: 204 GKTIHKFLVGCT-TSADREPSSDALAGFGRTMFSLPMQMGVK---KFAYCLNSHDYDDTR 259

Query: 311 SSGSLVFG-REALPVGAAWVPLVRNPR-APSFYYVGLSGLGVGG--MRIPISEDLFRLTQ 366
           +SG L+    +    G ++ P ++NP   P +YY+G+  + +G   +RIP       LT 
Sbjct: 260 NSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIPGK----YLTP 315

Query: 367 MGDD--GVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFV 421
             D   GV++D+G A   +  P ++   +    Q     R   A   S    CYN +G  
Sbjct: 316 GSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCYNFTGHK 375

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFA-----------FAPSPSGLSIIG 470
           S+++P + + F+GG  + +P  N+ +   +A   CF            F P PS   I+G
Sbjct: 376 SIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS--IILG 433

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N QQ    + FD  N  +GF    C
Sbjct: 434 NYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 173/373 (46%), Gaps = 47/373 (12%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP++  MV+D+GS++ W++C   +Q ++ +   FDP  S+S+S V CSS  C 
Sbjct: 87  VSLTVGTPPQNVSMVLDTGSELSWLRCNK-TQTFQTT---FDPNRSSSYSPVPCSSLTCT 142

Query: 217 DRLEN----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
           DR  +    A C + + C   +SY D S ++G LA +T  IG + +     GC       
Sbjct: 143 DRTRDFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYIGNSDMPGTIFGCMDSSFST 202

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALP 323
           N        GL+G+  GS+S V Q+       FSYC+    +  SG L+ G       +P
Sbjct: 203 NTEEDSKNTGLMGMNRGSLSFVSQMDFP---KFSYCI--SDSDFSGVLLLGDANFSWLMP 257

Query: 324 VGAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           +   + PL++ +   P F    Y V L G+ V    +P+ + +F     G    ++D+GT
Sbjct: 258 LN--YTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGT 315

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYN--LSGFVSVRVPTVSF 430
             T L  P Y A R+ F+ QT  + R      +      D CY   LS      +PTVS 
Sbjct: 316 QFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSL 375

Query: 431 YFSGGPVLTLPASNFL--IPVDDAGT---FCFAFAPS---PSGLSIIGNIQQEGIQISFD 482
            F G   + +     L  +P +  G+   +CF F  S        +IG+  Q+ + + FD
Sbjct: 376 MFRGAE-MKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFD 434

Query: 483 GANGFVGFGPNVC 495
                +GF    C
Sbjct: 435 LEKSRIGFAQVQC 447


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score =  120 bits (302), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 89/246 (36%), Positives = 130/246 (52%), Gaps = 12/246 (4%)

Query: 257 VKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSL 315
           +  +  GCG  N+   +   AGLLGLG G +SLV QLG Q    FSYCL S     + SL
Sbjct: 139 IPRIGFGCGVNNRATGMDQTAGLLGLGRGVLSLVSQLGTQ---KFSYCLTSIHENKTSSL 195

Query: 316 VFGREAL----PVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           +FG  A     P      PL++NP  PS+YY+ L G+ VG   +PI E  F+L + G  G
Sbjct: 196 LFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKDGSGG 255

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNL--SGFVSVRVPTVS 429
           +++D+GT +T L   A++  ++AF++QT      S  +  D C++L       V+VP + 
Sbjct: 256 MILDSGTTITYLQEDAFDVLKNAFISQTELQVANSSTTGLDLCFHLPVKNAAEVKVPKLI 315

Query: 430 FYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVG 489
           F+F G   L LP  N+++   + G  C A   + S LSI GNIQQ+ + +  D     + 
Sbjct: 316 FHFKGLD-LALPVENYMVSDPEMGLICLAIDATGS-LSIFGNIQQQNMLVLHDLKKSTLS 373

Query: 490 FGPNVC 495
             P  C
Sbjct: 374 LVPTQC 379



 Score = 41.6 bits (96), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 7/70 (10%)

Query: 112 MQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYM 171
           +QR + R    ++R+SG    A ++  Q       + +  G GE+ V + +G+PP     
Sbjct: 62  IQRGINRGRQRLQRMSGMATTAERNGFQ-------APVHVGDGEFVVNLMIGTPPVPFPA 114

Query: 172 VIDSGSDIVW 181
           ++D+GSD++W
Sbjct: 115 IMDTGSDLIW 124


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 167/371 (45%), Gaps = 44/371 (11%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
           Y+  IG+G+P +  Y+ +D+GSDI+WV C  C +C ++S       ++DP DS++ S VS
Sbjct: 4   YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63

Query: 211 CSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTI------GRTVVKN- 259
           C    C         GC     C Y V+YGDGS T G    + L        G+T   N 
Sbjct: 64  CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123

Query: 260 -VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGS 311
            V  GCG + QG  +G++     G++G G  + S++ QL   G+    F++CL +   G 
Sbjct: 124 TVTFGCGSQ-QGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGG- 181

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G    G    P      PLV  P  P  Y V L  + VGG  + +   +F   +    G
Sbjct: 182 -GIFAIGNVVQP-KVKTTPLV--PNMPH-YNVNLKSIDVGGTALKLPSHMFDTGE--KKG 234

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++D+GT +T LP   Y+    A  A+  ++   + V  F  C+   G V    P ++F+
Sbjct: 235 TIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHN-VQEF-LCFQYVGRVDDDFPKITFH 292

Query: 432 FSGG-PVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGA 484
           F    P+   P   F    D+   +C  F      +    G+ ++G++      + +D  
Sbjct: 293 FENDLPLNVYPHDYFFENGDNL--YCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLE 350

Query: 485 NGFVGFGPNVC 495
           N  +G+    C
Sbjct: 351 NQVIGWTEYNC 361


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 163/376 (43%), Gaps = 45/376 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV------FDPADSASFSGVSC 211
           V + VG+PP++  MV+D+GS++ W+ C    Q    +         F P  SA+F+ V C
Sbjct: 65  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124

Query: 212 SSAVC---DRLENAGCHAG--RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC-- 264
            S  C   D      C     +C   +SY DGS + G LA +   +G       A GC  
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPLRSAFGCMS 184

Query: 265 -GHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
             + +    V  AGLLG+  G++S V Q   +    FSYC+  R    +G L+ G   LP
Sbjct: 185 TAYDSSPDGVATAGLLGMNRGTLSFVTQASTRR---FSYCISDRD--DAGVLLLGHSDLP 239

Query: 324 -VGAAWVPLVRNPRAPSFYY------VGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
            +   + PL + P  P  Y+      V L G+ VGG  +PI   +      G    ++D+
Sbjct: 240 FLPLNYTPLYQ-PTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDS 298

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSG---FVSVRVPT 427
           GT  T L   AY A +  F+ QT  L RA     F      DTC+ +       S R+P 
Sbjct: 299 GTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPSARLPP 358

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-----GTFCFAFAPS---PSGLSIIGNIQQEGIQI 479
           V+  F+G   +++     L  V        G +C  F  +   P    +IG+  Q  + +
Sbjct: 359 VTLLFNGA-EMSVAGDRLLYKVPGEHRGADGVWCLTFGNADMVPLTAYVIGHHHQMNLWV 417

Query: 480 SFDGANGFVGFGPNVC 495
            +D   G VG  P  C
Sbjct: 418 EYDLERGRVGLAPVKC 433


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 164/372 (44%), Gaps = 42/372 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           + I VG+PP++  MVID+GS++ W+ C   +       P F+P  S+S++ +SCSS  C 
Sbjct: 68  ISITVGTPPQNMSMVIDTGSELSWLHCN-TNTTATIPYPFFNPNISSSYTPISCSSPTCT 126

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
               D    A C +   C   +SY D S ++G LA +T   G +    +  GC +     
Sbjct: 127 TRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFNPGIVFGCMNSSYST 186

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           N        GL+G+  GS+SLV QL       FSYC+   G+  SG L+ G      G +
Sbjct: 187 NSESDSNTTGLMGMNLGSLSLVSQLKIP---KFSYCI--SGSDFSGILLLGESNFSWGGS 241

Query: 328 --WVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ +   P F    Y V L G+ +    + IS +LF     G    + D GT  
Sbjct: 242 LNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQTMFDLGTQF 301

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVS--VRVPTVSFYF 432
           + L  P Y A RD F+ QT    RA     F      D CY +    S    +P+VS  F
Sbjct: 302 SYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSELPELPSVSLVF 361

Query: 433 SG------GPVLTLPASNFLIPVDDAGTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDG 483
            G      G  L      F+   D    +CF F  S   G+   IIG+  Q+ + + FD 
Sbjct: 362 EGAEMRVFGDQLLYRVPGFVWGNDSV--YCFTFGNSDLLGVEAFIIGHHHQQSMWMEFDL 419

Query: 484 ANGFVGFGPNVC 495
               VG     C
Sbjct: 420 VEHRVGLAHARC 431


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 100/312 (32%), Positives = 142/312 (45%), Gaps = 29/312 (9%)

Query: 196 PVFDPADSASFSGVSCSSAVCDRLENAGCH--------AGRCRYEVSYGDGS----YTKG 243
           P+  P  S+S + V+C    C  L    C         +G C Y  +YG+      YT+G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 244 TLALETLTIGR--TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFS 301
            L  ET T G        +A GC  +++G F   +GL+GLG G +SLV QL  +   AF 
Sbjct: 73  ILMTETFTFGDDAAAFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFG 129

Query: 302 YCLVSRGTGSSGSLVFGREALPVGA-----AWVPLVRNP--RAPSFYYVGLSGLGVGGMR 354
           Y L S  +  S  + FG  A   G         PL+ NP  +   FYYVGL+G+ VGG  
Sbjct: 130 YRLSSDLSAPS-PISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKL 188

Query: 355 IPISEDLFRLTQ-MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT 413
           + I    F   +  G  GV+ D+GT +T LP PAY   RD  ++Q G        +  D 
Sbjct: 189 VQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDL 248

Query: 414 CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPV---DDAGTFCFAFAPSPSGLSIIG 470
                G  +   P++  +F GG  + L   N+L  +   +     C++   S   L+IIG
Sbjct: 249 ICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIG 308

Query: 471 NIQQEGIQISFD 482
           NI Q    + FD
Sbjct: 309 NIMQMDFHVVFD 320


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 163/373 (43%), Gaps = 43/373 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ +IG+G+P +  Y+ +D+GSDI+WV C  C +C K S       +++  +S +   
Sbjct: 76  GLYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKL 135

Query: 209 VSCSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
           V C    C  +      GC A   C Y   YGDGS T G    + +   R      T   
Sbjct: 136 VPCDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAA 195

Query: 259 N--VAIGCGHKNQGMF-----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
           N  V  GCG +  G           G+LG G  + S++ QL   G+    F++CL   GT
Sbjct: 196 NGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL--DGT 253

Query: 310 GSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
              G  V G    P      PL+  P  P  Y V ++ + VG   + +  D+F   + GD
Sbjct: 254 NGGGIFVIGHVVQP-KVNMTPLI--PNQPH-YNVNMTAVQVGHEFLSLPTDVF---EAGD 306

Query: 370 -DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTV 428
             G ++D+GT +  LP   Y+      ++Q  +L +   V    TC+  S  +    P V
Sbjct: 307 RKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDL-KVHTVRDEYTCFQYSDSLDDGFPNV 365

Query: 429 SFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISFD 482
           +F+F    +L +    +L P +  G +C  +  S         ++++G++      + +D
Sbjct: 366 TFHFENSVILKVYPHEYLFPFE--GLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYD 423

Query: 483 GANGFVGFGPNVC 495
             N  +G+    C
Sbjct: 424 LENQAIGWTEYNC 436


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 81/127 (63%)

Query: 144 DVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADS 203
           DV + +  G+GE+ +++ +G P  +   ++D+GSD+ W QC PCS CYKQ  P++DP+ S
Sbjct: 9   DVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCSDCYKQPTPIYDPSLS 68

Query: 204 ASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIG 263
           +++  VSC S++C  L  + C +  C Y  +YGD S T+G L+ ET T+    + ++A G
Sbjct: 69  STYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQSIPHIAFG 128

Query: 264 CGHKNQG 270
           CG  N+G
Sbjct: 129 CGQDNEG 135


>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 492

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 113/403 (28%), Positives = 157/403 (38%), Gaps = 98/403 (24%)

Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQ---------SDPVFDPADSASFSGVSCSSAVCDRL 219
           + +D+GSD+VW  C P  C  C  +         S+P+  P DS     + C+S  C   
Sbjct: 100 LFLDTGSDLVWFPCAPFTCMLCEGKPTPPGNNNSSNPLPPPTDSRR---IPCASPFCSAA 156

Query: 220 ENAG-----CHAGRCRYE-----------------VSYGDGSYTKGTLALETLTIGRTVV 257
            ++      C A RC  +                  +YGDGS                 V
Sbjct: 157 HSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVARLRRGRVGIAASVAV 216

Query: 258 KNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLG-GQTGGAFSYCLVSRGTGS----- 311
           +N    C H   G  VG AG    G G +SL  QL      G FSYCLV+    +     
Sbjct: 217 ENFTFACAHTALGEPVGVAGF---GRGPLSLPAQLAPAALSGRFSYCLVAHSFRADRPIR 273

Query: 312 SGSLVFGR-----EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
              L+ GR      A   G  + PL+ NP+ P FY V L  + VGG RIP   +L R+ +
Sbjct: 274 PSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARPELGRVGR 333

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAF---------------VAQTGNLPRASGVSIF 411
            GD G+V+D+GT  T LP   Y    + F                 QTG  P       +
Sbjct: 334 AGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAP----CYYY 389

Query: 412 DTCYNLSGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPV------------------DDA 452
           D   + +   S R VP ++ +F G   + LP  N+ +                    DD 
Sbjct: 390 DHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGEDDG 449

Query: 453 GTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           G         P+G   +GN QQ+G ++ +D   G VGF    C
Sbjct: 450 G--------GPAG--TLGNFQQQGFEVVYDVDAGRVGFARRRC 482


>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
          Length = 337

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 105/353 (29%), Positives = 151/353 (42%), Gaps = 44/353 (12%)

Query: 171 MVIDSGSDIVWVQCQPC---SQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAG 227
           M  D+G  I   +C  C   + C   +   FDP+ S++F+ V C S  C     +GC +G
Sbjct: 1   MAFDTGLGISLARCAACRPGAPCDGLAS--FDPSRSSTFAPVPCGSPDC----RSGCSSG 54

Query: 228 RCRYEVSYGDGSYTKGTLALETLTIGRTV-VKNVAIGCGHKNQGMFVGAAGLLGLGGGSM 286
                       +  G +A + LT+  +  V +   GC   + G  +GAAGLL L   S 
Sbjct: 55  STP-SCPLTSFPFLSGAVAQDVLTLTPSASVDDFTFGCVEGSSGEPLGAAGLLDLSRDSR 113

Query: 287 SLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVG-----AAWVPLVRNPRAPSFY 341
           SL  +L    GG FSYCL    T S G LV G   +P        A  PLV +P  P+ Y
Sbjct: 114 SLASRLAAGAGGTFSYCLPLSTTSSHGFLVIGEADVPHNRSARVTAVAPLVYDPAFPNHY 173

Query: 342 YVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGN 401
            + L+G+ +GG  IPI              +V+DT    T +    Y   RDAF      
Sbjct: 174 VIDLAGVSLGGRDIPIPPHA---------AMVLDTALPYTYMKPSMYAPLRDAFRRAMAR 224

Query: 402 LPRASGVSIFDTCYNLSGFV-SVRVPTVSFYF-------SGGPVLTLPASNFLIPVDDAG 453
            PRA  +   DTCYN +G    V +P V   F        G   +    ++ ++ + + G
Sbjct: 225 YPRAPAMGDLDTCYNFTGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGADQMLYMSEPG 284

Query: 454 TF----CFAFAPSPSG-------LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            F    C AFA  PS          ++G + Q  +++  D   G +GF P  C
Sbjct: 285 NFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPGSC 337


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 88/358 (24%), Positives = 157/358 (43%), Gaps = 27/358 (7%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C  C +  DP F P  S ++  V C+
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT 145

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
                   N      +C Y+  Y + S + G L  + ++ G       +    GC +   
Sbjct: 146 PDC-----NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAPQRAVFGCENDET 200

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S++ QL  +     +FS C      G  G+++ G  + P  
Sbjct: 201 GDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVG-GGAMILGGISPPED 259

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
             +     +P    +Y + L  + V G ++ ++  +F     G  G V+D+GT    LP 
Sbjct: 260 MVFT--HSDPDRSPYYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLPE 313

Query: 386 PAYEAFRDAFVAQTGNLPRASG--VSIFDTCYNLSGF----VSVRVPTVSFYFSGGPVLT 439
            A+ AF+ A + +  +L + +G   +  D C+  +G     ++   P V   F  G  L+
Sbjct: 314 TAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLS 373

Query: 440 LPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L   N+L       G +C   F+      +++G I      + +D  N  +GF    C
Sbjct: 374 LSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 144/328 (43%), Gaps = 29/328 (8%)

Query: 183 QCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
            C  C  C+KQ  PVF P  S++F    C + VC  +    C +  C Y+   G G +T 
Sbjct: 54  NCSQCIHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTV 113

Query: 243 GTLALETLTIGRTV-VKNVAIGCGHKNQGM-FVGAAGLLGLGGGSMSLVGQLGGQTGGAF 300
           G +A +T  IG     +  A G   +     + G +G +GLG    SLV Q+       F
Sbjct: 114 GIVATDTFAIGTAAPARPPASGASWRATSTPWAGPSGFIGLGRTPWSLVAQMKLTR---F 170

Query: 301 SYCLVSRGTGSSGSLVFGREA-LPVGAAWVPLVR---NPRAPSFYYVGLSGLGVGGMRIP 356
           SYCL    TG +  L  G  A L  G AW P V+   N     +Y + L  +  G   I 
Sbjct: 171 SYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT 230

Query: 357 ISEDLFRLTQMGDDGVVMDTGTA-VTRLPTPAYEAFRDAFVAQTGNLPRASGV-SIFDTC 414
           +          G + V++ T    V+ L    Y+ F+ A +A  G  P A+ V + F+ C
Sbjct: 231 MPR--------GRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAPFEVC 282

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA-------PSPSGLS 467
           +  +G      P + F F  G  LT+P +N+L  V +  T C +          +  GL+
Sbjct: 283 FPKAGVSG--APDLVFTFQAGAALTVPPANYLFDVGN-DTVCLSVMSIALLNITALDGLN 339

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           I+G+ QQE + + FD     + F P  C
Sbjct: 340 ILGSFQQENVHLLFDLDKDMLSFEPADC 367


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/341 (31%), Positives = 160/341 (46%), Gaps = 39/341 (11%)

Query: 171 MVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCR 230
           +V D+ SD++W QCQPC  C  Q+  ++DP  + +++ ++ S+                 
Sbjct: 5   LVFDTTSDLLWTQCQPCLSCVAQAGDMYDPNKTETYANLTSSN----------------- 47

Query: 231 YEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           Y  +Y   S+T G  A ET  +G   V N+  GCG +NQG +   AG+ G+G G +SL+ 
Sbjct: 48  YNYTYSKQSFTSGYFATETFALGNVTVANITFGCGTRNQGYYDNVAGVFGVGRGGVSLLN 107

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLVFG------REALPVGAAWVPLVRNPRAPSFYYVG 344
           QLG      FSYC  S G   S ++  G        A    AA  P+V +P   S Y+V 
Sbjct: 108 QLGIDR---FSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVLKSGYFVK 164

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           L G+ VG  R+ ++       + G   +V+D+ + VT L    Y   R A VAQ   L  
Sbjct: 165 LVGVTVGATRVDVAGA--SSAEGGGRALVIDSTSPVTVLDEATYGPVRRALVAQLAPLKE 222

Query: 405 A-----SGVSIFDTCYNLSGFVSVRVP---TVSFYFSGGPV-LTLPASNFLIPVDDAGTF 455
           A     +GV + D C+ L+   +   P   T++ +F GG   L LP +N+L      G  
Sbjct: 223 ANANASAGVGL-DLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKDSAGGLI 281

Query: 456 CFAFAPSPS-GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           C    PS S G+ ++G+       + +D A   V F P  C
Sbjct: 282 CLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 162/362 (44%), Gaps = 33/362 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           + + +G+PP++Q MV+D+GS + W+QC    +   +    FDP+ S+SFS + CS  +C 
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQG 270
               D      C + R C Y   Y DG++ +G L  E +T   T +   + +GC  ++  
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS- 191

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
                 G+LG+  G +S V Q        FSYC+  +    G   +GS   G      G 
Sbjct: 192 ---DDRGILGMNRGRLSFVSQ---AKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 327 AWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            +V L+  P +       P  Y V + G+  G  ++ IS  +FR    G    ++D+G+ 
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLS-GFVSVRVPTVSFYFSGGP 436
            T L   AY+  R   + + G   +   V     D C++ +   +   +  + F F+ G 
Sbjct: 306 FTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGV 365

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            + +P    L+ V   G  C     S    +  +IIGN+ Q+ + + FD  N  VGF   
Sbjct: 366 EILVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKA 424

Query: 494 VC 495
            C
Sbjct: 425 DC 426


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 165/380 (43%), Gaps = 43/380 (11%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG     G Y+ +IG+G+PP++ Y+ +D+GSDI+WV C  C +C  +S       ++D  
Sbjct: 74  SGRPDAVGLYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIK 133

Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
           +S+S   V C    C  +      GC A   C Y   YGDGS T G    + +   +   
Sbjct: 134 ESSSGKLVPCDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSG 193

Query: 255 -----TVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLG--GQTGGAFSY 302
                +   ++  GCG +  G    +      G+LG G  + S++ QL   G+    F++
Sbjct: 194 DLKTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAH 253

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL   G    G    G    P      PL+  P  P  Y V ++ + VG   + +S D  
Sbjct: 254 CL--NGVNGGGIFAIGHVVQP-KVNMTPLL--PDQPH-YSVNMTAVQVGHTFLSLSTD-- 305

Query: 363 RLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
             +  GD  G ++D+GT +  LP   YE      ++Q  +L +   +    TC+  S  V
Sbjct: 306 -TSAQGDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDL-KVQTLHDEYTCFQYSESV 363

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQE 475
               P V+F+F  G  L +   ++L P      +C  +  S         ++++G++   
Sbjct: 364 DDGFPAVTFFFENGLSLKVYPHDYLFP--SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLS 421

Query: 476 GIQISFDGANGFVGFGPNVC 495
              + +D  N  +G+    C
Sbjct: 422 NKLVFYDLENQAIGWAEYNC 441


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 169/390 (43%), Gaps = 37/390 (9%)

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           R+L    + +  H       D++      +G Y  R+ +G+PP+   +++DSGS + +V 
Sbjct: 66  RKLHKSDSKSLPHSRMRLYDDLLI-----NGYYTTRLWIGTPPQMFALIVDSGSTVTYVP 120

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGR--CRYEVSYGDGSYT 241
           C  C QC K  DP F P  S+++  V C+        +  C   R  C YE  Y + S +
Sbjct: 121 CSDCEQCGKHQDPKFQPEMSSTYQPVKCNM-------DCNCDDDREQCVYEREYAEHSSS 173

Query: 242 KGTLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GG 294
           KG L  + ++ G   +   +    GC     G      A G++GLG G +SLV QL   G
Sbjct: 174 KGVLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKG 233

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMR 354
               +F  C      G  GS++ G    P    +     +P    +Y + L+G+ V G +
Sbjct: 234 LISNSFGLCYGGMDVG-GGSMILGGFDYPSDMVFTD--SDPDRSPYYNIDLTGIRVAGKQ 290

Query: 355 IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFD 412
           + +   +F     G+ G V+D+GT    LP  A+ AF +A + +   L +  G   +  D
Sbjct: 291 LSLHSRVFD----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKD 346

Query: 413 TCYNL--SGFVSVR---VPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS-PSG 465
           TC+ +  S +VS      P+V   F  G    L   N++       G +C    P+    
Sbjct: 347 TCFQVAASNYVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDH 406

Query: 466 LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +++G I      + +D  N  VGF    C
Sbjct: 407 TTLLGGIVVRNTLVVYDRENSKVGFWRTNC 436


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 162/362 (44%), Gaps = 33/362 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           + + +G+PP++Q MV+D+GS + W+QC    +   +    FDP+ S+SFS + CS  +C 
Sbjct: 74  ISLPIGTPPQAQQMVLDTGSQLSWIQCH-RKKLPPKPKTSFDPSLSSSFSTLPCSHPLCK 132

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQG 270
               D      C + R C Y   Y DG++ +G L  E +T   T +   + +GC  ++  
Sbjct: 133 PRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTEITPPLILGCATESS- 191

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVGA 326
                 G+LG+  G +S V Q        FSYC+  +    G   +GS   G      G 
Sbjct: 192 ---DDRGILGMNRGRLSFVSQ---AKISKFSYCIPPKSNRPGFTPTGSFYLGDNPNSHGF 245

Query: 327 AWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
            +V L+  P +       P  Y V + G+  G  ++ IS  +FR    G    ++D+G+ 
Sbjct: 246 KYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGGSGQTMVDSGSE 305

Query: 380 VTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCYNLS-GFVSVRVPTVSFYFSGGP 436
            T L   AY+  R   + + G   +   V     D C++ +   +   +  + F F+ G 
Sbjct: 306 FTHLVDAAYDKVRAEIMTRVGRRLKKGYVYGGTADMCFDGNVAMIPRLIGDLVFVFTRGV 365

Query: 437 VLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
            + +P    L+ V   G  C     S    +  +IIGN+ Q+ + + FD  N  VGF   
Sbjct: 366 EIFVPKERVLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGFAKA 424

Query: 494 VC 495
            C
Sbjct: 425 DC 426


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 160/383 (41%), Gaps = 48/383 (12%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-----YKQSDPVFDPADSAS 205
           G + + +  G+PP+    ++D+GS +VW  C     C+ C       +  P+F+P  S+S
Sbjct: 85  GGHSIPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSS 144

Query: 206 FSGVSCSSAVCDRLENAGCHAG---------RCR-----YEVSYGDGSYTKGTLALETLT 251
              + C +  C    +   H G          C      Y + YG G+ + G   LE L 
Sbjct: 145 SKILGCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGA-SSGDFLLENLN 203

Query: 252 IGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--- 308
                +    +GC     G  V +A L G G    SL  Q+G +    F+YCL S     
Sbjct: 204 FPGKTIHEFLVGCTTSAVGE-VTSAALAGFGRSMFSLPMQMGVK---KFAYCLNSHDYDD 259

Query: 309 TGSSGSLVFG-REALPVGAAWVPLVRNPRA-PSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           T +S  L+    +    G ++ P ++NP   P +YY+G+  + +G   + I         
Sbjct: 260 TRNSSKLILDYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNKLLRIPSKYLAPGS 319

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR---ASGVSIFDTCYNLSGFVSV 423
            G  G+++D+G A   +  P ++   +    +     R   A        CYN +G  S+
Sbjct: 320 DGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYNFTGQKSI 379

Query: 424 RVPTVSFYFSGGPVLTLPASNF--LIP---------VDDAGTFCFAFAPSPSGLSIIGNI 472
           ++P + + F GG  + +P  N+  LIP           DAGT    F P PS   I+GN 
Sbjct: 380 KIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPS--IILGNS 437

Query: 473 QQEGIQISFDGANGFVGFGPNVC 495
           Q     + FD  N  +GF    C
Sbjct: 438 QHVDYYVEFDLKNERLGFRQQTC 460


>gi|21668075|gb|AAM74221.1|AF518565_1 putative chloroplast nucleoid DNA-binding protein [Brassica
           oleracea]
          Length = 165

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 97/170 (57%), Gaps = 8/170 (4%)

Query: 328 WVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
           + P+       SFY + + G+ VGG ++ I + +F        G ++D+GT ++RLP  A
Sbjct: 1   FTPISTITDGTSFYGLDIVGISVGGQKLAIPQTVFS-----TPGALIDSGTVISRLPPKA 55

Query: 388 YEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLI 447
           Y A R AF A+       S VSI DTC++L+GF +V +PTVSFYF+GG V+ L +   L 
Sbjct: 56  YAALRGAFKAKMSQYKNTSAVSILDTCFDLTGFKTVTIPTVSFYFNGGAVVELGSKGVLY 115

Query: 448 PVDDAGTFCFAFA--PSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                   C AFA     +  +I GN+QQ+ +++ +DGA G VGF PN C
Sbjct: 116 AF-KMSQVCLAFAGNSDDNNAAIFGNVQQQTLEVVYDGAAGRVGFAPNGC 164


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 156/369 (42%), Gaps = 49/369 (13%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C QC +  DP F P  S+++  V C+
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCN 69

Query: 213 -SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGR---TVVKNVAIGCGHKN 268
               CD          +C YE  Y + S + G L  + ++ G       +    GC +  
Sbjct: 70  IDCNCDD------EKQQCVYERQYAEMSTSSGVLGEDIISFGNLSALAPQRAVFGCENME 123

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYC----------LVSRGTGSSGS 314
            G      A G++G+G G +S+V  L   G    +FS C          +V  G     +
Sbjct: 124 TGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISPPSN 183

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           +VF +              +P    +Y + L  + V G  +P++  +F     G  G ++
Sbjct: 184 MVFSQS-------------DPVRSPYYNIDLKEIHVAGKPLPLNPTVFD----GKHGTIL 226

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLP--RASGVSIFDTCYNLSGF----VSVRVPTV 428
           D+GT    LP  A+ +F+DA + +  +L   R    +  D C++ +G     +S   P V
Sbjct: 227 DSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAV 286

Query: 429 SFYFSGGPVLTLPASNFLIPVDDA-GTFCFA-FAPSPSGLSIIGNIQQEGIQISFDGANG 486
              F  G  L L   N+L       G +C   F       +++G I      + +D  N 
Sbjct: 287 EMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENS 346

Query: 487 FVGFGPNVC 495
            +GF    C
Sbjct: 347 KIGFWKTNC 355


>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
 gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
          Length = 508

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 158/394 (40%), Gaps = 75/394 (19%)

Query: 171 MVIDSGSDIVWVQCQP--CSQCYKQSDPVFDPADSASFSG--------VSCSSAVC---- 216
           + +D+GSD+VW  C P  C  C  +  P    + SA            V C+S +C    
Sbjct: 111 LFLDTGSDLVWFPCAPFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLCSAAH 170

Query: 217 ------DRLENAGC-----HAGRCR--------YEVSYGDGSYTKGTLALETLTIGRTV- 256
                 D    AGC       G CR           +YGDGS     L    + +G +V 
Sbjct: 171 ASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLV-AHLRRGRVGLGASVA 229

Query: 257 VKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSS---- 312
           V N    C H   G  VG AG    G G +SL GQL  Q  G FSYCLVS    +     
Sbjct: 230 VDNFTFACAHTALGEPVGVAGF---GRGPLSLPGQLAPQLSGRFSYCLVSHSFRADRLIR 286

Query: 313 -GSLVFGRE----ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
              L+ GR     A   G  + PL+ NP+ P FY V L  + VG  RI    +L R+ + 
Sbjct: 287 PSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARPELARVDRA 346

Query: 368 GDDGVVMDTGTAVTRLPTPAYE-----AFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVS 422
           G+ G+V+D+GT  T LP   Y        R    A      RA   +    CY+ +   S
Sbjct: 347 GNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCYHYA--AS 404

Query: 423 VR-VPTVSFYFSGGPVLTLPASNFLIPV------------DDAGTFCFAFAPSPSG---- 465
            R VP ++ +F G   + LP  N+ +              DD G          SG    
Sbjct: 405 DRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMNGGDVSGEDGG 464

Query: 466 ----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                  +GN QQ+G ++ +D   G VGF    C
Sbjct: 465 DDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 163/365 (44%), Gaps = 35/365 (9%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
           + + +G+P +SQ +V+D+GS + W+QC P         P   FDP+ S+SFS + CS  +
Sbjct: 82  LSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPL 141

Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
           C     D      C + R C Y   Y DG++ +G L  E  T   +     + +GC  ++
Sbjct: 142 CKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPLILGCAKES 201

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPV 324
                   G+LG+  G +S + Q        FSYC+ +R    G  S+GS   G      
Sbjct: 202 ----TDEKGILGMNLGRLSFISQ---AKISKFSYCIPTRSNRPGLASTGSFYLGDNPNSR 254

Query: 325 GAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTG 377
           G  +V L+  P++       P  Y V L G+ +G  R+ I   +FR    G    ++D+G
Sbjct: 255 GFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSG 314

Query: 378 TAVTRLPTPAYEAFRDAFVAQTGNLPRASGV--SIFDTCY--NLSGFVSVRVPTVSFYFS 433
           +  T L   AY+  ++  V   G+  +   V  S  D C+  N S  +   +  + F F 
Sbjct: 315 SEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTADMCFDGNHSMEIGRLIGDLVFEFG 374

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFVGF 490
            G  + +   + L+ V   G  C     S    +  +IIGN+ Q+ + + FD  N  VGF
Sbjct: 375 RGVEILVEKQSLLVNV-GGGIHCVGIGRSSMLGAASNIIGNVHQQNLWVEFDVTNRRVGF 433

Query: 491 GPNVC 495
               C
Sbjct: 434 SKAEC 438


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  119 bits (298), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 165/373 (44%), Gaps = 44/373 (11%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP++  MVID+GS++ W+ C   +         F+   S S+  + CSS+ C 
Sbjct: 33  VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPT-TFNQTRSISYRPIPCSSSTCT 91

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
               D    A C +   C   +SY D S ++G LA +T  +G + +  +  GC       
Sbjct: 92  NQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMGASDIPGMVFGCMDSVFSS 151

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGRE----ALP 323
           N        GL+G+  GS+S V Q+G      FSYC+   GT  SG L+ G      A+P
Sbjct: 152 NSDEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCI--SGTDFSGMLLLGESNFTWAVP 206

Query: 324 VGAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
           +   + PLV+ +   P F    Y V L G+ V    +PI + +F     G    ++D+GT
Sbjct: 207 LN--YTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAGQTMVDSGT 264

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYN--LSGFVSVRVPTVSF 430
             T L  PAY A R  F+ QT    R      F      D CY   +S  V  R+PTVS 
Sbjct: 265 QFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDLCYRVPISQRVLPRLPTVSL 324

Query: 431 YFSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFD 482
            F+G   +T+     L  V      +    C +F  S        +IG+  Q+ + + FD
Sbjct: 325 VFNGAE-MTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQNVWMEFD 383

Query: 483 GANGFVGFGPNVC 495
                +G     C
Sbjct: 384 LERSRIGLAQVRC 396


>gi|255647724|gb|ACU24323.1| unknown [Glycine max]
          Length = 334

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 85/246 (34%), Positives = 123/246 (50%), Gaps = 8/246 (3%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           + SG     G Y VR+ +G+P +  +MV+D+ +D  +V C  C+ C   SD  F P  S 
Sbjct: 89  IASGQTFNIGNYVVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGC---SDATFSPKAST 145

Query: 205 SFSGVSCSSAVCDRLENAGCHA---GRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           S+  + CS   C ++    C A   G C +  SY   S++  TL  ++L +   V+ N +
Sbjct: 146 SYGPLDCSVPQCGQVRGLSCPATGTGACSFNQSYAGSSFS-ATLVQDSLRLATDVIPNYS 204

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGRE 320
            GC +   G  V A GLLGLG G +SL+ Q G    G FSYCL S +    SGSL     
Sbjct: 205 FGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFKSYYFSGSLKLRPV 264

Query: 321 ALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             P      PL+R+P  PS YYV  +G+ VG + +P   +          G ++D+GT +
Sbjct: 265 GQPKSIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVI 324

Query: 381 TRLPTP 386
           TR   P
Sbjct: 325 TRFVEP 330


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  119 bits (297), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 156/359 (43%), Gaps = 29/359 (8%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y  R+ +G+PP+   +++D+GS + +V C  C  C    DP F P  S ++  V C+
Sbjct: 90  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT 149

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG---RTVVKNVAIGCGHKNQ 269
                   N      +C YE  Y + S + G L  + ++ G       +    GC +   
Sbjct: 150 WQC-----NCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRAIFGCENDET 204

Query: 270 GMFVG--AAGLLGLGGGSMSLVGQLGGQT--GGAFSYCLVSRGTGSSGSLVFGREALPVG 325
           G      A G++GLG G +S++ QL  +     AFS C    G G    ++ G   +   
Sbjct: 205 GDIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGG---ISPP 261

Query: 326 AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPT 385
           A  V    +P    +Y + L  + V G R+ ++  +F     G  G V+D+GT    LP 
Sbjct: 262 ADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAYLPE 317

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIF--DTCY-----NLSGFVSVRVPTVSFYFSGGPVL 438
            A+ AF+ A + +T +L R SG      D C+     N+S  +S   P V   F  G  L
Sbjct: 318 SAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQ-LSKSFPVVEMVFGNGHKL 376

Query: 439 TLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +L   N+L       G +C   F+      +++G I      + +D  +  +GF    C
Sbjct: 377 SLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKTNC 435


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 104/400 (26%), Positives = 170/400 (42%), Gaps = 67/400 (16%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQC--YKQSDPVF----------- 198
           Y + + +G+PP+   +++D+GSD+ WV C      C +C  Y+ +  +            
Sbjct: 82  YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141

Query: 199 ----------------DPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTK 242
                           +P D+ + +G S S+ V      A C      +  +YG G    
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLV-----KATCSRPCPSFAYTYGAGGVVT 196

Query: 243 GTLALETLTIGRT---VVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGA 299
           G L  +TL +  +   V K +   C       +    G+ G G G++S+V QLG    G 
Sbjct: 197 GILTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREPIGIAGFGRGTLSMVSQLGFLQKG- 255

Query: 300 FSYCLV----SRGTGSSGSLVFGREALPVG--AAWVPLVRNPRAPSFYYVGLSGLGVGGM 353
           FS+C +    +     S  LV G  AL       + P++ +P  P+FYYVGL  + VG +
Sbjct: 256 FSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNV 315

Query: 354 R-IPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
               +   L     +G+ G+ +D+GT  T LP P Y     + +  T N PR +G+ +  
Sbjct: 316 SATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVL-SILQSTINYPRDTGMEMQT 374

Query: 411 -FDTCY------NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAG----TFCFAF 459
            FD CY      N +      +P+++F+F     L LP  N   PV   G      C  F
Sbjct: 375 GFDLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMF 434

Query: 460 APSPSG----LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             +  G      + G+ QQ+ +++ +D     +GF P  C
Sbjct: 435 QSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 116/445 (26%), Positives = 196/445 (44%), Gaps = 49/445 (11%)

Query: 74  SDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRL---SGGG 130
           S+E  +   L+H D   S         ++ H  +  AR++  V R  + +  L   +   
Sbjct: 3   SNEVGFTARLIHHDSPLSP--------FYNHTMTDTARIEATVHRSRSRLNYLYYINKLS 54

Query: 131 ADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC-SQ 189
            +A  ++V    T V  G     GEY +   +G+P       +D+ + ++WVQC  C SQ
Sbjct: 55  ENALDNDVSLSPTLVNEG-----GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQ 109

Query: 190 CYKQSDPV---FDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKG 243
           C  +   +   F  + S ++    C S  C+ L   +        C+Y + YGD   T G
Sbjct: 110 CEPEKRGLTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSG 169

Query: 244 TLALETL----TIGRTV-VKNVAIGCGHKN-QGMFVGAAGLLGLGGGSMSLVGQLGGQTG 297
            L+ ++     + G  V V  +  GC      G      G +GL    +SL+ QLG +  
Sbjct: 170 ILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIK-- 227

Query: 298 GAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
             FSYCLV     GS+  + FG  +LPV +     +  P + + YYV + G+ +G    P
Sbjct: 228 -KFSYCLVPFNNLGSTSKMYFG--SLPVTSGGQTPLLYPNSDA-YYVKVLGISIGNDE-P 282

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-----QTGNLPRASGVSIF 411
             + +F + ++ D G ++DTG   + L T A+++    F+      Q  + P+      F
Sbjct: 283 HFDGVFDVYEVRD-GWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKER----F 337

Query: 412 DTCYNLSGFVSVR-VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIG 470
           + C+ L     +   P V+ +F G  ++    S F + ++D G FC A   S S +SI+G
Sbjct: 338 ELCFELQNANDLESFPDVTVHFDGADLILNVESTF-VKIEDDGIFCLALLRSGSPVSILG 396

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N Q +   + +D     + F P  C
Sbjct: 397 NFQLQNYHVGYDLEAQVISFAPVDC 421


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 167/388 (43%), Gaps = 33/388 (8%)

Query: 124 RRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQ 183
           R+L    + +  H       D++      +G Y  R+ +G+PP+   +++DSGS + +V 
Sbjct: 67  RKLHKSDSKSLPHSRMRLYDDLLI-----NGYYTTRLWIGTPPQMFALIVDSGSTVTYVP 121

Query: 184 CQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRCRYEVSYGDGSYTKG 243
           C  C QC K  DP F P  S+++  V C+        N      +C YE  Y + S +KG
Sbjct: 122 CSDCEQCGKHQDPKFQPELSSTYQPVKCNMDC-----NCDDDKEQCVYEREYAEHSSSKG 176

Query: 244 TLALETLTIG---RTVVKNVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQT 296
            L  + ++ G   +   +    GC     G      A G++GLG G +SLV QL   G  
Sbjct: 177 VLGEDLISFGNESQLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 236

Query: 297 GGAFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIP 356
             +F  C      G  GS++ G    P    +     +P    +Y + L+G+ V G ++ 
Sbjct: 237 SNSFGLCYGGMDVG-GGSMILGGFDYPSDMIFTD--SDPDRSPYYNIDLTGIRVAGKKLS 293

Query: 357 ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTC 414
           ++  +F     G+ G V+D+GT    LP  A+ AF +A + +   L +  G   +  DTC
Sbjct: 294 LNSRVFD----GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTC 349

Query: 415 Y-----NLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDA-GTFCFAFAPS-PSGLS 467
           +     N    +S   P+V   F  G    L   N++       G +C    P+     +
Sbjct: 350 FLVAASNDVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTT 409

Query: 468 IIGNIQQEGIQISFDGANGFVGFGPNVC 495
           ++G I      + +D  N  VGF    C
Sbjct: 410 LLGGIVVRNTLVVYDRENSKVGFWRTNC 437


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  118 bits (296), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 111/403 (27%), Positives = 161/403 (39%), Gaps = 81/403 (20%)

Query: 164 SPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSDPVF----DPADSASFSGVSCSSAVC- 216
           +PP+   + +D+GSD+VW  C+P  C  C  +++        P  S++   V C S+ C 
Sbjct: 91  NPPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACS 150

Query: 217 -------------------DRLENAGCHAGRC-RYEVSYGDGSYT--------KGTLALE 248
                              + +E + CH+  C  +  +YGDGS          K  LA  
Sbjct: 151 AAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVARLYHDSIKLPLATP 210

Query: 249 TLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGAFSYCLV 305
           +L++      N   GC H      VG AG    G G +SL  QL     Q G  FSYCLV
Sbjct: 211 SLSL-----HNFTFGCAHTALAEPVGVAGF---GRGVLSLPAQLASFAPQLGNRFSYCLV 262

Query: 306 SRGTGSS-----GSLVFGR--------EALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG 352
           S    S        L+ G             V   +  ++ NP+ P FY VGL G+ +G 
Sbjct: 263 SHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGK 322

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNL-PRASGVSI- 410
            +IP  E L R+ + G  GVV+D+GT  T LP   Y +    F  + G +  RA  V   
Sbjct: 323 KKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDK 382

Query: 411 --FDTCYNLSGFVSVRVPTVSFYFSGGP-VLTLPASNFLIPVDDAG--------TFCFAF 459
                CY     V+  +P++  +F G    + LP  N+     D G          C   
Sbjct: 383 TGLGPCYYYDTVVN--IPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLML 440

Query: 460 APSPSGLSI-------IGNIQQEGIQISFDGANGFVGFGPNVC 495
                   +       +GN QQ G ++ +D     VGF    C
Sbjct: 441 MNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKC 483


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 164/367 (44%), Gaps = 40/367 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           + + +GSPP++  MV+D+GS++ W+ C+         +  F+P  S+S++   C+S+VC 
Sbjct: 61  ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSVCM 116

Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GH 266
               D    A C      C   VSY D S  +GTLA ET ++          GC    G+
Sbjct: 117 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 176

Query: 267 KNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPV 324
            +         GL+G+  GS+SLV Q+       FSYC+   G  + G L+ G   + P 
Sbjct: 177 TSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI--SGEDAFGVLLLGDGPSAPS 231

Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              + PLV     +P F    Y V L G+ V    + + + +F     G    ++D+GT 
Sbjct: 232 PLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQ 291

Query: 380 VTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFS 433
            T L  P Y + +D F+ QT G L R    +       D CY+    ++  VP V+  FS
Sbjct: 292 FTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASLAA-VPAVTLVFS 350

Query: 434 GGPVLTLPASNFLIPVDDA--GTFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGANGFV 488
           G   + +     L  V       +CF F  S   G+   +IG+  Q+ + + FD     V
Sbjct: 351 GAE-MRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRV 409

Query: 489 GFGPNVC 495
           GF    C
Sbjct: 410 GFTETTC 416


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 166/374 (44%), Gaps = 49/374 (13%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP++  MV+D+GS++ W+ C+      +  + VF+P  S ++S V C S  C 
Sbjct: 71  VSLTVGSPPQNVTMVLDTGSELSWLHCKKT----QFLNSVFNPLSSKTYSKVPCLSPTCK 126

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH----K 267
               D      C A + C   VSY D +  +G LA ET  +G         GC       
Sbjct: 127 TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTKPATIFGCMDSGFSS 186

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
           N        GL+G+  GS+S V Q+G      FSYC+   G  S+G L+ G  + P    
Sbjct: 187 NSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCI--SGFDSAGVLLLGNASFPWLKP 241

Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
            ++ PLV+ +   P F    Y V L G+ V    + + + +F     G    ++D+GT  
Sbjct: 242 LSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSGTQF 301

Query: 381 TRLPTPAYEAFRDAFVAQTGNLPRASGVSIF------DTCYNLSGFVSVR-----VPTVS 429
           T L  P Y A ++ F++QT  + +      F      D CY L    S R     +P VS
Sbjct: 302 TFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDLCYLLD---SSRPNLQNLPVVS 358

Query: 430 FYFSGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSP-SGLS--IIGNIQQEGIQISF 481
             F G   +++     L  +P +  G    +CF F  S   G+   +IG+  Q+ + + F
Sbjct: 359 LMFQGAE-MSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAFVIGHHHQQNVWMEF 417

Query: 482 DGANGFVGFGPNVC 495
           D     +G     C
Sbjct: 418 DLEKSRIGLADVRC 431


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 168/384 (43%), Gaps = 54/384 (14%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
           G+   +G Y+  I +G+PP+  ++ +D+GSDI+WV C  C++C ++SD      ++DP  
Sbjct: 75  GLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKG 134

Query: 203 SASFSGVSCSSAVCDRLENAGCHAGR---------CRYEVSYGDGSYTKGTLALETLTIG 253
           S+S S VSC    C     A  + G+         C Y V YGDGS T G    ++L   
Sbjct: 135 SSSGSTVSCDQKFC-----AATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYN 189

Query: 254 --------RTVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGG 298
                   R    +V  GCG + QG  +G+      G++G G  + S++ QL   G+   
Sbjct: 190 QVSGDGQTRHANASVIFGCGAQ-QGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKK 248

Query: 299 AFSYCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
            FS+CL +   G  G    G    P   +  PLV  P  P  Y V L  + VGG  + + 
Sbjct: 249 IFSHCLDTIKGG--GIFAIGDVVQPKVKS-TPLV--PDMPH-YNVNLESINVGGTTLQLP 302

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD-TCYNL 417
             +F   +    G ++D+GT +T LP   Y   +D   A     P  +  S+ D  C   
Sbjct: 303 SHMFETGE--KKGTIIDSGTTLTYLPELVY---KDVLAAVFAKHPDTTFHSVQDFLCIQY 357

Query: 418 SGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGN 471
              V    P ++F+F     L +   ++     D   +CF F      +     + ++G+
Sbjct: 358 FQSVDDGFPKITFHFEDDLGLNVYPHDYFFQNGD-NLYCFGFQNGGLQSKDGKDMVLLGD 416

Query: 472 IQQEGIQISFDGANGFVGFGPNVC 495
           +      + +D  N  VG+    C
Sbjct: 417 LVLSNKVVVYDLENQVVGWTDYNC 440


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score =  118 bits (295), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 162/394 (41%), Gaps = 66/394 (16%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
           G Y V +  G+P ++   V D+GS +VW  C     CS C +   DP     F P +S+S
Sbjct: 88  GGYSVSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSS 147

Query: 206 FSGVSCSSAVCDRLENAGCHAGRCR------------YEVSYGDGSYTKGTLALETLTIG 253
              + C +  C  L  A      C             Y + YG GS T G L  E L   
Sbjct: 148 SRVIGCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFP 206

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
              V +  +GC   +       AG+ G G G  SL  Q+  +   +FS+CLVSR      
Sbjct: 207 DLTVPDFVVGCSVISTRT---PAGIAGFGRGPESLPSQMKLK---SFSHCLVSRRFDDTN 260

Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
                   TGS   SGS          G ++ P  +NP   +     +YY+ L  + VG 
Sbjct: 261 VTTDLGLDTGSGHKSGSKT-------PGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGS 313

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS---GVS 409
             + I          G+ G ++D+G+  T +  P +E   + F  Q  N  R      VS
Sbjct: 314 KHVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVS 373

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
               C+N+SG   V VP + F F GG  + LP SN+   V +A T C         +P G
Sbjct: 374 GIAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGG 433

Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +    I+G+ QQ+   + +D  N   GF    C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
          Length = 343

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 53/90 (58%), Positives = 66/90 (73%), Gaps = 2/90 (2%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSA 204
           VVSG+  GSGEYF R+GVGSP R  YMV+D+GSD+ WVQCQPC+ CY+QSDPVFDP+ S 
Sbjct: 156 VVSGVGLGSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLST 215

Query: 205 SFSGVSCSSAVCDRLENAGCH--AGRCRYE 232
           S++ V+C +  C  L+ A C    G C YE
Sbjct: 216 SYASVACDNPRCHDLDAAACRNSTGACLYE 245


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/328 (29%), Positives = 150/328 (45%), Gaps = 37/328 (11%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFS 207
           +G YF +IG+G+P +  Y+ +D+GSDI+WV C  C +C  +SD      ++D   S +  
Sbjct: 75  AGLYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSD 134

Query: 208 GVSCSSAVCDRLEN--AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVVK 258
            V C    C   +    GC  G +C Y V YGDGS T G    + +   R      T   
Sbjct: 135 AVGCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPT 194

Query: 259 N--VAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTG 310
           N  V  GCG+K  G    ++    G+LG G  + S++ QL   G+    FS+CL +   G
Sbjct: 195 NGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGG 254

Query: 311 SSGSLVFGREALP------VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRL 364
             G    G    P      + +  + ++   RA   Y V +  + VGG  + +  D F  
Sbjct: 255 --GIFAIGEVVEPKVRFLLMNSVMIVVLFLSRA--HYNVVMKEIEVGGDPLDVPSDAF-- 308

Query: 365 TQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSV 423
            + GD  G ++D+GT +   P   Y    +  ++Q  +L R   V    TC++ +G V  
Sbjct: 309 -ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDL-RLHTVEQAFTCFDYTGNVDD 366

Query: 424 RVPTVSFYFSGGPVLTLPASNFLIPVDD 451
             PTV+ +F     LT+    +L  V +
Sbjct: 367 GFPTVTLHFDKSISLTVYPHEYLFQVKE 394


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 103/396 (26%), Positives = 162/396 (40%), Gaps = 66/396 (16%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ----PCSQCYK------QSDPVFDPADSAS 205
           Y + + +G+PP++  + +D+GSD+ WV C      C +CY       +S  VF P  S++
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142

Query: 206 FSGVSCSSAVC----------DRLENAGCHAGRC----------RYEVSYGDGSYTKGTL 245
               SC+S+ C          D    AGC                +  +YG+G    G L
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202

Query: 246 ALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLV 305
             + L      V   + GC       +    G+ G G G +SL  QLG    G FS+C +
Sbjct: 203 TRDILKARTRDVPRFSFGCV---TSTYREPIGIAGFGRGLLSLPSQLGFLEKG-FSHCFL 258

Query: 306 S----RGTGSSGSLVFGREALPVGAA----WVPLVRNPRAPSFYYVGLSGLGVGGMRIP- 356
                     S  L+ G  AL +       + P++  P  P+ YY+GL  + +G    P 
Sbjct: 259 PFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPT 318

Query: 357 -ISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI---FD 412
            +   L +    G+ G+++D+GT  T LP P Y       +  T   PRA+       FD
Sbjct: 319 QVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTT-LQSTITYPRATETESRTGFD 377

Query: 413 TCY-------NLSGF---VSVRVPTVSFYFSGGPVLTLPASNFLI----PVDDAGTFCFA 458
            CY       NL+     V +  P+++F+F     L LP  N       P D +   C  
Sbjct: 378 LCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLL 437

Query: 459 FAPSPSG----LSIIGNIQQEGIQISFDGANGFVGF 490
           F     G      + G+ QQ+ +++ +D     +GF
Sbjct: 438 FQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGF 473


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 41/370 (11%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC----------YKQSDPVFDPAD 202
           +G Y  R+ +G+P +   +++DSGS + +V C  C QC           +  DP F P  
Sbjct: 89  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 148

Query: 203 SASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVK 258
           S+++S V C+    CD          +C YE  Y + S + G L  + ++ G+      +
Sbjct: 149 SSTYSPVKCNVDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQ 202

Query: 259 NVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGS 314
               GC +   G      A G++GLG G +S++ QL   G    +FS C      G  G+
Sbjct: 203 RAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGT 261

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           +V G   +P     V    NP    +Y + L  + V G  + +   +F        G V+
Sbjct: 262 MVLG--GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVL 315

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPT 427
           D+GT    LP  A+ AF+DA   +  +L +  G   +  D C+     N+S    V  P 
Sbjct: 316 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPD 374

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           V   F  G  L+L   N+L       G +C   F       +++G I      +++D  N
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434

Query: 486 GFVGFGPNVC 495
             +GF    C
Sbjct: 435 EKIGFWKTNC 444


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 156/362 (43%), Gaps = 40/362 (11%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--------VFDPADSASFS 207
           Y+  + VG+P     + +D+GSD+ W+ C  C  C    +         ++ P +S++  
Sbjct: 130 YYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTSK 188

Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSY-GDGSYTKGTLALETLTI------GRTVVKNV 260
            V CSS++C  L+     +  C Y+VSY  D + + G L  + L +       + V   +
Sbjct: 189 EVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI 248

Query: 261 AIGCGHKNQGMFVGAA---GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSL 315
            +GCG    G F+ +A   GL GLG  ++S+   L   G    +FS C    G    G +
Sbjct: 249 TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCF---GPARMGRI 305

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
            FG +  P G    P     R P+ Y V ++ +GVGG       DL       D  V+ D
Sbjct: 306 EFGDKGSP-GQNETPFNLGRRHPT-YNVSITQIGVGGHI----SDL-------DVAVIFD 352

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS-GFVSVRVPTVSFYFS 433
           +GT+ T L  PAY  F D F +            I F+ CY LS    +   P ++    
Sbjct: 353 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMK 412

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG    +     LI  +    FC A A S S ++IIG     G  I FD     +G+  +
Sbjct: 413 GGGHFVINHPIVLISTESKRLFCLAIARSDS-INIIGQNFMTGYHIVFDREKMVLGWKES 471

Query: 494 VC 495
            C
Sbjct: 472 NC 473


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 162/379 (42%), Gaps = 63/379 (16%)

Query: 100 HYHRHQHSFHARMQRDVKRVATLVR--RLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYF 157
            Y R Q S  A  + D +R  T++    L  GG                +G     G Y+
Sbjct: 38  RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGG----------------TGRPDIPGLYY 81

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCS 212
            +IG+G+P +S Y+ +D+GSDI+WV C  C QC ++S       +++  +S S   VSC 
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 213 SAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG--------RTVVKNV 260
              C ++     +GC A   C Y   YGDGS T G    + +           +T   +V
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 261 AIGCGHKNQGMFVGA-----AGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSG 313
             GCG +  G    +      G+LG G  + S++ QL   G+    F++CL  R  G  G
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGG--G 259

Query: 314 SLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGV 372
               GR   P      PLV  P  P  Y V ++ + VG   + I  DLF   Q GD  G 
Sbjct: 260 IFAIGRVVQP-KVNMTPLV--PNQPH-YNVNMTAVQVGQEFLTIPADLF---QPGDRKGA 312

Query: 373 VMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFD---TCYNLSGFVSVRVPTVS 429
           ++D+GT +  LP   YE               A  V I D    C+  SG V    P V+
Sbjct: 313 IIDSGTTLAYLPEIIYEPLVKK--------EPALKVHIVDKDYKCFQYSGRVDEGFPNVT 364

Query: 430 FYFSGGPVLTLPASNFLIP 448
           F+F     L +   ++L P
Sbjct: 365 FHFENSVFLRVYPHDYLFP 383


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 163/375 (43%), Gaps = 51/375 (13%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           V +  G+P ++  MV+D+GS++ W+ C+         + +F+P  S +++ + CSS  C+
Sbjct: 69  VSLTAGTPLQNITMVLDTGSELSWLHCKK----EPNFNSIFNPLASKTYTKIPCSSPTCE 124

Query: 218 RLEN-----AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
                      C   + C + +SY D S  +G LA ET  +G         GC       
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTGPATVFGCMDSGFSS 184

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGAA 327
           N        GL+G+  GS+S V Q+G +    FSYC+  R   SSG L+ G  +     +
Sbjct: 185 NSEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCISDRD--SSGVLLLGEASF----S 235

Query: 328 WV-PLVRNPRA------PSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
           W+ PL   P        P F    Y V L G+ V    + + + +F     G    ++D+
Sbjct: 236 WLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDS 295

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTG------NLPRASGVSIFDTCYNLSGFVSV--RVPTV 428
           GT  T L  P Y A +  F+ QT       N PR       D CY +    +    +P V
Sbjct: 296 GTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVV 355

Query: 429 SFYFSGGPVLTLPASNFL--IPVDDAG---TFCFAFAPSPS-GLS--IIGNIQQEGIQIS 480
           +  F G   +++     L  +P +  G    +CF F  S S G+   +IG+ QQ+ + + 
Sbjct: 356 NLMFRGAE-MSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWME 414

Query: 481 FDGANGFVGFGPNVC 495
           +D     +GF    C
Sbjct: 415 YDLEKSRIGFAEVRC 429


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 159/356 (44%), Gaps = 33/356 (9%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVS 210
           YF +IG+G+P +  Y+ +D+GSDI+WV C  C +C  +SD      ++DPA S S + VS
Sbjct: 27  YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86

Query: 211 CSSAVCDRLENA---GCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGH 266
           C    C    N     C     C+Y V YGDGS T G    + +   R V  N+  G  +
Sbjct: 87  CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFER-VTGNLQTGLSN 145

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPVGA 326
                  GA    GLG    +L G L     GAF++CL +   G  G    G    P   
Sbjct: 146 GTVTFGCGAQQSGGLGTSGEALDGIL-----GAFAHCLDNVNGG--GIFAIGELVSP-KV 197

Query: 327 AWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGD-DGVVMDTGTAVTRLPT 385
              P+V N    + Y V +  + VGG  + +  D+F     GD  G ++D+GT +  LP 
Sbjct: 198 NTTPMVPN---QAHYNVYMKEIEVGGTVLELPTDVF---DSGDRRGTIIDSGTTLAYLPE 251

Query: 386 PAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNF 445
             Y++  +   +Q   L   +    F  C+  SG V    P + F+F     LT+   ++
Sbjct: 252 VVYDSMMNEIRSQQPGLSLHTVEEQF-ICFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDY 310

Query: 446 LIPVDDAGTFCFAF------APSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           L  + +   +CF +      +     ++++G++      + +D  N  +G+    C
Sbjct: 311 LFQISE-DIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNC 365


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 116/420 (27%), Positives = 179/420 (42%), Gaps = 55/420 (13%)

Query: 119 VATLVRRLSGGGADAAKHEVQDFG--------TDVV---SGMDQGSGEYFVRIGVGSPPR 167
           V  +VR+  G   + A  +  D G         D+    +G    +G Y+ +IG+G  P 
Sbjct: 29  VFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLALGGNGRPTSTGLYYTKIGLG--PN 86

Query: 168 SQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSGVSCSSAVCDRLEN- 221
             Y+ +D+GSD +WV C  C+ C K+S       ++DP  S +   V C    C    + 
Sbjct: 87  DYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDG 146

Query: 222 --AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIG------RTVVKNVAI--GCGHKNQG 270
             +GC     C Y ++YGDGS T G+   + LT        RTV  N ++  GCG K  G
Sbjct: 147 PISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSG 206

Query: 271 MFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSLVFGREALP 323
                      G++G G  + S++ QL   G+    FS+CL +   G  G    G    P
Sbjct: 207 TLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGG--GIFAIGEVVQP 264

Query: 324 VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRL 383
                 PLV  PR  + Y V L  + V G  I +  D+F  T     G ++D+GT +  L
Sbjct: 265 -KVKTTPLV--PRM-AHYNVVLKDIEVAGDPIQLPTDIFDSTS--GRGTIIDSGTTLAYL 318

Query: 384 PTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVR--VPTVSFYFSGGPVLTLP 441
           P   Y+   +  +AQ   +        F TC++ S   S+    PTV F F  G  LT  
Sbjct: 319 PVSIYDQLLEKTLAQRSGMELYLVEDQF-TCFHYSDEKSLDDAFPTVKFTFEEGLTLTAY 377

Query: 442 ASNFLIPVDDAGTFCFAFAPSPS------GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             ++L P  +   +C  +  S +       L ++G++        +D  N  +G+    C
Sbjct: 378 PHDYLFPFKE-DMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIGWTDYNC 436


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 101/446 (22%), Positives = 182/446 (40%), Gaps = 51/446 (11%)

Query: 84  VHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKH-EVQ--- 139
           +  ++     N T +  +  +  +   R  R+   +    +R+  GG    K  +V+   
Sbjct: 110 MEEEEAQRERNETKSFLFQLYPKAHQGRGLREFGDIKLAAKRVDDGGRKVTKKLDVKGAA 169

Query: 140 DFGTD-----VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQ 193
             GT+      + G     G+Y+  I VG+PPR  ++ +D+GSD+ W+QC  PC+ C K 
Sbjct: 170 SAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKG 229

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRL---ENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
             P++ PA       V    ++C  L   +N      +C YE+ Y D S + G LA + +
Sbjct: 230 PHPLYKPAKEKI---VPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDM 286

Query: 251 TI-----GRTVVKNVAIGCGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGA 299
            +     GR  + +   GC +  QG  + +     G+LGL   ++SL  QL   G     
Sbjct: 287 HLIATNGGREKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNV 345

Query: 300 FSYCLVSRGTGSSGSLVFGREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPIS 358
           F +C ++R T   G +  G + +P  G  W P+   P   + Y+     +  G   +   
Sbjct: 346 FGHC-ITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPD--NLYHTEAQKVNYGDQELHAG 402

Query: 359 EDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLS 418
             +          V+ D+G++ T LP   Y+   DA    + +  + S  +    C+   
Sbjct: 403 NSV---------QVIFDSGSSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDTTLPLCWKAD 453

Query: 419 GFVSVRVPTVSFYFSGGPVLTLPASNFLIPVD-----DAGTFCFAFAPSPS----GLSII 469
             V      ++ +F G     +P +  ++P D     D G  C               I+
Sbjct: 454 FSVRSFFKPLNLHF-GRRWFVVPKTFTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIV 512

Query: 470 GNIQQEGIQISFDGANGFVGFGPNVC 495
           G++   G  + +D     +G+  + C
Sbjct: 513 GDVSLRGKLVVYDNERRQIGWANSEC 538


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 41/370 (11%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQC----------YKQSDPVFDPAD 202
           +G Y  R+ +G+P +   +++DSGS + +V C  C QC           +  DP F P  
Sbjct: 88  NGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDL 147

Query: 203 SASFSGVSCS-SAVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRT---VVK 258
           S+++S V C+    CD          +C YE  Y + S + G L  + ++ G+      +
Sbjct: 148 SSTYSPVKCNVDCTCDN------ERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKPQ 201

Query: 259 NVAIGCGHKNQGMFVG--AAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGS 314
               GC +   G      A G++GLG G +S++ QL   G    +FS C      G  G+
Sbjct: 202 RAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG-GGT 260

Query: 315 LVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVM 374
           +V G   +P     V    NP    +Y + L  + V G  + +   +F        G V+
Sbjct: 261 MVLG--GMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVL 314

Query: 375 DTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASG--VSIFDTCY-----NLSGFVSVRVPT 427
           D+GT    LP  A+ AF+DA   +  +L +  G   +  D C+     N+S    V  P 
Sbjct: 315 DSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEV-FPD 373

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDA-GTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGAN 485
           V   F  G  L+L   N+L       G +C   F       +++G I      +++D  N
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433

Query: 486 GFVGFGPNVC 495
             +GF    C
Sbjct: 434 EKIGFWKTNC 443


>gi|125572774|gb|EAZ14289.1| hypothetical protein OsJ_04213 [Oryza sativa Japonica Group]
          Length = 492

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/329 (30%), Positives = 144/329 (43%), Gaps = 23/329 (6%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCS 212
           +G Y +   VG+PP+    V+D  SD VW+QC  C+ C          AD+ + +     
Sbjct: 94  TGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACATCG---------ADAPAATSAPPF 144

Query: 213 SAVCDRLENAGCHAGRCRYEVSYGDGS--YTKGTLALETLTIGRTVVKNVAIGCGHKNQG 270
            A     +        C Y   YG G+   T G LA++           V  GC    +G
Sbjct: 145 YAFLSFHDTRAPTTPPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAVATEG 204

Query: 271 MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLV-FGREALPVGAAWV 329
                 G++GLG G +S V QL     G FSY L        GS + F  +A P  +  V
Sbjct: 205 DI---GGVIGLGRGELSPVSQL---QIGRFSYYLAPDDAVDVGSFILFLDDAKPRTSRAV 258

Query: 330 --PLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPA 387
             PLV +  + S YYV L+G+ V G  + I    F L   G  GVV+     VT L   A
Sbjct: 259 STPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSITIPVTFLDAGA 318

Query: 388 YEAFRDAFVAQTGNLPRASGVSI-FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFL 446
           Y+  R A  ++   L  A G  +  D CY      + +VP+++  F+GG V+ L   N+ 
Sbjct: 319 YKVVRQAMASKI-ELRAADGSELGLDLCYTSESLATAKVPSMALVFAGGAVMELEMGNYF 377

Query: 447 IPVDDAGTFCFAFAPSPSGL-SIIGNIQQ 474
                 G  C    PSP+G  S++G++ Q
Sbjct: 378 YMDSTTGLECLTILPSPAGDGSLLGSLIQ 406


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 161/394 (40%), Gaps = 66/394 (16%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP---CSQC-YKQSDPV----FDPADSAS 205
           G Y V +  G+P ++   V D+GS +V + C     CS C +   DP     F P +S+S
Sbjct: 88  GGYSVSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSS 147

Query: 206 FSGVSCSSAVCDRL------------ENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIG 253
              + C S  C  L                C  G   Y + YG GS T G L  E L   
Sbjct: 148 SKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGS-TAGVLITEKLDFP 206

Query: 254 RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR------ 307
              V +  +GC   +       AG+ G G G +SL  Q+  +    FS+CLVSR      
Sbjct: 207 DLTVPDFVVGCSIISTRQ---PAGIAGFGRGPVSLPSQMNLK---RFSHCLVSRRFDDTN 260

Query: 308 -------GTGS---SGSLVFGREALPVGAAWVPLVRNPRAPS-----FYYVGLSGLGVGG 352
                   TGS   SGS          G  + P  +NP   +     +YY+ L  + VG 
Sbjct: 261 VTTDLDLDTGSGHNSGSKT-------PGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGR 313

Query: 353 MRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-- 410
             + I          GD G ++D+G+  T +  P +E   + F +Q  N  R   +    
Sbjct: 314 KHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKET 373

Query: 411 -FDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP----SPSG 465
               C+N+SG   V VP + F F GG  L LP SN+   V +  T C         +PSG
Sbjct: 374 GLGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSG 433

Query: 466 LS----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
            +    I+G+ QQ+   + +D  N   GF    C
Sbjct: 434 GTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKC 467


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 85/269 (31%), Positives = 126/269 (46%), Gaps = 32/269 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF R+ +GSPP+  ++ ID+GSDI+WV C PC+ C   S        F+P  S++ S 
Sbjct: 89  GLYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSK 148

Query: 209 VSCSSAVCD---RLENAGCHAGR---CRYEVSYGDGSYTKGTLALETL----TIGRTVVK 258
           + CS   C    +   A C       C Y  +YGDGS T G    +T+     +G     
Sbjct: 149 IPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTA 208

Query: 259 N----VAIGCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRG 308
           N    +  GC +   G          G+ G G   +S+V QL   G +   FS+CL  +G
Sbjct: 209 NSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL--KG 266

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
           + + G ++   E +  G  + PLV  P  P  Y + L  + V G ++PI   LF  T   
Sbjct: 267 SDNGGGILVLGEIVEPGLVYTPLV--PSQP-HYNLNLESIVVNGQKLPIDSSLF--TTSN 321

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVA 397
             G ++D+GT +  L   AY+ F +A  A
Sbjct: 322 TQGTIVDSGTTLAYLADGAYDPFVNAITA 350


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 163/363 (44%), Gaps = 45/363 (12%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPV--FDPADSASFSGVSCSSAV 215
           V + +G+PP+ Q MV+D+GS + W+      QC+ ++ P   FDP+ S+SF  + C+  +
Sbjct: 90  VTLPIGTPPQPQQMVLDTGSQLSWI------QCHNKTPPTASFDPSLSSSFYVLPCTHPL 143

Query: 216 C-----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKN 268
           C     D      C   R C Y   Y DG+Y +G L  E L    +     + +GC  ++
Sbjct: 144 CKPRVPDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPLILGCSSES 203

Query: 269 QGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGS-----SGSLVFGREALP 323
           +     A G+LG+  G +S   Q        FSYC+ +R   +     +GS   G     
Sbjct: 204 R----DARGILGMNLGRLSFPFQ---AKVTKFSYCVPTRQPANNNNFPTGSFYLGNNPNS 256

Query: 324 VGAAWVPLVRNPRA-------PSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDT 376
               +V ++  P++       P  Y V + G+ +GG ++ I   +FR    G    ++D+
Sbjct: 257 ARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQTMVDS 316

Query: 377 GTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIF----DTCYNLSGFVSVRVP-TVSFY 431
           G+  T L   AY+  R+  +   G  PR     ++    D C++ +     R+   V+F 
Sbjct: 317 GSEFTFLVDVAYDRVREEIIRVLG--PRVKKGYVYGGVADMCFDGNAMEIGRLLGDVAFE 374

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFV 488
           F  G  + +P    L  V   G  C     S    +  +IIGN  Q+ + + FD AN  +
Sbjct: 375 FEKGVEIVVPKERVLADV-GGGVHCVGIGRSERLGAASNIIGNFHQQNLWVEFDLANRRI 433

Query: 489 GFG 491
           GFG
Sbjct: 434 GFG 436


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 109/373 (29%), Positives = 165/373 (44%), Gaps = 45/373 (12%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VG+PP+S  MV+D+GS++ W+ C+      +  + VF+P  S+S++ + C S +C 
Sbjct: 72  VSLTVGTPPQSVTMVLDTGSELSWLHCKK----QQNINSVFNPHLSSSYTPIPCMSPICK 127

Query: 217 ----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHK---- 267
               D L    C +   C   VSY D +  +G LA +T  I  +    +  G        
Sbjct: 128 TRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIFGSMDSGFSS 187

Query: 268 NQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGREALPV--G 325
           N        GL+G+  GS+S V Q+G      FSYC+   G  +SG L+FG         
Sbjct: 188 NANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCI--SGKDASGVLLFGDATFKWLGP 242

Query: 326 AAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAV 380
             + PLV+ N   P F    Y V L G+ VG   + + +++F     G    ++D+GT  
Sbjct: 243 LKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMVDSGTRF 302

Query: 381 TRLPTPAYEAFRDAFVAQTGNL------PRASGVSIFDTCYNL-SGFVSVRVPTVSFYFS 433
           T L    Y A R+ FVAQT  +      P        D C+ +  G V   VP V+  F 
Sbjct: 303 TFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPAVPAVTMVFE 362

Query: 434 GGPVLTLPASNFLIPVDDAG--------TFCFAFAPSP-SGLS--IIGNIQQEGIQISFD 482
           G   +++     L  V   G         +C  F  S   G+   +IG+  Q+ + + FD
Sbjct: 363 GAE-MSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQNVWMEFD 421

Query: 483 GANGFVGFGPNVC 495
             N  VGF    C
Sbjct: 422 LVNSRVGFADTKC 434


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 168/374 (44%), Gaps = 46/374 (12%)

Query: 145 VVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADS 203
           ++SG    +G Y+V + +G P +  ++ +D+GSD+ W+QC  PC  C K   P++ P  +
Sbjct: 46  LLSGDVYPTGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN 105

Query: 204 ASFSGVSCSSAVCDRLE-----NAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----G 253
                V C++++C  L      N  C    +C Y++ Y D + + G L  ++ ++     
Sbjct: 106 KL---VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNK 162

Query: 254 RTVVKNVAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVS 306
             V  +++ GCG+  Q    GAA     GLLGLG GS+SL+ QL  Q  T     +CL +
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222

Query: 307 RGTGSSGSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLT 365
            G    G L FG + +P     WVP+VR+    ++Y  G + L     R  +S       
Sbjct: 223 SG---GGFLFFGDDMVPTSRVTWVPMVRSTSG-NYYSPGSATLYFD--RRSLSTKPME-- 274

Query: 366 QMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSVR 424
                 VV D+G+  T      Y+A   A     + +L + S  S+         F SV 
Sbjct: 275 ------VVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQKAFKSVS 328

Query: 425 -----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF---APSPSGLSIIGNIQQEG 476
                  ++ F F    V+ +P  N+LI V   G  C      + +    SIIG+I  + 
Sbjct: 329 DVKKDFKSLQFIFGKNAVMEIPPENYLI-VTKNGNVCLGILDGSAAKLSFSIIGDITMQD 387

Query: 477 IQISFDGANGFVGF 490
             + +D     +G+
Sbjct: 388 QMVIYDNEKAQLGW 401


>gi|357444933|ref|XP_003592744.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
 gi|355481792|gb|AES62995.1| hypothetical protein MTR_1g115080, partial [Medicago truncatula]
          Length = 65

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/65 (83%), Positives = 59/65 (90%)

Query: 431 YFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 490
           YF GGP+LTLPA NFLIPVD  GTFCFAFAPS SGLSIIGNIQQEGI+IS DGANG++GF
Sbjct: 1   YFLGGPILTLPARNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGF 60

Query: 491 GPNVC 495
           GPN+C
Sbjct: 61  GPNIC 65


>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
          Length = 216

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/215 (34%), Positives = 103/215 (47%), Gaps = 5/215 (2%)

Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
           MSL+ Q G +  G FSYCL S R    SGSL  G    P    + PL+ NP  PS YYV 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTNPHRPSLYYVN 60

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           ++GL VG   + +    F        G V+D+GT +TR   P Y A R+ F  Q      
Sbjct: 61  VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120

Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
            + +  FDTC+N     +   P V+ +  GG  LTLP  N LI        C A A +P 
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180

Query: 465 ----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 156/362 (43%), Gaps = 40/362 (11%)

Query: 156 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDP--------VFDPADSASFS 207
           Y+  + VG+P     + +D+GSD+ W+ C  C  C    +         ++ P +S++  
Sbjct: 107 YYAEVTVGTPGVPYLVALDTGSDLFWLPCD-CVNCITGLNTTQGPVNFNIYSPNNSSTSK 165

Query: 208 GVSCSSAVCDRLENAGCHAGRCRYEVSY-GDGSYTKGTLALETLTI------GRTVVKNV 260
            V CSS++C  L+     +  C Y+VSY  D + + G L  + L +       + V   +
Sbjct: 166 EVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARI 225

Query: 261 AIGCGHKNQGMFVGAA---GLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSRGTGSSGSL 315
            +GCG    G F+ +A   GL GLG  ++S+   L   G    +FS C    G    G +
Sbjct: 226 TLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCF---GPARMGRI 282

Query: 316 VFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMD 375
            FG +  P G    P     R P+ Y V ++ +GVGG       DL       D  V+ D
Sbjct: 283 EFGDKGSP-GQNETPFNLGRRHPT-YNVSITQIGVGGHI----SDL-------DVAVIFD 329

Query: 376 TGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSI-FDTCYNLS-GFVSVRVPTVSFYFS 433
           +GT+ T L  PAY  F D F +            I F+ CY LS    +   P ++    
Sbjct: 330 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMK 389

Query: 434 GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 493
           GG    +     LI  +    FC A A S S ++IIG     G  I FD     +G+  +
Sbjct: 390 GGGHFVINHPIVLISTESKRLFCLAIARSDS-INIIGQNFMTGYHIVFDREKMVLGWKES 448

Query: 494 VC 495
            C
Sbjct: 449 NC 450


>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
          Length = 464

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 116/392 (29%), Positives = 165/392 (42%), Gaps = 75/392 (19%)

Query: 172 VIDSGSDIVWVQCQPC----------SQCYKQSDPVFDPADSASFSGVSCSS---AVCD- 217
           V+D+GSD+VW QC  C            C+ Q+ P ++ + S +   V C     A+C  
Sbjct: 77  VVDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARAVPCDDDDGALCGV 136

Query: 218 RLENAGCHAG------RCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQ-- 269
             E AGC  G       C    SYG G    G L  +  T   +    +A GC  + +  
Sbjct: 137 APETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFPSSSSVTLAFGCVSQTRIS 195

Query: 270 -GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTGSSGSLVFG-------- 318
            G   GA+G++GLG G++SLV QL       FSYCL    R T S   L  G        
Sbjct: 196 PGALNGASGIIGLGRGALSLVSQLNATE---FSYCLTPYFRDTVSPSHLFVGDGELAGLR 252

Query: 319 -------REALPVGAAWVPLVRNPR-AP--SFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
                      PV    VP  +NP+ +P  +FYY+ L GL  G   + +    F L +  
Sbjct: 253 AAAGGGGGGGAPVTT--VPFAKNPKDSPFSTFYYLPLVGLAAGNATVALPAGAFDLREAA 310

Query: 369 DD----GVVMDTGTAVTRLPTPAYEAFRDAFVAQ---TGNL--PRASGVSIFDTCYNL-- 417
                 G ++D+G+  TRL  PA+ A       Q   +G+L  P A      + C     
Sbjct: 311 PKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGSLVPPPAKLGGALELCVEAGD 370

Query: 418 --SGFVSVRVPTVSFYFS----GGPVLTLPASNFLIPVDDAGTFCFAFAPSPSG------ 465
                 +  VP +   F     GG  L +PA  +   V +A T+C A   S SG      
Sbjct: 371 DGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARV-EASTWCMAVVSSASGNATLPT 429

Query: 466 --LSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
              +IIGN  Q+ +++ +D ANG + F P  C
Sbjct: 430 NETTIIGNFMQQDMRVLYDLANGLLSFQPANC 461


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/379 (29%), Positives = 162/379 (42%), Gaps = 50/379 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
           G Y++ + +G+P +  Y+ +D+GSD+ W+QC  PC  C      ++DP  +     V C 
Sbjct: 29  GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRARV---VDCR 85

Query: 213 SAVCDRLENAG---CHAG--RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIG 263
              C +++  G   C     +C YEV Y DGS T G L  +T+T+    G        IG
Sbjct: 86  RPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLVLTNGTRFQTRAVIG 145

Query: 264 CGHKNQGMFVGAA----GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGSLVF 317
           CG+  QG    A     G++GL    +SL  QL   G       +CL   G+   G L F
Sbjct: 146 CGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG-GSNGGGYLFF 204

Query: 318 GREALP-VGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDD--GVVM 374
           G   +P +G  W P++  P     Y   L  +  GG       ++  L    DD  G + 
Sbjct: 205 GDTLVPALGMTWTPMIGRPLVEG-YQARLRSIKYGG-------EVLELEGTTDDVGGAMF 256

Query: 375 DTGTAVTRLPTPAYEAFRDAFV--AQTGNLPRASGVSIFDTCYN-LSGFVSV-------R 424
           D+GT+ T L   AY A   A V  AQ   L R    +    C+   S F SV       +
Sbjct: 257 DSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGPSPFESVADVSAYFK 316

Query: 425 VPTVSF----YFSGGPVLTLPASNFLIPVDDAGTFCF----AFAPSPSGLSIIGNIQQEG 476
             T+ F    ++S G +L L    +LI V   G  C     A   S    +I+G+I   G
Sbjct: 317 TVTLDFGGSTWWSSGKLLELSPEGYLI-VSTQGNVCLGVLDASVASLEVTNILGDISMRG 375

Query: 477 IQISFDGANGFVGFGPNVC 495
             + +D     +G+    C
Sbjct: 376 YLVVYDNMREQIGWVRRNC 394


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 108/367 (29%), Positives = 164/367 (44%), Gaps = 33/367 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ ++ +G+PPR   + ID+GSD++WV C  C+ C K S+       FDP  S+S S 
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 209 VSCSSAVC--DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI--- 262
           VSCS   C  +    +GC     C Y   YGDGS T G    + ++    +   +AI   
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSS 201

Query: 263 -----GCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGS 311
                GC +   G          G+ GLG GS+S++ QL   G     FS+CL    +G 
Sbjct: 202 APFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG- 260

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G +V G+   P    + PLV  P  P  Y V L  + V G  +PI   +F +     DG
Sbjct: 261 GGIMVLGQIKRP-DTVYTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTIAT--GDG 314

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++DTGT +  LP  AY  F  A         R      +  C+ ++       P VS  
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ-CFEITAGDVDVFPEVSLS 373

Query: 432 FSGGPVLTLPASNFLIPVDDAGT--FCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFV 488
           F+GG  + L    +L     +G+  +C  F   S   ++I+G++  +   + +D     +
Sbjct: 374 FAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRI 433

Query: 489 GFGPNVC 495
           G+    C
Sbjct: 434 GWAEYDC 440


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 162/382 (42%), Gaps = 47/382 (12%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG     G Y+ ++G+G+P +  Y+ +D+GSDI+WV C  C +C + S       +++  
Sbjct: 77  SGRPDTVGLYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIK 136

Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGR--- 254
           DS S   V C    C  +     +GC A   C Y   YGDGS T G    + +   R   
Sbjct: 137 DSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSG 196

Query: 255 ---TVVKN--VAIGCGHKNQGMF-----VGAAGLLGLGGGSMSLVGQLGG--QTGGAFSY 302
              T   N  V  GCG +  G           G+LG G  + S++ QL    +    F++
Sbjct: 197 DLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAH 256

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGG--MRIPISED 360
           CL   G    G    G    P      PL+  P  P  Y V ++ + VG   + +P  E 
Sbjct: 257 CL--DGINGGGIFAIGHVVQP-KVNMTPLI--PNQPH-YNVNMTAVQVGEDFLHLPTEE- 309

Query: 361 LFRLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
                + GD  G ++D+GT +  LP   YE      ++Q  +L +   V    TC+  SG
Sbjct: 310 ----FEAGDRKGAIIDSGTTLAYLPEIVYEPLVSKIISQQPDL-KVHIVRDEYTCFQYSG 364

Query: 420 FVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQ 473
            V    P V+F+F     L +    +L P +  G +C  +  S         ++++G++ 
Sbjct: 365 SVDDGFPNVTFHFENSVFLKVHPHEYLFPFE--GLWCIGWQNSGMQSRDRRNMTLLGDLV 422

Query: 474 QEGIQISFDGANGFVGFGPNVC 495
                + +D  N  +G+    C
Sbjct: 423 LSNKLVLYDLENQAIGWTEYNC 444


>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 469

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 163/389 (41%), Gaps = 55/389 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y V +  G+P ++   V+D+GS +VW  C     C + S P  DPA   +F     SS
Sbjct: 88  GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147

Query: 214 AV-----------------------CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
           A                        CD+  +A C      Y + YG G+ T G L LE+L
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQ-NSANCTKACPTYAIQYGLGT-TVGLLLLESL 205

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
                   +  +GC   +       +G+ G G G  SL  Q+G +    FSYCL+S    
Sbjct: 206 VFAERTEPDFVVGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFD 259

Query: 308 --GTGSSGSLVFG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPI 357
                S  +L  G   ++    G ++ P  +NP +       +YYV L  + VG  R+ +
Sbjct: 260 DSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKV 319

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTC 414
                     G+ G ++D+G+  T +  P +EA    F  Q  N  RA+ V   S    C
Sbjct: 320 PYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPC 379

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-------SGLS 467
           +NLSG  SV +P++ F F GG  + LP +N+   V D    C     +        SG S
Sbjct: 380 FNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPS 439

Query: 468 II-GNIQQEGIQISFDGANGFVGFGPNVC 495
           II GN Q +     +D  N   GF    C
Sbjct: 440 IILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score =  115 bits (289), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 163/373 (43%), Gaps = 60/373 (16%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSC 211
           +G Y+V + +G P +  ++ ID+GSD+ W+QC  PC  C K   P++ P  +     V C
Sbjct: 49  TGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNKVPHPLYKPTKNKL---VPC 105

Query: 212 SSAVCDRLE-----NAGCHA-GRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
           ++++C  L      N  C    +C Y++ Y D + + G L  +  T+      +V  +  
Sbjct: 106 AASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSSSVRPSFT 165

Query: 262 IGCGH-----KNQGMFVGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSSGS 314
            GCG+     KN  +     GLLGLG GS+SLV QL   G T     +CL + G    G 
Sbjct: 166 FGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLSTNG---GGF 222

Query: 315 LVFGREALPVG-AAWVPLVR-------NPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
           L FG   +P   A WVP+VR       +P + + Y+   S LGV  M             
Sbjct: 223 LFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRS-LGVKPME------------ 269

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYNLSGFVSVR- 424
                VV D+G+  T      Y+A   A  A  + +L + S  S+         F SV  
Sbjct: 270 -----VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKGQKVFKSVSD 324

Query: 425 ----VPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF---APSPSGLSIIGNIQQEGI 477
                 ++   F    VL +P  N+LI V   G  C      + +    +IIG+I  +  
Sbjct: 325 VKNDFKSLFLSFVKNSVLEIPPENYLI-VTKNGNACLGILDGSAAKLTFNIIGDITMQDQ 383

Query: 478 QISFDGANGFVGF 490
            I +D   G +G+
Sbjct: 384 LIIYDNERGQLGW 396


>gi|357118734|ref|XP_003561105.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 404

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 96/269 (35%), Positives = 124/269 (46%), Gaps = 26/269 (9%)

Query: 236 GDGSYTKGTLALETLTIGRTVVKNVAIGCGHKNQGMFVG-AAGLLGLGGGSMSLVGQLGG 294
            DG  T  T+A++T TI          GC H  +G F G  +G + LGGG  SL  Q   
Sbjct: 153 ADGDPTSQTMAIDT-TID-VPSSXXRFGCSHSVRGRFSGQTSGTMSLGGGRQSLRSQTAS 210

Query: 295 QTGGAFSYCLVSRGTGSSGSLVFGREALPVGAAW----VPLVRNPRAPSFYYVGLSGLGV 350
             G AFSYC+      +SG L  G      G+       PLV     P+FY V L G+ V
Sbjct: 211 AYGDAFSYCVPQ--PSASGFLSLGGAIGSSGSGSGFASTPLVATAN-PTFYVVRLQGIDV 267

Query: 351 GGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR--ASGV 408
            G R+ +   +F        G +MD+   VT+LP  AY A R AF        R  A G 
Sbjct: 268 AGRRLNVPPAVF------SAGTLMDSSAVVTQLPPTAYRALRRAFRNAMRRYRRVPAGGK 321

Query: 409 SIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP--SGL 466
            I DTCY+  G  +V VP VS  FSGG V+ L     ++        C AF P+P  S L
Sbjct: 322 QILDTCYDFEGLGNVTVPAVSLVFSGGAVVRLEPMAVMM------EGCLAFVPTPADSDL 375

Query: 467 SIIGNIQQEGIQISFDGANGFVGFGPNVC 495
             IGN+QQ+  ++ +D     VGF    C
Sbjct: 376 GFIGNVQQQTHEVLYDVGARNVGFRRGAC 404


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
           G Y+V + +G+PPR  ++ +D+GSD+ W+QC  PC  C K   P++ P  +     V C 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
             +C  L     G H       +C YE+ Y D   + G L  ++  +       V   +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
            GCG+  Q   VG++       G+LGLG GS+SL+ QL   G T     +CL +RG    
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226

Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           G L FG + +P   A W P+ R+  + ++Y  G + L  GG  + +     R  +     
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
           VV D+G++ T      Y+A  DA     + NL      S+         F SV       
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335

Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
            TV   FS G   ++ +P  N+LI V   G  C             L+I+G+I  +   +
Sbjct: 336 KTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394

Query: 480 SFDGANGFVGF 490
            +D   G +G+
Sbjct: 395 IYDNERGQIGW 405


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 163/380 (42%), Gaps = 43/380 (11%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG     G Y+ +IG+G+P +  Y+ +D+GSDIVWV C  C +C + S        +D  
Sbjct: 78  SGRPDAVGLYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLE 137

Query: 202 DSASFSGVSCSSAVCDRLEN---AGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR--- 254
           +S +   VSC    C  +     +GC     C Y   YGDGS T G    + +   R   
Sbjct: 138 ESTTGKLVSCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSG 197

Query: 255 ---TVVKN--VAIGCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQLGG--QTGGAFSY 302
              T   N  +  GCG +  G    +      G+LG G  + S++ QL    +    F++
Sbjct: 198 DLETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAH 257

Query: 303 CLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF 362
           CL   GT   G    G    P      PLV  P  P  Y V ++G+ VG + + IS D+F
Sbjct: 258 CL--DGTNGGGIFAMGHVVQP-KVNMTPLV--PNQPH-YNVNMTGVQVGHIILNISADVF 311

Query: 363 RLTQMGD-DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
              + GD  G ++D+GT +  LP   YE      ++Q  NL   +    +  C+  S  V
Sbjct: 312 ---EAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERV 367

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQE 475
               P V F+F    +L +    +L   ++   +C  +  S         +++ G++   
Sbjct: 368 DDGFPPVIFHFENSLLLKVYPHEYLFQYENL--WCIGWQNSGMQSRDRKNVTLFGDLVLS 425

Query: 476 GIQISFDGANGFVGFGPNVC 495
              + +D  N  +G+    C
Sbjct: 426 NKLVLYDLENQTIGWTEYNC 445


>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
 gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
          Length = 496

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 112/411 (27%), Positives = 163/411 (39%), Gaps = 75/411 (18%)

Query: 155 EYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQP--CSQCYKQSD-----PVFDPADSASFS 207
           +Y +   + S P S Y+  D+GSD+VW  CQP  C  C  +++         P  S + +
Sbjct: 81  DYTLSFTINSQPISLYL--DTGSDLVWFPCQPFECILCEGKAENASLASTPPPKLSKTAT 138

Query: 208 GVSCSSAVC--------------------DRLENAGCHAGRC-RYEVSYGDGSYT----K 242
            VSC S+ C                    + +E + C    C ++  +YGDGS      +
Sbjct: 139 PVSCKSSACSAVHSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIARLYR 198

Query: 243 GTLALETLTIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGG---QTGGA 299
            ++ L        +  N   GC H      +G AG    G G +SL  QL     Q G  
Sbjct: 199 DSIRLPLSNQTNLIFNNFTFGCAHTTLAEPIGVAGF---GRGVLSLPAQLATLSPQLGNQ 255

Query: 300 FSYCLVSRGTGSS-----GSLVFGR-----EALPVGAAWVP------LVRNPRAPSFYYV 343
           FSYCLVS    S        L+ GR     +   V     P      ++ NPR P FY V
Sbjct: 256 FSYCLVSHSFDSDRVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCV 315

Query: 344 GLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLP 403
           GL G+ +G  +IP  + L ++ + G  GVV+D+GT  T LP   Y+     F  + G + 
Sbjct: 316 GLEGISIGRKKIPAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVN 375

Query: 404 RASGVSIFDT----CYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDD-------- 451
             + V   +T    CY     V      V  +   G  + LP  N+     D        
Sbjct: 376 ERASVIEENTGLSPCYYFDNNVVNVPRVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKK 435

Query: 452 AGTFCFAFAP--SPSGLS-----IIGNIQQEGIQISFDGANGFVGFGPNVC 495
               C         + LS      +GN QQ+G ++ +D  N  VGF    C
Sbjct: 436 RKVGCLMLMNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQC 486


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 162/374 (43%), Gaps = 43/374 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ +IG+G+P +  Y+ +D+G+D++WV C  C +C  +S+      +++  +S+S   
Sbjct: 71  GLYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKL 130

Query: 209 VSCSSAVCDRLEN---AGCHA---GRCRYEVSYGDGSYTKGTLALETLTIG------RTV 256
           V C   +C  +      GC +     C Y   YGDGS T G    + +         +T 
Sbjct: 131 VPCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTA 190

Query: 257 VKN--VAIGCGHKNQGMFV-----GAAGLLGLGGGSMSLVGQL--GGQTGGAFSYCLVSR 307
             N  V  GCG +  G           G+LG G  + S++ QL   G+    F++CL   
Sbjct: 191 SANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL--N 248

Query: 308 GTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQM 367
           G    G    G    P      PL+  P  P  Y V ++ + VG   + +S D     Q 
Sbjct: 249 GVNGGGIFAIGHVVQPT-VNTTPLL--PDQP-HYSVNMTAIQVGHTFLNLSTDASE--QR 302

Query: 368 GDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPT 427
              G ++D+GT +  LP   Y+      ++Q  NL +   +    TC+  SG V    P 
Sbjct: 303 DSKGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNL-KVQTLHDEYTCFQYSGSVDDGFPN 361

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPS------PSGLSIIGNIQQEGIQISF 481
           V+FYF  G  L +   ++L   ++   +C  +  S         ++++G++      + +
Sbjct: 362 VTFYFENGLSLKVYPHDYLFLSENL--WCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFY 419

Query: 482 DGANGFVGFGPNVC 495
           D  N  +G+    C
Sbjct: 420 DLENQVIGWTEYNC 433


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 119/449 (26%), Positives = 186/449 (41%), Gaps = 82/449 (18%)

Query: 79  WNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEV 138
           +++EL+HRD + S         +H  + + H               R       +     
Sbjct: 27  FSVELIHRDSIKSP--------FHDPKLTRH--------------DRFLAAARRSRARAA 64

Query: 139 QDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ--------- 189
               +DV S +  G  EY   + VG+PP     V D+GSD+VW++C              
Sbjct: 65  ALLASDVSSDLFYGDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDS 124

Query: 190 ----------CYKQSDPVFDPADSASFSGVSCSSAVCDRLE-NAGCHAGR--CRYEVSYG 236
                        ++   F+P DS+S+S V C    C  L  NA C+     C +  SY 
Sbjct: 125 GNNSNSSPPPPPPEAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYR 184

Query: 237 DGSYTKGTLALETLTIG------RTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVG 290
           DG+   G LA +T T G       T   ++  GC     G    A G++GLG G +SL  
Sbjct: 185 DGASATGLLAADTFTFGGNINNDTTSTASIDFGCATGTAGREFQADGMVGLGAGPLSLAS 244

Query: 291 QLGGQTGGAFSYCLVSRGTGSSGSLV-FGREAL--PVGAAWVPLV-RNPRAPSFYYVGLS 346
           QLG +    FS+CL +     + S++ FG  A+    GAA  PL+  +  A ++Y + + 
Sbjct: 245 QLGRK----FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISID 300

Query: 347 GLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT-----RLPTPAYEAFRDAFVAQTGN 401
            L V G  +P +  + +        V++DTGT +T      L  P  E+   A V     
Sbjct: 301 SLKVAGQPVPGTTSVSK--------VIVDTGTVLTFLDRAALLAPLTESL--ARVMDGAG 350

Query: 402 LPRASGV-SIFDTCYNLSGFVSVR--VPTVSFYFSGGPV--LTLPASNFLIPVDDAGTFC 456
           LPRA       + CY++S    V   +P V+    GG    + L      + V + G  C
Sbjct: 351 LPRAPPPDETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKE-GVLC 409

Query: 457 FAF---APSPSGLSIIGNIQQEGIQISFD 482
            A    +P    LS++GN+  + + +  D
Sbjct: 410 LAVVTTSPELQPLSVLGNVALQDLHVGID 438


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 158/363 (43%), Gaps = 31/363 (8%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVCD 217
           V IG G    + ++V+D+ S + W++C  C    +Q  PVFDP+DS+S+  +  +S +C 
Sbjct: 78  VTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLC- 136

Query: 218 RLENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTIGRTV--VKNVAIGCGHKNQGMFVG 274
           R  N    AG +C + +         G +  +T+ +G     + +VA GC    +G    
Sbjct: 137 RAPNPVLPAGDKCSFHLP----GEAHGYVGTDTIILGNPTLPIHSVAFGCAQSTEGFDTK 192

Query: 275 A--AGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRG--TGSSGSLVFGRE----ALPVGA 326
              AG LG+G    SL+ Q+  + G  FSYCL+  G   G +G + FG +     L V  
Sbjct: 193 GTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHH 252

Query: 327 AWVPLVRNPRAP-----SFYYVGLSGLGVGGMRIP-ISEDLFRLTQMGDDGVVMDTGTAV 380
               L   P  P     S YYV L G+ + G  IP I + +F     G  G  +D GT V
Sbjct: 253 RIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQV 312

Query: 381 TRLPTPAYEAFRDAF--VAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPV- 437
           T L   AY    +A   + Q     R    + F  C+     +   +P ++  F G    
Sbjct: 313 THLVPAAYAVVEEAVAHMVQQWGYKRVRDPN-FSLCFREHPGIWSHIPKLTLDFEGPASR 371

Query: 438 ----LTLPASNFLIPVDDAGTFCF-AFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGP 492
               L + + N  + VD+    CF  +  S    +++G +QQ   +  FD     + F  
Sbjct: 372 TVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHR 431

Query: 493 NVC 495
             C
Sbjct: 432 ESC 434


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  115 bits (288), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 158/367 (43%), Gaps = 40/367 (10%)

Query: 157 FVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC 216
            V + +G+PP+SQ M++D+GS + W+QC            VFDP+ S+SFS + C+  +C
Sbjct: 78  LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137

Query: 217 -----DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRT-VVKNVAIGCGHKNQ 269
                D      C   R C Y   Y DG+  +G L  E +T   +     + +GC     
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPLILGCAEDAS 197

Query: 270 GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR----GTGSSGSLVFGREALPVG 325
                  G+LG+  G +S   Q        FSYC+ +R    G   +GS   G      G
Sbjct: 198 ----DDKGILGMNLGRLSFASQ---AKITKFSYCVPTRQVRPGFTPTGSFYLGENPNSAG 250

Query: 326 AAWVPLV---RNPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGT 378
             ++ L+   ++ R P+     + V L G+ +G  ++ I    FR    G    ++D+G+
Sbjct: 251 FQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGS 310

Query: 379 AVTRLPTPAYEAFRDAFVAQTGNLPRA------SGVSIFDTCYNLSGFVSVR-VPTVSFY 431
             T L   AY   R+  V   G  PR       SGVS  D C++ +     R +  + F 
Sbjct: 311 EFTYLVDVAYNKVREEVVRLAG--PRLKKGYVYSGVS--DMCFDGNAMEIGRLIGNMVFE 366

Query: 432 FSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP---SGLSIIGNIQQEGIQISFDGANGFV 488
           F  G  + +     L  V   G  C     S    +  +IIGN  Q+ + + FD AN  V
Sbjct: 367 FDKGVEIVIEKGRVLADV-GGGVHCVGIGRSEMLGAASNIIGNFHQQNLWVEFDIANRRV 425

Query: 489 GFGPNVC 495
           GFG   C
Sbjct: 426 GFGKADC 432


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 166/380 (43%), Gaps = 44/380 (11%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG+   +G YF RIG+G+P +  Y+ +D+GSDI+WV C  C  C ++S+      ++DP 
Sbjct: 81  SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140

Query: 202 DSASFSGVSCSSAVCDRLENAG------CHAGRCRYEVSYGDGSYTKGTLALETLTI--- 252
            S S   V+C    C  + N G           C Y +SYGDGS T G    + L     
Sbjct: 141 GSQSGELVTCDQQFC--VANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQV 198

Query: 253 ---GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFS 301
              G+T   N  V+ GCG K  G      +   G+LG G  + S++ QL   G+    F+
Sbjct: 199 SGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFA 258

Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
           +CL +   G  G    G    P      PLV  P  P  Y V L G+ VGG  + +  ++
Sbjct: 259 HCLDTVNGG--GIFAIGNVVQP-KVKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNI 312

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
           F        G ++D+GT +  +P   Y+A   A V           +  F +C+  SG V
Sbjct: 313 FD--SGNSKGTIIDSGTTLAYVPEGVYKALF-AMVFDKHQDISVQTLQDF-SCFQYSGSV 368

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF------APSPSGLSIIGNIQQE 475
               P V+F+F G   L +   ++L   +    +C  F            + ++G++   
Sbjct: 369 DDGFPEVTFHFEGDVSLIVSPHDYLFQ-NGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLS 427

Query: 476 GIQISFDGANGFVGFGPNVC 495
              + +D  N  +G+    C
Sbjct: 428 NKLVLYDLENQAIGWADYNC 447


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 163/374 (43%), Gaps = 41/374 (10%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQ-C-YKQSDPVFDPADSASFSGVSC 211
           G ++  + +G+P R   +++D+GS I +V C  C + C     D  FDPA S+S + + C
Sbjct: 60  GYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGC 119

Query: 212 SS--AVCDRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGCGHKN 268
            S   +C R    GC   R C Y+ +Y + S + G L  + L + R     V  GC  K 
Sbjct: 120 DSDKCICGR-PPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQL-RDGAVEVVFGCETKE 177

Query: 269 QGMFVG--AAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGR---EA 321
            G      A G+LGLG   +SLV QL G       F+ C  S      G+L+ G      
Sbjct: 178 TGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGS--VEGDGALMLGDVDAAE 235

Query: 322 LPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVT 381
             V   +  L+ +   P +Y V L  L VGG ++P+  + +        G V+D+GT  T
Sbjct: 236 YDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGY----GTVLDSGTTFT 291

Query: 382 RLPTPAYEAFRDAFVAQT---------GNLPRASGVSIF-DTCY---------NLSGFVS 422
            LP+ A++ F++A  A           G  P+    + F D C+         + S    
Sbjct: 292 YLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQFHDICFGGAPHAGHADQSKLEK 351

Query: 423 VRVPTVSFYFSGGPVL-TLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISF 481
           V  P     F+ G  L T P +   +   + G +C     + +  +++G I    I + +
Sbjct: 352 V-FPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVFDNGASGTLLGGISFRNILVQY 410

Query: 482 DGANGFVGFGPNVC 495
           D  N  VGFG   C
Sbjct: 411 DRRNRRVGFGAASC 424


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
           G Y+V + +G+PPR  ++ +D+GSD+ W+QC  PC  C K   P++ P  +     V C 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
             +C  L     G H       +C YE+ Y D   + G L  ++  +       V   +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
            GCG+  Q   VG++       G+LGLG GS+SL+ QL   G T     +CL +RG    
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226

Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           G L FG + +P   A W P+ R+  + ++Y  G + L  GG  + +     R  +     
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
           VV D+G++ T      Y+A  DA     + NL      S+         F SV       
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335

Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
            TV   FS G   ++ +P  N+LI V   G  C             L+I+G+I  +   +
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394

Query: 480 SFDGANGFVGF 490
            +D   G +G+
Sbjct: 395 IYDNERGQIGW 405


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 98/367 (26%), Positives = 152/367 (41%), Gaps = 30/367 (8%)

Query: 146 VSGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQC--QPCSQCYKQSDPVFDPADS 203
           +S M      Y ++  +GSP    Y + DSGS +VW+QC    C  CY+Q  P+F+P+ S
Sbjct: 91  ISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKS 150

Query: 204 ASFSGVSCSSAVC-----DRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTV-- 256
            ++    C++A C     D           C+Y   Y D SYT+G ++ +  T    +  
Sbjct: 151 VTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISG 210

Query: 257 ----VKNVAIGCGHKNQG-MFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCL---VSRG 308
                  +  GCG+ N         GL+GL     SLVGQ+       FSYC+     + 
Sbjct: 211 FGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKASLVGQMDVD---QFSYCVSIDTEQN 267

Query: 309 TGSSGSLVFGREALPVGAAWVPLVRNPRAPSFY-YVGLSGLGVGGMRIP-ISEDLFRLTQ 366
              S  + FG  A   G +   LV  P +  +Y +  + G+ V    +      +F+ T+
Sbjct: 268 LKGSMEIRFGLAASISGHS-TQLV--PNSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTE 324

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRAS-GVSIFDTCYNLSGFVSVRV 425
            G  G+ MDTGT  T L     +            +P      S F+ CY    F+   +
Sbjct: 325 GGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSNSGFELCYFSDDFLGATL 384

Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQISFDG 483
           P +   F+       +    N   P +     C A   + +G+SIIG  Q   I+I +D 
Sbjct: 385 PDIELRFTDNKDTYFSFNTRNAWTP-NGRSQMCLAMFRT-NGMSIIGMHQLRDIKIGYDL 442

Query: 484 ANGFVGF 490
            +  V F
Sbjct: 443 HHNIVSF 449


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 167/374 (44%), Gaps = 51/374 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G YF +I +GSPP+  ++ +D+GSDI+WV C+PC +C  +++      +FD   S++   
Sbjct: 72  GLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKK 131

Query: 209 VSCSSAVCDRL-ENAGCH-AGRCRYEVSYGDGSYTKGTLALETLTIGRT--------VVK 258
           V C    C  + ++  C  A  C Y + Y D S ++G    + LT+ +         + +
Sbjct: 132 VGCDDDFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQ 191

Query: 259 NVAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
            V  GCG    G          G++G G  + S++ QL   G     FS+CL +      
Sbjct: 192 EVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDN------ 245

Query: 313 GSLVFGREALPVGAAWVPLVR-NPRAPS--FYYVGLSGLGVGGMRIPISEDLFRLTQMGD 369
              V G     VG    P V+  P  P+   Y V L G+ V G  + +   + R     +
Sbjct: 246 ---VKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMR-----N 297

Query: 370 DGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDT--CYNLSGFVSVRVPT 427
            G ++D+GT +   P   Y++  +  +A+    P    + + DT  C++ S  V V  P 
Sbjct: 298 GGTIVDSGTTLAYFPKVLYDSLIETILARQ---PVKLHI-VEDTFQCFSFSENVDVAFPP 353

Query: 428 VSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAP------SPSGLSIIGNIQQEGIQISF 481
           VSF F     LT+   ++L  ++    +CF +          + + ++G++      + +
Sbjct: 354 VSFEFEDSVKLTVYPHDYLFTLEKE-LYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVY 412

Query: 482 DGANGFVGFGPNVC 495
           D  N  +G+  + C
Sbjct: 413 DLENEVIGWADHNC 426


>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
          Length = 216

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 75/215 (34%), Positives = 102/215 (47%), Gaps = 5/215 (2%)

Query: 286 MSLVGQLGGQTGGAFSYCLVS-RGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVG 344
           MSL+ Q G +  G FSYCL S R    SGSL  G    P      PL+ NP  PS YYV 
Sbjct: 1   MSLLSQTGSRYNGVFSYCLPSYRSYYFSGSLRLGAAGQPRNVRHTPLLTNPHRPSLYYVN 60

Query: 345 LSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPR 404
           ++GL VG   + +    F        G V+D+GT +TR   P Y A R+ F  Q      
Sbjct: 61  VTGLSVGRTWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSG 120

Query: 405 ASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS 464
            + +  FDTC+N     +   P V+ +  GG  LTLP  N LI        C A A +P 
Sbjct: 121 YTSLGAFDTCFNTDEVAAGGAPPVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQ 180

Query: 465 ----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
                ++++ N+QQ+ +++  D A   VGF    C
Sbjct: 181 NVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  115 bits (287), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 165/371 (44%), Gaps = 55/371 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
           G Y+V + +G+PPR  ++ +D+GSD+ W+QC  PC  C K   P++ P  +     V C 
Sbjct: 56  GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSKVPHPLYRPTKNKL---VPCV 112

Query: 213 SAVCDRLEN--AGCH-----AGRCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVA 261
             +C  L     G H       +C YE+ Y D   + G L  ++  +       V   +A
Sbjct: 113 DQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSIVRPGLA 172

Query: 262 IGCGHKNQGMFVGAA-------GLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGSS 312
            GCG+  Q   VG++       G+LGLG GS+SL+ QL   G T     +CL +RG    
Sbjct: 173 FGCGYDQQ---VGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLSTRG---G 226

Query: 313 GSLVFGREALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
           G L FG + +P   A W P+ R+  + ++Y  G + L  GG  + +     R  +     
Sbjct: 227 GFLFFGDDIVPYSRATWAPMARS-TSRNYYSPGSANLYFGGRPLGV-----RPME----- 275

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQ-TGNLPRASGVSIFDTCYNLSGFVSV-----RV 425
           VV D+G++ T      Y+A  DA     + NL      S+         F SV       
Sbjct: 276 VVFDSGSSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEF 335

Query: 426 PTVSFYFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSP----SGLSIIGNIQQEGIQI 479
            TV   FS G   ++ +P  N+LI V   G  C             L+I+G+I  +   +
Sbjct: 336 RTVVLSFSNGKKALMEIPPENYLI-VTKYGNACLGILNGSEVGLKDLNIVGDITMQDQMV 394

Query: 480 SFDGANGFVGF 490
            +D   G +G+
Sbjct: 395 IYDNERGQIGW 405


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 152/338 (44%), Gaps = 38/338 (11%)

Query: 147 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPA 201
           SG+   +G YF RIG+G+P +  Y+ +D+GSDI+WV C  C  C ++S+      ++DP 
Sbjct: 81  SGLATETGLYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPR 140

Query: 202 DSASFSGVSCSSAVCDRLENAG------CHAGRCRYEVSYGDGSYTKGTLALETLTI--- 252
            S S   V+C    C  + N G           C Y +SYGDGS T G    + L     
Sbjct: 141 GSQSGELVTCDQQFC--VANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQV 198

Query: 253 ---GRTVVKN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQL--GGQTGGAFS 301
              G+T   N  V+ GCG K  G      +   G+LG G  + S++ QL   G+    F+
Sbjct: 199 SGDGQTTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFA 258

Query: 302 YCLVSRGTGSSGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDL 361
           +CL +   G  G    G    P      PLV  P  P  Y V L G+ VGG  + +  ++
Sbjct: 259 HCLDTVNGG--GIFAIGNVVQPK-VKTTPLV--PDMPH-YNVILKGIDVGGTALGLPTNI 312

Query: 362 FRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFV 421
           F        G ++D+GT +  +P   Y+A   A V           +  F +C+  SG V
Sbjct: 313 FD--SGNSKGTIIDSGTTLAYVPEGVYKALF-AMVFDKHQDISVQTLQDF-SCFQYSGSV 368

Query: 422 SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF 459
               P V+F+F G   L +   ++L   +    +C  F
Sbjct: 369 DDGFPEVTFHFEGDVSLIVSPHDYLFQ-NGKNLYCMGF 405


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 108/367 (29%), Positives = 164/367 (44%), Gaps = 33/367 (8%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPADSASFSG 208
           G Y+ ++ +G+PPR   + ID+GSD++WV C  C+ C K S+       FDP  S+S S 
Sbjct: 82  GLYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASL 141

Query: 209 VSCSSAVC--DRLENAGCHAGR-CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAI--- 262
           VSCS   C  +    +GC     C Y   YGDGS T G    + ++    +   +AI   
Sbjct: 142 VSCSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSS 201

Query: 263 -----GCGHKNQGMFV----GAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGTGS 311
                GC +   G          G+ GLG GS+S++ QL   G     FS+CL    +G 
Sbjct: 202 APFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG- 260

Query: 312 SGSLVFGREALPVGAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDG 371
            G +V G+   P    + PLV  P  P  Y V L  + V G  +PI   +F +     DG
Sbjct: 261 GGIMVLGQIKRP-DTVYTPLV--PSQPH-YNVNLQSIAVNGQILPIDPSVFTIAT--GDG 314

Query: 372 VVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVPTVSFY 431
            ++DTGT +  LP  AY  F  A         R      +  C+ ++       P VS  
Sbjct: 315 TIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ-CFEITAGDVDVFPQVSLS 373

Query: 432 FSGGPVLTLPASNFLIPVDDAGT--FCFAFAP-SPSGLSIIGNIQQEGIQISFDGANGFV 488
           F+GG  + L    +L     +G+  +C  F   S   ++I+G++  +   + +D     +
Sbjct: 374 FAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRI 433

Query: 489 GFGPNVC 495
           G+    C
Sbjct: 434 GWAEYDC 440


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  114 bits (286), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 96/308 (31%), Positives = 139/308 (45%), Gaps = 44/308 (14%)

Query: 104 HQHSFHARMQRDVKRVATLVRRLSGGGADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVG 163
           H H+     QR ++R+   V      G +           D+ +      G Y+ RI +G
Sbjct: 5   HYHTLRKHDQRRLRRMLPEVVSFPISGDN-----------DIFA-----MGLYYTRISLG 48

Query: 164 SPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-PV----FDPADSASFSGVSCSSAVCDR 218
           +PP+  Y+ +D+GS++ WV+C PC+ C    D PV    FDP  S +   +SC+ A C  
Sbjct: 49  TPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGV 108

Query: 219 L-ENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKN---------VAIGCGH 266
           L +   C   R  C Y + YGDGS T G    +  T  +    N         +  GCG 
Sbjct: 109 LNKKLQCSPERLSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGG 168

Query: 267 KNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPV 324
              G +    GLLG G  ++SL  QL  Q  +   F++CL    +G  GSLV G    P 
Sbjct: 169 TQTGSW-SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSG-RGSLVIGTIREP- 225

Query: 325 GAAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLP 384
              + P+V        Y V L  +G+ G  +      F L   G  GV++D+GT +T L 
Sbjct: 226 DLVYTPMV---FGEDHYNVQLLNIGISGRNVTTPAS-FDLEYTG--GVIIDSGTTLTYLV 279

Query: 385 TPAYEAFR 392
            PAY+ FR
Sbjct: 280 QPAYDEFR 287


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 108/415 (26%), Positives = 174/415 (41%), Gaps = 48/415 (11%)

Query: 117 KRVATLVRRLSGGGADAAKHEVQDFGTDV---VSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
           K V   V  +  GG +     V  F +     V G    +G YF  I VGSPPR  ++ +
Sbjct: 59  KFVDFHVNDMKPGGINKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDM 118

Query: 174 DSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCSSAVC----DRLENAGCHA-G 227
           D+GSD+ W+QC  PC+ C K  +P++ P      + V    ++C      L+   C    
Sbjct: 119 DTGSDLTWIQCDAPCTSCAKGPNPLYKPKKG---NLVPLKDSLCVEVQRNLKTGYCETCE 175

Query: 228 RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGHKNQGMFVGAA----GLL 279
           +C YE+ Y D S + G LA + L +    G      +  GC +  QG+ + +     G+L
Sbjct: 176 QCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGIL 235

Query: 280 GLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGREALPV-GAAWVPLVRNPR 336
           GL    +SL  QL  Q        +CL S  TG  G +  G + +P  G AWVP++ N  
Sbjct: 236 GLSKAKVSLPSQLASQRIINNVLGHCLTSDATG-GGYMFLGDDFVPYWGMAWVPML-NSH 293

Query: 337 APSFYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAF----- 391
           +P+ Y+  +  +  G  ++ +     R  +     VV DTG++ T  P  AY A      
Sbjct: 294 SPN-YHSQIMKISHGSRQLSLGRQDGRTER-----VVFDTGSSYTYFPKEAYYALVASLK 347

Query: 392 --RDAFVAQTGNLPRA-----SGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASN 444
              D  + Q G+ P       +   I         F  + +   S ++       +P   
Sbjct: 348 DVSDEGLIQDGSDPTLPVCWRAKFPIRSVIDVKQFFQPLTLQFRSKWWIVSTKFRIPPEG 407

Query: 445 FLIPVDDAGTFCFAFAPSPS----GLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
           +LI + + G  C       +       I+G+I   G  + +D  N  +G+  + C
Sbjct: 408 YLI-ISNKGNVCLGILDGSNVHDGSTIILGDISLRGKLVVYDNVNQKIGWAQSTC 461


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 166/375 (44%), Gaps = 45/375 (12%)

Query: 153 SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFD-----PADSASFS 207
           SG YF +IG+G+P +  Y+ +D+GSDI+WV C  C+ C K+SD   +     P+ S++ +
Sbjct: 71  SGLYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSN 130

Query: 208 GVSCSSAVCDRLENA---GCHAG-RCRYEVSYGDGSYTKGTLALETLTIGR------TVV 257
            V+C+   C    +    GC     C Y V+YGDGS T G    + + + R      T  
Sbjct: 131 RVTCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTS 190

Query: 258 KN--VAIGCGHKNQGMF----VGAAGLLGLGGGSMSLVGQLG--GQTGGAFSYCLVSRGT 309
            N  +  GCG +  G          G+LG G  + S++ QL   G+    F++CL +   
Sbjct: 191 TNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDN--- 247

Query: 310 GSSGSLVFGREALPVGAAWVPLVR-NPRAP--SFYYVGLSGLGVGGMRIPISEDLFRLTQ 366
                 + G     +G    P VR  P  P  + Y V +  + V    + +  D+F    
Sbjct: 248 ------INGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL 301

Query: 367 MGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSGFVSVRVP 426
               G ++D+GT +   P   YE       A+   L   +    F TC+   G V    P
Sbjct: 302 R--KGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQF-TCFEYDGNVDDGFP 358

Query: 427 TVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAF----APSPSG--LSIIGNIQQEGIQIS 480
           TV+F+F     LT+    +L  + D+  +C  +    A S  G  + ++G++  +   + 
Sbjct: 359 TVTFHFEDSLSLTVYPHEYLFDI-DSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVM 417

Query: 481 FDGANGFVGFGPNVC 495
           +D  N  +G+    C
Sbjct: 418 YDLENQTIGWTEYNC 432


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  114 bits (286), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 108/392 (27%), Positives = 175/392 (44%), Gaps = 54/392 (13%)

Query: 148 GMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSD-----PVFDPAD 202
           G+   +G YF  I +G+PP+  Y+ +D+GSDI+WV C  CS+C ++S        +DP  
Sbjct: 79  GLPTDTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKA 138

Query: 203 SASFSGVSCSSAVCDRL---ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI------ 252
           S+S S VSC    C      +  GC A   C Y V YGDGS T G    + L        
Sbjct: 139 SSSGSTVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGD 198

Query: 253 GRTVVKNVAI--GCGHKNQGMFVGAA-----GLLGLGGGSMSLVGQL--GGQTGGAFSYC 303
           G+T   N  I  GCG + QG  +G +     G+LG G  + S++ QL   G+    F++C
Sbjct: 199 GQTQPGNATITFGCGAQ-QGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHC 257

Query: 304 LVS-RGTG--SSGSLV-----------FGREALPVGAAWVPLVRNPRAPSFYYVGLSGLG 349
           L + +G G  + G++V            G   +P+    + L+  P     Y V L  + 
Sbjct: 258 LDTIKGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPH----YNVNLKSID 313

Query: 350 VGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVS 409
           VGG  + +   +F   +    G ++D+GT +T LP   ++   D   ++  ++   +   
Sbjct: 314 VGGTTLQLPAHVFETGE--KKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQD 371

Query: 410 IFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFA----PSPSG 465
               C+  SG V    PT++F+F     L +    +  P +    +C  F      S  G
Sbjct: 372 FL--CFQYSGSVDDGFPTITFHFEDDLALHVYPHEYFFP-NGNDIYCVGFQNGALQSKDG 428

Query: 466 LSII--GNIQQEGIQISFDGANGFVGFGPNVC 495
             I+  G++      + +D  N  +G+    C
Sbjct: 429 KDIVLMGDLVLSNKLVVYDLENQVIGWTDYNC 460


>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
          Length = 445

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 155/354 (43%), Gaps = 47/354 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y V +  G+P ++   V+D+GS +VW  C     C + S P  DPA   +F     SS
Sbjct: 104 GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 163

Query: 214 A------------VCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTIGRTVVKNVA 261
           A            V D   +A C      Y + YG G+ T G L LE+L        +  
Sbjct: 164 AKIVGCLNPKCGFVMDSENSANCTKACPTYAIQYGLGT-TVGLLLLESLVFAERTEPDFV 222

Query: 262 IGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR-----GTGSSGSLV 316
           +GC   +       +G+ G G G  SL  Q+G +    FSYCL+S         S  +L 
Sbjct: 223 VGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFDDSPKSSKMTLY 276

Query: 317 FG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPISEDLFRLTQMG 368
            G   ++    G ++ P  +NP +       +YYV L  + VG  R+ +          G
Sbjct: 277 VGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKVPYSFMVAGSDG 336

Query: 369 DDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTCYNLSGFVSVRV 425
           + G ++D+G+  T +  P +EA    F  Q  N  RA+ V   S    C+NLSG  SV +
Sbjct: 337 NGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGSVAL 396

Query: 426 PTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPSGLSIIGNIQQEGIQI 479
           P++ F F GG  + LP +N+   V D    C         L+I+ N   E ++I
Sbjct: 397 PSLVFQFKGGAKMELPVANYFSLVGDLSVLC---------LTIVSN---EAVEI 438


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 162/367 (44%), Gaps = 40/367 (10%)

Query: 158 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSSAVC- 216
           V + VGSPP++  MV+D+GS++ W+ C+         +  F+P  S+S++   C+S++C 
Sbjct: 62  VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNL----NSTFNPLLSSSYTPTPCNSSICT 117

Query: 217 ----DRLENAGCHAGR--CRYEVSYGDGSYTKGTLALETLTIGRTVVKNVAIGC----GH 266
               D    A C      C   VSY D S  +GTLA ET ++          GC    G+
Sbjct: 118 TRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQPGTLFGCMDSAGY 177

Query: 267 KNQ-GMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTGSSGSLVFGR-EALPV 324
            +         GL+G+  GS+SLV Q+   +   FSYC+   G  + G L+ G     P 
Sbjct: 178 TSDINEDSKTTGLMGMNRGSLSLVTQM---SLPKFSYCI--SGEDALGVLLLGDGTDAPS 232

Query: 325 GAAWVPLVR-NPRAPSF----YYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTA 379
              + PLV     +P F    Y V L G+ V    + + + +F     G    ++D+GT 
Sbjct: 233 PLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQ 292

Query: 380 VTRLPTPAYEAFRDAFVAQT-GNLPRASGVSI-----FDTCYNLSGFVSVRVPTVSFYFS 433
            T L    Y + +D F+ QT G L R    +       D CY+     +  VP V+  FS
Sbjct: 293 FTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAA-VPAVTLVFS 351

Query: 434 GGPVLTLPASNFLIPVDDAG--TFCFAFAPSP-SGLS--IIGNIQQEGIQISFDGANGFV 488
           G   + +     L  V       +CF F  S   G+   +IG+  Q+ + + FD     V
Sbjct: 352 GAE-MRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRV 410

Query: 489 GFGPNVC 495
           GF    C
Sbjct: 411 GFTQTTC 417


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 156/365 (42%), Gaps = 50/365 (13%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQ-PCSQCYKQSDPVFDPADSASFSGVSCS 212
           G Y+V + +G P +  ++ +D+GSD+ W+QC  PC  C K   P + P  +     V C+
Sbjct: 71  GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPWYKPTKNKI---VPCA 127

Query: 213 SAVCDRL-ENAGCHAG-RCRYEVSYGDGSYTKGTLALETLTI----GRTVVKNVAIGCGH 266
           +++C  L  N  C    +C Y++ Y D + + G L  +  T+      TV  N+  GCG+
Sbjct: 128 ASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTLSLRNSSTVRANLTFGCGY 187

Query: 267 -----KNQGMFVGAAGLLGLGGGSMSLVGQLGGQ--TGGAFSYCLVSRGTGSSGSLVFGR 319
                KN  +     GLLGLG G++SL+ QL  Q  T     +C  + G    G L FG 
Sbjct: 188 DQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNG---GGFLFFGD 244

Query: 320 EALPVG-AAWVPLVRNPRAPSFYYVGLSGLGVGGMRIPISEDLF---RLTQMGDDGVVMD 375
           + +P     WVP+ R                 G    P S  L+   R   M    VV D
Sbjct: 245 DIVPTSRVTWVPMARTTS--------------GNYYSPGSGTLYFDRRSLGMKPMEVVFD 290

Query: 376 TGTAVTRLPTPAYEAFRDAFVA-QTGNLPRASGVSIFDTCYN----LSGFVSVRVPTVSF 430
           +G+         Y+A   A  A  + +L   S VS+   C+           V+    S 
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSL-PLCWKGQKVFKSVSEVKNDFKSL 349

Query: 431 YFSGGP--VLTLPASNFLIPVDDAGTFCFAFAPSPSG---LSIIGNIQQEGIQISFDGAN 485
           + S G   V+ +P  N+LI V   G  C       +     +IIG+I  +   I +D   
Sbjct: 350 FLSFGKNSVMEIPPENYLI-VTKYGNVCLGILDGTTAKLKFNIIGDITMQDQMIIYDNEK 408

Query: 486 GFVGF 490
           G +G+
Sbjct: 409 GQLGW 413


>gi|242092874|ref|XP_002436927.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
 gi|241915150|gb|EER88294.1| hypothetical protein SORBIDRAFT_10g011140 [Sorghum bicolor]
          Length = 484

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 122/459 (26%), Positives = 182/459 (39%), Gaps = 47/459 (10%)

Query: 63  RHNNISSSNTSSDEARWNLELVHRDKMSSSSNTTNNMHYHRHQHSFHARMQRDVKRVATL 122
           +     SS  S    R  L +VHR  +S  S           + S    + RD  R  +L
Sbjct: 47  KQTPTCSSAHSGTSRRDTLPVVHR--LSPCSPLGAARIQQLEKPSVADILHRDALRFRSL 104

Query: 123 VRRLSGG---------GADAAKHEVQDFGTDVVSGMDQGSGEYFVRIGVGSPPRSQYMVI 173
            R  + G         GAD     +   G D +  +  G+ EY V  G G+P +   +  
Sbjct: 105 FRDHNHGSAAPAPTSPGADGGGLSIPSRG-DPIQEL-PGAFEYHVTAGFGTPVQQFTVGF 162

Query: 174 DSGSD-IVWVQCQPCS---QCYKQSDPVFDPADSASFSGVSCSSAVCDRLENAGCHAGRC 229
           D+ +     +QC+PC+    C+      FDP+ S+S + V C S  C    N GC    C
Sbjct: 163 DTTTTGATQLQCKPCAADEPCHH----AFDPSASSSIAHVPCGSPDCPF--NKGCSGHSC 216

Query: 230 RYEVSYGDGSYTKGTLALETLTIGR-TVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSL 288
              VS  +      T   + LT+    +V +    C          + G+L L   S SL
Sbjct: 217 TLSVSINNTLLGNATFFTDKLTLTPWNIVDDFRFVCLEAGFRPDDDSTGILDLSRNSHSL 276

Query: 289 VGQLGGQTGGA--FSYCLVSRGT-------GSSGSLVFGREALPVGAAWVPLVRNPRAPS 339
             +    +  A  FSYCL S  +       G++   + GR+      ++ PL  N    +
Sbjct: 277 ASRAAPSSPDAVAFSYCLPSYPSDVGFLSLGATKPELLGRKV-----SYTPLRSNRHNGN 331

Query: 340 FYYVGLSGLGVGGMRIPISEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQT 399
            Y V L GLG+GG+ +P+         +   G +++  T  T L    Y A RD F    
Sbjct: 332 LYVVELVGLGLGGVDLPVPR-----AAIAGGGTILELHTTFTYLKPKVYAALRDEFRKSM 386

Query: 400 GNLPRASGVSIFDTCYNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTF---C 456
              P A      DTCYN +   S  VP V+  F GG    L     +   +    F   C
Sbjct: 387 SQYPVAPPQGSLDTCYNFTALSSYSVPAVTLKFDGGAEFDLWIDEMMYFPEPGSYFSVGC 446

Query: 457 FAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 495
            AF  +  G ++IG++ Q   ++ +D   G VGF P  C
Sbjct: 447 LAFV-AQDGGAVIGSMAQMSTEVVYDVRGGKVGFVPYRC 484


>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
          Length = 609

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 112/389 (28%), Positives = 162/389 (41%), Gaps = 55/389 (14%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYKQSDPVFDPADSASFSGVSCSS 213
           G Y V +  G+P ++   V+D+GS +VW  C     C + S P  DPA   +F     SS
Sbjct: 88  GGYSVSLSFGTPSQTLSFVMDTGSSLVWFPCTSRYVCTRCSFPNIDPAKIPTFIPKLSSS 147

Query: 214 AV-----------------------CDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETL 250
           A                        CD+  +A C      Y + YG G+ T G L LE+L
Sbjct: 148 AKIVGCLNPKCGFVMDSEVRTRCPGCDQ-NSANCTKACPTYAIQYGLGT-TVGLLLLESL 205

Query: 251 TIGRTVVKNVAIGCGHKNQGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSR--- 307
                   +  +GC   +       +G+ G G G  SL  Q+G +    FSYCL+S    
Sbjct: 206 VFAERTEPDFVVGCSILSSRQ---PSGIAGFGRGPSSLPKQMGLK---KFSYCLLSHRFD 259

Query: 308 --GTGSSGSLVFG---REALPVGAAWVPLVRNPRA-----PSFYYVGLSGLGVGGMRIPI 357
                S  +L  G   ++    G ++ P  +NP +       +YYV L  + VG  R+  
Sbjct: 260 DSPKSSKMTLYVGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRVKX 319

Query: 358 SEDLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGV---SIFDTC 414
                     G+ G ++D+G+  T +  P +EA    F  Q  N  RA+ V   S    C
Sbjct: 320 PYSFMVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPC 379

Query: 415 YNLSGFVSVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSP-------SGLS 467
           +NLSG  SV +P++ F F GG  + LP +N+   V D    C     +        SG S
Sbjct: 380 FNLSGVGSVALPSLVFQFKGGAKMELPVANYFSLVGDLSVLCLTIVSNEAVGSTLSSGPS 439

Query: 468 II-GNIQQEGIQISFDGANGFVGFGPNVC 495
           II GN Q +     +D  N   GF    C
Sbjct: 440 IILGNYQSQNFYTEYDLENERFGFRRQRC 468


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 160/385 (41%), Gaps = 44/385 (11%)

Query: 154 GEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCSQCYK--------------------Q 193
           G Y V +  G+P     +V+D+ +D+ W+ C+   +  K                    +
Sbjct: 125 GMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEAR 184

Query: 194 SDPVFDPADSASFSGVSCSSAVCDRLENAGCH----AGRCRYEVSYGDGSYTKGTLALET 249
               + PA S+S+  + CS   C  L    C     A  C Y     DG+ T G    E 
Sbjct: 185 RKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEK 244

Query: 250 LTI----GRTV-VKNVAIGCGHKNQGMFVGAA-GLLGLGGGSMSLVGQLGGQTGGAFSYC 303
            T+    GR   +  + +GC     G  V A  G+L LG G MS       + G  FS+C
Sbjct: 245 ATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQRFSFC 304

Query: 304 LVSRGTGSSGS--LVFGREALPVGAAWVP--LVRNPRAPSFYYVGLSGLGVGGMRIPISE 359
           L+S  +    S  L FG     +G   +   +V N      Y   ++G+ VGG R+ I +
Sbjct: 305 LLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIPQ 364

Query: 360 DLFRLTQMGDDGVVMDTGTAVTRLPTPAYEAFRDAFVAQTGNLPRASGVSIFDTCYNLSG 419
           +++   ++   GV++DT T+VT L   AY A   A      +LPR   +  F+ CY  + 
Sbjct: 365 EIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELDGFEYCYRWT- 423

Query: 420 FV--------SVRVPTVSFYFSGGPVLTLPASNFLIPVDDAGTFCFAFAPSPS-GLSIIG 470
           F         +V VP ++   +GG  L   A + ++P    G  C AF   P  G  I+G
Sbjct: 424 FAGDGVDLAHNVTVPRLTVEMAGGARLEPEAKSVVMPEVVPGVACLAFRKLPRGGPGILG 483

Query: 471 NIQQEGIQISFDGANGFVGFGPNVC 495
           N+  +      D   G + F  + C
Sbjct: 484 NVLMQEYIWEIDHGKGKMRFRKDKC 508


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.135    0.407 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,186,560,293
Number of Sequences: 23463169
Number of extensions: 382816785
Number of successful extensions: 1077599
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1741
Number of HSP's successfully gapped in prelim test: 2717
Number of HSP's that attempted gapping in prelim test: 1067543
Number of HSP's gapped (non-prelim): 5415
length of query: 495
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 348
effective length of database: 8,910,109,524
effective search space: 3100718114352
effective search space used: 3100718114352
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 79 (35.0 bits)